BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy7399
(107 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 578
Score = 128 bits (322), Expect = 4e-28, Method: Composition-based stats.
Identities = 60/101 (59%), Positives = 78/101 (77%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+S+GR++ SL + G+L NS++VF+SDNGG T+ N ASNWPLRGTKY+
Sbjct: 247 MVARLDDSLGRIVSSLGDRGMLKNSVIVFMSDNGGATIGKFR---NWASNWPLRGTKYTL 303
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR+VAA+WSP LL KG+V E L H+TDWLPTL+ +AG
Sbjct: 304 FEGGVRSVAAMWSPKLLSKGRVSEQLFHVTDWLPTLFQVAG 344
>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 575
Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/101 (64%), Positives = 74/101 (73%), Gaps = 1/101 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG++I+SL E +L NS+VVFISDNG PT D L H N SNWPLRG K S
Sbjct: 263 MVTSLDDSVGKIIESLHEHEMLENSIVVFISDNGAPTDDPLWGHDNFGSNWPLRGKKASV 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR VAA+WSP L+ K V NL HITDWLPTLY AG
Sbjct: 323 LEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAG 363
>gi|328726458|ref|XP_001949756.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 402
Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/101 (63%), Positives = 74/101 (73%), Gaps = 1/101 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG++I+SL E +L NS+VVF+SDNG PT D L H N SNWPLRG K S
Sbjct: 90 MVTSLDDSVGKIIESLHEHEMLENSIVVFMSDNGAPTDDPLWGHDNFGSNWPLRGKKASV 149
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR VAA+WSP L+ K V NL HITDWLPTLY AG
Sbjct: 150 LEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAG 190
>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 599
Score = 125 bits (314), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 61/101 (60%), Positives = 77/101 (76%), Gaps = 1/101 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML SLDESVG +I+SL+ G+L +S+V+FISDNG P D + +GN+ SNWPLRG K +
Sbjct: 263 MLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDPIWGYGNSGSNWPLRGEKGAV 322
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
+GGVR VAA+WSP L+K ++ ENL HITDWLPTLY AG
Sbjct: 323 LDGGVRGVAAIWSPWLKKKHRIFENLFHITDWLPTLYTAAG 363
>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
Length = 564
Score = 118 bits (295), Expect = 5e-25, Method: Composition-based stats.
Identities = 57/101 (56%), Positives = 76/101 (75%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGRV+ +L E+G+L +SL++F++DNG ++ + N SN+PLRGTKY+
Sbjct: 251 MVSKLDESVGRVVHALGEKGMLRDSLILFLTDNGAASIGR---YRNYGSNYPLRGTKYTL 307
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
+EGGVR VAALWS L KG +V + L+HITDWLPTLY AG
Sbjct: 308 YEGGVRGVAALWSSRLEKGARVFKKLIHITDWLPTLYSAAG 348
>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 575
Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats.
Identities = 56/101 (55%), Positives = 77/101 (76%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR++ +L E+G+L +SL++F++DNG +V + N SN+PLRGTKY+
Sbjct: 248 MVSRLDDSVGRIVHALGEKGMLRDSLILFLTDNGAASVGR---YRNWGSNYPLRGTKYTL 304
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
+EGGVR+VAALWSP L+K +V L+HITDWLPTLY AG
Sbjct: 305 YEGGVRSVAALWSPRLQKVARVSNQLIHITDWLPTLYSAAG 345
>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 579
Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats.
Identities = 55/101 (54%), Positives = 75/101 (74%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGR++ +L E+G+L +SL++F++DNG ++ + N SN+PLRGTKY+
Sbjct: 251 MVSKLDESVGRIVHALGEKGMLRDSLILFLTDNGAASIGR---YRNYGSNYPLRGTKYTL 307
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
+EGGVR VAALWS L K +V + L+HITDWLPTLY AG
Sbjct: 308 YEGGVRGVAALWSSRLEKAARVFKKLIHITDWLPTLYSAAG 348
>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
Length = 591
Score = 115 bits (287), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 56/101 (55%), Positives = 74/101 (73%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD SVGR++Q+L +G+L NSL++F++DNG + +G SN+PLRG KY+
Sbjct: 252 MVLRLDNSVGRIVQALGNKGMLKNSLILFLTDNGAAPIGKFRNYG---SNYPLRGMKYTL 308
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
+EGGVR VA LWSP LRK +V + LMH+TDWLPTLY IAG
Sbjct: 309 YEGGVRGVALLWSPRLRKTARVSDELMHVTDWLPTLYSIAG 349
>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 532
Score = 113 bits (283), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 55/100 (55%), Positives = 77/100 (77%), Gaps = 1/100 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
L+ LD+SVG V+++L+E+ +L NS+++ +SDNGG TV S+ N +SNWPL GTK++
Sbjct: 261 CLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTV-SVDLPPNWSSNWPLGGTKFTL 319
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR+VA +WSPLL KG + ++ +HITDWLPTLY AG
Sbjct: 320 FEGGVRSVALIWSPLLPKGVINDDFIHITDWLPTLYSAAG 359
>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
Length = 593
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/101 (54%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGR++ +L G+L +SL++F++DNG PT+ N SN PLRG KY+
Sbjct: 255 MVTRLDESVGRIVHTLGNRGMLRDSLILFLTDNGAPTIGKFR---NWGSNLPLRGMKYTL 311
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
+EGGVR VA LWSP LRK +V L+HITDWLPT Y AG
Sbjct: 312 YEGGVRGVAVLWSPRLRKAARVCNELVHITDWLPTFYSAAG 352
>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
Length = 581
Score = 111 bits (278), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/103 (54%), Positives = 75/103 (72%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGR+ +L E+G+L +SL++F++DNG +V + N SN+PLRGTKY+
Sbjct: 252 MVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGR---YRNYGSNYPLRGTKYTL 308
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
+EGGVR VAALWS L+K + N L+HITDWLPTLY AG +
Sbjct: 309 YEGGVRGVAALWSSRLQKAARVSNELVHITDWLPTLYSAAGGD 351
>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 545
Score = 111 bits (277), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 57/101 (56%), Positives = 72/101 (71%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG+V+++L ++G+L NS++VF +DNGGP F+ N ASNWPLRGTK +
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAG---FNLNAASNWPLRGTKNTL 307
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR LWSP L+R G+V L HI+DWLPTL AG
Sbjct: 308 WEGGVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAG 348
>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
Length = 545
Score = 111 bits (277), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 57/101 (56%), Positives = 72/101 (71%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG+V+++L ++G+L NS++VF +DNGGP F+ N ASNWPLRGTK +
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAG---FNLNAASNWPLRGTKNTL 307
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR LWSP L+R G+V L HI+DWLPTL AG
Sbjct: 308 WEGGVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAG 348
>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 579
Score = 110 bits (276), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/101 (55%), Positives = 73/101 (72%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGR+ +L E+G+L +SL++F++DNG +V + N SN+PLRGTKY+
Sbjct: 252 MVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGR---YRNYGSNYPLRGTKYTL 308
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR VAALWS L R +V L+HITDWLPTLY AG
Sbjct: 309 YEGGVRGVAALWSSRLQRAARVSNELVHITDWLPTLYSAAG 349
>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
Length = 544
Score = 110 bits (276), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 73/105 (69%), Gaps = 4/105 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG V+++L+ +G+L NS++VF +DNGGP F+ N ASNWPLRGTK +
Sbjct: 250 MLSKLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAG---FNLNAASNWPLRGTKNTL 306
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR +WSP L+R G+V L HI+DWLPTL AG + +
Sbjct: 307 WEGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAGGDPS 351
>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
Length = 472
Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 71/105 (67%), Gaps = 4/105 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD+SVG +I +L E G+L NS++VF +DNGGP F+ N ASNWPLRGTK +
Sbjct: 170 MLTELDQSVGAIIVALSERGMLDNSIIVFSTDNGGPAEG---FNNNAASNWPLRGTKNTL 226
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR +WSPL+ K +V +MHI+DWLPTL AG + N
Sbjct: 227 WEGGVRGAGCIWSPLIEEKRRVSHQIMHISDWLPTLLDAAGYDLN 271
>gi|241708194|ref|XP_002413315.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215507129|gb|EEC16623.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 294
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/102 (53%), Positives = 70/102 (68%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V ++L++ G+L NS+VVF SDNGG + + N+ S WPLRG K S
Sbjct: 43 MVSALDESVGAVFEALDKRGMLSNSIVVFSSDNGGAPDEETY---NSGSPWPLRGQKMSL 99
Query: 61 HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAG 100
EGGVR A LWSPLL +G + + L HITDWLPT Y +AG
Sbjct: 100 SEGGVRAPALLWSPLLSATEGSIFDGLFHITDWLPTFYELAG 141
>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
Length = 581
Score = 108 bits (269), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 71/101 (70%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVGRV+++L+E +L NS+++F+SDNG PTV + NT SN+P+RG K
Sbjct: 268 MMAALDESVGRVVKALKEAEMLENSIIIFMSDNGAPTVG---LYNNTGSNYPMRGIKGGM 324
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EG R A ++SPL++ +V E LMHI DWLPTLY AG
Sbjct: 325 FEGAARAAACIFSPLIKAHSRVSEELMHIVDWLPTLYTAAG 365
>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
Length = 557
Score = 107 bits (267), Expect = 8e-22, Method: Composition-based stats.
Identities = 52/98 (53%), Positives = 68/98 (69%), Gaps = 3/98 (3%)
Query: 4 SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
++DES+GRV+ +L +L NS++VF+SDNG PT D + + N SN+PLRG K SF EG
Sbjct: 268 TMDESIGRVVDALRRADMLKNSIIVFMSDNGAPTKDQILY--NFGSNYPLRGMKQSFFEG 325
Query: 64 GVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
VR VA ++SPL+ +V L HITDWLPTLY AG
Sbjct: 326 AVRGVACIYSPLIDFPSRVSTQLFHITDWLPTLYAAAG 363
>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 536
Score = 107 bits (267), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 74/106 (69%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+S+G V ++LE + +L N++VVFISDNG PTV H N SN+PLRG K +
Sbjct: 235 MVSKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGP---HHNWGSNYPLRGIKDTL 291
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR VA +WSPLL + +V +L+HITDWLPTL+ G + ++
Sbjct: 292 FEGGVRTVACIWSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSV 337
>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
Length = 558
Score = 107 bits (267), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 74/106 (69%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+S+G V ++LE + +L N++VVFISDNG PTV H N SN+PLRG K +
Sbjct: 257 MVSKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGP---HHNWGSNYPLRGIKDTL 313
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR VA +WSPLL + +V +L+HITDWLPTL+ G + ++
Sbjct: 314 FEGGVRTVACIWSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSV 359
>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
Length = 633
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 53/104 (50%), Positives = 76/104 (73%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT-VDSLHFHGNTASNWPLRGTKYS 59
M+++LD++ G+ + +L+E +L NS+++F SDNGGP+ VD + G ASNWPLRG K+S
Sbjct: 285 MIRALDDAFGQTVVALKEAEMLQNSVILFTSDNGGPSIVDESYKTG--ASNWPLRGQKFS 342
Query: 60 FHEGGVRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAGKE 102
EGGVR + LWSPLL+ + + ++L HITDWLPT Y +AG E
Sbjct: 343 MWEGGVRVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGE 386
>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
Length = 502
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 52/100 (52%), Positives = 69/100 (69%), Gaps = 4/100 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG ++ +L G+L +S+++F++DNG V +G SN+PLRG KY+
Sbjct: 160 MVSKLDDSVGLIVNTLGSRGMLRDSVILFLTDNGAAPVGKFRNYG---SNFPLRGMKYTL 216
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIA 99
+EGGVR VA LWSP L K +V NLMHITDWLPTLY A
Sbjct: 217 YEGGVRGVAVLWSPRLSKAARVCNNLMHITDWLPTLYAAA 256
>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 374
Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 72/103 (69%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG + Q+L++ GL NS++VF +DNGGPT + GN ASNWPLRG K++
Sbjct: 257 MVAGLDDSVGNISQTLQDAGLYNNSIIVFTTDNGGPTNG---YDGNHASNWPLRGCKHTL 313
Query: 61 HEGGVRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAGKE 102
EGGVR A + SPL+ K + + +MH+ DWLPTLY +AG +
Sbjct: 314 WEGGVRGTAFVNSPLIEKPRRFSDRMMHVCDWLPTLYGVAGGD 356
>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
Length = 512
Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats.
Identities = 56/104 (53%), Positives = 70/104 (67%), Gaps = 7/104 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V ++L + G+L N+L VF SDNG D+ + N AS+WPLRG KY+
Sbjct: 217 MVSALDESVGLVFEALAKNGMLSNTLFVFTSDNGA---DAAGPNANYASSWPLRGQKYTP 273
Query: 61 HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A LWS P +R G E+L HITDWLPTLY +AG
Sbjct: 274 WEGGVRAAALLWSHALGPRVRAGGDYEHLFHITDWLPTLYQLAG 317
>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats.
Identities = 54/101 (53%), Positives = 71/101 (70%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +LDESVGRVI +L++ +L NS++VFISDNG T L +G SN+PLRG K+S
Sbjct: 232 VVATLDESVGRVIDALKKTDMLNNSIIVFISDNGAQTEGFLQNYG---SNYPLRGLKFSL 288
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A ++SPL+ R +V L HITDWLPTLY AG
Sbjct: 289 FEGGIRGAACIYSPLIDRPSRVSNQLFHITDWLPTLYSAAG 329
>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
Length = 585
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG+++Q+L++ G+L N++V+F +DNG PT+ H N SN+PLRG KYS
Sbjct: 259 MISKLDDGVGQIVQALKKSGMLNNTVVLFYADNGAPTIGK---HSNGGSNFPLRGQKYSP 315
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNI 105
EG VR AA+WSPLL G+V +H++DWLPTL AG E N
Sbjct: 316 WEGAVRTAAAIWSPLLNNTGRVSNQWIHVSDWLPTLARAAGIEANF 361
>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
Length = 563
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 72/101 (71%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD S+GR++ L+E+ +L NS+V+F SDNG TV + N+ SNWPLRG K+S
Sbjct: 270 VVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVG---MYENSGSNWPLRGVKFSD 326
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A ++SPL +KG V E+L+HI+DWLPTLY AG
Sbjct: 327 FEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAG 367
>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
Length = 557
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 74/106 (69%), Gaps = 6/106 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD SVG++++SL + +L N+++VF +DNGGP F+ N ASNWPL+G K +
Sbjct: 250 MVHELDVSVGKIVKSLGDNNMLENTVIVFSTDNGGPAEG---FNQNAASNWPLKGVKNTP 306
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEK 103
EGGVR +WSPL+ R+GQV+ NLM I+DWLPTL+ AG+ +
Sbjct: 307 WEGGVRAAGLIWSPLIPKSRRGQVMSNLMDISDWLPTLFEAAGQSR 352
>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
Length = 531
Score = 105 bits (263), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 72/101 (71%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD S+GR++ L+E+ +L NS+V+F SDNG TV + N+ SNWPLRG K+S
Sbjct: 255 VVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVG---MYENSGSNWPLRGVKFSD 311
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A ++SPL +KG V E+L+HI+DWLPTLY AG
Sbjct: 312 FEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAG 352
>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
purpuratus]
Length = 514
Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats.
Identities = 51/101 (50%), Positives = 71/101 (70%), Gaps = 3/101 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG + Q+L + GL NS++VF +DNGG T D F G ASNWPLRG K++
Sbjct: 236 MVSALDDSVGSISQTLRDAGLYDNSIIVFTTDNGGATYD--FFDGTHASNWPLRGGKHTL 293
Query: 61 HEGGVRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAG 100
EGGVR A + SPL++K + + +MH+ DWLPTL+ +AG
Sbjct: 294 WEGGVRGTAFVNSPLIKKPRRFSDQMMHVCDWLPTLHSVAG 334
>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
Length = 881
Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats.
Identities = 53/101 (52%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG+V ++L++ G+L NS++VFISDNGG T H N+ SN+PLRG K+S
Sbjct: 261 MMSKLDDSVGQVYRALQQHGMLENSIIVFISDNGGVTRG---MHSNSGSNYPLRGQKHSP 317
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EG VR A +WSPLL+ Q + N HI+DWLPTL AG
Sbjct: 318 WEGAVRTAALIWSPLLKDRQRVSNQWFHISDWLPTLASAAG 358
>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 593
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG+V+++L + G+L N+++VF SDNGG H + NWPLRG K +
Sbjct: 299 MVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGPWGE---HNSRGFNWPLRGAKGTV 355
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG R A +WSPLL R+ +V LMHITDWLPT Y IAG
Sbjct: 356 WEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIAG 396
>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
Length = 563
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 78/112 (69%), Gaps = 9/112 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++DESVGRV+Q+L + +L NS++VF+SDNG TV L+ +G SN+PLRG K++
Sbjct: 267 VVTAMDESVGRVVQALRQTNMLENSIIVFMSDNGAQTVGLLNNYG---SNYPLRGLKFTL 323
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
EGGVR VA ++SP ++ + N L+H+TDWLPT Y AG E+N+D
Sbjct: 324 FEGGVRGVACVYSPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLD 375
>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 554
Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG+V+++L + G+L N+++VF SDNGG H + NWPLRG K +
Sbjct: 260 MVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGPWGE---HNSRGFNWPLRGAKGTV 316
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG R A +WSPLL R+ +V LMHITDWLPT Y IAG
Sbjct: 317 WEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIAG 357
>gi|241779012|ref|XP_002399826.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215508524|gb|EEC17978.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 103
Score = 105 bits (261), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/102 (51%), Positives = 70/102 (68%), Gaps = 4/102 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDES+G V Q+L + G+LGN+++ SDNGG + + H N N+PLRG K +
Sbjct: 1 MVDALDESIGEVFQALGDAGMLGNTIIALSSDNGGLPIGA---HSNRGFNFPLRGGKGTL 57
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGK 101
EGG R A +WSPLL RKG V + +MHITDWLPTLY AG+
Sbjct: 58 WEGGCRATAFIWSPLLKRKGVVSDQMMHITDWLPTLYSAAGQ 99
>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats.
Identities = 54/101 (53%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +LDESVGR+I +L + +L NS++VFISDNG T L +G SN+PLRG K+S
Sbjct: 244 VVATLDESVGRIIDALRKTDMLKNSIIVFISDNGAQTEGFLQNYG---SNYPLRGLKFSL 300
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A ++SPL+ R +V L HITDWLPTLY AG
Sbjct: 301 FEGGVRGAAYIYSPLIDRLSRVSTQLFHITDWLPTLYSAAG 341
>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
Length = 558
Score = 104 bits (260), Expect = 7e-21, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD S+G+V++ L E +L NS+++F SDNG PTV + N+ SNWPLRG K +
Sbjct: 254 IVAELDRSIGQVVRKLSERQMLENSIILFFSDNGAPTVGP---YTNSGSNWPLRGIKLTN 310
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R A ++SPLL+K G V + L+H++DWLPT Y AG
Sbjct: 311 FEGGIRGTATIFSPLLKKRGYVNKELIHVSDWLPTFYAAAG 351
>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
Length = 532
Score = 104 bits (259), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 74/110 (67%), Gaps = 7/110 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD+SVG+V++ L ++G+L +S+++F +DNGGP F+ N ASNWPLRG K +
Sbjct: 239 ILSKLDKSVGQVVEGLRKKGMLRDSVIIFSTDNGGPAAG---FNLNAASNWPLRGVKNTL 295
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEK---NID 106
EGGVR +WSP L R G+V L+HITDWLPTL AG + NID
Sbjct: 296 WEGGVRGAGLIWSPKLTRSGRVSRQLLHITDWLPTLITAAGGDPSNLNID 345
>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
Length = 528
Score = 104 bits (259), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 71/105 (67%), Gaps = 4/105 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD+SVG+V+ +L + +L +S+++F SDNGGP F+ N ASNWPLRG K +
Sbjct: 234 MLSKLDQSVGQVVDALRKRDMLRDSIIIFSSDNGGPAAG---FNLNAASNWPLRGVKNTL 290
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR +WSP L+R G+V ++HITDWLPTL AG + +
Sbjct: 291 WEGGVRGTGLIWSPRLVRPGRVSRQMLHITDWLPTLITAAGGDSS 335
>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
Length = 583
Score = 104 bits (259), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 52/104 (50%), Positives = 72/104 (69%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LDESVG+V+++L+ +L NS+++F+SDNG PT F N SN+P RG K +
Sbjct: 267 VVSKLDESVGKVVEALKNRNMLSNSIILFMSDNGAPTSG---FLSNGGSNYPFRGIKKTL 323
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKEK 103
EGGVR AA+WSPL++K + V +LMH+TDWLPTL AG K
Sbjct: 324 WEGGVRGAAAIWSPLIKKRERVSYHLMHVTDWLPTLLSAAGGNK 367
>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 610
Score = 104 bits (259), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 55/111 (49%), Positives = 75/111 (67%), Gaps = 8/111 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LDESVGR+I +LEE +L NS++VF +DNG PT+ + N SN+PLRG K +
Sbjct: 271 MVQKLDESVGRLIAALEESNMLHNSIIVFSADNGAPTIGG---YPNWGSNYPLRGIKETL 327
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE----KNID 106
EGG R V+ +WSPL++ + N L+HI+DWLPTLY AG KN+D
Sbjct: 328 WEGGTRGVSFIWSPLIKNTPRVSNELIHISDWLPTLYAAAGGNMENVKNLD 378
>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
Length = 648
Score = 103 bits (258), Expect = 9e-21, Method: Composition-based stats.
Identities = 55/104 (52%), Positives = 72/104 (69%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L ++ +L NS++VF+SDNG D +H N SN+PLRG K S
Sbjct: 249 MVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKP-DGIH--ANHGSNYPLRGNKNSA 305
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK 103
EG +R VAA+WSPL++K Q + N LMHI+DWLPT Y AG K
Sbjct: 306 WEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNK 349
>gi|350414673|ref|XP_003490384.1| PREDICTED: hypothetical protein LOC100748821, partial [Bombus
impatiens]
Length = 413
Score = 103 bits (258), Expect = 9e-21, Method: Composition-based stats.
Identities = 56/111 (50%), Positives = 74/111 (66%), Gaps = 8/111 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+SVG V+ +L G+L NS+VVF+SDNG PT L N SN+PLRG K S
Sbjct: 1 VVSKLDQSVGDVMDALRSRGMLENSIVVFMSDNGAPTNGIL---SNQGSNYPLRGIKNSP 57
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK----NID 106
EGG R VAA+WSPL+RK + + N +M ++DWLPTL AG ++ NID
Sbjct: 58 WEGGTRGVAAIWSPLIRKSKRVSNQMMFMSDWLPTLLSAAGVDRIQLGNID 108
>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
Length = 563
Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 77/112 (68%), Gaps = 9/112 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +LDESVGRV+Q+L++ +L NS+++F+SDNG TV L N+ SN+PLRG K++
Sbjct: 267 VVTALDESVGRVVQALKQANMLENSIILFMSDNGAQTVGLL---DNSGSNYPLRGMKFTL 323
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
EGGVR A ++SP ++ + N L+H+TDWLPT Y AG E+N+D
Sbjct: 324 FEGGVRGAACVYSPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLD 375
>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
Length = 657
Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG + +SL + ++ NS+++FISDNG P S H N+ SN+PLRG K S
Sbjct: 238 MVSKLDDSVGAIFKSLRAKNMIQNSIILFISDNGAP---SFGLHSNSGSNFPLRGQKNSP 294
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EG RNVAA+WSPLL + Q + N +HI+DWLPT IAG
Sbjct: 295 WEGATRNVAAIWSPLLEERQRVSNQYIHISDWLPTFASIAG 335
>gi|241999506|ref|XP_002434396.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497726|gb|EEC07220.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 458
Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/102 (51%), Positives = 68/102 (66%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V +SL G+L N++ VF SDNG D+ + NTAS+WP +G KY+
Sbjct: 161 MVSALDESVGSVFESLNHRGMLDNTVFVFSSDNGA---DTDSANANTASSWPFKGQKYTP 217
Query: 61 HEGGVRNVAALWSPLLRK--GQVLENLMHITDWLPTLYFIAG 100
EGGVR A +WSPL G V N+ HI+DWLPTLY +AG
Sbjct: 218 WEGGVRAPAVIWSPLFSGLLGSVYNNMFHISDWLPTLYQLAG 259
>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 646
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 72/104 (69%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L ++ +L NS++VF+SDNG D +H N SN+PLRG K S
Sbjct: 247 MVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGA-KPDGIH--ANHGSNYPLRGNKNSA 303
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK 103
EG +R VAA+WSPL++K Q + N LMHI+DWLPT Y AG K
Sbjct: 304 WEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNK 347
>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 570
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVG+V+ +LE+ +L NS+++F+SDNG T H N SN+PLRG K S
Sbjct: 272 MVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYG---LHSNRGSNYPLRGLKESP 328
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGGVR AA+WSP L K +V + LMH++DWLPTL AG
Sbjct: 329 WEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAG 369
>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
Length = 577
Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG+ +++L + G+L NS+V+ SDNG PTV H N SN+P RG K S
Sbjct: 255 MISSLDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVG---IHSNAGSNYPFRGQKESP 311
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R+V ALWSPLL ++G V +H DWLPTL AG
Sbjct: 312 WEGGIRSVGALWSPLLQQRGYVSNQAIHAIDWLPTLAAAAG 352
>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
Length = 634
Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats.
Identities = 53/101 (52%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG V +SL G+L NS++VF+SDNGG T H NT SN+P RG K+S
Sbjct: 261 MMSKLDDSVGAVYESLRANGMLDNSIIVFLSDNGGVTRG---MHSNTGSNYPFRGQKHSP 317
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EG VR A +WSPLL+ Q + N HI+DWLPTL AG
Sbjct: 318 WEGAVRTAALIWSPLLKDTQRVSNQWFHISDWLPTLASAAG 358
>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 234
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG V+ +L++ +L ++++VF +DNG F N SNWPLRGTK++
Sbjct: 137 MVDALDQSVGSVVDALQKADMLTDTILVFCTDNGAL---PWGFKSNRGSNWPLRGTKFTL 193
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
+EGGVR+ A +WSPL+ + V LMHITDWLPTLY AG
Sbjct: 194 YEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAAG 234
>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 425
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LDES+GRV+++L G+L ++++VF SDNGG N NWPLRG K++
Sbjct: 176 MLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGG---GPYGIESNRGFNWPLRGAKFTV 232
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A +WSP L+K +V LMHI+DWLPTLY AG
Sbjct: 233 WEGGVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAAG 273
>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
Length = 552
Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 71/101 (70%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG ++++L+ +G+L N+++VF SDNGGP F+ N ASNWPLRG K +
Sbjct: 249 MLSELDESVGAIVEALKIKGMLDNTIIVFSSDNGGPAEG---FNSNAASNWPLRGVKNTL 305
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
EGGVR +WSPL+ + +V + ++HI+DWLPTL AG
Sbjct: 306 WEGGVRAAGFIWSPLIAESRRVSQQMIHISDWLPTLLDAAG 346
>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
Length = 613
Score = 102 bits (254), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+S+G + QSL+E L N+L+VF +DNGGP F N ASNWPLRG K++
Sbjct: 286 MVAALDDSIGNITQSLKESSLYNNTLIVFTTDNGGPANG---FDFNYASNWPLRGMKHTT 342
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R V LW L+ K G++ + +MH+ DW+PTLY +AG
Sbjct: 343 WEGGLRGVGFLWGALIEKPGRMSDGMMHVCDWVPTLYGLAG 383
>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
Length = 438
Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 71/101 (70%), Gaps = 3/101 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +LD+S+GRV+ +L ++ +L N++++FISDNG T L+ N SN+PLRG K+S
Sbjct: 185 VVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQTEGPLY--QNFGSNYPLRGLKFSL 242
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
+EGGVR A ++SPL+ +V L HITDWLPTLY AG
Sbjct: 243 YEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAG 283
>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
Length = 519
Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD+SVG+V+++L + +L +S++VF +DNGGP F+ N ASNWPLRG K +
Sbjct: 223 ILSKLDQSVGQVVEALYKNNMLQDSVIVFSTDNGGPAAG---FNLNAASNWPLRGVKNTL 279
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR +WSP L+R G+V ++HITDWLPTL AG
Sbjct: 280 WEGGVRGAGLIWSPRLVRAGRVSRQMIHITDWLPTLITAAG 320
>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 559
Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG + ++L+ GL N +++F +DNGGP F N ASNWPLRG K++
Sbjct: 252 MVSALDDSVGNITRALKHSGLYENCIILFSTDNGGPAAG---FDANYASNWPLRGIKHTL 308
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGGVR + SPLL K G++ +++H+ DWLPT+Y IAG
Sbjct: 309 WEGGVRGDGFIHSPLLEKPGRISTDMIHVCDWLPTIYHIAG 349
>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 548
Score = 102 bits (253), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/101 (54%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD SVGRV+++L G+L NS+V F SDNG F N NWPLRG K S
Sbjct: 263 MVDALDVSVGRVVEALSRAGMLENSIVAFSSDNGA---VPFGFRSNRGFNWPLRGIKASV 319
Query: 61 HEGGVRNVAALWSPLLRKGQVLE-NLMHITDWLPTLYFIAG 100
EGGVR +WSPLLRK L +MHITDWLPTLY AG
Sbjct: 320 WEGGVRVPGFVWSPLLRKSARLSTQMMHITDWLPTLYAAAG 360
>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 196
Score = 102 bits (253), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/100 (52%), Positives = 71/100 (71%), Gaps = 4/100 (4%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ +LD+SVG V Q+LE++ +L N+++VF+SDNG N SNWPLRG K++
Sbjct: 95 VDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGAIPWGEW---SNAGSNWPLRGAKFTLW 151
Query: 62 EGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGGVR A +WSPLLRK Q + N LMH++DW+PTLY +AG
Sbjct: 152 EGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLAG 191
>gi|241143299|ref|XP_002404958.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215493668|gb|EEC03309.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 166
Score = 102 bits (253), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/100 (52%), Positives = 70/100 (70%), Gaps = 4/100 (4%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ +LD+SVG V Q+LE++ +L N+++VFISDNG N SNWPLRG K++
Sbjct: 45 VDALDQSVGIVFQALEKKDMLQNTMLVFISDNGAIPWGEWS---NAGSNWPLRGAKFTLW 101
Query: 62 EGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGGVR A +W PLLRK Q + N LMH++DW+PTLY +AG
Sbjct: 102 EGGVRVPAFVWGPLLRKPQRVSNQLMHVSDWVPTLYSLAG 141
>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 510
Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 69/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LDESVGRV+++L+ G+L ++++VF SDNGG N NWPLRG K++
Sbjct: 223 MLDALDESVGRVVEALDNAGMLEDTIIVFSSDNGGAPCGEA---SNQGFNWPLRGAKFTL 279
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
EG VR A +WSP L++ +V LMHI+DWLPTLY AG +
Sbjct: 280 WEGSVRVPAFVWSPKFLKQSRVSNQLMHISDWLPTLYTAAGGD 322
>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 514
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 68/105 (64%), Gaps = 3/105 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG+V+++L + ++ N +++F +DNGGP F+ N ASNWPLRG K +
Sbjct: 256 MLSKLDESVGKVVEALADSNMINNCVILFSTDNGGPAGG---FNLNAASNWPLRGVKDTL 312
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR V +WSP L +V ++HITDWLPTL + +I
Sbjct: 313 WEGGVRGVGFIWSPFLPSSKVSNAMIHITDWLPTLLSLTNASNSI 357
>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
Length = 542
Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++DESVG++++ L + +L NS++VF +DNGGP + F+ N ASN+PLRG K +
Sbjct: 240 LVTAMDESVGKIVEQLRKSRMLENSIIVFSTDNGGP---AEGFNSNFASNYPLRGVKNTL 296
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A LWSP L ++ + E LMH+ DWLPTL AG
Sbjct: 297 WEGGVRGAALLWSPQLTKRPRTAEQLMHMVDWLPTLVEAAG 337
>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 562
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 76/112 (67%), Gaps = 9/112 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++DESVGRVI++L + +L NS++VFISDNG T L +G SN+PLRG K++
Sbjct: 267 VVTAMDESVGRVIKALGQSSMLENSIIVFISDNGAQTEGLLENYG---SNYPLRGLKFTL 323
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
EGG+R VA ++S L++ + N LMHITDWLPT Y AG E+N+D
Sbjct: 324 FEGGIRGVACVYSRLIQNSSRISNELMHITDWLPTFYSAAGGNLENLEENMD 375
>gi|241756057|ref|XP_002401379.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215508427|gb|EEC17881.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 287
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 68/98 (69%), Gaps = 6/98 (6%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSFHEG 63
+D+S+G VI++L + +LGNS+V+F +DNGG PT N NWPLRG K + EG
Sbjct: 5 MDQSIGLVIEALHKRNMLGNSIVIFSTDNGGLPT----GLGSNYGFNWPLRGFKATLWEG 60
Query: 64 GVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
GVR A +WSPLLR+ G++ +MHITDWLPTLY AG
Sbjct: 61 GVRGAAFIWSPLLRRSGRISSQMMHITDWLPTLYSAAG 98
>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 543
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 71/105 (67%), Gaps = 4/105 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LDESVG V+++L +L NS++VF +DNGG F+ N ASNWPLRGTK +
Sbjct: 249 ILSKLDESVGLVVEALRRRKMLENSVIVFSTDNGGAPAG---FNLNAASNWPLRGTKNTL 305
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V LWSP L++ G+V L+HI+DWLPTL AG + +
Sbjct: 306 WEGGVRGVGLLWSPKLIKPGRVGRQLIHISDWLPTLLTAAGGDPS 350
>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 263
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG + + L++ G+ N++++F +DNGGP F+ N ASNWPLRG K +
Sbjct: 83 MVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAAG---FNNNHASNWPLRGIKATL 139
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGGVR + SPLL K G+V +++H+ DWLPT+Y IAG
Sbjct: 140 WEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIYHIAG 180
>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 594
Score = 101 bits (251), Expect = 7e-20, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 71/101 (70%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD SVGRV+++L ++G+L N++++F SDNGG + NT +N+PLRG K +
Sbjct: 305 MLAELDVSVGRVVEALAKKGILDNTIILFSSDNGGQATAPME---NTGTNFPLRGQKRTL 361
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
EGG R A +WSPL+R+ +V +++HI DWLPT+Y +AG
Sbjct: 362 FEGGTRVPAFVWSPLIRRPRRVFYDMVHIVDWLPTIYSLAG 402
>gi|241836593|ref|XP_002415110.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509322|gb|EEC18775.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 254
Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/97 (54%), Positives = 65/97 (67%), Gaps = 4/97 (4%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG V++SL G+L NS+VVF SDNG L N SNWPLRG K + EGG
Sbjct: 11 LDQSVGMVMESLHRRGMLENSVVVFSSDNGA---RPLGLGANAGSNWPLRGGKSTLWEGG 67
Query: 65 VRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
VR A +WSPLL R G++ + +MHI+DWLPT Y AG
Sbjct: 68 VRGAAFVWSPLLSRVGRLSDQMMHISDWLPTFYTAAG 104
>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 167
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 71/101 (70%), Gaps = 4/101 (3%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG V+++L + G+L NS++VF +DNGG N+ NWPLRG+K++ EGG
Sbjct: 65 LDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGK---EPNSGYNWPLRGSKFTLWEGG 121
Query: 65 VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
VR A +WSPLL K G++ LMHITDWLPT Y AG++++
Sbjct: 122 VRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAGEKEH 162
>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 470
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K+LD SV V +L +L N++ VF+SDNG PT H N SNWPL+G K +
Sbjct: 171 MMKNLDNSVAEVFNALHTNNMLDNTIFVFVSDNGAPTNG---IHRNYGSNWPLKGEKATP 227
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG+R A +WS LL K + L N LMHI+DWLPTLY AG
Sbjct: 228 WEGGIRTAAFVWSKLLSKKKTLPNPLMHISDWLPTLYQAAG 268
>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 490
Score = 100 bits (250), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LDES+GRV+++L G+L ++++VF SDNGG N NWPLRG K++
Sbjct: 255 MLDALDESMGRVVEALSSAGILEDTIIVFSSDNGG---SPYGLESNRGYNWPLRGAKFTL 311
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG VR A +W+P LL+K +V LMHI+DWLPTLY AG
Sbjct: 312 WEGSVRVPAFVWNPKLLKKSRVSNQLMHISDWLPTLYTAAG 352
>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
Length = 675
Score = 100 bits (250), Expect = 9e-20, Method: Composition-based stats.
Identities = 50/101 (49%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG++ SL + +L NS+++F+SDNG PT H NT SN+PLRG K
Sbjct: 259 MVSKLDDSVGQIFNSLRSKNMLDNSIILFMSDNGAPTA---ALHANTGSNYPLRGIKSVP 315
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
E R VAA+WSPLL++ Q + N +HI+DWLPTL AG
Sbjct: 316 WEAATRCVAAIWSPLLQERQRVSNQFIHISDWLPTLASAAG 356
>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 689
Score = 100 bits (250), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D+SVG V+ +L + G+L NS+V+F+SDNG PT SL N SN+PLRG K S
Sbjct: 288 VVSKVDQSVGEVMDALRKRGMLENSIVLFMSDNGAPTQGSL---SNQGSNYPLRGIKDSP 344
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG R VAA+WSPL+++ Q + N +M I DWLPTL AG
Sbjct: 345 WEGGTRGVAAIWSPLIKESQRVSNQMMFIADWLPTLLSAAG 385
>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
Length = 607
Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats.
Identities = 50/101 (49%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG++ SL + +L NS+++F+SDNG PT H NT SN+PLRG K
Sbjct: 259 MVSKLDDSVGQIFNSLRSKNMLDNSIILFMSDNGAPTA---ALHANTGSNYPLRGIKSVP 315
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
E R VAA+WSPLL++ Q + N +HI+DWLPTL AG
Sbjct: 316 WEAATRCVAAIWSPLLQERQRVSNQFIHISDWLPTLASAAG 356
>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
Length = 513
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 73/101 (72%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+SVG++++ L+E+ LL NS+++F+SDNG T+ + N+ SNWPLRG K +
Sbjct: 226 IVSRLDDSVGQIVEKLDEKNLLDNSIILFLSDNGAQTIG---VYENSGSNWPLRGLKLTE 282
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R AA++SPL +KG V + L+HITD+LPT Y AG
Sbjct: 283 FEGGIRAPAAIYSPLFHQKGYVSKELIHITDFLPTFYAAAG 323
>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
Length = 775
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +LD+SVGRV+ +L+ + +L N++++F++DNG T L N SN+PLRG K++
Sbjct: 464 VVATLDKSVGRVVDALKRKDILNNTIIIFMADNGAQTEGYL---ANYGSNYPLRGVKFTL 520
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EG VR VA ++SPL+ R +V L HITDWLPTLY AG
Sbjct: 521 FEGAVRGVACIYSPLIDRSSRVSTKLFHITDWLPTLYSAAG 561
Score = 92.0 bits (227), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 65/100 (65%), Gaps = 11/100 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR++ +L G+L +SL++F++DNG + N SN+PLRG
Sbjct: 158 MVSKLDDSVGRIVHTLGNRGMLRDSLILFLTDNGAAPIGKFR---NFGSNYPLRG----- 209
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIA 99
GGVR V+ LWSP L K +V +NL+HI+DWLPTLY A
Sbjct: 210 --GGVRGVSVLWSPRLNKPARVCDNLIHISDWLPTLYAAA 247
>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
Length = 542
Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats.
Identities = 49/103 (47%), Positives = 69/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+SVG+++ L + +L NS+++F SDNGGP + F+ N ASN+PL+G K +
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 287
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
EGGVR +WSPLL+K Q + N MHI DWLPTL AG +
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQ 330
>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 567
Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats.
Identities = 52/120 (43%), Positives = 75/120 (62%), Gaps = 14/120 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD--------SLHFHGNTASNWP 52
M+ LDESVGRV+++L E+ +L N+++VF+SDNG P+ D N SN+P
Sbjct: 257 MVSKLDESVGRVVEALTEKKMLQNTIIVFMSDNGSPSFDGSGRNFRPEAGVTANWGSNFP 316
Query: 53 LRGTKYSFHEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKE-----KNID 106
RG K + EGGV++ + +W+P ++ +V + LMHITDWLPTLY AG KN+D
Sbjct: 317 YRGIKNTLWEGGVKSASFIWAPYFQENPRVSKQLMHITDWLPTLYSAAGGNPGFLPKNLD 376
>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 571
Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 74/101 (73%), Gaps = 3/101 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++D+SVGRV+++L++ +L NS+++F+SDNG PT ++ + N SN+PLRG K +
Sbjct: 254 VVTAMDDSVGRVVKALKDANMLSNSIIIFMSDNGSPTAEAP--YTNYGSNYPLRGIKATV 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR A ++SP L+ + +V + L HITDW PTLY +AG
Sbjct: 312 FEGGVRVPACVFSPRLKDRFRVSDELFHITDWFPTLYKLAG 352
>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
Length = 544
Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+SVG+++ L + +L NS+++F SDNGGP + F+ N ASN+PL+G K +
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 287
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGGVR +WSPLL+K Q + N MHI DWLPTL AG
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAG 328
>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
Length = 478
Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+SVG+++ L + +L NS+++F SDNGGP + F+ N ASN+PL+G K +
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 287
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGGVR +WSPLL+K Q + N MHI DWLPTL AG
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAG 328
>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
Length = 543
Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%), Gaps = 7/110 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD+SVG V++SL + +L NS+++F +DNGG F+ N ASN+PLRG K +
Sbjct: 247 MLSKLDDSVGAVVESLSKRKMLKNSIIIFSTDNGGAAAG---FNLNAASNFPLRGVKNTV 303
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKE---KNID 106
EGGVR LWSPL+++ Q V + MHI+DWLPTL G KNID
Sbjct: 304 WEGGVRGAGLLWSPLIKQPQRVAQQFMHISDWLPTLLSAVGANSSLKNID 353
>gi|241574524|ref|XP_002403353.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500210|gb|EEC09704.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 255
Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 68/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDES+G V Q+L + G+L N+++V SDNGG + F N N+PLRG K +
Sbjct: 1 MVDALDESIGEVFQALGDAGILENTIIVLSSDNGGLPI---GFQSNRGFNFPLRGGKGTL 57
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKE 102
EGG R +WSPLL RKG + +MH+TDWLPTLY AG++
Sbjct: 58 WEGGCRATGFIWSPLLKRKGVASDQMMHVTDWLPTLYSAAGED 100
>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 479
Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats.
Identities = 46/101 (45%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+S+G + Q+L+E L N+++VF +DNGGP + F N A+NWPLRG K +
Sbjct: 257 MVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGP---AHGFDANYANNWPLRGVKDTT 313
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R LW L+ K G+ + +MH+ DW+PTLY +AG
Sbjct: 314 WEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAG 354
>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
Length = 354
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 71/106 (66%), Gaps = 11/106 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTK 57
M+ +LDESVG + +SL G+L N++++F SDNG P HGNT +SN+PLRG K
Sbjct: 225 MISALDESVGNLTKSLHANGMLENTIIIFSSDNGAP-------HGNTDVCSSNYPLRGAK 277
Query: 58 YSFHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
S EGG R A +WSPLL+K G V + +MHI+DWLPTL AG +
Sbjct: 278 SSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLKAAGYD 323
>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
Length = 418
Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 68/102 (66%), Gaps = 7/102 (6%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSF 60
+ +LDESVGR++ +L + +L NS++VF SDNGG P D NT SNWPLRGTK +
Sbjct: 115 VDALDESVGRIVAALHQRRMLANSVLVFTSDNGGIPWGD----FSNTGSNWPLRGTKGTM 170
Query: 61 HEGGVRNVAALWSPLLRKGQ--VLENLMHITDWLPTLYFIAG 100
EGGVR A +WS +L G V+ + MH+ DWLPTLY AG
Sbjct: 171 WEGGVRLPAVVWSSMLAPGSRVVVPHPMHLVDWLPTLYSAAG 212
>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
Length = 1007
Score = 99.4 bits (246), Expect = 3e-19, Method: Composition-based stats.
Identities = 55/125 (44%), Positives = 71/125 (56%), Gaps = 26/125 (20%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGT---- 56
M+ +DE VG++IQ+L E +L NS+V+F +DNG PTV H N+ SN+PLRG
Sbjct: 682 MISKVDEGVGQIIQTLAERDMLDNSIVLFYADNGAPTVG---MHSNSGSNFPLRGVCIDN 738
Query: 57 -----------------KYSFHEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYF 97
KYS EG VR VAA+WSPLL G+V +H++DWLPTL
Sbjct: 739 WSTLHIRCTNGCFSFQQKYSPWEGAVRTVAAIWSPLLNLTAGRVSNQWIHVSDWLPTLAH 798
Query: 98 IAGKE 102
AG E
Sbjct: 799 AAGIE 803
>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 476
Score = 99.4 bits (246), Expect = 3e-19, Method: Composition-based stats.
Identities = 46/101 (45%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+S+G + Q+L+E L N+++VF +DNGGP + F N A+NWPLRG K +
Sbjct: 254 MVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGP---AHGFDANYANNWPLRGVKDTT 310
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R LW L+ K G+ + +MH+ DW+PTLY +AG
Sbjct: 311 WEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAG 351
>gi|241861175|ref|XP_002416317.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215510531|gb|EEC19984.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 99
Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 66/101 (65%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V Q+L +L N++ VF SDNGG D+ + N AS+WPL+G KY+
Sbjct: 1 MVSALDESVGAVFQALNRRNMLSNTVFVFTSDNGG---DTESENANFASSWPLKGQKYTP 57
Query: 61 HEGGVRNVAALWSPLLRK--GQVLENLMHITDWLPTLYFIA 99
EGGV A +WSP + G V NL HI+DWLPTLY +A
Sbjct: 58 WEGGVHAPAVIWSPRFSRLLGSVYSNLFHISDWLPTLYQLA 98
>gi|241844556|ref|XP_002415496.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509708|gb|EEC19161.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 328
Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/97 (54%), Positives = 66/97 (68%), Gaps = 4/97 (4%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG V+++L + G+L NS+VVF +DNGG N+ NWPLRG K + EGG
Sbjct: 47 LDKSVGLVVEALNKRGMLSNSIVVFSTDNGG---LPWGIDPNSGYNWPLRGAKATLWEGG 103
Query: 65 VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
VR A +WSPLL K G++ LMHITDWLPTLY AG
Sbjct: 104 VRGAAFVWSPLLFKSGRLSNQLMHITDWLPTLYSAAG 140
>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 159
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 68/102 (66%), Gaps = 4/102 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+S+G ++++L+E L+GN++VVF SDNG N NWPLRG K++
Sbjct: 60 MVDALDQSIGTIMEALQEADLIGNTIVVFTSDNGA---LPWGVKSNRGYNWPLRGGKFTL 116
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK 101
EGG+R +WSPLL + + N +MHITDWLPTLY AG+
Sbjct: 117 WEGGLRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAGE 158
>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
Length = 550
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG+ +++L + G+L NS+++ SDNG PTV H N SN+P RG K S
Sbjct: 254 MISSLDKSVGQTVKALSDNGMLNNSIILLYSDNGAPTVG---IHSNQGSNYPFRGQKESP 310
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R+ A+WSPLL ++G V ++H DWLPTL A E
Sbjct: 311 WEGGIRSAGAIWSPLLQQRGYVSNQIIHAIDWLPTLAAAANVE 353
>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 546
Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 76/112 (67%), Gaps = 9/112 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++DESVGRVI++L + +L NS++VF+SDNG T L +G SN+PLRG K++
Sbjct: 252 VVTAMDESVGRVIKALGQSSMLENSIIVFMSDNGAQTEGLLENYG---SNYPLRGLKFTL 308
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
EGG+R VA ++S L++ + N L+HITDWLPT Y AG E+N+D
Sbjct: 309 FEGGIRGVACVYSRLIQNSSRISNELIHITDWLPTFYSAAGGNLENLEENMD 360
>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D ++ +V+ SL+ + NS++VF +DNGGP F N ASN+PLRG K +
Sbjct: 248 MVSSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANG---FDSNMASNFPLRGVKRTL 304
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R A + SPL+ K G+V+ LMH++DWLPTLY +AG
Sbjct: 305 WEGGIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAG 345
>gi|241847930|ref|XP_002400856.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509831|gb|EEC19284.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 275
Score = 98.2 bits (243), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 66/96 (68%), Gaps = 4/96 (4%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG V+++L + G+L NS++VF +DNGG N+ NWPLRG K++ EGG
Sbjct: 5 LDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGE---EPNSGYNWPLRGAKFTLWEGG 61
Query: 65 VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIA 99
VR A +WSPLL K G++ LMHITDWLPTLY A
Sbjct: 62 VRGAAFVWSPLLSKSGRLSNQLMHITDWLPTLYSAA 97
>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
Length = 575
Score = 98.2 bits (243), Expect = 5e-19, Method: Composition-based stats.
Identities = 48/103 (46%), Positives = 65/103 (63%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR++ +L G L NS+V+F SDNG P+V NT SNWPLRG K +
Sbjct: 265 MVSKLDQSVGRIVSALNSTGQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLRGQKNTP 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+WS L+ +G + +++ DWLP+L AG E
Sbjct: 322 WEGGVRVAGAIWSAQLQARGNIFTQPIYVADWLPSLAHAAGIE 364
>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 671
Score = 98.2 bits (243), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 74/111 (66%), Gaps = 8/111 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+SVG V+ +L G+L NS+VVF+SDNG P+ L N SN+PLRG K S
Sbjct: 257 VVSKLDQSVGDVMDALRNRGMLENSIVVFMSDNGAPSQGLL---SNEGSNYPLRGIKNSP 313
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK----NID 106
EGG R VAA+WSPL++K + + N +M ++DWLPTL AG ++ NID
Sbjct: 314 WEGGTRGVAAIWSPLIKKSKRVSNQMMFMSDWLPTLLSAAGVDRRQLGNID 364
>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 713
Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 74/111 (66%), Gaps = 8/111 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+SVG V+ +L G+L NS+V+F+SDNG P+ L+ N SN+PLRG K S
Sbjct: 301 VVSKLDQSVGDVMDALRNRGMLENSIVLFMSDNGAPSQGILN---NKGSNYPLRGIKNSP 357
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK----NID 106
EGG R VAA+WSPL++K + + N +M I+DWLPTL AG + NID
Sbjct: 358 WEGGTRGVAAIWSPLIKKSKRVSNQMMFISDWLPTLLSAAGVNRKQLGNID 408
>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
Length = 545
Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DESVG ++ L + +L NS+++F SDNGGP + F+ N ASN+PLRG K +
Sbjct: 234 MVAKMDESVGVIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLRGVKNTL 290
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
EGGVR +WSPLL K Q V++ MHI+DWLPTL AG
Sbjct: 291 WEGGVRAAGLVWSPLLEKRQRVVDETMHISDWLPTLIEAAG 331
>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
Length = 545
Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DESVG ++ L + +L NS+++F SDNGGP + F+ N ASN+PLRG K +
Sbjct: 234 MVAKMDESVGVIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLRGVKNTL 290
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
EGGVR +WSPLL K Q V++ MHI+DWLPTL AG
Sbjct: 291 WEGGVRAAGLVWSPLLEKRQRVVDETMHISDWLPTLIEAAG 331
>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
Length = 542
Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG+ +++L++ +L N++V+ SDNG PTV H N SN+P RG K S
Sbjct: 218 MISSLDKSVGQTMRALKDNDMLNNTIVLLYSDNGAPTVG---IHSNAGSNYPYRGQKESP 274
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R+V ALWSPLL K G V +H DWLPTL AG
Sbjct: 275 WEGGIRSVGALWSPLLEKQGYVSNQAIHAIDWLPTLAAAAG 315
>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
Length = 546
Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 69/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG V+++L + +L NS++VF +DNGGP F+ N ASN+PLRGTK +
Sbjct: 252 MLSKLDESVGLVVEALRKRKMLENSVIVFSTDNGGPPAG---FNLNAASNFPLRGTKNTL 308
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR +WSP L+ G++ L+HI+DWLPTL G +
Sbjct: 309 WEGGVRGTGLVWSPKLINPGRISRQLIHISDWLPTLLTAVGGD 351
>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
Length = 619
Score = 98.2 bits (243), Expect = 6e-19, Method: Composition-based stats.
Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGRVI +L L NS+V+F SDNG P+V NT SNWPL+G K S
Sbjct: 261 MVSKLDESVGRVITALARTDQLENSIVIFYSDNGAPSVGMF---SNTGSNWPLKGQKNSP 317
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A+WSPLL+ +G + +++ D+LPTL AG E
Sbjct: 318 WEGGLRVAGAIWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIE 360
>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
Length = 633
Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+SVG+++ L + +L NS+++F SDNGGP F+ N ASN+PL+G K +
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQG---FNLNFASNYPLKGVKNTL 287
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
EGGVR +WSPLL+K Q + N MHI DWLPTL AG +
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQ 330
>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 311
Score = 97.4 bits (241), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG V+ +L++ G+L ++++VF SDNG F N NWPLRG K++
Sbjct: 214 MVDALDQSVGIVVDTLQKAGMLNDTIIVFCSDNG---ALPWGFKSNRGYNWPLRGGKFTL 270
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
+EGGVR A LWS L+ + + N LMHI+DWLPTLY AG
Sbjct: 271 YEGGVRGTAFLWSRRLKNTKRVSNQLMHISDWLPTLYSAAG 311
>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
Length = 575
Score = 97.4 bits (241), Expect = 1e-18, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR++ +L G L NS+V+F SDNG P+V NT SNWPLRG K +
Sbjct: 265 MVSKLDQSVGRIVSALNSTGQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLRGQKNTP 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR A+WS L+ +G + +++ DWLP+L AG
Sbjct: 322 WEGGVRVAGAIWSAQLQARGNIFTQPIYVADWLPSLAHAAG 362
>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 545
Score = 97.4 bits (241), Expect = 1e-18, Method: Composition-based stats.
Identities = 50/101 (49%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D SVG V+++LE+ +L + L+VF +DNGGP + F+ N ASNWPLRG K +
Sbjct: 251 MASAMDASVGAVVEALEKRNMLRDCLIVFSTDNGGP---AEGFNLNAASNWPLRGVKDTL 307
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR V +WS L R G++ E HI DWLPTL AG
Sbjct: 308 FEGGVRGVGLVWSERLARPGRLSEATFHIADWLPTLLRAAG 348
>gi|21483362|gb|AAM52656.1| HL07902p [Drosophila melanogaster]
Length = 365
Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SV + I +L++ G+L NS+++ SDNG PT+ H N SN+P RG K S
Sbjct: 41 MISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 97
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ ALWSPLL+ +G V +H DWLPTL AG
Sbjct: 98 WEGGIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAG 138
>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
Length = 502
Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 72/104 (69%), Gaps = 5/104 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV-DSLHFHGNTASNWPLRGTKYS 59
ML +LDESVG+V ++L +G+L N+++VF SDNG P + +S H G SN+PLRG K S
Sbjct: 245 MLTALDESVGKVTKALHAKGMLENTIIVFSSDNGAPHIGNSDHTVG---SNFPLRGGKPS 301
Query: 60 FHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
EGG R A +WSPL +K G + + LMHI+DWLPTL AG +
Sbjct: 302 IWEGGSRVPAFVWSPLFKKSGYISKQLMHISDWLPTLLQAAGYD 345
>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
Length = 525
Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGRVI++L +L NS+V+F+SDNGGP+V H NTASN+PLRG K++
Sbjct: 223 MVSRLDKSVGRVIEALANREMLRNSIVLFLSDNGGPSVGE---HANTASNYPLRGQKHTP 279
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 280 WEGGIRSSAAIWSTEFERLGSVWKQRIYIGDLLPTLCSAAG 320
>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
Length = 562
Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D SVG+++ L + +L NS+++F SDNGGP + F+ N ASN+PL+G K +
Sbjct: 251 MVSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 307
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
EGGVR +WSPLL+K Q + N MHI DWLPTL AG +
Sbjct: 308 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGQ 350
>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
Length = 560
Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats.
Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVGRVI +L L NS+V+F SDNG P+V NT SNWPL+G K S
Sbjct: 261 MVSKLDESVGRVITALARTDQLENSIVIFYSDNGAPSVG---IFANTGSNWPLKGQKDSP 317
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A+WSPLL+ +G + +++ D+LPTL AG E
Sbjct: 318 WEGGLRVAGAIWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIE 360
>gi|241237879|ref|XP_002401191.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215496126|gb|EEC05767.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 117
Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 67/104 (64%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LDES+GRV++ L G+LG++++ F SDNGG N NWPLRG K +
Sbjct: 1 MLDALDESIGRVVEVLGNAGMLGDTIIAFSSDNGGSPFGPFS---NRGFNWPLRGAKRTL 57
Query: 61 HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGGVR A +W S L+K +V + LMHI+DWLPTLY A ++
Sbjct: 58 WEGGVRVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAVDQR 101
>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
Length = 549
Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 69/104 (66%), Gaps = 11/104 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTK 57
M+ +LDESVG + +SL G+L N++++F SDNG P H NT +SN+PLRG K
Sbjct: 225 MISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAP-------HENTDVCSSNYPLRGAK 277
Query: 58 YSFHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
S EGG R A +WSPLL+K G V + +MHI+DWLPTL AG
Sbjct: 278 TSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAG 321
>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
Length = 579
Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SV + I +L++ G+L NS+++ SDNG PT+ H N SN+P RG K S
Sbjct: 255 MISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ ALWSPLL+ +G V +H DWLPTL AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAG 352
>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
Length = 542
Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 69/104 (66%), Gaps = 11/104 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTK 57
M+ +LDESVG + +SL G+L N++++F SDNG P H NT +SN+PLRG K
Sbjct: 248 MISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAP-------HENTDVCSSNYPLRGAK 300
Query: 58 YSFHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
S EGG R A +WSPLL+K G V + +MHI+DWLPTL AG
Sbjct: 301 TSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAG 344
>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
Length = 579
Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SV + I +L++ G+L NS+++ SDNG PT+ H N SN+P RG K S
Sbjct: 255 MVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ ALWSPLL+ +G V +H DWLPTL AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352
>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
Length = 531
Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats.
Identities = 46/96 (47%), Positives = 66/96 (68%), Gaps = 4/96 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++DESVG++++ L + +L S++VF +DNGGP + F+ N ASN+PLRG K +
Sbjct: 229 LVTAMDESVGKIVEQLRKSRMLETSIIVFSTDNGGP---AEGFNSNFASNYPLRGVKNTL 285
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
EGGVR A LWSP L ++ + E MHI DWLPTL
Sbjct: 286 WEGGVRGAALLWSPQLTKRPRTAEQTMHIVDWLPTL 321
>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
Length = 579
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SV + I +L++ G+L NS+++ SDNG PT+ H N SN+P RG K S
Sbjct: 255 MVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ ALWSPLL+ +G V +H DWLPTL AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352
>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 451
Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML SLDES+GRV++ L G+L ++++ F SDNGG ++ N NWPLRG K +
Sbjct: 164 MLDSLDESIGRVVEILGNAGMLEDTIIAFSSDNGGLPYNAF---SNRGFNWPLRGAKGTL 220
Query: 61 HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A +W S L+K +V + LMHI+DWLPTLY AG
Sbjct: 221 WEGGVRVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAG 261
>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
Length = 578
Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SV + + +L++ G+L NS+++ SDNG PT+ H N SN+P RG K S
Sbjct: 255 MISSLDKSVAQTVGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ ALWSPLL+ +G V +H DWLPTL AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352
>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
Length = 545
Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG + ++L +G+L N+++VF +DNGGP + F N ASNWPLRG K +
Sbjct: 259 MISALDDSVGNITRTLHAKGMLNNTIIVFSTDNGGP---ADRFDDNCASNWPLRGGKRTM 315
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKE 102
EGG+R +WSPLL+ V E+ M I DWLPTL AG +
Sbjct: 316 WEGGLRGNGFVWSPLLKHSNYVSEHFMSIMDWLPTLLDAAGYD 358
>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
Length = 563
Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LDESVG V+ +L+ +G+L +S+VVF +DNGG F+ N ASN+PLRG K +
Sbjct: 273 VLTKLDESVGEVVAALKAKGVLNDSIVVFTTDNGGAAAG---FNDNAASNYPLRGVKNTL 329
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR LWSP + ++ +V MH+ DWLPTL AG
Sbjct: 330 WEGGVRGAGWLWSPFIDKRSRVATQRMHLVDWLPTLLSAAG 370
>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 406
Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 65/102 (63%), Gaps = 4/102 (3%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L LD+SVG ++++L G L N+++ F +DNGG V F NT+ NWPLRGTK +
Sbjct: 259 LAELDKSVGLIVEALYVSGQLNNTVIAFSTDNGGAPVG---FSANTSPNWPLRGTKGTVA 315
Query: 62 EGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR LW S L +G+V + L H+TDW+PT Y AG +
Sbjct: 316 EGGVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPTFYTAAGGQ 357
>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 418
Score = 95.1 bits (235), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D++ GR++++L G+L N++VVF SDNG V + F N NWPLRG K +
Sbjct: 162 MVDSMDDAFGRLVEALSGAGVLNNTIVVFSSDNGA--VPIIEF-PNRGFNWPLRGAKKTL 218
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EG VR A +WSPLL G+V + +MHI DWLPT Y +AG
Sbjct: 219 WEGAVRVPAFIWSPLLESSGRVSDQMMHIVDWLPTFYSVAG 259
>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
Length = 358
Score = 95.1 bits (235), Expect = 5e-18, Method: Composition-based stats.
Identities = 48/105 (45%), Positives = 70/105 (66%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG + +L+E GL N++++F +DNGG T+ + GN NWPLRG K++
Sbjct: 247 MVSAMDEAVGNITHTLQETGLWDNTVLIFSTDNGGQTL----YGGN---NWPLRGRKWTL 299
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR VA + PL+ K G V L+HI+DWLPT+ +AG N
Sbjct: 300 WEGGVRGVAFVSGPLIEKPGAVNRELIHISDWLPTIAGLAGASTN 344
>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 710
Score = 95.1 bits (235), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG V+ +L + +L NS+VVF++DNG T H N SN+PLRG K S
Sbjct: 273 MMSKLDQSVGEVVSALRRKNMLQNSIVVFMADNGAATQG---IHYNRGSNYPLRGIKASA 329
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EG VR AA+WSPL+++ + + N LM I DWLPTL +G
Sbjct: 330 WEGAVRGAAAVWSPLIQRPKRIYNELMSIADWLPTLLSASG 370
>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 529
Score = 94.7 bits (234), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 50/97 (51%), Positives = 66/97 (68%), Gaps = 4/97 (4%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG V+++L + G+L NS+V+F +DNGG N+ NWPLRG+K + EGG
Sbjct: 248 LDKSVGLVMEALNKRGMLSNSIVIFSTDNGG---LPWGVEPNSGYNWPLRGSKETNWEGG 304
Query: 65 VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
R A +WSPLL K G++ +MHITDWLPTLY AG
Sbjct: 305 ARGAAFVWSPLLFKSGRLSNQMMHITDWLPTLYSAAG 341
>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
Length = 573
Score = 94.4 bits (233), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D VGR+ +SLE +L N++++F +DNG PT L H N+ SN+PLRG K S
Sbjct: 254 MVSRIDTGVGRIYRSLERRRMLNNTIILFYADNGAPT---LGIHANSGSNYPLRGQKESP 310
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EG VR A +WS LL RKG V +H++DWLPTL AG
Sbjct: 311 WEGAVRGAALIWSRLLPRKGIVSNQWLHVSDWLPTLGHAAG 351
>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
Length = 567
Score = 94.4 bits (233), Expect = 8e-18, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR+I +L + L NS+V+F SDNG P+V NT SNWPLRG K +
Sbjct: 256 MVAKLDQSVGRIITTLAQTEQLENSIVIFYSDNGAPSVGQF---ANTGSNWPLRGQKNTP 312
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R A+WS L+ +G +++ DWLPTL AG
Sbjct: 313 WEGGIRVAGAIWSTQLQARGNKFTQPIYVADWLPTLAHAAG 353
>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 511
Score = 94.4 bits (233), Expect = 9e-18, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LDES+G V ++L + +L N+++V +DNGG F N N+PLRG K +
Sbjct: 215 MTDALDESIGEVFRALFDADMLDNTIIVLSADNGGL---PFGFESNRGYNFPLRGGKGTL 271
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG R A +WSPLL RKG V + +MHITDW+PTLY AG
Sbjct: 272 WEGGCRASAFVWSPLLKRKGVVSDQMMHITDWVPTLYSAAG 312
>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
Length = 576
Score = 94.0 bits (232), Expect = 1e-17, Method: Composition-based stats.
Identities = 50/103 (48%), Positives = 65/103 (63%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGRVI +L L NS+V+F SDNG P+V NT SNWPL+G K S
Sbjct: 263 MVSKLDKSVGRVIGALARAEQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLKGQKNSP 319
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A+WSPLL+ + V +++ D+LPTL AG E
Sbjct: 320 WEGGLRVAGAIWSPLLKARSSVFTQPIYVGDFLPTLAHAAGIE 362
>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
Length = 577
Score = 94.0 bits (232), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SV + I +L++ +L NS+++ SDNG PT+ H N SN+P RG K S
Sbjct: 255 MISSLDKSVAQTIGALKDNDMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ ALWSPLL+ +G V +H DWLPTL AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352
>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
Length = 554
Score = 94.0 bits (232), Expect = 1e-17, Method: Composition-based stats.
Identities = 48/103 (46%), Positives = 64/103 (62%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR+I +L L NS+V+F SDNG P+V NT SN+PLRG K +
Sbjct: 255 MISKLDQSVGRIISTLSSTDQLDNSIVIFYSDNGAPSVGMF---SNTGSNFPLRGQKNTP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+WS L+ +G + +++ DWLPTL AG E
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLSRAAGIE 354
>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
Length = 519
Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG + +L++EGL N+++VF +DNGG T+ SNWPLRG K+S
Sbjct: 254 MVSAMDEAVGNITMALQKEGLWNNTVLVFSTDNGGQTLSG-------GSNWPLRGRKWSL 306
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGGVR V + SPLL++ G V L+HI+DWLPTL +AG
Sbjct: 307 WEGGVRGVGFVASPLLKQPGTVNRELIHISDWLPTLVGLAG 347
>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
Length = 556
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDESVG ++ L +L NS++VF +DNGG F+ N ASN+PLRG K +
Sbjct: 234 MVSKLDESVGIIVDQLRRSNMLENSIIVFATDNGGAAEG---FNLNFASNYPLRGVKNTL 290
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR V LWS L++ +V E MHITDWLPT +AG ++ +
Sbjct: 291 WEGGVRGVGLLWSQRYLKRPRVAEQTMHITDWLPTFVEVAGGKEAL 336
>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 433
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 71/106 (66%), Gaps = 5/106 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V+++L G+LG++++VF SDNG +S + N AS WP +G K +
Sbjct: 141 MVSALDESVGAVVEALGRRGMLGDTVLVFSSDNG---ANSRGENPNYASAWPFKGQKITP 197
Query: 61 HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR A +WSP L +G+ N+ HI+DWLPTLY +AG + +
Sbjct: 198 WEGGVRAPAIIWSPSLSGTQGRDYSNIFHISDWLPTLYQLAGGDPS 243
>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
Length = 528
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 69/105 (65%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG + +L+E GL N+++VF +DNGG T+ SNWPLRG K+S
Sbjct: 253 MVSAMDEAVGNITLALQETGLWDNTVLVFSTDNGGQTLAG-------GSNWPLRGRKWSL 305
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL++ G V L+HI+DWLPTL +AG N
Sbjct: 306 WEGGVRGVGFVTSPLLKQPGTVSRQLIHISDWLPTLVGLAGGSTN 350
>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 526
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 67/101 (66%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG + ++L++ GL NS++VF +DNGGP H A N PLRGTK +
Sbjct: 254 MVSALDDSVGNITRALKKSGLYNNSIIVFSTDNGGP-------HECEAFNTPLRGTKNTL 306
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
EGG+R A + S LL K + V E +MH+ DWLPT+Y +AG
Sbjct: 307 WEGGIRGAAFIHSVLLEKPKRVCEGMMHVVDWLPTMYHVAG 347
>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
Length = 500
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SV V ++L E+G+L +++++F +DNGGP + N A NWPLRG+K +
Sbjct: 252 MVSALDDSVSNVTKALHEKGILNDTIIIFSTDNGGPANRQDY---NDACNWPLRGSKRTM 308
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR +WSPLL G V E+LM I DW+PT+ AG
Sbjct: 309 WEGGVRGNGFIWSPLLENSGYVSEHLMQIVDWVPTVLEAAG 349
>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
Length = 564
Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD +VG ++ +L +L N+++VF SDNGGP F+ N ASNWPLRG K +
Sbjct: 263 MMVELDRTVGSLVDALHARDMLENTIIVFSSDNGGPADG---FNDNAASNWPLRGVKNTL 319
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R +WSPLL+ +V ++ + DWLPTLY AG
Sbjct: 320 WEGGLRAAGFIWSPLLQNVSRVSHQMVQVCDWLPTLYEAAG 360
>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
Length = 554
Score = 92.4 bits (228), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/103 (46%), Positives = 63/103 (61%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR+I +L L NS+V+F SDNG P+V NT SN+PLRG K +
Sbjct: 255 MISKLDQSVGRIISTLSSTDQLENSIVIFYSDNGAPSVGMF---SNTGSNFPLRGQKNTP 311
Query: 61 HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+W S L +G + +++ DWLPTL AG E
Sbjct: 312 WEGGVRVAGAIWNSKLQARGSIFSQPLYVADWLPTLSRAAGIE 354
>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
Length = 570
Score = 92.4 bits (228), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/103 (46%), Positives = 62/103 (60%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR+I +L L NS+V+F SDNG P+V NT SNWPLRG K +
Sbjct: 258 MVSKLDQSVGRIISTLASTDQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLRGQKNTP 314
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+WS L +G + +++ DWLPTL A E
Sbjct: 315 WEGGVRVAGAVWSSKLEARGSIFTQPIYVGDWLPTLAHAADIE 357
>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 336
Score = 92.0 bits (227), Expect = 4e-17, Method: Composition-based stats.
Identities = 47/106 (44%), Positives = 68/106 (64%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +D +G++ L ++ +L +S+++F SDNGGP + N A+N+PLRGTK +
Sbjct: 80 MLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANG---LNNNWATNYPLRGTKTTV 136
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKNI 105
+EGGVR A +WS L K +V +LMHITDWLPTL AG + +
Sbjct: 137 YEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSAAGVDTTL 182
>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
Length = 548
Score = 92.0 bits (227), Expect = 4e-17, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD SVGRVI +L + +L +S+V+F+SDNGGPTV H +TASN+PLRG K S
Sbjct: 225 MVSRLDRSVGRVIDALARQQMLQDSIVLFLSDNGGPTVGE---HSSTASNYPLRGQKNSP 281
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ AA+WS R G V + +I D LPTL AG
Sbjct: 282 WEGGLRSSAAIWSTEFERLGTVWKQQTYIGDLLPTLAAAAG 322
>gi|323454325|gb|EGB10195.1| hypothetical protein AURANDRAFT_62802 [Aureococcus anophagefferens]
Length = 2048
Score = 92.0 bits (227), Expect = 4e-17, Method: Composition-based stats.
Identities = 50/103 (48%), Positives = 62/103 (60%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD +V +V+ +L LL N++V F SDNGG D SNWPLRGTK+S
Sbjct: 412 MVMNLDNNVRKVVDALAAAKLLQNTVVAFASDNGGCPRDG-------GSNWPLRGTKFSQ 464
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
+EGGVR A LWS L+K G V E L H+ DW PTL AG
Sbjct: 465 YEGGVRVPAFLWSSTLQKRGAGTVFEGLFHVADWFPTLVASAG 507
>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
Length = 565
Score = 92.0 bits (227), Expect = 4e-17, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR++ SL L NS+V+F SDNG P+V NT SN+PLRG K +
Sbjct: 255 MISKLDQSVGRIMSSLAATDQLENSIVIFYSDNGAPSVGMF---ANTGSNFPLRGQKNTP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR A+WS L+ +G + +++ DWLPTL AG
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLCRAAG 352
>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
Length = 548
Score = 91.7 bits (226), Expect = 5e-17, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD SVGRVI +L + +L +S+V+F+SDNGGPTV H TASN+PLRG K S
Sbjct: 225 MVSRLDRSVGRVIDALARQQMLQDSIVLFLSDNGGPTVGE---HSTTASNYPLRGQKNSP 281
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R+ AA+WS R G V + +I D LPTL AG
Sbjct: 282 WEGGLRSSAAIWSTEFERLGTVWKQQTYIGDLLPTLAAAAG 322
>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 527
Score = 91.7 bits (226), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 50/100 (50%), Positives = 66/100 (66%), Gaps = 4/100 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDES G V+++L E G+L N+++V S NGG S N SN+PLRG K +
Sbjct: 241 MVDALDESAGDVLEALYEAGMLANTIIVMSSANGG---LSSGVESNAGSNFPLRGGKGAL 297
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGG R A +WSPLL +K +V + +MHITDWLPTLY A
Sbjct: 298 WEGGTRASAFIWSPLLYQKNRVSDQMMHITDWLPTLYAAA 337
>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
Length = 500
Score = 91.7 bits (226), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 67/102 (65%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
M+ +LD+SVG +++ LE+ G+L ++++VF +DNGG P S N NWPLRG K +
Sbjct: 249 MVDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRS---DPNRGFNWPLRGIKGT 305
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R A +WSP L +V LMHI+DWLPTLY AG
Sbjct: 306 VWEGGIRGTAFIWSPRLELSSRVSTQLMHISDWLPTLYSAAG 347
>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
Length = 565
Score = 91.7 bits (226), Expect = 5e-17, Method: Composition-based stats.
Identities = 47/103 (45%), Positives = 64/103 (62%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+SVGR++ +L L NS+V+F SDNG P SL NT SN+PLRG K +
Sbjct: 255 IISKLDQSVGRIMSTLAATDQLENSIVIFYSDNGAP---SLGMFANTGSNFPLRGQKNTP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+WS L+ +G + +++ DWLPTL AG E
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIE 354
>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
Length = 498
Score = 91.3 bits (225), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLDES+ +++ SL+ +GL NS+++F +DNGG L + SN+PLRG K +
Sbjct: 224 MVSSLDESMRQLVTSLKRKGLYQNSIIIFTTDNGG-AAGGLDM--SAGSNFPLRGNKNTL 280
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA + SPL+ R G+V + LMH +DWLPTL+ +AG
Sbjct: 281 WEGGVRGVAFVHSPLIKRPGRVYDGLMHASDWLPTLHLLAG 321
>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
Length = 561
Score = 90.9 bits (224), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 65/103 (63%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG + ++L +G+L N+++ F +DNGGP F N ASNWPLRG K S
Sbjct: 259 MVSALDDSVGNITKALHAKGILENTIICFSTDNGGPASG---FDFNDASNWPLRGVKESL 315
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR +WSPL++ G +M I D+LPTL AG +
Sbjct: 316 WEGGVRGTGFVWSPLMKTSGYTSPAMMQIFDFLPTLLKAAGYD 358
>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
Length = 565
Score = 90.9 bits (224), Expect = 8e-17, Method: Composition-based stats.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+SVGR++ +L L NS+V+F SDNG P+V NT SN+PLRG K +
Sbjct: 255 IISKLDQSVGRIMSTLAATDQLENSIVIFYSDNGAPSVGMF---ANTGSNFPLRGQKNTP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+WS L+ +G + +++ DWLPTL AG E
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIE 354
>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
Length = 516
Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 68/105 (64%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG + +L++ GL N+++VF +DNGG T+ SNWPLRG K+S
Sbjct: 249 MVSAMDEAVGNITLALQKRGLWENTVLVFSTDNGGQTLSG-------GSNWPLRGRKWSL 301
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL + G + L+HI+DWLPTL +AG N
Sbjct: 302 WEGGVRGVGFVASPLLEQPGTISRQLIHISDWLPTLVGLAGGSTN 346
>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
Length = 564
Score = 90.5 bits (223), Expect = 1e-16, Method: Composition-based stats.
Identities = 45/96 (46%), Positives = 61/96 (63%), Gaps = 4/96 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGR+I +L L NS+V+F SDNG P+V NT SN+PLRG K +
Sbjct: 255 MISKLDQSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMF---SNTGSNFPLRGQKNTP 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTL 95
EGGVR A+WS L+ +G + +++ DWLPTL
Sbjct: 312 WEGGVRVAGAIWSSGLQARGSIFRQPLYVADWLPTL 347
>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
Length = 528
Score = 90.5 bits (223), Expect = 1e-16, Method: Composition-based stats.
Identities = 42/95 (44%), Positives = 65/95 (68%), Gaps = 4/95 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++D+SVGR+++ L + +L NS+++F +DNGGP + F+ N ASN+PLRG K +
Sbjct: 226 LVTAMDDSVGRIVEQLRKSHMLDNSIIIFSTDNGGP---AEGFNSNFASNYPLRGVKNTL 282
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPT 94
EGGVR +WS L ++ +V + MHI DWLPT
Sbjct: 283 WEGGVRGAGLVWSAQLSKRPRVADQTMHIADWLPT 317
>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
Length = 574
Score = 90.5 bits (223), Expect = 1e-16, Method: Composition-based stats.
Identities = 49/103 (47%), Positives = 63/103 (61%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVGRVI +L L NS+V+F SDNG P+V NT SNWPL+G K S
Sbjct: 263 MVSKLDQSVGRVITALARAEQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLKGQKNSP 319
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR +WS LL +G + +++ D+LPTL AG E
Sbjct: 320 WEGGVRVAGVIWSQLLTARGHLFTQPIYVGDFLPTLAHAAGIE 362
>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
Length = 531
Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 67/102 (65%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
M+ +LD+SVG +++ LE+ G+L ++++VF +DNGG P + N NWPLRG K +
Sbjct: 249 MMDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRA---DPNRGFNWPLRGIKGT 305
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R A +WSP L +V LMHI+DWLPTLY AG
Sbjct: 306 VWEGGIRGTAFIWSPRLEVSRRVSTQLMHISDWLPTLYSAAG 347
>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
Length = 541
Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 63/96 (65%), Gaps = 4/96 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DESVG ++ L + +L NS+++F SDNGGP F+ N ASN+PL+G K +
Sbjct: 232 MVTKMDESVGLIVDQLRKSNMLENSIIIFSSDNGGPAQG---FNLNFASNYPLKGVKNTL 288
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTL 95
EGGVR +WSPLL+ Q V +HI+DWLPTL
Sbjct: 289 WEGGVRAAGLIWSPLLKSRQRVSTQTIHISDWLPTL 324
>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
Length = 429
Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE+VG V +LE GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 269 MASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL RKG L+HI+DWLPTL +AG N
Sbjct: 322 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTN 366
>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 502
Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 66/100 (66%), Gaps = 3/100 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +D VGRV +L ++ +L NS+++F +DNGGP F N A+N+PLRG K +
Sbjct: 244 MLGYMDYGVGRVHDALAKKKMLDNSIIIFTTDNGGPANG---FDYNWANNFPLRGVKATL 300
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR V+ ++S L+ +V L+HITDWLPTL +AG
Sbjct: 301 FEGGVRGVSFVYSKLIESPRVSHELIHITDWLPTLVNLAG 340
>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
Length = 485
Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 70/111 (63%), Gaps = 8/111 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD++VG V ++L G+L NS+++F +DNGGP F N ASNWPLRG K +
Sbjct: 222 MVSALDDAVGNVTKALSARGMLENSVIIFTTDNGGPAAG---FDQNYASNWPLRGVKNTL 278
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKE----KNID 106
EGGV + SPL+++ + L+H+ D LPT+Y +AG + KN+D
Sbjct: 279 WEGGVHGTGFVHSPLIKQPKRTTHELLHVCDLLPTIYELAGGDSTELKNLD 329
>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
Length = 380
Score = 90.1 bits (222), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 72/102 (70%), Gaps = 6/102 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-FHGNTASNWPLRGTKYS 59
M+ +D+S+G V ++LE+ +L NS+++F +DNGG + H F+ N SN+PLRG K
Sbjct: 230 MVTCVDDSIGEVFRALEKNRMLNNSVILFTTDNGG----APHGFNRNQGSNYPLRGGKDM 285
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR A ++S L++ KG+V +L+ +TDW+PTLY++AG
Sbjct: 286 MWEGGVRGTAFIYSDLIKHKGRVSTDLIDVTDWVPTLYYLAG 327
>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
Length = 533
Score = 90.1 bits (222), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE+VG V +LE GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 269 MASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL RKG L+HI+DWLPTL +AG N
Sbjct: 322 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTN 366
>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
Length = 583
Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/107 (46%), Positives = 69/107 (64%), Gaps = 4/107 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG V+ +L + +L NS+++F+SDNGGPT+ H TASN+PLRG K S
Sbjct: 264 MVSRLDKSVGHVVDALARQDMLQNSILLFLSDNGGPTLGE---HATTASNYPLRGQKNSP 320
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R+ AA+WS R G V + ++I D LPTL AG + D
Sbjct: 321 WEGGIRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPD 367
>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 557
Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG+V+++L + LL ++++VF+SDNGG T + NT SN+P RG K +
Sbjct: 305 MVDELDQSVGQVVEALYRKRLLDSTMIVFLSDNGGQTTGVM---NNTGSNYPFRGQKRTL 361
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG R A +WS +++K ++ LMH+TDWLPT+ AG
Sbjct: 362 FEGGTRVSAFVWSTDIVKKPRIESGLMHVTDWLPTILKRAG 402
>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 571
Score = 89.7 bits (221), Expect = 2e-16, Method: Composition-based stats.
Identities = 46/105 (43%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD V +V+Q L+ +GL NS++++ SDNGG + SNWPLRG K ++
Sbjct: 265 MLSCLDSGVAQVVQELKTQGLYENSVLIYSSDNGGQPLSG-------GSNWPLRGGKGTY 317
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPLL +KG V L+H++DW PTL +AG +++
Sbjct: 318 WEGGIRAVGFVHSPLLKKKGIVSRALIHVSDWYPTLLGLAGAQQS 362
>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
Length = 431
Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE+VG V +LE GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 167 MASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 219
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL RKG L+HI+DWLPTL +AG N
Sbjct: 220 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTN 264
>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
Length = 496
Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE+VG V +L GL NS++++ SDNGG P+ SNWPLRG+K +
Sbjct: 193 MLSCLDEAVGNVTLALRAHGLYNNSILIYSSDNGGQPSAG--------GSNWPLRGSKGT 244
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
+ EGG+R V + SPLLR +G V L+HITDW PTL +A
Sbjct: 245 YWEGGIRAVGFVHSPLLRQRGTVCRELVHITDWYPTLVALA 285
>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 588
Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 70/106 (66%), Gaps = 5/106 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V+++L + G+L ++++VF SDNGG D+ + N AS+WP +G K +
Sbjct: 292 MVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGG---DTNGENPNYASSWPFKGQKRTL 348
Query: 61 HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+ +WSPL +G N+ HI+DWLPTLY +AG + +
Sbjct: 349 WEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAGGDPS 394
>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
Length = 515
Score = 89.4 bits (220), Expect = 3e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 784
Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 65/103 (63%), Gaps = 13/103 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VGRV+ +L+ GL NS++VF +DNGG NWPLRG+K S
Sbjct: 263 MVTCMDEAVGRVVTALKHTGLWENSVLVFSTDNGGI----------LGGNWPLRGSKASL 312
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
+EGGVR V + SPLL +G V L+H+TDWLPTL +AG
Sbjct: 313 YEGGVRGVGFVTSPLLPSRVRGTVNRELLHMTDWLPTLVRLAG 355
>gi|241619164|ref|XP_002407086.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500932|gb|EEC10426.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 383
Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats.
Identities = 44/102 (43%), Positives = 66/102 (64%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESVG V+++L + G+L ++++VF SDNG T + N AS+WP +G K +
Sbjct: 126 MVSALDESVGAVVEALGKRGMLNDTVLVFSSDNGADTNSE---NANYASSWPFKGQKMTP 182
Query: 61 HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAG 100
EGGV +WSPL +G ++ H++DWLPTLY +AG
Sbjct: 183 WEGGVHVPGIIWSPLFSGMRGIDYTHIFHVSDWLPTLYQLAG 224
>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
Length = 599
Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats.
Identities = 51/108 (47%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +LE G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLACLDEAINNVTLALETYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLITLA--EGQID 393
>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 502
Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 63/106 (59%), Gaps = 5/106 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDESVG V ++L G+L ++VV +DNG G S WP RG K +
Sbjct: 254 MLSVLDESVGAVFEALGRRGILNETIVVLTTDNGAGITTQFSSFG---SGWPFRGQKGTA 310
Query: 61 HEGGVRNVAALWSPLL--RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR A +WSPL +G V+E+L H++DWLPT Y +AG + +
Sbjct: 311 WEGGVRVPAVVWSPLFSEHRGAVVESLFHVSDWLPTFYELAGGDPS 356
>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
Length = 523
Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG V ++L+ GL N++++F +DNGG T GN NWPLRG K +
Sbjct: 259 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG----GN---NWPLRGRKGTL 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R + SPLL+ KG LMHITDWLPTL +AG
Sbjct: 312 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 352
>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
Length = 621
Score = 88.6 bits (218), Expect = 4e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 273 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 324
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 325 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 370
>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 355
Score = 88.6 bits (218), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 68/108 (62%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++ + ++LEE+ +L NS++VF SDNGG V+S +NWPLRG K++
Sbjct: 240 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVNS------AGNNWPLRGGKHTS 293
Query: 61 HEGGVRNVAALWSPLL-RKGQVLEN--LMHITDWLPTLYFIAGKEKNI 105
EGGVR V ++ LL R + EN LMHITDW PTL +AG ++
Sbjct: 294 WEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHV 341
>gi|291238953|ref|XP_002739390.1| PREDICTED: arylsulfatase B-like [Saccoglossus kowalevskii]
Length = 393
Score = 88.6 bits (218), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++ + ++LEE+ +L NS++VF SDNGG V G+ +NWPLRG K++
Sbjct: 133 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALV------GSAGNNWPLRGGKHTS 186
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR V ++ LL +G LMHITDW PTL +AG
Sbjct: 187 WEGGVRAVGFVYGDLLPCHSRGTENTGLMHITDWFPTLITLAG 229
>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
Length = 782
Score = 88.6 bits (218), Expect = 5e-16, Method: Composition-based stats.
Identities = 48/105 (45%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++++F +DNGG T+ GN NWPLRG K++
Sbjct: 518 MVSIMDEAVGNVTAALKRHGLWDNTVLIFSTDNGGQTLAG----GN---NWPLRGRKWTL 570
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR + + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 571 WEGGVRGIGFVASPLLKQKGVKNRELIHISDWLPTLVNLAGGHVN 615
>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 495
Score = 88.6 bits (218), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 68/108 (62%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++ + ++LEE+ +L NS++VF SDNGG V+S +NWPLRG K++
Sbjct: 240 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVNS------AGNNWPLRGGKHTS 293
Query: 61 HEGGVRNVAALWSPLL-RKGQVLEN--LMHITDWLPTLYFIAGKEKNI 105
EGGVR V ++ LL R + EN LMHITDW PTL +AG ++
Sbjct: 294 WEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHV 341
>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
Length = 599
Score = 88.2 bits (217), Expect = 5e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
Length = 596
Score = 88.2 bits (217), Expect = 5e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
Length = 578
Score = 88.2 bits (217), Expect = 5e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 47/107 (43%), Positives = 64/107 (59%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV--DSLHFHGNTASNWPLRGTKY 58
ML LDE +G V ++L+ G N+L+VF +DNGGP + DS+ A N+P+RG K+
Sbjct: 237 MLSCLDEGIGNVTRALKRNGFYNNTLIVFTTDNGGPILGGDSV-----GARNYPMRGGKH 291
Query: 59 SFHEGGVRNVAALWSP---LLRKGQVLENLMHITDWLPTLYFIAGKE 102
+ +EGGVR +A + P + G NLMH DWLPTL AG E
Sbjct: 292 AVYEGGVRGIAFVHPPQNTFVETGAGYHNLMHAADWLPTLCDAAGVE 338
>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 533
Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 48/106 (45%), Positives = 67/106 (63%), Gaps = 8/106 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--HGNTASNWPLRGTKYS 59
+ +LD+SVG V+++L E +L N+++V +DNGG L F N N+PLRG K +
Sbjct: 246 VDALDQSVGEVLETLYEADMLDNTIIVLSTDNGG-----LPFGVQSNRGYNFPLRGAKGT 300
Query: 60 FHEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKN 104
EGG R A +WSPLL + +V + ++HITDWLPTLY AG N
Sbjct: 301 LWEGGTRGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGN 346
>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
Length = 599
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
Length = 598
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
Length = 599
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
Length = 600
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 51/108 (47%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++V+ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIVYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 562
Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DESVG ++ L E G NS++++ SDNGG + + NWPLRG K S+
Sbjct: 249 MVSGVDESVGEIVSELRERGYYDNSVLIYSSDNGGQPL-------SGGCNWPLRGGKGSY 301
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL RKG V L+HI+DW PTL +AG ++
Sbjct: 302 WEGGVRAVGFVHSPLLKRKGVVSHALIHISDWYPTLLSLAGYRES 346
>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
Length = 596
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
Length = 596
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
Length = 597
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 295 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 346
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 347 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 392
>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
Length = 599
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
Length = 597
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
Length = 572
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 268 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 319
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 320 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 365
>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
Length = 614
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 500
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++ + ++LEE+ +L NS++VF SDNGG V+S +NWPLRG K++
Sbjct: 240 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVNS------AGNNWPLRGGKHTS 293
Query: 61 HEGGVRNVAALWSPLL-RKGQVLEN--LMHITDWLPTLYFIAGKEKNI 105
EGGVR + ++ LL R + EN LMHITDW PTL +AG ++
Sbjct: 294 WEGGVRAIGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHV 341
>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
Length = 597
Score = 87.8 bits (216), Expect = 8e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 327
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 70/100 (70%), Gaps = 3/100 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +D +GRV ++L E+ +L NS+++F +DNGGP S F+ N A+N+PLRG K +
Sbjct: 62 MLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGP---SNGFNNNWANNFPLRGVKATL 118
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR V ++S L+ K +V +L+H+TDWLPTL +AG
Sbjct: 119 FEGGVRGVGFVYSKLIEKSRVSHDLIHVTDWLPTLVSLAG 158
>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
Length = 601
Score = 87.8 bits (216), Expect = 8e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 298 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 349
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 350 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 395
>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
Length = 605
Score = 87.8 bits (216), Expect = 8e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 302 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 353
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 354 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 399
>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
Length = 584
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L + +L NS+V+F+SDNGGPT H TASN+PLRG K S
Sbjct: 264 MVSKLDQSVGSVIDALARQEMLQNSIVLFLSDNGGPTQGQ---HSTTASNYPLRGQKDSP 320
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG +R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361
>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 437
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 67/102 (65%), Gaps = 3/102 (2%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L +D+S+G++ ++L LL N+++VF+S +GG V F N + N+PLRGTK +
Sbjct: 247 LTEVDQSLGQIFEALHNRRLLNNTILVFVSASGGMPVGD--FTNNLSFNFPLRGTKGTCF 304
Query: 62 EGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKE 102
EGGVR A +WSPLL K + LMH+TDWLPTL+ AG +
Sbjct: 305 EGGVRVPAFVWSPLLNKTRRTSSQLMHVTDWLPTLFSAAGGD 346
>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
Length = 598
Score = 87.8 bits (216), Expect = 8e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 294 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
Length = 598
Score = 87.8 bits (216), Expect = 8e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
Length = 600
Score = 87.4 bits (215), Expect = 9e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
Length = 598
Score = 87.4 bits (215), Expect = 9e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
Length = 564
Score = 87.4 bits (215), Expect = 9e-16, Method: Composition-based stats.
Identities = 44/103 (42%), Positives = 66/103 (64%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD++V V +L++ G NS++++ SDNGG + +NWPLRG+K ++
Sbjct: 267 MLSCLDDAVNNVTTALKKYGFYDNSIIIYSSDNGGQPMAG-------GNNWPLRGSKGTY 319
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R V + SP+L+ KG + + L+HITDW PTL +AG E
Sbjct: 320 WEGGIRAVGFVHSPILKMKGYICKELVHITDWFPTLVTLAGGE 362
>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
Length = 595
Score = 87.4 bits (215), Expect = 9e-16, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393
>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
Length = 601
Score = 87.4 bits (215), Expect = 9e-16, Method: Composition-based stats.
Identities = 51/108 (47%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE+V V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 298 MLSCLDEAVSNVTLALKTYGFYNNSILIYSSDNGGQPTAG--------GSNWPLRGSKGT 349
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 350 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 395
>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 87.4 bits (215), Expect = 1e-15, Method: Composition-based stats.
Identities = 44/101 (43%), Positives = 62/101 (61%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G V ++L+E G+ N+++VF+SDNGG NWPLRG K S
Sbjct: 232 MVTCMDEGIGNVTEALKESGMWDNTVLVFVSDNGGEL-------SAGGRNWPLRGGKGSV 284
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLM-HITDWLPTLYFIAG 100
EGG+R V+ + S L+ + Q + N M HITDW PTL +AG
Sbjct: 285 FEGGIRTVSFVTSSLIERPQRISNAMIHITDWFPTLTHLAG 325
>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
Length = 570
Score = 87.4 bits (215), Expect = 1e-15, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
Length = 555
Score = 87.4 bits (215), Expect = 1e-15, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
Length = 427
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 124 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 175
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 176 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 221
>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 67/103 (65%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V+ +L+ GL NS++VF +DNGG + SL NWPLRG K S
Sbjct: 243 MVTCMDEAVGGVVATLKHSGLWENSVLVFSTDNGG--IHSL------GGNWPLRGGKASL 294
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
+EGGVR VA + SPLL +G + + L+H+TDW PTL +AG
Sbjct: 295 YEGGVRGVAFVTSPLLPSRVRGTINKELLHMTDWFPTLVRLAG 337
>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 533
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 43/100 (43%), Positives = 59/100 (59%), Gaps = 3/100 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG V +L+ +++VVF+SDNGGPT G ++NWPLRG K +
Sbjct: 268 MVSALDEAVGNVTGALKARRAYDDAVVVFVSDNGGPTNGD---EGTASNNWPLRGGKNTL 324
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG R V L +P + K +H DW PTLY AG
Sbjct: 325 YEGGTRVVGLLKAPGVAKSSASRAKLHAVDWFPTLYRFAG 364
>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
Length = 573
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 48/107 (44%), Positives = 67/107 (62%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L++ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 270 MLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 323 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 367
>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
Length = 467
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 48/107 (44%), Positives = 67/107 (62%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L++ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 164 MLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 216
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 217 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 261
>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
Length = 514
Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 68/105 (64%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG + +L+E GL N+++VF +DNGG T+ GN NWPLRG K++
Sbjct: 248 MVSLMDEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAG----GN---NWPLRGRKWTL 300
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 301 WEGGVRGVGFVASPLLKQKGVESHELIHISDWLPTLVHLAGGHTN 345
>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
Length = 593
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 290 MLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 341
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 342 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 387
>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
Length = 599
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 297 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 348
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 349 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 394
>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
Length = 573
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/107 (44%), Positives = 67/107 (62%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L++ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 270 MLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 323 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 367
>gi|119626709|gb|EAX06304.1| hCG20717, isoform CRA_b [Homo sapiens]
Length = 354
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 51 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 102
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 103 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 148
>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
Length = 535
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ VF +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSHGLWNNTVFVFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR V + SPLL RKG L+HI+DWLPTL +AG
Sbjct: 324 WEGGVRGVGFVASPLLKRKGVKSRELVHISDWLPTLVGLAG 364
>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
Length = 569
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359
>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
Length = 569
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359
>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
Length = 300
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L + +L NS+++F+SDNGGPT H TASN+PLRG K S
Sbjct: 184 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQ---HSTTASNYPLRGQKNSP 240
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG +R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 241 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 281
>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
Length = 597
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L+ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 346
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 347 WEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391
>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
Length = 571
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L+ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 268 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 320
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 321 WEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 365
>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
Length = 585
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L + +L NS+++F+SDNGGPT H TASN+PLRG K S
Sbjct: 264 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQ---HSTTASNYPLRGQKNSP 320
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG +R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361
>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
Length = 539
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 237 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 288
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 289 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 334
>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
Length = 426
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 123 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 175
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 176 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 216
>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
caballus]
Length = 523
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/105 (48%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 259 MVSLLDEAVGNVTAALKSRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 312 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGSTN 356
>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
Length = 599
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 297 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 348
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R + + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 349 YWEGGIRAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 394
>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
Length = 551
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 287 MVSLMDEAVGNVTAALKSRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 339
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 340 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 384
>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
Length = 466
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 164 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 215
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R + + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 216 YWEGGIRAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 261
>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
Length = 569
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359
>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
Length = 569
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359
>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
Length = 585
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L + +L NS+++F+SDNGGPT H TASN+PLRG K S
Sbjct: 264 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTEGQ---HSTTASNYPLRGQKNSP 320
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG +R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361
>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
Length = 585
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L + +L NS+++F+SDNGGPT H TASN+PLRG K S
Sbjct: 264 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTEGQ---HSTTASNYPLRGQKNSP 320
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG +R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361
>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
Length = 543
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 240 MLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 291
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 292 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 337
>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
Length = 569
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359
>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
Length = 520
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 64/101 (63%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG + +L + GL N+++VF +DNGG T SNWPLRG K++
Sbjct: 253 MVSAMDEAVGNISLALRQAGLWDNTVLVFSTDNGGQTRAG-------GSNWPLRGRKWAL 305
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPLL G V LMHI+DWLPTL +AG
Sbjct: 306 WEGGIRGVGFVSSPLLENPGTVNNELMHISDWLPTLVRLAG 346
>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
Length = 569
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALMHITDWYPTLVGLAG 359
>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
Length = 569
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359
>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
purpuratus]
Length = 784
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 44/98 (44%), Positives = 63/98 (64%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG+++ SL++ GL N++++F +DNGG GN NWPLRG K S
Sbjct: 242 MMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNGGEVAAG----GN---NWPLRGWKRSI 294
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGG+R V + SPLL +G V + L+H++DW PTL
Sbjct: 295 WEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTL 332
>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
Length = 607
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L+ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 304 MLSCLDEAINNVTLALKRYGFYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 356
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
EGG+R V + SPLL+ KG + + L+HITDW PTL +A
Sbjct: 357 WEGGIRAVGFVHSPLLKNKGTICKELVHITDWYPTLITLA 396
>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
Length = 522
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 70/105 (66%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L++ GL N++ +F +DNGG T+ GN NWPLRG K++
Sbjct: 258 MVTLMDEAVGNVTRALQKYGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWTL 310
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPLL++ +V + L+HI+DWLPTL +AG N
Sbjct: 311 WEGGIRGVGFVSSPLLKRKRVKSSELIHISDWLPTLVNLAGGRTN 355
>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
Length = 591
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG VI +L + +L NS+V+F+SDNGGPT H TASN+PLRG K S
Sbjct: 264 MVSKLDQSVGYVIDALARQEMLQNSIVLFLSDNGGPTQGQ---HSTTASNYPLRGQKDSP 320
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG +R+ AA+WS R G V + ++I D LPTL AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAGAG 361
>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
Length = 481
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 254 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 306
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 307 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 347
>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
Length = 572
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALMHITDWYPTLVGLAG 359
>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
Length = 504
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 198 MLSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTAG--------GSNWPLRGSKGT 249
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
+ EGG+R V + SPLL+ KG V L+HITDW PTL +A
Sbjct: 250 YWEGGIRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLA 290
>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
Length = 426
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 123 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 175
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 176 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 216
>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 69/108 (63%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K LD+S+G ++ +LE++G+L N+++ F +DNG PTV N+ SN+PLRG K S
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVG---LGANSGSNYPLRGVKKSP 306
Query: 61 HEGGVRNVAALWS-PLLR-----KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A +W+ P + +G+V + MH DW+PTL G++
Sbjct: 307 WEGGIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEK 354
>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
Length = 488
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L + G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 185 MLACLDEAINNVTLALRKYGYYENSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 237
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 238 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 282
>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
Length = 476
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 65/101 (64%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG + + ++ GL N++VVF +DNGG + N+ +NWPLRG K+S
Sbjct: 227 MVTCMDEAVGNITAAFKDTGLWDNTVVVFTTDNGG------DVNENSGNNWPLRGWKHSL 280
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPLL+ G + LMH++DW PTL +AG
Sbjct: 281 WEGGMRGVGFVNSPLLKSFGYPVNGLMHVSDWFPTLLNLAG 321
>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
Length = 430
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 166 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 218
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 219 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 263
>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
Length = 430
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 166 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 218
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 219 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 263
>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
Length = 426
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 123 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 175
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + LMHITDW PTL +AG
Sbjct: 176 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 216
>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
Length = 520
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +LE GL N++ +F +DNGG T+ +NWPLRG K++
Sbjct: 256 MVSLMDEAVGNVTAALESHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWTL 308
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPLL+ KG L+HI+DWLPTL +AG
Sbjct: 309 WEGGIRGVGFVASPLLKQKGVRTRELIHISDWLPTLVKLAG 349
>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 69/108 (63%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K LD+S+G ++ +LE++G+L N+++ F +DNG PTV N+ SN+PLRG K S
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVG---LGANSGSNYPLRGVKKSP 306
Query: 61 HEGGVRNVAALWS-PLLR-----KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A +W+ P + +G+V + MH DW+PTL G++
Sbjct: 307 WEGGIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEK 354
>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
Length = 547
Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 69/108 (63%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K LD+S+G ++ +LE++G+L N+++ F +DNG PTV N+ SN+PLRG K S
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVG---LGANSGSNYPLRGVKKSP 306
Query: 61 HEGGVRNVAALWS-PLLR-----KGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A +W+ P + +G+V + MH DW+PTL G++
Sbjct: 307 WEGGIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEK 354
>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
Length = 532
Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE++G V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 268 MVSLMDEAIGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 320
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 321 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 365
>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
Length = 597
Score = 85.5 bits (210), Expect = 3e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 297 MVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 349
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 350 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVALAG 390
>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
Length = 466
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L+ G NS++++ SDNGG + SNWPLRG+K ++
Sbjct: 163 MLSCLDEAINNVTLALKRYGFYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 215
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
EGG+R V + SPLL+ KG + + L+HITDW PTL +A
Sbjct: 216 WEGGIRAVGFVHSPLLKNKGTICKELVHITDWYPTLITLA 255
>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
Length = 564
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 51/108 (47%), Positives = 68/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT GN NWPLRG+K +
Sbjct: 263 MLACLDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG-----GN---NWPLRGSKGT 314
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 315 YWEGGIRAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLA--EGQID 360
>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
Length = 574
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 269 MVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 322 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVALAG 362
>gi|299473382|emb|CBN77780.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 623
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 50/115 (43%), Positives = 68/115 (59%), Gaps = 17/115 (14%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
+L LD+S+GR++ LE EG + NS++V SDNGG PT SN+PLRGTK S
Sbjct: 302 VLMYLDKSIGRLVDYLESEGWMDNSIIVVASDNGGCPTAG--------GSNFPLRGTKAS 353
Query: 60 FHEGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
+ EGG + A ++SP R+G + LMH+TDWLPTL G +N+D
Sbjct: 354 YWEGGTKVPALVYSPSHIPKERRGGQYDGLMHVTDWLPTLMAATGSVVESHRNLD 408
>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
Length = 491
Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG + +L+ GL N++ VF +DNGG T+ GN NWPLRG K++
Sbjct: 225 MVTLMDEAVGNLTDALKTYGLWDNTVFVFSTDNGGQTLAG----GN---NWPLRGRKWTL 277
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL RKG L+HI+DWLPTL +AG N
Sbjct: 278 WEGGVRGVGFVASPLLKRKGVESHELIHISDWLPTLVHLAGGHTN 322
>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
Length = 565
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 260 MVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 312
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 313 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVALAG 353
>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
Length = 613
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 311 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 363
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 364 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 404
>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
Length = 532
Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L++ GL N++ +F +DNGG T+ +NWPLRG K++
Sbjct: 268 MVSLMDEAVGNVTAALKKRGLWDNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWTL 320
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 321 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 365
>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 843
Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 64/108 (59%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++ + ++LEE+ +L NS++VF SDNGG S SNWPLRG K++
Sbjct: 583 MTSCVDEAIANITEALEEKDMLKNSIIVFTSDNGGAVERS------AGSNWPLRGGKHTI 636
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGV+ V ++ LL +G LMHITDW PTL +AG ++
Sbjct: 637 WEGGVKAVGFVYGELLPHHSRGTENTGLMHITDWFPTLVTLAGANPHV 684
>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 507
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 45/103 (43%), Positives = 64/103 (62%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++G V ++L GLL N+++VF SDNGG + +G ASN+PLRG K +
Sbjct: 251 MTTCMDEAIGNVTRTLARRGLLDNTIIVFSSDNGGAKL-----YG--ASNYPLRGQKGTL 303
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
+EGGVR A ++SPLL +G LM++ DW PT +AG
Sbjct: 304 YEGGVRAPAFVYSPLLSNRVRGTATHELMYVGDWFPTFLHLAG 346
>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
Length = 418
Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 156 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 208
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 209 WEGGIRGVGFVASPLLKQKGIKNRELIHISDWLPTLVTLAGGSTN 253
>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
Length = 414
Score = 85.1 bits (209), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHIN 367
>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
Length = 471
Score = 85.1 bits (209), Expect = 4e-15, Method: Composition-based stats.
Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT +NWPLRG+K +
Sbjct: 170 MLACLDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG--------GNNWPLRGSKGT 221
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ KG V + L+HITDW PTL +A E ID
Sbjct: 222 YWEGGIRAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLA--EGQID 267
>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 524
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 63/98 (64%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE VG V ++L+E GL N++++F +DNGGP + A+NWPLRG+K S
Sbjct: 253 MVSCMDEGVGNVTRALKEAGLYDNTIIIFSTDNGGPI-------QSGANNWPLRGSKGSL 305
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGG+ V + SPLL KG V L+H++DWLPT+
Sbjct: 306 WEGGIHGVGFVHSPLLPTSVKGTVNHELIHVSDWLPTI 343
>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 537
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 64/98 (65%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG+V+ SL+E GL ++++VF +DNGG G + +NWPLRGTK +
Sbjct: 266 MVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTP-------GKSGNNWPLRGTKNTL 318
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGGV V + P++ +G V ++ MHI+DW PTL
Sbjct: 319 WEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPTL 356
>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
Length = 481
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 43/101 (42%), Positives = 67/101 (66%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG V ++L + GL+ ++L++F +DNGG +S GN N+PLRG+K++
Sbjct: 240 MVSALDEAVGNVTKTLRQRGLMDDTLILFTADNGGGVDES----GN---NYPLRGSKFTV 292
Query: 61 HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG R V ++ S L + G V + ++H DWLPTL AG
Sbjct: 293 YEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAAAG 333
>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
Length = 431
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHITDWLPTL +AG N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 367
>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
Length = 575
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 269 MVTCMDEAVRNITSALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 322 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 362
>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
Length = 661
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 47/107 (43%), Positives = 64/107 (59%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE++ V +L+ G NS++++ SDNGG SNWPLRG+K ++
Sbjct: 358 MLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPAAG-------GSNWPLRGSKGTY 410
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + + SPLL+ KG V L+HITDW PTL +A E ID
Sbjct: 411 WEGGIRAIGFVHSPLLKNKGTVCRELVHITDWYPTLISLA--EGQID 455
>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
Length = 573
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITSALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 320 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 360
>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
Length = 506
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 242 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 294
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 295 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHIN 339
>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
Length = 534
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHIN 367
>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
Length = 431
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG + +L+ GL N+++VF +DNGG T+ GN NWPLRG K++
Sbjct: 165 MVTLMDEAVGNLTDALKRYGLWNNTVLVFSTDNGGQTLAG----GN---NWPLRGRKWTL 217
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG N
Sbjct: 218 WEGGVRGVGFVASPLLKQKGVESHELIHISDWLPTLVHLAGGHTN 262
>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
Length = 533
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 68/103 (66%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD SVG +I +L++ +L NS+V+F +DNGGP + L++ N ASNWP+RG K +
Sbjct: 234 MVSELDTSVGNIISALKDANMLSNSIVIFSTDNGGP-ANGLNY--NYASNWPMRGVKNTL 290
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
EGGVR V +WS + G+V +MHI DWLPTLY AG
Sbjct: 291 WEGGVRGVGVVWSSKFKPNSGGRVSNRMMHICDWLPTLYAAAG 333
>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
[Strongylocentrotus purpuratus]
Length = 401
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 65/98 (66%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K +DE+VG + ++L++ GL N++++F SDNG + ++ G SNWPLRG K S
Sbjct: 144 MVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSDNGA----NFNYGG---SNWPLRGMKRSL 196
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTL 95
EGG+++V + SPLL K G V NL H+TDW PTL
Sbjct: 197 WEGGIKSVGFIASPLLPKLVRGTVNNNLFHVTDWFPTL 234
>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 739
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 67/103 (65%), Gaps = 12/103 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G V ++L++ GL N++++F +DNGGP + A+NWPLRG+K +
Sbjct: 486 MVSCMDEGIGNVTRALKDAGLYDNTIIIFSTDNGGPI-------QSGANNWPLRGSKGNM 538
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
EGG+ V + SPLL KG V +LMH++DWLPT+ +AG
Sbjct: 539 WEGGIHGVGFVHSPLLPTSVKGTVNHDLMHVSDWLPTI--VAG 579
>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
Length = 460
Score = 84.7 bits (208), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 65/101 (64%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ VF +DNGG T+ GN NWPLRG K++
Sbjct: 196 MVSLMDEAVGNVTAALKSHGLWNNTVFVFSTDNGGQTLSG----GN---NWPLRGRKWTL 248
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPLL+ KG L+HI+DWLPTL +AG
Sbjct: 249 WEGGIRGVGFVASPLLKQKGVKSRELIHISDWLPTLVNLAG 289
>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
melanoleuca]
Length = 573
Score = 84.7 bits (208), Expect = 7e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 320 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 360
>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
Length = 573
Score = 84.7 bits (208), Expect = 7e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 268 MVTCMDEAVRNITWALKRYGFYNNSIIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 321 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 361
>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
Length = 473
Score = 84.7 bits (208), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG V ++L+ GL N++++F +DNGG T +NWPLRG K +
Sbjct: 209 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG-------GNNWPLRGRKGTL 261
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R + SPLL+ KG LMHITDWLPTL +AG
Sbjct: 262 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 302
>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 84.3 bits (207), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LD+ V +V++ L+ G+ NS++++ SDNGG + SNWPLRG K S+
Sbjct: 263 MLSCLDDGVAQVVKELKTTGIYDNSILIYSSDNGGQPLSG-------GSNWPLRGGKGSY 315
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPLL +KG + + ++H++DW PTL +AG
Sbjct: 316 WEGGIRAVGFVHSPLLKKKGVISKEMIHVSDWYPTLLGLAG 356
>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
Length = 556
Score = 84.3 bits (207), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 292 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 344
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHITDWLPTL +AG N
Sbjct: 345 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 389
>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
Length = 528
Score = 84.3 bits (207), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG V ++L+ GL N++++F +DNGG T +NWPLRG K +
Sbjct: 264 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG-------GNNWPLRGRKGTL 316
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R + SPLL+ KG LMHITDWLPTL +AG
Sbjct: 317 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 357
>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 648
Score = 84.3 bits (207), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++G V Q+L++ G+ N++VVF +DNGG T G ASN+PLRG K
Sbjct: 250 MTSCMDEAIGNVTQTLKDRGMWDNTVVVFSTDNGGAT-------GFGASNYPLRGQKCGH 302
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
+EGGVR A + SPLL+ +G LM++ DWLPT +AG
Sbjct: 303 YEGGVRGPAFVSSPLLKPHVRGTKNTELMYVGDWLPTFVHLAG 345
>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
Length = 517
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG V ++L+ GL N++++F +DNGG T +NWPLRG K +
Sbjct: 255 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG-------GNNWPLRGRKGTL 307
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R + SPLL+ KG LMHITDWLPTL +AG
Sbjct: 308 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 348
>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
Length = 534
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHITDWLPTL +AG N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 367
>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
(N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
intestinalis]
Length = 522
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 67/101 (66%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G V+ +L+ +G+ ++LVVF +DNGG T+ F GN NWPLRG K S
Sbjct: 253 MVTAMDEAIGSVVDALKWKGIWQDTLVVFTTDNGGQTL----FGGN---NWPLRGRKASL 305
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGVR V + +R KG+ L+HI+DW PTL +IAG
Sbjct: 306 WEGGVRGVGLVRGYGIRDKGRSSNELVHISDWFPTLLYIAG 346
>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 516
Score = 84.3 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG + Q+L++ GL NS+++F +DNGG + +NWPLRG K S
Sbjct: 259 MVSCMDEAVGNITQTLKDTGLYDNSVIIFSTDNGGQVIAG-------GNNWPLRGWKGSL 311
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+ V + SPL+ +G + L+H+TDW PT+ +AG N
Sbjct: 312 WEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGSVN 358
>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
Length = 534
Score = 84.3 bits (207), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHITDWLPTL +AG N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 367
>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
Length = 571
Score = 84.3 bits (207), Expect = 9e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 359
>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
Length = 570
Score = 84.3 bits (207), Expect = 9e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 264 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 316
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 317 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 357
>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
Length = 476
Score = 84.3 bits (207), Expect = 9e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 170 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 222
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 223 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 263
>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
Length = 602
Score = 84.3 bits (207), Expect = 9e-15, Method: Composition-based stats.
Identities = 41/102 (40%), Positives = 66/102 (64%), Gaps = 10/102 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ +LD +GR++ +L+E+ N+++VF SDNGG + ASNWPLRG+K + +
Sbjct: 270 ITTLDHHIGRMVTALKEKNYWNNTVLVFTSDNGGQPNEG-------ASNWPLRGSKGTIY 322
Query: 62 EGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
+GG+++ A + SPLL+ KGQ L H++DW PTL ++G
Sbjct: 323 DGGLKSRAFVASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG 364
>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 496
Score = 84.3 bits (207), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 12/103 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++ + ++LE +G+L NS++V +DNGG G+ +NWPLRG K S
Sbjct: 242 MASCVDEAIANITEALENKGMLKNSVIVLTTDNGG---------GSVGNNWPLRGRKNSH 292
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA ++S LL +G LMHITDW PTL + G
Sbjct: 293 WEGGVRGVAFVYSDLLPMDVRGTENSELMHITDWFPTLVKLGG 335
>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
Length = 547
Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 243 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 295
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 296 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 336
>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
Length = 572
Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 359
>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
Length = 521
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG + ++L+ GL N++++F +DNGG T+ GN NWPLRG K++
Sbjct: 257 MVSILDEAVGNITEALKTRGLWNNTVLIFSTDNGGQTLAG----GN---NWPLRGRKWTL 309
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR + SPLL+ +G L+HI+DWLPTL +AG N
Sbjct: 310 WEGGVRGAGFVASPLLKQRGVESHELIHISDWLPTLVKLAGGSTN 354
>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 1410
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+++G V+ L E GL N++++ +DNGG + + +NWPL+G+K S
Sbjct: 242 MVTCMDDAIGNVVDQLREAGLWNNTVLIVSTDNGGAKI--------SGNNWPLKGSKASL 293
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA + SP L +G LMH+TDW PT+ +AG
Sbjct: 294 WEGGVRGVAFVTSPFLADHVRGTSNHQLMHVTDWFPTMLHVAG 336
>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 42/103 (40%), Positives = 62/103 (60%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG + Q+LE+ GL NS+++F +DNGG +NWPLRG K S
Sbjct: 233 MVSCMDEAVGNITQTLEDAGLYDNSVIIFSTDNGGQVTAG-------GNNWPLRGWKGSL 285
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
EGG+ V + SPL+ +G + L+H+TDW PT+ +AG
Sbjct: 286 WEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAG 328
>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
Length = 252
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 134 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 186
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHI+DWLPTL +AG N
Sbjct: 187 WEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 231
>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
Length = 571
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 359
>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1818
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 43/95 (45%), Positives = 59/95 (62%), Gaps = 8/95 (8%)
Query: 4 SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
++D ++GRV+ +L+ G N+LVVF SDNG ++ SNWPLRG K+S EG
Sbjct: 296 AMDRAIGRVVDALQATGAYENALVVFASDNG-----AIPGQMGGGSNWPLRGGKFSAWEG 350
Query: 64 GVRNVAALWSPLL---RKGQVLENLMHITDWLPTL 95
GVR A + SPLL +G V + L H+ DWLPT+
Sbjct: 351 GVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTI 385
Score = 78.6 bits (192), Expect = 5e-13, Method: Composition-based stats.
Identities = 48/110 (43%), Positives = 64/110 (58%), Gaps = 15/110 (13%)
Query: 5 LDESVGRVIQSLEE-EGLLGNSLVVFISDNGGPTVDSLHFHGNTA--SNWPLRGTKYSFH 61
+D ++ V+ L+ +G NS+++F SDNGG G A SNWPLRG K+S
Sbjct: 869 MDRAMEAVVDELKAVDGAYENSVIMFASDNGG-------LPGQIAGGSNWPLRGQKFSPF 921
Query: 62 EGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG--KEKNID 106
EGGVR A + SPLL R+G L H+TDWLPTL +AG +N+D
Sbjct: 922 EGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLAGGAPPRNVD 971
>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 570
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 43/106 (40%), Positives = 68/106 (64%), Gaps = 8/106 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V + +L + G NS+++F +DNGG + SNWPLRG K ++
Sbjct: 268 MVSAVDEAVRNITYALRKYGYYQNSVIIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKNI 105
EGG+R + + SPLLRK +V + L+HITDW PTL +AG ++++
Sbjct: 321 WEGGIRGLGFVHSPLLRKKRRVSKALVHITDWYPTLVGLAGGDESL 366
>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
Length = 581
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 40/102 (39%), Positives = 66/102 (64%), Gaps = 10/102 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ +LD +G+++ +L+E+ N+++VF SDNGG + ASNWPLRG+K + +
Sbjct: 249 ITTLDHHIGKMVTALKEKNYWNNTVLVFTSDNGGQPKEG-------ASNWPLRGSKGTIY 301
Query: 62 EGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
+GG+++ A + SPLL+ KGQ L H++DW PTL ++G
Sbjct: 302 DGGLKSRAFIASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG 343
>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
Length = 584
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 46/101 (45%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F +DNGG T F G SNWPLRG K ++
Sbjct: 273 MVTCMDEAVRNITWALKRYGYYDNSVIIFSTDNGGQT-----FSG--GSNWPLRGRKGTY 325
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK + L+HITDW PTL +AG
Sbjct: 326 WEGGVRGIGFVHSPLLKRKRRTSRALVHITDWYPTLVTLAG 366
>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
Length = 431
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHI+DWLPTL +AG N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 367
>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 367
>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
Length = 415
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 368
>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
Length = 503
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K++
Sbjct: 240 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWTL 292
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +AG + +
Sbjct: 293 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGDTH 337
>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 41/101 (40%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE +G V +L++ G+ N++++F+SDNG FH NWPLRG K S
Sbjct: 248 MATCMDEGIGNVTSTLKQRGMWDNTVLIFVSDNGA------QFH-TGGRNWPLRGGKGSV 300
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R V+ + SPL+ R ++ E+++H+ DW PTL +AG
Sbjct: 301 FEGGIRVVSFVTSPLIERPRRISEDIVHLMDWFPTLTHLAG 341
>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
Length = 383
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 49/108 (45%), Positives = 66/108 (61%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
ML LDE++ V +L+ G NS++++ SDNGG PT SNWPLRG+K +
Sbjct: 80 MLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQPTAG--------GSNWPLRGSKGT 131
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
+ EGG+R V + SPLL+ +G V L+HITDW PTL +A E ID
Sbjct: 132 YWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWYPTLISLA--EGQID 177
>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
Length = 458
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V ++L+ GL N++ +F +DNGG T +NWPLRG K +
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + SPLL+ KG LMHI+DWLPTL +AG N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 367
>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 367
>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 368
>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
Length = 316
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V + L + G NS+++F +DNGG + SNWPLRG K ++
Sbjct: 162 MVSAVDEAVRNITYGLRKYGYYENSIMIFSTDNGGQPLSG-------GSNWPLRGRKGTY 214
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIA-GKEKNID 106
EGGVR + + SPLLRK + V + L+HITDW PTL +A GKE +I+
Sbjct: 215 WEGGVRGLGFIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIE 262
>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
Length = 502
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+V V +L+++GL N++ +F +DNGG T+ +NWPLRG K++
Sbjct: 237 MVSVLDEAVANVTNALKKQGLWNNTVFIFSTDNGGQTLSG-------GNNWPLRGRKWTL 289
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + S LL+ KG + L+HI+DWLPTL +AG N
Sbjct: 290 WEGGIRGVGFVTSSLLKQKGVTSKELIHISDWLPTLVNLAGGSTN 334
>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
Length = 475
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 60/96 (62%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE+VG V +LE GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 216 MASLMDEAVGNVTAALESRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 268
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTL 95
EGGVR V + SPLL RKG L+HI+DWLP L
Sbjct: 269 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPPL 304
>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 574
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 45/101 (44%), Positives = 65/101 (64%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V V +L + G NS++++ +DNG F G SNWPLRG K ++
Sbjct: 266 MVSTVDEAVRNVTYALRKYGFYRNSVIIYSTDNGAQP-----FTG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R VA + SPLL R+ +V + L+HITDW PTL +AG
Sbjct: 319 WEGGIRGVAFVHSPLLKRRRRVSKALLHITDWFPTLVGLAG 359
>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
Length = 408
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368
>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 469
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 62/98 (63%), Gaps = 9/98 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDES+G++ +L+EEGL +++ +F +DNGG N +NWPLRG K ++
Sbjct: 207 MVTMLDESIGKLTDTLKEEGLWNDTVFIFSTDNGGVG------KKNAGNNWPLRGVKGNY 260
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGG+R V + PLL +G + +LMHI+DW PTL
Sbjct: 261 FEGGIRGVGFVAGPLLSTNVQGTISTDLMHISDWYPTL 298
>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
Length = 415
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368
>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 504
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 66/103 (64%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE++ +I++LE + +L NS++VF SDNGG D L+ + +NWPLRG K S
Sbjct: 237 MVSCVDEAIANIIETLENKDMLKNSIIVFTSDNGG---DPLY----SGNNWPLRGWKASN 289
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + ++S +L +G LMHITDW PTL I+G
Sbjct: 290 WEGGVRALGFVYSEILPTNARGTDNNELMHITDWFPTLVDISG 332
>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
Length = 575
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +V + ++L++ G NS++VF SDNGG T F G SNWPLRG K ++
Sbjct: 268 MVTCMDAAVKNITRALKKYGYYSNSVIVFSSDNGGQT-----FSG--GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPL++K + LMHITDW PTL +AG
Sbjct: 321 WEGGVRGLGFVHSPLIKKKRRTSRALMHITDWYPTLVKLAG 361
>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
Length = 573
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 46/101 (45%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL+K + L+HITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKKKRRTSRALVHITDWYPTLVGLAG 359
>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
Length = 573
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 45/101 (44%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 268 MVTCMDEAVRNITGALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL++ + L+HITDW PTL +AG
Sbjct: 321 WEGGVRGLGFVHSPLLKRTRRTSRALLHITDWYPTLVGLAG 361
>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
Length = 513
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 249 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 301
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 302 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 346
>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367
>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
Length = 584
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 68/101 (67%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG + +L + +L +S+V+F +DNGG + N SN+PLRG K
Sbjct: 224 MVTSLDQSVGDIFAALSKRHMLRDSVVIFTTDNGGA---PYGLNWNRGSNFPLRGGKDML 280
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGGV+ VA ++S L++ KG+V + L+ +TDW+PT+Y +AG
Sbjct: 281 WEGGVKGVAFVYSDLIKQKGRVSKELIDVTDWVPTIYHLAG 321
>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
Length = 413
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366
>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
Length = 573
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 46/101 (45%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL+K + L+HITDW PTL +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKKKRRTSRALVHITDWYPTLVGLAG 359
>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
Length = 413
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366
>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
Length = 572
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 62/101 (61%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL RK L+HITDW PTL +AG
Sbjct: 320 WEGGVRGLGFIHSPLLKRKRWTSHALVHITDWYPTLVGLAG 360
>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLMKLARGHTN 366
>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
Length = 534
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367
>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
Length = 492
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 228 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 280
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 281 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 325
>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
Length = 455
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 42/101 (41%), Positives = 61/101 (60%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG+++ ++ + +L N+L +F SDNGGP G N PLRG K++
Sbjct: 240 MVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPE------PGKLTDNGPLRGGKHTL 293
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR A A W + G +E +HI DW PTL +AG
Sbjct: 294 YEGGVRVCAFASWKGRIAPGSKVEAPLHIVDWYPTLIELAG 334
>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
Length = 535
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368
>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
Length = 533
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366
>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
Length = 533
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366
>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368
>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367
>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
Length = 519
Score = 82.8 bits (203), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 255 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 307
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 308 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTN 352
>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 588
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 65/106 (61%), Gaps = 11/106 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD+SVG +I LE+EG + NS+VV SDNGG + + SN+PLRG K S+
Sbjct: 291 VLMFLDKSVGHLIHYLEDEGWMENSIVVVASDNGGDSNEG-------GSNYPLRGKKASY 343
Query: 61 HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG + A ++S P R+G + LMH+TDWLPTL AG E
Sbjct: 344 WEGGSKVPAFVYSTSHIPEARRGSEYDGLMHVTDWLPTLAAGAGLE 389
>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 496
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/101 (42%), Positives = 62/101 (61%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD +VGR++Q++ +E L+ N+L +F SDNGGP L SN PLR K +
Sbjct: 264 MVTALDAAVGRIVQAVRDEKLVENTLFIFSSDNGGPNPKQL------TSNGPLRAGKGTV 317
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR A A W ++ G ++ +HI DW PTL +AG
Sbjct: 318 YEGGVRVCAFATWPGKIKPGTSVQAPIHIADWFPTLLGLAG 358
>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
Length = 573
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L+ G NS+++F SDNGG T F G SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITWALKRYGFYNNSIIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR + + SPLL RK + L+HITDW PTL +A
Sbjct: 320 WEGGVRGLGFIHSPLLKRKRRTSHALVHITDWYPTLVGLA 359
>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
Length = 534
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 323 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367
>gi|47228126|emb|CAF97755.1| unnamed protein product [Tetraodon nigroviridis]
Length = 650
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 45/101 (44%), Positives = 65/101 (64%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V V +L + G NS++++ +DNG F G SNWPLRG K ++
Sbjct: 278 MVSTVDEAVRNVTYALRKYGYYRNSVIIYSTDNGAQP-----FTG--GSNWPLRGRKGTY 330
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R VA + SPLL R+ +V + L+HITDW PTL +AG
Sbjct: 331 WEGGIRGVAFVHSPLLRRRRRVSKALLHITDWFPTLVGLAG 371
>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
Length = 484
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/106 (40%), Positives = 66/106 (62%), Gaps = 8/106 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE++G + L E G+ ++L+VF SDNGG +D +GN N+PLRG K++
Sbjct: 240 MVSALDEAIGNITDQLRESGMYDDTLIVFTSDNGGSPMD----YGN---NYPLRGAKFTV 292
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNI 105
+EGG R V+ + L+K G+ E++MH DW TL AG E ++
Sbjct: 293 YEGGTRVVSFVHGAGLQKTGEAYESMMHAVDWSDTLVAAAGGEPDM 338
>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
Length = 431
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 64/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ +NWPLRG K+S
Sbjct: 167 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 219
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPLL+ KG L+HI+DWLPTL +A N
Sbjct: 220 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTN 264
>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
Length = 520
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++DE+VG + +L++ GL N+++VF +DNGG + +NWPLRG K S
Sbjct: 266 MTTAMDEAVGNITDALKQYGLWDNTVLVFSTDNGGQVMAG-------GNNWPLRGWKGSL 318
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R V + SPL++ KG+ + L+HI+DW PTL +AG N
Sbjct: 319 WEGGIRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLAGGSTN 363
>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 570
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 9/108 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V + L + G NS+++F +DNGG + SNWPLRG K ++
Sbjct: 268 MVSAVDEAVRNITYGLRKYGYYENSIMIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIA-GKEKNID 106
EGGVR + + SPLLRK + V + L+HITDW PTL +A GKE +I+
Sbjct: 321 WEGGVRGLGFIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIE 368
>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
Length = 535
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 64/100 (64%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V +L+ GL N++ +F +DNGG T+ GN NWPLRG K+S
Sbjct: 271 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 323
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
EGG+R V + SPLL+ KG L+HI+DWLPTL +A
Sbjct: 324 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363
>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
Length = 527
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 66/105 (62%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V + L + G NS+++F +DNGG + SNWPLRG K ++
Sbjct: 268 MVSAVDEAVRNITYGLRKYGYYQNSIIIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR + + SPLLRK +V + L+HITDW PTL +AG +++
Sbjct: 321 WEGGVRGLGFVHSPLLRKKRRVSKALVHITDWYPTLVGLAGGDES 365
>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 505
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE +G + ++L+ G+L N+++VF SDNG FH NWPLRG K +
Sbjct: 247 MATCMDEGIGNITETLKASGMLNNTILVFSSDNGA------QFH-TGGRNWPLRGGKGTM 299
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGGVR +A + SPLL+K + + + ++H +DW PT +AG
Sbjct: 300 WEGGVRVLAFVNSPLLKKSRTVNHEMIHTSDWFPTFVRLAG 340
>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 545
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 62/98 (63%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G V Q+L++ GL N++++F +DNGG + GN NWPLRG+K S
Sbjct: 292 MVSCMDEGIGNVTQALKDAGLYDNTIIIFSTDNGG----HIDRGGN---NWPLRGSKGSM 344
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGG+ V + SPLL KG V LMH++DWLPT+
Sbjct: 345 WEGGIHGVGFVHSPLLPDAVKGTVNHELMHVSDWLPTI 382
>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
Length = 382
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 44/104 (42%), Positives = 61/104 (58%), Gaps = 9/104 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+V + +LEE GL+ N L+VF +DNGGP F+G A+N PLRG K +
Sbjct: 243 MISALDEAVANITNALEETGLIDNLLLVFTTDNGGP------FNG-AANNLPLRGCKATL 295
Query: 61 HEGGVRNVAALWSPLLRK--GQVLENLMHITDWLPTLYFIAGKE 102
EGG + ++S L K G + +MH DW PT +AG E
Sbjct: 296 WEGGTKGAGFIYSKTLLKKTGYLNTGMMHAVDWYPTFVELAGGE 339
>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
Length = 484
Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 68/110 (61%), Gaps = 11/110 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D ++G I++L++ +L N+++VF+SDNGG + F GN NWPLRG+K +
Sbjct: 244 MVSAMDSAIGMTIKALKQRNMLTNTMIVFLSDNGG----TPFFGGN---NWPLRGSKATL 296
Query: 61 HEGGVRNVAALWSPLL--RKGQVLENLMHITDWLPTLYFIAG--KEKNID 106
EGG R + +W +L + G ++H DW PTL +AG EK++D
Sbjct: 297 WEGGTRVPSFVWGKMLLQKAGYTSNEMIHAVDWFPTLVSLAGGTPEKDVD 346
>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
Length = 333
Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 9/99 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD VG++++ L E G++ N+ V+F+SDNG + L N P RG K S EGG
Sbjct: 233 LDAMVGKIVEKLIENGMMNNTYVIFVSDNGADPPEGL--------NTPFRGGKSSLFEGG 284
Query: 65 VRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
++ + ++SPLL+K + + LMHITDWLPTL +AG +
Sbjct: 285 TKSNSFIYSPLLKKTEYENDGLMHITDWLPTLVRLAGGQ 323
>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
carolinensis]
Length = 580
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 64/100 (64%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L++ G NS++VF +DNGG T F G SNWPLRG K ++
Sbjct: 277 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 329
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
EGGVR + + SPL++ K ++ L+HITDW PTL +A
Sbjct: 330 WEGGVRGIGFVHSPLIKHKRRISRALIHITDWYPTLVTLA 369
>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
Length = 345
Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 9/97 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD VG++++ LE GL+ N+ V+F+SDNG + L N P RG K S EGG
Sbjct: 245 LDAMVGKIVEKLENNGLMDNTYVIFVSDNGADPPEGL--------NTPFRGGKGSLFEGG 296
Query: 65 VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
++ + ++SPLL K G + LMHITDWLPTL +AG
Sbjct: 297 TKSNSFVYSPLLNKTGYENDGLMHITDWLPTLVKLAG 333
>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
Length = 873
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 60/96 (62%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE++ + +L+ G N+++V+ SDNGG + SNWPLRG+K ++
Sbjct: 584 MVSCLDEAIHNLTLALKRYGYYDNTVIVYSSDNGGQPLAG-------GSNWPLRGSKATY 636
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTL 95
EGG+R V + SPLL +KG +L+HITDW PTL
Sbjct: 637 WEGGIRAVGFVHSPLLMKKGTKCRSLIHITDWFPTL 672
>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
Length = 560
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 60/96 (62%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE++ + +L+ G N+++V+ SDNGG + SNWPLRG+K S+
Sbjct: 259 MVSCLDEAIRNLTLALKRYGYYDNTVLVYSSDNGGQPLLG-------GSNWPLRGSKASY 311
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTL 95
EGG+R V + SPLLR KG +L+HITDW PTL
Sbjct: 312 WEGGIRAVGFVHSPLLRNKGTKCRSLIHITDWFPTL 347
>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE + + L+ +GLL +++++ +DNGG T +SNWPLRG K +
Sbjct: 236 MMLALDEGISNITSLLQTKGLLDDTIIILTTDNGGQTALG-------SSNWPLRGNKATV 288
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
EGGVR ++ +WS LRK N +MHITDW PT+
Sbjct: 289 FEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTI 324
>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 491
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE + + L+ +GLL +++++ +DNGG T +SNWPLRG K +
Sbjct: 236 MMLALDEGISNITSLLQTKGLLDDTIIILTTDNGGQTALG-------SSNWPLRGNKATV 288
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
EGGVR ++ +WS LRK N +MHITDW PT+
Sbjct: 289 FEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTI 324
>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
Length = 492
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 63/98 (64%), Gaps = 10/98 (10%)
Query: 4 SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
++DE+VG + ++L++ G+ N++++F +DNG SNWPLRG K + EG
Sbjct: 243 AMDEAVGNITRALKDHGMWENTVLIFSTDNGA---------SKAGSNWPLRGFKNTLWEG 293
Query: 64 GVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
GVR V + SPLL+ K ++ + L+HI+DW PT+ IAG
Sbjct: 294 GVRGVGFVSSPLLKVKQRISDALLHISDWFPTIVRIAG 331
>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 539
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 62/103 (60%), Gaps = 12/103 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG--PTVDSLHFHGNTASNWPLRGTKY 58
M +DE +G V +L++ G+ N++ +F+SDNG PT NWPLRG K
Sbjct: 281 MATCMDEGIGNVTNALKQRGMWDNTVFIFVSDNGAQFPT---------GGRNWPLRGGKG 331
Query: 59 SFHEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
S EGG+R V+ + SPL+ + Q N ++H+TDWLPTL +AG
Sbjct: 332 SVFEGGIRVVSFVTSPLIERPQRSSNEMIHLTDWLPTLTHLAG 374
>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 631
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 63/109 (57%), Gaps = 5/109 (4%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++SLD+S ++ L+ +L S+++F SDNGG T +F ASNWPLRG K +
Sbjct: 274 VESLDDSFAHMVDVLQRNNILNESVILFSSDNGGTTFKDPYFK-TGASNWPLRGQKNTQW 332
Query: 62 EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE----KNID 106
EGGVR + +WS + + V E L H DWLPT + G + K+ID
Sbjct: 333 EGGVRVSSFVWSSQITEPYVSEELYHFVDWLPTFRRLGGGDIGDLKDID 381
>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
Length = 573
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L++ G NS++VF +DNGG T F G SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR + + SPL+ RK + L+HITDW PTL +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361
>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
Length = 574
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L++ G NS++VF +DNGG T F G SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR + + SPL+ RK + L+HITDW PTL +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361
>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 568
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L + G NS+V+F +DNG + SNWPLRG K ++
Sbjct: 264 MVSIVDEAVRNITYALRKYGFYRNSVVIFSTDNGAQPLTG-------GSNWPLRGCKGTY 316
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPL+R + ++ +L+HITDW PTL +AG
Sbjct: 317 WEGGIRGVGFVHSPLIRHRRRISRDLIHITDWYPTLVGLAG 357
>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
Length = 574
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L++ G NS++VF +DNGG T F G SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR + + SPL+ RK + L+HITDW PTL +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361
>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 732
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V V +L + G NS++++ +DNG F G +NWPLRG K ++
Sbjct: 269 MVSTVDEAVRNVTYALRKYGYYRNSVIIYSTDNGAQP-----FTG--GNNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR V + SPLL R+ +V + LMHITDW PTL +A
Sbjct: 322 WEGGVRGVGFVHSPLLRRRRRVSKALMHITDWFPTLVGLA 361
>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 64/102 (62%), Gaps = 10/102 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
M+ ++DE+V V +L + G NS++++ +DNG P V SNWPLRG K +
Sbjct: 264 MVSTVDEAVRNVTYALRKYGYYKNSVIIYSTDNGAQPYVG--------GSNWPLRGRKGT 315
Query: 60 FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
+ EGGVR VA + SPLL+ + +V L+HITDW PTL +AG
Sbjct: 316 YWEGGVRGVAFVHSPLLKHRRRVSTALLHITDWFPTLVGLAG 357
>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
Length = 449
Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 63/105 (60%), Gaps = 9/105 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG V +L+ GL N++++F +DNG N SNWPLRG K +
Sbjct: 196 MVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGA--------RENAGSNWPLRGWKNTL 247
Query: 61 HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + S LLRKG + + L+HI+DW PTL I+ N
Sbjct: 248 WEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLLRISRSVSN 292
>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 673
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 64/101 (63%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V + +L + G NS+++F +DNGG + SNWPLRG K ++
Sbjct: 268 MVSTVDEAVRNITYALRKYGYYQNSVLIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG+R + + SPLL+K +V + L+HITDW PTL +AG
Sbjct: 321 WEGGIRGLGFVHSPLLKKRKRVSKALVHITDWYPTLVGLAG 361
>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 545
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 62/98 (63%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE+VG + + L+++GL ++++VF SDNGG +D ASNWPLRG+K +
Sbjct: 286 MTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNGG-NID------QGASNWPLRGSKTTL 338
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGGVR V + SPLL KG V L+ I+DW PTL
Sbjct: 339 WEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTL 376
>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
Length = 551
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE++ + +L+ G N+++V+ SDNGG + SNWPLRG+K ++
Sbjct: 245 MVSCLDEAIHNLTLALKHYGYYDNTVIVYSSDNGGQPLAG-------GSNWPLRGSKATY 297
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTL 95
EGG+R V + SPLL KG +L+HITDW PTL
Sbjct: 298 WEGGIRTVGFVHSPLLMNKGTKCRSLIHITDWFPTL 333
>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 461
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 39/93 (41%), Positives = 58/93 (62%), Gaps = 3/93 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+SVG V ++L+ +G+L +++V+ SD+GG SNWPLRGTK +
Sbjct: 226 MVAALDQSVGDVFRALQRKGILEDTIVLLTSDSGGAAGG---VEQGVGSNWPLRGTKKTL 282
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLP 93
EGGVR V LWS L +G+V L+H + + P
Sbjct: 283 WEGGVRAVGLLWSAKLARGRVASQLIHRSAYFP 315
>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 42/101 (41%), Positives = 61/101 (60%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++DE+VG++ +++ +G+ N++ +F SDNGGP G SN PLRG K +
Sbjct: 248 MLAAMDEAVGQIASAIDGKGMRSNTIFLFSSDNGGPA------PGQVTSNGPLRGQKGTL 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR A + W LR G V+ +HI D PTL +AG
Sbjct: 302 YEGGVRVPAFISWEGHLRPGTVVNAPLHIVDLFPTLLTLAG 342
>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
Length = 575
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V + +L++ G NS++VF +DNGG T F G SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVMVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR + + SPL+ RK + L+HITDW PTL +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361
>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
Length = 328
Score = 79.0 bits (193), Expect = 4e-13, Method: Composition-based stats.
Identities = 42/102 (41%), Positives = 62/102 (60%), Gaps = 11/102 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+ LD VG+V++ L + ++ N+ ++F+SDNG + L N P RG K S
Sbjct: 228 LTSGLDSMVGKVVELLNDNHMMDNTYLIFLSDNGADPAEGL--------NAPFRGGKGSL 279
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN--LMHITDWLPTLYFIAG 100
EGG ++ + ++SPLL+K + EN LMHITDWLPTL +AG
Sbjct: 280 FEGGTKSSSFIYSPLLKKTE-YENDGLMHITDWLPTLVKLAG 320
>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
Length = 330
Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 9/99 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD VG+++ L E ++ N+ V+F+SDNG ++ N P RG K S EGG
Sbjct: 231 LDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPIE--------GRNAPFRGGKGSLFEGG 282
Query: 65 VRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAGKE 102
++ + ++SPLL+K + + LMHITDWLPTL +AG +
Sbjct: 283 TKSSSFIYSPLLKKTEYENDGLMHITDWLPTLVKLAGGQ 321
>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 473
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +DE++G + ++L+++G+ N+++VF +DNGG ASN+PLRG K S
Sbjct: 215 MTSCMDEAIGNITRTLKDKGIWNNTVIVFSTDNGGARTFG-------ASNYPLRGQKASN 267
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
EGG+R A + SPLL +G + LM+I DW PT +AG
Sbjct: 268 FEGGIRGPAFVSSPLLDPLVRGTINNELMYIGDWFPTFMNLAG 310
>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
Length = 478
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 61/100 (61%), Gaps = 9/100 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG V +L+ L N++++F +DNG N SNWPLRG K +
Sbjct: 225 MVSAMDEAVGNVTDALKRTRLWDNTVLIFSTDNGA--------RRNAGSNWPLRGWKNTL 276
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIA 99
EGGVR V + S LLRKG+ + L+HI+DW PTL I+
Sbjct: 277 WEGGVRGVGFVTSKLLRKGKRKCDALIHISDWFPTLLRIS 316
>gi|335058614|gb|AEH26503.1| arylsulphatase [uncultured Acidobacteria bacterium A11]
Length = 491
Score = 78.6 bits (192), Expect = 5e-13, Method: Composition-based stats.
Identities = 50/117 (42%), Positives = 67/117 (57%), Gaps = 17/117 (14%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-------------GPTVDS-LHFHGNT 47
L+ LD SVGR++ +L+E GL +LVVF+SDNG GP D+ LH
Sbjct: 238 LEHLDNSVGRLMAALDELGLRERTLVVFMSDNGGVSRSYDPKPFTEGPGTDTQLHLANAE 297
Query: 48 ASNWPLRGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
N PLRG K S +EGG+R V L W ++ G+V+E H+ DWLPTL +AG +
Sbjct: 298 FPNAPLRGWKGSPYEGGIR-VPCLVRWPGVVAAGRVVEAPAHVVDWLPTLLEVAGAK 353
>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
Length = 488
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 58/101 (57%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE++G V +L+ GL N+L+VF +DNGG H N+PLRG K +
Sbjct: 239 MVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNGG-------IHTAGGYNYPLRGEKNTL 291
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + P+ R G + LMH+TDW PTL +AG
Sbjct: 292 WEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAG 332
>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
Length = 545
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 44/112 (39%), Positives = 69/112 (61%), Gaps = 13/112 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D+++G V+ + + +GN +VVF +DNGG + H GN N+PLRG+K +
Sbjct: 243 MVTAMDDAIGAVMAAFRKHKFMGNLIVVFTTDNGG----AAHIVGN---NFPLRGSKTTV 295
Query: 61 HEGGVRNVAALWSP-LLRK-GQVLENLMHITDWLPTLYFIAG----KEKNID 106
EGG R V+ ++S LL+K G + L H TDW PT+ +AG +EK +D
Sbjct: 296 WEGGTRAVSFVYSKNLLKKTGYTHDGLFHATDWFPTILAVAGGKPKQEKQLD 347
>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
Length = 477
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 9/106 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD++V + LE+ G + N L+VF +DNGGP A+NWPLRG K +
Sbjct: 243 MVSALDDAVYNITTVLEDLGFMDNLLLVFTTDNGGPPTRG-------ANNWPLRGGKNTL 295
Query: 61 HEGGVRNVAALWSPLL--RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG R A ++S L +KG L H DW PTL IAG + +
Sbjct: 296 WEGGTRGAAFVYSKTLLKKKGYTHPGLFHAVDWYPTLLDIAGGDSS 341
>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
Length = 328
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 5/103 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG + ++L+ G+ N+++VF SDNGG V HG + +N PLRG K ++
Sbjct: 228 MVNVLDDIVGNITRALKANGMWNNTVLVFSSDNGG-AVPGAPQHG-SMNNHPLRGGKVAY 285
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
EGGVR A + SPLL K G L L+HI+DW T +AG
Sbjct: 286 FEGGVRTAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLAG 328
>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 42/108 (38%), Positives = 62/108 (57%), Gaps = 13/108 (12%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D +G V++ L+E+G ++++V SDNGG T + ASNWPLRG K EGG
Sbjct: 246 VDHFIGEVVKKLKEKGFWKDTILVITSDNGGQTREG-------ASNWPLRGRKGDVFEGG 298
Query: 65 VRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG---KEKNID 106
+R+ A + SP L KG ++ H+TDW PTL +G + N+D
Sbjct: 299 IRSRAFIHSPKLPNSLKGSSFPHVFHVTDWFPTLLRFSGCQQPDSNLD 346
>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 469
Score = 78.2 bits (191), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++S+DE++G++IQSLE + + N+ V F+SDNGG + N N PLRG K S
Sbjct: 250 QVQSMDEAIGKIIQSLENQQVCDNTFVFFVSDNGGSVM-----RANRGDNRPLRGGKNSL 304
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGGVR A + + P L Q + + I D PTL +AG E
Sbjct: 305 YEGGVRVPAIVSYPPKLSANQEINQIFSIVDLYPTLAKLAGVE 347
>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
Length = 1542
Score = 77.8 bits (190), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 8/99 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+V + +L++ G N ++V+ SDNGG + SNWPLRG+K S+
Sbjct: 1247 MVSCLDEAVRNLTLALKQYGYYDNMVMVYSSDNGGQPMAG-------GSNWPLRGSKGSY 1299
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFI 98
EGG+R V + SPLL +KG + L+HITDW PTL +
Sbjct: 1300 WEGGIRAVGFVHSPLLVKKGTKSKALIHITDWYPTLVML 1338
>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
Length = 514
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 63/103 (61%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +D +VG +++ L+ + + N+L+VFISDNGG + + GN NWPLRG K +
Sbjct: 247 MLAVMDHAVGLIVKKLKMKDMWRNTLLVFISDNGG----QILYGGN---NWPLRGNKNTL 299
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
EGG R A + PL+R G+ +++H+ DW PTL AG E
Sbjct: 300 FEGGTRVPAFVAGPLIRNGGRNSSSIIHVVDWFPTLLSAAGGE 342
>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
Length = 598
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +D++VG V ++L++ GL N++++F +DNGG VD NWPLRG K S
Sbjct: 346 MTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGG-RVD------RGGINWPLRGWKGSL 398
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR V + SPL++ KG+ + L+HI+DW PTL +A N
Sbjct: 399 WEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASGSTN 443
>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 61/102 (59%), Gaps = 10/102 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D +G V++ L+E+G ++++V SDNGG T + ASNWPLRG K EGG
Sbjct: 246 VDHFIGEVVKKLKEKGFWKDTILVITSDNGGQTREG-------ASNWPLRGRKGDVFEGG 298
Query: 65 VRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKEK 103
+R+ A + SP L KG +++ H+TDW PTL +G ++
Sbjct: 299 IRSRAFIHSPKLPDSLKGSSFQHVFHVTDWFPTLLRFSGCQQ 340
>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 530
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 39/98 (39%), Positives = 57/98 (58%), Gaps = 8/98 (8%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFH----GNTASNWPLRGTKYSFH 61
D SVGRV+++L+ G +S+++F SDNG H NT SNWPLRG K ++
Sbjct: 263 DRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESNTGSNWPLRGMKATYW 322
Query: 62 EGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTL 95
EGG R A + +P G + + ++HITDW+PT+
Sbjct: 323 EGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTI 360
>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
Length = 495
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 47/119 (39%), Positives = 67/119 (56%), Gaps = 15/119 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT--------ASNWP 52
M+ ++D+ +G+V+ +LE+ G+ N+L+VF SDNGGP S F G A N P
Sbjct: 260 MITAMDDQIGQVVAALEKRGMRNNTLIVFQSDNGGPR--SAQFTGEVDTSGGTIPADNGP 317
Query: 53 LRGTKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
R K S +EGG R VA A W ++ G V+ + +HI D PTL +A GK K +D
Sbjct: 318 YRDGKASLYEGGTRVVALANWPGKIQPGTVVNHPIHIVDMYPTLTGLASVSVGKNKPLD 376
>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 635
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 42/99 (42%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D +V +++++ G NS+V+F SDNG + SNWPLRG+K++ +EGG
Sbjct: 363 MDRAVESIVKTMHAVGAYDNSIVLFASDNGATLAQT-----GGGSNWPLRGSKFTPYEGG 417
Query: 65 VRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
VR A L SPLL R+G L H+TD LPTL AG
Sbjct: 418 VRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAG 456
>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
Length = 898
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G + + +++G+ N+L+VF +DNGG ++ +N+PLRG K S
Sbjct: 645 MVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNGGQILEG-------GNNYPLRGWKASL 697
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+ V + +L RKG V + LMH+TDW PTL +AG N
Sbjct: 698 WEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLAGGNLN 742
>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 500
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 61/98 (62%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+VG+++ SL++ G+ N++++F +DNGG +NWPLRG K S
Sbjct: 252 MMTCLDEAVGKIVHSLQQAGVWDNTVLIFSTDNGGEVAAG-------GNNWPLRGWKRSI 304
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGG+R V + S LL +G V + L+H++DW PTL
Sbjct: 305 WEGGMRGVGFVNSRLLPASVQGTVNKQLIHVSDWFPTL 342
>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
Length = 470
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 62/100 (62%), Gaps = 9/100 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+ ++DE+VG V +L+ GL N++++F +DNG +T SNWPLRG K +
Sbjct: 218 QVTAMDEAVGNVTNALKNSGLWENTVLIFSTDNGA--------RRHTGSNWPLRGWKNTL 269
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGGVR V + S LL RKG+ + L+HI+DW PTL I+
Sbjct: 270 WEGGVRGVGFVNSNLLKRKGKTSDALIHISDWFPTLLRIS 309
>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
Length = 478
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 9/100 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG V +L+ GL N++++F +DNG + SNWPLRG K +
Sbjct: 227 MVTAMDEAVGNVTDALKGSGLWDNTVLIFSTDNGA--------RRSAGSNWPLRGWKNTL 278
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
EGG+R V + S LL RKG + L+HI+DW PTL +A
Sbjct: 279 WEGGIRGVCFVTSNLLERKGTKSDALIHISDWFPTLIHLA 318
>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
Length = 370
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD VG ++ L+ GL+ N+ ++F+SDNG + L N P RG K S
Sbjct: 217 LVSGLDRMVGGIVDGLKRNGLMNNTYIIFVSDNGADPEEGL--------NDPFRGGKGSL 268
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG ++ + ++SPLL K N LMHITDW+PT+ +AG
Sbjct: 269 FEGGTKSASFIYSPLLNKAGYENNGLMHITDWMPTILKLAG 309
>gi|313211935|emb|CBY17782.1| unnamed protein product [Oikopleura dioica]
Length = 342
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 62/101 (61%), Gaps = 10/101 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D +G ++ +L+++G N+++V SDNGG + ASNWPLRG+K + EGG
Sbjct: 50 VDHYIGLIVNALKKKGFWQNTVMVVTSDNGGAREEG-------ASNWPLRGSKGTILEGG 102
Query: 65 VRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKE 102
+R A L SP++ KG+ +MH+TDW PTL ++G E
Sbjct: 103 IRGRAFLHSPIIPENLKGKSFPYVMHVTDWYPTLLRMSGCE 143
>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
Length = 471
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD S+GRV+++LE++GL N++V+F SDNGG + WPL G K
Sbjct: 257 MVESLDASIGRVLKALEDKGLAENTIVIFTSDNGGERF---------SKTWPLTGQKTEL 307
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R A + W + GQV E + DWLPTL
Sbjct: 308 LEGGIRVPAIVRWPAKIAAGQVSEQVAISMDWLPTL 343
>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
Length = 503
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+V V + ++ GL N+++VF +DNGG + +NWPLRG K S
Sbjct: 255 MVSCMDEAVKNVTDAFKKYGLWDNTILVFSTDNGGQVAEG-------GNNWPLRGWKGSL 307
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
+GG+R V + S LL+ KG V LMH++DW PTL AG
Sbjct: 308 WDGGMRGVGFVHSTLLKQKGAVAHQLMHVSDWFPTLLNQAG 348
>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 588
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 10/99 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
++ LDE+V V ++L+ GL NS+++F +DNGG P S+ GN NWPLRG K +
Sbjct: 321 LVNILDEAVRNVTEALKYNGLYDNSVIIFTTDNGGRPKPRSV---GN---NWPLRGGKST 374
Query: 60 FHEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
EGG+R V + SPL+ +G V L+H++DW PT+
Sbjct: 375 LWEGGIRGVGFVHSPLIPWELRGTVNRQLIHVSDWFPTI 413
>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 513
Score = 76.3 bits (186), Expect = 3e-12, Method: Composition-based stats.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 10/102 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE++ + ++L + G+ N++++F +DNGG T GN NWPLRG K S
Sbjct: 255 MVTCMDEAIWNITKALMDTGMWDNTVLIFSTDNGGAT----SVGGN---NWPLRGGKRSM 307
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIA 99
EGG+R V + SPLL +G L+H+TDW PT +A
Sbjct: 308 FEGGIRGVGFVTSPLLDEAVRGTENNQLIHVTDWFPTFVHLA 349
>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
Length = 501
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD V + +LE++ LL N+ ++F++DNG SN P RGTK +
Sbjct: 237 LITGLDGMVKTIKDTLEKKNLLNNTYIIFLADNGADV--------GIGSNEPFRGTKSTL 288
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGGV++++ ++SPLL+K G + L+HITDW+PTL +AG
Sbjct: 289 WEGGVKSLSFIYSPLLKKTGYENDGLIHITDWMPTLVSLAG 329
>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
Length = 684
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 65/106 (61%), Gaps = 8/106 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE++G + L E G+ ++L++F SDNGG ++GN N+PLRG K++
Sbjct: 440 MVSALDEAIGNITDQLRESGMYDDTLIIFTSDNGGWP----KYYGN---NYPLRGGKFTV 492
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNI 105
+EGG R V+ + L+K G+ E++MH DW TL AG + ++
Sbjct: 493 YEGGTRVVSFVHGAGLQKTGEAYESIMHAVDWSDTLVEAAGGKPDL 538
>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 43/105 (40%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +D+S+ V +L++ GL N++ +F +DNG T +G +WPL+G K S
Sbjct: 252 MATCMDDSIADVTGALKDTGLWDNTIFIFSTDNGAATA-----YG--GCSWPLKGGKTSL 304
Query: 61 HEGGVRNVAALWSPLLRKG-QVLEN--LMHITDWLPTLYFIAGKE 102
EGG+R V + SPLL Q EN LMH++DW PTL +AG E
Sbjct: 305 FEGGIRAVGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAGGE 349
>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
Length = 545
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 60/96 (62%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++D+++GRV+ +L++EG+ N++V+F SDNGG + A+N PLRG K
Sbjct: 269 VVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGGAA-----YSYGGANNAPLRGGKGET 323
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + + W +L GQ+ E +M + D PTL
Sbjct: 324 FEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTL 359
>gi|156353132|ref|XP_001622929.1| predicted protein [Nematostella vectensis]
gi|156209565|gb|EDO30829.1| predicted protein [Nematostella vectensis]
Length = 175
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 61/100 (61%), Gaps = 8/100 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G V ++L++ GL N+L+VF SDNGG + NWPLRG K +
Sbjct: 56 MVSAVDEAIGNVTKTLQQRGLWNNTLLVFSSDNGGVPLGG-------GYNWPLRGYKGNL 108
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
EGGVR A + +L+ KG + L+H++DW PT +A
Sbjct: 109 WEGGVRAAAFVHGKMLKNKGSRSKELLHVSDWYPTFTALA 148
>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
Length = 526
Score = 75.1 bits (183), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 63/111 (56%), Gaps = 12/111 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D+++G+V+ +L+EEG+ N++V F SDNGG F + N P RG K
Sbjct: 268 MVDSMDQAIGKVLTALDEEGVADNTIVFFFSDNGG-------FENYGSDNGPYRGGKLEV 320
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
+EGG+R A + W L G ++ ++ + D LPTL A G EK ID
Sbjct: 321 YEGGIRVTAVMRWPEKLAAGSEVDEIVSVMDLLPTLTHAAGVENGTEKKID 371
>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 726
Score = 74.7 bits (182), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 58/99 (58%), Gaps = 11/99 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD+S+GR++ LE EG L NS++V SDNGG SN+PLRG K+S
Sbjct: 378 VLMYLDKSIGRLVDYLETEGWLENSIIVVASDNGG-------CPSTGGSNYPLRGLKHSN 430
Query: 61 HEGGVRNVAALWSPL----LRKGQVLENLMHITDWLPTL 95
EGGV+ A ++SP G LMH+TDWLPTL
Sbjct: 431 WEGGVKVPAFVYSPSHIPEEHWGTEYRGLMHVTDWLPTL 469
>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
obscuriglobus UQM 2246]
Length = 459
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++DE+VG+++ ++E+ G+ N+L VF+SDNGGP G SN R K +
Sbjct: 237 MLAAMDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQ------PGVVTSNGKFRAGKGTL 290
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR A A + +R G + +H+ DW PTL + G
Sbjct: 291 YEGGVRVAACAAFDGHIRPGTTITEPLHVVDWYPTLLKLGG 331
>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
Length = 540
Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 44/107 (41%), Positives = 60/107 (56%), Gaps = 7/107 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ + D S+GR+I + GL N+L+++ SDNG + NWPLRG K S
Sbjct: 251 MMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGY------NWPLRGYKSSL 304
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGGVR A + +L RKG + +L H+TDW PTL +AG E D
Sbjct: 305 FEGGVRVPAFVHGEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEVEPD 351
>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
Length = 476
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 62/101 (61%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG++ ++ + +++ F +DNGG +H GN NWPLRG K S
Sbjct: 227 MVAAMDEAVGKIRDLFKKYEMWDETVLAFSTDNGG----QIHRGGN---NWPLRGWKISL 279
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R V + SPLL G + LMH++DWLPTL +AG
Sbjct: 280 WEGGMRGVGFVHSPLLPNSGGKSKGLMHVSDWLPTLVHVAG 320
>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 485
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 44/103 (42%), Positives = 60/103 (58%), Gaps = 7/103 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LD+SVG V+Q+LE+ L N++V+F+SDNGGPT + T+SN PLRG K
Sbjct: 295 MLANLDDSVGEVLQTLEQHQLTSNTIVIFLSDNGGPTREL------TSSNAPLRGEKGDM 348
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R V W + G+V E + D PT A E
Sbjct: 349 YEGGIRVPVLMQWPGQIPAGRVEERPIISLDLFPTCLAAAEAE 391
>gi|194919165|ref|XP_001983032.1| GG19813 [Drosophila erecta]
gi|190647643|gb|EDV45031.1| GG19813 [Drosophila erecta]
Length = 299
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 13 IQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALW 72
+ +L L NS+V+F SDNG P+V NT SN+PLRG K + EGGVR A+W
Sbjct: 1 MSTLAATDQLENSIVIFYSDNGAPSVG---MFANTGSNFPLRGQKNTPWEGGVRVAGAIW 57
Query: 73 SPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
S L+ +G + +++ DWLPTL AG E
Sbjct: 58 SSKLQARGSIFSQPLYVADWLPTLSHAAGIE 88
>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 513
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 3/100 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D ++G V+ ++ G GN+LV F +DNGGP + G SN+PLR K++
Sbjct: 260 MVSAVDTAIGNVMGAIRANGFAGNTLVFFTNDNGGPINANDRSPG---SNYPLRAGKFTL 316
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR W+P + K L HI D LPT+ AG
Sbjct: 317 WEGGVRGTGIFWAPQVLKPGKFGGLSHIVDVLPTILSAAG 356
>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
Length = 697
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 42/105 (40%), Positives = 61/105 (58%), Gaps = 11/105 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++ V +L GL ++L++F SDNGGP H H N N+P RG K +
Sbjct: 288 MVSAVDEAINNVTLALRRAGLFDDTLIIFSSDNGGP-----HDHAN---NYPFRGAKTAD 339
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN---LMHITDWLPTLYFIAGKE 102
EGGVR VA + ++ K V N LMH+ DW TL +AG++
Sbjct: 340 FEGGVRAVAFVSGGVVPKNMVGTNVHGLMHVCDWYSTLARLAGED 384
>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 658
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 60/103 (58%), Gaps = 12/103 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
M +DE++G V +L+E GL ++ +F SDNGG PT F G SNWPLRG K
Sbjct: 419 MTSCMDEAIGNVTDTLKETGLWNKTVFIFSSDNGGTPT-----FGG---SNWPLRGRKGL 470
Query: 60 FHEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIA 99
++EGG+R V + S LL G L+HITDW PTL +A
Sbjct: 471 WYEGGIRAVGFVNSYLLSNHVTGSTNHELIHITDWFPTLLRLA 513
>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
Length = 491
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G + ++ G+ N+L+VF +DNGG +H GN N+PLRG K S
Sbjct: 241 MVSAMDEAIGNLTDVFKKRGMWNNTLMVFSTDNGG----QIHAGGN---NFPLRGWKDSL 293
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGV V + +L RKG V +L+H++DW PTL + G N
Sbjct: 294 WEGGVHGVGLVHGQMLKRKGSVSRDLIHVSDWFPTLISMVGGTLN 338
>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
Length = 503
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE V + Q+L+++GL N++++F +DNGG +H GN N+PLRG K S
Sbjct: 252 MVSALDEGVANLTQALKDKGLWNNTVLIFSTDNGG----QIHAGGN---NYPLRGWKASL 304
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG V + L R G V + L+H++DW PTL +AG N
Sbjct: 305 WEGGFHGVGFVSGGALKRSGAVSKGLIHVSDWFPTLVTLAGGNLN 349
>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
Length = 540
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 43/117 (36%), Positives = 66/117 (56%), Gaps = 19/117 (16%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN-------------- 46
M+ +DE+V V +L+ G+ ++L+V+ SDNG PT+ S F+G
Sbjct: 257 MISGVDEAVRNVTDALKAAGMWNDTLLVWTSDNGSPTMGS--FNGGCKNVVPDSRGGACM 314
Query: 47 TASNWPLRGTKYSFHEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
SN+PLRG+K++ EGGVR + P+L +G+ L+ LMH+ DW TL +AG
Sbjct: 315 CGSNYPLRGSKHTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVAG 371
>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
Length = 617
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 46/110 (41%), Positives = 62/110 (56%), Gaps = 13/110 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD ++GR+I+S ++ G N++ F SDNGG SL + N N P RG K
Sbjct: 374 MVTGLDRAIGRIIESFKKYGKYKNTIFAFASDNGG----SLTYRAN---NMPFRGGKNDL 426
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGK--EKNI 105
EGGVR ++ L S L+ + G NL ITDW PTL AGK +KN+
Sbjct: 427 LEGGVRTLSFLHSSLIPNDKIGSRYSNLFDITDWFPTL-LDAGKCSQKNL 475
>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 476
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 44/111 (39%), Positives = 66/111 (59%), Gaps = 7/111 (6%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV-DSLHFHGNTASNWPLRGTKYSF 60
++S+DE++G +IQSL+ + + N+L+VF SDNGG V + + F G N PLRG K +
Sbjct: 249 VQSMDEAIGSIIQSLQAKQVWNNTLLVFTSDNGGEIVANDIRFTGR-GDNRPLRGGKRNL 307
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
+EGGVR + P L G+ +E + I D PT +AG +E+ ID
Sbjct: 308 YEGGVRVPTIISYPGHLSSGKTVEQMFSIVDLYPTFAKLAGLKINQEQQID 358
>gi|167534419|ref|XP_001748885.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772565|gb|EDQ86215.1| predicted protein [Monosiga brevicollis MX1]
Length = 690
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 41/104 (39%), Positives = 58/104 (55%), Gaps = 10/104 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT---VDSLHFHG--NTASNWPLRG 55
M+ ++DE++G + +L + N+L++F +DNGGP D G + N+PLRG
Sbjct: 287 MVAAMDEAIGNITSALHARSMFDNTLIIFTTDNGGPVGSHNDKPQGIGCATGSQNYPLRG 346
Query: 56 TKYSFHEGGVRNVA----ALWSPLLRKGQVLENLMHITDWLPTL 95
K ++ +GGVR A A P LR G NLMH DWLPTL
Sbjct: 347 GKGAYFQGGVRGTAWVHGAQVHPALR-GTTNYNLMHAVDWLPTL 389
>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
Length = 318
Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 60/101 (59%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+ +LD VG+++Q L++ ++ N+ ++F+SDNG N P RG K S
Sbjct: 218 LTSALDSMVGKIVQHLKDNCMMDNTYLIFMSDNGADP--------GEGQNTPFRGGKTSL 269
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG ++ + ++S LL+K + + LMHITDWLPTL +AG
Sbjct: 270 FEGGTKSNSFIYSQLLKKTEYENDGLMHITDWLPTLVKLAG 310
>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
Length = 571
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 41/108 (37%), Positives = 65/108 (60%), Gaps = 10/108 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ ++D +G+V+ +L++ LL N+L+VF SDNGG T HF +A+N PLR K + +
Sbjct: 261 ITAMDTGIGKVVDALKQHNLLDNTLIVFQSDNGGDT----HF---SANNGPLRDRKTTLY 313
Query: 62 EGGVRNVAALWSP--LLRKGQVLENLMHITDWLPTLYFIAGKEK-NID 106
EGGV+ + + P L + ++L HI+DW T+ AG E N+D
Sbjct: 314 EGGVKVPSMAYGPGLLTNTPRTSDDLFHISDWFTTILSAAGLEAGNVD 361
>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 467
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 62/113 (54%), Gaps = 12/113 (10%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGG--------PTVDSLHFHGNTA--SNWPLR 54
LD+SVG+++ +LE+ G GN+LV+F SDNG P H+ A SN PL
Sbjct: 255 LDDSVGKILVALEKTGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYPPGPAGGSNEPLH 314
Query: 55 GTKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
G K +EGG+ A A W L+ G+ L L HITDW+PT +AG + D
Sbjct: 315 GMKGEVYEGGIHTAAVARWPGQLKPGKFL-GLAHITDWMPTFCALAGYKPEKD 366
>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
Length = 784
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDESV + +L ++GLL N+++VF +D GG ++ASN+PLRG+K +
Sbjct: 214 MVLALDESVLNITIALAQKGLLDNTVIVFTTDAGGAVGGQ---ELSSASNFPLRGSKLTV 270
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG+R VA ++S L++ KG+V + H TDWL T +AG
Sbjct: 271 WEGGIRAVAFVYSDLIKNKGRVSLEMYHSTDWLLTTLGLAG 311
>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
Length = 485
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG V ++++ GL N+LV+F +DNGG + + +N+PLRG K +
Sbjct: 200 MVSALDEAVGNVTMAMKKAGLWDNTLVIFTTDNGG-------WISASGNNYPLRGGKVTL 252
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG R VA +L+K G ++H DW PT+ AG
Sbjct: 253 WEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTILAAAG 293
>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
Length = 474
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 61/101 (60%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D ++GRV+Q+L++EGL N+LV+F +DNGGP+ N + N PL GTK +
Sbjct: 261 MMLNMDRAIGRVLQTLQQEGLSENTLVIFTNDNGGPS------DHNGSINLPLSGTKANL 314
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W + + G + ++ D LPT Y AG
Sbjct: 315 LEGGIRVPMIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAAG 355
>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
Length = 353
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 61/105 (58%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+++ RV +L+ +GL N+L +F+SDNGG +D SNWPLRG K +
Sbjct: 214 MVAALDDAIKRVTDALQGKGLWNNTLTIFMSDNGGDYLDG-------QSNWPLRGAKGTV 266
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG R A A + L R G +MH D LPTL +AG ++
Sbjct: 267 WEGGTRVPAFAHGNMLERTGYTYHGMMHGVDILPTLVSVAGGTED 311
>gi|313212374|emb|CBY36362.1| unnamed protein product [Oikopleura dioica]
Length = 247
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 61/99 (61%), Gaps = 11/99 (11%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D ++GRV+++L+ + NS++VF +DNGG + ++NWP + K S EGG
Sbjct: 101 MDRAIGRVVETLQRANMWENSIIVFANDNGGKS--------GLSNNWPFQRAKSSILEGG 152
Query: 65 VRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
+R + + S LL K G+ +NLM +TDWLPT+ +AG
Sbjct: 153 IRVPSFIHSNLLPKSRLGERTDNLMDMTDWLPTMLSMAG 191
>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
Length = 494
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG V ++++ GL N+L++F +DNGG + S +N+PLRG K +
Sbjct: 242 MVSALDEAVGNVTMAMKKTGLWDNTLLIFSTDNGGMVIQS-------GNNYPLRGGKTTL 294
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG R VA +L+K G ++H DW PTL AG
Sbjct: 295 WEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTLLAAAG 335
>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
Length = 446
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 56/106 (52%), Gaps = 9/106 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD +VG+V+ +L+E+GL N++VVF SDNGG + WP G K
Sbjct: 234 MVRSLDGAVGQVLHALQEQGLADNTIVVFTSDNGGERF---------SKTWPFTGQKTEL 284
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L P QV + + DWLPTL AG + D
Sbjct: 285 LEGGIRVPTLLRWPARIAPQVSDQVTATMDWLPTLLAAAGVAPDAD 330
>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 558
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 44/101 (43%), Positives = 63/101 (62%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V V +L + G N+++ F +DNGG + F G SNWPLRG K ++
Sbjct: 258 MVSAVDEAVRNVTYALRKYGYYKNTVIFFSTDNGGQPL----FGG---SNWPLRGRKGTY 310
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SP L R+ +V L+HITDW PTL +AG
Sbjct: 311 WEGGVRGIGFVHSPLLRRRRRVSRALIHITDWYPTLMRLAG 351
>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 45/118 (38%), Positives = 68/118 (57%), Gaps = 19/118 (16%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV------DSLH-------FHGNT 47
M+K LD+++ + ++++G+ N+L+VF SDNGGP + D+ H ++G
Sbjct: 289 MVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRGYMCYNGEA 348
Query: 48 -ASNWPLRGTKYSFHEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
A+N+PLRG KYSF EGG+R V A S P ++G +MHI DW T +AG
Sbjct: 349 GANNYPLRGGKYSFFEGGIR-VNAFVSGGLIPADQRGSNRTGIMHIADWYATFCALAG 405
>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
Length = 506
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 60/95 (63%), Gaps = 8/95 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M KS+D++VGR+I S + +G+ +++V F +DNGG T+ + GN NWPLRG K +
Sbjct: 237 MTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGGN---NWPLRGLKNTL 289
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + ++ P + + + LMH+ D PTL
Sbjct: 290 WEGGIRGIGSIKIPGVSADK-RDQLMHVVDMFPTL 323
>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
Length = 1683
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 60/95 (63%), Gaps = 8/95 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M KS+D++VGR+I S + +G+ +++V F +DNGG T+ + GN NWPLRG K +
Sbjct: 1414 MTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGGN---NWPLRGLKNTL 1466
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + ++ P + + + LMH+ D PTL
Sbjct: 1467 WEGGIRGIGSIKIPGVSADK-RDQLMHVVDMFPTL 1500
>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
caulinodans ORS 571]
gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
caulinodans ORS 571]
Length = 490
Score = 72.4 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 46/113 (40%), Positives = 67/113 (59%), Gaps = 11/113 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT----VDSLHFHGN-TASNWPLRG 55
M+ ++D+ +G+V+ +LE+ G+ N+L+VF SDNGG V FHG ASN P R
Sbjct: 259 MISAMDDQIGKVVAALEQRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELPASNAPYRD 318
Query: 56 TKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
K + +EGG R VAAL W + G + +MH+ D +PTL +AG N+D
Sbjct: 319 GKGTLYEGGTR-VAALANWPGRITPGAA-DGVMHVVDMMPTLARLAG--ANLD 367
>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 539
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 40/103 (38%), Positives = 60/103 (58%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D+S+GRV+ +L+ EGL N++V+F SDNGG + A N PLRG K
Sbjct: 263 VVDGMDQSIGRVLDTLDAEGLTDNTIVLFFSDNGGAV-----YAIGGADNAPLRGGKGDT 317
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R VA + W + G ++++M + D PTL AG E
Sbjct: 318 FEGGIRVVATMRWPEKIAPGGKVDSIMSVMDVFPTLLAAAGIE 360
>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
Length = 452
Score = 72.0 bits (175), Expect = 5e-11, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 57/106 (53%), Gaps = 9/106 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD++VG+V+++L+ +GL N++VVF SDNGG + WP G K
Sbjct: 240 MVRSLDKAVGQVLEALKAQGLADNTIVVFTSDNGGERF---------SKTWPFTGQKTEL 290
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L P QV + + DWLPTL AG + D
Sbjct: 291 LEGGIRVPTLLRWPARIAPQVSDQVTATMDWLPTLLAAAGVAPHAD 336
>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
Length = 500
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 60/101 (59%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDESVGRV+ L+E GL N++++F SDNGG + G SN PLR K
Sbjct: 278 MMQSLDESVGRVLDKLKELGLDKNTVIIFTSDNGGLSTA----EGAPTSNMPLRAGKGWP 333
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W + + V ++ + TD+ PTL IAG
Sbjct: 334 YEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPTLLEIAG 374
>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 60/103 (58%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G +++ L+E+ + N+++V +DNGG ASN+PLRG KY+F
Sbjct: 236 MVSVADNVTGHLVELLKEKEMWDNTIMVISADNGGDQC--------VASNYPLRGCKYTF 287
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR++A + LL R+GQ + +HI+DW T +AG
Sbjct: 288 FEGGVRSLAFVNGGLLPESRRGQSTDGFIHISDWYTTFCKLAG 330
>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 389
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 63/107 (58%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G V++ L+++G+ N+L+V +DNGG + SN+PL+G+K +F
Sbjct: 236 MVSVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPCE--------GSNYPLKGSKMTF 287
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEKN 104
+EGGVR++A + LL R+G+ E +HI DW T +AG + +
Sbjct: 288 YEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLAGVDSD 334
>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 71.2 bits (173), Expect = 6e-11, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G +++ L+++G+ N+++V +DNGG SN+PL+G K +F
Sbjct: 235 MISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPC--------AGSNYPLKGCKTTF 286
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR+VA + LL R+GQ + +HI DW PT +AG
Sbjct: 287 FEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLAG 329
>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
Length = 357
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LD+++ RV +L+ +GL N+L +F+SDNGG V SNWPLRG K +
Sbjct: 223 MATALDDAIKRVTDALKGKGLWDNTLTIFMSDNGGNYVQG-------QSNWPLRGAKGTL 275
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG R A + +L R G +MH D LPTL +AG
Sbjct: 276 WEGGTRVPAFVHGNMLERTGYTYHGMMHGVDILPTLVSVAG 316
>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 471
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 42/104 (40%), Positives = 58/104 (55%), Gaps = 7/104 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LD + G V+ L+E GL ++LVVF +DNGGPT N +SN+PL GTK +F
Sbjct: 257 MTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPT------DKNASSNYPLAGTKSNF 310
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R W L G+V + + D LPT + G E+
Sbjct: 311 LEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKAGGGEE 354
>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 551
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 58/96 (60%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++D+++G+V+ +L+EEGL N++V+F SDNGG + A N PLRG K
Sbjct: 272 VVDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAA-----YSYGGADNAPLRGGKGET 326
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R V+ + W +L Q + +M + D PTL
Sbjct: 327 FEGGIRVVSLMRWPGVLEPAQSFDQIMTVMDVFPTL 362
>gi|260824679|ref|XP_002607295.1| hypothetical protein BRAFLDRAFT_88244 [Branchiostoma floridae]
gi|229292641|gb|EEN63305.1| hypothetical protein BRAFLDRAFT_88244 [Branchiostoma floridae]
Length = 226
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG V ++++ GL N+LV+F +DNGG + + +N+PLRG K +
Sbjct: 16 MVSALDEAVGNVTMAMKKAGLWDNTLVIFTTDNGG-------WISASGNNFPLRGGKTTL 68
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG R VA +L+K G ++H DW PT+ AG
Sbjct: 69 WEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTILEAAG 109
>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 706
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 39/100 (39%), Positives = 60/100 (60%), Gaps = 13/100 (13%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
+L LD ++G +++ LE G + NS+++ SDNGG P+ SN+PLRG K+S
Sbjct: 407 VLMYLDYTIGELVEYLETNGWMENSIIIVASDNGGCPSCG--------GSNYPLRGIKHS 458
Query: 60 FHEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTL 95
+ EGG + + ++S P R G + LMH+TDWLPT+
Sbjct: 459 YWEGGTKVPSFVYSKSHIPEERWGTEYDGLMHVTDWLPTI 498
>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
Length = 487
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE+VG V +++ GL N+LV+F +DNGG + S +N+PLRG K +
Sbjct: 243 MVSALDEAVGNVTMAMKNAGLWDNTLVIFTTDNGGWIIAS-------GNNYPLRGGKVTL 295
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTL 95
EGG R VA + +L+K G ++H DW PT+
Sbjct: 296 WEGGTRGVAFVHGKMLQKTGYTNNEMIHAVDWFPTI 331
>gi|149176385|ref|ZP_01854999.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148844737|gb|EDL59086.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 498
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 61/101 (60%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D++VG+V+Q LEE G+ N++V+ +DNGG + G+ SN PLRG K
Sbjct: 283 MVESMDKAVGKVLQQLEESGVAENTVVMLTADNGGLSTS----EGSPTSNLPLRGGKGWL 338
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R V + W G V + + TD+ PT+ +AG
Sbjct: 339 YEGGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTILDLAG 379
>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 461
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD+++GRV+ +L G L N+LVVF SDNGG + A+N RG K F
Sbjct: 256 LIEHLDDNIGRVLATLRANGQLANTLVVFTSDNGG-------WGPGKANNGQFRGVKGQF 308
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R A+WS + G L+ + DW PTL AG
Sbjct: 309 YEGGIRIPTGAMWSGHIEAGSESGALLQLMDWFPTLLQAAG 349
>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 564
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 43/116 (37%), Positives = 60/116 (51%), Gaps = 15/116 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML + DES+G V ++L++ G+ N+LVVF +DNGGPT +SN+P RG K +
Sbjct: 296 MLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCA---AQGSSNYPKRGGKCTV 352
Query: 61 HEGGVRNVAALWSPLL------RKGQVLENL------MHITDWLPTLYFIAGKEKN 104
+EGG + P RK + E L H+ DWLPTL + G N
Sbjct: 353 YEGGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLARMTGATPN 408
>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 528
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 3/100 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D S+G ++ + G N+++ +DNGGPT + +SN PLRG KY+
Sbjct: 252 MVSAMDASIGELMHGISTNGFAENTIIFLTNDNGGPTNG---MASSGSSNHPLRGCKYTL 308
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG R A W P + + HITD LPTL +G
Sbjct: 309 WEGGTRGSALFWYPKKLGAGTYKGMAHITDVLPTLLTASG 348
>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 519
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 60/107 (56%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G + Q+L+E L N++++F SDNGG D +NWPLRG K +
Sbjct: 239 MVTCMDEGIGNITQTLKETELWNNTIIIFSSDNGGDPNDG-------GNNWPLRGEKGTH 291
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAGKEKN 104
EG + + ++S + K G V ++H++DW PT+ +AG N
Sbjct: 292 WEGAIHGLGFVYSSDIAKDVRGTVNTEMIHVSDWFPTIVNLAGGSLN 338
>gi|149177349|ref|ZP_01855954.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
8797]
gi|148843874|gb|EDL58232.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
8797]
Length = 472
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 66/109 (60%), Gaps = 9/109 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML S+D+S+G++++ +++ GL +L+VF+SDNGGPT + T+SN PLRG K S
Sbjct: 279 MLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPTREL------TSSNLPLRGEKGSM 332
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
+EGG+R + W+ L Q ++ + D PT +AG +N+D
Sbjct: 333 YEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPTSVALAGASLPQNLD 381
>gi|260821541|ref|XP_002606091.1| hypothetical protein BRAFLDRAFT_88001 [Branchiostoma floridae]
gi|229291429|gb|EEN62101.1| hypothetical protein BRAFLDRAFT_88001 [Branchiostoma floridae]
Length = 311
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 61/101 (60%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G V ++++ GL N+L++F +DNGG + F GN N+PLRG K +
Sbjct: 75 MVSAMDEAIGNVTMAMKKAGLWDNTLLIFTADNGGWPL----FSGN---NFPLRGGKITL 127
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG R A + +L+K G + ++H DW PT+ AG
Sbjct: 128 WEGGTRAAAFVHGKMLQKTGYTSDEMIHAVDWFPTILAAAG 168
>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G + ++L+ +G+ ++L++F +DNGG ++ F GN N+PLRG K +
Sbjct: 238 MVTAMDEAIGNITKALKTKGIFNDTLIIFTTDNGG----AVKFAGN---NYPLRGGKATL 290
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
EGG R A + ++K L+H DW+PT+ AG
Sbjct: 291 FEGGTRATAFVTGAGIQKFNYEYSGLIHAVDWMPTVLSAAG 331
>gi|32473691|ref|NP_866685.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
SH 1]
gi|32444227|emb|CAD74224.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
SH 1]
Length = 480
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LD VGR+I+ L+++ L ++LVVF SDNGGPT + T+SN PLRG K S
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340
Query: 61 HEGGVRNVAALWS 73
+EGGVR + +WS
Sbjct: 341 YEGGVR-IPMIWS 352
>gi|440718712|ref|ZP_20899155.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SWK14]
gi|436436039|gb|ELP29830.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SWK14]
Length = 480
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LD VGR+I+ L+++ L ++LVVF SDNGGPT + T+SN PLRG K S
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340
Query: 61 HEGGVRNVAALWS 73
+EGGVR + +WS
Sbjct: 341 YEGGVR-IPMIWS 352
>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 498
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 46/107 (42%), Positives = 66/107 (61%), Gaps = 6/107 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D++V VI +LE L N+L+VF +DNGGP + N +NWPLRG+K +
Sbjct: 247 MVTAMDDAVKVVIAALERYELWDNTLLVFTTDNGGPANPN-----NAGNNWPLRGSKLTL 301
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R V + +L +KG V +MH+TDW PTL IAG + + D
Sbjct: 302 WEGGTRGVGFVHGKMLEKKGYVNNEMMHVTDWYPTLLHIAGGKADSD 348
>gi|323448512|gb|EGB04410.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 572
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 41/98 (41%), Positives = 56/98 (57%), Gaps = 8/98 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ D+ VGR++ +LE G+L N++V F SDNGG + + GN NWPL G K+S
Sbjct: 292 MVRYADDEVGRLVDALERRGMLANAVVTFHSDNGGEIMGAGACGGN---NWPLTGGKFSN 348
Query: 61 HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPT 94
EGG+R V A S P R+G L L + DW T
Sbjct: 349 FEGGIR-VNAFVSGGAVPEARRGAALSQLSTLWDWYAT 385
>gi|421613374|ref|ZP_16054460.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
gi|408495968|gb|EKK00541.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
Length = 480
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LD VGR+I+ L+++ L ++LVVF SDNGGPT + T+SN PLRG K S
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340
Query: 61 HEGGVRNVAALWS 73
+EGGVR + +WS
Sbjct: 341 YEGGVR-IPMIWS 352
>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G + ++L+ +G+ ++L++F +DNGG + F GN N+PLRG K +
Sbjct: 238 MVTAMDEAIGNITKALKTKGMFNDTLIIFTADNGG----WITFAGN---NYPLRGGKATV 290
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
EGG R A + ++K V ++H DW+PT+ AG
Sbjct: 291 FEGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAG 331
>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 459
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 41/100 (41%), Positives = 58/100 (58%), Gaps = 1/100 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD+SVG+++ +L+ L N+LV+F SDNGG +F N +SN PLRG K +
Sbjct: 263 MIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGGYLTYGKNFQ-NISSNGPLRGQKATL 321
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG R + P + V + H D LPTL AG
Sbjct: 322 YEGGHRVPCLISWPGVITAGVTDQTAHSVDLLPTLAQAAG 361
>gi|417302808|ref|ZP_12089892.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327540882|gb|EGF27442.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 480
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LD VGR+I+ L+++ L ++LVVF SDNGGPT + T+SN PLRG K S
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340
Query: 61 HEGGVRNVAALWS 73
+EGGVR + +WS
Sbjct: 341 YEGGVR-IPMIWS 352
>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1517
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 40/95 (42%), Positives = 58/95 (61%), Gaps = 8/95 (8%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L+ +D++VG V+ +LEE + NS+++F SDNG HG SN PLRG+K
Sbjct: 326 LRLVDDAVGNVVAALEEARMDANSILIFASDNGANPE-----HG--GSNAPLRGSKGYLF 378
Query: 62 EGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
EGGVR A + +P L +G ++ H+TDW+PTL
Sbjct: 379 EGGVRVPAFVRAPKYLPRGATYDHPFHVTDWVPTL 413
>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
Length = 478
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 60/103 (58%), Gaps = 5/103 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD SVG V++ LEE+G+L N+L+VF SDNG V + HG +A + LR K
Sbjct: 248 VIQELDWSVGEVLRKLEEKGILENTLIVFSSDNGPWLV--MEDHGGSAGH--LREGKQFT 303
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A+W + GQV E+L DW PT + G E
Sbjct: 304 FEGGVRVPTVAMWKGKIEPGQVYEDLATQMDWFPTFSQLVGAE 346
>gi|325110760|ref|YP_004271828.1| Cerebroside-sulfatase,N-acetylgalactosamine-6-sulfatase
[Planctomyces brasiliensis DSM 5305]
gi|324971028|gb|ADY61806.1| Cerebroside-sulfatase,N-acetylgalactosamine-6-sulfatase
[Planctomyces brasiliensis DSM 5305]
Length = 932
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 38/100 (38%), Positives = 58/100 (58%), Gaps = 9/100 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+++LD VG ++Q+++E G N+LV+F+SDNGG H +N PLRG+K++ +
Sbjct: 708 VETLDHHVGTILQAIDEAGQRENTLVIFLSDNGG--------HPEYCANAPLRGSKWNLY 759
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A W + V E + D LPTL +AG
Sbjct: 760 EGGIRVPFIARWPKTIAAASVCETPVVGYDLLPTLVDVAG 799
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D +GR++ ++ GL ++V+F SDNG + F N LRG+KY
Sbjct: 249 MIEVMDRGIGRLLDEIDALGLAEETIVLFASDNGPDPLTGERF------NLELRGSKYQV 302
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W+ + G V + ++ D LPTL
Sbjct: 303 YEGGLRVPLLIRWTNQVIPG-VRDQMVSFVDLLPTL 337
>gi|319899988|ref|YP_004159716.1| sulfatase [Bacteroides helcogenes P 36-108]
gi|319415019|gb|ADV42130.1| sulfatase [Bacteroides helcogenes P 36-108]
Length = 461
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 8/66 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D SVGR++ ++E GL GN+LVVF SDNGGP + WPLR K SF
Sbjct: 251 MVENMDMSVGRILDAVERNGLTGNTLVVFTSDNGGPY--------EISHQWPLRAGKGSF 302
Query: 61 HEGGVR 66
+EGG+R
Sbjct: 303 YEGGIR 308
>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
Length = 558
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G V+ L+ +G+ N+ + F SDNGG N N P RG K S
Sbjct: 286 LVSGMDAVIGNVVAKLKAKGMFDNTYIFFSSDNGGDV--------NEGDNSPWRGAKSSL 337
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
EGG + SPL+R KG + L+H+TDWLPT+ +AG
Sbjct: 338 FEGGCHAHSWASSPLIRKKGVEADGLLHMTDWLPTILTLAG 378
>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
Length = 538
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++S+DE+VGR++ +L+E+ + N+L+ F SDNGG + D G+ N PLRG K +
Sbjct: 260 QVQSMDEAVGRIVAALKEKQVWDNTLLFFTSDNGGGS-DQPWTRGD---NRPLRGQKGTL 315
Query: 61 HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAG 100
++GGVR + P L+ GQV+E + + D PTL +AG
Sbjct: 316 YDGGVRVPTIISFPAQLKGGQVIEQVFSVVDLYPTLAKLAG 356
>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
Length = 370
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 60/103 (58%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD VG +++ + +L N+ ++F SDNGG V++L N P RG K S
Sbjct: 228 LVSGLDWMVGELLELVIANNMLDNTYIIFQSDNGGDAVEAL--------NGPYRGGKGSL 279
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
EGG + V ++SPLL+K G MHITDW+PT+ +AG E
Sbjct: 280 FEGGSKVVNFIYSPLLKKVGYENNGWMHITDWMPTIISLAGGE 322
>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 468
Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/95 (38%), Positives = 55/95 (57%), Gaps = 7/95 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D++VG+V+ LE E LL N+L+ F+SDNGGPT NT N PLRG K +
Sbjct: 245 MMSAMDDAVGQVVAKLEREKLLENTLIFFVSDNGGPTA------ANTGDNTPLRGFKATT 298
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPT 94
EGG+R W + G+ + + D++PT
Sbjct: 299 WEGGIRVPYFVSWKGKIPAGKTYDQPVIQIDFVPT 333
>gi|182677550|ref|YP_001831696.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
gi|182633433|gb|ACB94207.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
Length = 486
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D +GR++Q+L+ G+ N++V+F SDNGG A WP G K
Sbjct: 260 MIAAMDLQIGRIVQALQANGISENTIVIFTSDNGGERF---------ADTWPFTGRKTEL 310
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R A + W ++ Q ++ + DWLPTL AG E +
Sbjct: 311 LEGGLRIPALVSWPARIKADQTIDQVSISMDWLPTLLAAAGSEPD 355
>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
Length = 483
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G + ++L+ +G+ ++L+VF +DNGG + + GN N+PLRG K +
Sbjct: 239 MVTAMDEAIGNITKALKTKGMFNDTLIVFTADNGG----WITYAGN---NYPLRGGKTTV 291
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
EGG R A + ++K V ++H DW+PT+ AG
Sbjct: 292 FEGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLSAAG 332
>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
Length = 492
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/79 (45%), Positives = 51/79 (64%), Gaps = 10/79 (12%)
Query: 25 SLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLR---KGQV 81
SL+VF SDNGG + ASNWPLRG+K + ++GG+++ A + SPLL+ KGQ
Sbjct: 284 SLLVFTSDNGGRPKEG-------ASNWPLRGSKGTIYDGGLKSRAFVASPLLQNRMKGQN 336
Query: 82 LENLMHITDWLPTLYFIAG 100
L H++DW PTL ++G
Sbjct: 337 YNYLFHVSDWFPTLLTLSG 355
>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
Length = 548
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G+V+ L+E+ + N+ + F SDNGG + N P RG K S
Sbjct: 244 LVSGMDAVIGKVVAKLKEKDMFDNTYIFFSSDNGGDV--------HEGDNSPKRGAKSSL 295
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
+GG + SPLL+K G + L+HITDWLPT+ +AG E
Sbjct: 296 FDGGCHAHSWASSPLLKKKGVEADGLLHITDWLPTIVKLAGGE 338
>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
Length = 559
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 24/100 (24%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG+ +++L + G+L NS+V+ SDNG PTV H N SN+P RG
Sbjct: 255 MISSLDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVG---IHSNAGSNYPFRG----- 306
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
++G V +H DWLPTL AG
Sbjct: 307 ----------------QRGYVSNQAIHAIDWLPTLAAAAG 330
>gi|408793054|ref|ZP_11204664.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408464464|gb|EKJ88189.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 583
Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 62/107 (57%), Gaps = 6/107 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD SVG+++ SL+EEGL N++V+F SDNG P ++ G N P RG K +
Sbjct: 362 MIRSLDRSVGKILASLKEEGLDENTIVIFTSDNGAP-----NYIGLPDVNSPYRGWKLTL 416
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
+GG+R A W ++ G +N + D LPT+ AG + +D
Sbjct: 417 FQGGIRVPYVAKWPGHIKPGTKYQNAITNIDILPTVANAAGIKLPLD 463
>gi|443724469|gb|ELU12468.1| hypothetical protein CAPTEDRAFT_223869 [Capitella teleta]
Length = 441
Score = 68.9 bits (167), Expect = 3e-10, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G+V+ L+E+ + N+ + F SDNGG + N P RG K S
Sbjct: 139 LVSGMDAVIGKVVAKLKEKDMFDNTYIFFSSDNGGDV--------HEGDNSPKRGAKSSL 190
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
+GG + SPLL+K G + L+HITDWLPT+ +AG E
Sbjct: 191 FDGGCHAHSWASSPLLKKKGVEADGLLHITDWLPTIVKLAGGE 233
>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
queenslandica]
Length = 2318
Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats.
Identities = 38/107 (35%), Positives = 56/107 (52%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G V+ L E + N+L+V +DNGG SN+PL+G+KY+F
Sbjct: 1353 MVSVADNVTGHVVNLLRENDMWDNTLMVVSADNGGADC--------KGSNYPLKGSKYTF 1404
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR +A L+ R+G+ + +HI DW T +AG + N
Sbjct: 1405 FEGGVRVIAFASGGLIPAGRRGKSTDGFIHIADWYTTFCKMAGVDHN 1451
>gi|398955521|ref|ZP_10676483.1| arylsulfatase A family protein [Pseudomonas sp. GM33]
gi|398150946|gb|EJM39513.1| arylsulfatase A family protein [Pseudomonas sp. GM33]
Length = 570
Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 62/103 (60%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K LD+++G V++ LE+ G L N++VVF +DNG T+ + G T P +G K +
Sbjct: 305 MKQLDDNIGYVLKKLEDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W +++ G VL ++ DW+PTL IAG +K
Sbjct: 360 EGGMRAPLVIRWPGVIKPGTVLTDIFASLDWVPTLVSIAGGKK 402
>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 474
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 61/108 (56%), Gaps = 8/108 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M LD + G V+ L+E G+ N+L+VF +DNGGP+ N +SN PL GTK +
Sbjct: 261 MTLGLDRACGNVLAKLDELGVRENTLIVFTNDNGGPS------GTNASSNKPLGGTKANH 314
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA-GKEKNID 106
EGG+R + W L++ V + + D LPT Y IA G EKNI+
Sbjct: 315 LEGGIRVPFLMSWPKQLKENTVYDYPISTLDLLPTFYQIAGGNEKNIN 362
>gi|418207595|gb|AFX62894.1| sulfatase 1A precursor, partial [Haliotis rufescens]
Length = 143
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 47/85 (55%), Gaps = 10/85 (11%)
Query: 20 GLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLL--R 77
G + N L+VF +DNGGPT N ASNWPLRG K + EGG R ++S L +
Sbjct: 3 GFMDNLLLVFTTDNGGPT--------NIASNWPLRGGKGTLWEGGTRGAGFVYSKTLLKK 54
Query: 78 KGQVLENLMHITDWLPTLYFIAGKE 102
KG L H DW PTL IAG +
Sbjct: 55 KGYTHPGLFHAVDWYPTLLDIAGGD 79
>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1605
Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats.
Identities = 37/98 (37%), Positives = 57/98 (58%), Gaps = 10/98 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L +LD+SVGRV++ ++ + ++L VF SDNGG +N+PLRG K+
Sbjct: 337 LLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNGGCWAQG-------GTNYPLRGGKHFL 389
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTL 95
EGGV+ A ++S L+ + + L H+TDW PTL
Sbjct: 390 WEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTL 427
>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
Length = 470
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 62/110 (56%), Gaps = 6/110 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+ S+DE +G +++ ++E GL ++VVF SD+G T + H G +A + RG K+S
Sbjct: 263 FVSSIDERIGGLLKRVDELGLREKTIVVFQSDHGHSTEERAHLGGGSAGIY--RGAKFSL 320
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG---KEKNID 106
EGG+R A + W L +G+ L H DWLPTL +A E +ID
Sbjct: 321 FEGGIRVPAIISWPGRLPQGEARSQLAHGCDWLPTLAALAAVPLVEPDID 370
>gi|257092922|ref|YP_003166563.1| sulfatase [Candidatus Accumulibacter phosphatis clade IIA str.
UW-1]
gi|257045446|gb|ACV34634.1| sulfatase [Candidatus Accumulibacter phosphatis clade IIA str.
UW-1]
Length = 574
Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats.
Identities = 39/106 (36%), Positives = 61/106 (57%), Gaps = 6/106 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ LE+ G L N++VVF +DNG + + GNT P +G K +
Sbjct: 306 MKQMDDNIGYVLKKLEDMGQLDNTIVVFTTDNGAEVI-TYPDGGNT----PFKGGKLTTW 360
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A + W ++ G VL ++ DW+PT IAG K D
Sbjct: 361 EGGMRAPAVIRWPGHIKPGTVLNDIFASYDWMPTFVEIAGGAKGND 406
>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
Length = 501
Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats.
Identities = 45/115 (39%), Positives = 71/115 (61%), Gaps = 15/115 (13%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
M+ +D+ VG+V+ +LE++G+ N+++VF SDNGG V + F G++ ASN P
Sbjct: 272 MISVMDDGVGKVVAALEKKGMRDNTIIVFHSDNGG--VKNSLFAGDSKVGGELPASNGPY 329
Query: 54 RGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGK--EKN 104
R K + +EGG R VAAL W ++ G V+++++H+ D PTL +A EKN
Sbjct: 330 RDGKGTLYEGGTR-VAALANWPGKIKPG-VVDSMVHVVDMYPTLAAVADAKLEKN 382
>gi|426408644|ref|YP_007028743.1| arylsulfatase [Pseudomonas sp. UW4]
gi|426266861|gb|AFY18938.1| arylsulfatase [Pseudomonas sp. UW4]
Length = 570
Score = 68.2 bits (165), Expect = 5e-10, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 62/103 (60%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K LD+++G V++ LE+ G L N++VVF +DNG T+ + G T P +G K +
Sbjct: 305 MKQLDDNIGYVLKKLEDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W +++ G VL ++ DW+PTL IAG +K
Sbjct: 360 EGGMRAPLVIRWPGVIKPGTVLTDIFASLDWVPTLVNIAGGKK 402
>gi|336413800|ref|ZP_08594149.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
3_8_47FAA]
gi|335934817|gb|EGM96800.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
3_8_47FAA]
Length = 465
Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 66/109 (60%), Gaps = 4/109 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D++VG++IQ L+E+G+L N++++F SDNG V H SN P RG K
Sbjct: 258 MVSEIDKNVGQIIQLLKEKGILENTIIIFSSDNGVHCVGG-HEPDFFDSNGPYRGYKRDL 316
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
+EGGVR + W +++K + +E++ D+LPT+ + G + K ID
Sbjct: 317 YEGGVRAPFIVSWPKMIKKKRTVEHITTFWDFLPTVTELVGADMVKGID 365
>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
Length = 491
Score = 68.2 bits (165), Expect = 6e-10, Method: Composition-based stats.
Identities = 45/116 (38%), Positives = 61/116 (52%), Gaps = 11/116 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
M+ ++D+ +G V+ +L G+ N+L+VF SDNGG G ASN P R
Sbjct: 261 MITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRD 320
Query: 56 TKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
K S +EGG R VA A W + G E +MH+ D LPTL +AG K K +D
Sbjct: 321 GKGSLYEGGTRVVALANWPGRIAPGAA-EGVMHVVDMLPTLAKLAGASLAKSKPLD 375
>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
Length = 440
Score = 68.2 bits (165), Expect = 6e-10, Method: Composition-based stats.
Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++++LD++VG+V+Q+L++ G N++V+F SDNGG + WP G K
Sbjct: 229 IVEALDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L WS +R QV E + DWLPTL AG + D
Sbjct: 280 LEGGIRVPTLLRWSARIRP-QVQEQVTASFDWLPTLLAAAGARPHPD 325
>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
Length = 609
Score = 68.2 bits (165), Expect = 6e-10, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 58/100 (58%), Gaps = 17/100 (17%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D ++ +V+++ ++ +++++F SDNGG + G ASNWPLRG K S
Sbjct: 321 MVTAMDTAIYKVVKAYKKFKFWDDTILIFSSDNGG------NLKGG-ASNWPLRGAKGSL 373
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + + SPLL +TDW PTL +AG
Sbjct: 374 FEGGVRTIGFVHSPLLP----------VTDWFPTLLQLAG 403
>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 68.2 bits (165), Expect = 7e-10, Method: Composition-based stats.
Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 3/101 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LDE+ G VI+ L GL +L++F SDNGGPT+ +G + N PLRG+K +
Sbjct: 251 MLSALDEAAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTTNG--SKNDPLRGSKRTT 308
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR L W L G+V + + D LPT AG
Sbjct: 309 LEGGVRVPFVLSWKGTLPAGKVYDQPIIQLDILPTALAAAG 349
>gi|406833200|ref|ZP_11092794.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 476
Score = 68.2 bits (165), Expect = 7e-10, Method: Composition-based stats.
Identities = 41/107 (38%), Positives = 56/107 (52%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE VGR++ L GL N+LV+F SDN GP +S H + + PLRG K S
Sbjct: 268 MISRLDEYVGRMLDVLRANGLAENTLVIFTSDN-GPHQESNHDTARFSPSGPLRGMKRSL 326
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W + G V ++ + DW+ T +AG D
Sbjct: 327 TEGGIRVPFIAWWPGRISAGSVSNHVGYFGDWMATAAELAGTAPTAD 373
>gi|323455793|gb|EGB11661.1| hypothetical protein AURANDRAFT_70842 [Aureococcus anophagefferens]
Length = 1489
Score = 68.2 bits (165), Expect = 7e-10, Method: Composition-based stats.
Identities = 35/94 (37%), Positives = 55/94 (58%), Gaps = 10/94 (10%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD ++ +V+ ++++ G NS++V SDNGG + G+ N+P RG K +EGG
Sbjct: 308 LDNAMKKVVGAMKKSGFYDNSILVVASDNGGCS-------GSGGYNYPYRGGKQYLYEGG 360
Query: 65 VRNVAALWSPLLRK---GQVLENLMHITDWLPTL 95
+ A + SPL+ K G + L H++DWLPTL
Sbjct: 361 IHVNAFIHSPLIPKSARGTSYQGLFHVSDWLPTL 394
>gi|302535739|ref|ZP_07288081.1| arylsulfatase [Streptomyces sp. C]
gi|302444634|gb|EFL16450.1| arylsulfatase [Streptomyces sp. C]
Length = 495
Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats.
Identities = 43/102 (42%), Positives = 53/102 (51%), Gaps = 12/102 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+Q+L G N+LVVF SDNGG + NWPL G K S
Sbjct: 308 MVEDLDRSVGEVLQALRRSGQEENTLVVFSSDNGGERF---------SYNWPLSGNKASL 358
Query: 61 HEGGVR--NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R N+ W L GQV + DW TL +AG
Sbjct: 359 QEGGIRVPNIVR-WPARLDGGQVSRVPVFSPDWTATLLEVAG 399
>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 480
Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 54/95 (56%), Gaps = 4/95 (4%)
Query: 9 VGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNV 68
+G + SL + + L++ SDNGG N NWPLRG K + EGGVR
Sbjct: 222 LGLALVSLVKTKNYVDILLMGFSDNGG---SPFGLFSNRGFNWPLRGAKANLWEGGVRVP 278
Query: 69 AALWS-PLLRKGQVLENLMHITDWLPTLYFIAGKE 102
A +WS L+K +V + LMHI+DWLPTLY AG +
Sbjct: 279 AFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAGGD 313
>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
Length = 459
Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats.
Identities = 43/108 (39%), Positives = 63/108 (58%), Gaps = 10/108 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD SVGRV+++L+ GL N+LVVF+SDNGG +L + SN PLRG K +
Sbjct: 263 VEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGG----ALWY---AQSNGPLRGGKQDMY 315
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
EGG+R A W + G +N + D PT +AG++ +N+D
Sbjct: 316 EGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPPENVD 363
>gi|284035516|ref|YP_003385446.1| sulfatase [Spirosoma linguale DSM 74]
gi|283814809|gb|ADB36647.1| sulfatase [Spirosoma linguale DSM 74]
Length = 522
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 62/104 (59%), Gaps = 1/104 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G V+ L+E L GN++++F+SDNGG ++ L N PLR K S
Sbjct: 290 LVEGMDKSLGDVLAFLDERKLTGNTVIIFMSDNGGLSMSPLRGGQAHTHNLPLRVGKGSV 349
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
+EGG+R + W +++ G + E + I D+ PT+ IAG K
Sbjct: 350 YEGGIREPMLVRWPGVVKAGSLTEQPVIIEDFFPTILAIAGGPK 393
>gi|343086462|ref|YP_004775757.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354996|gb|AEL27526.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 472
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 61/104 (58%), Gaps = 9/104 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD ++GR+I L+E+ L N+ + F++DNGG L+ PLR K S+
Sbjct: 251 MVENLDRNIGRLINLLQEKKLFENTFIFFVTDNGG-----LY---KVTKQRPLRAGKGSY 302
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
+EGG+R A A+W +R GQV E + D PTL +AG EK
Sbjct: 303 YEGGIRIPAFAVWPGEIRPGQVSEVPITNLDIFPTLLELAGIEK 346
>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
Length = 440
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 39/107 (36%), Positives = 60/107 (56%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++++LD++VG+V+Q+L++ G N++V+F SDNGG + WP G K
Sbjct: 229 IVEALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L WS ++ QV E + DWLPTL AG ++ D
Sbjct: 280 LEGGIRVPTLLRWSARIQP-QVQEQITASFDWLPTLLAAAGARQHPD 325
>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 508
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML+++D SVGR+++ LEEEGL N+++VF SDNGG + G N P RG K +
Sbjct: 285 MLRAVDRSVGRIMEKLEEEGLADNTIIVFSSDNGGAG-----YIGIPEVNAPYRGWKITL 339
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W ++ G V++ + D +PTL G
Sbjct: 340 FEGGIRVPLFMKWPGTIQPGTVIDEPVAHIDLMPTLAAATG 380
>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 454
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 57/96 (59%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D+++GRV +LE +G+L N+L+VF SDNGG N + N PL+G K F
Sbjct: 240 MLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKP------DNNYSLNHPLKGQKTQF 293
Query: 61 HEGGVRNVAALWSP--LLRKGQVLENLMHITDWLPT 94
+EGG+R A + P + G+ L+ + + D P+
Sbjct: 294 YEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPS 329
>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
Length = 464
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 55/107 (51%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD +G+V+ +L G N+LVVF SDNGG + NWPL G K S
Sbjct: 280 MVENLDARIGQVLNALRRSGQEENTLVVFASDNGGERF---------SYNWPLTGNKGSL 330
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L W +R GQV + DW TL IAG + D
Sbjct: 331 SEGGIRVPTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAPSPD 377
>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 534
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 58/113 (51%), Gaps = 21/113 (18%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFIS-----------DNGGPTVDSLHFHGNTAS 49
M +DE++G V +L+E+GL N++ +F S DNG D S
Sbjct: 284 MASCMDEAIGNVTATLKEKGLWNNTVFIFSSARLNFKHDFCLDNGATPTDG-------GS 336
Query: 50 NWPLRGTKYSFHEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIA 99
NWPLRG K ++EGG+R V + S LL G L+H+TDW PTL +A
Sbjct: 337 NWPLRGRKGLWYEGGIRAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLA 389
>gi|149200548|ref|ZP_01877558.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
gi|149136366|gb|EDM24809.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
Length = 461
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 43/109 (39%), Positives = 60/109 (55%), Gaps = 12/109 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML+SLDE++GR+++ LEE L N+L++F +DNGG + NWPLR K S+
Sbjct: 242 MLESLDENIGRLVKVLEEFKLDKNTLIIFTTDNGGVW--------KFSKNWPLRAGKGSY 293
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFI---AGKEKNI 105
EGG+R A W ++ G + + D PTL I A KEK I
Sbjct: 294 FEGGIRVPTFAYWPGKIKAGSSSDIPITALDMYPTLLDIAESAKKEKKI 342
>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
Length = 500
Score = 67.8 bits (164), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D ++GR+++ L+ L N+++VF SDNGG + ASN PLRG K +
Sbjct: 278 MLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTS-------NFGASNAPLRGEKDTI 330
Query: 61 HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGG + + SP+ + +G E + H+ DW T+ I G E IDS
Sbjct: 331 WEGGTKTTTFVHSPMYIEEGGTREMMFHVVDWHATILSITGLE--IDS 376
>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 67.4 bits (163), Expect = 9e-10, Method: Composition-based stats.
Identities = 40/103 (38%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D+++G+V+ + E G N+LV FISDNGGPT G T+ N PLRG K +
Sbjct: 251 MMSAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPT------QGTTSQNGPLRGFKMTT 304
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG R W L G+ +N + D LPT+ AG +
Sbjct: 305 FEGGTRVPFLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAGSK 347
>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
Length = 440
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 39/107 (36%), Positives = 60/107 (56%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++++LD++VG+V+Q+L++ G N++V+F SDNGG + WP G K
Sbjct: 229 IVEALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L WS ++ QV E + DWLPTL AG ++ D
Sbjct: 280 LEGGIRVPTLLRWSARIQP-QVQEQVTASFDWLPTLLAAAGARQHPD 325
>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 478
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 49/85 (57%), Gaps = 2/85 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-FHGNTASNWPLRGTKYS 59
M+ +D +G V+ +L+ GL N+L+ F+SDNGGP H GN +SN P RG K
Sbjct: 242 MVDVMDRGIGEVVTALDRNGLRENTLIFFLSDNGGPVSSKKHPGKGNGSSNGPFRGGKTD 301
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLE 83
F+EGGV A W ++ GQV +
Sbjct: 302 FYEGGVHVPFIASWPAKIQPGQVYD 326
>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
Length = 515
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 61/109 (55%), Gaps = 7/109 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +D+++G++ +L E + ++LVV+ISDNGG + + N +N+PLRG K++
Sbjct: 255 MSSVMDDAIGKITTALWETTMWEDTLVVYISDNGGAS----GINANNGNNYPLRGGKHTD 310
Query: 61 HEGGVRNVAAL---WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGGVR A + + P ++G + MH DW TL +AG D
Sbjct: 311 FEGGVRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGASPADD 359
>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G +++ L++ G+ N+L+V +DNGG SN+PL+G+K +F
Sbjct: 234 MVSVADNVTGHLVKLLQDNGMWDNTLLVISADNGGAPCG--------GSNYPLKGSKGTF 285
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR++A + LL RKG+ + +HI DW T +AG
Sbjct: 286 YEGGVRSLAFVNGGLLPTNRKGKSTQGFIHIADWYTTFCQLAG 328
>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
Length = 478
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D +GRV+ +L+ G N++VVF SDNGG A WP G K
Sbjct: 262 MIEQMDLQIGRVLGALDANGATDNTIVVFTSDNGGERF---------ADTWPFTGRKTEL 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A + W +R+G + + DWLPTL AG
Sbjct: 313 LEGGLRIPALISWPARIRQGATTDQVAMSMDWLPTLLAAAG 353
>gi|398872055|ref|ZP_10627361.1| arylsulfatase A family protein [Pseudomonas sp. GM74]
gi|398204372|gb|EJM91174.1| arylsulfatase A family protein [Pseudomonas sp. GM74]
Length = 570
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 62/103 (60%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K LD+++G V++ L++ G L N++VVF +DNG T+ + G T P +G K +
Sbjct: 305 MKQLDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W +++ G VL ++ DW+PTL IAG +K
Sbjct: 360 EGGMRAPLVIRWPGVIKPGTVLTDIFASLDWVPTLVSIAGGKK 402
>gi|384218661|ref|YP_005609827.1| arylsulfatase protein [Bradyrhizobium japonicum USDA 6]
gi|354957560|dbj|BAL10239.1| arylsulfatase protein [Bradyrhizobium japonicum USDA 6]
Length = 591
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 60/103 (58%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K LD+++G V++ L++ G L N++VVF +DNG T+ + G T P +G K S
Sbjct: 325 MKQLDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLSTW 379
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R A + W +++ G + ++ DWLPT IAG K
Sbjct: 380 EGGMRAPALVRWPGVIKPGTIKNDIFASLDWLPTFVSIAGGAK 422
>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
Length = 553
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 43/103 (41%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD SVGRV+QSL+E GL N+LV+F SDNG P + G N P RG K +F
Sbjct: 332 MIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPG-----YIGLPDVNKPYRGWKLTF 386
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A W + G + D PT+ AG E
Sbjct: 387 FEGGIRVPFFAKWPARIPAGTERTTPVAHLDMFPTIVAAAGGE 429
>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
Length = 480
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/73 (50%), Positives = 48/73 (65%), Gaps = 7/73 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +LD VGR+I+ L+ + L +LVVF SDNGGPT + T+SN PLRG K S
Sbjct: 287 MLIALDRGVGRIIKKLDRQKLRQETLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340
Query: 61 HEGGVRNVAALWS 73
+EGGVR + +WS
Sbjct: 341 YEGGVR-IPMIWS 352
>gi|325110839|ref|YP_004271907.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324971107|gb|ADY61885.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 494
Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG--PTVDSLHFHGNTASNW-PLRGTK 57
M++ D+ VGR++ S+EE GL N+L+VF SDNGG D + S+ PL+G K
Sbjct: 248 MIEHCDDMVGRIVDSVEELGLTDNTLIVFTSDNGGLISRYDYREAADDIVSDLSPLKGEK 307
Query: 58 YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
S HEGG+R + + PL+ G V + D+ PT +AG E
Sbjct: 308 GSLHEGGIRVPLIVKYPPLVNAGSVTDEPAISYDFYPTFVDLAGGE 353
>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
Length = 457
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 60/95 (63%), Gaps = 7/95 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD ++GRV ++LEE G L N+L++F SDNGG G A+N P+RG+K
Sbjct: 257 LIEHLDYNIGRVYKALEESGQLDNTLIIFCSDNGGDA-------GAEANNGPVRGSKGDM 309
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGG++ A++ K + +++L+ ++D PTL
Sbjct: 310 YEGGIKVPCAVYWRNHLKPRSIDDLVIMSDIFPTL 344
>gi|167515780|ref|XP_001742231.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778855|gb|EDQ92469.1| predicted protein [Monosiga brevicollis MX1]
Length = 241
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 59/107 (55%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+ VGR+ +L G L N+++VF+SDNGG VD L G K+
Sbjct: 121 MVAALDDQVGRLYDTLNATGQLDNTVIVFLSDNGG-WVDRLDGLACLLCLPLSAGWKHWV 179
Query: 61 HEGGVRNVAALWS-PLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGGVR+ A +WS L G V L H DWLPTL +AG + + +
Sbjct: 180 FEGGVRSAAFVWSRQLTNPGSVHHGLFHSVDWLPTLARLAGADTSAN 226
>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
Length = 491
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 60/101 (59%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+++G + ++ ++ GL N++++F +DNGG N ++PLRG K +
Sbjct: 239 MVTIMDDALGNLTRAFDKAGLWENTILIFSTDNGG-------VPKNGGYDYPLRGRKDTL 291
Query: 61 HEGGVRNVAALWSPLLRKGQV-LENLMHITDWLPTLYFIAG 100
EGGVR VA + L + V + LMH+TDW PTL +AG
Sbjct: 292 WEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAG 332
>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
sphaeroides KD131]
gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
KD131]
Length = 509
Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats.
Identities = 42/108 (38%), Positives = 58/108 (53%), Gaps = 7/108 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-----NWPLRG 55
M+ +D+ VG V+Q+LE G+ ++LV+F SDNGG G A N PLR
Sbjct: 276 MISCMDDQVGLVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLRE 335
Query: 56 TKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
K + +EGG R VA A W + G+ +MH+ D LPTL +A E
Sbjct: 336 GKGTLYEGGTRVVALANWPGRIPAGET-HGMMHVVDMLPTLAGLAQAE 382
>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
Length = 544
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD SVG++I+ L+E + G +L++F SDNGG L+ N P RG K SF
Sbjct: 324 MINSLDRSVGKIIEKLKELDIYGKTLIIFTSDNGGANYIELN-----DINKPYRGWKISF 378
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+GG+R + W + G+ EN +H D PT+ AG E
Sbjct: 379 FDGGIRVPYIISWPDEINPGKKSENAVHHFDIFPTILKAAGIE 421
>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
Length = 637
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/78 (46%), Positives = 46/78 (58%), Gaps = 7/78 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML LDE+V V +L+ G NS++V+ SDNGG SNWPLRG K ++
Sbjct: 272 MLSCLDEAVANVTLALKAHGFYDNSVIVYSSDNGGQPAAG-------GSNWPLRGRKGTY 324
Query: 61 HEGGVRNVAALWSPLLRK 78
EGGVR V + SPLLR+
Sbjct: 325 WEGGVRAVGFVHSPLLRQ 342
>gi|430741911|ref|YP_007201040.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013631|gb|AGA25345.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 469
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 44/111 (39%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
ML S+DESVGRV+++L+E GL ++LV+F SDNGG + + G N PLRG K
Sbjct: 249 MLASVDESVGRVVKTLDELGLSDDTLVIFSSDNGGVGGYEREGIGKAGAVTDNAPLRGGK 308
Query: 58 YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA-GKEKNID 106
+EGG+R W + G V + ++ D PTL +A GK D
Sbjct: 309 GMLYEGGIRAPYLFRWPGKIPAGTVCDRAINSVDLYPTLVELAEGKAPESD 359
>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
Length = 534
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 62/107 (57%), Gaps = 8/107 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD +V + +L + + + ++V+ DNGGP ++ H++G LRG K+++
Sbjct: 279 MVTALDAAVANMTTALASKNM-ADFVLVYSGDNGGPEIEG-HWNGG------LRGGKWTW 330
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGGVR A + SPLL + +H+ DWLPT +AG ++ D+
Sbjct: 331 FEGGVRPAAFVHSPLLTRRGWHNGTLHLVDWLPTFLALAGAARDDDA 377
>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
Length = 498
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 42/108 (38%), Positives = 58/108 (53%), Gaps = 7/108 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-----NWPLRG 55
M+ +D+ VG V+Q+LE G+ ++LV+F SDNGG G A N PLR
Sbjct: 265 MISCMDDQVGPVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLRE 324
Query: 56 TKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
K + +EGG R VA A W + G+ +MH+ D LPTL +A E
Sbjct: 325 GKGTLYEGGTRVVALANWPGRIPAGET-HGMMHVVDMLPTLAGLAQAE 371
>gi|171915961|ref|ZP_02931431.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Verrucomicrobium
spinosum DSM 4136]
Length = 527
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 59/96 (61%), Gaps = 5/96 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D++VG+V+ L+E GL N++VVF SDNGG + G+ +N P RG K
Sbjct: 292 MVEAMDQAVGKVLAKLDELGLKENTIVVFTSDNGGLSTS----EGSPTANLPWRGGKGWM 347
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +++ G V+ + TD+ PTL
Sbjct: 348 YEGGIREALLVRWPGVVQPGSVIHEPVIGTDFYPTL 383
>gi|346992020|ref|ZP_08860092.1| arylsulfatase [Ruegeria sp. TW15]
Length = 522
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 40/111 (36%), Positives = 59/111 (53%), Gaps = 16/111 (14%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD-----SLHFHGNTASNWPLRG 55
++K LD++VG+V LEE G+ N++V+F SDNG T+ + FHG + W
Sbjct: 277 VMKELDDNVGKVTAHLEELGIADNTIVIFTSDNGPETMTWPDGGTTPFHGEKGTTW---- 332
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGG R A + W + +GQVL + DW+PTL AG +N+
Sbjct: 333 ------EGGFRVPAIIRWPGKVAEGQVLNGIFDGMDWMPTLVAAAGGSQNL 377
>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 498
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 54/103 (52%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD VGR+ + LEE G GN+L+VF SDNGG T N + N PLRG K S
Sbjct: 257 MMYALDRGVGRIREHLEESGQWGNTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W G + ++ D LPT AG E
Sbjct: 310 REGGIRVPMIWTWPAKFPAGVRYDGVVSSLDLLPTFCSAAGAE 352
>gi|255014753|ref|ZP_05286879.1| arylsulfatase A [Bacteroides sp. 2_1_7]
gi|410102838|ref|ZP_11297763.1| hypothetical protein HMPREF0999_01535 [Parabacteroides sp. D25]
gi|409237965|gb|EKN30760.1| hypothetical protein HMPREF0999_01535 [Parabacteroides sp. D25]
Length = 468
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+LV+F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGADE----HFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
GG R + W + GQ +NL+ D+LPTL G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337
>gi|423330334|ref|ZP_17308118.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
CL03T12C09]
gi|409231950|gb|EKN24798.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
CL03T12C09]
Length = 468
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+LV+F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGADE----HFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
GG R + W + GQ +NL+ D+LPTL G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337
>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 536
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 8/94 (8%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD +V R ++ + G +++ F+SDNG S SNWPLRG+K++ +EGG
Sbjct: 265 LDRAVERFVEFTKARGAYDDAVFFFMSDNGATLAQS-----GGGSNWPLRGSKFTAYEGG 319
Query: 65 VRNVAALWS---PLLRKGQVLENLMHITDWLPTL 95
VR A L S P R+G L H+ D LPTL
Sbjct: 320 VRVPAFLHSARVPPARRGAAHAGLFHVVDVLPTL 353
>gi|149198310|ref|ZP_01875356.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149138606|gb|EDM27013.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 578
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 56/96 (58%), Gaps = 1/96 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD++VG +++ LE GL+ N++++F SDNGG L SN+PL G K S
Sbjct: 260 MVQSLDDAVGALLEGLESRGLMDNTVIIFTSDNGGNIYSQLDEGIVPTSNFPLSGGKASM 319
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGGVR +W + + G + ++ +D+ T+
Sbjct: 320 CEGGVRVPCTVVWPGVTKAGSRSDEIVQTSDFYTTI 355
>gi|323456611|gb|EGB12478.1| hypothetical protein AURANDRAFT_5553, partial [Aureococcus
anophagefferens]
Length = 190
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 45/69 (65%), Gaps = 4/69 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +D+SVGRV +L G+ +SLVVF +DNGG ++ F ++ +NWPLRG KYS
Sbjct: 123 MASFVDDSVGRVEAALRARGMWRDSLVVFATDNGG----AVGFDADSGNNWPLRGGKYSD 178
Query: 61 HEGGVRNVA 69
EGGVR V
Sbjct: 179 FEGGVRGVC 187
>gi|119505846|ref|ZP_01627912.1| sulfatase family protein [marine gamma proteobacterium HTCC2080]
gi|119458344|gb|EAW39453.1| sulfatase family protein [marine gamma proteobacterium HTCC2080]
Length = 558
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 60/105 (57%), Gaps = 8/105 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D SVGR++ +LEEEG+ N++VVF SDNGG + G N P RG K +
Sbjct: 332 MIRAVDRSVGRILDTLEEEGIANNTVVVFTSDNGG-----AGYIGIPEVNSPFRGFKITM 386
Query: 61 HEGGVR-NVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W + G V E + HI D +PTL AG +
Sbjct: 387 FEGGLRVPLFVRWPAKIAPGISVNEPVAHI-DVMPTLAAAAGASE 430
>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
Length = 453
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD ++G+VIQSLEE G L N+LV+F SDNGG G+ A+N P RG K
Sbjct: 251 LIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGD-------RGSMANNGPTRGAKGDM 303
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
EGG+ AL P + +G +N + + D +PT+
Sbjct: 304 FEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTI 339
>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 514
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 40/107 (37%), Positives = 62/107 (57%), Gaps = 8/107 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D+ VG + Q+L+++G+ ++L VF+SDNGG + F G SN+P RG K +
Sbjct: 260 MMAAMDQGVGNITQALKDKGMWDDTLFVFLSDNGGDKM----FAG---SNYPYRGNKATM 312
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGGV+ A + +L+ G V L H TD TL +AG + D
Sbjct: 313 WEGGVKVPAFIRGSMLKTSGTVNNELYHFTDIFATLLKVAGGTPDED 359
>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 470
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 58/103 (56%), Gaps = 7/103 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD +VGR+++++EE LL N++ VFISDNGGP + N + N P G K +
Sbjct: 255 MVHRLDINVGRIVEAVEEADLLENTIFVFISDNGGPVMT------NGSINAPYNGKKGTL 308
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W L G+V E+ + D PT +AG E
Sbjct: 309 LEGGIRVPFFISWPGHLEPGKVFEHPVSSLDLTPTFVALAGGE 351
>gi|255690671|ref|ZP_05414346.1| twin-arginine translocation pathway signal [Bacteroides finegoldii
DSM 17565]
gi|260623695|gb|EEX46566.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 453
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD ++G+VIQSLEE G L N+LV+F SDNGG G+ A+N P RG K
Sbjct: 251 LIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGD-------RGSMANNGPTRGAKGDM 303
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
EGG+ AL P + +G +N + + D +PT+
Sbjct: 304 FEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTI 339
>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
Length = 488
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 63/108 (58%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D ++GR+++ L+ L N+++VF SDNGG + +F ASN PLRG K +
Sbjct: 266 MLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGG----TANF---GASNAPLRGEKDTI 318
Query: 61 HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGG + + SP+ + +G + + H+ DW T+ I G E IDS
Sbjct: 319 WEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLE--IDS 364
>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 481
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 4/102 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG+++ +L+EEG+ N+LVVF SDNG V HG TA LRG K
Sbjct: 257 VIEEIDWSVGQILSTLKEEGIAENTLVVFTSDNGPWHV--FKTHGGTAG--LLRGAKGGT 312
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R W P K V+ ++ D LPT I+G E
Sbjct: 313 FEGGMREPTVFWWPAQIKPGVVMDMGTTMDLLPTFCAISGTE 354
>gi|374619517|ref|ZP_09692051.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302744|gb|EHQ56928.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 556
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 55/96 (57%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD SVGR+ ++LEE G+ N+LVVF +DNGGP + G N P RG K S
Sbjct: 333 MIRSLDRSVGRINKALEELGIADNTLVVFTNDNGGPG-----YIGLPDINKPFRGWKISQ 387
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + W + G V E + D +PTL
Sbjct: 388 FEGGIRVPLMMKWPARISPGTVSEEPVAHIDVMPTL 423
>gi|27379852|ref|NP_771381.1| arylsulfatase [Bradyrhizobium japonicum USDA 110]
gi|27353005|dbj|BAC50006.1| bll4741 [Bradyrhizobium japonicum USDA 110]
Length = 556
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ L++ G L N++VVF +DNG T F P +G+K +
Sbjct: 290 MKQMDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAET-----FTYPDGGTTPFKGSKMNTW 344
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W +++ G V ++ DWLPTL IAG K
Sbjct: 345 EGGMRAPCVIRWPGVIKPGTVKNDIFASLDWLPTLVNIAGGAK 387
>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
Length = 463
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD+S+G+VI++LE G L +++V F SDNGG ++ WP G K
Sbjct: 278 MVEDLDQSIGQVIKALERTGQLRDTVVFFASDNGGERFSNV---------WPFTGAKSGL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
+EGG+R L W +LR QV + + TDW T +AG D
Sbjct: 329 NEGGIRVPTILSWPGILRGRQVSDEPVVTTDWTATFLELAGATPAED 375
>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
Length = 452
Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 63/108 (58%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D ++GR+++ L+ L N+++VF SDNGG + +F ASN PLRG K +
Sbjct: 230 MLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGG----TANF---GASNAPLRGEKDTI 282
Query: 61 HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGG + + SP+ + +G + + H+ DW T+ I G E IDS
Sbjct: 283 WEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLE--IDS 328
>gi|326431402|gb|EGD76972.1| hypothetical protein PTSG_07315 [Salpingoeca sp. ATCC 50818]
Length = 439
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 17/117 (14%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-------------PTVDSLHFHGNT 47
M+ LD+++ + L+ +G+ N+L+V SDNGG P+ + ++G
Sbjct: 168 MVNFLDDNLANITALLKAKGMWENTLMVLTSDNGGYVNPLHGDCDLSDPSRGYMCYNGEA 227
Query: 48 -ASNWPLRGTKYSFHEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
A+N+PLRG KYSF EGG+R V+ + P +G + + ++HI DW T +AG
Sbjct: 228 GANNYPLRGGKYSFFEGGIRANAFVSGGFVPKHLRGTINDGIVHIADWYATFAALAG 284
>gi|399037695|ref|ZP_10734370.1| arylsulfatase A family protein [Rhizobium sp. CF122]
gi|398064717|gb|EJL56393.1| arylsulfatase A family protein [Rhizobium sp. CF122]
Length = 568
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ LEE G L N++VVF +DNG T+ + G T P +G K S
Sbjct: 305 MKQMDDNIGYVLKRLEEMGELDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLSTW 359
Query: 62 EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W + G V ++ DWLPT IAG K
Sbjct: 360 EGGMRAPLVVRWPGHIEPGTVKNDMFSALDWLPTFVNIAGGPK 402
>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 495
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G +++ L+++GL N+++V +DNGG H G SN+PLRG K +
Sbjct: 236 MVSVADNVTGHLVELLKKKGLWDNTIMVISADNGGA-----HCSG---SNYPLRGCKETL 287
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR++A + LL R+GQ + +HI DW T +AG
Sbjct: 288 FEGGVRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAG 330
>gi|224537344|ref|ZP_03677883.1| hypothetical protein BACCELL_02222 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521037|gb|EEF90142.1| hypothetical protein BACCELL_02222 [Bacteroides cellulosilyticus
DSM 14838]
Length = 500
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 59/101 (58%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+DE++GR+I L+ GL N++++F SDNGG + G+ +N PLRG K
Sbjct: 291 MIASMDENIGRIIDELKRLGLYENTIIIFTSDNGGLSTS----EGSPTANLPLRGGKGFT 346
Query: 61 HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
+EGG+R + P + G ++++ TD+ PT+ + G
Sbjct: 347 YEGGIREPLIVRIPHMENAGNRCDSMVISTDYYPTILEMTG 387
>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 455
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 55/95 (57%), Gaps = 3/95 (3%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ +LDE +G V+ +LEE G N++VVF SD+G T + F G +A P RG K
Sbjct: 249 VSTLDEKLGEVLATLEETGQRENTIVVFQSDHGHSTEERAFFGGGSAG--PYRGAKGCLF 306
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + + W L +G+V + + DWLPTL
Sbjct: 307 EGGIRVPSVISWPAGLPQGEVRDQMAVGCDWLPTL 341
>gi|256841081|ref|ZP_05546588.1| arylsulfatase A [Parabacteroides sp. D13]
gi|256736924|gb|EEU50251.1| arylsulfatase A [Parabacteroides sp. D13]
Length = 468
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+LV+F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
GG R + W + GQ +NL+ D+LPTL G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337
>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
Length = 452
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 3/95 (3%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ ++DE +G+V+Q L+ GL N++VVF SD G T D G P RG K+S
Sbjct: 247 VSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTED--RSFGGGGFTGPYRGAKFSLF 304
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R A + W+ L K +V + L DW PTL
Sbjct: 305 EGGIRVPAIIRWTGHLPKNEVRDQLCVNIDWYPTL 339
>gi|398866290|ref|ZP_10621789.1| arylsulfatase A family protein [Pseudomonas sp. GM78]
gi|398241341|gb|EJN26995.1| arylsulfatase A family protein [Pseudomonas sp. GM78]
Length = 570
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 61/103 (59%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K LD+++G V++ L++ G L N++VVF +DNG T+ + G T P +G K +
Sbjct: 305 MKQLDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W ++ G VL ++ DW+PTL IAG +K
Sbjct: 360 EGGMRAPLVIRWPGKIKPGTVLTDIFASLDWVPTLVNIAGGKK 402
>gi|301311963|ref|ZP_07217885.1| sulfatase family protein [Bacteroides sp. 20_3]
gi|423339481|ref|ZP_17317222.1| hypothetical protein HMPREF1059_03147 [Parabacteroides distasonis
CL09T03C24]
gi|300830065|gb|EFK60713.1| sulfatase family protein [Bacteroides sp. 20_3]
gi|409230862|gb|EKN23723.1| hypothetical protein HMPREF1059_03147 [Parabacteroides distasonis
CL09T03C24]
Length = 468
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+LV+F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
GG R + W + GQ +NL+ D+LPTL G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337
>gi|262383797|ref|ZP_06076933.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294695|gb|EEY82627.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 468
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+LV+F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
GG R + W + GQ +NL+ D+LPTL G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337
>gi|170727985|ref|YP_001762011.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169813332|gb|ACA87916.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 489
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 15/104 (14%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKY-- 58
M++SLD SVGR++Q+LE+ G N++V+F SD GG +F +N PLRG K
Sbjct: 292 MVESLDTSVGRILQALEDSGQADNTVVIFYSDQGG------YF-----TNAPLRGGKIGG 340
Query: 59 -SFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+ +EGG R + W + Q+ E L+ TD LPT IAG
Sbjct: 341 RALYEGGARVPLIVRWPGVTPNSQISEQLVLSTDILPTFVDIAG 384
>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
Length = 502
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 42/110 (38%), Positives = 61/110 (55%), Gaps = 11/110 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
M+ +D+ +GRV+++LE G+ N+L+VF SDNGG VD+ F G + A N P
Sbjct: 273 MISVMDDGIGRVVEALEMRGMRENTLIVFHSDNGG-VVDAF-FAGESNVKGKLPADNGPY 330
Query: 54 RGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
R K + +EGG R V + W + G +MH+ D PTL +AG E
Sbjct: 331 REGKGTTYEGGTRVVGLVNWPGKIEPGTA-NGMMHVVDMYPTLAALAGAE 379
>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
Length = 520
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD VGR+ + LEE G N+L+VF SDNGG T N + N PLRG K S
Sbjct: 279 MMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 331
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W G + + ++ D LPT AG E
Sbjct: 332 REGGIRVPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAAGAE 374
>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
Length = 1941
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 8/106 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L +D++VG VI +L+ G+ +L++FISDNG F +N PLRG +
Sbjct: 266 LSLMDDAVGHVIDALKSRGMWEETLLIFISDNGA-------FDFMEGTNLPLRGAGTTLF 318
Query: 62 EGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A +++K G + L HITDW T+ +AG + D
Sbjct: 319 EGGMRVPAIAHGKMIKKTGYINNELNHITDWHSTILSLAGIQPEPD 364
>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
Length = 440
Score = 65.5 bits (158), Expect = 3e-09, Method: Composition-based stats.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++++LD++VG+V+Q+L++ G N++V+F SDNGG + WP G K
Sbjct: 229 IVEALDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R L WS ++ QV E + DWLPTL AG + D
Sbjct: 280 LEGGIRVPTLLRWSARIQP-QVQEQVTVSFDWLPTLLAAAGARPHPD 325
>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 508
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 37/102 (36%), Positives = 60/102 (58%), Gaps = 8/102 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG ++ +L+++GL N L V SDNGGP + G A+N+PL+G K S
Sbjct: 260 MVNYLDDVVGELVDALKKKGLWENLLFVTSSDNGGPV-----YPGGGANNYPLKGGKASD 314
Query: 61 HEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
+GG+R+ V+ + P +G+ L+ +H+ DW T +A
Sbjct: 315 WQGGIRSNAFVSGGYLPEKMRGKKLDGYIHLADWYATFCALA 356
>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 489
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D +GR++ +++ G+ N+LVVF SDNG D H SN PLRG K++
Sbjct: 286 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 338
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR VA + W G E++ D LPT+ AG +
Sbjct: 339 WEGGVRVVATMRWPREFPAGVTYESVASYVDLLPTMVAAAGGQ 381
>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 35/103 (33%), Positives = 59/103 (57%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G +++ L+++G+ N+++V +DNGG + SN+PL+G K +F
Sbjct: 235 MVSVADNVTGHLVELLKKKGMWDNTIMVISADNGGAPC--------SGSNYPLKGCKSTF 286
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R++A + LL R+GQ + +HI DW T +AG
Sbjct: 287 FEGGIRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAG 329
>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 63/104 (60%), Gaps = 7/104 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT----ASNWPLRGT 56
ML S+D+ VGR+IQ+L++EG+ N+LV+F+SDNG P ++LH ASN LRG
Sbjct: 245 MLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAP--NNLHEAERAGDYLASNGSLRGR 302
Query: 57 KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
K +EGG+R + W ++ K ++ + D +PTL I+
Sbjct: 303 KGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHIS 346
>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 512
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 17/114 (14%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LDE+V V L++ G+ ++L++F +DNGG HG +NWP RGTK +
Sbjct: 255 MVSVLDETVRNVTNYLKDAGMWEDTLLIFSTDNGGEVK-----HG--GNNWPYRGTKNTL 307
Query: 61 HEGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTLY------FIAGKEKN 104
+EGGV+ V + +L +V L+H+TDW PT+ +++G +N
Sbjct: 308 YEGGVKAVGFVHGKILGDLSPAQRVNRELIHVTDWYPTIMAAADCPYVSGNSRN 361
>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 487
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE++G+V+ LE G +LV+FISDNGGPT+ + +G+ N PLRG+K +
Sbjct: 242 MMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSI--NTPLRGSKRTT 299
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGG+R + P V ++ + D T +AG EK++ S
Sbjct: 300 LEGGIRVPFVVSWPGKIAPAVFDSPVIQLDLTATALAVAGVEKDVKS 346
>gi|431799800|ref|YP_007226704.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430790565|gb|AGA80694.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 497
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 54/92 (58%), Gaps = 9/92 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D VGR++++LE G N++V+F SDNGG H A N P RG+K++ +EGG
Sbjct: 280 MDHYVGRILKALERTGKSKNTIVIFTSDNGG--------HPEIADNGPFRGSKWNLYEGG 331
Query: 65 VRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+R + W +++G V + ++ D++PT
Sbjct: 332 IRVPMIMSWPAHIQEGSVCDQIVSQLDFMPTF 363
>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 462
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 39/105 (37%), Positives = 57/105 (54%), Gaps = 7/105 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D + G+V+ ++ GL N+L+VF +DNGGP+ N +SN PL GTK +
Sbjct: 251 MTIAMDRACGKVLDRIKALGLDKNTLIVFTNDNGGPS------DANESSNLPLSGTKANH 304
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + W L+ V EN + D LPT AG + N
Sbjct: 305 LEGGIRVPFLMSWPGKLKPNTVYENPISTLDLLPTFVTAAGGDVN 349
>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
Length = 518
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 11/111 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LDE VG+++ L+E G+ N++ VF SDNGG S G + +N+PLRG K S
Sbjct: 249 MISALDEQVGQLVDKLKEVGMWSNTVFVFYSDNGGTQPQS----GQSGNNFPLRGKKGSL 304
Query: 61 HEGGVRNVAALWSPLLRKGQV------LENLMHITDWLPTLYFIAGKEKNI 105
EGG R V A R G + L HI+D T +AG + +
Sbjct: 305 FEGGYR-VPAFLIARTRAGNLELIASTSSTLFHISDMFATFIALAGGDAAV 354
>gi|298375847|ref|ZP_06985803.1| sulfatase family protein [Bacteroides sp. 3_1_19]
gi|298266884|gb|EFI08541.1| sulfatase family protein [Bacteroides sp. 3_1_19]
Length = 468
Score = 65.5 bits (158), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+L++F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLIIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
GG R + W + GQ +NL+ D+LPTL G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337
>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 517
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 58/99 (58%), Gaps = 11/99 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+V + + L+ GL N+++VF +DNGG T+ +NWPLRG K +
Sbjct: 246 MVAAMDEAVKNITEHLQAAGLWDNTILVFSADNGGQTLSG-------GNNWPLRGRKLTL 298
Query: 61 HEGGVRNVAALWSPLL---RKGQVLEN-LMHITDWLPTL 95
EGG++ V + +L ++ N ++HI+DW PT+
Sbjct: 299 WEGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTI 337
>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 472
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D +GR++ +++ G+ N+LVVF SDNG D H SN PLRG K++
Sbjct: 269 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA + W L G +++ D LPT+ AG
Sbjct: 322 WEGGVRVVAMMRWPRELPAGIAYDSVTSYVDLLPTMVAAAG 362
>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 494
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G +++ L+++G+ N+++V +DNGG H G SN+PLRG K +
Sbjct: 235 MISVADNVTGHLVELLKKKGIWDNTIMVISADNGGA-----HCSG---SNYPLRGCKETL 286
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR++A + LL R+GQ + +HI DW T +AG
Sbjct: 287 FEGGVRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAG 329
>gi|221113984|ref|XP_002154333.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 479
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 33/101 (32%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD +VG +++ L + G+L +++++F + GG ++ + S +P RG +
Sbjct: 207 MIMQLDRAVGHLVRILYDTGILHDAILIFTTLTGGTKGNNFY---TWPSTYPFRGGNGTL 263
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG R + ++S + R+G+V L+HI DWLPT++ +AG
Sbjct: 264 WEGGSRALGFVYSNRIARRGRVSYGLVHINDWLPTIFRLAG 304
>gi|32471071|ref|NP_864064.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32396773|emb|CAD71738.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 490
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 44/115 (38%), Positives = 62/115 (53%), Gaps = 11/115 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS--------LHFHGNTASNWP 52
+++ +D +VGR++ SL E L N+ V+F SDNG V + L HG +A P
Sbjct: 227 VIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRLGDHGGSAG--P 284
Query: 53 LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
LR K S EGGVR A LW+P + G V +++ D +PTL +AG E D
Sbjct: 285 LRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAEIPTD 339
>gi|375088716|ref|ZP_09735054.1| hypothetical protein HMPREF9703_01136 [Dolosigranulum pigrum ATCC
51524]
gi|374561681|gb|EHR33020.1| hypothetical protein HMPREF9703_01136 [Dolosigranulum pigrum ATCC
51524]
Length = 483
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 36/102 (35%), Positives = 54/102 (52%), Gaps = 8/102 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L +D +GR++ +E G N+L+++++DNGG N +N PL GTKYS +
Sbjct: 268 LDIMDREIGRILDKIEAAGEQENTLIIYLTDNGGSPC-------NYGNNTPLFGTKYSLY 320
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR A W ++ NL+ D LPT ++A E
Sbjct: 321 EGGVRVPFIASWPGVVEANSQSRNLVSSLDLLPTFAYLASGE 362
>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
Length = 472
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 59/101 (58%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD +VG+++ SLEE+GL N+LVVF SDNGGPT DS N + N P RG K
Sbjct: 255 MVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDNGGPT-DS-----NASLNAPYRGQKGIL 308
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+ + P LL +G + + + D +PT +AG
Sbjct: 309 LEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTFLALAG 349
>gi|343085191|ref|YP_004774486.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353725|gb|AEL26255.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 478
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 3/103 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VGR++Q L+ +GL N+ V+ SDNGG + +L+ G SN PLR K
Sbjct: 275 MVAAMDENVGRILQELKAQGLDDNTWVILTSDNGG--LSTLYGEGAPTSNGPLRAGKGWC 332
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTLYFIAGKE 102
+EGG+R + P ++ L I+ D PTL IAG E
Sbjct: 333 YEGGIRVPMLIKGPGIKNPGTAPELPVISMDIFPTLLSIAGAE 375
>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
Length = 504
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G ++Q L+++G+ N++++ +DNGG SN+PL+G K SF
Sbjct: 248 MVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG--------GSNYPLKGCKGSF 299
Query: 61 HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
EGGVR +A + LL R+G+ + +HI DW T +AG
Sbjct: 300 FEGGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLAG 342
>gi|294053904|ref|YP_003547562.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613237|gb|ADE53392.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 566
Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 62/103 (60%), Gaps = 7/103 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D++VGR++QSL E L N+LV+F SDNG +V +L N N P RG+K
Sbjct: 347 MIDSVDQNVGRLMQSLVETEELDNTLVLFFSDNG--SVSTL----NEYMNAPYRGSKALV 400
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG++ + A W ++ + + + I DW+PT+ +AG E
Sbjct: 401 WEGGLKTHCIAYWKGKIQPASISHDQVWIGDWMPTVLELAGVE 443
>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 472
Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D +GR++ +++ G+ N+LVVF SDNG D H SN PLRG K++
Sbjct: 269 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA + W L G +++ D LPT+ AG
Sbjct: 322 WEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPTMVAAAG 362
>gi|340618199|ref|YP_004736652.1| sulfatase [Zobellia galactanivorans]
gi|339732996|emb|CAZ96371.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 506
Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE++GR+I L++ L N+L++ SDNG +HG + N LRG+K
Sbjct: 306 MVQSMDENIGRIITYLKKNNLEENTLIIITSDNG--------YHGIASENLKLRGSKGEI 357
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
+EGG++ A + P K + E + +TD+ PTL +A
Sbjct: 358 YEGGIKVPALIHWPGKVKARRTEIPISVTDYFPTLMDVA 396
>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
Length = 498
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 11/108 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
M+ +D+ +GRV+++LE G+ N+L++F SDNGG VD+ F G + A N P
Sbjct: 269 MISVMDDGIGRVVEALETRGMRENTLILFHSDNGG-VVDAF-FAGESDVKGKLPADNGPY 326
Query: 54 RGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R K + +EGG R V + W + G + +MH+ D PTL +AG
Sbjct: 327 RQGKGTTYEGGTRVVGLINWPGKIEPGSA-DGMMHVVDMYPTLAALAG 373
>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG ++ +L+++G+ N L V SDNGGP + G A+N+PL+G K +
Sbjct: 258 MVNYLDDVVGDLVAALKQKGMWDNLLYVTSSDNGGPI-----YAGGGANNYPLKGGKLTD 312
Query: 61 HEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+GG+R V+ + P +GQ + +H+ DW T IAG
Sbjct: 313 WQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWYGTFCAIAG 355
>gi|218679088|ref|ZP_03526985.1| putative sulfatase protein [Rhizobium etli CIAT 894]
Length = 295
Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 62/109 (56%), Gaps = 13/109 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
M+ +D+ +GRV+++LE G+ N+L+VF SDNGG VD+ F G + A N P
Sbjct: 66 MISVMDDGIGRVVEALETRGMRENTLIVFHSDNGG-VVDAF-FAGESDVKGKLPADNGPY 123
Query: 54 RGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R K + +EGG R VA L W + G + +MH+ D PTL +AG
Sbjct: 124 RQGKGTTYEGGTR-VAGLINWPGKIVPGSA-DGMMHVVDMYPTLAALAG 170
>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 585
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 61/107 (57%), Gaps = 8/107 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++DE+VG + +L+++G+ ++L +F+SDNGG + + GN N+P RG K S
Sbjct: 331 MISAMDEAVGNITNALKDKGMWNDTLFIFLSDNGG----DMFYAGN---NYPFRGNKASL 383
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EG V+ ++ +L +KG L+H TD TL AG + D
Sbjct: 384 WEGSVKIPGFIYGNMLEKKGYSNNELIHFTDLFATLLSAAGGTPDDD 430
>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
Length = 453
Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 58/96 (60%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD+++G+V+Q+LEE G L N+LV+F SDNGG G+ A+N P RG K
Sbjct: 251 LIEHLDDNIGKVMQALEESGQLSNTLVIFASDNGGD-------RGSMANNGPTRGAKGDM 303
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
EGG+ AL P + +G +N + + D +PT+
Sbjct: 304 FEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTI 339
>gi|283781860|ref|YP_003372615.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283440313|gb|ADB18755.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 513
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D++VG+V+ L+E GL N+LV+F SDNGG + G SN PLRG K
Sbjct: 283 MVDAMDQAVGKVLAKLDELGLRENTLVIFTSDNGGLSTS----EGWPTSNLPLRGGKGWM 338
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ G ++ + D++ TL
Sbjct: 339 YEGGIREPLVMRWPAKVKAGSTIDTPVSSPDFMATL 374
>gi|392410845|ref|YP_006447452.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
gi|390623981|gb|AFM25188.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
Length = 571
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ LE+ G L N++VVF +DNG T S G T P +G+K +
Sbjct: 305 MKQMDDNIGLVLKKLEDMGQLDNTIVVFTTDNGAETW-SFPDGGTT----PFKGSKMNTW 359
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W ++ G V ++ DWLPT IAG K
Sbjct: 360 EGGMRAPCVIRWPGHIKPGTVFNDIFASLDWLPTFVEIAGGPK 402
>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 500
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 11/107 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ D G V+ L+E + N+++V +DNGG SN+PL+G+K++F
Sbjct: 240 MVSVADNVTGHVVNLLKENDMWDNTIMVISADNGGADC--------KGSNYPLKGSKHTF 291
Query: 61 HEGGVRNVAALWS---PLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR +A P R+G+ + +H+ DW T +AG + N
Sbjct: 292 FEGGVRVIAFASGGLIPAARRGKSTDGFIHVADWYTTFCKMAGVDHN 338
>gi|365118348|ref|ZP_09336988.1| hypothetical protein HMPREF1033_00334 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649879|gb|EHL88976.1| hypothetical protein HMPREF1033_00334 [Tannerella sp.
6_1_58FAA_CT1]
Length = 508
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ S+DE+VG+V+ L++EGL N++V +SDNGG + G+ SN+PLRG K
Sbjct: 296 LIWSMDENVGKVLDYLKKEGLYDNTVVCLVSDNGGLSTA----EGSPTSNFPLRGGKGWL 351
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + +P ++ + + + TD+ PT+ + G
Sbjct: 352 YEGGIRVPYIIKTPYMKGSAGICKTPVSSTDFYPTILELTG 392
>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 40/107 (37%), Positives = 60/107 (56%), Gaps = 7/107 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT----ASNWPLRGT 56
M+ + D+ VG + +L+ +G+ N+L+V SDNGG L + ASNWPLRG
Sbjct: 282 MVLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGGASNWPLRGG 341
Query: 57 KYSFHEGGVRNVAAL---WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
K S +GG+R A + + P R+GQV L+ + DW+ T +AG
Sbjct: 342 KLSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAG 388
>gi|149196713|ref|ZP_01873767.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
gi|149140393|gb|EDM28792.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
Length = 483
Score = 64.7 bits (156), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 53/100 (53%), Gaps = 9/100 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDESVGRV+ LEE GL N+ + F DNG GN L+G K
Sbjct: 268 MVQSLDESVGRVLAKLEETGLRDNTYIFFTGDNGS---------GNNLYTGGLKGAKAKA 318
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
HEGG R V + P ++ G + + TD+ PT+ +AG
Sbjct: 319 HEGGTREVYLISGPGIKAGGKSDVPVIGTDFYPTMLDLAG 358
>gi|329896998|ref|ZP_08271791.1| Arylsulfatase [gamma proteobacterium IMCC3088]
gi|328921506|gb|EGG28891.1| Arylsulfatase [gamma proteobacterium IMCC3088]
Length = 560
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 37/96 (38%), Positives = 56/96 (58%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD SVGRV+ +L++ GL N+LV+F SDNGGP + G + N P RG K +
Sbjct: 334 MIRALDRSVGRVLDALDQRGLSENTLVIFTSDNGGPG-----YIGLSDINKPFRGWKITQ 388
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R L W ++ G + + D +PT+
Sbjct: 389 FEGGIRVPLFLRWPRVIESGVRINTPVAHIDMMPTI 424
>gi|402871939|ref|XP_003899903.1| PREDICTED: arylsulfatase B-like [Papio anubis]
Length = 242
Score = 64.7 bits (156), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 47/76 (61%), Gaps = 8/76 (10%)
Query: 30 ISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLR-KGQVLENLMHI 88
+SDNGG T+ +NWPLRG K+S EGGVR V + SPLL+ KG L+HI
Sbjct: 7 VSDNGGQTLAG-------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHI 59
Query: 89 TDWLPTLYFIAGKEKN 104
+DWLPTL +AG N
Sbjct: 60 SDWLPTLVKLAGGHTN 75
>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 486
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 41/109 (37%), Positives = 58/109 (53%), Gaps = 11/109 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA---------SNW 51
M+ LDE++G+V+ +L+++GL N+L+ F SDNGG T D + F G A N
Sbjct: 242 MVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGG-TKDKM-FAGQMADMSKVVLPCDNG 299
Query: 52 PLRGTKYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
P R K S EGG R A P K Q ++ ++H D PT +AG
Sbjct: 300 PYRNGKGSLFEGGSRVCALANWPGKIKAQTVDGMIHAVDLYPTFAALAG 348
>gi|149178575|ref|ZP_01857162.1| aryl-sulphate sulphohydrolase [Planctomyces maris DSM 8797]
gi|148842593|gb|EDL56969.1| aryl-sulphate sulphohydrolase [Planctomyces maris DSM 8797]
Length = 467
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 8/66 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D+S+GRV+Q+L E+ L N++VVF SDNGG +G S PLRG+K
Sbjct: 250 MIRSMDQSIGRVLQTLREQQLDQNTIVVFTSDNGG--------YGPATSMLPLRGSKGML 301
Query: 61 HEGGVR 66
+EGG+R
Sbjct: 302 YEGGIR 307
>gi|421738848|ref|ZP_16177186.1| arylsulfatase A family protein [Streptomyces sp. SM8]
gi|406692707|gb|EKC96390.1| arylsulfatase A family protein [Streptomyces sp. SM8]
Length = 485
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 40/101 (39%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG+V+++LE G N+LV F SDNGG + NWPL G K S
Sbjct: 296 MVEDLDRSVGQVLRALERSGQEENTLVFFASDNGGERF---------SYNWPLDGNKGSL 346
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + L W + QV + ++ DW TL I G
Sbjct: 347 KEGGIRVPSVLRWPARIDPRQVSDVPVYSPDWTATLLDIGG 387
>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 498
Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats.
Identities = 41/103 (39%), Positives = 53/103 (51%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD VGR+ + LEE G N+L+VF SDNGG T N + N PLRG K S
Sbjct: 257 MMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W G + ++ D LPT AG E
Sbjct: 310 REGGIRVPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAE 352
>gi|408371886|ref|ZP_11169642.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
gi|407742679|gb|EKF54270.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
Length = 480
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 9/113 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDN-----GGPTVDSLHFHGNTASNWPLRG 55
M+ SLD VG++++ L+ +GL N+LV+F SDN GG T + + F +TA LRG
Sbjct: 267 MVTSLDSYVGKLLEKLKSKGLEENTLVIFTSDNGTHIEGGRTKEDVKFFNSTAG---LRG 323
Query: 56 TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
K +EGG+R A W + KG + D LPT +AG +K +++
Sbjct: 324 VKRDLYEGGIRTPFIAHWPGTIPKGSISSVTAAYWDLLPTFTELAGYKKRLET 376
>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 483
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 61/102 (59%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D +VG+++ L+E+G+ N++V+F SDNGG + G +N PLR K
Sbjct: 272 MVESMDMAVGKILGKLQEKGIADNTVVIFFSDNGGLSTSE----GIPTANTPLRAGKGWL 327
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGK 101
+EGG+R A + W ++ G V + ++ D+ PT+ + G+
Sbjct: 328 YEGGIREPAIIRWPGTIKPGTVSDAVITSMDFYPTILEMTGQ 369
>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
Length = 452
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD +VGRV ++LE+ G L N++++F SDNGG A+N P RG K
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +R V +N + ++D PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336
>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
Length = 456
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 58/95 (61%), Gaps = 7/95 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+ VGRVI SLE+ G L N+L++F SDNGG + A+N P+RG K
Sbjct: 256 LIEHMDDGVGRVINSLEKSGQLDNTLIIFCSDNGGD-------RKSEANNGPVRGDKGDM 308
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
++GG++ +L+ + + + NL+ ++D PTL
Sbjct: 309 YDGGIKVACSLYWKGHLEHRRVNNLVMMSDIFPTL 343
>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
Length = 486
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 41/108 (37%), Positives = 57/108 (52%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G + L+E GL N++ +F +DNGG + GN N PLRG K +
Sbjct: 214 MVSVMDEGIGNITSHLKESGLWNNTIFIFSTDNGGQA----YVGGN---NLPLRGNKAGY 266
Query: 61 HEGGVRN---VAALWSPLLRKGQVLEN--LMHITDWLPTLYFIAGKEK 103
EGG+ VA + RKG L N L+HI+DW PT+ EK
Sbjct: 267 WEGGIHGNGFVAGGYFEDRRKGSELVNNELIHISDWYPTIMEATSCEK 314
>gi|257067328|ref|YP_003153583.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
gi|256558146|gb|ACU83993.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
Length = 480
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 40/102 (39%), Positives = 52/102 (50%), Gaps = 8/102 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L+ +D +GR++ LE G N+LVV+++DNGG T N N PL GTKYS
Sbjct: 272 LEIMDREIGRMLDHLEATGRRENTLVVYLTDNGGSTC-------NYGINTPLHGTKYSLF 324
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGGVR W + G + L+ D LPT AG E
Sbjct: 325 EGGVRVPFMVSWPGEVPAGARCDELVSSMDLLPTFLAAAGAE 366
>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
Length = 314
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 59/95 (62%), Gaps = 12/95 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
M++++DESV + + ++ GL +++++F +DNGG P SN+PLRG K +
Sbjct: 228 MMRAVDESVRNITATYKQLGLWNDTVLIFTTDNGGNPETG--------GSNYPLRGQKAT 279
Query: 60 FHEGGVRNVAALWSPLL---RKGQVLENLMHITDW 91
EGG+R V + SPLL ++G + + L+H++DW
Sbjct: 280 TFEGGMRGVGFVNSPLLNESQRGFISDELIHVSDW 314
>gi|150008925|ref|YP_001303668.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937349|gb|ABR44046.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
Length = 468
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 57/94 (60%), Gaps = 8/94 (8%)
Query: 3 KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
+ LD S+G ++++L+EEGL N+L++F SDNG HF G +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLIIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
GG R + W + GQ +NL+ D+LPTL
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTL 332
>gi|149174163|ref|ZP_01852791.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
8797]
gi|148847143|gb|EDL61478.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
8797]
Length = 470
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 8/107 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE++GRV L+E G+ +L+VF SDNGG T++N PLRG K
Sbjct: 263 MIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNGG-------VRRKTSNNDPLRGEKGQH 315
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A LW + G V + D+ PT+ I G N +
Sbjct: 316 WEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTILNITGVAGNTE 362
>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
Length = 452
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD +VGRV ++LE+ G L N++++F SDNGG A+N P RG K
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +R V +N + ++D PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336
>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 614
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 40/99 (40%), Positives = 57/99 (57%), Gaps = 9/99 (9%)
Query: 1 MLKSLDESVGRVI-QSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
++ + DES+ + + L+ GL +++V SDNGG V ++ SN+PLRG K
Sbjct: 292 LMMNADESLANLTNRGLKATGLYDTAVIVIASDNGG-QVSAM----GGGSNYPLRGEKKY 346
Query: 60 FHEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTL 95
EGGVR A + SPLL +G LMH++DWLPTL
Sbjct: 347 LFEGGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTL 385
>gi|149199917|ref|ZP_01876945.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149136986|gb|EDM25411.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 499
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D SVG VI++L+E G+ +++ F SDNG + + H SN PL+G K S
Sbjct: 257 MIEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKE-HAEKGSNLPLKGWKGSS 315
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR L W L +G+ + + D PT +AG E +
Sbjct: 316 EEGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEPEV 361
>gi|218131663|ref|ZP_03460467.1| hypothetical protein BACEGG_03284 [Bacteroides eggerthii DSM 20697]
gi|317474715|ref|ZP_07933989.1| sulfatase [Bacteroides eggerthii 1_2_48FAA]
gi|217985966|gb|EEC52305.1| arylsulfatase [Bacteroides eggerthii DSM 20697]
gi|316909396|gb|EFV31076.1| sulfatase [Bacteroides eggerthii 1_2_48FAA]
Length = 457
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR + L+++GL N+LV+F SDNG L F G S PLRG K+S+
Sbjct: 267 VVEEIDWNVGRFLDYLDKKGLAENTLVIFASDNG----PWLSFKGEGGSADPLRGGKFSY 322
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W ++ G + ++ D PT+ AG
Sbjct: 323 YEGGVRVPCIIRWKGVVPAGVTSDAIIASIDLFPTIMHYAG 363
>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 506
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L ++D S+G ++++L+ G+ ++L +F SDNG F + SNWPLRG +
Sbjct: 256 LSAMDNSIGLLVEALKTRGMWDDTLFIFTSDNGA-------FATESGSNWPLRGNIATLF 308
Query: 62 EGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGK--EKNID 106
EGG R + +L+K G V L+HI D T+ +AG E +ID
Sbjct: 309 EGGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELAGATTESDID 356
>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
Length = 452
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD +VGRV ++LE+ G L N++++F SDNGG A+N P RG K
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +R V +N + ++D PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336
>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
Length = 452
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD +VGRV ++LE+ G L N++++F SDNGG A+N P RG K
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +R V +N + ++D PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336
>gi|254501736|ref|ZP_05113887.1| sulfatase, putative [Labrenzia alexandrii DFL-11]
gi|222437807|gb|EEE44486.1| sulfatase, putative [Labrenzia alexandrii DFL-11]
Length = 518
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 16/111 (14%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD-----SLHFHGNTASNWPLRG 55
++K LD +VG+V+ L+E G+ N++V+F SDNG T+ + FHG + W
Sbjct: 273 VMKELDNNVGKVLSRLDELGIADNTIVMFTSDNGPETMTWPDGGTTPFHGEKGTTW---- 328
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGG R A + W + +GQVL + DW+PTL AG N+
Sbjct: 329 ------EGGFRVPAIIRWPGKIAEGQVLNGIFSGLDWMPTLVAAAGGPDNL 373
>gi|386850106|ref|YP_006268119.1| sulfatase [Actinoplanes sp. SE50/110]
gi|359837610|gb|AEV86051.1| sulfatase [Actinoplanes sp. SE50/110]
Length = 462
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD VG+V+ +L + G N+LV+F SDNGG + WPL GTK S
Sbjct: 276 MVENLDVRVGQVLSALRQTGQEENTLVIFSSDNGGE---------RYSYQWPLTGTKASL 326
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R W + GQV + DW TL IAG
Sbjct: 327 NEGGIRVPTIVRWPARIHGGQVSHEPVVTPDWTATLLEIAG 367
>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
Length = 470
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 12/107 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D VGR++Q+L ++ + N++VVF SDNGG ++ WP G K
Sbjct: 254 IVERMDFQVGRILQALTDKKIRENTIVVFTSDNGGERF---------SNTWPFSGKKSEL 304
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + + W + GQ E + DWLPTL + E N+D
Sbjct: 305 LEGGIRIPSLICWPAKIPSGQTSEQVTLSMDWLPTL--LEAAETNVD 349
>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 468
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LD++ G++++ L+EEGL ++L+VF +DNGGP G N+PL G K +
Sbjct: 252 MTLALDKACGQILEKLKEEGLEDDTLIVFSNDNGGPD-------GTKTCNYPLSGCKSNN 304
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + P +++ G E + D LPT IAG
Sbjct: 305 LEGGIRVPFVMKLPKVIQPGSSYEYPVSTLDLLPTFVNIAG 345
>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
Length = 482
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +GRV+ +LE+ G+ +++VVF SDNGG + WP G K
Sbjct: 260 IVAEMDAQIGRVLDALEQHGIAQDTIVVFTSDNGGERF---------SDTWPFTGRKTEL 310
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A + W L G V M DWLPTL AG
Sbjct: 311 LEGGLRIPAIVSWPRRLPAGVVSAQPMITMDWLPTLLAAAG 351
>gi|423220023|ref|ZP_17206519.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
CL03T12C61]
gi|392623855|gb|EIY17953.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
CL03T12C61]
Length = 455
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 37/112 (33%), Positives = 61/112 (54%), Gaps = 12/112 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
M+ +D +VG++IQ L+EEG+ N+++ F SDNG +H G SN P RG
Sbjct: 250 MVSEIDRNVGQIIQLLKEEGIWENTIIFFSSDNG------VHLTGGHDYKFFNSNGPFRG 303
Query: 56 TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
K +EGG+R W ++++ +V ++L D++PT+ I G + D
Sbjct: 304 HKRDLYEGGIRAPFIVSWPMIIKEKRVTDHLSAFWDFMPTVAEIVGADVKSD 355
>gi|302543697|ref|ZP_07296039.1| LOW QUALITY PROTEIN: N-acetylgalactosamine-6-sulfate sulfatase
[Streptomyces hygroscopicus ATCC 53653]
gi|302461315|gb|EFL24408.1| LOW QUALITY PROTEIN: N-acetylgalactosamine-6-sulfate sulfatase
[Streptomyces himastatinicus ATCC 53653]
Length = 318
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+ +L G ++LV F SDNGG + WPL G K+S
Sbjct: 130 MVEDLDRSVGEVLAALRRSGQERDTLVFFASDNGGE---------RWSYQWPLTGAKFSL 180
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
HEGG+R W L GQV + DW TL + G +
Sbjct: 181 HEGGIRVPTIVRWPARLAPGQVSHEPVFTPDWTATLLELGGARPD 225
>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 889
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 11/104 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DES+ + L+ E + N+L+ F+SDNGG + HG +SN LRG K S+
Sbjct: 223 MVSMVDESLASIHDDLKAEAMWENTLLFFLSDNGG-----ILRHG--SSNLDLRGEKGSY 275
Query: 61 HEGGVRNVA----ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R A + LL G+V E L+HITD T +AG
Sbjct: 276 YEGGIRVPAFVSGGYLAKLLSAGRVFEGLVHITDLHATALRVAG 319
>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
Length = 454
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 50/95 (52%), Gaps = 9/95 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++++LD +VG+V+ +L+E G N+LVVF SDNGG + WP G K
Sbjct: 235 LVEALDAAVGQVLAALDETGQADNTLVVFTSDNGGERF---------SKTWPFTGQKTEL 285
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+R L P + QV + DWLPTL
Sbjct: 286 LEGGLRVPTLLRWPARVRPQVQAQVTITADWLPTL 320
>gi|409198451|ref|ZP_11227114.1| arylsulfatase [Marinilabilia salmonicolor JCM 21150]
Length = 465
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 42/114 (36%), Positives = 59/114 (51%), Gaps = 14/114 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
M+ +D +GR++Q L++ GL N+LVVF SDNG H G SN P RG
Sbjct: 261 MVTHMDRDIGRLMQLLKDLGLDENTLVVFTSDNG------THVEGGNDPDFFDSNGPFRG 314
Query: 56 TKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGK--EKNID 106
K +EGG+R A W ++ G+ +++ D PT +AGK EKN D
Sbjct: 315 YKRDLYEGGIRVPFIARWKNVIEPGRESDHISAFWDMYPTFCELAGKQPEKNTD 368
>gi|359146688|ref|ZP_09180156.1| sulfatase [Streptomyces sp. S4]
Length = 508
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG+V+++LE G N+LV F SDNGG + NWPL G K S
Sbjct: 319 MVEDLDRSVGQVLRALERSGQEENTLVFFASDNGGERF---------SYNWPLDGNKGSL 369
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + L W + QV + ++ DW TL + G
Sbjct: 370 KEGGIRVPSVLRWPARIDPRQVSDVPVYSPDWTATLLELGG 410
>gi|325109217|ref|YP_004270285.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969485|gb|ADY60263.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 485
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D S+GRV+ LE+ GL N+++ F SDNGG + G+ SN P RG K
Sbjct: 272 MVETMDTSIGRVLDHLEKLGLTDNTVICFTSDNGGLSTS----EGSPTSNLPFRGGKGWV 327
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R + W G V E + D PTL IA E
Sbjct: 328 YEGGIREPLLIKWPGTTPAGSVCETPICSIDMHPTLCDIADAE 370
>gi|421612348|ref|ZP_16053456.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408496803|gb|EKK01354.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 482
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 43/115 (37%), Positives = 61/115 (53%), Gaps = 11/115 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNTASNWP 52
+++ +D +VGR++ SL E L N+ V+F SDNG V + HG +A P
Sbjct: 219 VIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRPGDHGGSAG--P 276
Query: 53 LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
LR K S EGGVR A LW+P + G V +++ D +PTL +AG E D
Sbjct: 277 LRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAEIPTD 331
>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 498
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 54/103 (52%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD VGR+ + LEE G N+L+VF SDNGG T N + N PLRG K S
Sbjct: 257 MMFALDRGVGRIRKHLEETGQWKNTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W G+ + ++ D LPT AG +
Sbjct: 310 REGGIRVPMIWTWPAKFPAGKRYDGVISSLDLLPTFCSAAGAQ 352
>gi|325108394|ref|YP_004269462.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968662|gb|ADY59440.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 475
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 8/108 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG----PTVDSLHFHGNTASNWPLRGT 56
ML+ LD +VGR++ ++E+ N+ V F +DNGG P D+ T +N PL G+
Sbjct: 253 MLEDLDNNVGRLLDAIEKLEAEENTYVFFTADNGGRGTVPGGDT----SRTETNHPLTGS 308
Query: 57 KYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
K+S +EGG+R P +++ V + D+LPT Y +AG + N
Sbjct: 309 KHSLYEGGIRVPFLARGPGIKQNSVCHQPVVGYDFLPTFYSLAGGKPN 356
>gi|417301290|ref|ZP_12088451.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327542405|gb|EGF28888.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 482
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 43/115 (37%), Positives = 61/115 (53%), Gaps = 11/115 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNTASNWP 52
+++ +D +VGR++ SL E L N+ V+F SDNG V + HG +A P
Sbjct: 219 VIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRPGDHGGSAG--P 276
Query: 53 LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
LR K S EGGVR A LW+P + G V +++ D +PTL +AG E D
Sbjct: 277 LRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAELPTD 331
>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
Length = 482
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M S+D +G+V+ +L+E GL N+LVVF +DNGGP+ N ++N PL GTK +
Sbjct: 263 MTLSMDREIGKVLNTLDELGLSDNTLVVFTNDNGGPS------DTNASNNGPLSGTKANH 316
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
EGG+R + P + Q N + D LPT + AG
Sbjct: 317 LEGGIRVPFIMRWPAGKVSQGEYPNPISTLDLLPTFFSAAG 357
>gi|421612380|ref|ZP_16053488.1| N-acetyl-galactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408496835|gb|EKK01386.1| N-acetyl-galactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 889
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 60/97 (61%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
M++S+D+++G ++ +L+ GL +++VF SDNGG + + G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGLADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561
Query: 60 FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
+EGGVR A + P ++ G + ++ D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTL 598
>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 498
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 53/103 (51%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD VGR+ + LEE G N+L+VF SDNGG T N + N PLRG K S
Sbjct: 257 MMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W G + ++ D LPT AG E
Sbjct: 310 REGGIRVPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAE 352
>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 452
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD +VGRV ++L++ G L N++++F SDNGG A+N P RG K
Sbjct: 249 LIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +R V EN + ++D PTL
Sbjct: 302 YEGGIRVAGGIYWKNRIRPA-VRENFVMLSDMFPTL 336
>gi|198275209|ref|ZP_03207740.1| hypothetical protein BACPLE_01368 [Bacteroides plebeius DSM 17135]
gi|198271792|gb|EDY96062.1| arylsulfatase [Bacteroides plebeius DSM 17135]
Length = 509
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ S+DE+VGRV+Q+L++ GL N++V +SDNGG + G+ N PLR K
Sbjct: 299 LIYSMDENVGRVMQALKDNGLDKNTIVCLLSDNGGLSTA----EGSPTCNAPLRAGKGWL 354
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + P ++ G V + D+ PTL +AG
Sbjct: 355 YEGGIREPFIIKYPQMVEAGSVCHTPVVAVDFYPTLLDMAG 395
>gi|255034790|ref|YP_003085411.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254947546|gb|ACT92246.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 517
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 59/101 (58%), Gaps = 1/101 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G V++ L+E+ + N++V+F+SDNGG + N PL+ K S
Sbjct: 288 LVEGMDKSLGDVLRYLDEQKIADNTVVLFMSDNGGLSTSPARGGKAWTHNLPLKAGKGSV 347
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + W + + G V E + I D+ PT+ IAG
Sbjct: 348 YEGGIREPMLVRWPGVTKAGSVTEQYVIIEDFFPTILDIAG 388
>gi|372208819|ref|ZP_09496621.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 459
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D+++G V+Q+L+E GL N++VVF SDNGG H SN PLR K
Sbjct: 241 MIEHMDDALGAVVQALKETGLENNTIVVFHSDNGG--------HLKPTSNAPLRSGKGDL 292
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTL 95
+EGGVR PL K + N + I+ D +PTL
Sbjct: 293 YEGGVRVPLVFKWPLKIKAGAVSNELAISADIMPTL 328
>gi|404406436|ref|ZP_10998020.1| sulfatase [Alistipes sp. JC136]
Length = 468
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 11/101 (10%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSF 60
LK +D+ VG +++ LEE G+ N++V+F SDNGG P V N PLRG K +
Sbjct: 273 LKVIDDGVGMIVEKLEELGIADNTIVIFTSDNGGSPQV---------TDNGPLRGGKGTL 323
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG R +W P + G+V E + D+ PTL + G
Sbjct: 324 YEGGTREPMIVWQPGRICGGRVSELPTNNYDFYPTLCQLTG 364
>gi|313231682|emb|CBY08795.1| unnamed protein product [Oikopleura dioica]
Length = 509
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K+LD ++G + ++LE G L N++V+++ DNG V++ G+ LRG K
Sbjct: 220 VKALDIAIGEIFEALENAGKLDNTIVIYVGDNGA-AVENFQSSGD------LRGYKGGAF 272
Query: 62 EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGGV+ A L P + +G + + TDWLPTL
Sbjct: 273 EGGVKTFALLSGPGISQGASYKGIFQSTDWLPTL 306
>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 60/107 (56%), Gaps = 7/107 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D+ V +V+QSL+E + N++V F+SDNGGP+ H N + N+PL+G K
Sbjct: 248 MVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPS------HKNKSDNFPLKGQKSDV 301
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A+ P ++ QV ++ + D T+ +A + D
Sbjct: 302 WEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHAD 348
>gi|311745994|ref|ZP_07719779.1| N-acetylgalactosamine-6-sulfatase [Algoriphagus sp. PR1]
gi|126576208|gb|EAZ80486.1| N-acetylgalactosamine-6-sulfatase [Algoriphagus sp. PR1]
Length = 504
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D+S+G ++ LE+ L+ N+++VF+SDNG H N PLRG K S
Sbjct: 290 MIEGMDKSLGDIMDFLEDSKLVENTIIVFMSDNGA--------HKQVPQNSPLRGWKLSP 341
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + W + + G V+E + I D PT ++G
Sbjct: 342 YEGGIRVPLIVKWPGVTKPGSVVEKQVIIEDIYPTFLDMSG 382
>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLDE VG VI + LL N++ F++DNGG T D +NWPL G K++
Sbjct: 249 MVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNGGETRDG-------GNNWPLSGQKWTL 301
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLY--FIAGKEKNID 106
EGGV+ ++ L+ G+ L H++D LPT+ F +K ID
Sbjct: 302 LEGGVKATGFMFGGLVPLGK-YNGLFHVSDILPTVMEAFNCPIKKKID 348
>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 4/103 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT--ASNWPLRGTKY 58
M+ LD++VGR++ +LE GL ++LVVF+SD+G T +S + + SN P RG K
Sbjct: 255 MVTQLDKNVGRIMTALEGLGLAADTLVVFVSDHGA-TFESGNLGASDYHDSNRPFRGQKR 313
Query: 59 SFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+ EGG+R A W + G + ++H+TD PTL G
Sbjct: 314 TLWEGGIRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAATG 356
>gi|410633554|ref|ZP_11344196.1| N-acetylgalactosamine-6-sulfatase [Glaciecola arctica BSs20135]
gi|410146795|dbj|GAC21063.1| N-acetylgalactosamine-6-sulfatase [Glaciecola arctica BSs20135]
Length = 440
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 4/100 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD SVG+V+Q+L E+G+ N+LVVF SDNG L F S P RG K
Sbjct: 226 VMEELDWSVGQVLQTLREQGIAENTLVVFSSDNG----PWLLFETMGGSAGPFRGGKNET 281
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R W P K +++ ++ D LPTL +AG
Sbjct: 282 LEGGLRVPGFFWWPGHIKPELVHDIGSTIDLLPTLVSVAG 321
>gi|383116283|ref|ZP_09937035.1| hypothetical protein BSHG_3339 [Bacteroides sp. 3_2_5]
gi|251945469|gb|EES85907.1| hypothetical protein BSHG_3339 [Bacteroides sp. 3_2_5]
Length = 493
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSRHLKGRQVSDTPIIGTDYYPT 363
>gi|149196406|ref|ZP_01873461.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
gi|149140667|gb|EDM29065.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
Length = 493
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD +VGR++ +LEE+GL +L++F SDNGG H + PLR K S+
Sbjct: 245 MIEALDHNVGRLVAALEEQGLREKTLIMFTSDNGG--------HMKFSRQEPLRAGKGSY 296
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R A W ++ G + + D+ PT+ +AG E
Sbjct: 297 YEGGIRVPFFASWPGVIEAGSRSQVPVTGLDFYPTVCELAGVE 339
>gi|423260671|ref|ZP_17241593.1| hypothetical protein HMPREF1055_03870 [Bacteroides fragilis
CL07T00C01]
gi|423266807|ref|ZP_17245809.1| hypothetical protein HMPREF1056_03496 [Bacteroides fragilis
CL07T12C05]
gi|387775225|gb|EIK37334.1| hypothetical protein HMPREF1055_03870 [Bacteroides fragilis
CL07T00C01]
gi|392699582|gb|EIY92757.1| hypothetical protein HMPREF1056_03496 [Bacteroides fragilis
CL07T12C05]
Length = 493
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|375092146|ref|ZP_09738431.1| hypothetical protein HMPREF9709_01293 [Helcococcus kunzii ATCC
51366]
gi|374561912|gb|EHR33249.1| hypothetical protein HMPREF9709_01293 [Helcococcus kunzii ATCC
51366]
Length = 472
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
L+ +D+ +G +I +EE G N+L+V+ +DNGG N +N PL GTKY+
Sbjct: 264 QLELMDKKIGEIIDKVEEMGQKDNTLIVYTTDNGGSPC-------NYGNNAPLTGTKYTL 316
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG R W ++ K Q NL D L T ++ G E
Sbjct: 317 YEGGTRVPFIFSWDGVIPKNQSSRNLSSTLDLLTTFAYLGGVE 359
>gi|375360006|ref|YP_005112778.1| putative exported sulfatase [Bacteroides fragilis 638R]
gi|301164687|emb|CBW24246.1| putative exported uslfatase [Bacteroides fragilis 638R]
Length = 493
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 467
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 41/110 (37%), Positives = 61/110 (55%), Gaps = 11/110 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D ++GRV+Q L+ +G+ N+LV+F++DNGG T N + N LRG K S
Sbjct: 252 MLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNGGATT-------NYSDNGQLRGMKGSV 304
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG---KEKNID 106
EGGVR W+ + V + + D LPT AG K+K++D
Sbjct: 305 WEGGVRVGYIMRWNGKIPANVVYDKAVSSLDILPTSLAAAGNTRKDKHLD 354
>gi|60683167|ref|YP_213311.1| sulfatase [Bacteroides fragilis NCTC 9343]
gi|423270164|ref|ZP_17249135.1| hypothetical protein HMPREF1079_02217 [Bacteroides fragilis
CL05T00C42]
gi|423276123|ref|ZP_17255065.1| hypothetical protein HMPREF1080_03718 [Bacteroides fragilis
CL05T12C13]
gi|60494601|emb|CAH09402.1| putative exported uslfatase [Bacteroides fragilis NCTC 9343]
gi|392698088|gb|EIY91270.1| hypothetical protein HMPREF1079_02217 [Bacteroides fragilis
CL05T00C42]
gi|392699263|gb|EIY92444.1| hypothetical protein HMPREF1080_03718 [Bacteroides fragilis
CL05T12C13]
Length = 493
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|423285640|ref|ZP_17264522.1| hypothetical protein HMPREF1204_04060 [Bacteroides fragilis HMW
615]
gi|404579155|gb|EKA83873.1| hypothetical protein HMPREF1204_04060 [Bacteroides fragilis HMW
615]
Length = 493
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|423251817|ref|ZP_17232830.1| hypothetical protein HMPREF1066_03840 [Bacteroides fragilis
CL03T00C08]
gi|423255138|ref|ZP_17236068.1| hypothetical protein HMPREF1067_02712 [Bacteroides fragilis
CL03T12C07]
gi|392649242|gb|EIY42921.1| hypothetical protein HMPREF1066_03840 [Bacteroides fragilis
CL03T00C08]
gi|392652579|gb|EIY46238.1| hypothetical protein HMPREF1067_02712 [Bacteroides fragilis
CL03T12C07]
Length = 493
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|440713548|ref|ZP_20894148.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436441706|gb|ELP34912.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 889
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 60/97 (61%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
M++S+D+++G ++ +L+ G+ +++VF SDNGG + + G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561
Query: 60 FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
+EGGVR A + P ++ G + ++ D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTL 598
>gi|408676338|ref|YP_006876165.1| Arylsulfatase [Streptomyces venezuelae ATCC 10712]
gi|328880667|emb|CCA53906.1| Arylsulfatase [Streptomyces venezuelae ATCC 10712]
Length = 467
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD S+G+V+ +L+ G ++LV F SDNGG + NWPL G K S
Sbjct: 282 MVEDLDRSIGQVLTALKRSGQEQDTLVFFASDNGGERF---------SYNWPLAGNKGSL 332
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + L W + GQV + + DW TL +AG
Sbjct: 333 QEGGIRVPSILRWPARIDGGQVSDLPVFTPDWTATLLDLAG 373
>gi|53715233|ref|YP_101225.1| sulfatase ydeN [Bacteroides fragilis YCH46]
gi|52218098|dbj|BAD50691.1| putative secreted sulfatase ydeN precursor [Bacteroides fragilis
YCH46]
Length = 493
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|343084295|ref|YP_004773590.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352829|gb|AEL25359.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 499
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G ++Q L+E+ L+ N++++F+SDNGG +V S +T N PL K S
Sbjct: 275 LIEGMDKSLGDIMQHLKEKNLMENTIILFMSDNGGLSVHSRGGEPHT-HNKPLSSGKGSI 333
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
HEGG+R + W + G +N + I D+ P++ +AG E
Sbjct: 334 HEGGIRVPMIVKWPGVTIPGSTNDNYLIIEDFYPSILEMAGVE 376
>gi|407940996|ref|YP_006856637.1| sulfatase [Acidovorax sp. KKS102]
gi|407898790|gb|AFU47999.1| sulfatase [Acidovorax sp. KKS102]
Length = 444
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GR++ LE +GL ++LVVF SDNGG + NWPL G K
Sbjct: 240 MIHHMDEGIGRIMALLEAKGLARDTLVVFTSDNGGERF---------SDNWPLVGGKMDL 290
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W ++ G V DW T+ AG + D
Sbjct: 291 TEGGIRVPWIAHWPAVIAPGGVSAQTCMTMDWSATMLDAAGAATHTD 337
>gi|32471039|ref|NP_864032.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32396741|emb|CAD71706.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 889
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 60/97 (61%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
M++S+D+++G ++ +L+ G+ +++VF SDNGG + + G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561
Query: 60 FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
+EGGVR A + P ++ G + ++ D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTL 598
>gi|239816550|ref|YP_002945460.1| sulfatase [Variovorax paradoxus S110]
gi|239803127|gb|ACS20194.1| sulfatase [Variovorax paradoxus S110]
Length = 438
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G ++ +L + G+ N+LVVF SDNGG + NWPL G K
Sbjct: 234 MIHHMDEGIGWIMAALRKHGMADNTLVVFTSDNGGERF---------SDNWPLVGGKMDL 284
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W ++ KG L DW T+ AG + + D
Sbjct: 285 TEGGIRVPWIAHWPAVIAKGGESRQLCMTMDWSATMLDAAGVKADAD 331
>gi|265767063|ref|ZP_06094892.1| sulfatase [Bacteroides sp. 2_1_16]
gi|263253440|gb|EEZ24916.1| sulfatase [Bacteroides sp. 2_1_16]
Length = 493
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ +L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 489
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D +GR++ +++ G+ N+LVVF SDNG D H SN PLRG K++
Sbjct: 286 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 338
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA + W L G +++ D LP + AG
Sbjct: 339 WEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPIMVAAAG 379
>gi|167524284|ref|XP_001746478.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775240|gb|EDQ88865.1| predicted protein [Monosiga brevicollis MX1]
Length = 520
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 38/115 (33%), Positives = 59/115 (51%), Gaps = 15/115 (13%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH------------FHGNTA 48
M+ LD+++ L+ +G+ N+L+V SDNGG L F+G
Sbjct: 248 MVAFLDDNLANFTSLLKSKGMWNNTLMVLTSDNGGYVKSFLGDCDNNTSHGYACFNGEAG 307
Query: 49 SNWPLRGTKYSFHEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+N+PL+G KYS EGGVR V+ + P R+G + ++H+ DW T +AG
Sbjct: 308 ANYPLQGGKYSTLEGGVRVNAFVSGGFVPEERRGTREDEMVHVADWYATFAALAG 362
>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLDE VG V+ + LL N++ F++DNGG T D +NWPL G K++
Sbjct: 249 MVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGETRDG-------GNNWPLSGQKWTL 301
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLY--FIAGKEKNID 106
EGGV+ ++ L+ G+ L H++D LPT+ F +K ID
Sbjct: 302 LEGGVKATGFMFGGLVPLGK-YNGLFHVSDILPTVMEAFNCPIKKKID 348
>gi|313246656|emb|CBY35538.1| unnamed protein product [Oikopleura dioica]
Length = 351
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLDE VG V+ + LL N++ F++DNGG T D +NWPL G K++
Sbjct: 110 MVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGETRDG-------GNNWPLSGQKWTL 162
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLY--FIAGKEKNID 106
EGGV+ ++ L+ G+ L H++D LPT+ F +K ID
Sbjct: 163 LEGGVKATGFMFGGLVPLGK-YNGLFHVSDILPTVMEAFNCPIKKKID 209
>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
Length = 470
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 55/95 (57%), Gaps = 8/95 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+ VG ++++L+E+ L N+L+VF++DNG T GN A N LRG K S
Sbjct: 251 MMVSLDDGVGELVKTLKEQNLYDNTLIVFVNDNGAAT-------GNYADNGKLRGLKGSK 303
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPT 94
EGG+R + P + G+V ++ D +PT
Sbjct: 304 WEGGIRVAYIMQYPDQIPAGKVYSKMVSGLDIMPT 338
>gi|320103547|ref|YP_004179138.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319750829|gb|ADV62589.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 502
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 3/96 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L+ +D S+GR++ +L + GL N+ VVF+SDNGG +S + G SN PLR K +
Sbjct: 265 LLEEMDTSIGRILNALNQRGLDSNTWVVFLSDNGGLERESGGWPG--TSNRPLRDQKGTL 322
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + A L + G V+E + D PTL
Sbjct: 323 YEGGIRIPMIARIPGLTKPGTVVETPVVTHDLAPTL 358
>gi|423239862|ref|ZP_17220977.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
CL03T12C01]
gi|392644851|gb|EIY38585.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
CL03T12C01]
Length = 464
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|333907388|ref|YP_004480974.1| N-acetylgalactosamine-6-sulfatase [Marinomonas posidonica
IVIA-Po-181]
gi|333477394|gb|AEF54055.1| N-acetylgalactosamine-6-sulfatase [Marinomonas posidonica
IVIA-Po-181]
Length = 437
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 49/96 (51%), Gaps = 10/96 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GR++ +LEE GL ++L+VF SDNGG + NWPL G K
Sbjct: 233 MIHHMDEGIGRIMAALEEHGLTDDTLIVFTSDNGGERF---------SDNWPLVGGKMDL 283
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+R A W + G+V DW TL
Sbjct: 284 TEGGIRVPWIARWPKKIDAGKVSTQHCMTMDWSRTL 319
>gi|198274831|ref|ZP_03207363.1| hypothetical protein BACPLE_00990 [Bacteroides plebeius DSM 17135]
gi|198272278|gb|EDY96547.1| arylsulfatase [Bacteroides plebeius DSM 17135]
Length = 459
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 5/104 (4%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ +D +VGR++ LE+EGL N++V+F SDNG L N S PLR K+S +
Sbjct: 266 VEEIDWNVGRLLDFLEKEGLSENTMVIFTSDNG----PWLGMKENGGSALPLRDGKFSAY 321
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGGVR + W + G+V ++++ D PTL +A +K+
Sbjct: 322 EGGVRTPCIIKWKNHIPAGKVSDHIIASIDIFPTLLELAQMDKS 365
>gi|431907856|gb|ELK11463.1| Arylsulfatase B [Pteropus alecto]
Length = 267
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 47/77 (61%), Gaps = 8/77 (10%)
Query: 29 FISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLR-KGQVLENLMH 87
+ DNGG T+ GN NWPLRG K+S EGG+R V + SPLL+ KG L+H
Sbjct: 31 LVHDNGGQTLAG----GN---NWPLRGRKWSLWEGGIRGVGFVTSPLLKQKGVKNRELIH 83
Query: 88 ITDWLPTLYFIAGKEKN 104
I+DWLPTL +AG N
Sbjct: 84 ISDWLPTLVNLAGGSTN 100
>gi|212690903|ref|ZP_03299031.1| hypothetical protein BACDOR_00391 [Bacteroides dorei DSM 17855]
gi|237712609|ref|ZP_04543090.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
gi|212666135|gb|EEB26707.1| arylsulfatase [Bacteroides dorei DSM 17855]
gi|229453930|gb|EEO59651.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
Length = 464
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|265752292|ref|ZP_06088085.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
gi|423229021|ref|ZP_17215426.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
CL02T00C15]
gi|423244861|ref|ZP_17225935.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
CL02T12C06]
gi|263237084|gb|EEZ22554.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
gi|392634774|gb|EIY28686.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
CL02T00C15]
gi|392640902|gb|EIY34693.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
CL02T12C06]
Length = 464
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|149198336|ref|ZP_01875382.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149138632|gb|EDM27039.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 512
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 61/107 (57%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D++VG ++ +++E G+ ++++F+SDNGG + T N+PLRG K S
Sbjct: 268 MIESMDDAVGSLLNAVDEAGIADETVIIFVSDNGGNMYSKIDGVFPT-DNYPLRGGKASM 326
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + W + + E L+ +D+ PT+ G E +D
Sbjct: 327 CEGGIRVPCVISWPGVTQDETRSEELIQTSDFYPTILNQLGIEIPVD 373
>gi|345512484|ref|ZP_08792010.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
gi|345456366|gb|EEO45525.2| arylsulfatase [Bacteroides dorei 5_1_36/D4]
Length = 464
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|421613867|ref|ZP_16054938.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495453|gb|EKK00041.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 501
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 58/107 (54%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +GR++ LEE L +LV+F SDNG P + H S+ PL+G+K S
Sbjct: 293 MITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNG-PHHEGGHSDLFFNSSGPLQGSKRSM 351
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
HEGG+R A WS + G + ++ D+LPT +AG E D
Sbjct: 352 HEGGIRVPFIAKWSGTIEPGTISDHPSAFWDFLPTACELAGAEPPAD 398
>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
Length = 496
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D+++GR++ L+ +G+ N++++F SDNGG F G N P RG K
Sbjct: 238 MVDSMDQAIGRIMDELDRQGVSDNTIILFFSDNGG-------FAGFGGVNKPYRGGKLET 290
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
EGG+R N W + G + + D PTL A EK ID
Sbjct: 291 FEGGIRVNAIMRWPGAIPAGSKTAQAISVMDVFPTLAAAADIEMHNEKKID 341
>gi|417302348|ref|ZP_12089451.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327541354|gb|EGF27895.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 501
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 58/107 (54%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +GR++ LEE L +LV+F SDNG P + H S+ PL+G+K S
Sbjct: 293 MITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNG-PHHEGGHSDLFFNSSGPLQGSKRSM 351
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
HEGG+R A WS + G + ++ D+LPT +AG E D
Sbjct: 352 HEGGIRVPFIAKWSGTIEPGTISDHPSAFWDFLPTACELAGAEPPAD 398
>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
Length = 502
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 59/111 (53%), Gaps = 19/111 (17%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M S+D +G+V+ SL+E GL N+L+VF +DNGGP+ N + N PL GTK +
Sbjct: 285 MTLSMDREIGKVLASLDELGLSENTLLVFTNDNGGPS------DTNASDNTPLSGTKANH 338
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMH----ITDWLPTLYFIAGKEKN 104
EGG+R P L + G + N H D LPT + AG +K+
Sbjct: 339 LEGGIR------VPFLMRWPAGNIGPNTYHEPISTLDLLPTFFAAAGGKKS 383
>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
Length = 579
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D +GR++ +++ G+ N+LVVF SDNG D H +N PLRG K++
Sbjct: 376 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---DPKH----GGNNEPLRGNKFTT 428
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR VA + W L G +++ D LP++ AG
Sbjct: 429 WEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPSMVGAAG 469
>gi|410096140|ref|ZP_11291130.1| hypothetical protein HMPREF1076_00308 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227211|gb|EKN20112.1| hypothetical protein HMPREF1076_00308 [Parabacteroides goldsteinii
CL02T12C30]
Length = 516
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 8/104 (7%)
Query: 4 SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFH------GNTASNWPLRGTK 57
D SVG+++ L+ GL N+L++ SDN GP VD + G+ PLRG K
Sbjct: 303 QFDWSVGQILDELDRLGLRENTLIILSSDN-GPVVDDGYADQAEELLGDHKPGGPLRGGK 361
Query: 58 YSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
YS E G R A A W ++KGQV + LM DW +L + G
Sbjct: 362 YSAFEAGTRIPAIASWPKEIKKGQVSDALMSQVDWFASLAALTG 405
>gi|398996733|ref|ZP_10699583.1| arylsulfatase A family protein [Pseudomonas sp. GM21]
gi|398126037|gb|EJM15485.1| arylsulfatase A family protein [Pseudomonas sp. GM21]
Length = 571
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ LE G L N++VVF +DNG +S F + P + K +
Sbjct: 305 MKQMDDNIGLVLKKLESMGQLDNTIVVFTTDNGA---ESQTFPDGGMT--PFKAGKLTTW 359
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R A + W ++ G V E+++ DWLPTL +AG K
Sbjct: 360 EGGMRVPALIRWPGQIKPGTVYEDIVASYDWLPTLVEVAGGPK 402
>gi|340619477|ref|YP_004737930.1| sulfatase [Zobellia galactanivorans]
gi|339734274|emb|CAZ97651.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 488
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 41/106 (38%), Positives = 57/106 (53%), Gaps = 8/106 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT---VDSLHFHGNTASNWPLRGTK 57
M+ LD VG ++ +LE+ G+ N+LVVF SDNG T D +F+ S+ P RGTK
Sbjct: 276 MVHLLDAQVGEIMATLEKLGIADNTLVVFTSDNGPHTEGGADPEYFN----SSGPFRGTK 331
Query: 58 YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R + A W ++ G E + D PT IAG E
Sbjct: 332 RDLYEGGIRVPMVAAWPGKIKPGSRTELVSAFWDVFPTFSDIAGIE 377
>gi|294778194|ref|ZP_06743624.1| arylsulfatase [Bacteroides vulgatus PC510]
gi|294447960|gb|EFG16530.1| arylsulfatase [Bacteroides vulgatus PC510]
Length = 466
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 263 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 322 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 357
>gi|410636000|ref|ZP_11346606.1| predicted sulfatase [Glaciecola lipolytica E3]
gi|410144354|dbj|GAC13811.1| predicted sulfatase [Glaciecola lipolytica E3]
Length = 474
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLDESVG ++ L L N+L++F SDNGG + F N A PLR K +
Sbjct: 261 MVASLDESVGTLLNHLTSNNLTHNTLILFYSDNGGLDKNDTGFPTNNA---PLRNGKATL 317
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
+EGG+R A W + G ++ TD+ PTL + G +K
Sbjct: 318 YEGGIRVPFIASWPGKIAAGSESNEMITSTDFYPTLAKLVGSKK 361
>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 56/103 (54%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG + +L+ +G+ N L V SDNGGP G A+N+PL+G K +
Sbjct: 258 MVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGPV-----HEGTGANNYPLKGGKTTD 312
Query: 61 HEGGVR---NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+GGVR V+ + P +GQ + +H+ DW T IAG
Sbjct: 313 WQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCAIAG 355
>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
Length = 466
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 56/95 (58%), Gaps = 8/95 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++D+ +GR+++ L++ G L N+L++F++DNG T N A N PLRG K S
Sbjct: 249 LMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT-------NNGADNGPLRGLKGSK 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
EGG+R + W L+ G+ + L+ D +PT
Sbjct: 302 WEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPT 336
>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
anophagefferens]
Length = 304
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 56/108 (51%), Gaps = 11/108 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ + D ++G + +L++ L N+L+V +SDNG SN PLRG K +
Sbjct: 188 VVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAGG--------SNAPLRGAKGGY 239
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR A L LL +G E L+H+ DW PTL AG ++
Sbjct: 240 FEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAGLRPDL 287
>gi|423315320|ref|ZP_17293250.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
CL09T03C04]
gi|423315328|ref|ZP_17293258.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
CL09T03C04]
gi|392679557|gb|EIY72938.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
CL09T03C04]
gi|392679565|gb|EIY72946.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
CL09T03C04]
Length = 464
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|325109684|ref|YP_004270752.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969952|gb|ADY60730.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 498
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D++VGR++ +L+ L N+LV+F+SDNGG +++ SN PLRG K S
Sbjct: 266 MIESMDDAVGRLLDTLDRLQLTENTLVIFMSDNGGNMYNTVD-KTTPTSNAPLRGGKASM 324
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
EGG+R + P + G E ++ D+ PT+
Sbjct: 325 FEGGIRVPCIISYPGHVEAGSRSEAIIQSCDFYPTI 360
>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
ATCC 8503]
Length = 452
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD +VGRV ++L++ G L N++++F SDNGG A+N P RG K
Sbjct: 249 LIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W +R V +N + ++D PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336
>gi|150002770|ref|YP_001297514.1| arylsulfatase [Bacteroides vulgatus ATCC 8482]
gi|149931194|gb|ABR37892.1| arylsulfatase precursor [Bacteroides vulgatus ATCC 8482]
Length = 464
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
Length = 482
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 42/109 (38%), Positives = 60/109 (55%), Gaps = 4/109 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD VG+V+Q L+E G+ N+LV+F SDN GP + + SN PLRG K
Sbjct: 272 MVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDN-GPHKEGGYDPDYFNSNGPLRGIKRDL 330
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG--KEKNID 106
+EGG+R + A W ++ G + + D+LPT +A K NID
Sbjct: 331 YEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAELAKALKPANID 379
>gi|319643165|ref|ZP_07997794.1| arylsulfatase [Bacteroides sp. 3_1_40A]
gi|345520543|ref|ZP_08799930.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
gi|317385206|gb|EFV66156.1| arylsulfatase [Bacteroides sp. 3_1_40A]
gi|345456873|gb|EET15372.2| arylsulfatase [Bacteroides sp. 4_3_47FAA]
Length = 464
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+SVG +I+ L+E+GL ++++VF SDNG + + H SN P RG K
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + W ++ +G V ++ D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355
>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 548
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 60/103 (58%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML+S+D SV RV+ LE++G+ N++V+ SDNGG +++ N P RG K +F
Sbjct: 325 MLRSIDRSVERVMAKLEKQGIADNTIVILSSDNGGADYVAIN-----DLNKPYRGWKNTF 379
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + W ++ + V+E ++ D +PT+ +A +
Sbjct: 380 FEGGIRVPFSVTWPNVIDESTVIEEPVNHIDLMPTIINMANAD 422
>gi|108761533|ref|YP_634630.1| sulfatase [Myxococcus xanthus DK 1622]
gi|108465413|gb|ABF90598.1| sulfatase family protein [Myxococcus xanthus DK 1622]
Length = 553
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 38/105 (36%), Positives = 60/105 (57%), Gaps = 6/105 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD+ VG ++ L+E G+ N+LVVF +DNG V+ + + N P RG K S
Sbjct: 268 MRELDDIVGVLLAKLDELGIADNTLVVFSTDNG---VEKMGW--PDGGNSPFRGEKGSTW 322
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR + W ++ G+V+ ++ DW+PTL AG K++
Sbjct: 323 EGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDL 367
>gi|405345812|ref|ZP_11022551.1| Arylsulfatase [Chondromyces apiculatus DSM 436]
gi|397093455|gb|EJJ24162.1| Arylsulfatase [Myxococcus sp. (contaminant ex DSM 436)]
Length = 554
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 38/105 (36%), Positives = 60/105 (57%), Gaps = 6/105 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD+ VG ++ L+E G+ N+LVVF +DNG V+ + + N P RG K S
Sbjct: 269 MRELDDIVGVLLAKLDELGIADNTLVVFSTDNG---VEKMGW--PDGGNSPFRGEKGSTW 323
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR + W ++ G+V+ ++ DW+PTL AG K++
Sbjct: 324 EGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDL 368
>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
Length = 477
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 41/106 (38%), Positives = 57/106 (53%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
M+ S+DESVGRV+ L+E L N+LV+F SDNGG + + G+ N PLRG K
Sbjct: 240 MIASVDESVGRVVALLDELKLSDNTLVIFSSDNGGVGGYQREGIKKAGDVTDNNPLRGGK 299
Query: 58 YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG R W + G+V + + D PTL +AG +
Sbjct: 300 GMLYEGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYPTLLELAGAK 345
>gi|417305741|ref|ZP_12092689.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327537968|gb|EGF24664.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 889
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 59/97 (60%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
M++S+D+++G ++ +L+ G+ +++VF SDNGG + + G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561
Query: 60 FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
+EGGVR A + P + G + ++ D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGAVESGSRSDAIIQSIDFYPTL 598
>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
Length = 551
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G ++ ++E G+ N+ ++F SDNGG N P RG K +
Sbjct: 243 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 294
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG + + SPLL V N LMH+TDWLPT+ +AG
Sbjct: 295 WEGGCKANSWTHSPLLGATGVENNGLMHVTDWLPTIVDLAG 335
>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
Length = 529
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G ++ ++E G+ N+ ++F SDNGG N P RG K +
Sbjct: 221 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 272
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG + + SPLL V N LMH+TDWLPT+ +AG
Sbjct: 273 WEGGCKANSWTHSPLLGATGVENNGLMHVTDWLPTIVDLAG 313
>gi|323455471|gb|EGB11339.1| hypothetical protein AURANDRAFT_2359, partial [Aureococcus
anophagefferens]
Length = 367
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 37/103 (35%), Positives = 58/103 (56%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+ G V+ +L GL +L+++ SDNGG ++H G ++ WPLRG Y+
Sbjct: 240 MVALMDDVAGDVVGALRTAGLWAKTLLLWSSDNGG----AVHL-GGGSNTWPLRGGYYNN 294
Query: 61 HEGGVRNVAALWS---PLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A L P + +G++L ++I DW T +AG
Sbjct: 295 FEGGVRAPAFLAGGALPEVARGKLLTEWIYIADWYATFARLAG 337
>gi|330505678|ref|YP_004382547.1| putative sulfatase [Pseudomonas mendocina NK-01]
gi|328919964|gb|AEB60795.1| probable sulfatase precursor [Pseudomonas mendocina NK-01]
Length = 629
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 54/100 (54%), Gaps = 7/100 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD ++ R++++LEE G N++V+F SDNGG + N +N+PL G K ++
Sbjct: 253 ILSQLDHNMARLLKALEESGQADNTVVIFASDNGGANI-------NMDNNYPLIGKKATY 305
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR L P + Q + D PTL IAG
Sbjct: 306 TEGGVRTPLLLRWPGHYENQDIREPTVFMDIFPTLVSIAG 345
>gi|149200261|ref|ZP_01877282.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
gi|149136625|gb|EDM25057.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
Length = 472
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 36/100 (36%), Positives = 62/100 (62%), Gaps = 2/100 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE++GR+ + L++ GL N++V+F SDNGG + + G T+S +PLRG K
Sbjct: 261 MVESMDENIGRLNKFLKQSGLDKNTVVIFTSDNGGLSTNKNPKAGPTSS-FPLRGGKAWV 319
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
+EGG+R + W + G + + + TD+ PT+ +A
Sbjct: 320 YEGGIRVPLIIKWPESTQAGSEISSPVISTDFYPTILSMA 359
>gi|313212310|emb|CBY36307.1| unnamed protein product [Oikopleura dioica]
Length = 661
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 53/94 (56%), Gaps = 7/94 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K+LD ++G + ++LE G L N++V++ DNG V++ G+ LRG K
Sbjct: 372 VKALDIAIGEIFEALENAGKLDNTIVIYAGDNGA-AVENFQSSGD------LRGYKGGAF 424
Query: 62 EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGGV+ A L P + +G + + TDWLPTL
Sbjct: 425 EGGVKTFALLSGPGISQGASYKGIFQSTDWLPTL 458
>gi|329956963|ref|ZP_08297531.1| arylsulfatase [Bacteroides clarus YIT 12056]
gi|328523720|gb|EGF50812.1| arylsulfatase [Bacteroides clarus YIT 12056]
Length = 460
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 12/101 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
M+ LD SVG +I L+E GL N++VVF SDNG +H G SN P RG
Sbjct: 256 MVTYLDRSVGIIIDKLKELGLYDNTIVVFTSDNG------VHAEGGHDPYFFDSNGPFRG 309
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
K +EGG+R + W ++ G V +++ D+LPT+
Sbjct: 310 YKRDLYEGGIRTPFVIQWPRIIPAGVVTDHVSAFWDFLPTI 350
>gi|410631840|ref|ZP_11342513.1| arylsulfatase B [Glaciecola arctica BSs20135]
gi|410148741|dbj|GAC19380.1| arylsulfatase B [Glaciecola arctica BSs20135]
Length = 544
Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 57/99 (57%), Gaps = 12/99 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
ML+SLD SV RV+ LEE+G+ N++V+ SDNGG +D+L N P RG K
Sbjct: 321 MLRSLDRSVERVMAKLEEQGIADNTIVILSSDNGGADYVAIDNL--------NKPYRGWK 372
Query: 58 YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+F EGG+R + W + G +++ ++ D +PT+
Sbjct: 373 STFFEGGIRVPFSITWPGEIEAGLIIDEPVNHFDLMPTI 411
>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 486
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 41/110 (37%), Positives = 60/110 (54%), Gaps = 3/110 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT-VDSLHFHG-NTASNWPLRGTKY 58
M K LDE+VGRV++ L+E GL N++V+F SDNGG VD + +N PLR K
Sbjct: 260 MNKELDENVGRVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKG 319
Query: 59 SFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
+ +EGG+R + W + G + + +TD L T I G+ D+
Sbjct: 320 ALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFLHITGQPPATDA 369
>gi|171910116|ref|ZP_02925586.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 480
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 2/101 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L++LD ++GRV+++LEE+ L ++VVF SDNGG + G T SN PLR K
Sbjct: 266 LLENLDTNIGRVLKALEEQKLREKTIVVFTSDNGGLCTLAKGRTGPT-SNLPLRCGKGWN 324
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + W L+ G V+ + D PTL ++G
Sbjct: 325 YEGGIRTPCYISWPGHLKGGTVIGTPAYTPDLYPTLLELSG 365
>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 453
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 39/96 (40%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D++VG+V+ L E L +LVVFISDNGGP N ++N PL G K +
Sbjct: 247 MLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPP-------ANGSTNTPLNGQKATT 299
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGGVR W L G+ E + D LPT+
Sbjct: 300 WEGGVRVPFVVSWPGQLAAGKKYEEPVIQLDLLPTI 335
>gi|397643272|gb|EJK75757.1| hypothetical protein THAOC_02512 [Thalassiosira oceanica]
Length = 870
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 10/110 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D+ +QS+++ G N+LV+F SDNGG ++ +G + N+PLRG+K SF
Sbjct: 343 MMRYIDDLFDGTMQSIKDAGQWENTLVLFTSDNGG----AVFPNGQSNQNYPLRGSKRSF 398
Query: 61 HEGGVRNVAALWSPLL--RKGQVLENLMHI----TDWLPTLYFIAGKEKN 104
+GG R L + R G+ N H DW PT IAG ++N
Sbjct: 399 FDGGNRVNQFLTGGWIDARLGKDRSNESHTYVFGLDWAPTFVSIAGGDRN 448
>gi|423288214|ref|ZP_17267065.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
CL02T12C04]
gi|392671103|gb|EIY64579.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
CL02T12C04]
Length = 473
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 35/106 (33%), Positives = 61/106 (57%), Gaps = 15/106 (14%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKY--- 58
++++DESVGR+ +++++ G+ N++V++ SD GG SN+PLRG K
Sbjct: 274 IEAVDESVGRIRKAIKDRGIEDNTIVIYFSDQGG-----------LYSNYPLRGCKLIDD 322
Query: 59 SFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEK 103
+ EGG+R A++ P + +KG V + + D+ PT IA +K
Sbjct: 323 TMAEGGIRVPLAIYYPGVTKKGSVCQTPVQALDFFPTFMEIATGKK 368
>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
Length = 311
Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G ++ ++E G+ N+ ++F SDNGG N P RG K +
Sbjct: 169 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 220
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG + + SPLL V N LMH+TDWLPT+ +AG
Sbjct: 221 WEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAG 261
>gi|398810130|ref|ZP_10568960.1| arylsulfatase A family protein [Variovorax sp. CF313]
gi|398083821|gb|EJL74525.1| arylsulfatase A family protein [Variovorax sp. CF313]
Length = 438
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G ++ +L++ G+ N+LVVF SDNGG + NWPL G K
Sbjct: 234 MIHHMDEGIGWIMAALQKHGMADNTLVVFTSDNGGERF---------SDNWPLVGGKMDL 284
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W ++ KG L DW T+ AG + D
Sbjct: 285 TEGGIRVPWIAHWPAVIGKGGESTQLCMTMDWSATMLDAAGVAAHAD 331
>gi|14518340|ref|NP_116823.1| MS135, putative arylsulfatase [Microscilla sp. PRE1]
gi|14484975|gb|AAK62857.1| MS135, putative arylsulfatase [Microscilla sp. PRE1]
Length = 249
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +++ + G+++ L+E GL N+LVVF +DNGGP G ASN+PL G K +
Sbjct: 35 MTLAMNRACGQILDKLKEHGLEENTLVVFSNDNGGPV-------GTMASNYPLSGAKSNN 87
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + P ++ G + + + D LPT +AG
Sbjct: 88 LEGGIRVPCIMKLPGVIDPGSEYPHPISMLDMLPTFVSVAG 128
>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 474
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 40/103 (38%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LDE +G+V L +GL N+LVVF+SDNGGPT + T+SN PLRG K
Sbjct: 283 VLAHLDEDIGKVRAQLRADGLEENTLVVFLSDNGGPTKEL------TSSNLPLRGGKGDL 336
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+GG+R A W + G ++ D T +AG E
Sbjct: 337 WDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTATALKLAGAE 379
>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 540
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 61/101 (60%), Gaps = 6/101 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+++G+V+ +L +EG+ ++++F SDNG + G A N+PLRG K
Sbjct: 258 VVDGLDQAIGQVLDTLTKEGIEEETIILFSSDNG-----GAAYAGGGADNFPLRGGKGDT 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R VAA+ W L G+ E++M + D PTL AG
Sbjct: 313 YEGGIRVVAAMRWKGTLDAGKSFESIMTVMDVFPTLASAAG 353
>gi|320101855|ref|YP_004177446.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319749137|gb|ADV60897.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 493
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 38/104 (36%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
M+ S+DESVGR++ ++ L +LV+F SDNGG + L + N PLRG K
Sbjct: 254 MIASVDESVGRIVAQIDRLALDRQTLVIFTSDNGGVGGYAREGLRRSQDVTDNAPLRGGK 313
Query: 58 YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R A W + G V + + D PTL +AG
Sbjct: 314 GMLYEGGIRVPFLARWPGRVPSGVVCDEPIQSIDLFPTLLELAG 357
>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
Length = 555
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 55/96 (57%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D SVGR++ +L+EEGL N++VVF SDNGG + G N P RG K +
Sbjct: 332 MVRAVDRSVGRILATLKEEGLSDNTIVVFSSDNGGAG-----YIGIPDINAPYRGWKLTM 386
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + W + + ++ D+LPTL
Sbjct: 387 FEGGIRVPLFMRWPAHIARNTSIDTPAAHIDFLPTL 422
>gi|319794759|ref|YP_004156399.1| sulfatase [Variovorax paradoxus EPS]
gi|315597222|gb|ADU38288.1| sulfatase [Variovorax paradoxus EPS]
Length = 439
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G ++ +L++ G+ N+LVVF SDNGG + NWPL G K
Sbjct: 235 MIHHMDEGIGWIMAALQKHGMSDNTLVVFTSDNGGERF---------SDNWPLVGGKMDL 285
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A W ++ KG L DW T+ AG
Sbjct: 286 TEGGIRVPWIAHWPAVIAKGGESAQLCMTMDWSATMLEAAG 326
>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 58/101 (57%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
MLK LD VG ++++L++ + N++++++SDNGG A+N PLRG K+
Sbjct: 239 MLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAK-------SMVANNKPLRGVKHDI 291
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
++GG+R + W ++ GQ ++ + D LPTL AG
Sbjct: 292 YDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLDAAG 332
>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 441
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 3/101 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML ++D+++G+V+ SL E N+L+ FISDNGGPT+ N++ N PLRG+K +
Sbjct: 236 MLSAMDDAIGKVLASLRETAQDENTLIFFISDNGGPTMPGTTI--NSSINAPLRGSKRTT 293
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R W + G V + + D PT AG
Sbjct: 294 LEGGIRVPFVVRWPGHVPAGAVYAHPVIQLDIQPTALAAAG 334
>gi|340617020|ref|YP_004735473.1| sulfatase [Zobellia galactanivorans]
gi|339731817|emb|CAZ95082.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 452
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 41/113 (36%), Positives = 61/113 (53%), Gaps = 13/113 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDN-----GGPTVDSLHFHGNTASNWPLRG 55
M+ LD VG+++ L++ L GN+LV+F SDN GG VD SN PLRG
Sbjct: 258 MITRLDSYVGQILDLLKKNNLDGNTLVIFTSDNGPHQEGGWQVDYFD------SNGPLRG 311
Query: 56 TKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
K +EGG+R A W + KG+ + + D+LPT ++G++K I +
Sbjct: 312 MKRDLYEGGIRVPFIAKWPGKIEKGET-DQVATFWDFLPTACELSGQDKPIST 363
>gi|333911967|ref|YP_004485699.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
gi|333742167|gb|AEF87344.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
Length = 442
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G V+ +LE++G+ N+L+VF SDNGG ++NWP G K
Sbjct: 236 MIHHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGERF---------SNNWPFIGQKMDL 286
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + A W + G V + DW T AG + D
Sbjct: 287 LEGGIRVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAGVHGHAD 333
>gi|311746014|ref|ZP_07719799.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|126576230|gb|EAZ80508.1| sulfatase family protein [Algoriphagus sp. PR1]
Length = 489
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 61/106 (57%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++S+D++VG+V+ LE GL N+LV+F SDNGG + + +N+PL+ K
Sbjct: 285 LVESVDQNVGKVLSELERMGLRENTLVIFTSDNGGLIGN---YDNPITNNYPLKSQKGYP 341
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
+EGG+R + W + +G V E + DW+PT+ G++ +
Sbjct: 342 YEGGIRIPTIVSWPGKIPQGFVDETPIITMDWIPTILDFMGEDPTL 387
>gi|160895745|ref|YP_001561327.1| sulfatase [Delftia acidovorans SPH-1]
gi|160361329|gb|ABX32942.1| sulfatase [Delftia acidovorans SPH-1]
Length = 442
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G V+ +LE++G+ N+L+VF SDNGG ++NWP G K
Sbjct: 236 MIHHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGERF---------SNNWPFIGQKMDL 286
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + A W + G V + DW T AG + D
Sbjct: 287 LEGGIRVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAGVHGHAD 333
>gi|339503592|ref|YP_004691012.1| arylsulfatase [Roseobacter litoralis Och 149]
gi|338757585|gb|AEI94049.1| putative arylsulfatase [Roseobacter litoralis Och 149]
Length = 525
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++K LD++VG+V+ L+E G N++VVF SDN GP + G T P RG K +
Sbjct: 280 VMKELDDNVGKVLARLDELGQRDNTIVVFTSDN-GPETMTWPDGGTT----PFRGEKGTT 334
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGG R A + W + +G+V + DW+PTL AG N+
Sbjct: 335 WEGGFRVPAIIRWPGKVPEGRVGNGIFSGMDWMPTLVAAAGGPDNL 380
>gi|264676341|ref|YP_003276247.1| sulfatase [Comamonas testosteroni CNB-2]
gi|262206853|gb|ACY30951.1| sulfatase [Comamonas testosteroni CNB-2]
Length = 442
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G ++ +LEE GL N+L+VF SDNGG ++ WP G K
Sbjct: 236 MIHHMDEGIGWLLDALEENGLSENTLIVFTSDNGGERF---------SNTWPFVGQKMDL 286
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + A W + G V + DW T AG + + D
Sbjct: 287 LEGGIRVPLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVDAHAD 333
>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 462
Score = 61.6 bits (148), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 12/101 (11%)
Query: 4 SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
+D+S+G +++ ++E GL N+LV+F+SDNGG ++ A N+PL+G K F EG
Sbjct: 272 QMDQSIGTILERIKEMGLEENTLVIFLSDNGGMSL---------ADNYPLKGGKAQFFEG 322
Query: 64 GVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
GVR V L W + GQ + + + P + AG E
Sbjct: 323 GVR-VPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAGLE 362
>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
Length = 382
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ +D +G ++ ++E G+ N+ ++F SDNGG N P RG K +
Sbjct: 240 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 291
Query: 61 HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
EGG + + SPLL V N LMH+TDWLPT+ +AG
Sbjct: 292 WEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAG 332
>gi|386395695|ref|ZP_10080473.1| arylsulfatase A family protein [Bradyrhizobium sp. WSM1253]
gi|385736321|gb|EIG56517.1| arylsulfatase A family protein [Bradyrhizobium sp. WSM1253]
Length = 589
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ LE+ G L N+++VF +DNG + + G T +G K +
Sbjct: 326 MKQMDDNIGYVLKKLEDMGQLDNTIIVFTTDNGAENI-TFPDGGTTL----FKGGKLTTW 380
Query: 62 EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W ++ G V +M DWLPTL IAG K
Sbjct: 381 EGGMRAPMVVRWPGHIKPGTVKSEMMSALDWLPTLVDIAGGPK 423
>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 489
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 36/106 (33%), Positives = 62/106 (58%), Gaps = 7/106 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD++VG ++++LEE+G++ N+++VF SDNGG + HG T N+PLR K S
Sbjct: 267 LVEHLDDAVGAILKALEEKGIIDNTIIVFTSDNGGEIL-----HGIT-DNFPLRDGKGSS 320
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
+EGG R + W + + V + D+ PT +AG ++
Sbjct: 321 YEGGTRVPLLVKWPGVTQVNTVSHERIIGFDYYPTFLSMAGANPDL 366
>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
variabilis ATCC 29413]
gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
ATCC 29413]
Length = 457
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 9/89 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+KSLD+ VGRV+ +LE G N+LV+F SDNGG + P RG K S
Sbjct: 266 MVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERFSNFG---------PFRGQKASL 316
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHIT 89
+EGG+R A + P + + + N + IT
Sbjct: 317 YEGGIRVPAIIRYPGVTQANQVSNQVIIT 345
>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
Length = 507
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG ++ +L+++G+ N L V SDNGGP F G A+N+PL+G K S
Sbjct: 258 MVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNGGPVYP---FGG--ANNYPLKGGKLSD 312
Query: 61 HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
+GG+R V A S P +GQ + +H+ DW T IAG
Sbjct: 313 WQGGIR-VNAFVSGGHLPEKMRGQKTDGYIHLADWYGTFCAIAG 355
>gi|291449944|ref|ZP_06589334.1| n-acetylgalactosamine-6-sulfate sulfatase [Streptomyces albus
J1074]
gi|291352893|gb|EFE79795.1| n-acetylgalactosamine-6-sulfate sulfatase [Streptomyces albus
J1074]
Length = 464
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG+V+++LE G N+LV F SDNGG + NWPL G K S
Sbjct: 275 MVEDLDRSVGQVLRALERSGQEENTLVFFASDNGGERF---------SYNWPLDGNKGSL 325
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
E G+R + L W + QV + ++ DW TL I G
Sbjct: 326 KERGIRVPSVLRWPARIDPRQVSDVPVYSPDWTATLLDIGG 366
>gi|393786360|ref|ZP_10374496.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
CL02T12C05]
gi|392659989|gb|EIY53606.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
CL02T12C05]
Length = 467
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 38/109 (34%), Positives = 57/109 (52%), Gaps = 4/109 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD + RVI L E+G+L N++++F SDNG + H G SN P RG K
Sbjct: 261 MVTRLDRDIQRVIDLLAEKGVLENTIIIFTSDNGTHK-EGGHDPGYFDSNGPFRGIKRDL 319
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
+EGG+R W ++ G V + D++PT+ + G + NID
Sbjct: 320 YEGGIRTPFIVRWPGVIPAGSVSYQISTFWDFMPTMCELIGAKAPANID 368
>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
Length = 339
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 53/96 (55%), Gaps = 5/96 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M LD++VG + +L+ + N++++F SDNGGP +H ++N+PLRG K +
Sbjct: 247 MTSILDDAVGNITDALKRSRIADNTILIFTSDNGGP----VHDENTESNNYPLRGGKNTL 302
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
GG + V + + G +MH +DWLP+L
Sbjct: 303 WNGGTQVVGMIAGKGIENPGTDCHGMMHASDWLPSL 338
>gi|149196669|ref|ZP_01873723.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149140349|gb|EDM28748.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 472
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 52/96 (54%), Gaps = 5/96 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++K +D GR+I+ L+E+ L N+LV+F SDNG L + S+ PLR K++
Sbjct: 269 VIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNG----PWLSYGDEGGSSGPLRDGKFTS 324
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
+EGGVR W P L++ V L D LPT
Sbjct: 325 YEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTF 360
>gi|149197139|ref|ZP_01874191.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
gi|149139685|gb|EDM28086.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
Length = 493
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 31/100 (31%), Positives = 53/100 (53%), Gaps = 5/100 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD VG ++ +E++G+ N+ ++F SDNGG + + N PL+G K
Sbjct: 249 MVEDLDIGVGMILDEVEKQGIKDNTYIIFTSDNGG-----FSYANTSGQNTPLKGGKRWL 303
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + P ++ G + D+LPT Y + G
Sbjct: 304 YEGGIRVPFVIQGPKIKAGTYCNQPIINWDFLPTFYDLVG 343
>gi|383458406|ref|YP_005372395.1| sulfatase family protein [Corallococcus coralloides DSM 2259]
gi|380732108|gb|AFE08110.1| sulfatase family protein [Corallococcus coralloides DSM 2259]
Length = 552
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD+ VG ++ LEE G+ N+LVVF +DNG V+ + + N P RG K S
Sbjct: 267 MRELDDIVGVLLDKLEELGIADNTLVVFSTDNG---VEKMGW--PDGGNSPFRGEKGSTW 321
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR W ++ G+V+ ++ DW+PT+ AG
Sbjct: 322 EGGVRVPCVVRWPGVVEPGRVINDIFAHEDWMPTMVAAAG 361
>gi|167764178|ref|ZP_02436305.1| hypothetical protein BACSTE_02562 [Bacteroides stercoris ATCC
43183]
gi|167698294|gb|EDS14873.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
Length = 502
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 59/101 (58%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L+ +D+S+G ++ ++E+G+ N++++F+SDNGG T+ G N+PL K S
Sbjct: 281 LLEGMDKSLGDLMDYVDEKGIADNTVIIFMSDNGGYTI------GRHDKNYPLSEGKGSL 334
Query: 61 HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
EGG+R + P + K + + + I D+ PTL IAG
Sbjct: 335 KEGGIREPMIVCYPHVAKPSTINDTPVIIEDFFPTLLEIAG 375
>gi|336410426|ref|ZP_08590905.1| hypothetical protein HMPREF1018_02922 [Bacteroides sp. 2_1_56FAA]
gi|335945158|gb|EGN06973.1| hypothetical protein HMPREF1018_02922 [Bacteroides sp. 2_1_56FAA]
Length = 493
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDE++GR++ L GL ++VVF SDNGG + + T SN PLR K
Sbjct: 270 MVESLDENIGRLLDILHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
+EGG++ A + WS L+ QV + + TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363
>gi|116623176|ref|YP_825332.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116226338|gb|ABJ85047.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 560
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V+ LE+ G L N++VVF +DNG T+ S G T P +G K
Sbjct: 293 MKQMDDNIGVVLAKLEQMGQLDNTIVVFTTDNGAETI-SFPDGGIT----PFKGQKGEAW 347
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG R + W ++ G V + L DWLPT AG K
Sbjct: 348 EGGYRAPCVIRWPGHIKPGTVYKELFAALDWLPTFVEFAGGPK 390
>gi|224536571|ref|ZP_03677110.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521827|gb|EEF90932.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
DSM 14838]
Length = 450
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 35/112 (31%), Positives = 60/112 (53%), Gaps = 10/112 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDN----GGPTVDSLH-----FHGNTASNW 51
++ DE++G ++ LE L+ N+++VF+SDN GGP + + F
Sbjct: 241 LIACADENIGILLNYLENRKLIDNTIIVFVSDNGPAKGGPELKTWEGYDPSFEYVFGQMK 300
Query: 52 PLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
LRG K +EGG+R + + PLLR+G+V +++ D+ PT+ + G E
Sbjct: 301 TLRGHKVDLYEGGIRTPMIIAYKPLLREGKVFRDMISTLDFYPTICEMTGSE 352
>gi|254282184|ref|ZP_04957152.1| arylsulfatase A [gamma proteobacterium NOR51-B]
gi|219678387|gb|EED34736.1| arylsulfatase A [gamma proteobacterium NOR51-B]
Length = 535
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 59/101 (58%), Gaps = 6/101 (5%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
D+++ R+ +SL+ G+ N+LV++ISDNG F+ +T +W LRG K +EGGV
Sbjct: 324 DKNMARLFESLDNMGIDENTLVLWISDNGPMN----KFYPSTGFSW-LRGYKSEVYEGGV 378
Query: 66 RNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
R A W + GQ +++H++DW T+ +AG + I
Sbjct: 379 RTPGIAKWPGSIAPGQTPIDIVHVSDWYTTIANLAGAKAAI 419
>gi|149195594|ref|ZP_01872651.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
gi|149141056|gb|EDM29452.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
Length = 481
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 64/111 (57%), Gaps = 5/111 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D++VG V+Q++++ G+ N++++F SDNGG + + SN+PLR K
Sbjct: 253 MVEHMDDAVGTVMQAVKDAGIEDNTIIIFNSDNGGLRGNYENNRQKVTSNYPLRSGKGDM 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
+EGGVR + WS ++ GQ + + D PTL + K+++ID
Sbjct: 313 YEGGVRVPMIIKWSRKIKAGQTSSSPVISHDIYPTLLDLCKIDVSKKQDID 363
>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
Length = 497
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 40/109 (36%), Positives = 61/109 (55%), Gaps = 13/109 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
M+ +D+ +G+V+ +LE+ G+ N+L+VF SDNGG V + F G + A N P
Sbjct: 268 MISVMDDEIGKVVAALEKRGMRDNTLIVFHSDNGG--VVNAFFAGESEVKGKLPADNGPY 325
Query: 54 RGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R K + +EGG R VA L W ++ G + + MH+ D PTL + G
Sbjct: 326 REGKGTTYEGGTR-VAGLVNWPGKIKPGTI-DGPMHVVDMYPTLAGLVG 372
>gi|372209156|ref|ZP_09496958.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 529
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 61/102 (59%), Gaps = 7/102 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++S+D++VGRV+Q L++ GL N+LVVF SDNGG + SN PLRG K
Sbjct: 277 LVESMDDAVGRVLQGLKDLGLDENTLVVFTSDNGGVAAGDAF----STSNLPLRGGKGYQ 332
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHIT--DWLPTLYFIAG 100
+EGG++ + P + K QV + + ++ D+ PTL + G
Sbjct: 333 YEGGIKEPYFIKVPWM-KSQVKQTNVPVSGVDFYPTLLDLVG 373
>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 562
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 56/97 (57%), Gaps = 9/97 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D + ++++ L+ + N++ +F +DNGG T GN NWPLRG K +
Sbjct: 261 MVYAMDRGIRQLVKHLKRARMWKNTIFIFSTDNGGQTTRG----GN---NWPLRGKKGTL 313
Query: 61 HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTL 95
EGG+R V + L+ +V + L+H++DW PT+
Sbjct: 314 WEGGIRGVGFVHGKPLQVTTPRVNKELLHVSDWYPTI 350
>gi|449138581|ref|ZP_21773838.1| arylsulfatase [Rhodopirellula europaea 6C]
gi|448882843|gb|EMB13400.1| arylsulfatase [Rhodopirellula europaea 6C]
Length = 459
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 35/100 (35%), Positives = 56/100 (56%), Gaps = 9/100 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L++ D VG+++ SLEE G ++VVF SDNGG H +N PLRG+K++ +
Sbjct: 243 LETFDHHVGQILDSLEESGQADRTIVVFCSDNGG--------HPEYTANAPLRGSKWNLY 294
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W +++ ++ + D LPT+ AG
Sbjct: 295 EGGIRVPMIVRWPDVVQPQTEIKRPVIGYDLLPTMVDWAG 334
>gi|421611066|ref|ZP_16052221.1| arylsulfatase [Rhodopirellula baltica SH28]
gi|408498168|gb|EKK02672.1| arylsulfatase [Rhodopirellula baltica SH28]
Length = 484
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 9/100 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L++ D VG+++ SLE G ++V+F SDNGG H +N PLRG+K++ +
Sbjct: 268 LETFDHHVGQILDSLEASGRADRTIVIFYSDNGG--------HPEYTANAPLRGSKWNLY 319
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W +++ ++ + D LPT+ +AG
Sbjct: 320 EGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAG 359
>gi|323450061|gb|EGB05945.1| hypothetical protein AURANDRAFT_66030 [Aureococcus anophagefferens]
Length = 611
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 13/98 (13%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
+D S+ + Q+L+ G+ ++L+ F SDNGG T G SN+PLRG K + EGG
Sbjct: 312 VDRSLETLFQTLDAAGMTDDTLLFFGSDNGGLT-------GFGGSNYPLRGEKLTLFEGG 364
Query: 65 VRNVAALWS----PLLR--KGQVLENLMHITDWLPTLY 96
VR A W+ P R G ++H++D PTL+
Sbjct: 365 VRVPAFFWAGASFPRFRATAGAAYGGIVHVSDVAPTLF 402
>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
Length = 478
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K++D+ +GRV+ +L+ L N++VVF SDNGG + WP G K
Sbjct: 268 MIKAMDDQIGRVLATLDSYHLTENTIVVFTSDNGGERF---------SDTWPFTGMKGEL 318
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A L W + G + + DWLP+ G
Sbjct: 319 LEGGLRVPAVLRWPARVAAGSESDQTLITMDWLPSFLAATG 359
>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 484
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 12/111 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D VG ++ SL++ G N+L++F+SDNGG T G+ A+N+PL G K
Sbjct: 263 MVYAVDRGVGEIVASLKKTGQFDNTLIIFLSDNGGHT-------GHGANNYPLTGRKGDT 315
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
EGG R W ++KGQ + + D PT+ I G +K +D
Sbjct: 316 WEGGFRVPMFFHWPKKIKKGQKFDYPVSALDLYPTIAHIGQAKIGSDKILD 366
>gi|449132692|ref|ZP_21768698.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448888163|gb|EMB18494.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 544
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 33/66 (50%), Positives = 46/66 (69%), Gaps = 3/66 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K++D SVGR++Q+L+E G+ N+LVVF SDNGG + +L G T N PLR K
Sbjct: 319 MVKAVDTSVGRIMQALKEHGVDDNTLVVFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 375
Query: 61 HEGGVR 66
+EGG+R
Sbjct: 376 YEGGIR 381
>gi|346225028|ref|ZP_08846170.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
gi|346226998|ref|ZP_08848140.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
Length = 488
Score = 60.8 bits (146), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 4/100 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG+++++LEEEGL N+LVVF SDNG + HG TA LRG K
Sbjct: 267 VIEEIDWSVGQILKTLEEEGLDENTLVVFTSDNGPWHI--FETHGGTAGL--LRGAKGGT 322
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R W P K V+ ++ D PT ++G
Sbjct: 323 FEGGMREPTIFWWPGKLKQGVVMDMATTMDLFPTFCNLSG 362
>gi|283782011|ref|YP_003372766.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283440464|gb|ADB18906.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 516
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 40/111 (36%), Positives = 59/111 (53%), Gaps = 10/111 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNW-----PLRG 55
M+ +D +GR+++ LE+ GL ++VVF SDN GP D L G T +++ LRG
Sbjct: 283 MITRMDREIGRILERLEKYGLQRRTIVVFSSDN-GPLYDKL---GGTDADFFQSALDLRG 338
Query: 56 TKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
K S +EGG+R + P ++ G L DW+PTL +AG I
Sbjct: 339 RKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDWMPTLLSLAGMSTKI 389
>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
Length = 659
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 49/83 (59%), Gaps = 9/83 (10%)
Query: 24 NSLVVF-ISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLRK-GQV 81
NS F DNGG T+ GN NWPLRG K++ EGG+R V + SPLL++ G
Sbjct: 417 NSSSAFPFPDNGGQTLAG----GN---NWPLRGRKWTLWEGGIRGVGFVASPLLKQTGVQ 469
Query: 82 LENLMHITDWLPTLYFIAGKEKN 104
L+HI+DWLPTL +AG N
Sbjct: 470 SRELIHISDWLPTLVNLAGGHTN 492
>gi|374620757|ref|ZP_09693291.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374303984|gb|EHQ58168.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 519
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 54/97 (55%), Gaps = 3/97 (3%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ +D SVG +I +LE+ G+ N+LV F SDNG + S HF G+T P +G+K +
Sbjct: 304 IEEIDWSVGEIIGALEKAGIDDNTLVFFSSDNGPWQLASTHFAGSTG---PFKGSKQEIY 360
Query: 62 EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFI 98
EGGVR W P +V+ ++ + D T+ I
Sbjct: 361 EGGVRVPGIFWWPGTVAPEVISDMGSVLDLYETIATI 397
>gi|189465112|ref|ZP_03013897.1| hypothetical protein BACINT_01456 [Bacteroides intestinalis DSM
17393]
gi|189437386|gb|EDV06371.1| arylsulfatase [Bacteroides intestinalis DSM 17393]
Length = 467
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR + L+++GL N+LV+F SDNG L + ++ S PLRG K+S+
Sbjct: 270 VVEEIDWNVGRFLDYLDQQGLAENTLVIFASDNG----PWLGYKEDSGSADPLRGGKFSY 325
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR L W + G + ++ D PT+ G
Sbjct: 326 YEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYVG 366
>gi|423339136|ref|ZP_17316877.1| hypothetical protein HMPREF1059_02802 [Parabacteroides distasonis
CL09T03C24]
gi|409231038|gb|EKN23895.1| hypothetical protein HMPREF1059_02802 [Parabacteroides distasonis
CL09T03C24]
Length = 515
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
++ D SVG+++++L+E GL N+L++ SDN GP VD ++ G+ PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPARPLRG 359
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
KYS EGG R A + W +++ V + L+ DW +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400
>gi|340616998|ref|YP_004735451.1| hypothetical protein zobellia_1003 [Zobellia galactanivorans]
gi|339731795|emb|CAZ95060.1| Pseudogene of sulfatase [Zobellia galactanivorans]
Length = 339
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 11/102 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTK-YS 59
ML S+DES+GR+ L+ G+ N+LV+F SDNGG N+ SN+PL G K YS
Sbjct: 97 MLVSVDESIGRINTELKRLGIENNTLVIFTSDNGG--------LNNSTSNYPLLGGKSYS 148
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
F EG +R + W +++ G+ + + D+ PT IAG
Sbjct: 149 F-EGAMRVPLIVKWPAMIKPGRKSKKRVIGMDFYPTFLDIAG 189
>gi|150009207|ref|YP_001303950.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|298377006|ref|ZP_06986960.1| arylsulfatase A [Bacteroides sp. 3_1_19]
gi|149937631|gb|ABR44328.1| putative arylsulfatase precursor [Parabacteroides distasonis ATCC
8503]
gi|298265990|gb|EFI07649.1| arylsulfatase A [Bacteroides sp. 3_1_19]
Length = 515
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
++ D SVG+++++L+E GL N+L++ SDN GP VD ++ G+ PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
KYS EGG R A + W +++ V + L+ DW +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400
>gi|262384095|ref|ZP_06077231.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262294993|gb|EEY82925.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 515
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
++ D SVG+++++L+E GL N+L++ SDN GP VD ++ G+ PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
KYS EGG R A + W +++ V + L+ DW +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400
>gi|255015814|ref|ZP_05287940.1| putative arylsulfatase precursor [Bacteroides sp. 2_1_7]
gi|410104728|ref|ZP_11299640.1| hypothetical protein HMPREF0999_03412 [Parabacteroides sp. D25]
gi|423334479|ref|ZP_17312258.1| hypothetical protein HMPREF1075_03781 [Parabacteroides distasonis
CL03T12C09]
gi|409225670|gb|EKN18588.1| hypothetical protein HMPREF1075_03781 [Parabacteroides distasonis
CL03T12C09]
gi|409233740|gb|EKN26574.1| hypothetical protein HMPREF0999_03412 [Parabacteroides sp. D25]
Length = 515
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
++ D SVG+++++L+E GL N+L++ SDN GP VD ++ G+ PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
KYS EGG R A + W +++ V + L+ DW +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400
>gi|301311001|ref|ZP_07216930.1| arylsulfatase A [Bacteroides sp. 20_3]
gi|300831064|gb|EFK61705.1| arylsulfatase A [Bacteroides sp. 20_3]
Length = 515
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
++ D SVG+++++L+E GL N+L++ SDN GP VD ++ G+ PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
KYS EGG R A + W +++ V + L+ DW +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400
>gi|256841763|ref|ZP_05547269.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256736657|gb|EEU49985.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 515
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
++ D SVG+++++L+E GL N+L++ SDN GP VD ++ G+ PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359
Query: 56 TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
KYS EGG R A + W +++ V + L+ DW +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400
>gi|149196742|ref|ZP_01873796.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
gi|149140422|gb|EDM28821.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
Length = 482
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 37/108 (34%), Positives = 58/108 (53%), Gaps = 20/108 (18%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS-------------LHFHGNTA 48
++ LD SVG+ IQ+LE+EGL+ N+L +F SDNGG S H++G T
Sbjct: 264 IQDLDYSVGQFIQALEQEGLMENTLFIFTSDNGGDIPASNKDWPEYQAYDMGFHYNGKT- 322
Query: 49 SNWPLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
RG K+ +EGG++ + W ++KGQ ++L+ D+ T
Sbjct: 323 -----RGDKHQIYEGGLKVPFIIHWPKKVKKGQESDHLVTTADFFSTF 365
>gi|418528067|ref|ZP_13094017.1| sulfatase [Comamonas testosteroni ATCC 11996]
gi|371454443|gb|EHN67445.1| sulfatase [Comamonas testosteroni ATCC 11996]
Length = 442
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 35/107 (32%), Positives = 52/107 (48%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G ++ +LEE+G+ N+L+VF SDNGG ++ WP G K
Sbjct: 236 MIHHMDEGIGWLLDALEEKGMSENTLIVFTSDNGGERF---------SNTWPFVGQKMDL 286
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R + A W + G V + DW T AG + D
Sbjct: 287 LEGGIRVPLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVSADAD 333
>gi|357412212|ref|YP_004923948.1| sulfatase [Streptomyces flavogriseus ATCC 33331]
gi|320009581|gb|ADW04431.1| sulfatase [Streptomyces flavogriseus ATCC 33331]
Length = 484
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+++L+ G ++LV F SDNGG + NWPL G K S
Sbjct: 295 MVEDLDRSVGEVLRALKRSGQEEDTLVFFASDNGGERF---------SYNWPLSGNKSSL 345
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R + L W + QV + + DW TL + G +
Sbjct: 346 QEGGIRVPSVLRWPARIEANQVSDLPVFSPDWTATLLELGGARPD 390
>gi|87309613|ref|ZP_01091747.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87287377|gb|EAQ79277.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 496
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 58/102 (56%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D +VG+V+ +L+ L N+LV+F SDNGG + G+ SN PLRG K
Sbjct: 287 MVEAMDAAVGKVLDALDRLKLTDNTLVIFTSDNGGLSTS----EGHPTSNLPLRGGKGWM 342
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK 101
+EGG+R + P + G + L+ D+LPT+ + K
Sbjct: 343 YEGGIREPLVVRYPGVTSPGSESDALVTSPDFLPTILAVVDK 384
>gi|421609369|ref|ZP_16050565.1| arylsulfatase A [Rhodopirellula baltica SH28]
gi|408499866|gb|EKK04329.1| arylsulfatase A [Rhodopirellula baltica SH28]
Length = 491
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 54/101 (53%), Gaps = 2/101 (1%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYSF 60
+ ++D +VGR+++ LEE G N+L+VF SDNG T+D + S PLRG K
Sbjct: 253 VANVDAAVGRLVKGLEEIGKRDNTLIVFTSDNGPETLDRYRSANRSYGSPGPLRGMKLHT 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
E G R + W + GQ L+ ++ D LPT +AG
Sbjct: 313 TEAGFRVPGIVNWPGEIEPGQTLDTVVSSLDLLPTFCRLAG 353
>gi|384098832|ref|ZP_09999944.1| putative secreted sulfatase ydeN [Imtechella halotolerans K1]
gi|383834975|gb|EID74406.1| putative secreted sulfatase ydeN [Imtechella halotolerans K1]
Length = 504
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 63/106 (59%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-NWPLRGTKYS 59
M++ +D+S+G V+ LE G+ N++++FISDNGG + ++ G S N PL+ K S
Sbjct: 282 MIEGMDKSLGDVMSFLENRGIADNTIILFISDNGG--LSAVGRGGEAHSHNRPLKSGKGS 339
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
+EGG+R + A W + + V+ + + I D+ P++ +A E N
Sbjct: 340 MYEGGIRVPMLAYWPGVTKPSTVINDYVIIEDFFPSILEMAQIEPN 385
>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 466
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 55/95 (57%), Gaps = 8/95 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ ++D+ +GR+++ L++ G L N+L++F++DNG T N A N PLRG K S
Sbjct: 249 LMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT-------NNGADNGPLRGLKGSK 301
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
EGG+R + W L G+ + L+ D +PT
Sbjct: 302 WEGGIRVPMIMHWPKQLPAGKDYKGLVSSLDLVPT 336
>gi|149177652|ref|ZP_01856253.1| arylsulfatase [Planctomyces maris DSM 8797]
gi|148843470|gb|EDL57832.1| arylsulfatase [Planctomyces maris DSM 8797]
Length = 494
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 60/121 (49%), Gaps = 19/121 (15%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG----------------GPTVDSLHF- 43
M+ S+D+++GR++ +L+E G L N+LV+F+SDNG GP D +
Sbjct: 269 MIDSMDQNIGRIMTTLKETGQLDNTLVLFLSDNGGCSEEPGGRNPQERTPGPKDDYVAVG 328
Query: 44 -HGNTASNWPLRGTKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGK 101
A N P R K HEGG+ + A W + G + + HI D +PT +AG
Sbjct: 329 PAWGWAQNSPFRRYKSWVHEGGISTPMIAWWPGKVPAGAINRSPAHIIDLMPTFLEMAGA 388
Query: 102 E 102
E
Sbjct: 389 E 389
>gi|32475023|ref|NP_868017.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32445563|emb|CAD75564.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 501
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 2/107 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +GR++ LEE L +LV+F SDNG P + H S+ PL+G+K S
Sbjct: 293 MITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNG-PHHEGGHSDLFFNSSGPLQGSKRSM 351
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
HEGG+R A W + G + ++ D+LPT +AG E D
Sbjct: 352 HEGGIRVPFIAKWPGTIEPGTISDHPSAFWDFLPTACELAGAEPPAD 398
>gi|423221012|ref|ZP_17207506.1| hypothetical protein HMPREF1061_04279 [Bacteroides caccae
CL03T12C61]
gi|392622251|gb|EIY16383.1| hypothetical protein HMPREF1061_04279 [Bacteroides caccae
CL03T12C61]
Length = 520
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 10/104 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-------FHGNTASNWPLRGTK 57
D VG ++ +LE+ GL N+L++ SDN GP VD + +G++ + PLRG K
Sbjct: 307 FDYCVGEILNTLEKLGLRENTLIILSSDN-GPVVDDGYDDKAEELLNGHSPAG-PLRGNK 364
Query: 58 YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
YS EGG R A L W ++KG+ + L+ DWL +L + G
Sbjct: 365 YSAFEGGTRIPAILSWPKEVKKGKTSDLLVSQVDWLASLASLTG 408
>gi|417301112|ref|ZP_12088282.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327542541|gb|EGF29015.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 484
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 9/100 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L++ D VG+++ SLE G ++V+F SDNGG H +N PLRG+K++ +
Sbjct: 268 LETFDHHVGQILDSLEASGRADRTIVIFYSDNGG--------HPEYTANAPLRGSKWNLY 319
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W +++ ++ + D LPT+ +AG
Sbjct: 320 EGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAG 359
>gi|153809423|ref|ZP_01962091.1| hypothetical protein BACCAC_03737 [Bacteroides caccae ATCC 43185]
gi|149128009|gb|EDM19231.1| arylsulfatase [Bacteroides caccae ATCC 43185]
Length = 520
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 10/104 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-------FHGNTASNWPLRGTK 57
D VG ++ +LE+ GL N+L++ SDN GP VD + +G++ + PLRG K
Sbjct: 307 FDYCVGEILNTLEKLGLRENTLIILSSDN-GPVVDDGYDDKAEELLNGHSPAG-PLRGNK 364
Query: 58 YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
YS EGG R A L W ++KG+ + L+ DWL +L + G
Sbjct: 365 YSAFEGGTRIPAILSWPKEVKKGKTSDLLVSQVDWLASLASLTG 408
>gi|260819473|ref|XP_002605061.1| hypothetical protein BRAFLDRAFT_85207 [Branchiostoma floridae]
gi|229290391|gb|EEN61071.1| hypothetical protein BRAFLDRAFT_85207 [Branchiostoma floridae]
Length = 289
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D +VGR++ +++ GL N+L++F +DNGG + +N P +G K++
Sbjct: 1 MVNAMDLNVGRIVDAMKGAGLWDNTLLIFSTDNGG-------WPDQGGNNDPYKGGKFTL 53
Query: 61 HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
EGG + VA + +L G ++H DW PT+ AG
Sbjct: 54 WEGGTKGVAFVHGNMLVETGYTNNEMIHAVDWFPTIVAAAG 94
>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 441
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 32/74 (43%), Positives = 45/74 (60%), Gaps = 6/74 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++K+LD+ G ++Q+L++ N+L+ F+SDNGGPT N +SNWPLRG K S
Sbjct: 241 LMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGPTTH------NGSSNWPLRGFKGSE 294
Query: 61 HEGGVRNVAALWSP 74
EGG R L P
Sbjct: 295 FEGGNRTPFLLQWP 308
>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 443
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 9/101 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD S G V+ +LE G N+LV F+SDNG HG TA LRG K+
Sbjct: 245 VVECLDWSTGEVLDALERTGQAENTLVAFLSDNGPKK-----GHGTTAG---LRGFKHDP 296
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + + L+ GQ + + ++ DW PTL +AG
Sbjct: 297 YEGGMRVPCIIAAQGLIPAGQTSDEICNVMDWYPTLVNLAG 337
>gi|440716552|ref|ZP_20897057.1| arylsulfatase [Rhodopirellula baltica SWK14]
gi|436438411|gb|ELP31961.1| arylsulfatase [Rhodopirellula baltica SWK14]
Length = 459
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 9/100 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L++ D VG+++ SLE G ++V+F SDNGG H +N PLRG+K++ +
Sbjct: 243 LETFDHHVGQILDSLEASGRADRTIVIFYSDNGG--------HPEYTANAPLRGSKWNLY 294
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W +++ ++ + D LPT+ +AG
Sbjct: 295 EGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAG 334
>gi|406836223|ref|ZP_11095817.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 456
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D VGR++ L+E L N++V F SDNGGP F N A LRG K
Sbjct: 251 MVTRMDADVGRIVALLDELKLRDNTIVFFCSDNGGPAPFGETFQTNGA----LRGKKGQV 306
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R + A W + G++ + + D LPTL + G E
Sbjct: 307 YEGGLRVPMIARWPGHVPAGKISDVPWYFADVLPTLAELGGGE 349
>gi|338532447|ref|YP_004665781.1| sulfatase family protein [Myxococcus fulvus HW-1]
gi|337258543|gb|AEI64703.1| sulfatase family protein [Myxococcus fulvus HW-1]
Length = 554
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 8/101 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD+ VG ++ L+E G+ N+LVVF +DNG V+ + + N P RG K S
Sbjct: 269 MRELDDIVGALLAKLDELGIADNTLVVFSTDNG---VEKMGW--PDGGNCPFRGEKGSTW 323
Query: 62 EGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR V L W ++ G+V+ ++ DW+PTL AG
Sbjct: 324 EGGVR-VPCLVRWPGVVEAGRVINDIFAHEDWMPTLVSAAG 363
>gi|427385168|ref|ZP_18881673.1| hypothetical protein HMPREF9447_02706 [Bacteroides oleiciplenus YIT
12058]
gi|425727336|gb|EKU90196.1| hypothetical protein HMPREF9447_02706 [Bacteroides oleiciplenus YIT
12058]
Length = 455
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 58/101 (57%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR + L+++GL ++LV+F SDNG L + ++ S PLRG K+S+
Sbjct: 258 VVEEIDWNVGRFLDYLDQQGLAESTLVIFASDNG----PWLGYKEDSGSADPLRGGKFSY 313
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR L W + G + ++ D PT+ AG
Sbjct: 314 YEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYAG 354
>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
Length = 453
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+ +G+VIQ+L +GL N+L+VF+SDNGG D A+N PLR K
Sbjct: 249 LIEHMDDGIGKVIQTLRAKGLYENTLIVFVSDNGGKLFDG-------ATNGPLRSGKGHM 301
Query: 61 HEGGVRNVAA-LWSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R A +W + + + + D PTL
Sbjct: 302 YEGGIRIPACVVWPGKVAAQSQSQQPLLLMDIFPTL 337
>gi|196228589|ref|ZP_03127455.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196226870|gb|EDY21374.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 631
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS--NWPLRGTKY 58
M++SLD+ VGR+++ ++E GL ++ +F SDNGG V L NT S N P R K
Sbjct: 236 MIESLDDCVGRIMKKVDELGLTEKTIFIFTSDNGGLHVYEL---PNTPSTYNAPFRAGKG 292
Query: 59 SFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W ++ G E + + D++PTL AG
Sbjct: 293 YLEEGGLREPLIVRWPGKIKAGATNETPVVLYDFMPTLMTAAG 335
>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 465
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 55/101 (54%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LD + G+V+ L+E G+ N++VVF +DNGGP+ N ++N PL GTK +
Sbjct: 245 MTLALDRACGKVLNKLKELGIDRNTIVVFSNDNGGPS------DKNASNNKPLSGTKSNH 298
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W ++K + D LPT Y AG
Sbjct: 299 LEGGIRVPFLMSWPKEIKKKSRFNYPVVTFDLLPTFYAAAG 339
>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
Length = 324
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 3/60 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+SVG+ I++L + G+L NS+V+ SDNG PT L H N+ SN P RG F
Sbjct: 255 MVSSLDKSVGQTIRALSDRGMLNNSIVLLYSDNGAPT---LGIHSNSGSNHPFRGVSTYF 311
>gi|167646636|ref|YP_001684299.1| sulfatase [Caulobacter sp. K31]
gi|167349066|gb|ABZ71801.1| sulfatase [Caulobacter sp. K31]
Length = 559
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 46/67 (68%), Gaps = 5/67 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD +VGR++Q+L+EEGL N+LV+F SDNGG ++ G N P RG K +F
Sbjct: 337 MVRNLDRNVGRLLQALKEEGLDQNTLVIFTSDNGGA-----NYIGLPDINRPYRGWKATF 391
Query: 61 HEGGVRN 67
EGG+ +
Sbjct: 392 FEGGIHS 398
>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 474
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 13/111 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +G+VIQ L+E G+ N+L+VF SDNGG H HG AS+ P RG K
Sbjct: 249 MVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGG------HSHG--ASSAPYRGHKGML 300
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE----KNID 106
EGG+R + W ++ + + + D PT+ + KNID
Sbjct: 301 FEGGIREPFVISWPNKIKGNRTYKEPIIALDIFPTILAATNTQVPASKNID 351
>gi|295690083|ref|YP_003593776.1| sulfatase [Caulobacter segnis ATCC 21756]
gi|295431986|gb|ADG11158.1| sulfatase [Caulobacter segnis ATCC 21756]
Length = 551
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 45/66 (68%), Gaps = 5/66 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD +VGRV+Q+L++ GL N+LV+F SDNGG ++ G N P RG K +F
Sbjct: 329 MVRNLDRNVGRVLQALKDRGLDDNTLVIFTSDNGGA-----NYVGLPDLNRPYRGWKATF 383
Query: 61 HEGGVR 66
EGG++
Sbjct: 384 FEGGIK 389
>gi|374373039|ref|ZP_09630700.1| N-acetylgalactosamine-6-sulfatase [Niabella soli DSM 19437]
gi|373235115|gb|EHP54907.1| N-acetylgalactosamine-6-sulfatase [Niabella soli DSM 19437]
Length = 469
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 58/107 (54%), Gaps = 9/107 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D VG ++++L++EG+ N++V F SDNGG +N LR K
Sbjct: 261 MIERIDNGVGLIMETLKKEGIDKNTIVFFFSDNGGVW--------KLGNNGILRAGKSWL 312
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
+EGGVR N+ A W ++R V+ + TD+ PT +A +K D
Sbjct: 313 YEGGVRENLLARWPGVIRPNTVVNTRVMATDFYPTFLELAKTKKKKD 359
>gi|334145289|ref|YP_004538499.1| sulfatase [Novosphingobium sp. PP1Y]
gi|333937173|emb|CCA90532.1| sulfatase [Novosphingobium sp. PP1Y]
Length = 461
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 10/96 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +GR+++ L+ GL ++++VF SDNGG + WP G K
Sbjct: 246 MVTRMDMEIGRILERLDRLGLAEDTIIVFTSDNGGE---------RYSKTWPYNGLKTEL 296
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R A + W + GQ E ++ DW+PT
Sbjct: 297 LEGGLRVPAVMRWPRRIAAGQESEQVIMSMDWMPTF 332
>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 550
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/124 (31%), Positives = 55/124 (44%), Gaps = 24/124 (19%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA------------ 48
ML D+ VG+V+ L+E N+L++FISDNG P D H N A
Sbjct: 303 MLDRADQGVGKVLAKLKELKKDDNTLIIFISDNGAPAEDMAHRDKNHAGRNTGPVGTAGS 362
Query: 49 -----------SNWPLRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLY 96
SN P R K +EGG+ + W P ++ G + + H+ D PT Y
Sbjct: 363 FESQGKPWSYVSNTPFRSFKSFAYEGGISSPFIAWFPGKIKAGSIAKGTAHLIDLAPTFY 422
Query: 97 FIAG 100
+AG
Sbjct: 423 ELAG 426
>gi|323454643|gb|EGB10513.1| hypothetical protein AURANDRAFT_62515 [Aureococcus anophagefferens]
Length = 981
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 10/103 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
LK++D VG V ++L++ ++V SDNGG T + H +GN N+P RG K+
Sbjct: 323 LKAVDRGVGAVRRALDDRAR--EYVLVVHSDNGGNTCGT-HCNGN---NYPRRGAKFFEF 376
Query: 62 EGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTLYFIAG 100
EGG+R ++SP L R+G LMH DWL T + G
Sbjct: 377 EGGLRVPGVVYSPTLIPARRRGAAYAGLMHHVDWLATFVALGG 419
>gi|150008689|ref|YP_001303432.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937113|gb|ABR43810.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
Length = 452
Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats.
Identities = 37/99 (37%), Positives = 54/99 (54%), Gaps = 5/99 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD SVG+++Q+L+ EGL N+LV+F SDNG L S PL+ K S
Sbjct: 246 VEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG----PWLLCKQEGGSPGPLKDGKASMF 301
Query: 62 EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG R +W +++ G + ++ D LPT IAG
Sbjct: 302 EGGFRVPCIMWGAMVKPGYI-TDMASTLDLLPTFCEIAG 339
>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 484
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LD + G V+ L+E GL N++VVF +DNGGPT N + N PL GTK +
Sbjct: 248 MTLALDRACGTVLNKLKELGLDKNTIVVFSNDNGGPT------DKNASLNLPLSGTKSNH 301
Query: 61 HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
EGG+R + P K + + N + D LPT Y G
Sbjct: 302 LEGGIRVPFLISWPKQIKSKTVYNFPVSTLDLLPTFYAAGG 342
>gi|256841279|ref|ZP_05546786.1| arylsulfatase A [Parabacteroides sp. D13]
gi|262383570|ref|ZP_06076706.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|423330544|ref|ZP_17308328.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
CL03T12C09]
gi|256737122|gb|EEU50449.1| arylsulfatase A [Parabacteroides sp. D13]
gi|262294468|gb|EEY82400.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|409232160|gb|EKN25008.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
CL03T12C09]
Length = 452
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/99 (37%), Positives = 54/99 (54%), Gaps = 5/99 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD SVG+++Q+L+ EGL N+LV+F SDNG L S PL+ K S
Sbjct: 246 VEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG----PWLLCKQEGGSPGPLKDGKASMF 301
Query: 62 EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG R +W +++ G + ++ D LPT IAG
Sbjct: 302 EGGFRVPCIMWGAMVKPGYI-TDMASTLDLLPTFCEIAG 339
>gi|255533274|ref|YP_003093646.1| sulfatase [Pedobacter heparinus DSM 2366]
gi|255346258|gb|ACU05584.1| sulfatase [Pedobacter heparinus DSM 2366]
Length = 513
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 34/103 (33%), Positives = 64/103 (62%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS--NWPLRGTKY 58
+++ +D+S+G ++ L+++G+ N++++F+SDNGG +D H G + N+PLR K
Sbjct: 284 LVEGMDKSLGDIMDYLKKKGVDKNTIIIFMSDNGG--LDH-HQRGGALNTHNYPLRSGKG 340
Query: 59 SFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
S +EGG+R + W + G V +N + I D+ P++ +AG
Sbjct: 341 SVYEGGIREPMIVRWPGVTSAGSVYKNPVIIEDFFPSILEMAG 383
>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
Length = 457
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 9/66 (13%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D SVGR++ +L+E L N+L VF+SDNGG + S+ PLRG+K S
Sbjct: 259 MMETVDNSVGRLMATLDELNLADNTLFVFMSDNGGERITSMA---------PLRGSKGSL 309
Query: 61 HEGGVR 66
+EGGVR
Sbjct: 310 YEGGVR 315
>gi|90408762|ref|ZP_01216908.1| arylsulfatase [Psychromonas sp. CNPT3]
gi|90310142|gb|EAS38281.1| arylsulfatase [Psychromonas sp. CNPT3]
Length = 507
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 56/96 (58%), Gaps = 6/96 (6%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
DE VGR+++++++ G+ N++V++ +DN GP + + G T P RG K + EG
Sbjct: 272 DEQVGRILKTIKDLGIEDNTIVIYTTDN-GPMIATWPDAGMT----PFRGEKNTGWEGSF 326
Query: 66 RNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R A + W + KG+ L +M + DWLPTL AG
Sbjct: 327 RVPAMVKWPNHIPKGKNLNGIMSLEDWLPTLLAAAG 362
>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 491
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 53/101 (52%), Gaps = 2/101 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D +G V+ +LE+ G L N+L+ F+SDNGG + + + A N P + K SF
Sbjct: 258 MIDEMDAGIGLVVDALEQSGKLDNTLIFFLSDNGGCVPQAWDSYSDFADNGPFKDGKTSF 317
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG A W P K Q +N++ D T +AG
Sbjct: 318 YEGGTHVPFIAHW-PAKLKPQTYDNIISSLDLAATAVALAG 357
>gi|329956451|ref|ZP_08297048.1| putative cerebroside-sulfatase [Bacteroides clarus YIT 12056]
gi|328524348|gb|EGF51418.1| putative cerebroside-sulfatase [Bacteroides clarus YIT 12056]
Length = 457
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/100 (36%), Positives = 55/100 (55%), Gaps = 5/100 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ +D +VGR + L+ +GL N+LV+F SDNG L F S PLRG K+S++
Sbjct: 268 VEEIDWNVGRFLDYLDTKGLAENTLVIFASDNG----PWLSFKEEGGSADPLRGGKFSYY 323
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + W ++ G + ++ D PT+ AG
Sbjct: 324 EGGVRVPCIIRWKGVVPAGVTSDAIIASIDLFPTIMHYAG 363
>gi|442317872|ref|YP_007357893.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
gi|441485514|gb|AGC42209.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
Length = 559
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 6/100 (6%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ LD+ VG ++ L+E G+ N++VVF +DNG V+ + + N P RG K S
Sbjct: 275 MTELDDIVGTLLAKLDELGIADNTIVVFSTDNG---VEKMSW--PDGGNAPFRGEKGSTW 329
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + W ++ G V+ +L DW+PTL AG
Sbjct: 330 EGGVRVPCVVRWPGVIEPGTVINDLFAHEDWMPTLVAAAG 369
>gi|332662429|ref|YP_004445217.1| Steryl-sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332331243|gb|AEE48344.1| Steryl-sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 506
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 5/96 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML+++D +VGR++ L GL N++V+F SDNGG + G+ +N PLR K
Sbjct: 285 MLENMDYNVGRILDQLAASGLTENTIVLFTSDNGGLSTA----EGSPTTNDPLRAGKGWL 340
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R V W + GQ+ + ++ D+ PT
Sbjct: 341 YEGGIRVPVIMRWPAKIVAGQISDVPVNTIDFYPTF 376
>gi|254430517|ref|ZP_05044220.1| sulfatase [Cyanobium sp. PCC 7001]
gi|197624970|gb|EDY37529.1| sulfatase [Cyanobium sp. PCC 7001]
Length = 601
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ L++ G L N+L+VF +DNG + S G T P +G K +
Sbjct: 337 MKQMDDNIGYVLKKLDDMGQLDNTLIVFTTDNGAEKI-SFPDGGVT----PFKGQKGEAY 391
Query: 62 EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG R + W + G V + L DW+PTL I G K
Sbjct: 392 EGGYRAPMVVRWPGHIEPGTVKDQLFAALDWVPTLVDIGGGAK 434
>gi|424864690|ref|ZP_18288593.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
gi|400759436|gb|EJP73618.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
Length = 549
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD SVG++++ L++ + G +L++F SDNGG L N P RG K SF
Sbjct: 329 MINALDRSVGKIVEKLKDLNIYGKTLIIFTSDNGGANYIEL-----DDINKPYRGWKISF 383
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R W + G+ + +H D PT+ AG E
Sbjct: 384 FEGGIRVPYIVSWPSEISPGKKSDIAVHHFDIFPTIAKAAGIE 426
>gi|149197500|ref|ZP_01874551.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139518|gb|EDM27920.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 472
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 56/101 (55%), Gaps = 2/101 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
M K LD++VGR+++ ++E G+ N++++F SDNGG S +G+ SN+P R K S
Sbjct: 249 MTKHLDDTVGRLVKKVDELGIAENTIIIFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSS 308
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
+EGG R + W + + ++ D PTL IA
Sbjct: 309 HYEGGSRVPLIFHWPGVTEANSLSHEVVSGIDIYPTLLKIA 349
>gi|430746124|ref|YP_007205253.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430017844|gb|AGA29558.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 478
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++++D SVGR++ +LEE+GL +LVVF SDNG L + + S P R K +
Sbjct: 249 VIEAIDGSVGRILATLEEQGLDQKTLVVFTSDNG----PWLSYGNHAGSAGPFREGKGTS 304
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W + G + D LPT +AG +++ D
Sbjct: 305 FEGGIREPFVARWPGKIPAGSICREPAMTIDLLPTFAELAGADRSGD 351
>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 467
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 7/96 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D ++G V++ LE +GL N+L+VF +DNGGP+ N + N PL GTK +
Sbjct: 252 MTLAMDRAIGTVLKELETQGLSNNTLIVFTNDNGGPS------DANDSDNSPLSGTKANH 305
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + W L+K + + D PT
Sbjct: 306 LEGGIRVPFLMSWPKQLQKNKNYNYPISTLDLFPTF 341
>gi|365860802|ref|ZP_09400595.1| putative sulfatase [Streptomyces sp. W007]
gi|364009756|gb|EHM30703.1| putative sulfatase [Streptomyces sp. W007]
Length = 435
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+++L+ G ++LV F SDNGG + NWPL G K S
Sbjct: 246 MVEDLDRSVGEVLKALKRSGQEKDTLVFFASDNGGERF---------SYNWPLAGNKGSL 296
Query: 61 HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
EGG+R + L P +G + +L + DW TL + G
Sbjct: 297 QEGGIRVPSVLRWPARVEGHQVSDLPVFSPDWTATLLELGG 337
>gi|340619116|ref|YP_004737569.1| sulfatase [Zobellia galactanivorans]
gi|339733913|emb|CAZ97290.1| Sulfatase, family S1-17 [Zobellia galactanivorans]
Length = 596
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 64/113 (56%), Gaps = 9/113 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH--FHGNTASNWPLRGTKY 58
M+ ++D +VGRV++ L++ G+ N+LV+F+SDNG + + + G SN LRGTK
Sbjct: 254 MIDNIDANVGRVLKRLKDLGIADNTLVIFLSDNGPRHRRTKNDVYPGRWVSN--LRGTKT 311
Query: 59 SFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFI----AGKEKNID 106
S +E G+R W L+KGQ + + D LPT+ + A KE +D
Sbjct: 312 SVYECGIRVPFFVKWPTQLKKGQKSSTMGAVIDILPTILDVCDIKAPKEVKLD 364
>gi|410907337|ref|XP_003967148.1| PREDICTED: arylsulfatase A-like [Takifugu rubripes]
Length = 501
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 6/102 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L LD ++G ++++LE G+L N+L++F SDN GP + + GN PLR K + +
Sbjct: 253 LLELDNTIGSLLETLEGTGVLNNTLILFTSDN-GPELMRMSRGGNAG---PLRCGKGTTY 308
Query: 62 EGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A A W +++ G V + D LPT+ +AG +
Sbjct: 309 EGGMREPAIAYWRGIIQPG-VTHEMASTLDILPTIASLAGAK 349
>gi|348029892|ref|YP_004872578.1| sulfatase [Glaciecola nitratireducens FR1064]
gi|347947235|gb|AEP30585.1| sulfatase [Glaciecola nitratireducens FR1064]
Length = 617
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSL-HFHGNTASNWPLRGTKYS 59
M+K D++VG + ++L E + N++++F SDNGG D L H SN+PLRG K +
Sbjct: 155 MVKQFDDAVGILWEALVEAQVEDNTIIIFTSDNGGNMYDVLGTIHAT--SNFPLRGGKAT 212
Query: 60 FHEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTL 95
++GG+R A+ P L + L N + D+ PTL
Sbjct: 213 NYDGGLRVPTAIIWPGLTQPNTLSNAPIQTVDYFPTL 249
>gi|386820648|ref|ZP_10107864.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386425754|gb|EIJ39584.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 512
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 10/108 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS----LHFHGNTASNWPLRGT 56
M++ +D+S+G ++ L+E L N++++F+SDNGG + LH H N PL
Sbjct: 288 MVEGMDKSLGDIMNYLDENDLAENTIILFMSDNGGLSAHKRGGELHTH-----NKPLASG 342
Query: 57 KYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
K S +EGG+R + W +++ V + I D+ PT+ +AG EK
Sbjct: 343 KGSMYEGGIREPMLVKWPRVIKPESVTNEYLIIEDFFPTILEMAGIEK 390
>gi|374369744|ref|ZP_09627765.1| sulfatase [Cupriavidus basilensis OR16]
gi|373098668|gb|EHP39768.1| sulfatase [Cupriavidus basilensis OR16]
Length = 433
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G ++Q+L + G L N+L+VF SDNGG + NWPL G K
Sbjct: 229 MISHMDEGIGWLVQALRQLGQLDNTLIVFTSDNGGERF---------SDNWPLVGGKMDL 279
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R A W ++ G DW TL AG + +
Sbjct: 280 TEGGIRVPWIAHWPAMIAPGGTSAQHCMTMDWSATLLDAAGVQAD 324
>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
Length = 476
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 40/114 (35%), Positives = 58/114 (50%), Gaps = 14/114 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
M++ +D VG V+Q LE GL N++++F SDNG H G SN RG
Sbjct: 264 MVERIDRYVGEVLQKLESNGLSENTVIIFTSDNGA------HREGGADPEFFDSNGKFRG 317
Query: 56 TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
K +EGGVR + A W ++ G +++ D LPT I G++ K+ID
Sbjct: 318 FKRDLYEGGVRAPMIAWWPNKIKAGSTTDHVSAFWDLLPTFAAIGGEKITKSID 371
>gi|443734458|gb|ELU18440.1| hypothetical protein CAPTEDRAFT_208445 [Capitella teleta]
Length = 240
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 8/76 (10%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD VG++++ L E G++ N+ V+F+SDNG + L N P RG K S EGG
Sbjct: 168 LDAMVGKIVEKLIENGMMNNTYVIFVSDNGADPPEGL--------NTPFRGGKSSLFEGG 219
Query: 65 VRNVAALWSPLLRKGQ 80
++ + ++SPLL+K +
Sbjct: 220 TKSNSFIYSPLLKKTE 235
>gi|87301451|ref|ZP_01084291.1| putative arylsulfatase protein [Synechococcus sp. WH 5701]
gi|87283668|gb|EAQ75622.1| putative arylsulfatase protein [Synechococcus sp. WH 5701]
Length = 374
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ L++ G L N+L+VF +DNG + S G T P +G K +
Sbjct: 110 MKQMDDNIGYVLKKLDDMGQLDNTLIVFTTDNGAEKI-SFPDGGVT----PFKGQKGEAY 164
Query: 62 EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG R + W + G V + L DW+PTL I G K
Sbjct: 165 EGGYRAPMVVRWPGHIEPGTVKDQLFAALDWVPTLVDIGGGAK 207
>gi|149173445|ref|ZP_01852075.1| putative arylsulfatase [Planctomyces maris DSM 8797]
gi|148847627|gb|EDL61960.1| putative arylsulfatase [Planctomyces maris DSM 8797]
Length = 459
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD+ VG ++ LE+ G+ N+L++F SDNGG N PLRG K S
Sbjct: 245 MVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDFFNGNAPLRGHKGSM 304
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R A W + G+ + + D LPTL +AG
Sbjct: 305 YEGGIRVPFIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAG 345
>gi|449136003|ref|ZP_21771428.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448885345|gb|EMB15791.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 480
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 40/111 (36%), Positives = 59/111 (53%), Gaps = 11/111 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNTASNWP 52
+++ +D +VGR++ +L E L N+ V+F SDNG V + HG +A P
Sbjct: 217 VIEEIDFNVGRILDTLSELKLADNTYVLFTSDNGPWLVKNKGHVDGHRPRDHGGSAG--P 274
Query: 53 LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
LR K S EGGVR A LW P + G V +++ D +PTL +AG +
Sbjct: 275 LRSGKVSSFEGGVRVPAILWGPGKVPAGTVCDSIATTMDVMPTLAALAGAD 325
>gi|255035424|ref|YP_003086045.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254948180|gb|ACT92880.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 457
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA----SNWPLRGTKYSF 60
LDE+VG++ + L E GL N++V F SDN GP + A SN LRG K
Sbjct: 259 LDENVGKIKKYLTESGLAKNTIVFFCSDN-GPRSEGTPQQTAIAEFFDSNGRLRGYKRDM 317
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R +W+P +++ G V + D +PT IAG +
Sbjct: 318 YEGGIRVPMVVWAPGIVKPGSVSSEPAYFADIMPTFADIAGSK 360
>gi|325110465|ref|YP_004271533.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970733|gb|ADY61511.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 535
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 19/110 (17%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNS--------LVVFISDNGGPTVDSLHFHGNTASNWPL 53
+ S+DE++GR+ L + G+S L++F SDNGG ASN PL
Sbjct: 303 ISSVDENLGRLRSLLNDPNQDGDSSDSISEQTLILFTSDNGG----------THASNAPL 352
Query: 54 RGTKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
RG K F+EGG+R + A W ++ V L+H D+ PT +AG +
Sbjct: 353 RGEKGEFYEGGIRVPLIACWPGTIQADSVNNRLVHCVDYYPTFLELAGNQ 402
>gi|449136701|ref|ZP_21772070.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448884660|gb|EMB15143.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 890
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 59/97 (60%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
M++S+D+++G ++ +L+ + +++VF SDNGG + + G TA SN PLRG K +
Sbjct: 505 MIESMDDAIGTLLDTLDRLEIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 562
Query: 60 FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
+EGGVR A + P ++ G + ++ D+ PTL
Sbjct: 563 MYEGGVRGPAIVVQPGVVEAGSRSDAIIQSIDFYPTL 599
>gi|149195948|ref|ZP_01873004.1| arylsulfatase [Lentisphaera araneosa HTCC2155]
gi|149140795|gb|EDM29192.1| arylsulfatase [Lentisphaera araneosa HTCC2155]
Length = 510
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/119 (30%), Positives = 62/119 (52%), Gaps = 19/119 (15%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG----PTVDS--LHFHG--------- 45
M+ S+D+++GR++++L++ G+ N+LV+FISDNGG P D+ H G
Sbjct: 292 MVDSMDQNIGRILKTLDQTGVSENTLVLFISDNGGCAEMPGGDNNISHIPGPKEFYTACG 351
Query: 46 ---NTASNWPLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
A N P + K + HEGG+ + W ++ +++ H+ D PT +AG
Sbjct: 352 PSWANAQNTPFKRYKVNMHEGGICTPCVVRWPAKIKANSWTDSIAHVIDLQPTCMALAG 410
>gi|149176837|ref|ZP_01855447.1| arylsulfatase (A or B) [Planctomyces maris DSM 8797]
gi|148844274|gb|EDL58627.1| arylsulfatase (A or B) [Planctomyces maris DSM 8797]
Length = 476
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 9/99 (9%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+++LD VG+V+ +L+++GL ++LVVF SDNGG H A N PLRG+K++ +
Sbjct: 256 IETLDAYVGQVLTALDQQGLRESTLVVFTSDNGG--------HPEYADNAPLRGSKWNLY 307
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
E G+R + W ++ G + D PT +A
Sbjct: 308 EAGIRVPLLVRWPQHVQAGSGCTKPVSGIDLFPTFCDVA 346
>gi|119477026|ref|ZP_01617307.1| sulfatase family protein [marine gamma proteobacterium HTCC2143]
gi|119449833|gb|EAW31070.1| sulfatase family protein [marine gamma proteobacterium HTCC2143]
Length = 551
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/96 (38%), Positives = 51/96 (53%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD SVGRV+QSLE+ G+ N++VVF SDNGG +L N P RG K +
Sbjct: 329 MIRALDRSVGRVLQSLEDNGIADNTIVVFTSDNGGAGYIAL-----PDINKPYRGWKLTQ 383
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG A W + G ++ +H D T
Sbjct: 384 FEGGTHVPFMAKWPAKIAAGTTFDSPIHHIDLFHTF 419
>gi|423294186|ref|ZP_17272313.1| hypothetical protein HMPREF1070_00978 [Bacteroides ovatus
CL03T12C18]
gi|392676443|gb|EIY69879.1| hypothetical protein HMPREF1070_00978 [Bacteroides ovatus
CL03T12C18]
Length = 450
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 62/105 (59%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR+I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRLIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|343086211|ref|YP_004775506.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354745|gb|AEL27275.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 471
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 60/105 (57%), Gaps = 9/105 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT----ASNWPLRGT 56
M++SLD++VGR+++ LE EGL N+L+ F SDNG L++ G+T SN LRG
Sbjct: 272 MIESLDKNVGRLVEYLEAEGLDKNTLIFFTSDNG----QYLNYDGSTWPKVGSNGVLRGE 327
Query: 57 KYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
K +EGG R + A W + + + + + D LPTL I G
Sbjct: 328 KGDLYEGGHRVPSIAYWPEKIPENVISDAPLMSMDILPTLLEIIG 372
>gi|332665500|ref|YP_004448288.1| Cerebroside-sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332334314|gb|AEE51415.1| Cerebroside-sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 464
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/97 (37%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD +G+++ L++EGL +LVVF+SDNG D+ G LRG K +EGG
Sbjct: 268 LDREIGKLLAQLKKEGLDKKTLVVFVSDNGPKQRDAFESTGG------LRGKKRDMYEGG 321
Query: 65 VR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+R A W ++ G V E + D+LPT +AG
Sbjct: 322 IRVPFIARWPGKIKAGTVNETPTALWDFLPTACALAG 358
>gi|302554271|ref|ZP_07306613.1| arylsulfatase [Streptomyces viridochromogenes DSM 40736]
gi|302471889|gb|EFL34982.1| arylsulfatase [Streptomyces viridochromogenes DSM 40736]
Length = 474
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD +VG V+ +L G +++V+F SDNGG L WPL G K+S
Sbjct: 287 MVESLDAAVGEVLAALRRSGQEDDTVVLFASDNGGERWSYL---------WPLSGGKFSL 337
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTLYFIAGKEKN 104
EGG+R A L P G+ + + H + DW TL +AG +
Sbjct: 338 QEGGIRVPAILRWPDRIDGRQVSHEPHFSPDWTATLLELAGARHD 382
>gi|262406944|ref|ZP_06083493.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294805693|ref|ZP_06764573.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|345507695|ref|ZP_08787342.1| hypothetical protein BSAG_05002 [Bacteroides sp. D1]
gi|262355647|gb|EEZ04738.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294447109|gb|EFG15696.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|345455385|gb|EGX26397.1| hypothetical protein BSAG_05002 [Bacteroides sp. D1]
Length = 853
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 5/103 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR+I L+++ L N+LV+F SDNG L F + S PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGGVR + W + G + ++ D PT+ AG +
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAGCQ 364
>gi|340620633|ref|YP_004739086.1| sulfatase [Zobellia galactanivorans]
gi|339735430|emb|CAZ98807.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 538
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 7/104 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV-DSLHFHGNTASNWPLRGTKYS 59
+++++D++VGRV+ +L+ GL N+LVVF SDNGG + DS + SN PLRG K
Sbjct: 286 LVETMDDAVGRVLNTLDSLGLDKNTLVVFTSDNGGVSAGDSF-----STSNLPLRGGKGY 340
Query: 60 FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R + +P L++ G + D PT+ + G +
Sbjct: 341 QFEGGIREPYFIKAPWLVKGGAYTSEPVTGADLYPTILDVVGAD 384
>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
Length = 913
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M D+ V +L+ +G+ ++L+V SDNGGP+ ++++N+PLRG KYS
Sbjct: 632 MTAMADDVVRNATAALKAKGMWDDTLIVLTSDNGGPSGTD----ADSSNNFPLRGGKYSD 687
Query: 61 HEGGVRN---VAALWSPLLRKGQVLEN---LMHITDWLPTL 95
EGGVR VA + P +G+ L+ +HI DW T
Sbjct: 688 LEGGVRAAGLVAGGFLPDAARGRTLDGAGAYVHICDWYATF 728
>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 522
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 23/105 (21%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE+VG V L++ L NS+V+F + LRG K +
Sbjct: 277 MVTCMDEAVGYVTTVLKQSALWENSVVIFSTX--------------------LRGGKGTL 316
Query: 61 HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKE 102
+EGGVR V + SPLL +G + L+HI+DW PTL +AG +
Sbjct: 317 YEGGVRGVGFVTSPLLHEQVRGTINHELIHISDWFPTLTRLAGGD 361
>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
Length = 473
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 7/95 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +D+ +G V+ +L++ G+ N++VVF+SDNGG +L N S L+G K S
Sbjct: 252 MTSDVDQDIGEVLAALDKRGIAENTIVVFMSDNGG----NLEAGANNGS---LQGGKSSV 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGGVR + P + + + M DW PTL
Sbjct: 305 YEGGVRVPVLIRWPAQLQSKTMTQPMFTQDWAPTL 339
>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
Length = 472
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 7/66 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ SLD+ +G+V+ +L+ L N+L++FI+DNGG TV N++ N PLRG K S
Sbjct: 251 MMTSLDDGIGKVMATLKANQLDKNTLIIFINDNGGATV-------NSSDNGPLRGMKGSK 303
Query: 61 HEGGVR 66
EGG+R
Sbjct: 304 WEGGIR 309
>gi|149196555|ref|ZP_01873609.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149140235|gb|EDM28634.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 507
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 3/96 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
L +DE VGR+ L E G+ N+++ F SDNG + T S LRG K S
Sbjct: 245 CLTEMDEQVGRIRAKLREMGVEKNTVLFFCSDNGPEGKKAKGAKAGTTSG--LRGRKRSL 302
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTL 95
++GGVR A A W ++ G V++ M D+LPT+
Sbjct: 303 YDGGVRVPALAEWPGKIQAGSVIDAAMSTLDYLPTV 338
>gi|440718731|ref|ZP_20899174.1| arylsulfatase [Rhodopirellula baltica SWK14]
gi|436436058|gb|ELP29849.1| arylsulfatase [Rhodopirellula baltica SWK14]
Length = 490
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 9/107 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE VGR++++L E GL N+ +VF SDNGG G S PLRG K ++
Sbjct: 255 MIESVDEGVGRMVETLRELGLEENTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
+EGG+R W ++ G + + D PT + G + D
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAKLPAD 353
>gi|294644129|ref|ZP_06721904.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|292640494|gb|EFF58737.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
Length = 743
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR+I L+++ L N+LV+F SDNG L F + S PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W + G + ++ D PT+ AG
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362
>gi|283781484|ref|YP_003372239.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283439937|gb|ADB18379.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 475
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 58/107 (54%), Gaps = 7/107 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLD+SVGR++ L+ L ++V+F SDNGG + + SN P R K S
Sbjct: 269 MVESLDQSVGRIMAKLDALDLTKKTIVIFTSDNGGLVLREI------TSNLPARAGKGSA 322
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
+EGGVR + + P+++ G + D PTL ++G + + D
Sbjct: 323 YEGGVRVPLIVSYPPMIKPGTTCDVPAISMDLFPTLAELSGAKYSHD 369
>gi|421611130|ref|ZP_16052281.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408497944|gb|EKK02452.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 496
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 59/102 (57%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D++VGRV++ +++ GL ++V+ SDNGG + G SN PLRG K
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLAEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK 101
+EGG+R + +P + + G + + D+ PT+ + G+
Sbjct: 337 YEGGIREAFMICAPGVTQPGTTCKEPVCSIDFYPTILDLCGQ 378
>gi|326778304|ref|ZP_08237569.1| N-acetylgalactosamine-4-sulfatase [Streptomyces griseus XylebKG-1]
gi|326658637|gb|EGE43483.1| N-acetylgalactosamine-4-sulfatase [Streptomyces griseus XylebKG-1]
Length = 487
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+++L+ G ++LV F SDNGG + NWPL G K S
Sbjct: 298 MVEDLDRSVGEVLKALKRSGQEKDTLVFFASDNGGERF---------SYNWPLAGNKGSL 348
Query: 61 HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
EGG+R + L P G + +L + DW TL + G
Sbjct: 349 QEGGIRVPSVLRWPARVDGHQVSDLPVFSPDWTATLLELGG 389
>gi|149175234|ref|ZP_01853856.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148845843|gb|EDL60184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 459
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 38/109 (34%), Positives = 55/109 (50%), Gaps = 7/109 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ LD VGR++ + E L +L++F SDNGG H +N PLRG K
Sbjct: 250 MIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGGHRGVPAQLH----TNGPLRGFKRDL 305
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
EGG+R A W + G+V + ++ D LPT +AG + N+D
Sbjct: 306 TEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELAGAQVSANLD 354
>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
Length = 466
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 54/100 (54%), Gaps = 7/100 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M ++D ++G V+ L+++GL N+LVVF +DNGGP+ N ++N PL GTK +
Sbjct: 252 MTLAMDRAIGTVLDELKKQGLDKNTLVVFTNDNGGPS------DANQSNNNPLSGTKANH 305
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
EGG+R + W L+K + D PT +A
Sbjct: 306 LEGGIRVPFLMSWPGKLKKNTTYNYPISTLDLFPTFLNVA 345
>gi|32473667|ref|NP_866661.1| aryl-sulphate sulphohydrolase [Rhodopirellula baltica SH 1]
gi|32444203|emb|CAD74200.1| aryl-sulphate sulphohydrolase [Rhodopirellula baltica SH 1]
Length = 490
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE VGR++++L E GL N+ +VF SDNGG G S PLRG K ++
Sbjct: 255 MIESVDEGVGRMVETLRELGLEENTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R W ++ G + + D PT + G +
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAK 349
>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 616
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/98 (37%), Positives = 51/98 (52%), Gaps = 15/98 (15%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSFHEG 63
LD+ + R+ L+ G L N+++V SDNG PT SN+PLRG K++ EG
Sbjct: 362 LDKEMRRLHDELDALGALDNTVLVVASDNGACPTAG--------GSNYPLRGYKHTIFEG 413
Query: 64 GVRNVAALWS------PLLRKGQVLENLMHITDWLPTL 95
GVR A ++S P +G +MH TDW PT
Sbjct: 414 GVRVPAFVYSKSTDLIPEEARGTRYSGMMHSTDWTPTF 451
>gi|440718016|ref|ZP_20898485.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436436817|gb|ELP30517.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 491
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 2/101 (1%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYSF 60
+ ++D +VGR+++ LEE G ++L+VF SDNG T+D + S PLRG K
Sbjct: 253 VANVDAAVGRLVKGLEEIGKRDDTLIVFTSDNGPETLDRYRSANRSYGSPGPLRGMKLHT 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
E G R + W + GQ L+ ++ D LPT +AG
Sbjct: 313 TEAGFRVPGIVNWPGEIEPGQTLDTVVSSLDLLPTFCRLAG 353
>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
Length = 462
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D++V +V+ +++ G N++VVF SDNGG + WP G K
Sbjct: 260 MVETMDQNVAKVLAAIDRSGKADNTVVVFTSDNGGERF---------SDTWPFVGHKGEV 310
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + W ++ G E +M D+LPTL +AG
Sbjct: 311 LEGGVRVPLMVRWPRRIKAGSRSEQVMVSMDFLPTLLGMAG 351
>gi|32473340|ref|NP_866334.1| arylsulphatase A [Rhodopirellula baltica SH 1]
gi|32398020|emb|CAD78114.1| arylsulphatase A [Rhodopirellula baltica SH 1]
Length = 491
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 2/101 (1%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYSF 60
+ ++D +VGR+++ LEE G ++L+VF SDNG T+D + S PLRG K
Sbjct: 253 VANVDAAVGRLVKGLEELGKRDDTLIVFTSDNGPETLDRYRSANRSYGSPGPLRGMKLHT 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
E G R + W + GQ L+ ++ D LPT +AG
Sbjct: 313 TEAGFRVPGIVNWPGEIEPGQTLDTVVSSLDLLPTFCRLAG 353
>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
Length = 467
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 11/108 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT-----VDSLHFHGNTASNWPLRG 55
M+ +D SVGRV++ L GL N+LV+F SDN GPT DS F N+A LRG
Sbjct: 259 MVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDN-GPTHNVGGADSSFF--NSAGK--LRG 313
Query: 56 TKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
K S +EGG+R + P ++ G + ++ D LPTL AG +
Sbjct: 314 LKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGTK 361
>gi|254483658|ref|ZP_05096879.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
gi|214036091|gb|EEB76777.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
Length = 558
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 6/104 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D SV R++ +LE EG+ +++VVF SDNGG + G N P RG K +
Sbjct: 332 MIRAVDRSVSRILDTLEAEGIADDTVVVFTSDNGGAG-----YIGIQDVNSPFRGWKITM 386
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
EGG+R + W + G ++ + D +PTL AG +
Sbjct: 387 FEGGLRVPMFVKWPKRIAAGSSVDTPVAHIDMMPTLAAAAGASQ 430
>gi|182437669|ref|YP_001825388.1| sulfatase [Streptomyces griseus subsp. griseus NBRC 13350]
gi|178466185|dbj|BAG20705.1| putative sulfatase [Streptomyces griseus subsp. griseus NBRC 13350]
Length = 449
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+++L+ G ++LV F SDNGG + NWPL G K S
Sbjct: 260 MVEDLDRSVGEVLKALKRSGQEKDTLVFFASDNGGERF---------SYNWPLAGNKGSL 310
Query: 61 HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
EGG+R + L P G + +L + DW TL + G
Sbjct: 311 QEGGIRVPSVLRWPARVDGHQVSDLPVFSPDWTATLLELGG 351
>gi|299147171|ref|ZP_07040238.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Bacteroides sp. 3_1_23]
gi|298515056|gb|EFI38938.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Bacteroides sp. 3_1_23]
Length = 450
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKRIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|374620943|ref|ZP_09693477.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304170|gb|EHQ58354.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 524
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 5/102 (4%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L D VG+V+ +L+E GL N++VV+ SDNG + H G+T P RG + H
Sbjct: 262 LAEHDYRVGQVLDALDELGLTENTVVVYASDNGPDRAEFPHI-GDTG---PYRGYLGTVH 317
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EG VR L W + G+V +++ I D++PT I G+E
Sbjct: 318 EGSVRTPMMLRWPGKVEPGRVTNDIVAIHDFMPTFAAIVGEE 359
>gi|296121751|ref|YP_003629529.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014091|gb|ADG67330.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 536
Score = 58.9 bits (141), Expect = 3e-07, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 8/103 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS-LHFHGNTA------SNWPL 53
M++ LD VG VI SLE+ GLL +L+VF SDNG + FH A S L
Sbjct: 292 MIRDLDNHVGDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGADPLFFNSTREL 351
Query: 54 RGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
+G K S +EGG+R A + W + + + DW PTL
Sbjct: 352 KGFKGSIYEGGLRVPAIVSWPGQIPPATTINTPSYFPDWFPTL 394
>gi|423290540|ref|ZP_17269389.1| hypothetical protein HMPREF1069_04432 [Bacteroides ovatus
CL02T12C04]
gi|392665927|gb|EIY59450.1| hypothetical protein HMPREF1069_04432 [Bacteroides ovatus
CL02T12C04]
Length = 450
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKRIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|32474361|ref|NP_867355.1| arylsulfatase A [Rhodopirellula baltica SH 1]
gi|32444899|emb|CAD74901.1| arylsulfatase A [Rhodopirellula baltica SH 1]
Length = 525
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 14/111 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K++D SVGR++++L+E G+ N+LV+F SDNGG + +L G T N PLR K
Sbjct: 300 MVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 356
Query: 61 HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
+EGG+R + P G + ++++ TD PT+ + G
Sbjct: 357 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVG 407
>gi|297180066|gb|ADI16291.1| arylsulfatase a and related enzymes [uncultured bacterium
HF0010_16H03]
Length = 543
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 53/96 (55%), Gaps = 6/96 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD S+G+++ +LEE GL N+L++F SDNGG + G N P RG K +
Sbjct: 321 MIEALDRSIGKIVTTLEENGLTENTLIIFTSDNGGAG-----YIGLDNINKPYRGWKLTH 375
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
EGG+ A W ++K + +H TD T+
Sbjct: 376 FEGGMHIPFFAKWPAKIKKDMRYDKRIHHTDIFSTI 411
>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 486
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 40/109 (36%), Positives = 57/109 (52%), Gaps = 11/109 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA---------SNW 51
M+ LD+++GRV+ +L+++GL N+L++F SDNGG + F G A N
Sbjct: 243 MVACLDDNIGRVVAALDKKGLRENTLILFHSDNGG--THNPMFAGVMADVSKIKIPCDNS 300
Query: 52 PLRGTKYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
P R K S EGGVR A P K ++ L+H TD T +AG
Sbjct: 301 PYRDGKGSLFEGGVRVCALANWPGKIKPATVDGLIHATDIFTTFTKLAG 349
>gi|393784539|ref|ZP_10372702.1| hypothetical protein HMPREF1071_03570 [Bacteroides salyersiae
CL02T12C01]
gi|392665520|gb|EIY59044.1| hypothetical protein HMPREF1071_03570 [Bacteroides salyersiae
CL02T12C01]
Length = 511
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 61/108 (56%), Gaps = 7/108 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G ++ LEE+G+ N++V+F+SDNGG T+ G N+PL K S
Sbjct: 283 LVEGMDKSLGDLMDYLEEKGIAQNTVVIFMSDNGGYTI------GRPDKNFPLSEGKGSL 336
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGG+R + W + + + I D+ P++ +AG +K I +
Sbjct: 337 KEGGIREPMIVYWPGVTQPSAENATPVIIEDFFPSILELAGVKKGIHT 384
>gi|417302363|ref|ZP_12089465.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327541297|gb|EGF27839.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 544
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 14/111 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K++D SVGR++++L+E G+ N+LV+F SDNGG + +L G T N PLR K
Sbjct: 319 MVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 375
Query: 61 HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
+EGG+R + P G + ++++ TD PT+ + G
Sbjct: 376 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVG 426
>gi|171912351|ref|ZP_02927821.1| aryl-sulphate sulphohydrolase [Verrucomicrobium spinosum DSM 4136]
Length = 491
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 45/107 (42%), Positives = 59/107 (55%), Gaps = 7/107 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS----NWPLRGT 56
ML S+DESVGR+++ L++ L N+LVVF SDNGG V G A N PLRG
Sbjct: 248 MLASVDESVGRILKLLDDLKLSQNTLVVFSSDNGG--VGGYEREGIDAKGITDNAPLRGG 305
Query: 57 KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
K +EGGVR W ++ GQV ++ D PTL +AG +
Sbjct: 306 KGMLYEGGVRVPYVFQWKGHIKAGQVNTTPINSVDLYPTLVELAGAQ 352
>gi|170727981|ref|YP_001762007.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169813328|gb|ACA87912.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 482
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 58/104 (55%), Gaps = 7/104 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN----TASNWPLRGT 56
M+ ++D+SVGR+++SLE++GL N+LVV SDNGG S G+ + +N PLR
Sbjct: 275 MIWAVDKSVGRILESLEKQGLDKNTLVVLTSDNGG---FSTRHQGDERVTSTANLPLRSG 331
Query: 57 KYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
K +EGGVR + P + + L D PTL +AG
Sbjct: 332 KGWVYEGGVRIPLLIHQPGQQIQSQHDTLTTSADLYPTLANVAG 375
>gi|393783639|ref|ZP_10371811.1| hypothetical protein HMPREF1071_02679 [Bacteroides salyersiae
CL02T12C01]
gi|392668564|gb|EIY62059.1| hypothetical protein HMPREF1071_02679 [Bacteroides salyersiae
CL02T12C01]
Length = 553
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 33/101 (32%), Positives = 59/101 (58%), Gaps = 2/101 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G ++ LEE+ L N++++F+SDNGG + +T N+PL K S
Sbjct: 329 LVEGMDKSLGDLMDYLEEKNLQENTVILFMSDNGGLAAHTRAGQLHT-QNYPLNSGKGSA 387
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W ++ G + + I D+ P++ IAG
Sbjct: 388 YEGGVREPMIVSWPGTVKPGSKCDRYLMIEDFFPSILEIAG 428
>gi|336415446|ref|ZP_08595786.1| hypothetical protein HMPREF1017_02894 [Bacteroides ovatus
3_8_47FAA]
gi|335941042|gb|EGN02904.1| hypothetical protein HMPREF1017_02894 [Bacteroides ovatus
3_8_47FAA]
Length = 450
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|160885324|ref|ZP_02066327.1| hypothetical protein BACOVA_03323 [Bacteroides ovatus ATCC 8483]
gi|156108946|gb|EDO10691.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
Length = 450
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 460
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 12/108 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M SLD+ +GRV+ SL+ GL N+LV+F++D+GG V + GN N P RG K +
Sbjct: 260 MTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGGDYV----YGGN---NQPFRGAKATL 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
EGG+R + W ++ G + D PT+ A N+D+
Sbjct: 313 FEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFA----NVDT 356
>gi|427388135|ref|ZP_18884018.1| hypothetical protein HMPREF9447_05051 [Bacteroides oleiciplenus YIT
12058]
gi|425724718|gb|EKU87592.1| hypothetical protein HMPREF9447_05051 [Bacteroides oleiciplenus YIT
12058]
Length = 557
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 1/104 (0%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D+S+G V+ LE +G+ N++++F SDNGG +V + N PLR K S
Sbjct: 303 MVEGMDKSLGDVLDFLETKGVADNTIIIFYSDNGGHSVGNEKGGEPHTQNLPLREGKGSV 362
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
+EGG+R + W + G + + D PT+ +AG +K
Sbjct: 363 YEGGIRVPMLFCWPGKVSAGTRINTPVSTEDMFPTILEMAGVKK 406
>gi|423214930|ref|ZP_17201458.1| hypothetical protein HMPREF1074_02990 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692193|gb|EIY85431.1| hypothetical protein HMPREF1074_02990 [Bacteroides xylanisolvens
CL03T12C04]
Length = 450
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|390573121|ref|ZP_10253308.1| sulfatase [Burkholderia terrae BS001]
gi|389934853|gb|EIM96794.1| sulfatase [Burkholderia terrae BS001]
Length = 563
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
D+ VG++++SL+E G+ N++VV+ +DN GP F A+ P R K S +EG
Sbjct: 301 DQDVGKLLKSLDEMGIADNTIVVYTTDN-GPN----QFTWPDAATTPFRNEKDSNYEGAF 355
Query: 66 RNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R A + W ++ GQV + DW PTL AG
Sbjct: 356 RVPAMVRWPGHVKAGQVSNEIFSGMDWFPTLLAAAG 391
>gi|187736242|ref|YP_001878354.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
gi|187426294|gb|ACD05573.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
Length = 562
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 3/104 (2%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G + + L++ L N++++F++DNGG + GN SN+PL K S
Sbjct: 326 LIQGMDKSLGDIREYLKKNNLDKNTVIIFMADNGGLAISGR--MGNKESNYPLSFGKGSN 383
Query: 61 HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKEK 103
EGG+R ++ P + K + V + I D+ PT+ IAG +K
Sbjct: 384 REGGIREPMIVYWPGVTKAESVCTTPVIIEDFFPTILEIAGAKK 427
>gi|32474453|ref|NP_867447.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32444991|emb|CAD74993.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 496
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 63/111 (56%), Gaps = 9/111 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D++VGRV++ +++ GL ++V+ SDNGG + G SN PLRG K
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLAEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK----EKNID 106
+EGG+R + +P + + G + + D+ PT+ + G+ EK +D
Sbjct: 337 YEGGIREAFMIRAPGVTQPGTTCKEPVCSIDFYPTILDLCGQSPPAEKIVD 387
>gi|420253579|ref|ZP_14756627.1| arylsulfatase A family protein [Burkholderia sp. BT03]
gi|398051875|gb|EJL44184.1| arylsulfatase A family protein [Burkholderia sp. BT03]
Length = 563
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
D+ VG++++SL+E G+ N++VV+ +DN GP F A+ P R K S +EG
Sbjct: 301 DQDVGKLLKSLDEMGIADNTIVVYTTDN-GPN----QFTWPDAATTPFRNEKDSNYEGAF 355
Query: 66 RNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R A + W ++ GQV + DW PTL AG
Sbjct: 356 RVPAMVRWPGHVKAGQVSNEIFSGMDWFPTLLAAAG 391
>gi|440716917|ref|ZP_20897419.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436437985|gb|ELP31567.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 496
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 63/111 (56%), Gaps = 9/111 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D++VGRV++ +++ GL ++V+ SDNGG + G SN PLRG K
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLTEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK----EKNID 106
+EGG+R + +P + + G + + D+ PT+ + G+ EK +D
Sbjct: 337 YEGGIREAFMIRAPGVTQPGTTCKEPVCSIDFYPTILDLCGQSPPAEKIVD 387
>gi|421613610|ref|ZP_16054683.1| arylsulfatase A [Rhodopirellula baltica SH28]
gi|408495569|gb|EKK00155.1| arylsulfatase A [Rhodopirellula baltica SH28]
Length = 503
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 14/111 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K++D SVGR++++L+E G+ N+LV+F SDNGG + +L G T N PLR K
Sbjct: 278 MVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 334
Query: 61 HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
+EGG+R + P G + ++++ TD PT+ + G
Sbjct: 335 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVG 385
>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 442
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 53/96 (55%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M SLDE+VG++ LE G L N+++ FI+DNGG S H +A N PL+G K
Sbjct: 243 MTHSLDENVGKIFNRLEAMGELDNTIIFFINDNGG----SKHI---SADNSPLKGYKGQL 295
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R A W ++ V ++ D LP++
Sbjct: 296 YEGGIRVPFAVRWPGKIKGKTVCHSITSSLDILPSI 331
>gi|351732525|ref|ZP_08950216.1| sulfatase [Acidovorax radicis N35]
Length = 444
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/107 (32%), Positives = 52/107 (48%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GR++ +L+ G+ ++LVVF SDNGG + +WPL G K
Sbjct: 240 MIHHMDEGIGRIMAALQRHGIDRDTLVVFTSDNGGERF---------SDSWPLVGGKMDL 290
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W ++ G V DW T+ AG + D
Sbjct: 291 TEGGIRVPWIAHWPAVIAPGSVSPQTCMTMDWSATMLDAAGAMPHAD 337
>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
Length = 471
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M +LD + GRV+ L++ GL N+++VF +DNGGPT N + N PL GTK +
Sbjct: 247 MTLALDRACGRVLDKLKQLGLDKNTIIVFSNDNGGPT------DKNASLNLPLSGTKSNH 300
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W ++ + D LPT Y G
Sbjct: 301 LEGGIRVPFLMSWPKHIQPKTTYTFPISTMDLLPTFYAAGG 341
>gi|325108172|ref|YP_004269240.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968440|gb|ADY59218.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 520
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 41/120 (34%), Positives = 65/120 (54%), Gaps = 18/120 (15%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLH-----FHGN-------- 46
M+ LDE++GR++Q LE++G L N+L++F SDNG P S H + G
Sbjct: 270 MVDRLDENIGRLLQHLEQKGQLENTLILFCSDNGACPFERSKHIDIAPWKGGSFYLYDAS 329
Query: 47 --TASNWPLRGTKYSFHEGGVRNVAALWSP--LLRKGQVLENLMHITDWLPTLYFIAGKE 102
T N PLR K + HEGG+ ++ + P + + G+ ++ H+ D + TL +AG E
Sbjct: 330 WATVGNTPLRHYKQTQHEGGISSLLIVHWPGKIKQPGRWEDSPGHLVDVMATLLEVAGAE 389
>gi|149176594|ref|ZP_01855206.1| arylsulfatase A [Planctomyces maris DSM 8797]
gi|148844473|gb|EDL58824.1| arylsulfatase A [Planctomyces maris DSM 8797]
Length = 510
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/100 (35%), Positives = 51/100 (51%), Gaps = 2/100 (2%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD S G+++ L E G+ N+LV++ SDNG P ++ +N PL G Y+
Sbjct: 275 IEELDWSTGQILDKLVELGIDKNTLVIWTSDNGSPMAKDMN-STERGTNKPLNGRGYTTS 333
Query: 62 EGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EG R +W P + G V E L D LPT +AG
Sbjct: 334 EGAFRVPTIVWWPETVPAGTVCEELATTMDLLPTFARLAG 373
>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 473
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 7/100 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M LD + G V+ L+E GL N++VVF +DNGGP+ N ++N PL GTK +
Sbjct: 259 MTLGLDRASGYVLDKLKELGLDDNTIVVFSNDNGGPS------DKNASNNAPLAGTKSNQ 312
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
EGG+R + W ++ G + + D LPT Y A
Sbjct: 313 LEGGIRVPFLISWPKHIKPGSTYDYPVSTLDLLPTFYSAA 352
>gi|417304244|ref|ZP_12091273.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327539456|gb|EGF26071.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 496
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 59/102 (57%), Gaps = 5/102 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++++D++VGRV++ +++ GL ++V+ SDNGG + G SN PLRG K
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLTEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336
Query: 61 HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK 101
+EGG+R + +P + + G + + D+ PT+ + G+
Sbjct: 337 YEGGIREAFMIRAPGVTQPGTTCKEPVCSIDFYPTILDLCGQ 378
>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 525
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 11/106 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNT--ASN 50
M+ +LD+ +GRV++SL + GL N+L + ISDNG P + GNT +S
Sbjct: 253 MVLALDDGIGRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSP 312
Query: 51 WPLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
P RG K +EGG+R + W L +G V +N + D PT+
Sbjct: 313 GPFRGYKADTYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTV 358
>gi|119505013|ref|ZP_01627089.1| Twin-arginine translocation pathway signal [marine gamma
proteobacterium HTCC2080]
gi|119458995|gb|EAW40094.1| Twin-arginine translocation pathway signal [marine gamma
proteobacterium HTCC2080]
Length = 653
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
ML +D +VGRV+++L+ L ++LV+ +SDNGG SN P G K +F
Sbjct: 271 MLMQVDAAVGRVLETLDASDLTRSTLVIVVSDNGGT-------EKQLPSNQPFIGVKNTF 323
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
EGG+R + W ++ K V++ + D+ PTL
Sbjct: 324 TEGGLRTPLLMRWPEVIPKNMVIDETVSYLDYFPTL 359
>gi|395804316|ref|ZP_10483556.1| sulfatase [Flavobacterium sp. F52]
gi|395433415|gb|EJF99368.1| sulfatase [Flavobacterium sp. F52]
Length = 539
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +LD+++GR+ + +++EGL N+L++F SDNGG + +N PL+G K+S
Sbjct: 330 MVSALDDAIGRIREKVKKEGLEDNTLIIFASDNGGAD------YTFATTNAPLKGGKFSH 383
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+ AL W ++ V +N + D T+ G +
Sbjct: 384 FEGGINVPFALSWKGKIKPNTVYKNPVSTLDIFTTIATAIGSD 426
>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 826
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 9/105 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GRV+ + E GL N++V+F SDNGG A N+P RG K
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMVIFYSDNGG--------RREHAVNFPYRGHKGML 317
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R W +R G E+ + D PT AG + +
Sbjct: 318 FEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS 362
>gi|421502322|ref|ZP_15949276.1| putative sulfatase [Pseudomonas mendocina DLHK]
gi|400346754|gb|EJO95110.1| putative sulfatase [Pseudomonas mendocina DLHK]
Length = 500
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 7/105 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD ++ R++ +LEE G N++V+F SDNGG +D +N+PL G K ++
Sbjct: 124 ILSQLDHNIARLLTALEESGQAKNTVVIFASDNGGANID-------MDNNYPLVGKKATY 176
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGGVR L P + + D PTL +AG + +
Sbjct: 177 TEGGVRTPLLLRWPGRYENLDIREPTVFMDLFPTLVAMAGGKPPV 221
>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
Length = 479
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 56/100 (56%), Gaps = 4/100 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG+V+ +L E+G+ N+LVVF SDNG + + HG +A L+ K +
Sbjct: 262 VIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNGPWLI--MGAHGGSAGL--LKSGKGTS 317
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R A W P K V N D PT+ IAG
Sbjct: 318 YEGGMREPAIFWWPEKIKPAVAHNTASTLDLFPTIMSIAG 357
>gi|449136514|ref|ZP_21771894.1| arylsulfatase [Rhodopirellula europaea 6C]
gi|448884831|gb|EMB15303.1| arylsulfatase [Rhodopirellula europaea 6C]
Length = 490
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE VGR++ +L E GL ++ +VF SDNGG G S PLRG K ++
Sbjct: 255 MIESVDEGVGRMVDTLRELGLEESTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGG+R W ++ G E + D PT + G +
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKCEIPIIAADLYPTFVEMTGAK 349
>gi|343086300|ref|YP_004775595.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354834|gb|AEL27364.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 454
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 55/101 (54%), Gaps = 8/101 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
++ LD+ +G+VI +L+E G N+L++F SDNGG D A+N LR K S
Sbjct: 252 FIEHLDDGIGQVINALKETGQYENTLIIFTSDNGGHLPD-------LANNGLLRDGKQSM 304
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R A+ W ++ G + + D PTL IAG
Sbjct: 305 YEGGLRVPTAISWPAKIKAGSTSDQVNLSMDIFPTLTEIAG 345
>gi|392969626|ref|ZP_10335041.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387841820|emb|CCH57099.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 477
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 39/97 (40%), Positives = 54/97 (55%), Gaps = 5/97 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-NWPLRGTKYS 59
ML S DE VGRV+Q L++ GL N+LVVF SDNGG + G T + N PLR K
Sbjct: 264 MLLSFDEGVGRVLQKLKDTGLADNTLVVFTSDNGGLGMPEA---GPTPTDNSPLRKWKGH 320
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGG+R + S + + + + TD++PT
Sbjct: 321 VYEGGIRVPLIMAGSGITARNATCDRYLTGTDFVPTF 357
>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
siliculosus]
Length = 597
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 15/98 (15%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSFHEG 63
LD+ V R+ +L+ G+L N+++VF SDNG P+ SN+PLRG K++ +G
Sbjct: 328 LDKEVRRLHTALDALGVLDNAVIVFASDNGACPS--------KGGSNYPLRGFKHTTFQG 379
Query: 64 GVRNVAALWS------PLLRKGQVLENLMHITDWLPTL 95
GVR ++S P +G +MH DW PTL
Sbjct: 380 GVRVPGFVYSKSADLIPEEARGTRYSGMMHSMDWTPTL 417
>gi|325110466|ref|YP_004271534.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324970734|gb|ADY61512.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 536
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 9/111 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D++VG V++ L++ GL N++V F SDNGG + SN P RG K
Sbjct: 285 MIESMDDAVGIVLKKLDDLGLRENTIVCFTSDNGGVSSGDAF----ATSNLPFRGGKGRQ 340
Query: 61 HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG----KEKNID 106
EGG+R + +P G ++ + DW PTL +AG KE+ +D
Sbjct: 341 WEGGIRVPFYISAPSAAVAGTTVDTPANGIDWYPTLLELAGVDQPKEQQVD 391
>gi|298479797|ref|ZP_06997997.1| LOW QUALITY PROTEIN: sulfatase family protein [Bacteroides sp. D22]
gi|298274187|gb|EFI15748.1| LOW QUALITY PROTEIN: sulfatase family protein [Bacteroides sp. D22]
Length = 456
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR++ L+++ L N+LV+F SDNG L F + S PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLLDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W + G + ++ D PT+ AG
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362
>gi|168702101|ref|ZP_02734378.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Gemmata
obscuriglobus UQM 2246]
Length = 596
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 36/97 (37%), Positives = 58/97 (59%), Gaps = 4/97 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
M++S+D +VGRV++ L++ L N+LV+F SDNGG T++ + F N PLR K
Sbjct: 250 MVESMDAAVGRVLKRLDDLKLSDNTLVLFTSDNGGLATLEGMPFAPTI--NAPLREGKGY 307
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
+EGGVR + A W ++ G V++ + D+ T+
Sbjct: 308 LYEGGVRVPLIAKWPGKVKPGTVMDQVACSIDFFDTI 344
>gi|293372063|ref|ZP_06618458.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292632967|gb|EFF51552.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 450
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 40/103 (38%), Positives = 61/103 (59%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR+I++L+E GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRLIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAG 100
+EGG+R + P ++ GQV E+++ D P+L IAG
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAG 336
>gi|296123895|ref|YP_003631673.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016235|gb|ADG69474.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 637
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 4/101 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN-TASNWPLRGTKYS 59
M+ +LD+ VG ++ +LE L +++VF SDNGG D++ G+ SN PLRG K S
Sbjct: 282 MIAALDQRVGLIVAALERLQLTQRTIIVFTSDNGGNMTDTI--EGDLLTSNRPLRGGKGS 339
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
+EGG R + +W + + ++ + D LPTL +A
Sbjct: 340 MYEGGSRVPLIVVWPGVATPARSCDDAVSAVDLLPTLVDMA 380
>gi|440716652|ref|ZP_20897156.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436438149|gb|ELP31709.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 521
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 14/111 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K++D SVGR++++L+E G+ N+L+VF SDNGG + +L G T N PLR K
Sbjct: 296 MVKAVDTSVGRIMKALKEHGVDDNTLIVFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 352
Query: 61 HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
+EGG+R + P G + +++ TD PT+ + G
Sbjct: 353 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSFACSTDLFPTILDVVG 403
>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 472
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 36/102 (35%), Positives = 55/102 (53%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHG-NTASNWPLRGTKYS 59
M+ +D +G VIQSL++ G+ N+L+ F+SDNGGP G N + N P RG K +
Sbjct: 246 MVDVMDRGIGTVIQSLKDSGIYENTLIFFLSDNGGPQSKGGSTKGWNGSDNQPFRGGKGN 305
Query: 60 FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
++GGV A W ++ G V+ ++ D T +AG
Sbjct: 306 LYDGGVHVPFFASWPNKIKPGTVVNYPVNSLDIGRTAVALAG 347
>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
Length = 511
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 50/82 (60%), Gaps = 6/82 (7%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP-TVDSLHFHGNT-----ASNWPLRG 55
L ++D ++ + ++ + G+ N L+V+ +DNGGP TV S G +SN+PLRG
Sbjct: 251 LTTVDSALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPNKGKQEDSIGSSNYPLRG 310
Query: 56 TKYSFHEGGVRNVAALWSPLLR 77
K++ +EGGVR+ A L P L+
Sbjct: 311 GKHNAYEGGVRSTAFLSGPALK 332
>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
Length = 426
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ LD SVG+V+ +L+ G ++LV F SDNGG + NWPL G K S
Sbjct: 240 LVEDLDRSVGKVLAALKRSGQEEDTLVFFASDNGGERF---------SYNWPLSGNKGSL 290
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+GG+R + L W + GQV ++ DW TL + G
Sbjct: 291 LDGGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGG 331
>gi|291227067|ref|XP_002733509.1| PREDICTED: steroid sulfatase-like, partial [Saccoglossus
kowalevskii]
Length = 321
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +G + ++L+E L N++++F S D+ + +NWPLRG K ++
Sbjct: 41 MVTCMDEGIGNITRTLKETELWNNTIIIFSS-------DNGGNPNHGGNNWPLRGGKGTY 93
Query: 61 HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAGKEKN 104
EG + + ++S + K G V ++H++DW PT+ +AG N
Sbjct: 94 WEGAIHGLGFVYSSAIAKEMRGTVNTEMIHVSDWFPTIVNLAGGSLN 140
>gi|417302824|ref|ZP_12089908.1| sulfatase [Rhodopirellula baltica WH47]
gi|327540898|gb|EGF27458.1| sulfatase [Rhodopirellula baltica WH47]
Length = 490
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 9/107 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+DE VGR++ +L E GL N+ +VF SDNGG G S PLRG K ++
Sbjct: 255 MIESVDEGVGRMVDALRELGLEENTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
+EGG+R W ++ G + + D PT + G + D
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAKLPAD 353
>gi|410638759|ref|ZP_11349312.1| hypothetical protein GLIP_3906 [Glaciecola lipolytica E3]
gi|410141287|dbj|GAC16517.1| hypothetical protein GLIP_3906 [Glaciecola lipolytica E3]
Length = 508
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 63/113 (55%), Gaps = 14/113 (12%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
L LD+++G ++ LE+ LL N+++V++SDNGG T A+N RG+KY+
Sbjct: 296 QLYYLDQAIGEIVNELEKNNLLENTIIVYLSDNGGST-------PIYANNGRFRGSKYTL 348
Query: 61 HEGGVRNVAALWSPLLRK-----GQVLENLMHITDWLPTLYFIAGKE--KNID 106
+EGG+R ++ P + + +++++ D PTL AG + +NID
Sbjct: 349 YEGGIRVPMIVYIPEEFRHNTTMNRRIDDVVSTMDIYPTLLQAAGLQAPENID 401
>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
Length = 488
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 28/65 (43%), Positives = 41/65 (63%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D+ +GR++ L++ G N+L+ F+SDNGG +S + N ASN P R K S
Sbjct: 253 MVDSMDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYPESWMPNSNWASNKPFRRGKVSL 312
Query: 61 HEGGV 65
EGGV
Sbjct: 313 TEGGV 317
>gi|126663705|ref|ZP_01734701.1| arylsulfatase A precursor [Flavobacteria bacterium BAL38]
gi|126624288|gb|EAZ94980.1| arylsulfatase A precursor [Flavobacteria bacterium BAL38]
Length = 512
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNG-----GPTVDSLHFHGNTASNWPLRGTKYSF 60
D VG +I++L+EEG+L N+L+VF SDNG G D++ GN N PLRG KYS
Sbjct: 300 DYCVGEIIKTLKEEGVLDNTLIVFSSDNGPVLNDGYYDDAVEKLGNHKPNGPLRGGKYSL 359
Query: 61 HEGGVR 66
E G R
Sbjct: 360 LEAGTR 365
>gi|440717770|ref|ZP_20898247.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436437072|gb|ELP30746.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 480
Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats.
Identities = 38/109 (34%), Positives = 57/109 (52%), Gaps = 7/109 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG------GPTVDSLHFHGNTASNWPLR 54
+++ +D +VG+++ +L E L N+ V+F SDNG VD + S PLR
Sbjct: 217 VIEEIDFNVGKILDTLSELKLADNTYVLFTSDNGPWLVKNKGHVDGHRPRDHGGSAGPLR 276
Query: 55 GTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
K S EGGVR A LW P + G V +++ D +PTL +AG +
Sbjct: 277 SGKVSSFEGGVRVPAVLWGPGRVPAGTVCDSIATTMDVMPTLAALAGAD 325
>gi|393788830|ref|ZP_10376956.1| hypothetical protein HMPREF1068_03236 [Bacteroides nordii
CL02T12C05]
gi|392653936|gb|EIY47586.1| hypothetical protein HMPREF1068_03236 [Bacteroides nordii
CL02T12C05]
Length = 509
Score = 58.5 bits (140), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 60/106 (56%), Gaps = 7/106 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D+S+G ++ LE +G+ N++++F+SDNGG T+ G T N+PL K S
Sbjct: 284 LVEGMDKSLGDLMDYLEVKGIAQNTVIIFMSDNGGYTI------GRTNKNYPLSEGKGSL 337
Query: 61 HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGG+R + W + + V + I D+ P++ +AG + I
Sbjct: 338 KEGGIREPMIVCWPGVTQPATVNVTPVIIEDFFPSILELAGMKGRI 383
>gi|298375848|ref|ZP_06985804.1| sulfatase family protein [Bacteroides sp. 3_1_19]
gi|298266885|gb|EFI08542.1| sulfatase family protein [Bacteroides sp. 3_1_19]
Length = 459
Score = 58.5 bits (140), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG V+++L E GL N+ V+F SDNG L H N S PL+ K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTKHENGGSAGPLKDGKGTW 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A W P + + +M D PT +AG E+ D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350
>gi|256840089|ref|ZP_05545598.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256739019|gb|EEU52344.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 514
Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 60/102 (58%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
+++ +D+S+G ++ LE+ G + N++V+F+SDNGG + + G N+PL K S
Sbjct: 287 LIEGMDKSLGDLMDWLEKNGEVDNTIVIFMSDNGGLSSEPGWRDGEIHTQNYPLNSGKGS 346
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W +++ G + + I D+ PT+ +AG
Sbjct: 347 AYEGGVREPMIVRWPGVVKPGSKCDKYLIIEDFYPTILEMAG 388
>gi|313232711|emb|CBY19381.1| unnamed protein product [Oikopleura dioica]
Length = 453
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 10/96 (10%)
Query: 2 LKSLDESVGRVIQSLEEEGL--LGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
+KS+D +GR + L+ + N++V+F SDNGGP ASNWPLRG K
Sbjct: 173 VKSMDLEIGRYLDVLKRIYIPDYNNTIVIFSSDNGGP-------RKLGASNWPLRGGKGC 225
Query: 60 FHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
+G R+ A + P + K + ++ H DW+PTL
Sbjct: 226 IFDGAFRSRALIHIPGM-KHKNYPHMFHAVDWMPTL 260
>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 494
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 40/64 (62%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ S+D+ +GRV+ +LE+ G L N+L+ F SDNGGP + + A N P RG K +
Sbjct: 255 MIDSMDQGIGRVVDALEKSGKLENTLIFFFSDNGGPRPKPGYEGEDFADNTPYRGGKGAM 314
Query: 61 HEGG 64
EGG
Sbjct: 315 LEGG 318
>gi|336403079|ref|ZP_08583800.1| hypothetical protein HMPREF0127_01113 [Bacteroides sp. 1_1_30]
gi|335947153|gb|EGN08947.1| hypothetical protein HMPREF0127_01113 [Bacteroides sp. 1_1_30]
Length = 853
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 58/103 (56%), Gaps = 5/103 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D +VGR++ L+++ L N+LV+F SDNG L F + S PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLLDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
+EGGVR + W + G + ++ D PT+ AG +
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAGCQ 364
>gi|119713775|gb|ABL97823.1| putative secreted sulfatase [uncultured marine bacterium
HF10_49E08]
Length = 667
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 60/111 (54%), Gaps = 11/111 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN----------TASN 50
ML++ DES+GR+ + L++ GL N++V+F +DNGG + + + N +SN
Sbjct: 286 MLEATDESLGRIRKKLKDLGLADNTIVIFTADNGGMSASNQYRGINHPIESLDSRFASSN 345
Query: 51 WPLRGTKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
PLRG K +EGG+R + W ++ L+ TD+ PTL + G
Sbjct: 346 LPLRGAKGWNYEGGIRVPLVVYWPGRIKPDSTSNALVTGTDFYPTLLEMIG 396
>gi|87308971|ref|ZP_01091109.1| Arylsulfatase [Blastopirellula marina DSM 3645]
gi|87288314|gb|EAQ80210.1| Arylsulfatase [Blastopirellula marina DSM 3645]
Length = 482
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNW-----PLRG 55
M+ +D +G+++ +E GL G++LV+F SDNG T D L G + S++ LRG
Sbjct: 269 MITHMDRDIGKILSKIETLGLDGDTLVIFTSDNGA-TYDRL---GGSDSDFFESVANLRG 324
Query: 56 TKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
K S +EGGVR + A W ++ GQV + D LPT+
Sbjct: 325 LKGSLYEGGVRVPLVARWPGKIKSGQVSHVPSALWDMLPTI 365
>gi|325109525|ref|YP_004270593.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969793|gb|ADY60571.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 453
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 60/101 (59%), Gaps = 6/101 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D VG ++Q+L++ GL N+LVVF SDN GP ++ + HG+ PLR K++
Sbjct: 238 VVQEIDWQVGELMQTLDKLGLTDNTLVVFASDN-GPQLN-VEGHGSAG---PLRDGKWTN 292
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W ++ G V ++ I D LPT IAG
Sbjct: 293 FEGGIRVPCLMRWPSVVPAGSVNNEIVGIIDMLPTFCEIAG 333
>gi|149173056|ref|ZP_01851687.1| arylsulphatase A [Planctomyces maris DSM 8797]
gi|148847862|gb|EDL62194.1| arylsulphatase A [Planctomyces maris DSM 8797]
Length = 442
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 36/117 (30%), Positives = 63/117 (53%), Gaps = 12/117 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWP-------- 52
M+ S+D+ VG+V+ SL E G+ N+L++F +DNG P L + N P
Sbjct: 236 MIASMDDMVGKVVASLNEMGVRDNTLIIFTTDNGTPAASYLTVNENGKMVRPKVVSVQNG 295
Query: 53 --LRGTKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG-KEKNI 105
+ G K + G R + A W ++ G +++++ ++D+LPT+ IAG KE ++
Sbjct: 296 KIVPGGKGKLDDTGTRVPLIANWPGHIKAGTEVDDMVDMSDYLPTVAEIAGLKEADV 352
>gi|423330335|ref|ZP_17308119.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
CL03T12C09]
gi|409231951|gb|EKN24799.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
CL03T12C09]
Length = 459
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG V+++L E GL N+ V+F SDNG L H N S PL+ K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A W P + + +M D PT +AG E+ D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350
>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 826
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 9/105 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GRV+ + E GL N++++F SDNGG A N+P RG K
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG--------RREHAVNFPYRGHKGML 317
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R W +R G E+ + D PT AG + +
Sbjct: 318 FEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS 362
>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 512
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 12/104 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M LD+SVG V ++L+++ + +SL+V+ DNGG + + P RG K+S
Sbjct: 243 MAAFLDDSVGNVTKALKKQNMYDSSLIVYTPDNGG-------YLTQGGDDAPFRGGKFSD 295
Query: 61 HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
+GGVR VAA S P +G L+H+ DW+ T + G
Sbjct: 296 FQGGVR-VAAFISGGLVPAGLRGTASAALVHVCDWMATFAALGG 338
>gi|423294501|ref|ZP_17272628.1| hypothetical protein HMPREF1070_01293 [Bacteroides ovatus
CL03T12C18]
gi|392675692|gb|EIY69133.1| hypothetical protein HMPREF1070_01293 [Bacteroides ovatus
CL03T12C18]
Length = 518
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 59/102 (57%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
+++ +D+S+G ++ LE+ G + N++++F+SDNGG +S G N+PL K S
Sbjct: 291 LIEGMDKSLGDLMNWLEKNGEVNNTIIIFMSDNGGLASESGWRDGKLHTQNYPLNSGKGS 350
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + W ++ G N + I D+ PT+ +AG
Sbjct: 351 TYEGGIREPMIVSWPGVVAPGSKCNNYLLIEDFYPTILEMAG 392
>gi|343086294|ref|YP_004775589.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354828|gb|AEL27358.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 511
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 16/115 (13%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS- 59
M+ +DE VG +++ L+ +GL N+L++F SDN GPT F+G T S W G +
Sbjct: 299 MVSYMDEQVGELVEQLKSKGLYENTLIIFTSDN-GPT-----FNGGTDSPWFNSGGAFRE 352
Query: 60 --------FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
+EGG+R + A W +++G V ++ D +PTL +AGK I
Sbjct: 353 EKGFGKGFLYEGGIRVPMIASWPGKIKEGSVSDHASVFYDVMPTLAEVAGKPLEI 407
>gi|398909246|ref|ZP_10654456.1| arylsulfatase A family protein [Pseudomonas sp. GM49]
gi|398188431|gb|EJM75734.1| arylsulfatase A family protein [Pseudomonas sp. GM49]
Length = 559
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 6/106 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+ LD+SVG +++ +E+ G N++VVF +DNG F P +GTK +
Sbjct: 300 MAQLDDSVGAILKCVEDAGEADNTIVVFTTDNGAEV-----FTWPDGGMTPFKGTKGTVM 354
Query: 62 EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A + W ++ G V + DW PTL AG D
Sbjct: 355 EGGFRAPAIIRWPGKIKPGTVENGIYSALDWFPTLVAAAGNPNITD 400
>gi|294817955|ref|ZP_06776597.1| putative sulfatase [Streptomyces clavuligerus ATCC 27064]
gi|326446919|ref|ZP_08221653.1| putative sulfatase [Streptomyces clavuligerus ATCC 27064]
gi|294322770|gb|EFG04905.1| putative sulfatase [Streptomyces clavuligerus ATCC 27064]
Length = 473
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 50/101 (49%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD SVG V+ +L G ++LVVF SDNGG + WPL G K S
Sbjct: 286 MVEDLDRSVGAVLAALRRSGQEDDTLVVFASDNGGERF---------SYTWPLSGGKGSL 336
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+GG+R L W + GQV ++ DW TL + G
Sbjct: 337 LDGGIRVPTILRWPARIEGGQVSRVPVYSPDWTATLLEVGG 377
>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 543
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 9/105 (8%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GRV+ + E GL N++++F SDNGG A N+P RG K
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG--------RREHAVNFPYRGHKGML 317
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
EGG+R W +R G E+ + D PT AG + +
Sbjct: 318 FEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS 362
>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 463
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 13/111 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D +VGR+ +L G+ N+++VF+SDNGG G A N P RG K
Sbjct: 243 MVNAVDANVGRIDSTLVANGMKENTILVFLSDNGG--------RGEHADNRPYRGHKGML 294
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
EGG++ W L+ GQ E ++ D PT AG KE +D
Sbjct: 295 FEGGIKVPFFITWPKKLKGGQHYEKMVSSLDLFPTFLEAAGGKVKKESQLD 345
>gi|304309759|ref|YP_003809357.1| sulfatase [gamma proteobacterium HdN1]
gi|301795492|emb|CBL43690.1| probable sulfatase precursor [gamma proteobacterium HdN1]
Length = 661
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 7/100 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+L LD++V R++ +L+++G+ +++VVF SDNGGP + +N PL+G K ++
Sbjct: 280 VLNQLDDNVARLLAALKQKGVDDDTIVVFASDNGGP-------NRARNNNLPLQGLKATY 332
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR + P + + + D PTL +AG
Sbjct: 333 TEGGVRTPVMVRWPARFSSRDVVQPTQLVDLFPTLVALAG 372
>gi|47216038|emb|CAG11369.1| unnamed protein product [Tetraodon nigroviridis]
Length = 474
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L D+++G ++ +LE G++ N+L+ F SDN GP + + GN PLR K + +
Sbjct: 226 LLEFDQTIGSLLATLERTGVINNTLIFFTSDN-GPELMRMSRGGNAG---PLRCGKGTTY 281
Query: 62 EGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+R A A W L++ G V + D LPT +AG +
Sbjct: 282 EGGMREPAIAYWQGLIQPG-VTHEMASTLDILPTFASLAGAK 322
>gi|256841082|ref|ZP_05546589.1| arylsulfatase A [Parabacteroides sp. D13]
gi|256736925|gb|EEU50252.1| arylsulfatase A [Parabacteroides sp. D13]
Length = 459
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG V+++L E GL N+ V+F SDNG L H N S PL+ K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A W P + + +M D PT +AG E+ D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350
>gi|393784389|ref|ZP_10372554.1| hypothetical protein HMPREF1071_03422 [Bacteroides salyersiae
CL02T12C01]
gi|392666165|gb|EIY59682.1| hypothetical protein HMPREF1071_03422 [Bacteroides salyersiae
CL02T12C01]
Length = 463
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 35/109 (32%), Positives = 57/109 (52%), Gaps = 12/109 (11%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
M+ LD +VG+++ L+E+ +L N++++F SDNG +H G SN RG
Sbjct: 261 MVSCLDHAVGKIVGKLKEKNILENTIIIFSSDNG------VHQEGGHDPEYFNSNGIFRG 314
Query: 56 TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
K +EGG+ W ++ KGQ ++ D+LPT+ I G +K
Sbjct: 315 IKRDLYEGGIHTPFIVQWENVIAKGQTSSHISAFWDFLPTVADIVGCDK 363
>gi|87198499|ref|YP_495756.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
gi|87134180|gb|ABD24922.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
Length = 491
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ LD +GRV+++L+ +++VVF SDNGG + WP G K
Sbjct: 283 MVRRLDYQIGRVLEALKANRAEQDTIVVFTSDNGGERF---------SDTWPFSGRKTEL 333
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R A + W + R G + + DWLPT AG
Sbjct: 334 LEGGLRIPAIVRWPGVTRAGTTSDAQIISMDWLPTFLAAAG 374
>gi|343087660|ref|YP_004776955.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342356194|gb|AEL28724.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 513
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 19/117 (16%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSL-----HFHGNTAS---------- 49
LD +G ++Q LE G+ N+L++F SDNG + + H N+ S
Sbjct: 300 LDYYIGELLQELERLGMKENTLIIFASDNGSQFITTHAGQEGESHSNSPSNVNKEINPDA 359
Query: 50 ---NWPLRGTKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
N+P +GTK+S +EGGVR + A W +G V + L+ + D LP+L + G++
Sbjct: 360 HQPNYPYKGTKWSVNEGGVRTPLIASWKGNFPEGLVSDQLIALNDVLPSLAALVGEK 416
>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 445
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +DE VGR++QSL+E G N++V+FISDNG + N LRG K
Sbjct: 253 MVEVMDEGVGRIMQSLKETGQYENTIVIFISDNGA---------NHYGDNGGLRGFKAGP 303
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTLYFIAGKE 102
+EGG R A P KG + N ++ D LPTL G++
Sbjct: 304 YEGGSRVPAIFSFPKEVKGGSVNNQTVLSMDLLPTLLDFIGRK 346
>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
[Ciona intestinalis]
Length = 513
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 8/107 (7%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD VG +IQ L++ GL N+LV+F SDNG + S +G SN P K + EGG
Sbjct: 252 LDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMIGSAFENG---SNGPFLCGKQTTFEGG 308
Query: 65 VRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
+R W P +++GQV + + DW T +A K++ ID
Sbjct: 309 IREPTIAWGPGFVQQGQVSHQVGSLMDWFATALDLASIAVPKDRIID 355
>gi|16125424|ref|NP_419988.1| sulfatase [Caulobacter crescentus CB15]
gi|13422494|gb|AAK23156.1| sulfatase family protein [Caulobacter crescentus CB15]
Length = 521
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 5/66 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD +VG+V+Q+L++ GL N+LV+F SDNGG ++ G N P RG K +F
Sbjct: 299 MVRNLDRNVGKVLQALKDRGLDDNTLVIFTSDNGGA-----NYIGLPDINKPYRGWKATF 353
Query: 61 HEGGVR 66
EGG++
Sbjct: 354 FEGGIK 359
>gi|149196983|ref|ZP_01874036.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
gi|149140093|gb|EDM28493.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
Length = 485
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +D+S+G +++ +EE +LV+F+SDNGG + N PLR K S
Sbjct: 242 MIDYMDQSIGLLMKGIEEMKSDRETLVIFMSDNGGLS--------RVWENEPLRRGKGSA 293
Query: 61 HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
+EGG+R + W ++ GQ + +HITD P+ IA
Sbjct: 294 YEGGIRVPLIMHWPERIKAGQTISEPVHITDLFPSFLSIA 333
>gi|443325558|ref|ZP_21054248.1| arylsulfatase A family protein [Xenococcus sp. PCC 7305]
gi|442794839|gb|ELS04236.1| arylsulfatase A family protein [Xenococcus sp. PCC 7305]
Length = 523
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 35/100 (35%), Positives = 56/100 (56%), Gaps = 12/100 (12%)
Query: 6 DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTKYSFHE 62
D++VG+++ +L+E GL N++V++ +DNG H NT A+ P RG K + E
Sbjct: 278 DKNVGQILVALDEAGLTNNTIVIYGTDNGP--------HMNTWPDAAMTPFRGEKNTGWE 329
Query: 63 GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGK 101
GG R + W ++ G V ++M DW+PTL AG+
Sbjct: 330 GGFRTPCMIRWPGHIQAGTVSNDIMSNLDWMPTLVAAAGE 369
>gi|262383796|ref|ZP_06076932.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294694|gb|EEY82626.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 459
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG V+++L E GL N+ V+F SDNG L H N S PL+ K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A W P + + +M D PT +AG E+ D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350
>gi|301310141|ref|ZP_07216080.1| N-acetylgalactosamine-6-sulfatase [Bacteroides sp. 20_3]
gi|423336382|ref|ZP_17314129.1| hypothetical protein HMPREF1059_00081 [Parabacteroides distasonis
CL09T03C24]
gi|300831715|gb|EFK62346.1| N-acetylgalactosamine-6-sulfatase [Bacteroides sp. 20_3]
gi|409240857|gb|EKN33631.1| hypothetical protein HMPREF1059_00081 [Parabacteroides distasonis
CL09T03C24]
Length = 510
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 59/102 (57%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
+++ +D+S+G ++ LE+ G N++V+F+SDNGG + + G N+PL K S
Sbjct: 284 LIEGMDKSLGDLMDWLEKNGEADNTIVIFMSDNGGLSSEPRWRDGEIHTQNYPLNSGKGS 343
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGGVR + W +++ G + + I D+ PT+ +AG
Sbjct: 344 AYEGGVREPMIVRWPGVVKPGSKCDKYLIIEDFYPTILEMAG 385
>gi|307152367|ref|YP_003887751.1| sulfatase [Cyanothece sp. PCC 7822]
gi|306982595|gb|ADN14476.1| sulfatase [Cyanothece sp. PCC 7822]
Length = 570
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 16/110 (14%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-----PTVDSLHFHGNTASNWPLRGT 56
++ D+ VG ++ L+E+GL +++V++ +DNG P + FHG +NW
Sbjct: 308 MEEHDQDVGILLDYLDEQGLTDDTIVIYTTDNGAEVFSWPDGGTTPFHGEKNTNW----- 362
Query: 57 KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
EGG R A + W + GQ+ ++ DWLPTL AG +I
Sbjct: 363 -----EGGFRVPAMIRWPGYIEAGQISNEIISHQDWLPTLLAAAGAPDDI 407
>gi|448409003|ref|ZP_21574483.1| sulfatase [Halosimplex carlsbadense 2-9-1]
gi|445673783|gb|ELZ26342.1| sulfatase [Halosimplex carlsbadense 2-9-1]
Length = 496
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 58/105 (55%), Gaps = 5/105 (4%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGN---TASNWPLRGT 56
M+ +LD +VGR++ +L+E G+ +LVVF SDNGG D + + S PLRG
Sbjct: 275 MVSNLDANVGRLLAALDERGIREETLVVFTSDNGGLGDYDEIGLPRDWKRVTSQGPLRGG 334
Query: 57 KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
K + +EGG+R + W ++ G+ + TD+ PT +AG
Sbjct: 335 KRTLYEGGIRVPTVVSWPGVVPGGRRSSTPIAGTDFFPTFVDLAG 379
>gi|262405065|ref|ZP_06081615.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294643590|ref|ZP_06721394.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294808689|ref|ZP_06767423.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|345511384|ref|ZP_08790927.1| hypothetical protein BSAG_02032 [Bacteroides sp. D1]
gi|229444530|gb|EEO50321.1| hypothetical protein BSAG_02032 [Bacteroides sp. D1]
gi|262355940|gb|EEZ05030.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292641085|gb|EFF59299.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294444128|gb|EFG12861.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
Length = 518
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 58/102 (56%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
+++ +D+S+G ++ LE+ G N++++F+SDNGG +S G N+PL K S
Sbjct: 291 LIEGMDKSLGDLMNWLEKNGEANNTIIIFMSDNGGLASESGWRDGKLHTQNYPLNSGKGS 350
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + W ++ G N + I D+ PT+ +AG
Sbjct: 351 TYEGGIREPMIVSWPGVVAPGSKCNNYLLIEDFYPTILEMAG 392
>gi|255014752|ref|ZP_05286878.1| arylsulfatase A [Bacteroides sp. 2_1_7]
gi|410102839|ref|ZP_11297764.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
gi|409237966|gb|EKN30761.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
Length = 459
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG V+++L E GL N+ V+F SDNG L H N S PL+ K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A W P + + +M D PT +AG E+ D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350
>gi|150008924|ref|YP_001303667.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937348|gb|ABR44045.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
Length = 459
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
+++ +D SVG V+++L E GL N+ V+F SDNG L H N S PL+ K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG R A W P + + +M D PT +AG E+ D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350
>gi|405379201|ref|ZP_11033103.1| arylsulfatase A family protein [Rhizobium sp. CF142]
gi|397324334|gb|EJJ28697.1| arylsulfatase A family protein [Rhizobium sp. CF142]
Length = 560
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 6/103 (5%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG +++ +E+ G N++VVF +DNG F P +GTK + EGG
Sbjct: 304 LDDSVGAILKCVEDAGETDNTIVVFTTDNGAEV-----FTWPDGGMTPFKGTKGTVMEGG 358
Query: 65 VRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
R A + W ++ G V + DW PTL AG D
Sbjct: 359 FRAPAIIRWPGKVKPGTVENGIFSALDWFPTLVAAAGNPNITD 401
>gi|293371631|ref|ZP_06618042.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292633328|gb|EFF51898.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 494
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 58/102 (56%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
+++ +D+S+G ++ LE+ G N++++F+SDNGG +S G N+PL K S
Sbjct: 267 LIEGMDKSLGDLMNWLEKNGEANNTIIIFMSDNGGLASESGWRDGKLHTQNYPLNSGKGS 326
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG+R + W ++ G N + I D+ PT+ +AG
Sbjct: 327 TYEGGIREPMIVSWPGVVAPGSKCNNYLLIEDFYPTILEMAG 368
>gi|221234167|ref|YP_002516603.1| sulfatase family protein [Caulobacter crescentus NA1000]
gi|220963339|gb|ACL94695.1| sulfatase family protein [Caulobacter crescentus NA1000]
Length = 551
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 5/66 (7%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+++LD +VG+V+Q+L++ GL N+LV+F SDNGG ++ G N P RG K +F
Sbjct: 329 MVRNLDRNVGKVLQALKDRGLDDNTLVIFTSDNGGA-----NYIGLPDINKPYRGWKATF 383
Query: 61 HEGGVR 66
EGG++
Sbjct: 384 FEGGIK 389
>gi|357061552|ref|ZP_09122301.1| hypothetical protein HMPREF9332_01859 [Alloprevotella rava F0323]
gi|355373735|gb|EHG21044.1| hypothetical protein HMPREF9332_01859 [Alloprevotella rava F0323]
Length = 513
Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 57/102 (55%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLG-NSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
+++ +D+S+G ++ LE+ N++++FISDNGG V + NWP R K S
Sbjct: 285 LVEGMDKSLGDLMDFLEQHPKTAQNTIIIFISDNGGQGVAVRQGREDYDQNWPARAGKGS 344
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+ GG++ + W + + G V +N + I D+ P+L +AG
Sbjct: 345 AYLGGIKEPMIVYWPKVTKAGTVNDNRVMIEDFFPSLLEMAG 386
>gi|229818890|ref|YP_002880416.1| sulfatase [Beutenbergia cavernae DSM 12333]
gi|229564803|gb|ACQ78654.1| sulfatase [Beutenbergia cavernae DSM 12333]
Length = 464
Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats.
Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 21/107 (19%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
L+ +D + ++ +LEE GL +++VV+++DNGG T N SN PL GTKY+
Sbjct: 258 LELMDAEIAALLDTLEERGLADDTIVVYLTDNGGSTC-------NYGSNAPLAGTKYTLW 310
Query: 62 EGGVRNVAALWSPLLRK--------GQVLENLMHITDWLPTLYFIAG 100
EGG+R P L + G+ L+ D +PTL AG
Sbjct: 311 EGGIR------VPFLVRWPGGGWAGGRTTTALVSSLDLVPTLVAAAG 351
>gi|261880234|ref|ZP_06006661.1| N-acetylgalactosamine-6-sulfatase [Prevotella bergensis DSM 17361]
gi|270333067|gb|EFA43853.1| N-acetylgalactosamine-6-sulfatase [Prevotella bergensis DSM 17361]
Length = 513
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 31/102 (30%), Positives = 57/102 (55%), Gaps = 2/102 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLG-NSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
+++ +D+S+G ++ LE + N++++F+SDNGG ++ N N+P RG K S
Sbjct: 294 LIEGMDKSLGDIMNYLEARPEVARNTIIIFMSDNGGQAINIRQGRANYDQNYPARGGKGS 353
Query: 60 FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+ + W +++ G N + I D+ PT+ +AG
Sbjct: 354 ALEGGIHEPMLVSWPGVIKGGTTNGNRVMIEDFYPTIIEMAG 395
>gi|399043292|ref|ZP_10737678.1| arylsulfatase A family protein [Rhizobium sp. CF122]
gi|398058225|gb|EJL50129.1| arylsulfatase A family protein [Rhizobium sp. CF122]
Length = 561
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 6/97 (6%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG +++ +E+ G N++V+F +DNG F P +GTK + EGG
Sbjct: 305 LDDSVGALMKCVEDAGETDNTIVIFTTDNGAEV-----FTWPDGGMTPFKGTKGTVMEGG 359
Query: 65 VRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
R A + WS ++ G V + DW PTL AG
Sbjct: 360 FRAPAIIRWSGKVKPGTVENGIFSALDWFPTLAAAAG 396
>gi|149197012|ref|ZP_01874065.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
gi|149140122|gb|EDM28522.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
Length = 463
Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 10/100 (10%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++SLDESVGRV+ L+E G+ N++V+F DNG V + + G L+G K
Sbjct: 257 MVQSLDESVGRVLDKLDELGIADNTIVIFTGDNGA--VGTNYCGG-------LKGAKALS 307
Query: 61 HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
HEGGVR + P + G +M TD+ PT+ +AG
Sbjct: 308 HEGGVREPFFIKGPGIEPGVSSVPVM-ATDFYPTILDLAG 346
>gi|383110969|ref|ZP_09931787.1| hypothetical protein BSGG_2074 [Bacteroides sp. D2]
gi|313694539|gb|EFS31374.1| hypothetical protein BSGG_2074 [Bacteroides sp. D2]
Length = 450
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 40/105 (38%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
+L +D+ +GR I++L++ GL N++V+F SDNG P+ S A LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDDMGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293
Query: 60 FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
+EGG+R + P ++ GQV E+++ D P+L IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338
>gi|344339408|ref|ZP_08770337.1| sulfatase [Thiocapsa marina 5811]
gi|343800712|gb|EGV18657.1| sulfatase [Thiocapsa marina 5811]
Length = 653
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 6/100 (6%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K +D+++G V++ LE+ G L N++VVF +DNG T+ + G P RG K +
Sbjct: 387 MKQMDDNIGYVLKKLEDMGELDNTIVVFTTDNGAETIT--YPDGGVT---PFRGGKLTTW 441
Query: 62 EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W + G + + DW+PT IAG
Sbjct: 442 EGGMRAPMLVRWPGHVEAGSNPDQIFSSLDWVPTFVNIAG 481
>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
queenslandica]
Length = 523
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 36/100 (36%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
+K LD SVG+++ +L+ G+ ++LV+F SDNGG T + + SN P K +
Sbjct: 261 VKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGAT----YAKESGGSNSPFLCGKETTF 316
Query: 62 EGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R W P ++ GQV L + DW T +AG
Sbjct: 317 EGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLAG 356
>gi|409437884|ref|ZP_11264988.1| Sulfatase [Rhizobium mesoamericanum STM3625]
gi|408750579|emb|CCM76148.1| Sulfatase [Rhizobium mesoamericanum STM3625]
Length = 560
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 6/103 (5%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD+SVG +++ +E+ G N++V+F +DNG F P +GTK + EGG
Sbjct: 304 LDDSVGALMKCVEDAGEADNTIVIFTTDNGAEV-----FTWPDGGMTPFKGTKGTVMEGG 358
Query: 65 VRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
R A + W+ ++ G V + DW PTL AG + D
Sbjct: 359 FRAPAIIRWTGKVKPGSVENGIFSALDWFPTLAAAAGNTEITD 401
>gi|340619473|ref|YP_004737926.1| sulfatase [Zobellia galactanivorans]
gi|339734270|emb|CAZ97647.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 477
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 57/100 (57%), Gaps = 9/100 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ ++D++VG+++++L++ GL GN+LV+F SDNGG S PLR K S+
Sbjct: 256 MVYAMDQNVGKLLKALKQNGLEGNTLVIFTSDNGGIRAIS--------EQSPLRAGKGSY 307
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
+EGG+R + W ++ + + D+ PTL IA
Sbjct: 308 YEGGIRVPLIIKWPGKIQAQSISTAPVSHLDFYPTLQSIA 347
>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 486
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 39/100 (39%), Positives = 53/100 (53%), Gaps = 8/100 (8%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
++ LD S G ++++L+E GL N+LV+F SDNG T G S PL+G K S +
Sbjct: 260 IQELDWSTGEILRALKELGLDSNTLVIFTSDNGPNT-------GKGGSAGPLKGGKGSTY 312
Query: 62 EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGGVR A W + G + M D PTL +AG
Sbjct: 313 EGGVRVPFVARWPGAIPAGIESDEAMTGMDLPPTLTRLAG 352
>gi|120613101|ref|YP_972779.1| sulfatase [Acidovorax citrulli AAC00-1]
gi|120591565|gb|ABM35005.1| sulfatase [Acidovorax citrulli AAC00-1]
Length = 453
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 35/107 (32%), Positives = 50/107 (46%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+ +DE +GR++ +L GL ++LVVF SDNGG + NWPL G K
Sbjct: 249 MIHHMDEGIGRIMDTLRAHGLERDTLVVFTSDNGGERF---------SDNWPLVGGKMDL 299
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
EGG+R A W + G DW T+ + G + + D
Sbjct: 300 TEGGIRVPWIAHWPAAIAPGGTSAQPCLTMDWSATMLELGGAQPHPD 346
>gi|297180722|gb|ADI16930.1| arylsulfatase a and related enzymes [uncultured SAR406 cluster
bacterium HF0010_18O13]
Length = 585
Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 6/103 (5%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M+K+LD S+GR++ SL++ GL N+L++ SDNGG + L + N P RG K +
Sbjct: 363 MIKALDRSIGRILDSLQQNGLSENTLIILTSDNGGASYIEL-----SDINKPYRGWKLTH 417
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
EGG+ A W + G ++H D T+ A E
Sbjct: 418 FEGGMHIPFMARWPKEIEAGTSFSPVVHHNDIFKTIAAAAKAE 460
>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 511
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF-HGNTASNWPLRGTKYS 59
M+ ++D VG+++Q+L+E N+L+VF+SDNGG +F HG A+N+PL+GTK
Sbjct: 283 MVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGG------NFNHG--ANNYPLKGTKGD 334
Query: 60 FHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
EGG R W ++K Q + + D PT +A
Sbjct: 335 TWEGGYRVPMFFHWPKKIKKDQRFDFPVSSLDLYPTFTGLA 375
>gi|109899035|ref|YP_662290.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109701316|gb|ABG41236.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 724
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 35/98 (35%), Positives = 57/98 (58%), Gaps = 6/98 (6%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT--ASNWPLRGTKY 58
M+K D+++G + Q+L E + N++++F SDNGG + GNT SN+PL+G K
Sbjct: 261 MVKQFDDAIGVLWQALVEAKVEKNTIIIFTSDNGGNMYTVV---GNTHATSNFPLKGGKA 317
Query: 59 SFHEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTL 95
+ +EGG++ A+ P L + L N + D+ PTL
Sbjct: 318 TEYEGGLKVPTAVIWPGLTQPNTLSNTPIQTADFFPTL 355
>gi|149199413|ref|ZP_01876449.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149137491|gb|EDM25908.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 608
Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 56/101 (55%), Gaps = 2/101 (1%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++S+D+S+G+VI +LE L+ +++VF SDNGG + S+ SN P RG K S
Sbjct: 253 MVESMDDSIGKVIDALETNKLMEKTIIVFFSDNGG-NIHSVVDGTTATSNKPFRGGKASI 311
Query: 61 HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
+EGG A +W + G ++L+ D ++ +A
Sbjct: 312 YEGGTHVPAIVVWPNQTKTGVRNDSLIQSEDLYASILEMAA 352
>gi|299148032|ref|ZP_07041095.1| arylsulfatase A [Bacteroides sp. 3_1_23]
gi|298514215|gb|EFI38101.1| arylsulfatase A [Bacteroides sp. 3_1_23]
Length = 520
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 10/104 (9%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-------FHGNTASNWPLRGTK 57
D VG ++ +LE+ GL N+L++ SDN GP VD + +G++ + PLRG K
Sbjct: 307 FDYCVGEILNTLEKLGLRENTLIILSSDN-GPVVDDGYDDKAEELLNGHSPAG-PLRGNK 364
Query: 58 YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
YS EGG R A + W ++KG + L+ DWL +L + G
Sbjct: 365 YSAFEGGTRIPAIVSWPAGVKKGMTSDLLVSQVDWLASLASLTG 408
>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
Length = 330
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 8/76 (10%)
Query: 5 LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
LD VG+++ L E ++ N+ V+F+SDNG ++ N P RG K S EGG
Sbjct: 258 LDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPIE--------GRNAPFRGGKGSLFEGG 309
Query: 65 VRNVAALWSPLLRKGQ 80
++ + ++SPLL+K +
Sbjct: 310 TKSSSFIYSPLLKKTE 325
>gi|393782821|ref|ZP_10371003.1| hypothetical protein HMPREF1071_01871 [Bacteroides salyersiae
CL02T12C01]
gi|392672206|gb|EIY65676.1| hypothetical protein HMPREF1071_01871 [Bacteroides salyersiae
CL02T12C01]
Length = 904
Score = 57.8 bits (138), Expect = 9e-07, Method: Composition-based stats.
Identities = 30/65 (46%), Positives = 38/65 (58%), Gaps = 8/65 (12%)
Query: 2 LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
LK +D VG ++Q L + LL N+LV+F SDNGG T +N PLRG K S +
Sbjct: 269 LKHIDTGVGEILQCLRDNDLLNNTLVIFTSDNGGAT--------GVTTNTPLRGGKTSMY 320
Query: 62 EGGVR 66
EGG R
Sbjct: 321 EGGNR 325
>gi|225013127|ref|ZP_03703540.1| sulfatase [Flavobacteria bacterium MS024-2A]
gi|225002747|gb|EEG40730.1| sulfatase [Flavobacteria bacterium MS024-2A]
Length = 509
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 61/107 (57%), Gaps = 10/107 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSL----HFHGNTASNWPLRGT 56
M++S+DE++G V+ +EEE + ++V+F+SDNGG + + H H N PL
Sbjct: 283 MIESMDEALGSVLDFVEEEQIEKETVVLFMSDNGGLSAVARGGRKHIH-----NAPLNSG 337
Query: 57 KYSFHEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAGKE 102
K S +EGG+R ++ P + K +E + I D+ P+L +AG E
Sbjct: 338 KGSIYEGGIRVPMMVYWPGVTKANSIEKTPVIIEDFFPSLLSVAGIE 384
>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
Length = 468
Score = 57.8 bits (138), Expect = 9e-07, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 1 MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
M++ +D +VGRV+ L GL +++V+F SDNGG + +WPL G K
Sbjct: 263 MMEGMDANVGRVLAELAALGLQEDTIVIFTSDNGGERF---------SDSWPLTGMKGEL 313
Query: 61 HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
EGG+R + W L KG+ + D LPTL AG
Sbjct: 314 LEGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTLLSAAG 354
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.136 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,801,250,933
Number of Sequences: 23463169
Number of extensions: 69097476
Number of successful extensions: 164576
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3840
Number of HSP's successfully gapped in prelim test: 2499
Number of HSP's that attempted gapping in prelim test: 156458
Number of HSP's gapped (non-prelim): 6628
length of query: 107
length of database: 8,064,228,071
effective HSP length: 75
effective length of query: 32
effective length of database: 6,304,490,396
effective search space: 201743692672
effective search space used: 201743692672
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)