BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy5200
         (271 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|380010998|ref|XP_003689601.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase SULF-1
           homolog [Apis florea]
          Length = 1346

 Score =  421 bits (1082), Expect = e-115,   Method: Composition-based stats.
 Identities = 184/217 (84%), Positives = 208/217 (95%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIAN
Sbjct: 112 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 171

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 172 DSVTFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 231

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 232 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 291

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 292 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 328


>gi|110760118|ref|XP_393715.3| PREDICTED: extracellular sulfatase SULF-1 homolog [Apis mellifera]
          Length = 1436

 Score =  421 bits (1082), Expect = e-115,   Method: Composition-based stats.
 Identities = 184/217 (84%), Positives = 208/217 (95%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIAN
Sbjct: 170 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 229

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 230 DSVTFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 289

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 290 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 349

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 350 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 386


>gi|350410548|ref|XP_003489072.1| PREDICTED: extracellular sulfatase SULF-1 homolog [Bombus
           impatiens]
          Length = 1416

 Score =  420 bits (1079), Expect = e-115,   Method: Composition-based stats.
 Identities = 184/217 (84%), Positives = 208/217 (95%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIAN
Sbjct: 165 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 224

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 225 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 284

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 285 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 344

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 345 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 381


>gi|340719524|ref|XP_003398201.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase SULF-1
           homolog [Bombus terrestris]
          Length = 1416

 Score =  420 bits (1079), Expect = e-115,   Method: Composition-based stats.
 Identities = 184/217 (84%), Positives = 208/217 (95%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIAN
Sbjct: 165 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 224

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 225 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 284

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 285 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 344

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 345 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 381


>gi|383862653|ref|XP_003706798.1| PREDICTED: extracellular sulfatase SULF-1 homolog [Megachile
           rotundata]
          Length = 1395

 Score =  419 bits (1077), Expect = e-115,   Method: Composition-based stats.
 Identities = 183/217 (84%), Positives = 208/217 (95%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIAN
Sbjct: 164 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 223

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 224 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 283

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD+AV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 284 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDNAVDRIYQELKDLGELDNTYIIYTSDH 343

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 344 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 380


>gi|345495672|ref|XP_001606010.2| PREDICTED: extracellular sulfatase SULF-1 homolog [Nasonia
           vitripennis]
          Length = 1355

 Score =  419 bits (1076), Expect = e-115,   Method: Composition-based stats.
 Identities = 183/217 (84%), Positives = 206/217 (94%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIAN
Sbjct: 162 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYNKDYYPDLIAN 221

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQY++MFFNVT+HHTPAYDYAPNPDK
Sbjct: 222 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQYSDMFFNVTTHHTPAYDYAPNPDK 281

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H++FTDLLMTKRLQTLQSVD AVE+I +ELK LGEL+NTYIIYTSDH
Sbjct: 282 QWILQVTQKMQPIHKEFTDLLMTKRLQTLQSVDAAVERIYQELKSLGELDNTYIIYTSDH 341

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PG++
Sbjct: 342 GYHLGQFGLIKGKSFPFEFDVRVPFLVRGPGIAPGSV 378


>gi|347969800|ref|XP_314283.5| AGAP003374-PA [Anopheles gambiae str. PEST]
 gi|333469277|gb|EAA09641.5| AGAP003374-PA [Anopheles gambiae str. PEST]
          Length = 1361

 Score =  410 bits (1053), Expect = e-112,   Method: Composition-based stats.
 Identities = 179/217 (82%), Positives = 202/217 (93%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY  DYYPDLIAN
Sbjct: 35  GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYAKDYYPDLIAN 94

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLRQSK    +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTPAYD+APNPDK
Sbjct: 95  DSIAFLRQSKHQNHRKPVLLAMSFPAPHGPEDSAPQYSHLFFNVTTHHTPAYDHAPNPDK 154

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT++M+P+HR+FTDLLMTKRLQTLQSVD AVE++ +ELK LGEL+NTYIIYTSDH
Sbjct: 155 QWILRVTQQMEPIHRKFTDLLMTKRLQTLQSVDVAVERVYQELKALGELDNTYIIYTSDH 214

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+RVPFLMRGPGI P T+
Sbjct: 215 GYHLGQFGLIKGKSFPFEFDVRVPFLMRGPGIEPATV 251


>gi|270007459|gb|EFA03907.1| hypothetical protein TcasGA2_TC014039 [Tribolium castaneum]
          Length = 1049

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 180/217 (82%), Positives = 206/217 (94%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSY+PPGWREWG LIMNSKYYNYSINMNG+KIKHG DY+ DYYPDLIAN
Sbjct: 141 GYFGKYLNKYNGSYVPPGWREWGGLIMNSKYYNYSINMNGKKIKHGFDYHKDYYPDLIAN 200

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLRQSK++ + KP+ML MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YDYAPNPDK
Sbjct: 201 DSIAFLRQSKKNSAHKPVMLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDYAPNPDK 260

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD AV+ IV+EL+ LGEL+NTYIIYTSDH
Sbjct: 261 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDAAVQLIVEELRALGELDNTYIIYTSDH 320

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+R+PFL+RGPG+ PGT+
Sbjct: 321 GYHLGQFGLIKGKSFPFEFDVRIPFLVRGPGVEPGTV 357


>gi|189237136|ref|XP_973189.2| PREDICTED: similar to Sulfated CG6725-PA [Tribolium castaneum]
          Length = 1049

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 180/217 (82%), Positives = 206/217 (94%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSY+PPGWREWG LIMNSKYYNYSINMNG+KIKHG DY+ DYYPDLIAN
Sbjct: 141 GYFGKYLNKYNGSYVPPGWREWGGLIMNSKYYNYSINMNGKKIKHGFDYHKDYYPDLIAN 200

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLRQSK++ + KP+ML MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YDYAPNPDK
Sbjct: 201 DSIAFLRQSKKNSAHKPVMLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDYAPNPDK 260

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD AV+ IV+EL+ LGEL+NTYIIYTSDH
Sbjct: 261 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDAAVQLIVEELRALGELDNTYIIYTSDH 320

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFEFD+R+PFL+RGPG+ PGT+
Sbjct: 321 GYHLGQFGLIKGKSFPFEFDVRIPFLVRGPGVEPGTV 357


>gi|242005522|ref|XP_002423613.1| sulfatase-1, sulf-1, putative [Pediculus humanus corporis]
 gi|212506773|gb|EEB10875.1| sulfatase-1, sulf-1, putative [Pediculus humanus corporis]
          Length = 1037

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 178/217 (82%), Positives = 201/217 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 176 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSINLNGKKIKHGFDYSKDYYPDLIAN 235

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLRQSKQ FS+KP+MLV+SFPAPHGPEDSAPQY+ MFFNVT+HHTP+YD+APN DK
Sbjct: 236 DSIAFLRQSKQYFSRKPVMLVVSFPAPHGPEDSAPQYSKMFFNVTTHHTPSYDFAPNLDK 295

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT KMQ +H+QFTD LMTKRLQTLQSVD AVE+I +ELK++GEL NTYI+YTSDH
Sbjct: 296 QWILRVTNKMQDIHKQFTDTLMTKRLQTLQSVDSAVERIYQELKNIGELENTYILYTSDH 355

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGLVKGKSFPFEFDIRVPFL+RGPG+  G++
Sbjct: 356 GYHLGQFGLVKGKSFPFEFDIRVPFLVRGPGVEAGSI 392


>gi|218506045|gb|ACK77664.1| RE44752p [Drosophila melanogaster]
          Length = 1114

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360


>gi|195451083|ref|XP_002072760.1| GK13512 [Drosophila willistoni]
 gi|194168845|gb|EDW83746.1| GK13512 [Drosophila willistoni]
          Length = 1148

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 196/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 153 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 212

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 213 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 272

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK LGEL+NTYI+YTSDH
Sbjct: 273 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 332

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 333 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 364


>gi|24647401|ref|NP_524987.1| sulfated, isoform A [Drosophila melanogaster]
 gi|442619385|ref|NP_001262626.1| sulfated, isoform B [Drosophila melanogaster]
 gi|442619387|ref|NP_001262627.1| sulfated, isoform C [Drosophila melanogaster]
 gi|33112451|sp|Q9VEX0.1|SULF1_DROME RecName: Full=Extracellular sulfatase SULF-1 homolog;
           Short=DmSulf-1; Flags: Precursor
 gi|7300128|gb|AAF55296.1| sulfated, isoform A [Drosophila melanogaster]
 gi|270289773|gb|ACZ67866.1| FI12873p [Drosophila melanogaster]
 gi|440217485|gb|AGB96006.1| sulfated, isoform B [Drosophila melanogaster]
 gi|440217486|gb|AGB96007.1| sulfated, isoform C [Drosophila melanogaster]
          Length = 1114

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360


>gi|195501026|ref|XP_002097625.1| GE26324 [Drosophila yakuba]
 gi|194183726|gb|EDW97337.1| GE26324 [Drosophila yakuba]
          Length = 1108

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360


>gi|194901252|ref|XP_001980166.1| GG20070 [Drosophila erecta]
 gi|190651869|gb|EDV49124.1| GG20070 [Drosophila erecta]
          Length = 1115

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 150 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 209

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 210 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 269

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 270 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 329

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 330 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 361


>gi|21429186|gb|AAM50312.1| SD04414p [Drosophila melanogaster]
          Length = 1006

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 41  GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 100

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 101 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 160

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 161 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 220

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 221 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 252


>gi|195152788|ref|XP_002017318.1| GL22250 [Drosophila persimilis]
 gi|194112375|gb|EDW34418.1| GL22250 [Drosophila persimilis]
          Length = 1173

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 163 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 222

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 223 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 282

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  EL+DLGEL+NTYI+YTSDH
Sbjct: 283 QWILRVTDPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELRDLGELDNTYIVYTSDH 342

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 343 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 374


>gi|198454076|ref|XP_001359462.2| GA19814 [Drosophila pseudoobscura pseudoobscura]
 gi|198132640|gb|EAL28608.2| GA19814 [Drosophila pseudoobscura pseudoobscura]
          Length = 1160

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 162 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 221

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 222 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 281

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  EL+DLGEL+NTYI+YTSDH
Sbjct: 282 QWILRVTDPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELRDLGELDNTYIVYTSDH 341

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 342 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 373


>gi|195349414|ref|XP_002041240.1| GM15446 [Drosophila sechellia]
 gi|194122845|gb|EDW44888.1| GM15446 [Drosophila sechellia]
          Length = 1114

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360


>gi|195570418|ref|XP_002103204.1| GD20299 [Drosophila simulans]
 gi|194199131|gb|EDX12707.1| GD20299 [Drosophila simulans]
          Length = 1114

 Score =  390 bits (1001), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 197/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360


>gi|195388950|ref|XP_002053141.1| GJ23505 [Drosophila virilis]
 gi|194151227|gb|EDW66661.1| GJ23505 [Drosophila virilis]
          Length = 1135

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 171/212 (80%), Positives = 196/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 164 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 223

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++F+NVT+HHTP+YD+APNPDK
Sbjct: 224 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFYNVTTHHTPSYDHAPNPDK 283

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT++LMTKRLQTLQSVD AVE++  ELK LGEL+NTYI+YTSDH
Sbjct: 284 QWILRVTEPMQPVHKRFTNILMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 343

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 344 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 375


>gi|194744781|ref|XP_001954871.1| GF16526 [Drosophila ananassae]
 gi|190627908|gb|EDV43432.1| GF16526 [Drosophila ananassae]
          Length = 1145

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 172/212 (81%), Positives = 196/212 (92%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KI+HG DY  DYYPDLIAN
Sbjct: 154 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIRHGFDYAKDYYPDLIAN 213

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 214 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 273

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK LGEL+NTYI+YTSDH
Sbjct: 274 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKSLGELDNTYIVYTSDH 333

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 334 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 365


>gi|328703992|ref|XP_001948094.2| PREDICTED: extracellular sulfatase SULF-1 homolog [Acyrthosiphon
           pisum]
          Length = 929

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 169/216 (78%), Positives = 195/216 (90%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSY+PPGWR+WG LIMNSKYYNYS+N+NG+KIKHG+DY+ DYYP+L+ N
Sbjct: 170 GYFGKYLNKYNGSYVPPGWRQWGGLIMNSKYYNYSVNLNGKKIKHGDDYHKDYYPNLVTN 229

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FL+ +KQ    +P MLVMSFP+PHGPEDSAP+Y++MFFNVTSHHTP YDYAPNPDK
Sbjct: 230 DSINFLQHAKQHHPHRPFMLVMSFPSPHGPEDSAPEYSDMFFNVTSHHTPTYDYAPNPDK 289

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQ T +M P+H++FT+LLMTKRLQTLQSVDDAV ++  ELK LGEL NTYIIYTSDH
Sbjct: 290 QWILQATGQMLPIHKKFTNLLMTKRLQTLQSVDDAVSRVYGELKQLGELENTYIIYTSDH 349

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           GYHLGQFGLVKGKSFPFEFD+RVP L+RGPGI PGT
Sbjct: 350 GYHLGQFGLVKGKSFPFEFDVRVPMLVRGPGIEPGT 385


>gi|195110209|ref|XP_001999674.1| GI24652 [Drosophila mojavensis]
 gi|193916268|gb|EDW15135.1| GI24652 [Drosophila mojavensis]
          Length = 1155

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 171/212 (80%), Positives = 194/212 (91%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 164 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 223

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++F NVT+HHTP+YD+APNPDK
Sbjct: 224 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFHNVTTHHTPSYDHAPNPDK 283

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH +FT++LMTKRLQTLQSVD AVE++  ELK LGEL+NTYI+YTSDH
Sbjct: 284 QWILRVTEPMQPVHMRFTNILMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 343

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 344 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 375


>gi|195038641|ref|XP_001990765.1| GH18078 [Drosophila grimshawi]
 gi|193894961|gb|EDV93827.1| GH18078 [Drosophila grimshawi]
          Length = 1191

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 170/212 (80%), Positives = 195/212 (91%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY  DYYPDLIAN
Sbjct: 176 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 235

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR SKQ   +KP++L MSFPAPHGPEDSAPQY+++F+NVT+HHTP+YD+APNPDK
Sbjct: 236 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFYNVTTHHTPSYDHAPNPDK 295

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  M PVH++FT++LMTKRLQTLQSVD AVE++  ELK LGEL+NTYI+YTSDH
Sbjct: 296 QWILRVTDPMLPVHKRFTNVLMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 355

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 356 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 387


>gi|321466795|gb|EFX77788.1| hypothetical protein DAPPUDRAFT_305315 [Daphnia pulex]
          Length = 856

 Score =  361 bits (927), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 163/217 (75%), Positives = 186/217 (85%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSY+PPGWR W   +MNS+YYNY+IN NG+KIKHG DY  DYYPDLI N
Sbjct: 76  GYFGKYLNKYNGSYVPPGWRVWAGQVMNSRYYNYTINFNGQKIKHGADYQTDYYPDLITN 135

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ FLR+SKQ +  +P  LVMSFP+PHGPEDSAPQY++MFFNVT H TP+YD+APNPDK
Sbjct: 136 DSISFLRESKQHYDHQPFHLVMSFPSPHGPEDSAPQYSDMFFNVTRHRTPSYDFAPNPDK 195

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+   KM+PVH +FTDLLMTKRLQTLQSVD+AVEKI  EL  LG+L NTYI YTSDH
Sbjct: 196 QWILRWLGKMKPVHHRFTDLLMTKRLQTLQSVDEAVEKIFSELVALGQLENTYIFYTSDH 255

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYHLGQFGL+KGKSFPFE DIRVPFL+RGPG+  G +
Sbjct: 256 GYHLGQFGLIKGKSFPFESDIRVPFLVRGPGVGRGRV 292


>gi|307202326|gb|EFN81784.1| Extracellular sulfatase SULF-1-like protein [Harpegnathos saltator]
          Length = 1241

 Score =  361 bits (927), Expect = 2e-97,   Method: Composition-based stats.
 Identities = 157/191 (82%), Positives = 181/191 (94%)

Query: 46  MNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPA 105
           MNS+YYNYS+NMNG+KIKHG +Y  DYYPDLIANDSV FLRQSK +F++KP+MLV SFPA
Sbjct: 1   MNSRYYNYSVNMNGKKIKHGFEYSRDYYPDLIANDSVAFLRQSKHNFARKPVMLVASFPA 60

Query: 106 PHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
           PHGPEDSAPQY+++FFNVT+HHTPAYDYAPNPDKQWILQVT+KMQP+H+QFTDLLMTKRL
Sbjct: 61  PHGPEDSAPQYSHLFFNVTTHHTPAYDYAPNPDKQWILQVTQKMQPIHKQFTDLLMTKRL 120

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           QTLQSVD AV++I +ELKDLGEL+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL
Sbjct: 121 QTLQSVDAAVDRIYQELKDLGELDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFL 180

Query: 226 MRGPGIVPGTM 236
           +RGPGI  G++
Sbjct: 181 IRGPGIEAGSV 191


>gi|332031465|gb|EGI70950.1| Extracellular sulfatase SULF-1-like protein [Acromyrmex echinatior]
          Length = 1215

 Score =  356 bits (913), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 158/191 (82%), Positives = 182/191 (95%)

Query: 46  MNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPA 105
           MNS+YYNYS+NMNG+KIKHG DY  DYYPDLIANDSV FLRQSK +F++KP+MLV SFPA
Sbjct: 1   MNSRYYNYSVNMNGKKIKHGFDYSRDYYPDLIANDSVAFLRQSKHNFARKPVMLVASFPA 60

Query: 106 PHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
           PHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDKQWILQVT+KMQP+H+QFTDLLMTKRL
Sbjct: 61  PHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDKQWILQVTQKMQPIHKQFTDLLMTKRL 120

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           QTLQSVD AV++I +ELKDLGEL+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL
Sbjct: 121 QTLQSVDSAVDRIYQELKDLGELDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFL 180

Query: 226 MRGPGIVPGTM 236
           +RGPGI PG++
Sbjct: 181 IRGPGIEPGSI 191


>gi|6941989|gb|AAF32278.1| SULF1 [Drosophila melanogaster]
          Length = 983

 Score =  353 bits (906), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 160/212 (75%), Positives = 185/212 (87%), Gaps = 1/212 (0%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+K KHG DY  DYYPDLIAN
Sbjct: 46  GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKXKHGFDYAKDYYPDLIAN 105

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            S+ FLR SKQ   +KP++L MSFPAPHGPEDS P  +++ +     H+ +YD+APNPDK
Sbjct: 106 XSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSLPVLSSLLYCDNPSHS-SYDHAPNPDK 164

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWIL+VT  MQPVH++FT+LLMTKRLQTLQSVD AVE++  ELK+LGEL+NTYI+YTSDH
Sbjct: 165 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 224

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 225 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 256


>gi|443730659|gb|ELU16083.1| hypothetical protein CAPTEDRAFT_174568 [Capitella teleta]
          Length = 923

 Score =  345 bits (885), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 152/215 (70%), Positives = 186/215 (86%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLN+Y G+YIPPGW EW  LI N+++YNYS+N NG+KIKHG++YY DY  DLIAN
Sbjct: 152 GYFGKYLNEYTGTYIPPGWTEWVGLIRNTRFYNYSLNFNGKKIKHGDNYYKDYLTDLIAN 211

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FL+QSKQ F  +P+ +V+S PAPHGPED+APQY +MF N+T+H TP+Y++APNPDK
Sbjct: 212 DSVTFLKQSKQYFPNRPVAMVLSMPAPHGPEDAAPQYQHMFPNITTHRTPSYNFAPNPDK 271

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQ T KM+P+H +FTD+L  KRLQTLQSVDDAVEK+  ELK+LGEL+NTY IY++DH
Sbjct: 272 QWILQYTGKMEPIHMEFTDMLHRKRLQTLQSVDDAVEKVFNELKELGELDNTYFIYSADH 331

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           GYHLGQ+GLVKGK+ P++FDIRVPFLMRGP I PG
Sbjct: 332 GYHLGQYGLVKGKAMPYDFDIRVPFLMRGPKIPPG 366


>gi|427780443|gb|JAA55673.1| Putative sulfatase [Rhipicephalus pulchellus]
          Length = 474

 Score =  343 bits (880), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 159/245 (64%), Positives = 198/245 (80%), Gaps = 5/245 (2%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
           W  E  SR+   +  + GY    FGKYLN+YNG++IP GWREW AL+ NS++YNY++N+N
Sbjct: 171 WQQEFESRSFATYLSAAGYRTAYFGKYLNEYNGNHIPTGWREWAALVRNSRFYNYTVNVN 230

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
           G K KHG DY  DYYPDL+ NDSV FLRQSKQ F  +P+M+V+S+P+PHGPEDSAPQY +
Sbjct: 231 GHKRKHGLDYARDYYPDLVTNDSVAFLRQSKQLFPNRPVMMVVSYPSPHGPEDSAPQYQH 290

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           +F NVT+H TP+++YAPN DKQW+L+ T +M+P+H +FTD+L TKRLQTLQ+VDDAVEK+
Sbjct: 291 LFHNVTTHRTPSWNYAPNMDKQWLLRYTGQMEPIHIKFTDMLHTKRLQTLQTVDDAVEKL 350

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
             EL  LGELNNTYI YTSDHGYHLGQFGLVKGKS PFEFDIRVPF +RGP + PGT  L
Sbjct: 351 YNELLSLGELNNTYIFYTSDHGYHLGQFGLVKGKSMPFEFDIRVPFYVRGPKVPPGTR-L 409

Query: 239 LDVLI 243
            D+++
Sbjct: 410 KDIVL 414


>gi|241153279|ref|XP_002407031.1| sulfatase-1, sulf-1, putative [Ixodes scapularis]
 gi|215494013|gb|EEC03654.1| sulfatase-1, sulf-1, putative [Ixodes scapularis]
          Length = 630

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 153/237 (64%), Positives = 194/237 (81%), Gaps = 4/237 (1%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
           W  E  SR+   +  S GY    FGKYLN+YNG++IP GWREW AL+ NS++YNY++++N
Sbjct: 60  WQQEFESRSFATYLSSAGYRTAYFGKYLNEYNGNHIPTGWREWAALVRNSRFYNYTVSVN 119

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
           G K KHGEDY  DYYPDL+AN+S+ FLRQSKQ F  +P+M+V+SFP+PHGPEDSAPQY +
Sbjct: 120 GHKRKHGEDYARDYYPDLVANESLAFLRQSKQLFPTRPVMMVVSFPSPHGPEDSAPQYQH 179

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           +F NVT+H TP++++APN DKQW+L+ T +M+P+H +FTD+L TKRLQTLQ+VDDAVE++
Sbjct: 180 LFHNVTTHRTPSWNHAPNQDKQWLLRYTGQMEPIHIRFTDMLHTKRLQTLQTVDDAVERL 239

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            KEL  LGEL++TYI YTSDHGYHLGQFGLVKGKS PFEFDIRVPF +RGP +   T
Sbjct: 240 YKELVSLGELDDTYIFYTSDHGYHLGQFGLVKGKSMPFEFDIRVPFFVRGPKVPRAT 296


>gi|357615219|gb|EHJ69539.1| hypothetical protein KGM_19329 [Danaus plexippus]
          Length = 1117

 Score =  327 bits (838), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 144/180 (80%), Positives = 168/180 (93%)

Query: 57  MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
           MNG++IKHG+DY  DYYPDLIANDS+ FLR SK+ FS+KP++LVMSFPAPHGPEDSAPQY
Sbjct: 1   MNGKRIKHGDDYNKDYYPDLIANDSIAFLRASKRRFSRKPVLLVMSFPAPHGPEDSAPQY 60

Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
           +++FFNVT+HHTP YD APNPDKQWIL+VT KM+P+HRQFTDLLMTKRLQTLQSVD AVE
Sbjct: 61  SHLFFNVTTHHTPTYDMAPNPDKQWILRVTEKMKPIHRQFTDLLMTKRLQTLQSVDVAVE 120

Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           ++ +ELK LGEL+NTY++YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL+RGPG+ PGT+
Sbjct: 121 RVYQELKALGELDNTYLVYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLVRGPGVEPGTV 180


>gi|405952082|gb|EKC19933.1| Extracellular sulfatase Sulf-1 [Crassostrea gigas]
          Length = 914

 Score =  325 bits (834), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 144/214 (67%), Positives = 181/214 (84%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLN+YNG+YIPPGWREW  LI N+++YNY++N NG K+KH ++YY DY+ DLIAN
Sbjct: 90  GYFGKYLNEYNGTYIPPGWREWVGLIKNTRFYNYTVNFNGNKMKHEDNYYADYFTDLIAN 149

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV FL+ SKQ F K+P+++V+S PAPHGPEDSAPQY ++F+N T H TP ++ APNPDK
Sbjct: 150 DSVTFLKNSKQYFPKRPVLMVLSVPAPHGPEDSAPQYQHLFYNNTLHRTPTWNMAPNPDK 209

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           Q++L+VT KM+P+ + FTD+L  KRLQTLQSVDD VEK+ +EL +L EL NTYIIYTSDH
Sbjct: 210 QFLLKVTGKMEPIEQTFTDVLQQKRLQTLQSVDDLVEKVYEELWNLDELENTYIIYTSDH 269

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           GYHLGQFGLVKGK+ P++FD+RVP  +RGPGI P
Sbjct: 270 GYHLGQFGLVKGKALPYDFDVRVPLYVRGPGIKP 303


>gi|391332766|ref|XP_003740800.1| PREDICTED: extracellular sulfatase Sulf-2-like [Metaseiulus
           occidentalis]
          Length = 761

 Score =  323 bits (829), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 145/227 (63%), Positives = 184/227 (81%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLN+YNG+++P GW+EW  L+ NSK+YNY++++NG K+KHG DY  DY PDLI N
Sbjct: 124 GYFGKYLNEYNGNHVPLGWKEWLGLVRNSKFYNYTVSVNGNKVKHGNDYSKDYLPDLITN 183

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            S+ FLR+SKQ +  KP+++VM++P PHGPEDSAPQY NMF N T H TP+++YAPN DK
Sbjct: 184 HSLSFLRRSKQFYPDKPVLMVMAYPGPHGPEDSAPQYQNMFLNETQHRTPSWNYAPNYDK 243

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QWILQ T KMQ +H +FTD+L TKRLQTLQSVDDAV++++ ELK L E++NT+I YTSDH
Sbjct: 244 QWILQHTGKMQDIHMKFTDMLNTKRLQTLQSVDDAVDQLITELKRLDEVDNTFIFYTSDH 303

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYHLGQFGLVKGK+ PFEFD+RVPF +RGP I  G      + IP++
Sbjct: 304 GYHLGQFGLVKGKAMPFEFDVRVPFYVRGPKISRG------IQIPEI 344


>gi|51467917|ref|NP_001003846.1| extracellular sulfatase Sulf-1 precursor [Danio rerio]
 gi|37812238|gb|AAR04055.1| sulfatase FP1c [Danio rerio]
          Length = 1099

 Score =  309 bits (792), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 141/238 (59%), Positives = 178/238 (74%), Gaps = 4/238 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ + R SK+ F  +P+M+V+S  APHGPEDSAPQY+
Sbjct: 175 NGNKEKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAAPHGPEDSAPQYS 234

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+VEK
Sbjct: 235 ELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRLQTLMSVDDSVEK 294

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           +   L D GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG 
Sbjct: 295 VYNALVDTGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGA 352


>gi|37812240|gb|AAR04056.1| sulfatase FP1a [Danio rerio]
          Length = 874

 Score =  309 bits (792), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 141/238 (59%), Positives = 178/238 (74%), Gaps = 4/238 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ + R SK+ F  +P+M+V+S  APHGPEDSAPQY+
Sbjct: 175 NGNKEKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAAPHGPEDSAPQYS 234

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+VEK
Sbjct: 235 ELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRLQTLMSVDDSVEK 294

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           +   L D GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG 
Sbjct: 295 VYNALVDTGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGA 352


>gi|348511948|ref|XP_003443505.1| PREDICTED: extracellular sulfatase Sulf-1-like [Oreochromis
           niloticus]
          Length = 1165

 Score =  309 bits (792), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 142/239 (59%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +    GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 117 SWQAQHEPRSFAVYLNETGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ F R SK+ +  +P+M+V+S  APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYAKDYFTDLITNDSISFFRMSKKMYPHRPVMMVVSHAAPHGPEDSAPQYA 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
             F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+V+K
Sbjct: 237 EFFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNFLHRKRLQTLMSVDDSVQK 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF MRGP + PG +
Sbjct: 297 IYDMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFMRGPNVEPGAV 355


>gi|37812242|gb|AAR04057.1| sulfatase FP1b [Danio rerio]
          Length = 892

 Score =  309 bits (791), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 141/238 (59%), Positives = 178/238 (74%), Gaps = 4/238 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ + R SK+ F  +P+M+V+S  APHGPEDSAPQY+
Sbjct: 175 NGNKEKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAAPHGPEDSAPQYS 234

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+VEK
Sbjct: 235 ELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRLQTLMSVDDSVEK 294

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           +   L D GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG 
Sbjct: 295 VYNALVDTGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGA 352


>gi|427794433|gb|JAA62668.1| Putative sulfatase, partial [Rhipicephalus pulchellus]
          Length = 603

 Score =  308 bits (790), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 147/245 (60%), Positives = 184/245 (75%), Gaps = 21/245 (8%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
           W  E  SR+   +  + GY    FGKYLN+YNG++IP GWREW AL+ NS++YNY++N+N
Sbjct: 152 WQQEFESRSFATYLSAAGYRTAYFGKYLNEYNGNHIPTGWREWAALVRNSRFYNYTVNVN 211

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
           G K KHG DY  DYYPDL+ NDSV FLRQSKQ F  +P+M+V+S+P+PHG          
Sbjct: 212 GHKRKHGLDYARDYYPDLVTNDSVAFLRQSKQLFPNRPVMMVVSYPSPHG---------- 261

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
                 +H TP+++YAPN DKQW+L+ T +M+P+H +FTD+L TKRLQTLQ+VDDAVEK+
Sbjct: 262 ------THRTPSWNYAPNMDKQWLLRYTGQMEPIHIKFTDMLHTKRLQTLQTVDDAVEKL 315

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
             EL  LGELNNTYI YTSDHGYHLGQFGLVKGKS PFEFDIRVPF +RGP + PGT  L
Sbjct: 316 YNELLSLGELNNTYIFYTSDHGYHLGQFGLVKGKSMPFEFDIRVPFYVRGPKVPPGTR-L 374

Query: 239 LDVLI 243
            D+++
Sbjct: 375 KDIVL 379


>gi|432930070|ref|XP_004081305.1| PREDICTED: extracellular sulfatase Sulf-1-like [Oryzias latipes]
          Length = 1150

 Score =  307 bits (787), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 141/239 (58%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  L+ NS++YNY++  
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTVCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ F R SK+ +  +P+M+V+S  APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYAKDYFTDLITNDSISFFRMSKKMYPHRPVMMVISHAAPHGPEDSAPQYA 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
             F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+V++
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNFLHRKRLQTLMSVDDSVQR 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L D GEL NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PGT+
Sbjct: 297 IHDMLADTGELQNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGTV 355


>gi|317419889|emb|CBN81925.1| Extracellular sulfatase Sulf-1 [Dicentrarchus labrax]
          Length = 843

 Score =  307 bits (786), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 140/239 (58%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ F R SK+ +  +P+M+V+S  APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYSKDYFTDLITNDSINFFRVSKRMYPHRPVMMVISHAAPHGPEDSAPQYA 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
             F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+V+K
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQK 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           + + L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG +
Sbjct: 297 VYEMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGAV 355


>gi|317419888|emb|CBN81924.1| Extracellular sulfatase Sulf-1 [Dicentrarchus labrax]
          Length = 1124

 Score =  307 bits (786), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 140/239 (58%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ F R SK+ +  +P+M+V+S  APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYSKDYFTDLITNDSINFFRVSKRMYPHRPVMMVISHAAPHGPEDSAPQYA 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
             F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+V+K
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQK 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           + + L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG +
Sbjct: 297 VYEMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGAV 355


>gi|317419890|emb|CBN81926.1| Extracellular sulfatase Sulf-1 [Dicentrarchus labrax]
          Length = 832

 Score =  306 bits (785), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 140/239 (58%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY+ DLI NDS+ F R SK+ +  +P+M+V+S  APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYSKDYFTDLITNDSINFFRVSKRMYPHRPVMMVISHAAPHGPEDSAPQYA 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
             F N + H TP+Y+YAPN DK WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+V+K
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQK 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           + + L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG +
Sbjct: 297 VYEMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGAV 355


>gi|312080544|ref|XP_003142644.1| hypothetical protein LOAG_07062 [Loa loa]
 gi|307762190|gb|EFO21424.1| hypothetical protein LOAG_07062 [Loa loa]
          Length = 827

 Score =  306 bits (783), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 136/211 (64%), Positives = 171/211 (81%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            YFGKYLN+Y GSYIPPGW  W  L+ NS++YNY+IN+NG KIKHG DY  DY+ DLIAN
Sbjct: 124 AYFGKYLNEYTGSYIPPGWDHWMGLLRNSRFYNYTINVNGDKIKHGWDYEKDYFTDLIAN 183

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           D++ F+R+       KP M+VMSFPAPHGPED APQY+  F +V +H T A++YAPNPDK
Sbjct: 184 DTIAFIRELHSRDLFKPYMIVMSFPAPHGPEDPAPQYSTWFQDVETHRTEAWNYAPNPDK 243

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QW+LQ T +M+PVH  FTD+L  +RLQTLQSVD ++++++ EL+DLG+L+NT  IYTSDH
Sbjct: 244 QWLLQHTGRMEPVHVIFTDVLHRRRLQTLQSVDSSIQRLINELRDLGDLSNTVFIYTSDH 303

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           GYHLGQFGLVKGK+ P+EFDIRVP+ +RGPG
Sbjct: 304 GYHLGQFGLVKGKNMPYEFDIRVPYFIRGPG 334


>gi|395829467|ref|XP_003787880.1| PREDICTED: extracellular sulfatase Sulf-2 [Otolemur garnettii]
          Length = 972

 Score =  306 bits (783), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 138/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 219 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 278

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 279 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 338

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H QFT++L  KRLQTL SVDD++E+
Sbjct: 339 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLMSVDDSMER 398

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 399 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 457


>gi|410909097|ref|XP_003968027.1| PREDICTED: extracellular sulfatase Sulf-1-like [Takifugu rubripes]
          Length = 1172

 Score =  304 bits (778), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 136/217 (62%), Positives = 171/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG +Y  DY+ DLI N
Sbjct: 137 GFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCRNGYKEKHGGEYAKDYFTDLITN 196

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ F  +P+M+V+S  APHGPEDSAPQYA+ F N + H TP+Y+YAPN DK
Sbjct: 197 DSINFFRISKRMFPHRPVMMVISHAAPHGPEDSAPQYADHFPNASQHITPSYNYAPNMDK 256

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M+P+H +FT+ L  KRLQTL SVDD+V+K+   L++ GEL NTYIIYT+DH
Sbjct: 257 HWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQKVYDMLEETGELENTYIIYTADH 316

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P++FDIRVPF +RGP +  G +
Sbjct: 317 GYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVESGAV 353


>gi|297259626|ref|XP_001106412.2| PREDICTED: extracellular sulfatase Sulf-2-like [Macaca mulatta]
          Length = 948

 Score =  303 bits (777), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 198 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 257

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 258 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 317

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 318 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 377

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 378 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 436


>gi|81158032|tpe|CAI84990.1| TPA: sulfatase 2 [Rattus norvegicus]
          Length = 866

 Score =  303 bits (777), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGMKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPSVEAGSL 355


>gi|397511408|ref|XP_003826067.1| PREDICTED: extracellular sulfatase Sulf-2 [Pan paniscus]
 gi|410207924|gb|JAA01181.1| sulfatase 2 [Pan troglodytes]
 gi|410291484|gb|JAA24342.1| sulfatase 2 [Pan troglodytes]
 gi|410345916|gb|JAA40669.1| sulfatase 2 [Pan troglodytes]
          Length = 870

 Score =  303 bits (777), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|332207736|ref|XP_003252952.1| PREDICTED: extracellular sulfatase Sulf-2 [Nomascus leucogenys]
          Length = 869

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|119596097|gb|EAW75691.1| sulfatase 2, isoform CRA_b [Homo sapiens]
          Length = 852

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355


>gi|440893777|gb|ELR46430.1| Extracellular sulfatase Sulf-2 [Bos grunniens mutus]
          Length = 865

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 138/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRASKKMYPHRPVLMVLSHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F N + H TP+Y+YAPNPDK WI++ T  M+P+H QFT++L  KRLQTL SVDD++E 
Sbjct: 237 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|355563054|gb|EHH19616.1| Extracellular sulfatase Sulf-2 [Macaca mulatta]
 gi|380787465|gb|AFE65608.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
 gi|380787469|gb|AFE65610.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
 gi|383409655|gb|AFH28041.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
 gi|384939934|gb|AFI33572.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
 gi|384939936|gb|AFI33573.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
          Length = 870

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|110626131|ref|NP_001030099.1| extracellular sulfatase Sulf-2 precursor [Rattus norvegicus]
 gi|58012992|gb|AAW62950.1| sulfatase FP2 [Rattus norvegicus]
 gi|149042868|gb|EDL96442.1| sulfatase 2, isoform CRA_b [Rattus norvegicus]
          Length = 875

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGMKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPSVEAGSL 355


>gi|240255478|ref|NP_940998.2| extracellular sulfatase Sulf-2 isoform b precursor [Homo sapiens]
 gi|37182046|gb|AAQ88826.1| GPPS559 [Homo sapiens]
 gi|119596098|gb|EAW75692.1| sulfatase 2, isoform CRA_c [Homo sapiens]
          Length = 867

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355


>gi|183986511|gb|AAI66410.1| Sulf2 protein [Rattus norvegicus]
          Length = 875

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGMKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPSVEAGSL 355


>gi|29789100|ref|NP_061325.1| extracellular sulfatase Sulf-2 isoform a precursor [Homo sapiens]
 gi|240255483|ref|NP_001155313.1| extracellular sulfatase Sulf-2 isoform a precursor [Homo sapiens]
 gi|33112446|sp|Q8IWU5.1|SULF2_HUMAN RecName: Full=Extracellular sulfatase Sulf-2; Short=hSulf-2; Flags:
           Precursor
 gi|27356934|gb|AAM76861.1| extracellular sulfatase SULF-2 [Homo sapiens]
 gi|51476368|emb|CAH18174.1| hypothetical protein [Homo sapiens]
 gi|83405533|gb|AAI10540.1| Sulfatase 2 [Homo sapiens]
 gi|119596096|gb|EAW75690.1| sulfatase 2, isoform CRA_a [Homo sapiens]
 gi|119596099|gb|EAW75693.1| sulfatase 2, isoform CRA_a [Homo sapiens]
 gi|119596100|gb|EAW75694.1| sulfatase 2, isoform CRA_a [Homo sapiens]
 gi|119596101|gb|EAW75695.1| sulfatase 2, isoform CRA_a [Homo sapiens]
 gi|190690055|gb|ACE86802.1| sulfatase 2 protein [synthetic construct]
 gi|190691429|gb|ACE87489.1| sulfatase 2 protein [synthetic construct]
 gi|190692019|gb|ACE87784.1| sulfatase 2 protein [synthetic construct]
 gi|254071383|gb|ACT64451.1| sulfatase 2 protein [synthetic construct]
          Length = 870

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355


>gi|168269766|dbj|BAG10010.1| extracellular sulfatase Sulf-2 precursor [synthetic construct]
          Length = 870

 Score =  303 bits (776), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355


>gi|348563931|ref|XP_003467760.1| PREDICTED: extracellular sulfatase Sulf-2-like [Cavia porcellus]
          Length = 872

 Score =  303 bits (775), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|14133245|dbj|BAA86561.2| KIAA1247 protein [Homo sapiens]
          Length = 885

 Score =  303 bits (775), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 132 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 191

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 192 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 251

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 252 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 311

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 312 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 370


>gi|300793847|ref|NP_001179867.1| extracellular sulfatase Sulf-2 precursor [Bos taurus]
 gi|296480968|tpg|DAA23083.1| TPA: sulfatase 2-like [Bos taurus]
          Length = 862

 Score =  303 bits (775), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 138/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRASKKMYPHRPVLMVLSHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F N + H TP+Y+YAPNPDK WI++ T  M+P+H QFT++L  KRLQTL SVDD++E 
Sbjct: 237 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|354476744|ref|XP_003500583.1| PREDICTED: extracellular sulfatase Sulf-2 [Cricetulus griseus]
 gi|344241558|gb|EGV97661.1| Extracellular sulfatase Sulf-2 [Cricetulus griseus]
          Length = 875

 Score =  303 bits (775), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  + GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+++  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKTYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|363730828|ref|XP_003640872.1| PREDICTED: extracellular sulfatase Sulf-1 [Gallus gallus]
          Length = 867

 Score =  302 bits (774), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 178/227 (78%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY+I+ NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD++E++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358


>gi|33112449|sp|Q90XB6.1|SULF1_COTCO RecName: Full=Extracellular sulfatase Sulf-1; Short=qSulf1; Flags:
           Precursor
 gi|15430244|gb|AAK98515.1|AF410802_1 N-acetylglucosamine-6-sulfatase [Coturnix coturnix]
          Length = 867

 Score =  302 bits (774), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 178/227 (78%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY+I+ NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD++E++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358


>gi|357588455|ref|NP_001239508.1| extracellular sulfatase Sulf-2 isoform 2 [Mus musculus]
          Length = 908

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 150 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 209

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 210 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 269

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 270 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 329

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 330 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 388


>gi|350595021|ref|XP_003484028.1| PREDICTED: extracellular sulfatase Sulf-2-like [Sus scrofa]
          Length = 527

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|410920477|ref|XP_003973710.1| PREDICTED: extracellular sulfatase Sulf-2-like [Takifugu rubripes]
          Length = 883

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 131/216 (60%), Positives = 172/216 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++  NG + KH  DY  DY  D+I N
Sbjct: 146 GFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYPKDYLTDIITN 205

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+++  +P+M+V+S  APHGPEDSAPQY++ F N + H TP+Y+YAPNPDK
Sbjct: 206 ESINYFRTSKRTYPNRPVMMVLSHVAPHGPEDSAPQYSSAFPNASQHITPSYNYAPNPDK 265

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T  M+PVH QFT++L  +R+QTL SVDD+VEK+   L + GEL+NTY+IYTSDH
Sbjct: 266 HWILRYTGAMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVFNMLAETGELDNTYLIYTSDH 325

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G 
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRVPFFIRGPNVEQGA 361


>gi|390462707|ref|XP_002747675.2| PREDICTED: extracellular sulfatase Sulf-2 [Callithrix jacchus]
          Length = 931

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  + GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 179 SWQAQHESRTFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 238

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 239 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 298

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 299 CLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 358

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 359 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 417


>gi|148674529|gb|EDL06476.1| sulfatase 2, isoform CRA_a [Mus musculus]
          Length = 943

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 185 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 244

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 245 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 304

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 305 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 364

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 365 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 423


>gi|148674531|gb|EDL06478.1| sulfatase 2, isoform CRA_c [Mus musculus]
          Length = 908

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 150 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 209

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 210 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 269

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 270 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 329

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 330 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 388


>gi|27356938|gb|AAM76862.1| extracellular sulfatase SULF-2 [Mus musculus]
 gi|187954897|gb|AAI41087.1| Sulfatase 2 [Mus musculus]
          Length = 875

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|61656167|ref|NP_082348.2| extracellular sulfatase Sulf-2 isoform 1 precursor [Mus musculus]
 gi|357588451|ref|NP_001239507.1| extracellular sulfatase Sulf-2 isoform 1 precursor [Mus musculus]
 gi|341942093|sp|Q8CFG0.2|SULF2_MOUSE RecName: Full=Extracellular sulfatase Sulf-2; Short=mSulf-2; Flags:
           Precursor
 gi|26331608|dbj|BAC29534.1| unnamed protein product [Mus musculus]
 gi|74138510|dbj|BAE38065.1| unnamed protein product [Mus musculus]
 gi|74197259|dbj|BAC28804.2| unnamed protein product [Mus musculus]
 gi|74223521|dbj|BAE21607.1| unnamed protein product [Mus musculus]
          Length = 875

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|26349279|dbj|BAC38279.1| unnamed protein product [Mus musculus]
          Length = 875

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYFNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|37360296|dbj|BAC98126.1| mKIAA1247 protein [Mus musculus]
          Length = 948

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 190 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 249

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 250 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 309

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 310 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 369

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 370 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 428


>gi|281343859|gb|EFB19443.1| hypothetical protein PANDA_016742 [Ailuropoda melanoleuca]
          Length = 776

 Score =  301 bits (772), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 59  SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 118

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 119 NGLKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 178

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 179 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 238

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 239 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 297


>gi|224046351|ref|XP_002199068.1| PREDICTED: extracellular sulfatase Sulf-1 [Taeniopygia guttata]
          Length = 867

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 178/227 (78%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY+I+ NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD++E++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLHQMLAEMGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358


>gi|426392035|ref|XP_004062367.1| PREDICTED: extracellular sulfatase Sulf-2 [Gorilla gorilla gorilla]
          Length = 870

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++ +
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMXQ 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|114326190|ref|NP_001041555.1| extracellular sulfatase Sulf-2 precursor [Canis lupus familiaris]
 gi|81158076|tpe|CAI85012.1| TPA: sulfatase 2 [Canis lupus familiaris]
          Length = 869

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 GLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|355784413|gb|EHH65264.1| Extracellular sulfatase Sulf-2 [Macaca fascicularis]
          Length = 870

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG +Y  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSNYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|444706942|gb|ELW48257.1| Extracellular sulfatase Sulf-2 [Tupaia chinensis]
          Length = 1080

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 221 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 280

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 281 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 340

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  +RLQTL SVDD++E 
Sbjct: 341 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRRRLQTLMSVDDSMET 400

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 401 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 459


>gi|417515840|gb|JAA53728.1| E3 extracellular sulfatase Sulf-2 [Sus scrofa]
          Length = 865

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|149733333|ref|XP_001503564.1| PREDICTED: extracellular sulfatase Sulf-2 [Equus caballus]
          Length = 869

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG +Y  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSEYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|403282315|ref|XP_003932598.1| PREDICTED: extracellular sulfatase Sulf-2 [Saimiri boliviensis
           boliviensis]
          Length = 869

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  + GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 CLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|301783255|ref|XP_002927045.1| PREDICTED: extracellular sulfatase Sulf-2-like [Ailuropoda
           melanoleuca]
          Length = 828

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 75  SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 134

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 135 NGLKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 194

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 195 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 254

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 255 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 313


>gi|351710483|gb|EHB13402.1| Extracellular sulfatase Sulf-2 [Heterocephalus glaber]
          Length = 871

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 HLFPNSSQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLLETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|410953784|ref|XP_003983550.1| PREDICTED: extracellular sulfatase Sulf-2 [Felis catus]
          Length = 854

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 GLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYIMYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|348588460|ref|XP_003479984.1| PREDICTED: extracellular sulfatase Sulf-1-like [Cavia porcellus]
          Length = 1082

 Score =  300 bits (768), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      +IPQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VIPQI 358


>gi|26324822|dbj|BAC26165.1| unnamed protein product [Mus musculus]
          Length = 875

 Score =  300 bits (768), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRL TL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLHTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|344280062|ref|XP_003411804.1| PREDICTED: extracellular sulfatase Sulf-2 [Loxodonta africana]
          Length = 874

 Score =  300 bits (768), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 178/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHEGRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNVLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|397522689|ref|XP_003831390.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Pan paniscus]
 gi|397522691|ref|XP_003831391.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Pan paniscus]
 gi|397522693|ref|XP_003831392.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Pan paniscus]
 gi|397522695|ref|XP_003831393.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Pan paniscus]
          Length = 869

 Score =  300 bits (767), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 GFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|297683035|ref|XP_002819205.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Pongo abelii]
 gi|297683037|ref|XP_002819206.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Pongo abelii]
 gi|297683039|ref|XP_002819207.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Pongo abelii]
 gi|297683041|ref|XP_002819208.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Pongo abelii]
          Length = 868

 Score =  299 bits (766), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF MRGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFMRGPSVEPGS------IVPQI 358


>gi|126321290|ref|XP_001378749.1| PREDICTED: extracellular sulfatase Sulf-1 [Monodelphis domestica]
          Length = 1134

 Score =  299 bits (765), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 177/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ+++++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSDLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|327271782|ref|XP_003220666.1| PREDICTED: extracellular sulfatase Sulf-2-like [Anolis
           carolinensis]
          Length = 882

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 130/217 (59%), Positives = 172/217 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSISFFRISKKMYPHRPVLMVLSHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTY+IYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQQKRLQTLMSVDDSMEAIYNTLVETGELDNTYVIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355


>gi|395506869|ref|XP_003757752.1| PREDICTED: extracellular sulfatase Sulf-2 [Sarcophilus harrisii]
          Length = 875

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 171/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+ +F N + H TP+Y+YAPNPDK
Sbjct: 199 DSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYI+YT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355


>gi|126303328|ref|XP_001379302.1| PREDICTED: extracellular sulfatase Sulf-2 [Monodelphis domestica]
          Length = 878

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 171/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+ +F N + H TP+Y+YAPNPDK
Sbjct: 199 DSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYI+YT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355


>gi|26336991|dbj|BAC32179.1| unnamed protein product [Mus musculus]
          Length = 862

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358


>gi|351705551|gb|EHB08470.1| Extracellular sulfatase Sulf-1 [Heterocephalus glaber]
          Length = 866

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|395511033|ref|XP_003759767.1| PREDICTED: extracellular sulfatase Sulf-1 [Sarcophilus harrisii]
          Length = 1130

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 133/223 (59%), Positives = 175/223 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ+++++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSDLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++    VL
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVSQIVL 360


>gi|291409979|ref|XP_002721269.1| PREDICTED: sulfatase 2 [Oryctolagus cuniculus]
          Length = 869

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW +L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVSLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K +HG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKERHGFDYAKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
             F N + H TP+Y+YAPNPDK WI++ T  M+P+H QFT++L  KRLQTL SVDD++E 
Sbjct: 237 RFFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNVLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           I   L + GEL+NTY++YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 IYNMLVETGELDNTYLVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|449283961|gb|EMC90544.1| Extracellular sulfatase Sulf-2 [Columba livia]
          Length = 875

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 172/217 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHVGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|26986617|ref|NP_758498.1| extracellular sulfatase Sulf-1 precursor [Mus musculus]
 gi|310616752|ref|NP_001185494.1| extracellular sulfatase Sulf-1 precursor [Mus musculus]
 gi|310616754|ref|NP_001185495.1| extracellular sulfatase Sulf-1 precursor [Mus musculus]
 gi|33112448|sp|Q8K007.1|SULF1_MOUSE RecName: Full=Extracellular sulfatase Sulf-1; Short=mSulf-1; Flags:
           Precursor
 gi|21961489|gb|AAH34547.1| Sulfatase 1 [Mus musculus]
 gi|27356936|gb|AAM76863.1| extracellular sulfatase SULF-1 [Mus musculus]
 gi|29165845|gb|AAH49276.1| Sulfatase 1 [Mus musculus]
 gi|148682368|gb|EDL14315.1| sulfatase 1 [Mus musculus]
          Length = 870

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358


>gi|118100629|ref|XP_417386.2| PREDICTED: extracellular sulfatase Sulf-2 [Gallus gallus]
          Length = 877

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 172/217 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|26390193|dbj|BAC25858.1| unnamed protein product [Mus musculus]
          Length = 870

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358


>gi|326931861|ref|XP_003212042.1| PREDICTED: extracellular sulfatase Sulf-2-like [Meleagris
           gallopavo]
          Length = 877

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 172/217 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|54660730|gb|AAV37455.1| heparan sulfate 6-O endosulfatase 2 [Coturnix coturnix]
          Length = 877

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 172/217 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKRMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|354495022|ref|XP_003509631.1| PREDICTED: extracellular sulfatase Sulf-1 [Cricetulus griseus]
          Length = 1011

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+       +PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGST------VPQI 358


>gi|296226652|ref|XP_002759021.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Callithrix
           jacchus]
          Length = 869

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +SV + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESVNYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|37360220|dbj|BAC98088.1| mKIAA1077 protein [Mus musculus]
          Length = 1079

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 85  AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 144

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 145 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 204

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 205 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 264

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+      ++PQ+
Sbjct: 265 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 305


>gi|431894471|gb|ELK04271.1| Extracellular sulfatase Sulf-2 [Pteropus alecto]
          Length = 814

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/239 (55%), Positives = 179/239 (74%), Gaps = 4/239 (1%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHEGRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DL+ NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLVTNDSVNFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 237 HLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +   L + GEL+NTY++YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 297 LYNMLVETGELDNTYLVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|41055578|ref|NP_957230.1| sulfatase 2 precursor [Danio rerio]
 gi|28279577|gb|AAH45403.1| Sulfatase 2 [Danio rerio]
 gi|182890734|gb|AAI65233.1| Sulf2 protein [Danio rerio]
          Length = 873

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGWREW AL+ NS++YNY++  NG + KHG  Y  DY  D+I N
Sbjct: 141 AFFGKYLNEYNGSYVPPGWREWVALVKNSRFYNYTLCRNGIREKHGTQYPKDYLTDVITN 200

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+M+V+S  APHGPED+APQY++ F N + H TP+Y++APNPDK
Sbjct: 201 DSINFFRMSKRMYPHRPVMMVLSHAAPHGPEDAAPQYSSAFPNASQHITPSYNHAPNPDK 260

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T  M+PVH QFT++L  +RLQTL SVDD+VEK+   L + GEL+NTYIIY SDH
Sbjct: 261 HWILRYTGPMKPVHMQFTNMLQRRRLQTLLSVDDSVEKVYNMLVETGELDNTYIIYMSDH 320

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIR+PF +RGP +  G +
Sbjct: 321 GYHIGQFGLVKGKSMPYEFDIRIPFYVRGPNVEAGAI 357


>gi|119607360|gb|EAW86954.1| sulfatase 1, isoform CRA_b [Homo sapiens]
          Length = 882

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|19705485|ref|NP_599205.1| extracellular sulfatase Sulf-1 precursor [Rattus norvegicus]
 gi|33112437|sp|Q8VI60.1|SULF1_RAT RecName: Full=Extracellular sulfatase Sulf-1; AltName:
           Full=RSulfFP1; AltName: Full=Sulfatase FP; Flags:
           Precursor
 gi|18461371|gb|AAL71906.1|AF230072_1 sulfatase FP [Rattus norvegicus]
          Length = 870

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358


>gi|387018856|gb|AFJ51546.1| Extracellular sulfatase Sulf-2-like [Crotalus adamanteus]
          Length = 888

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 129/217 (59%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGMKEKHGSDYTRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAP Y+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSISFFRISKKMYPHRPVLMVLSHAAPHGPEDSAPHYSHLFPNASQHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL NTY+IYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQQKRLQTLMSVDDSMEAIYNTLVETGELENTYVIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355


>gi|327269757|ref|XP_003219659.1| PREDICTED: extracellular sulfatase Sulf-1-like [Anolis
           carolinensis]
          Length = 1122

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 134/223 (60%), Positives = 173/223 (77%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCRNGLKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 DSIHYFKMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD++E++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLLSVDDSMERLYHMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG++    VL
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGSVVSQIVL 360


>gi|426242121|ref|XP_004014923.1| PREDICTED: extracellular sulfatase Sulf-2 [Ovis aries]
          Length = 912

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/219 (60%), Positives = 173/219 (78%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           +  +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI
Sbjct: 184 ARAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKDYLTDLI 243

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
            NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+++F N + H TP+Y+YAPNP
Sbjct: 244 TNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSSLFPNASQHITPSYNYAPNP 303

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
           DK WI++ T  M+P+H QFT++L  KRLQTL SVDD++E I   L + GEL+NTYI+YT+
Sbjct: 304 DKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMETIYNMLVETGELDNTYIVYTA 363

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 364 DHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 402


>gi|344256682|gb|EGW12786.1| Extracellular sulfatase Sulf-1 [Cricetulus griseus]
          Length = 787

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 76  AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 135

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 136 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 195

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 196 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 255

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+       +PQ+
Sbjct: 256 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGST------VPQI 296


>gi|350583123|ref|XP_003481437.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1-like
           [Sus scrofa]
          Length = 1119

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 199 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 259 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 319 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 359


>gi|344273147|ref|XP_003408388.1| PREDICTED: extracellular sulfatase Sulf-1 [Loxodonta africana]
          Length = 869

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRLYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|149060925|gb|EDM11535.1| sulfatase 1, isoform CRA_a [Rattus norvegicus]
 gi|149060926|gb|EDM11536.1| sulfatase 1, isoform CRA_a [Rattus norvegicus]
          Length = 658

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358


>gi|189571636|ref|NP_055985.2| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
 gi|189571638|ref|NP_001121676.1| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
 gi|189571641|ref|NP_001121677.1| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
 gi|189571643|ref|NP_001121678.1| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
 gi|33112447|sp|Q8IWU6.1|SULF1_HUMAN RecName: Full=Extracellular sulfatase Sulf-1; Short=hSulf-1; Flags:
           Precursor
 gi|27356932|gb|AAM76860.1| extracellular sulfatase SULF-1 [Homo sapiens]
 gi|119607359|gb|EAW86953.1| sulfatase 1, isoform CRA_a [Homo sapiens]
 gi|119607361|gb|EAW86955.1| sulfatase 1, isoform CRA_a [Homo sapiens]
          Length = 871

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|28191290|gb|AAO33315.1| sulfatase SULF1 precursor [Homo sapiens]
          Length = 871

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|355779744|gb|EHH64220.1| Extracellular sulfatase Sulf-1 [Macaca fascicularis]
          Length = 869

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|355698008|gb|EHH28556.1| Extracellular sulfatase Sulf-1 [Macaca mulatta]
 gi|380810974|gb|AFE77362.1| extracellular sulfatase Sulf-1 precursor [Macaca mulatta]
          Length = 869

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|426359865|ref|XP_004047178.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Gorilla
           gorilla gorilla]
 gi|426359867|ref|XP_004047179.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 869

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|426235614|ref|XP_004011775.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1
           [Ovis aries]
          Length = 867

 Score =  298 bits (762), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|403304744|ref|XP_003942951.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403304746|ref|XP_003942952.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 869

 Score =  297 bits (761), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|114326202|ref|NP_001041580.1| extracellular sulfatase Sulf-1 precursor [Canis lupus familiaris]
 gi|81158074|tpe|CAI85011.1| TPA: sulfatase 1 [Canis lupus familiaris]
          Length = 869

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 176/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|33096729|emb|CAE11871.1| hypothetical protein [Homo sapiens]
 gi|190689601|gb|ACE86575.1| sulfatase 1 protein [synthetic construct]
 gi|190690967|gb|ACE87258.1| sulfatase 1 protein [synthetic construct]
          Length = 871

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/226 (58%), Positives = 175/226 (77%), Gaps = 6/226 (2%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N+
Sbjct: 139 FFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITNE 198

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ 140
           S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK 
Sbjct: 199 SINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKH 258

Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
           WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DHG
Sbjct: 259 WIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHG 318

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           YH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 319 YHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|402878437|ref|XP_003902891.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Papio anubis]
 gi|402878439|ref|XP_003902892.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Papio anubis]
 gi|402878441|ref|XP_003902893.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Papio anubis]
 gi|402878443|ref|XP_003902894.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Papio anubis]
          Length = 869

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358


>gi|332826233|ref|XP_003311792.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Pan
           troglodytes]
 gi|332826235|ref|XP_003311793.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Pan
           troglodytes]
 gi|332826237|ref|XP_003311794.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Pan
           troglodytes]
 gi|332826239|ref|XP_003311795.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Pan
           troglodytes]
 gi|410266244|gb|JAA21088.1| sulfatase 1 [Pan troglodytes]
 gi|410266246|gb|JAA21089.1| sulfatase 1 [Pan troglodytes]
 gi|410266248|gb|JAA21090.1| sulfatase 1 [Pan troglodytes]
          Length = 869

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCHNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|47227306|emb|CAF96855.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1239

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 130/217 (59%), Positives = 171/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++  NG + KH  DY  DY  D+I N
Sbjct: 173 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYPKDYLTDIITN 232

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +P+M+V+S  APHGPEDSAPQY++ F + + H TP+Y+YAPNPDK
Sbjct: 233 ESIHYFRTSKRMYPNRPVMMVLSHVAPHGPEDSAPQYSSAFPSASQHITPSYNYAPNPDK 292

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T  M+PVH QFT++L  +R+QTL SVDD+VEK+   L + GEL+NTY+IYTSDH
Sbjct: 293 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVFNMLVETGELDNTYLIYTSDH 352

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G M
Sbjct: 353 GYHIGQFGLVKGKSMPYEFDIRVPFFIRGPNVEQGAM 389


>gi|410336657|gb|JAA37275.1| sulfatase 1 [Pan troglodytes]
 gi|410336659|gb|JAA37276.1| sulfatase 1 [Pan troglodytes]
 gi|410336661|gb|JAA37277.1| sulfatase 1 [Pan troglodytes]
          Length = 869

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCHNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|281351669|gb|EFB27253.1| hypothetical protein PANDA_007018 [Ailuropoda melanoleuca]
          Length = 862

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358


>gi|224078734|ref|XP_002189215.1| PREDICTED: extracellular sulfatase Sulf-2 isoform 1 [Taeniopygia
           guttata]
          Length = 879

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 130/217 (59%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGYDYSRDYLTDLITN 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+ +F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASEHITPSYNYAPNPDK 258

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GEL NTY+IYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNMLAETGELENTYLIYTADH 318

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355


>gi|358415387|ref|XP_003583093.1| PREDICTED: extracellular sulfatase Sulf-1-like [Bos taurus]
          Length = 868

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|444518093|gb|ELV11952.1| Extracellular sulfatase Sulf-1 [Tupaia chinensis]
          Length = 1035

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYI+YT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIVYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|301766256|ref|XP_002918548.1| PREDICTED: extracellular sulfatase Sulf-1-like [Ailuropoda
           melanoleuca]
          Length = 867

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358


>gi|5689491|dbj|BAA83029.1| KIAA1077 protein [Homo sapiens]
          Length = 818

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 85  AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 144

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 145 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 204

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 205 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 264

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 265 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 305


>gi|431891827|gb|ELK02361.1| Extracellular sulfatase Sulf-1 [Pteropus alecto]
          Length = 958

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 196 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 255

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 256 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 315

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 316 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 375

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 376 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 416


>gi|317418991|emb|CBN81029.1| Extracellular sulfatase Sulf-2 [Dicentrarchus labrax]
          Length = 885

 Score =  297 bits (760), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 130/217 (59%), Positives = 171/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++  NG + KH  DY  DY  D+I N
Sbjct: 146 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYSKDYLTDIITN 205

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + R SK+ +  +P+M+V+S  APHGPEDSAPQY++ F N + H TP+Y+YAPNPDK
Sbjct: 206 DSINYFRMSKKMYPHRPVMMVLSHVAPHGPEDSAPQYSSAFPNASQHITPSYNYAPNPDK 265

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T  M+PVH QFT++L  +R+QTL SVDD++EK+   L + GEL+NTY+IYTSDH
Sbjct: 266 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSMEKMYNMLMETGELDNTYLIYTSDH 325

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEQGAI 362


>gi|410987255|ref|XP_003999920.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1
           [Felis catus]
          Length = 865

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|432859487|ref|XP_004069132.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oryzias latipes]
          Length = 885

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 130/212 (61%), Positives = 169/212 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++  NG + KH  DY  DY  D+I N
Sbjct: 146 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSGDYRKDYLTDIITN 205

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + R SK+ +  +P+M+V+S  APHGPEDSAPQY++ F N + H TP+Y+YAPNPDK
Sbjct: 206 DSINYFRMSKRMYPHRPVMMVLSHVAPHGPEDSAPQYSSAFPNASQHITPSYNYAPNPDK 265

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T  M+PVH QFT++L  +R+QTL SVDD+VEK+   L + GEL+NTYIIYTSDH
Sbjct: 266 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVYNMLLETGELDNTYIIYTSDH 325

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYH+GQFGLVKGKS P+EFDIR+PF +RGP +
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRIPFYVRGPDV 357


>gi|432096980|gb|ELK27479.1| Extracellular sulfatase Sulf-1 [Myotis davidii]
          Length = 1137

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|395849374|ref|XP_003797303.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Otolemur
           garnettii]
 gi|395849376|ref|XP_003797304.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Otolemur
           garnettii]
          Length = 863

 Score =  296 bits (759), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCHNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PI++V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPILMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|297299552|ref|XP_002808528.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1-like
           [Macaca mulatta]
          Length = 1042

 Score =  296 bits (759), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|359072436|ref|XP_003586945.1| PREDICTED: extracellular sulfatase Sulf-1-like [Bos taurus]
          Length = 826

 Score =  296 bits (759), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 96  AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 155

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 156 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 215

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 216 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 275

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 276 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 316


>gi|440904303|gb|ELR54836.1| Extracellular sulfatase Sulf-1, partial [Bos grunniens mutus]
          Length = 869

 Score =  296 bits (759), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 174/227 (76%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+       +PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGST------VPQI 358


>gi|51491872|ref|NP_001003833.2| extracellular sulfatase Sulf-2 precursor [Danio rerio]
 gi|37812244|gb|AAR04058.1| sulfatase FP2b [Danio rerio]
          Length = 885

 Score =  296 bits (758), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 129/215 (60%), Positives = 169/215 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+YIPPGWREW A++ NS++YNY++  NG + KHG +Y  DY  DLI N
Sbjct: 141 AFFGKYLNEYNGTYIPPGWREWVAMVKNSRFYNYTLCRNGVREKHGFEYPKDYLTDLITN 200

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + R SK+ +  +P+++V+S  APHGPED+APQY   F N + H TP+Y+YAPNPDK
Sbjct: 201 DSINYFRMSKKIYPHRPVLMVISHAAPHGPEDAAPQYTTAFPNASQHITPSYNYAPNPDK 260

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD+VEK+   L D GEL+NTY+IYT+DH
Sbjct: 261 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSVEKVYNMLVDTGELDNTYVIYTADH 320

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G
Sbjct: 321 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAG 355


>gi|37812246|gb|AAR04059.1| sulfatase FP2a [Danio rerio]
          Length = 867

 Score =  296 bits (758), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 129/215 (60%), Positives = 169/215 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+YIPPGWREW A++ NS++YNY++  NG + KHG +Y  DY  DLI N
Sbjct: 141 AFFGKYLNEYNGTYIPPGWREWVAMVKNSRFYNYTLCRNGVREKHGFEYPKDYLTDLITN 200

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + R SK+ +  +P+++V+S  APHGPED+APQY   F N + H TP+Y+YAPNPDK
Sbjct: 201 DSINYFRMSKKIYPHRPVLMVISHAAPHGPEDAAPQYTTAFPNASQHITPSYNYAPNPDK 260

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD+VEK+   L D GEL+NTY+IYT+DH
Sbjct: 261 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSVEKVYNMLVDTGELDNTYVIYTADH 320

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G
Sbjct: 321 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAG 355


>gi|260788079|ref|XP_002589078.1| hypothetical protein BRAFLDRAFT_120897 [Branchiostoma floridae]
 gi|229274252|gb|EEN45089.1| hypothetical protein BRAFLDRAFT_120897 [Branchiostoma floridae]
          Length = 922

 Score =  296 bits (758), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 136/223 (60%), Positives = 168/223 (75%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLN+YNGSY+PPGWREW   I NS++YNY +N NG K KH   Y  DY+ DLIAN
Sbjct: 147 GYFGKYLNEYNGSYVPPGWREWVGQIRNSRFYNYKVNRNGLKFKHDSSYERDYFTDLIAN 206

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DSV F + SK++F  +P+ +V++FPAPHGPED APQY++MF N T H    Y+ APN DK
Sbjct: 207 DSVSFFKLSKKTFPHRPVAMVLAFPAPHGPEDGAPQYSHMFANNTDHRIQGYNMAPNMDK 266

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T+ +      FTD+L  KRLQTLQSVDDAVEK+   L + G+L NTY+IYTSDH
Sbjct: 267 HWILRWTQPLDNTQLAFTDMLQQKRLQTLQSVDDAVEKVYNMLIEQGQLENTYMIYTSDH 326

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           GYHLGQFGLVKGKSFP+++D RVP+L+RGP + P T+    VL
Sbjct: 327 GYHLGQFGLVKGKSFPYDWDTRVPYLIRGPNVQPNTVMSQIVL 369


>gi|148225416|ref|NP_001088414.1| sulfatase 2 precursor [Xenopus laevis]
 gi|119850718|gb|AAI27412.1| LOC495271 protein [Xenopus laevis]
          Length = 872

 Score =  296 bits (757), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 128/217 (58%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+Y+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 138 AFFGKYLNEYNGTYVPPGWKEWVGLVKNSRFYNYTLCRNGIKEKHGFDYAQDYLTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+ MF N + H TP+Y+YAPNPDK
Sbjct: 198 DSISFFRMSKKIYPHRPVLMVLSHAAPHGPEDSAPQYSQMFQNASQHITPSYNYAPNPDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  +RLQTL SVD+++E I   L + GEL NTY+IYT+DH
Sbjct: 258 HWIMRYTGPMKPIHMEFTNMLQRRRLQTLMSVDNSMEMIYNMLVETGELENTYVIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 318 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 354


>gi|345306776|ref|XP_001510791.2| PREDICTED: extracellular sulfatase Sulf-1-like [Ornithorhynchus
           anatinus]
          Length = 815

 Score =  295 bits (756), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 129/217 (59%), Positives = 172/217 (79%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGWREW  L+ NS++YNY+I  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYVPPGWREWLGLVKNSRFYNYTICRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +P+M+V+S  APHGPEDSAPQ+++ F N + H TP+Y+YAPN DK
Sbjct: 198 ESISYFKMSKRMYPHRPVMMVVSHAAPHGPEDSAPQFSDRFPNASQHVTPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WILQ T  M P+H +FT++L  KRLQTL SVDD++E++   L + GEL+NTY+IYT+DH
Sbjct: 258 HWILQYTGPMLPIHMEFTNMLQRKRLQTLLSVDDSMERLFAMLLETGELDNTYVIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSI 354


>gi|348510327|ref|XP_003442697.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oreochromis
           niloticus]
          Length = 884

 Score =  295 bits (756), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 129/217 (59%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++  NG + KH  DY  DY  D+I N
Sbjct: 146 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYTKDYLTDVITN 205

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +P+++V+S  APHGPEDSAPQY+  F N + H TP+Y+YAPNPDK
Sbjct: 206 NSIDYFRMSKRLYPHRPVLMVLSHVAPHGPEDSAPQYSAAFSNASQHITPSYNYAPNPDK 265

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WIL+ T  M+PVH QFT++L  +R+QTL SVDD+VEK+   L + GEL+NTY+IYTSDH
Sbjct: 266 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVYNMLVETGELDNTYLIYTSDH 325

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G +
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEQGAI 362


>gi|338728266|ref|XP_003365645.1| PREDICTED: extracellular sulfatase Sulf-1 [Equus caballus]
          Length = 1218

 Score =  295 bits (754), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 132/227 (58%), Positives = 174/227 (76%), Gaps = 6/227 (2%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L +  EL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETRELENTYIIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358


>gi|147901831|ref|NP_001090848.1| sulfatase 1 precursor [Xenopus (Silurana) tropicalis]
 gi|125719330|gb|ABN54443.1| extracellular 6-O-endosulfatase [Xenopus (Silurana) tropicalis]
          Length = 884

 Score =  295 bits (754), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY++  NG K KHG +Y  DY+ DLI N
Sbjct: 137 AFFGKYLNEYNGSYIPPGWREWLGLVKNSRFYNYTMCRNGFKEKHGFEYEKDYFTDLITN 196

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++  F N + H TP+Y+YAPN DK
Sbjct: 197 DSISYFKLSKKLYPHRPIMMVISHAAPHGPEDSAPQFSEFFPNASQHITPSYNYAPNMDK 256

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD++EK+   L D GE+ NTY+IYTSDH
Sbjct: 257 HWIMQYTGAMLPIHMEFTNVLHRKRLQTLLSVDDSMEKLYNMLVDTGEVENTYLIYTSDH 316

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++
Sbjct: 317 GYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGSI 353


>gi|115530840|emb|CAL49339.1| sulfatase 1 [Xenopus (Silurana) tropicalis]
          Length = 697

 Score =  294 bits (753), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 170/217 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY++  NG K KHG +Y  DY+ DLI N
Sbjct: 137 AFFGKYLNEYNGSYIPPGWREWLGLVKNSRFYNYTMCRNGFKEKHGFEYEKDYFTDLITN 196

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ + + SK+ +  +PIM+V+S  APHGPEDSAPQ++  F N + H TP+Y+YAPN DK
Sbjct: 197 DSISYFKLSKKLYPHRPIMMVISHAAPHGPEDSAPQFSEFFPNASQHITPSYNYAPNMDK 256

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL SVDD++EK+   L D GE+ NTY+IYTSDH
Sbjct: 257 HWIMQYTGAMLPIHMEFTNVLHRKRLQTLLSVDDSMEKLYNMLVDTGEVENTYLIYTSDH 316

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++
Sbjct: 317 GYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGSI 353


>gi|170571849|ref|XP_001891890.1| Sulfatase family protein [Brugia malayi]
 gi|158603342|gb|EDP39296.1| Sulfatase family protein [Brugia malayi]
          Length = 595

 Score =  293 bits (751), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 136/224 (60%), Positives = 171/224 (76%), Gaps = 13/224 (5%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            YFGKYLN+Y GSYIPPGW  W  L+ NS++YNY++N+NG KIKHG DY  DY+ DLIAN
Sbjct: 124 AYFGKYLNEYIGSYIPPGWDYWMGLLRNSRFYNYTVNINGNKIKHGWDYGKDYFTDLIAN 183

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           D++ F+RQ +     KP M+V+SFPAPHGPED APQY+  F +V +H T A++YAPNPDK
Sbjct: 184 DTIAFIRQLRDKDPFKPYMIVISFPAPHGPEDPAPQYSAWFQDVETHRTEAWNYAPNPDK 243

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE-------------KIVKELKDLG 186
           QW+LQ T +M+PVH  FTD+L  +RLQTLQSVD  ++             K++ EL+DLG
Sbjct: 244 QWLLQHTGRMEPVHVIFTDVLHRRRLQTLQSVDSNIQRVGFISYFYFPLAKLINELRDLG 303

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +L+NT  IYTSDHGYHLGQFGLVKGK+ P+EFDIRVP+ +RGPG
Sbjct: 304 DLSNTVFIYTSDHGYHLGQFGLVKGKNMPYEFDIRVPYFIRGPG 347


>gi|348517056|ref|XP_003446051.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oreochromis
           niloticus]
          Length = 870

 Score =  293 bits (750), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 129/217 (59%), Positives = 168/217 (77%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+P GWREW  L+ NS++YNY++  NG + KHG  Y  DY  DLI  
Sbjct: 134 AFFGKYLNEYNGSYVPAGWREWLGLVKNSRFYNYTLCRNGVREKHGAQYPQDYLTDLITA 193

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+R+ R SK+ +  +P+++V+S  APHGPEDSAPQY+  F N + H TP+Y+YAPNPDK
Sbjct: 194 ESIRYFRSSKRVYPHRPVLMVLSHAAPHGPEDSAPQYSTAFPNASQHITPSYNYAPNPDK 253

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD+VEK+   L + GEL NTYIIYTSDH
Sbjct: 254 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSVEKLYNMLVETGELENTYIIYTSDH 313

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 314 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEQGSI 350


>gi|54020831|ref|NP_001005661.1| sulfatase 2 precursor [Xenopus (Silurana) tropicalis]
 gi|49250556|gb|AAH74686.1| sulfatase 1 [Xenopus (Silurana) tropicalis]
          Length = 875

 Score =  291 bits (746), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 127/217 (58%), Positives = 169/217 (77%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+Y+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 140 AFFGKYLNEYNGTYVPPGWKEWVGLVKNSRFYNYTLCRNGIKEKHGFDYAQDYLTDLITN 199

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+ MF N + H TP+Y+YAPNPDK
Sbjct: 200 NSINFFRTSKKIYPHRPVLMVLSHVAPHGPEDSAPQYSQMFPNASQHITPSYNYAPNPDK 259

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  +RLQTL SVD ++E I   L + GE+ NTYIIYT+DH
Sbjct: 260 HWIMRYTGPMKPIHMEFTNMLQRRRLQTLMSVDSSMEMIYNMLVETGEVENTYIIYTADH 319

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 320 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVDAGSL 356


>gi|47939687|gb|AAH72073.1| LOC432214 protein, partial [Xenopus laevis]
          Length = 679

 Score =  290 bits (742), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 126/217 (58%), Positives = 169/217 (77%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+Y+PPGW+EW  L+ NS++YNY++  NG K KHG DY  DY  DLI N
Sbjct: 138 AFFGKYLNEYNGTYVPPGWKEWVGLVKNSRFYNYTLCRNGIKEKHGFDYAQDYLTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F R SK+ +  +P+++V+S  APHGPEDSAPQY+ MF N + H TP+Y+YAPN DK
Sbjct: 198 DSISFFRTSKKVYPHRPVLMVLSHVAPHGPEDSAPQYSQMFPNASQHITPSYNYAPNLDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  +RLQTL SVD+++E +   L + GEL NTY+IYT+DH
Sbjct: 258 HWIMRYTGPMRPIHMEFTNMLQRRRLQTLMSVDNSMEMVYNMLVETGELENTYVIYTADH 317

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 318 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVDAGSL 354


>gi|449265602|gb|EMC76771.1| Extracellular sulfatase Sulf-1, partial [Columba livia]
          Length = 856

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 131/227 (57%), Positives = 174/227 (76%), Gaps = 8/227 (3%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY+I+ NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+Q T  M P+H +FT++L  KRLQTL S   AV ++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLFSY--AVLQLYQMLAEMGELENTYIIYTADH 315

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 316 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 356


>gi|432959714|ref|XP_004086377.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oryzias latipes]
          Length = 938

 Score =  288 bits (736), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 128/215 (59%), Positives = 166/215 (77%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSY+P GWREW  L+ NS++YNY+++ NG + KHG  Y  DY  DLI  
Sbjct: 134 AFFGKYLNEYNGSYVPNGWREWLGLVKNSRFYNYTLSRNGVREKHGARYPEDYLTDLITE 193

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +SVR+ R SK+ +  +P+++V+S  APHGPEDSAPQY+  F N + H TP+Y+YAPN DK
Sbjct: 194 ESVRYFRSSKRVYPHRPVLMVLSHAAPHGPEDSAPQYSEAFPNASHHITPSYNYAPNLDK 253

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++ T  M+P+H +FT++L  KRLQTL SVDD++EK+   L + GEL NTYIIYTSDH
Sbjct: 254 HWIMRYTGPMKPIHMEFTNVLQRKRLQTLLSVDDSMEKLYNMLTETGELENTYIIYTSDH 313

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           GYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G
Sbjct: 314 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEQG 348


>gi|268578729|ref|XP_002644347.1| C. briggsae CBR-SUL-1 protein [Caenorhabditis briggsae]
          Length = 681

 Score =  288 bits (736), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 129/212 (60%), Positives = 167/212 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNYS+N NG + K G +Y  DY+ DL+ N
Sbjct: 101 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYSMNSNGEREKFGFEYEKDYFTDLVTN 160

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            S+RF+ +  ++ + +P  L++S+PAPHGPED APQ+A+MF N  SH T ++++APNPDK
Sbjct: 161 RSLRFIEKHIKTRAWQPFALIISYPAPHGPEDPAPQFAHMFENEFSHRTGSWNFAPNPDK 220

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QW+LQ T KM  VH  FTDLL  +RLQTLQSVD+ +E++   L++L +L NTY IYTSDH
Sbjct: 221 QWLLQRTGKMNEVHISFTDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDH 280

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI
Sbjct: 281 GYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGI 312


>gi|341873966|gb|EGT29901.1| CBN-SUL-1 protein [Caenorhabditis brenneri]
          Length = 691

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 129/218 (59%), Positives = 168/218 (77%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNY++N NG + K G +Y  DY+ DL+ N
Sbjct: 123 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGEREKFGFEYEKDYFTDLVTN 182

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            S+RF+ +  ++ + +P  L++S+PAPHGPED APQ+A+MF N  SH T ++++APNPDK
Sbjct: 183 RSLRFIEKHIKTRAWQPFALIISYPAPHGPEDPAPQFAHMFENEFSHRTGSWNFAPNPDK 242

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QW+LQ T KM  VH  FTDLL  +RLQTLQSVDD + ++   L++L +L NTY IYTSDH
Sbjct: 243 QWLLQRTGKMNDVHITFTDLLHRRRLQTLQSVDDGLYRLFNLLRELNQLWNTYAIYTSDH 302

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
           GYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI   T +
Sbjct: 303 GYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGIPRNTSF 340


>gi|17568795|ref|NP_508560.1| Protein SUL-1 [Caenorhabditis elegans]
 gi|33112442|sp|Q21376.1|SULF1_CAEEL RecName: Full=Putative extracellular sulfatase Sulf-1 homolog;
           Short=CeSulf-1; Flags: Precursor
 gi|351061478|emb|CCD69246.1| Protein SUL-1 [Caenorhabditis elegans]
          Length = 709

 Score =  285 bits (728), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 127/212 (59%), Positives = 166/212 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNY++N NG + K G +Y  DY+ DL+ N
Sbjct: 130 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGEREKFGSEYEKDYFTDLVTN 189

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            S++F+ +  +  + +P  L++S+PAPHGPED APQ+A+MF N  SH T ++++APNPDK
Sbjct: 190 RSLKFIDKHIKIRAWQPFALIISYPAPHGPEDPAPQFAHMFENEISHRTGSWNFAPNPDK 249

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           QW+LQ T KM  VH  FTDLL  +RLQTLQSVD+ +E++   L++L +L NTY IYTSDH
Sbjct: 250 QWLLQRTGKMNDVHISFTDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDH 309

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI
Sbjct: 310 GYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGI 341


>gi|198427333|ref|XP_002120498.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 1001

 Score =  280 bits (715), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 131/240 (54%), Positives = 171/240 (71%), Gaps = 7/240 (2%)

Query: 1   MAWPVELTSRNLLYF----DISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSIN 56
           +AW V+  SR+   +      S  YFGKYLN+YNGSYIP GW+ W  L+ NS+YYNYS+ 
Sbjct: 139 IAWQVKHESRSFATYLNKTGYSTAYFGKYLNEYNGSYIPQGWQYWMGLVRNSRYYNYSLR 198

Query: 57  MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
            N  K  H ++Y +DY+ DLI N S+ + R+ K      PI+ V+SFPAPHG ED APQY
Sbjct: 199 HNDVKESHRDNYRDDYFTDLIVNRSMTYFRRKKHEEPDSPILSVLSFPAPHGSEDGAPQY 258

Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRK--MQPVHRQFTDLLMTKRLQTLQSVDDA 174
            +++ NVTSH TP++DY PNPDK WI+  +RK  M     +F+ +L  KRLQTL+SVDDA
Sbjct: 259 QHLYANVTSHITPSFDYGPNPDKHWIIS-SRKVPMDETQHRFSSILQQKRLQTLRSVDDA 317

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           V++ V  L+D GEL+NTY++YTSDHG+H+GQFGL KGKS P++FD+RVP  MRGPGI  G
Sbjct: 318 VDRFVSMLQDTGELDNTYLLYTSDHGFHIGQFGLAKGKSMPYDFDVRVPLFMRGPGIQAG 377


>gi|284424942|dbj|BAI67117.1| heparan sulfate 6-O endosulfatase [Hemicentrotus pulcherrimus]
          Length = 983

 Score =  279 bits (713), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 125/215 (58%), Positives = 163/215 (75%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLN+Y G YIPPGW EW  L+ NSK+YNY++  NG+ ++HG +Y+ DY PDLIAN
Sbjct: 158 GYFGKYLNEYRGDYIPPGWVEWVGLVKNSKFYNYTLCRNGQLMRHGFNYHRDYLPDLIAN 217

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           DS+ F + SK+++ + P+ +V+SFP+PHGPED APQY +MF     H TPA++Y PN DK
Sbjct: 218 DSMAFFKLSKRAYPRLPVAMVLSFPSPHGPEDGAPQYQDMFEGNKLHRTPAWNYGPNEDK 277

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++   +M  + R FTD L  KRL TL SVDDA+EK+ +EL+ LGEL+NTYI+ TSDH
Sbjct: 278 HWIIRQASRMTKLERDFTDALQRKRLITLLSVDDAIEKLCRELEALGELDNTYILMTSDH 337

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           GYHLGQF ++KGKS  ++ DIRVPF +RGP I  G
Sbjct: 338 GYHLGQFNMLKGKSNAYDTDIRVPFYIRGPDIPHG 372


>gi|308512451|ref|XP_003118408.1| CRE-SUL-1 protein [Caenorhabditis remanei]
 gi|308239054|gb|EFO83006.1| CRE-SUL-1 protein [Caenorhabditis remanei]
          Length = 680

 Score =  276 bits (707), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 126/227 (55%), Positives = 167/227 (73%), Gaps = 15/227 (6%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY----------- 68
            Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNY++N NG + + G DY           
Sbjct: 124 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGERERFGSDYEKVRFRFWKNI 183

Query: 69  ----YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
               + DY+ DL+ N S++F+ +  ++ + +P  L++S+PAPHGPED APQ+A+MF N  
Sbjct: 184 LILRFQDYFTDLVTNRSLKFIEKHIKTRAWQPFALIISYPAPHGPEDPAPQFAHMFENEL 243

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           SH T ++++APNPDKQW+LQ T KM  VH  FTDLL  +RLQTLQSVDD + ++   L++
Sbjct: 244 SHRTGSWNFAPNPDKQWLLQRTGKMNDVHITFTDLLHRRRLQTLQSVDDGIARLFNLLRE 303

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           L +L NT+ IYTSDHGYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI
Sbjct: 304 LNQLWNTFAIYTSDHGYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGI 350


>gi|47211781|emb|CAF94091.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 879

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 131/273 (47%), Positives = 180/273 (65%), Gaps = 37/273 (13%)

Query: 1   MAWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSIN 56
           M+W  +   R    +  + GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY+++
Sbjct: 111 MSWQRQHEPRTFGVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWLGLVKNSRFYNYTLS 170

Query: 57  MNGRKIKHGEDY-------YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
            NG + KHG +        + DY  DLI  +S+R+ R SK+ +  +P+++V+S  APHGP
Sbjct: 171 RNGFREKHGAECVCMCVCVFQDYLTDLITAESMRYFRYSKRVYPHRPVLMVLSHAAPHGP 230

Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
           EDSAPQY+  F N + H TP+Y+YAPNPDK WI++    M+P+H +FT++L  KRLQTL 
Sbjct: 231 EDSAPQYSTAFQNASQHITPSYNYAPNPDKHWIMRYIGPMKPIHMEFTNMLQRKRLQTLL 290

Query: 170 SVDDAVEK--------------------------IVKELKDLGELNNTYIIYTSDHGYHL 203
           SVDD++EK                          +   L ++GEL+NTYIIYTSDHGYH+
Sbjct: 291 SVDDSMEKAGPADQDASPPGALLRLLTPSVSVLQLYNMLVEMGELDNTYIIYTSDHGYHI 350

Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 351 GQFGLVKGKSMPYEFDIRVPFFIRGPNVEQGSI 383


>gi|195997457|ref|XP_002108597.1| hypothetical protein TRIADDRAFT_18386 [Trichoplax adhaerens]
 gi|190589373|gb|EDV29395.1| hypothetical protein TRIADDRAFT_18386 [Trichoplax adhaerens]
          Length = 655

 Score =  249 bits (637), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 116/212 (54%), Positives = 151/212 (71%), Gaps = 1/212 (0%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+FGKYLN+Y+GSYIP GW  W   + NS+YYNY++N NG+ +KHG     DY+ DLI  
Sbjct: 125 GFFGKYLNEYDGSYIPVGWDHWIGQVKNSRYYNYTLNRNGKLVKHGNRCQQDYFTDLIVR 184

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+RFL+++ + + KKPI++V+S PAPHG EDSAPQY N +       TP Y+Y  + DK
Sbjct: 185 ESIRFLKKTTEKYPKKPILMVLSMPAPHGTEDSAPQYQNRYARAKVPRTPNYNYH-SFDK 243

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI+++   M  V   FTDLL  KRLQTL SVDDAVEK+ + ++ L     TY+ +TSDH
Sbjct: 244 HWIVRIAPPMSSVKMNFTDLLYRKRLQTLLSVDDAVEKLYRSIRKLDLKEKTYLFFTSDH 303

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLG F +VKGKS P+E DIRVPFL+RGP I
Sbjct: 304 GYHLGHFSIVKGKSMPYETDIRVPFLVRGPMI 335


>gi|47213980|emb|CAG00671.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1160

 Score =  248 bits (632), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 125/257 (48%), Positives = 166/257 (64%), Gaps = 22/257 (8%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG +Y  DY+ DLI NDS+ F R SK+ F  +P+M+V+S  APHGPEDSAPQYA
Sbjct: 175 NGYKEKHGAEYAKDYFTDLITNDSINFFRISKRMFPHRPVMMVISHAAPHGPEDSAPQYA 234

Query: 118 NMFFNVTSHHTPAYDYAPN------PDKQWILQVTRK------------MQPVHRQFTDL 159
           + F N + H  P +    +      P K   LQ+  K            M+P+  +FT+ 
Sbjct: 235 DHFPNASQHIPPDWWVKSSSISHSRPRKHAQLQLRSKYGQTLDHAIPGPMRPIPMEFTNF 294

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
              KRLQTL +   +  K+   L++ GEL+NTYI+YT+DHGYH+GQFGLVKGKS P++FD
Sbjct: 295 PHRKRLQTLMAYHYSAPKVYDMLEETGELDNTYIMYTADHGYHIGQFGLVKGKSMPYDFD 354

Query: 220 IRVPFLMRGPGIVPGTM 236
           IRVPF +RGP + PG +
Sbjct: 355 IRVPFFLRGPNVEPGAV 371


>gi|449674493|ref|XP_002155025.2| PREDICTED: extracellular sulfatase Sulf-1-like [Hydra
           magnipapillata]
          Length = 497

 Score =  239 bits (610), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 111/215 (51%), Positives = 146/215 (67%), Gaps = 1/215 (0%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            YFGKYLN YNG  +PPGW  W  L+ NS++YNYS+N+NG+ +K+  DY  DY+ D++ N
Sbjct: 134 AYFGKYLNNYNGDRVPPGWNHWSGLVRNSRFYNYSLNVNGKIVKYKNDYKKDYFTDVVTN 193

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            S+ + +  K      PIM+V+S  APHGPEDSAPQY + F ++ +   P Y+Y  + DK
Sbjct: 194 ASIDYFKNVKSQTLSAPIMMVLSMSAPHGPEDSAPQYQDSFKDIKAPRHPNYNYT-SSDK 252

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            W+++ T  +      FTD+L  KRLQTL SVDDA+EK+   L  +G L+ TYII+TSDH
Sbjct: 253 HWVIRQTPPLDETTALFTDILYRKRLQTLLSVDDAIEKVYNMLLQVGLLDETYIIFTSDH 312

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           GYHLGQF  VKGKS P+E DIR+P  MRGP I  G
Sbjct: 313 GYHLGQFNQVKGKSQPYESDIRIPLYMRGPHIPKG 347


>gi|324535141|gb|ADY49408.1| Extracellular sulfatase Sulf-1, partial [Ascaris suum]
          Length = 170

 Score =  236 bits (603), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 107/169 (63%), Positives = 138/169 (81%)

Query: 43  ALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMS 102
           ALI NS++YNY+IN NG KIKHG DY  DY+ DLIAND++ F+ + ++    KP + V+S
Sbjct: 2   ALIRNSRFYNYTINFNGDKIKHGFDYEKDYFTDLIANDTIAFIHRHREREPTKPFLAVLS 61

Query: 103 FPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMT 162
           FPAPHGPED APQ+A+ F +V +H T A++YAPNPDKQW+LQ T +M+P+H  FTD+L  
Sbjct: 62  FPAPHGPEDPAPQFADWFEDVETHRTEAWNYAPNPDKQWLLQHTGRMEPIHVVFTDVLHR 121

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
           +RLQTLQSVD +V+++  EL+DLGEL+NT+IIYTSDHGYHLGQFGLVKG
Sbjct: 122 RRLQTLQSVDHSVQRLANELRDLGELSNTFIIYTSDHGYHLGQFGLVKG 170


>gi|156364294|ref|XP_001626284.1| predicted protein [Nematostella vectensis]
 gi|156213155|gb|EDO34184.1| predicted protein [Nematostella vectensis]
          Length = 369

 Score =  236 bits (603), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 113/212 (53%), Positives = 148/212 (69%), Gaps = 1/212 (0%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLN Y+GSYIP GW  W  LI NS++YNY +  N    KH  +Y NDY+ ++I +
Sbjct: 89  GYFGKYLNAYDGSYIPYGWHRWAGLIRNSRFYNYVLRHNTFYKKHQNNYENDYFTNVITD 148

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ F + SK     +PIM+V+S  APHGPED APQY + F N T+  TP +++  +PDK
Sbjct: 149 HAISFFKNSKARKPDRPIMMVVSHSAPHGPEDGAPQYQDHFPNATAPRTPNWNFT-SPDK 207

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            WI++VT  M      F+D+L  KRLQTL SVDD +++I   LK+ G   +TYI ++SDH
Sbjct: 208 HWIVRVTPPMTQQKSDFSDMLQRKRLQTLLSVDDNIQRIWDMLKETGYDRDTYIFFSSDH 267

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GYHLGQ+GLVKGKS P+E DIRVPF +RGPGI
Sbjct: 268 GYHLGQYGLVKGKSMPYESDIRVPFFVRGPGI 299


>gi|402882321|ref|XP_003904694.1| PREDICTED: extracellular sulfatase Sulf-2 [Papio anubis]
          Length = 1165

 Score =  227 bits (579), Expect = 4e-57,   Method: Composition-based stats.
 Identities = 99/183 (54%), Positives = 132/183 (72%), Gaps = 4/183 (2%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 448 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 507

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG K KHG DY  DY  DLI NDSV F R SK+ +  +P+++V+S  APHGPEDSAPQY+
Sbjct: 508 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 567

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F N + H TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E 
Sbjct: 568 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 627

Query: 178 IVK 180
            V+
Sbjct: 628 QVR 630


>gi|307174994|gb|EFN65195.1| Extracellular sulfatase SULF-1-like protein [Camponotus floridanus]
          Length = 1148

 Score =  213 bits (543), Expect = 5e-53,   Method: Composition-based stats.
 Identities = 93/109 (85%), Positives = 106/109 (97%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TPAYDYAPNPDKQWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD AV++I +ELKDLGE
Sbjct: 23  TPAYDYAPNPDKQWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDAAVDRIYQELKDLGE 82

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL+RGPG+ PG++
Sbjct: 83  LDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGVEPGSI 131


>gi|312374901|gb|EFR22368.1| hypothetical protein AND_15355 [Anopheles darlingi]
          Length = 1369

 Score =  207 bits (526), Expect = 5e-51,   Method: Composition-based stats.
 Identities = 92/119 (77%), Positives = 105/119 (88%)

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           N  F      TPAYD+APNPDKQWIL+VT+ M+P+HR+FTDLLMTKRLQTLQSVD AVE+
Sbjct: 38  NGSFFTFQQSTPAYDHAPNPDKQWILRVTQPMEPIHRKFTDLLMTKRLQTLQSVDVAVER 97

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           + +ELK LGEL+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFLMRGPGI P T+
Sbjct: 98  VYQELKALGELDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLMRGPGIEPATI 156


>gi|170065419|ref|XP_001867931.1| sulfatase-1 [Culex quinquefasciatus]
 gi|167882509|gb|EDS45892.1| sulfatase-1 [Culex quinquefasciatus]
          Length = 1090

 Score =  206 bits (524), Expect = 1e-50,   Method: Composition-based stats.
 Identities = 88/111 (79%), Positives = 105/111 (94%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TPAYD+APNPDKQWIL+VT+ M+P+HR+FTDLLMTKRLQTLQSVD AVE++ +ELK LGE
Sbjct: 3   TPAYDHAPNPDKQWILRVTKPMEPIHRKFTDLLMTKRLQTLQSVDVAVERVYQELKALGE 62

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           L+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI P ++++
Sbjct: 63  LDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPASVFV 113


>gi|170065421|ref|XP_001867932.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167882510|gb|EDS45893.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 376

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 95/140 (67%), Positives = 108/140 (77%), Gaps = 1/140 (0%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY  DYYPDLIAN
Sbjct: 206 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYGKDYYPDLIAN 265

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT-PAYDYAPNPD 138
           DS+ FLRQSK    +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HH    Y  + N  
Sbjct: 266 DSIAFLRQSKHQNHRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHCLTLYRVSDNLS 325

Query: 139 KQWILQVTRKMQPVHRQFTD 158
           +  I   +   Q    + TD
Sbjct: 326 RDRIQMTSVTTQQTQGRETD 345


>gi|312374902|gb|EFR22369.1| hypothetical protein AND_15356 [Anopheles darlingi]
          Length = 358

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 88/107 (82%), Positives = 97/107 (90%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY  DYYPDLIAN
Sbjct: 251 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYAKDYYPDLIAN 310

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
           DS+ FLRQSK    +KP++L MSFPAPHGPEDSAPQY+++FFNVT+H
Sbjct: 311 DSIAFLRQSKHQNHRKPVLLAMSFPAPHGPEDSAPQYSHLFFNVTTH 357


>gi|339245873|ref|XP_003374570.1| extracellular sulfatase Sulf-1 [Trichinella spiralis]
 gi|316972167|gb|EFV55855.1| extracellular sulfatase Sulf-1 [Trichinella spiralis]
          Length = 645

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 92/159 (57%), Positives = 119/159 (74%), Gaps = 8/159 (5%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           DY+ DLI NDS+ F ++S   F +KPI+LV+SFPAPHGPED AP+Y + F N T+H T +
Sbjct: 23  DYFTDLIVNDSINFFKRSTLLFPQKPILLVLSFPAPHGPEDPAPKYNDFFANNTAHRTAS 82

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV--------KEL 182
           ++YAPNPDKQW+L+ T KM+PVH  FTD L  +RLQTL SVDDAV K+         KE 
Sbjct: 83  WNYAPNPDKQWLLRHTGKMEPVHVAFTDFLQRRRLQTLLSVDDAVYKVTPVVLYHLYKEF 142

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
           +  G L+NTY +YTSDHGYHLGQ+GL+KGK+ P++ DI+
Sbjct: 143 RQFGLLDNTYFLYTSDHGYHLGQYGLIKGKTLPYDCDIK 181


>gi|291190064|ref|NP_001167172.1| N-acetylglucosamine-6-sulfatase precursor [Salmo salar]
 gi|223648450|gb|ACN10983.1| N-acetylglucosamine-6-sulfatase precursor [Salmo salar]
          Length = 518

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 109/239 (45%), Positives = 149/239 (62%), Gaps = 16/239 (6%)

Query: 2   AWPVELTSRNLLYFDISHGYFGKYLNKY------NGSYIPPGWREWGALIMNSKYYNYSI 55
           A+PV L+ ++   F     Y GKYLN+Y      N  Y+PPGW +W AL+ NS+YYNY++
Sbjct: 127 AFPVYLSKQHYQTF-----YAGKYLNQYGKKETGNVGYVPPGWDQWHALMGNSQYYNYTL 181

Query: 56  NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
           ++NG+  +HG++Y  DY  DLI N S+ FL        ++P  +++S PAPH P  +APQ
Sbjct: 182 SVNGKAEQHGDNYEKDYLTDLILNRSLHFLDDRS---PQRPFFIMLSPPAPHSPWTAAPQ 238

Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
           Y   F +V +    ++D  P  DK W+L Q +  M      F D    KR QTL SVDD 
Sbjct: 239 YEKNFADVKAPRDGSFD-KPGKDKHWLLRQPSNPMPSTSLNFLDNAYRKRWQTLLSVDDM 297

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           VE +VK+L++L ELNNTYI YTSD+GYH GQF L   K   ++FDIR+P ++RGPGI P
Sbjct: 298 VEVLVKKLEELKELNNTYIFYTSDNGYHTGQFSLPIDKRQLYDFDIRIPLMVRGPGIKP 356


>gi|157118710|ref|XP_001653223.1| sulfatase-1, sulf-1 [Aedes aegypti]
 gi|108875645|gb|EAT39870.1| AAEL008372-PA [Aedes aegypti]
          Length = 250

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 88/107 (82%), Positives = 97/107 (90%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY  DYYPDLIAN
Sbjct: 143 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYAKDYYPDLIAN 202

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
           DS+ FLRQSK    +KP++L MSFPAPHGPEDSAPQY+++FFNVT+H
Sbjct: 203 DSIAFLRQSKHQNHRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTH 249


>gi|157118712|ref|XP_001653224.1| sulfatase-1, sulf-1 [Aedes aegypti]
 gi|108875646|gb|EAT39871.1| AAEL008362-PA, partial [Aedes aegypti]
          Length = 782

 Score =  197 bits (500), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 88/109 (80%), Positives = 101/109 (92%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TPAYD APNPDKQWIL+VT+ M+P+HR+FTDLLMTKRLQTLQSVD AVE++ +ELK L E
Sbjct: 1   TPAYDLAPNPDKQWILRVTQPMEPIHRKFTDLLMTKRLQTLQSVDVAVERVYQELKALDE 60

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFLMRGPGI P ++
Sbjct: 61  LDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLMRGPGIEPASV 109


>gi|410903982|ref|XP_003965472.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Takifugu rubripes]
          Length = 520

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 106/229 (46%), Positives = 139/229 (60%), Gaps = 11/229 (4%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IPPGW  W AL+ NS+YYNYS+++NG++ KHG++Y +DY  
Sbjct: 143 FAGKYLNQYGQKDAGHVGHIPPGWDHWHALVGNSQYYNYSLSVNGKEEKHGDNYGDDYLT 202

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DLI N S+ FL          P  L++S PAPH P  +APQ+   + ++ +    ++D  
Sbjct: 203 DLITNRSLTFLDNRSPQL---PFFLLLSPPAPHAPWTAAPQHQKDYADIKAPRDGSFD-K 258

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
           P  DK W+L Q    M      + D    KR QTL SVDD VEK+V++LK + ELNNTYI
Sbjct: 259 PGKDKHWLLRQPANPMTASSLTYLDNAYRKRWQTLASVDDMVEKLVEKLKSIDELNNTYI 318

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
            YTSD+GYH GQF L   K   +EFDIRVP L+RGPGI P       VL
Sbjct: 319 FYTSDNGYHTGQFSLPIDKRQLYEFDIRVPLLVRGPGIAPNQTVEAPVL 367


>gi|432884623|ref|XP_004074510.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oryzias latipes]
          Length = 513

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 106/229 (46%), Positives = 142/229 (62%), Gaps = 11/229 (4%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y G      ++IPPGW +W AL+ NS+YYNY++++NG++ KHG++Y  DY  
Sbjct: 136 FAGKYLNQYGGKDAGGVAHIPPGWDQWNALVGNSQYYNYTLSVNGKEEKHGDNYEKDYLT 195

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DLI N S+ FL        + P  +++S PAPH P  +APQY   F NV +    ++D  
Sbjct: 196 DLILNRSLHFLDDRS---PQHPFFIMLSPPAPHSPWTAAPQYGKEFSNVKAPRDGSFD-K 251

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
           P  DK W+L Q    M      + D    +R QTL SVDD VEK+VK+L+ + ELN+TYI
Sbjct: 252 PGKDKHWLLRQPPNPMSNSSLVYLDDAYRRRWQTLLSVDDMVEKLVKKLESVKELNDTYI 311

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
            YTSD+GYH GQF L   K   ++FDIRVP L+RGPGI P       VL
Sbjct: 312 FYTSDNGYHTGQFSLPIDKRQLYDFDIRVPLLVRGPGIKPNQTLQAPVL 360


>gi|156408455|ref|XP_001641872.1| predicted protein [Nematostella vectensis]
 gi|156229012|gb|EDO49809.1| predicted protein [Nematostella vectensis]
          Length = 506

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 135/218 (61%), Gaps = 9/218 (4%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLNKY  S      Y+PPGW  W  L+ NSKYYNY++++NG+  KHG DY  DY+ 
Sbjct: 131 YAGKYLNKYGMSATGGTQYVPPGWDSWLGLVGNSKYYNYTVSVNGKAEKHGGDYKEDYFT 190

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I   +V FL +  QS SK+P  ++++ PA HGP   APQY N F   T+  T +Y+  
Sbjct: 191 DVITRSAVSFLEE--QSSSKQPFFIMLATPACHGPFTPAPQYNNSFNKTTAPRTQSYNKP 248

Query: 135 PNPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
               K W+L+ T   M      ++D +   R +TL SVDD VE++V  L     L+NTYI
Sbjct: 249 GGKSKHWLLRSTPSPMSASSVAYSDDVFRNRWRTLLSVDDLVEQVVVTLDKHKHLSNTYI 308

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           I+TSD+GYH GQF L   K   +EFDIRVP L+RGPGI
Sbjct: 309 IFTSDNGYHHGQFSLPIDKRQLYEFDIRVPLLVRGPGI 346


>gi|198427210|ref|XP_002128609.1| PREDICTED: similar to N-acetylglucosamine-6-sulfatase precursor
           (G6S) (Glucosamine-6-sulfatase) [Ciona intestinalis]
          Length = 545

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 95/218 (43%), Positives = 142/218 (65%), Gaps = 9/218 (4%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y G       ++P GW +W  L+ NSKYYNY+I+ NG  ++HG +Y+ DY  
Sbjct: 127 FAGKYLNQYGGKSVGGPQHVPVGWNQWFGLVGNSKYYNYTISDNGVPVQHGANYHEDYLT 186

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DL+AN SV F+   K  +++ P  +++S PAPH P DSAPQY+ M+ N T+ HTP+Y+  
Sbjct: 187 DLLANRSVDFIHNHKMRYTQ-PFFMMISTPAPHSPWDSAPQYSKMYENNTAPHTPSYN-T 244

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q T  M      ++D     R + L+SVDD VE+++  L  + +L+NTY+
Sbjct: 245 KAVNKHWLVRQATHPMTKESMDYSDNAFRSRWRALKSVDDLVERVINALSKMKQLDNTYV 304

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            ++SD+GYHLGQF L   K   +EFD+RVP ++RGPG+
Sbjct: 305 FFSSDNGYHLGQFSLPIDKRQLYEFDLRVPLIIRGPGV 342


>gi|402591933|gb|EJW85862.1| hypothetical protein WUBG_03229 [Wuchereria bancrofti]
          Length = 256

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 86/150 (57%), Positives = 112/150 (74%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            YFGKYLN+Y GSYIPPGW  W  L+ NS++YNY++N+NG KIKHG DY  DY+ DLIAN
Sbjct: 106 AYFGKYLNEYTGSYIPPGWDYWMGLLRNSRFYNYTVNVNGNKIKHGWDYEKDYFTDLIAN 165

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           D++ F+RQ +     KP M+V+SFPAPHGPED APQY+  F +V +H T A++YAPNPDK
Sbjct: 166 DTIAFIRQLRSKDPFKPYMIVISFPAPHGPEDPAPQYSAWFQDVETHRTEAWNYAPNPDK 225

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
           QW+LQ T +M+PVH  FTD+ +    +  +
Sbjct: 226 QWLLQHTGRMEPVHVIFTDVYIDDDCKHCK 255


>gi|41054147|ref|NP_956135.1| glucosamine (N-acetyl)-6-sulfatase b precursor [Danio rerio]
 gi|28277517|gb|AAH45323.1| Glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID), b
           [Danio rerio]
 gi|182891244|gb|AAI64157.1| Gnsb protein [Danio rerio]
          Length = 347

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 99/218 (45%), Positives = 136/218 (62%), Gaps = 11/218 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        +++PPGW +W AL+ NSKYYNY++++NG++ KHG+ Y  DY  
Sbjct: 131 YCGKYLNQYGSKDAGGVAHVPPGWDQWHALVGNSKYYNYTLSVNGKEEKHGDSYEKDYLT 190

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DL+ N S+ FL +   S    P  +++  PAPH P  +APQY+  F  V +    +++  
Sbjct: 191 DLVLNRSLHFLEERSPS---HPFFMMLCPPAPHSPWTAAPQYSGSFSGVKAPRNGSFN-K 246

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
           P  DK W+L Q    M      + D    +R QTL SVDD VE+++K+L  + EL+NTYI
Sbjct: 247 PGTDKHWLLRQPANPMPNSSIDYLDNAFRRRWQTLLSVDDLVERLLKKLDSVKELDNTYI 306

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            YTSDHGYH GQF L   K   +EFDIR+P L+RGPGI
Sbjct: 307 FYTSDHGYHTGQFSLPIDKRQLYEFDIRIPLLVRGPGI 344


>gi|326911518|ref|XP_003202105.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Meleagris
           gallopavo]
          Length = 488

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 143/220 (65%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y        S++PPGW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 80  FAGKYLNEYGAEDAGGVSHVPPGWSFWYALEKNSKYYNYTLSVNGKARRHGENYSVDYLT 139

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL + K +F  +P  +++S PAPH P  +APQY N F NV++     ++  
Sbjct: 140 DVLANMSLDFL-EYKSNF--EPFFMMISTPAPHSPWTAAPQYKNSFQNVSAPRNSNFNIH 196

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK+L++ GEL+NTYI
Sbjct: 197 -GKNKHWLIRQAKTPMTNSSIQFLDDAYRKRWQTLLSVDDLVEKLVKKLENHGELDNTYI 255

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+G+H GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 256 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 295


>gi|47228544|emb|CAG05364.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 473

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 108/239 (45%), Positives = 143/239 (59%), Gaps = 16/239 (6%)

Query: 2   AWPVELTSRNLLYFDISHGYFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSI 55
           A+PV L+      F     + GKYLN+Y         YIPPGW +W AL+ NS+YYNYS+
Sbjct: 84  AFPVLLSKLGYQTF-----FAGKYLNQYGMKEAGGVDYIPPGWDQWHALVGNSQYYNYSL 138

Query: 56  NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
           ++NG++ KHG++Y  DY  DLI N S+ FL          P  L++S PAPH P  +APQ
Sbjct: 139 SVNGKEEKHGDNYSADYLTDLITNRSLNFLDTRS---PVHPFFLMLSPPAPHAPWTAAPQ 195

Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
           Y   F +V +    ++D  P  DK W+L Q    M      + D    KR QTL SVDD 
Sbjct: 196 YQKDFGDVKAPRDGSFD-KPGKDKHWLLRQPANPMTASSLAYLDNAYRKRWQTLVSVDDM 254

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           VE++VK+L  + ELN+TYI +TSD+GYH GQF L   K   +EFDI+VP ++RGPGI P
Sbjct: 255 VEQLVKKLDSINELNSTYIFFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLMVRGPGIAP 313


>gi|313760582|ref|NP_001186488.1| N-acetylglucosamine-6-sulfatase precursor [Gallus gallus]
          Length = 538

 Score =  189 bits (481), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 142/220 (64%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y        S++PPGW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 130 FAGKYLNEYGAEDAGGVSHVPPGWSFWYALEKNSKYYNYTLSVNGKARRHGENYSVDYLT 189

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL + K +F  +P  +++S PAPH P  +APQY N F NV++     ++  
Sbjct: 190 DVLANMSLDFL-EYKSNF--EPFFMMISTPAPHSPWTAAPQYKNDFLNVSAPRNSNFNIH 246

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK+L+  GEL+NTYI
Sbjct: 247 -GKNKHWLIRQAKTPMTNSSIQFLDDAYRKRWQTLLSVDDLVEKLVKKLEIHGELDNTYI 305

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+G+H GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 306 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 345


>gi|348504250|ref|XP_003439675.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oreochromis
           niloticus]
          Length = 524

 Score =  189 bits (481), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 105/229 (45%), Positives = 139/229 (60%), Gaps = 11/229 (4%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IPPGW +W AL+ NS+YYNY++++NG++ KHG+ Y +DY  
Sbjct: 145 FSGKYLNQYGSKAAGSVGHIPPGWDQWHALVGNSQYYNYTLSVNGKEEKHGDSYEDDYLT 204

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DLI N S+ FL        + P  L++S PA H P  +APQY   F +V +    +++  
Sbjct: 205 DLILNRSLHFLDDRS---PQHPFFLMLSPPACHSPWTAAPQYQKEFKDVQAPRDGSFN-K 260

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
           P  DK W+L Q    M      F D    KR QTL SVDD VEK+VK+L  + ELNNTYI
Sbjct: 261 PGKDKHWLLRQPLNPMPNSSVTFLDNAYRKRWQTLLSVDDMVEKLVKKLDSIKELNNTYI 320

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
            YTSD+GYH GQF L   K   +EFDIR+P ++RGPGI P  +    VL
Sbjct: 321 FYTSDNGYHTGQFSLPIDKRQLYEFDIRIPLMVRGPGIKPKQVLQAPVL 369


>gi|348521278|ref|XP_003448153.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oreochromis
           niloticus]
          Length = 532

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 107/220 (48%), Positives = 135/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y  S      ++P GW  W  L  NSKYYNY++++NG+  KHG DY  DY  
Sbjct: 126 FAGKYLNQYGHSEAGGVEHVPLGWSYWVGLDKNSKYYNYTLSVNGKAQKHGADYSKDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DL+AN S+ FL Q K S+  +P  +++S PAPH P  +APQY N F N  +   P ++  
Sbjct: 186 DLLANMSLDFL-QYKTSY--QPFFMMVSTPAPHSPWTAAPQYQNSFKNTKAPRDPNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     QF D     R QTL SVDD VEKIVK L+  GELNNTYI
Sbjct: 243 -GKDKHWLIRQAKTPMSNSSIQFLDDAFRGRWQTLLSVDDLVEKIVKRLEVRGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            +TSD+GYH GQF L   K   +EFDIRVP ++RGP I P
Sbjct: 302 FFTSDNGYHTGQFSLPVDKRQLYEFDIRVPLMVRGPNIKP 341


>gi|345324687|ref|XP_001507725.2| PREDICTED: N-acetylglucosamine-6-sulfatase, partial
           [Ornithorhynchus anatinus]
          Length = 488

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 103/220 (46%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++PPGW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 82  FAGKYLNEYGAEGAGGLEHVPPGWSYWFALEKNSKYYNYTLSINGKARRHGENYSQDYLT 141

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 142 DVLANVSLDFLESKSNS---EPFFMMISTPAPHSPWIAAPQYEKSFQNVNAPRNSNFNIH 198

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR +TL SVDD VEK+VK L   GELNNTYI
Sbjct: 199 -GKNKHWLIRQAKTPMTNSSIQFLDDAFRKRWRTLLSVDDLVEKLVKRLDFRGELNNTYI 257

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 258 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 297


>gi|313230182|emb|CBY07886.1| unnamed protein product [Oikopleura dioica]
          Length = 526

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 143/227 (62%), Gaps = 24/227 (10%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+Y+P GW +W AL  NS+YYNY+    G K +       +Y+ D+I  
Sbjct: 136 AFFGKYLNEYNGTYVPKGWSKWYALRGNSRYYNYNF---GNKPE-------EYFTDVITK 185

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN--------VTSHHTPAY 131
           +++ +   +      +P+++V++FPAPHGPED+ P Y + +          +T+H    +
Sbjct: 186 NALEYFEANADK--PEPLLMVLNFPAPHGPEDAHPDYRHYYDEPSLEQQSILTAHRELVF 243

Query: 132 D---YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
           +   + P  +K W L   + M      FTDLL  +RLQTL SVD+ +E+I++ +++   +
Sbjct: 244 NRTSWQPG-EKHWFLTSQKDMTEGDGYFTDLLQRRRLQTLFSVDNGIERIIRLVEEKNAI 302

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            NTY +YTSDHGYHLGQFG+VKGK+ P++FD R+PF MRGP I  G+
Sbjct: 303 ENTYFLYTSDHGYHLGQFGVVKGKALPYDFDSRIPFYMRGPSIPAGS 349


>gi|432093890|gb|ELK25743.1| N-acetylglucosamine-6-sulfatase [Myotis davidii]
          Length = 476

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 137/220 (62%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         Y+PPGW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 70  FAGKYLNEYGAPAAGGLEYVPPGWSYWVALEKNSKYYNYTLSVNGQARKHGENYSVDYLT 129

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 130 DVLANVSLDFL-DYKSNF--EPFFMMISTPAPHSPWIAAPQYQKTFQNVFAPRNKNFNIH 186

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
               K W++ Q    M     QF D    KR QTL SVDD VEK++K L+ +GEL+NTYI
Sbjct: 187 -GTSKHWLIRQAKTPMTNSSIQFLDSAFRKRWQTLLSVDDLVEKLIKRLEFIGELDNTYI 245

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 246 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 285


>gi|313218282|emb|CBY41541.1| unnamed protein product [Oikopleura dioica]
          Length = 509

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 143/227 (62%), Gaps = 24/227 (10%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNG+Y+P GW +W AL  NS+YYNY+    G K +       +Y+ D+I  
Sbjct: 136 AFFGKYLNEYNGTYVPKGWSKWYALRGNSRYYNYNF---GNKPE-------EYFTDVITK 185

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN--------VTSHHTPAY 131
           +++ +   +      +P+++V++FPAPHGPED+ P Y + +          +T+H    +
Sbjct: 186 NALEYFEANADK--PEPLLMVLNFPAPHGPEDAHPDYRHYYDEPSLEQQSILTAHRELVF 243

Query: 132 D---YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
           +   + P  +K W L   + M      FTDLL  +RLQTL SVD+ +E+I++ +++   +
Sbjct: 244 NRTSWQPG-EKHWFLTSQKDMTEGDGYFTDLLQRRRLQTLFSVDNGIERIIRLVEEKNAI 302

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            NTY +YTSDHGYHLGQFG+VKGK+ P++FD R+PF MRGP I  G+
Sbjct: 303 ENTYFLYTSDHGYHLGQFGVVKGKALPYDFDSRIPFYMRGPSIPAGS 349


>gi|166158272|ref|NP_001107314.1| glucosamine (N-acetyl)-6-sulfatase precursor [Xenopus (Silurana)
           tropicalis]
 gi|161612194|gb|AAI55715.1| LOC100135105 protein [Xenopus (Silurana) tropicalis]
          Length = 533

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 105/230 (45%), Positives = 142/230 (61%), Gaps = 13/230 (5%)

Query: 13  LYFDISHGYF--GKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKH 64
           L F I +  F  GKYLN+Y        +++PPGW  W AL  NSKYYNY+++ NGR   H
Sbjct: 119 LLFSIGYQTFFAGKYLNQYGSEEAGGINHVPPGWSYWFALEKNSKYYNYTLSENGRPKTH 178

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE+Y  DY  D++AN S+ FL   K +++  P  ++++ PAPH P  +APQY   F NV+
Sbjct: 179 GENYSQDYLTDVLANVSLDFLNY-KSNYA--PFFMMIATPAPHSPWTAAPQYETAFLNVS 235

Query: 125 SHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
           +     ++     DK W++ Q    M     Q+ D    KR QTL SVDD VEK+VK+L+
Sbjct: 236 APRGSNFN-VHGKDKHWLIRQAKSPMSNSSIQYLDNAFQKRWQTLLSVDDLVEKLVKQLE 294

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
             GEL+NT+I +TSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 295 ARGELDNTFIFFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 344


>gi|417515891|gb|JAA53750.1| N-acetylglucosamine-6-sulfatase precursor [Sus scrofa]
          Length = 552

 Score =  187 bits (474), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y        +++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLAHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY N F NV +     ++  
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNTFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|335288215|ref|XP_003355555.1| PREDICTED: N-acetylglucosamine-6-sulfatase, partial [Sus scrofa]
          Length = 366

 Score =  187 bits (474), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y        +++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLAHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY N F NV +     ++  
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNTFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|296487655|tpg|DAA29768.1| TPA: glucosamine (N-acetyl)-6-sulfatase precursor [Bos taurus]
          Length = 560

 Score =  186 bits (473), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 154 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 213

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY N F NV +     ++  
Sbjct: 214 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIH 270

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 271 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 329

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 330 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 369


>gi|115497422|ref|NP_001069030.1| N-acetylglucosamine-6-sulfatase precursor [Bos taurus]
 gi|122134311|sp|Q1LZH9.1|GNS_BOVIN RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
           Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
           Precursor
 gi|94534741|gb|AAI15991.1| Glucosamine (N-acetyl)-6-sulfatase [Bos taurus]
          Length = 560

 Score =  186 bits (473), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 154 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 213

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY N F NV +     ++  
Sbjct: 214 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIH 270

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 271 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 329

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 330 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 369


>gi|167520137|ref|XP_001744408.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777494|gb|EDQ91111.1| predicted protein [Monosiga brevicollis MX1]
          Length = 486

 Score =  186 bits (472), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 97/254 (38%), Positives = 144/254 (56%), Gaps = 20/254 (7%)

Query: 21  YFGKYLNKYN--------------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE 66
           Y GKYLN Y                + IPPGW  W  L  NSKYYNY+++ +G  + HG+
Sbjct: 107 YAGKYLNDYALPGSPNCTHETPEACAAIPPGWDRWLGLFGNSKYYNYTVSDDGVPVDHGD 166

Query: 67  DYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
           DY NDY+PDL+AN ++ F+ ++ +    +PI+ V ++P PHGP   AP  A  +    + 
Sbjct: 167 DYTNDYFPDLVANRTIGFIYEAARVAPDRPILAVNAWPTPHGPHPPAPWAAGRYAGYQAP 226

Query: 127 HTPAYDYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
            TP+Y+ +      K + ++    + P   Q  D+   +R +TL SVD+ VE IV+ L D
Sbjct: 227 RTPSYNASSEYMATKNYFMRQLDMIWPEDAQSIDVNYQQRWETLLSVDNHVESIVQALAD 286

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL-- 242
            G LNNT+IIY SDHG+ LGQ  L+  K   +E DIR+PF++RGPG+ P +     VL  
Sbjct: 287 TGRLNNTFIIYASDHGFQLGQHRLLGDKRHQYENDIRIPFIVRGPGVPPNSTETRTVLNI 346

Query: 243 --IPQVRKFSSGSL 254
              P + +  +G++
Sbjct: 347 DIAPTLTELGTGAV 360


>gi|426224775|ref|XP_004006544.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Ovis aries]
          Length = 552

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY N F N+ +     ++  
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNIFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|47218297|emb|CAG04129.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 532

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 134/218 (61%), Gaps = 11/218 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y  +      ++PPGW  W  L  NSKYYNY++++NG+  KHG DY  DY  
Sbjct: 136 FAGKYLNQYGHAEAGGMEHVPPGWNVWVGLEKNSKYYNYTLSVNGKAQKHGSDYNKDYLT 195

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL Q K SF  +P  +++S PAPH P  +APQY N F    +   P ++  
Sbjct: 196 DVLANMSLDFL-QYKSSF--QPFFMMVSTPAPHSPWTAAPQYQNSFNGTKAPRDPNFNVH 252

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     Q  D     R +TL SVDD VEK+VK L+  GEL+NTYI
Sbjct: 253 -GTDKHWLIRQAKTPMSNSSVQLLDDAFRSRWRTLLSVDDLVEKLVKRLEIRGELDNTYI 311

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           I+TSD+GYH GQF L   K   +EFDIRVP ++RGP I
Sbjct: 312 IFTSDNGYHTGQFSLPLDKRQLYEFDIRVPLMVRGPNI 349


>gi|291389495|ref|XP_002711358.1| PREDICTED: glucosamine (N-acetyl)-6-sulfatase [Oryctolagus
           cuniculus]
          Length = 554

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 148 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 207

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 208 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 264

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 265 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 323

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 324 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 363


>gi|62898700|dbj|BAD97204.1| glucosamine (N-acetyl)-6-sulfatase precursor variant [Homo sapiens]
          Length = 552

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVCAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|1707906|sp|P50426.1|GNS_CAPHI RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
           Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
           Precursor
 gi|841158|gb|AAB01663.1| N-acetylglucosamine 6-sulfatase precursor [Capra hircus]
 gi|1580855|prf||2116212A GlcNAc 6-sulfatase
          Length = 559

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 153 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 212

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY N F NV +     ++  
Sbjct: 213 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIH 269

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    +R QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 270 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRERWQTLLSVDDLVEKLVKRLEFNGELNNTYI 328

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 329 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 368


>gi|313227555|emb|CBY22702.1| unnamed protein product [Oikopleura dioica]
          Length = 824

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 98/239 (41%), Positives = 141/239 (58%), Gaps = 17/239 (7%)

Query: 15  FDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK------------I 62
           F      FGKYLN+Y+GSY+P GW  W  L+ NS++YNY + ++G              I
Sbjct: 136 FQYKTAMFGKYLNEYDGSYVPSGWDTWHGLVHNSRFYNYKVRVHGWSENNTDINADNSFI 195

Query: 63  KHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF-F 121
           +  +D   DY+ D+IAN S +++R   +     P +LV++FP PHGP D+AP++   F +
Sbjct: 196 ERHKDQPLDYFTDVIANRSSQWVRDQHRLDPDSPKLLVVNFPGPHGPADAAPRHTEAFKY 255

Query: 122 NVTSHHTPAYDY---APNPDKQWILQVTR-KMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
              S    +  +     N  + W L+ T+ K+    +QF   L  KRLQTL S+D+AV+K
Sbjct: 256 EDESQLKKSRAFNNVDDNEHRNWFLRATKEKLTSEEQQFGGQLRRKRLQTLLSLDEAVKK 315

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +    KDLG L  TY+I+TSDHG+H+GQFG+VKGKS P+EFD+RVP  + G  I   T 
Sbjct: 316 VYFTFKDLGLLEKTYMIFTSDHGFHIGQFGMVKGKSTPYEFDVRVPMYVLGKDIKRTTQ 374


>gi|71834556|ref|NP_001025379.1| N-acetylglucosamine-6-sulfatase precursor [Danio rerio]
 gi|66910454|gb|AAH97128.1| Zgc:114066 [Danio rerio]
          Length = 538

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 137/220 (62%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NGR  +HG++Y  DY  
Sbjct: 132 FAGKYLNEYGSKKAGGVEHVPLGWDHWFALERNSKYYNYTLSVNGRAQRHGQNYSEDYLT 191

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL       +++P  +++S PAPH P  +APQY + F ++ +   P ++  
Sbjct: 192 DVLANVSIDFLENKS---NRRPFFMMVSTPAPHSPWTAAPQYDSSFPDLKAPRDPNFNIH 248

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     +F D    KR +TL SVDD VEK+V++L   GEL+NTY+
Sbjct: 249 -GKDKHWLIRQAKTPMSNASVEFLDNAYRKRWRTLLSVDDLVEKLVRKLDIRGELSNTYV 307

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           I+TSD+GYH GQF L   K   +EFDIRVP L+RGP I P
Sbjct: 308 IFTSDNGYHTGQFSLPMDKRQLYEFDIRVPLLVRGPNIKP 347


>gi|410918631|ref|XP_003972788.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Takifugu rubripes]
          Length = 542

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 134/218 (61%), Gaps = 11/218 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y  +      +IPPGW  W  L  NSKYYNY++++NG+  KHG DY  DY  
Sbjct: 136 FAGKYLNQYGHAQAGGVEHIPPGWSFWVGLEKNSKYYNYTLSVNGKAQKHGSDYSKDYLT 195

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL Q K S+  +P  +++S PAPH P  +APQY N F    +   P ++  
Sbjct: 196 DVLANMSLEFL-QYKSSY--QPFFMMVSTPAPHSPWTAAPQYQNSFNGTKAPRDPNFNVH 252

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     QF D    KR +TL SVDD VE +VK L+  GEL+NTYI
Sbjct: 253 -GKDKHWLIRQAKTPMSNSSVQFLDDAFRKRWRTLLSVDDLVETLVKRLEIRGELDNTYI 311

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            +TSD+GYH GQF L   K   +EFDIRVP ++RGP I
Sbjct: 312 FFTSDNGYHTGQFSLPLDKRQLYEFDIRVPLMVRGPKI 349


>gi|432943754|ref|XP_004083269.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oryzias latipes]
          Length = 539

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 137/220 (62%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y  +      ++PPGW  W AL  NSKYYNY++++NG+  KHG DY  DY  
Sbjct: 133 FAGKYLNQYGRADAGGVQHVPPGWSYWFALEKNSKYYNYTMSVNGKAQKHGADYSRDYLT 192

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++ N S+ FL Q K S+  +P  +++S PAPH P  +APQY + F    +   P ++  
Sbjct: 193 DVLTNVSLDFL-QYKSSY--QPFFMMVSTPAPHSPWTAAPQYQDRFNATKAPREPNFNIH 249

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEKIVK+L+  GEL+NTYI
Sbjct: 250 -GANKHWLIRQAKTPMTNSSIQFLDEAFRKRWQTLLSVDDLVEKIVKKLEVQGELDNTYI 308

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            +TSD+GYH GQF L   K   +EFDI+VP ++RGP I P
Sbjct: 309 FFTSDNGYHTGQFSLPLDKRQLYEFDIKVPLMVRGPNIKP 348


>gi|410965016|ref|XP_003989048.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Felis catus]
          Length = 553

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 147 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 207 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 263

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 264 -GTNKHWLIRQAKTPMTNSSIQFLDSAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 322

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 323 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362


>gi|417402596|gb|JAA48140.1| Putative sulfatase [Desmodus rotundus]
          Length = 548

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 142 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSMDYLT 201

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 202 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQQTFQNVFAPRNKNFNIH 258

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 259 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 317

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 318 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 357


>gi|81158070|tpe|CAI85009.1| TPA: glucosamine (N-acetyl)-6-sulfatase [Canis lupus familiaris]
          Length = 488

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 82  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 141

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 142 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 198

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 199 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 257

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 258 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 297


>gi|410965018|ref|XP_003989049.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Felis catus]
          Length = 533

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 127 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 186

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 187 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 243

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 244 -GTNKHWLIRQAKTPMTNSSIQFLDSAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 302

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 303 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 342


>gi|395852131|ref|XP_003798594.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Otolemur garnettii]
          Length = 548

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 103/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHG +Y  DY  
Sbjct: 142 FAGKYLNEYGAPAAGGLEHVPLGWSYWFALEKNSKYYNYTLSINGKARKHGANYSVDYLT 201

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 202 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 258

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+V+ L+  GELNNTYI
Sbjct: 259 -GTNKHWLIRQAKTPMTNSSIQFLDDAFRKRWQTLLSVDDLVEKLVRRLESTGELNNTYI 317

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           IYTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 318 IYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIRP 357


>gi|281341126|gb|EFB16710.1| hypothetical protein PANDA_008787 [Ailuropoda melanoleuca]
          Length = 527

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GANKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|338726197|ref|XP_001492201.3| PREDICTED: LOW QUALITY PROTEIN: n-acetylglucosamine-6-sulfatase
           [Equus caballus]
          Length = 554

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 148 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 207

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 208 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 264

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 265 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 323

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 324 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 363


>gi|301769353|ref|XP_002920093.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Ailuropoda
           melanoleuca]
          Length = 552

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GANKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|359320559|ref|XP_531660.3| PREDICTED: N-acetylglucosamine-6-sulfatase [Canis lupus familiaris]
          Length = 469

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 63  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 122

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 123 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 179

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 180 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 238

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 239 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 278


>gi|297692359|ref|XP_002823525.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Pongo abelii]
          Length = 584

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|26350659|dbj|BAC38966.1| unnamed protein product [Mus musculus]
          Length = 544

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           +YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 LYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|119617545|gb|EAW97139.1| glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID),
           isoform CRA_b [Homo sapiens]
          Length = 496

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 90  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 149

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 150 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 206

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 207 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 265

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 266 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 305


>gi|31873233|emb|CAD97608.1| hypothetical protein [Homo sapiens]
          Length = 496

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 90  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 149

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 150 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 206

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 207 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 265

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 266 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 305


>gi|148234488|ref|NP_001080261.1| glucosamine (N-acetyl)-6-sulfatase [Xenopus laevis]
 gi|27503350|gb|AAH42274.1| Gns-prov protein [Xenopus laevis]
          Length = 547

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 98/223 (43%), Positives = 137/223 (61%), Gaps = 11/223 (4%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y        +++PPGW  W AL  NSKYYNY+++ NGR   HGE+Y  DY  
Sbjct: 141 FAGKYLNQYGSEEAGGINHVPPGWSYWFALEKNSKYYNYTLSENGRPKTHGENYSQDYLT 200

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+++N S+ FL       + +P  ++++ PAPH P  +APQY N F NV++     ++  
Sbjct: 201 DVLSNVSLDFLNYKS---NHEPFFMMIATPAPHSPWTAAPQYENAFLNVSAPRGGNFN-V 256

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     Q+ D    KR QTL SVDD VE ++K+L+  GEL+NT+I
Sbjct: 257 HGKDKHWLIRQAKSPMSNSSIQYLDHAFRKRWQTLLSVDDLVENLLKQLEVRGELDNTFI 316

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            +TSD+GYH GQF L   K   +EFDI+VP L+RGPGI P  +
Sbjct: 317 FFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKPNQI 359


>gi|410046400|ref|XP_003952183.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Pan troglodytes]
          Length = 584

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|21321594|gb|AAM47270.1|AF510336_1 N-acetylglucosamine-6-sulfatase [Equus caballus]
          Length = 427

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 147 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 207 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 263

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 264 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 322

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 323 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362


>gi|426373316|ref|XP_004053552.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Gorilla
           gorilla gorilla]
          Length = 583

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|397508844|ref|XP_003824850.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Pan paniscus]
 gi|194385338|dbj|BAG65046.1| unnamed protein product [Homo sapiens]
          Length = 584

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|332207367|ref|XP_003252768.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Nomascus
           leucogenys]
          Length = 584

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|297692361|ref|XP_002823526.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Pongo abelii]
          Length = 552

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|297262879|ref|XP_001116940.2| PREDICTED: n-acetylglucosamine-6-sulfatase isoform 1 [Macaca
           mulatta]
          Length = 584

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|4504061|ref|NP_002067.1| N-acetylglucosamine-6-sulfatase precursor [Homo sapiens]
 gi|55638455|ref|XP_522454.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 5 [Pan
           troglodytes]
 gi|397508840|ref|XP_003824848.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Pan paniscus]
 gi|232126|sp|P15586.3|GNS_HUMAN RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
           Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
           Precursor
 gi|31867|emb|CAA78164.1| N-acetylglucosamine-6-sulphatase [Homo sapiens]
 gi|15214696|gb|AAH12482.1| Glucosamine (N-acetyl)-6-sulfatase [Homo sapiens]
 gi|119617544|gb|EAW97138.1| glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID),
           isoform CRA_a [Homo sapiens]
 gi|119617546|gb|EAW97140.1| glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID),
           isoform CRA_a [Homo sapiens]
 gi|325463309|gb|ADZ15425.1| glucosamine (N-acetyl)-6-sulfatase [synthetic construct]
 gi|410264478|gb|JAA20205.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410264480|gb|JAA20206.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410264482|gb|JAA20207.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410264484|gb|JAA20208.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410264486|gb|JAA20209.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410264488|gb|JAA20210.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410307956|gb|JAA32578.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410307958|gb|JAA32579.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410335013|gb|JAA36453.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410335015|gb|JAA36454.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410335017|gb|JAA36455.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
 gi|410335019|gb|JAA36456.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
          Length = 552

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|355564432|gb|EHH20932.1| hypothetical protein EGK_03884 [Macaca mulatta]
          Length = 584

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|332207363|ref|XP_003252766.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Nomascus
           leucogenys]
          Length = 552

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|426373312|ref|XP_004053550.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Gorilla
           gorilla gorilla]
          Length = 551

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|194390666|dbj|BAG62092.1| unnamed protein product [Homo sapiens]
          Length = 469

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 63  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 122

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 123 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 179

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 180 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 238

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 239 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 278


>gi|158256974|dbj|BAF84460.1| unnamed protein product [Homo sapiens]
          Length = 552

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|380787195|gb|AFE65473.1| N-acetylglucosamine-6-sulfatase precursor [Macaca mulatta]
 gi|383411757|gb|AFH29092.1| N-acetylglucosamine-6-sulfatase precursor [Macaca mulatta]
 gi|384939736|gb|AFI33473.1| N-acetylglucosamine-6-sulfatase precursor [Macaca mulatta]
          Length = 552

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|297692363|ref|XP_002823527.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Pongo abelii]
          Length = 532

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|332207365|ref|XP_003252767.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Nomascus
           leucogenys]
          Length = 532

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|395540906|ref|XP_003772391.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Sarcophilus harrisii]
          Length = 556

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 101/218 (46%), Positives = 134/218 (61%), Gaps = 11/218 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++PPGW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 149 FAGKYLNEYGALDAGGVEHVPPGWSYWYALEKNSKYYNYTLSVNGKARKHGENYSLDYLT 208

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV++     ++  
Sbjct: 209 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWIAAPQYQKSFQNVSAPRNSNFNIH 265

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+++ L   GELNNT+I
Sbjct: 266 -GKNKHWLIRQAKTPMTNSSIQFLDEAFRKRWQTLLSVDDLVEKLIRRLDFHGELNNTFI 324

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI
Sbjct: 325 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGI 362


>gi|449481578|ref|XP_004176153.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylglucosamine-6-sulfatase
           [Taeniopygia guttata]
          Length = 533

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 101/220 (45%), Positives = 139/220 (63%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++PPGW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 128 FAGKYLNEYGAEDAGGVGHVPPGWSFWYALEKNSKYYNYTLSVNGKARRHGENYSVDYLT 187

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL + K +F  +P  ++++ PAPH P   APQY   F NV++     ++  
Sbjct: 188 DVLANMSLDFL-EYKSNF--EPFFMMIATPAPHSPWTPAPQYTKSFQNVSAPRNSNFNIH 244

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD +EK+VK+L+  GEL+NTYI
Sbjct: 245 -GKNKHWLIRQAKTPMSNSSIQFLDDAYRKRWQTLLSVDDLIEKLVKKLEVNGELDNTYI 303

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+G+H GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 304 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 343


>gi|397508842|ref|XP_003824849.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Pan paniscus]
 gi|410046398|ref|XP_003952182.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Pan troglodytes]
 gi|194379548|dbj|BAG63740.1| unnamed protein product [Homo sapiens]
          Length = 532

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|297262881|ref|XP_002798713.1| PREDICTED: n-acetylglucosamine-6-sulfatase isoform 2 [Macaca
           mulatta]
          Length = 532

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|426373314|ref|XP_004053551.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Gorilla
           gorilla gorilla]
          Length = 531

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|148692469|gb|EDL24416.1| glucosamine (N-acetyl)-6-sulfatase, isoform CRA_a [Mus musculus]
          Length = 614

 Score =  182 bits (463), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|148692470|gb|EDL24417.1| glucosamine (N-acetyl)-6-sulfatase, isoform CRA_b [Mus musculus]
          Length = 574

 Score =  182 bits (463), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 168 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 227

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 228 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 284

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 285 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 343

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 344 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 383


>gi|74208040|dbj|BAE29132.1| unnamed protein product [Mus musculus]
          Length = 404

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|355786273|gb|EHH66456.1| hypothetical protein EGM_03453, partial [Macaca fascicularis]
          Length = 436

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|29789239|ref|NP_083640.1| N-acetylglucosamine-6-sulfatase precursor [Mus musculus]
 gi|81873637|sp|Q8BFR4.1|GNS_MOUSE RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
           Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
           Precursor
 gi|26326771|dbj|BAC27129.1| unnamed protein product [Mus musculus]
 gi|26339830|dbj|BAC33578.1| unnamed protein product [Mus musculus]
 gi|33243973|gb|AAH55328.1| Glucosamine (N-acetyl)-6-sulfatase [Mus musculus]
 gi|74138505|dbj|BAE38063.1| unnamed protein product [Mus musculus]
 gi|74143704|dbj|BAE41197.1| unnamed protein product [Mus musculus]
 gi|74218111|dbj|BAE42031.1| unnamed protein product [Mus musculus]
          Length = 544

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|26343951|dbj|BAC35632.1| unnamed protein product [Mus musculus]
          Length = 544

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|74198008|dbj|BAE35186.1| unnamed protein product [Mus musculus]
          Length = 544

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 132/218 (60%), Gaps = 11/218 (5%)

Query: 23  GKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  D+
Sbjct: 140 GKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDV 199

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++    
Sbjct: 200 LANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH-G 255

Query: 137 PDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI Y
Sbjct: 256 TNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYIFY 315

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           TSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 316 TSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|431892015|gb|ELK02462.1| N-acetylglucosamine-6-sulfatase [Pteropus alecto]
          Length = 550

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 133/218 (61%), Gaps = 11/218 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK+VK L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGI 359


>gi|74181578|dbj|BAE30054.1| unnamed protein product [Mus musculus]
          Length = 451

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IP GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 45  FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 104

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 105 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 161

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 162 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 220

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 221 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 260


>gi|225719510|gb|ACO15601.1| N-acetylglucosamine-6-sulfatase precursor [Caligus clemensi]
          Length = 475

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 100/233 (42%), Positives = 143/233 (61%), Gaps = 15/233 (6%)

Query: 20  GYFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
            YFGKYLN+Y   +   IP GW  W  LI NSKYYNY++++NGR  +HG+ Y NDY  D+
Sbjct: 120 AYFGKYLNQYGRIDPLRIPKGWDYWAGLIGNSKYYNYTLSVNGRLERHGDIYDNDYLTDV 179

Query: 77  IANDSVRFLR---QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
           I   S+ FL+   +S +   K P+++V++ P+PH P   APQY+  F    S   P+++Y
Sbjct: 180 IKRKSLDFLKDHFKSTEDQKKDPLLMVLATPSPHAPFTPAPQYSRNFSESKSPRIPSFNY 239

Query: 134 APNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
             +  K W+ Q     +P+        D +   R +TL SVDD V+ I++ L     L+N
Sbjct: 240 VHDK-KHWLAQ----KEPLSDYLIGDIDEVFRNRWRTLLSVDDLVDDILEFLHKKKVLDN 294

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
           TY+I+TSDHGYHLGQ+GLV  K  P+E DIR+PF+M GPGI P +    D+++
Sbjct: 295 TYVIFTSDHGYHLGQYGLVLDKRMPYETDIRIPFVMYGPGI-PASSESNDIIL 346


>gi|403269060|ref|XP_003926575.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Saimiri
           boliviensis boliviensis]
          Length = 584

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F +V +     ++  
Sbjct: 238 DVLANVSLDFL-NYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 294

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 353

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393


>gi|354498717|ref|XP_003511460.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Cricetulus griseus]
          Length = 522

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 101/220 (45%), Positives = 132/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+   HGE+Y  DY  
Sbjct: 116 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARTHGENYSVDYLT 175

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 176 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKEFQNVFAPRNKNFNIH 232

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 233 -GTNKHWLIRQAKTPMTNSSIQFLDDAFRRRWQTLLSVDDLVEKLVKRLDGTGELDNTYI 291

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 292 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 331


>gi|392349421|ref|XP_003750373.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Rattus norvegicus]
 gi|81158028|tpe|CAI84988.1| TPA: glucosamine (N-acetyl)-6-sulfatase [Rattus norvegicus]
          Length = 544

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 100/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARRHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFPNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIKFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|403269056|ref|XP_003926573.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 552

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F +V +     ++  
Sbjct: 206 DVLANVSLDFLNY-KSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|58865546|ref|NP_001011989.1| N-acetylglucosamine-6-sulfatase precursor [Rattus norvegicus]
 gi|56611129|gb|AAH87741.1| Glucosamine (N-acetyl)-6-sulfatase [Rattus norvegicus]
          Length = 519

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 100/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARRHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFPNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIKFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353


>gi|348580785|ref|XP_003476159.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Cavia porcellus]
          Length = 553

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 137/220 (62%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 147 FAGKYLNEYGSLDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  +++S PAPH P  +APQY   F NV +  +  ++  
Sbjct: 207 DVLANVSLDFL-DYKSNF--EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRSKNFNIH 263

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     Q+ D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 264 -GMNKHWLIRQAKTPMTNSSIQYLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 322

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            +TSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 323 FFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362


>gi|344255848|gb|EGW11952.1| N-acetylglucosamine-6-sulfatase [Cricetulus griseus]
          Length = 474

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 101/220 (45%), Positives = 132/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+   HGE+Y  DY  
Sbjct: 68  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARTHGENYSVDYLT 127

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 128 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKEFQNVFAPRNKNFNIH 184

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 185 -GTNKHWLIRQAKTPMTNSSIQFLDDAFRRRWQTLLSVDDLVEKLVKRLDGTGELDNTYI 243

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 244 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 283


>gi|403269058|ref|XP_003926574.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 532

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F +V +     ++  
Sbjct: 186 DVLANVSLDFLNY-KSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|296212248|ref|XP_002752750.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Callithrix
           jacchus]
          Length = 552

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F +V +     ++  
Sbjct: 206 DVLANVSLDFL-NYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 263 -GMNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 321

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361


>gi|351696736|gb|EHA99654.1| N-acetylglucosamine-6-sulfatase [Heterocephalus glaber]
          Length = 553

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 147 FAGKYLNEYGSLDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 207 DVLANVSLDFL-DYKSNF--EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 263

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     Q+ D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 264 -GMNKHWLIRQAKTPMTNSSIQYLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 322

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            +TSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 323 FFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362


>gi|296212250|ref|XP_002752751.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Callithrix
           jacchus]
          Length = 532

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F +V +     ++  
Sbjct: 186 DVLANVSLDFLNY-KSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 242

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD VEK++K L+  GELNNTYI
Sbjct: 243 -GMNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 301

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341


>gi|344267548|ref|XP_003405628.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Loxodonta africana]
          Length = 555

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 132/220 (60%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 149 FAGKYLNEYGAPGAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 208

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 209 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKSFQNVFAPRNKNFNIH 265

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D     R QTL SVDD VEK++K L   GELNNTYI
Sbjct: 266 -GMNKHWLIRQAKTPMSNSSIQFLDNAFRGRWQTLLSVDDLVEKLIKRLDFNGELNNTYI 324

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 325 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 364


>gi|327273075|ref|XP_003221308.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Anolis
           carolinensis]
          Length = 550

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 138/220 (62%), Gaps = 11/220 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++PPGW  W AL  NSKYYNY++++NG+  +HG +Y  DY  
Sbjct: 145 FAGKYLNEYGAEDAGGIGHVPPGWSFWYALEKNSKYYNYTLSVNGKPRRHGGNYSVDYLT 204

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL + K +F  +P  ++++ PAPH P  +APQY   F N+++     ++  
Sbjct: 205 DVLANMSLDFL-EYKSNF--EPFFMMIATPAPHSPWVAAPQYKKNFANISAPRNSNFNIH 261

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     QF D    KR QTL SVDD +EK++K L+  GEL+NTYI
Sbjct: 262 -GKDKHWLIRQAKTPMTNSSIQFLDDAFRKRWQTLLSVDDLIEKLLKRLEFHGELDNTYI 320

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+G+H GQF L   K   +EFDI+VP L+RGPG+ P
Sbjct: 321 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGVKP 360


>gi|443733552|gb|ELU17869.1| hypothetical protein CAPTEDRAFT_122995, partial [Capitella teleta]
          Length = 229

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 83/122 (68%), Positives = 100/122 (81%), Gaps = 4/122 (3%)

Query: 118 NMFFNVT----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
           N F  +T       TP+Y++APNPDKQWILQ T KM+P+H +FTD+L  KRLQTLQSVDD
Sbjct: 5   NAFLIITLLYFPRRTPSYNFAPNPDKQWILQYTGKMEPIHMEFTDMLHRKRLQTLQSVDD 64

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           AVEK+  ELK+LGEL+NTY IY++DHGYHLGQ+GLVKGK+ P++FDIRVPFLMRGP I P
Sbjct: 65  AVEKVFNELKELGELDNTYFIYSADHGYHLGQYGLVKGKAMPYDFDIRVPFLMRGPKIPP 124

Query: 234 GT 235
           G 
Sbjct: 125 GV 126


>gi|313224745|emb|CBY20536.1| unnamed protein product [Oikopleura dioica]
          Length = 503

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 135/231 (58%), Gaps = 28/231 (12%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN Y         ++P GW  W AL+ NS+YYNYSI+ NG++ KHG+++  DY  
Sbjct: 117 YAGKYLNMYGFKKAGGVEHVPKGWDSWNALVGNSRYYNYSISRNGKEEKHGDNFTLDYLT 176

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           +L+ANDSV F+   K      P  ++++ PAPH P DS P+YA+++ N+T         A
Sbjct: 177 NLVANDSVNFINAMK---PDSPFFMIVAPPAPHEPWDSDPKYADLYKNLT---------A 224

Query: 135 PNPDKQW----------ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           P  +KQW          +  V   M      ++D     R + L+SVDD VE I+  L +
Sbjct: 225 PRDNKQWNIHRADAHWVVRNVPNPMSQKSTDWSDNAFRSRWRCLKSVDDLVESIMNALNE 284

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
              L++TYI+++SDHGYH GQF L   K  P+EFD+R+P L+RGPG+ P +
Sbjct: 285 TDLLDDTYIMFSSDHGYHTGQFSLPYDKRQPYEFDLRIPLLIRGPGVKPNS 335


>gi|313241330|emb|CBY33605.1| unnamed protein product [Oikopleura dioica]
          Length = 503

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 135/231 (58%), Gaps = 28/231 (12%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN Y         ++P GW  W AL+ NS+YYNYSI+ NG++ KHG+++  DY  
Sbjct: 117 YAGKYLNMYGFKKAGGVEHVPKGWDSWNALVGNSRYYNYSISRNGKEEKHGDNFTLDYLT 176

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           +L+ANDSV F+   K      P  ++++ PAPH P DS P+YA+++ N+T         A
Sbjct: 177 NLVANDSVNFINAMK---PDAPFFMIVAPPAPHEPWDSDPKYADLYKNLT---------A 224

Query: 135 PNPDKQW----------ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           P  +KQW          +  V   M      ++D     R + L+SVDD VE I+  L +
Sbjct: 225 PRDNKQWNIHRADAHWVVRNVPNPMSQKSTDWSDNAFRSRWRCLKSVDDLVESIMNALNE 284

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
              L++TYI+++SDHGYH GQF L   K  P+EFD+R+P L+RGPG+ P +
Sbjct: 285 TDLLDDTYIMFSSDHGYHTGQFSLPYDKRQPYEFDLRIPLLIRGPGVKPNS 335


>gi|444721213|gb|ELW61958.1| N-acetylglucosamine-6-sulfatase [Tupaia chinensis]
          Length = 514

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 98/202 (48%), Positives = 128/202 (63%), Gaps = 5/202 (2%)

Query: 33  YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF 92
           ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  D++AN S+ FL   K +F
Sbjct: 53  HVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDVLANVSLDFL-DYKSNF 111

Query: 93  SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQP 151
             +P  +++S PAPH P  +APQY   F NV +     ++     +K W++ Q    M  
Sbjct: 112 --EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH-GTNKHWLIRQAKTPMTN 168

Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
              QF D    KR QTL SVDD VEK++K L+  GELNNTYI YTSD+GYH GQF L   
Sbjct: 169 SSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYIFYTSDNGYHTGQFSLPID 228

Query: 212 KSFPFEFDIRVPFLMRGPGIVP 233
           K   +EFDI+VP L+RGPGI P
Sbjct: 229 KRQLYEFDIKVPLLVRGPGIKP 250


>gi|260803665|ref|XP_002596710.1| hypothetical protein BRAFLDRAFT_219258 [Branchiostoma floridae]
 gi|229281969|gb|EEN52722.1| hypothetical protein BRAFLDRAFT_219258 [Branchiostoma floridae]
          Length = 513

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 95/221 (42%), Positives = 135/221 (61%), Gaps = 10/221 (4%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         +IPPGW  W  LI NS+YY+Y++++NG K +HG+ Y  DY  
Sbjct: 124 FAGKYLNQYGFPSTGGPGHIPPGWDWWFGLIGNSRYYDYALSVNGTKQQHGKLYSMDYLT 183

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I N S+ FL       S  P  +++S PA H P DSAPQY   F +  +    +Y+  
Sbjct: 184 DVIKNQSLHFL--DTVGASPNPFFMMLSTPAAHSPFDSAPQYEQDFLDWKAPRGGSYNVY 241

Query: 135 PNPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W+L+     M+    Q+ D +  KR++TL SV+D V+ ++ +L+D G L NTY+
Sbjct: 242 -GKDKHWLLRFAPSPMKDTSVQYADSIFRKRMRTLLSVEDLVQAVMTKLEDQGLLYNTYV 300

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           I++SD+GYH GQF L + K   +EFDIRVP L+RGPGI  G
Sbjct: 301 IFSSDNGYHYGQFSLPEDKRQLYEFDIRVPLLIRGPGIKAG 341


>gi|327286464|ref|XP_003227950.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Anolis
           carolinensis]
          Length = 506

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 101/229 (44%), Positives = 138/229 (60%), Gaps = 12/229 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y         ++P GW  W  L+ NSKYYNY+++ NGR+ KHG+ Y  DY  
Sbjct: 133 YAGKYLNQYGDPKAGGVQHVPLGWNYWLGLVGNSKYYNYTLSSNGREEKHGDLYEEDYLT 192

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DLI N S+ FL    Q  S+ P ++V++ PA H P  +AP Y + F  + +    +++  
Sbjct: 193 DLITNRSLAFL----QRLSRPPFLMVLAPPASHSPWTAAPHYKSAFSTIKAPRNGSFNVY 248

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK W++ Q    M     +F D +  +R QTL SVDD VEK++ +LK LG L++TY+
Sbjct: 249 -GKDKHWLIRQAKSPMLNTSIEFLDNVYRQRWQTLLSVDDMVEKVLLQLKSLGLLDSTYV 307

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
            YTSDHGYH GQF L   K   +EFDIRVP ++RGPGI P   YL  +L
Sbjct: 308 FYTSDHGYHTGQFSLPIDKRQLYEFDIRVPLMVRGPGIKPNQTYLEPIL 356


>gi|390333874|ref|XP_003723795.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Strongylocentrotus
           purpuratus]
          Length = 680

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 106/266 (39%), Positives = 151/266 (56%), Gaps = 48/266 (18%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y        ++IPPGW +W AL +NS YYNYS+++NG++  HG+DY+ DY  
Sbjct: 158 FAGKYLNQYGQKEVGGLAHIPPGWDDWIALKLNSVYYNYSLSVNGKEEVHGDDYHKDYLT 217

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DLI N S+ FL+  KQ+ S  P  ++MS PA H P DSAPQY + F +  +   P+++  
Sbjct: 218 DLINNRSMEFLQ--KQTVSS-PFFMMMSTPACHSPFDSAPQYMSHFTDKKAPRGPSFN-- 272

Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
                            VH        + R +TL SVDD V+ +V  L     ++NTYII
Sbjct: 273 -----------------VH--------SARWRTLLSVDDMVKNLVTLLTTKKMIDNTYII 307

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL-----LDVLIPQVRKF 249
           ++SD+G+HLGQF L   K   +EFDIRVP ++RGPGI PGT        +D++ P + + 
Sbjct: 308 FSSDNGFHLGQFSLPNDKRQLYEFDIRVPLIIRGPGISPGTTSKQIAVNIDIM-PTIVEM 366

Query: 250 SSGSL------IFIMSFLINLNTMRA 269
           ++GS       I +M  LI  N   A
Sbjct: 367 ATGSAPSEVDGISLMPLLIARNESMA 392


>gi|323652548|gb|ADX98525.1| N-acetylglucosamine-6-sulfatase [Gallus gallus]
          Length = 263

 Score =  173 bits (438), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 74/125 (59%), Positives = 98/125 (78%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYIPPGWREW  L+ NS++YNY+I+ NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           +S+ + R SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 140 QWILQ 144
            WI+Q
Sbjct: 258 HWIMQ 262


>gi|157106230|ref|XP_001649229.1| sulfatase [Aedes aegypti]
 gi|108879923|gb|EAT44148.1| AAEL004460-PA [Aedes aegypti]
          Length = 495

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 134/225 (59%), Gaps = 9/225 (4%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y GKYLN+Y    +PPGW +W  L  NSKYYNY++N NG+ +   E+Y  D    +++N 
Sbjct: 125 YAGKYLNEYYSKEVPPGWSDWHGLHGNSKYYNYTLNENGQIVSFTEEYLTD----VLSNR 180

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-DK 139
           +V FL +++Q     P   +++ PAPH P  +A ++ + F +V +  T  Y+    P +K
Sbjct: 181 TVDFLSKAEQGV---PFFAMVAPPAPHAPYTAATRHEDTFADVNAPRTKNYNLPCGPLEK 237

Query: 140 QWILQVTRKMQPVHRQFT-DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
            W+L +     P     + D++  KR Q+L +VD+ VE IV  L+    L +TYIIYTSD
Sbjct: 238 HWLLTMPPSPLPADILASIDIIYRKRWQSLMAVDEMVESIVATLEKRNMLTDTYIIYTSD 297

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
           +GYH+GQFGL   K  P+E DIRVP LM GP I P T+    V++
Sbjct: 298 NGYHMGQFGLPYDKRQPYETDIRVPLLMTGPKIPPKTLVSAPVVL 342


>gi|149066818|gb|EDM16551.1| rCG48973 [Rattus norvegicus]
          Length = 388

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/220 (43%), Positives = 130/220 (59%), Gaps = 18/220 (8%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  +HGE+Y  DY  
Sbjct: 138 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARRHGENYSVDYLT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFPNVIAPRNKNFNIH 254

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     +F D    +R QTL SVDD VEK+VK L   GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIKFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            YTSD+GYH  +  L       +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTDKRQL-------YEFDIKVPLLVRGPGIKP 346


>gi|195425576|ref|XP_002061073.1| GK10744 [Drosophila willistoni]
 gi|194157158|gb|EDW72059.1| GK10744 [Drosophila willistoni]
          Length = 505

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 133/234 (56%), Gaps = 10/234 (4%)

Query: 2   AWPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK 61
           A PV L  R    F     Y GKYLN++ G ++PPGW E+  L  NS+YYNY++  NGR 
Sbjct: 118 ALPVLLAERGYRTF-----YAGKYLNEFRGEHVPPGWTEFHGLHGNSRYYNYTLLQNGRN 172

Query: 62  IKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF 121
           +     Y + Y  DLI++ +V F++Q  Q     P  ++++ PA H P   AP++   F 
Sbjct: 173 VS----YTDTYLTDLISDLAVDFIQQEVQWKRMNPFFVMLAPPAAHAPYTPAPRHEGTFA 228

Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
              +  TP+++  P  DK W++  ++ +        D    KR ++L +VD+ V  +V  
Sbjct: 229 KWKALRTPSFN-VPTKDKHWLVGSSKVLPNETIATIDKYFQKRWESLLAVDEMVSTLVAL 287

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           L +   L+NTYI+YTSD+GYHLGQF     K  P+E DIRVP L+RGPGI  G+
Sbjct: 288 LNETQLLDNTYIVYTSDNGYHLGQFSQPFDKRQPYETDIRVPLLVRGPGIAKGS 341


>gi|326432134|gb|EGD77704.1| hypothetical protein PTSG_12796 [Salpingoeca sp. ATCC 50818]
          Length = 843

 Score =  164 bits (415), Expect = 4e-38,   Method: Composition-based stats.
 Identities = 86/217 (39%), Positives = 122/217 (56%), Gaps = 7/217 (3%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y GKYLN Y G   PPGW EW  L  NS+YYNY+I  N + +K G+ Y  DY  D++A  
Sbjct: 447 YGGKYLNLYYGPSNPPGWTEWFGLYGNSRYYNYTIKDNDKTVKFGDKYDTDYLTDVLARR 506

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD--YAPNPD 138
           +VRF+ +S  + + +P  L +  P+ H     APQY N    + +  TP Y+  YA   D
Sbjct: 507 AVRFINES--AAADEPFFLWLGTPSAHASFTPAPQYENTVTGIKAPRTPNYNKVYA---D 561

Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
           +   ++    M     + +D +  KRL+TL+SVDD V  I   L    +L+NT+  +T D
Sbjct: 562 RHRPVRDLSPMTDEQIRQSDEIYRKRLETLRSVDDLVGDIFDTLAATNQLSNTWFFFTGD 621

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           HG+HLGQF +   K   ++ DIRVP+ +RGPG    T
Sbjct: 622 HGFHLGQFAMGFDKRQLYDTDIRVPYFVRGPGAAANT 658


>gi|443689097|gb|ELT91586.1| hypothetical protein CAPTEDRAFT_122955 [Capitella teleta]
          Length = 537

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 132/221 (59%), Gaps = 9/221 (4%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++PPGW +W  L+ NS YYNYS+++NG++ KHG+ Y +DY  
Sbjct: 122 FAGKYLNQYGYKPSGGVEHVPPGWDDWNGLVGNSVYYNYSLSVNGKEEKHGDKYPDDYLT 181

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I+  + RFL    ++F   P  +++S PA H P   APQ+ + F NVT+    +++  
Sbjct: 182 DVISKKAERFLENEHETFGS-PFFMMLSTPACHAPFTPAPQHRDKFANVTAPRDGSFNKY 240

Query: 135 PNPDKQWILQVTRK-MQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            + ++ W+++  +  M     +  D +   R +TL SVDD + K+   L+  G L +TY+
Sbjct: 241 -DANRHWLVRNAKHPMGNESMKLIDPVFRNRWRTLLSVDDLITKLFTTLETKGFLEDTYV 299

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            Y +D+GYHLGQ  L   K   +EFDIRVP ++RGPGI  G
Sbjct: 300 FYAADNGYHLGQLSLPYDKRQLYEFDIRVPLMVRGPGIKAG 340


>gi|350414629|ref|XP_003490372.1| PREDICTED: LOW QUALITY PROTEIN:
           N-acetylglucosamine-6-sulfatase-like [Bombus impatiens]
          Length = 598

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/226 (39%), Positives = 138/226 (61%), Gaps = 14/226 (6%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        S++P GW  W  LI NSKYYNYS+++NG + K G +  NDY  
Sbjct: 119 YGGKYLNQYGEKIVGGSSHVPEGWDWWVGLIGNSKYYNYSLSINGTEKKFGSEP-NDYLT 177

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I++ ++ F++    + + +P ++V++ PAPH P   A ++ + + +V +  TP ++  
Sbjct: 178 DVISDLAIDFIK--AHNSNSEPFLMVLAPPAPHAPFIPAMRHIDKYKDVIAKRTPNFNTQ 235

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              DK W++Q  R   P+        D +  +R +TL +VD+ VE + + LK    LNNT
Sbjct: 236 TQMDKHWLVQ--RGPSPLPDNLLPKLDEIYRRRWETLLAVDELVENVYEALKVGNLLNNT 293

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
           YII+TSD+GYH+GQF +   K  P+E DI+VP L+RGPGI P  ++
Sbjct: 294 YIIFTSDNGYHIGQFSMPIDKRQPYETDIKVPLLIRGPGITPSKIF 339


>gi|326434077|gb|EGD79647.1| hypothetical protein PTSG_10494 [Salpingoeca sp. ATCC 50818]
          Length = 513

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/227 (38%), Positives = 125/227 (55%), Gaps = 20/227 (8%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y         ++P GW  W AL+ NS YY+Y I+ NG    HG    N Y P
Sbjct: 127 YTGKYLNQYGAPSAGGVQHVPRGWDNWQALVGNSVYYDYVISNNGTPESHG----NTYLP 182

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           +L+ + +  FLR  + +    P  +++  P+ H P + AP+YA++  N T+   P Y  A
Sbjct: 183 NLLLDKTSDFLRTLEGT--DDPFFIMVGTPSCHDPTEPAPEYAHLLPNATAPRPPNYGKA 240

Query: 135 PNPDKQWILQVTRKMQ-------PVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
              DK W +    ++        P+  ++ DL   +R  TL +VDD V  +   LK +G 
Sbjct: 241 FR-DKHWFVAKQGEVAGFGSGYAPIEEKYNDLQYRRRALTLMTVDDIVANVTATLKSMGR 299

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           LNNTY  Y SDHGYHLG F ++K K  P+E D+R+P ++RGP I PG
Sbjct: 300 LNNTYFFYASDHGYHLGNFAVLKDKRLPYETDLRIPAVVRGPHIKPG 346


>gi|328776879|ref|XP_623263.2| PREDICTED: n-acetylglucosamine-6-sulfatase-like isoform 2 [Apis
           mellifera]
          Length = 533

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 91/225 (40%), Positives = 134/225 (59%), Gaps = 15/225 (6%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        ++IP GW  W  LI NSKYYNY +++NG + K G D  +DY  
Sbjct: 110 YAGKYLNQYGDKIVGGAAHIPIGWDWWAGLIGNSKYYNYILSINGTEKKFGNDS-SDYLT 168

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I++ +  F++    +   +P ++V++ PAPH P   A ++ N + NV +  TP ++  
Sbjct: 169 DVISDMATNFIKTHNPN---QPFLMVLAPPAPHAPFIPAQRHINKYKNVKAKRTPNFNTQ 225

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              DK W+  V R+  P+        D +   R +TL +VD+ V+ I + LK    LNNT
Sbjct: 226 TQMDKHWL--VKREPSPLPDNLLPKLDEIYRNRWETLLAVDELVKNIYQVLKLQSFLNNT 283

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           YII+TSD+GYH+GQF +   K  P+E DIRVP L+RGPGI+P  +
Sbjct: 284 YIIFTSDNGYHIGQFSMPIDKRQPYETDIRVPLLIRGPGIMPSKI 328


>gi|441647472|ref|XP_003274834.2| PREDICTED: extracellular sulfatase Sulf-1 [Nomascus leucogenys]
          Length = 1023

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 72/119 (60%), Positives = 93/119 (78%), Gaps = 6/119 (5%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TP+Y+YAPN DK WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GE
Sbjct: 400 TPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLMETGE 459

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           L NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 460 LENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 512



 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/49 (69%), Positives = 40/49 (81%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY 68
            +FGKYLN+YNGSYIPPGWREW  LI NS++YNY++  NG K KHG DY
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDY 186


>gi|326917734|ref|XP_003205151.1| PREDICTED: extracellular sulfatase Sulf-1-like, partial [Meleagris
           gallopavo]
          Length = 622

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 71/119 (59%), Positives = 95/119 (79%), Gaps = 6/119 (5%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TP+Y+YAPN DK WI+Q T  M P+H +FT++L  KRLQTL SVDD++E++ + L ++GE
Sbjct: 1   TPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGE 60

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           L NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 61  LENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 113


>gi|340715199|ref|XP_003396106.1| PREDICTED: LOW QUALITY PROTEIN:
           n-acetylglucosamine-6-sulfatase-like [Bombus terrestris]
          Length = 497

 Score =  160 bits (404), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 136/225 (60%), Gaps = 14/225 (6%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        S++P GW  W  LI NSKYYNYS+++NG + K G +  NDY  
Sbjct: 119 YGGKYLNQYGEKIVGGSSHVPEGWDWWVGLIGNSKYYNYSLSINGTEKKFGSEP-NDYLT 177

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I++ ++ F++    + + +P ++V++ PAPH P   A ++ + +  V +  TP ++  
Sbjct: 178 DVISDLAIDFIKA--HNPNSEPFLMVLAPPAPHAPFIPAMRHIDKYKYVIAKRTPNFNTQ 235

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              DK W++Q  R   P+        D +  +R +TL +VD+ VE + + LK    LNNT
Sbjct: 236 TQMDKHWLVQ--RGPSPLPDNLLPKLDEIYRRRWETLLAVDELVENVYEALKTGNLLNNT 293

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           YII+TSD+GYH+GQF +   K  P+E DI+VP L+RGPGI P  +
Sbjct: 294 YIIFTSDNGYHIGQFSMPIDKRQPYETDIKVPLLIRGPGITPSKI 338


>gi|46249932|gb|AAH68565.1| SULF1 protein [Homo sapiens]
          Length = 644

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 72/119 (60%), Positives = 93/119 (78%), Gaps = 6/119 (5%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TP+Y+YAPN DK WI+Q T  M P+H +FT++L  KRLQTL SVDD+VE++   L + GE
Sbjct: 5   TPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGE 64

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           L NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+      ++PQ+
Sbjct: 65  LENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 117


>gi|383853828|ref|XP_003702424.1| PREDICTED: LOW QUALITY PROTEIN:
           N-acetylglucosamine-6-sulfatase-like [Megachile
           rotundata]
          Length = 594

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 134/225 (59%), Gaps = 14/225 (6%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        +++P GW  W  L+ NS+YYNYSI++NG + K G +   DY  
Sbjct: 112 YAGKYLNQYGHQKAGGPAHVPVGWDWWAGLVGNSRYYNYSISINGTRKKFGNEPA-DYLT 170

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I N ++ F+    ++ + +P ++V++ PAPH P   A ++ + F +V +  TP ++  
Sbjct: 171 DVINNLAMEFI--GTRNANNQPFLMVLAPPAPHAPFTPATRHIDKFKDVKAKRTPNFNTL 228

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              DK W++Q  +   P+        D +  +R +TL +VD+ V  I + LK    LNNT
Sbjct: 229 IQSDKHWLVQ--QGPSPLPNDLLPKLDDIYRRRWETLLAVDELVTNIYQALKFQNLLNNT 286

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           Y+I+TSD+GYH+GQF     K  P+E DIRVP LMRGPGIVP  +
Sbjct: 287 YMIFTSDNGYHIGQFSQPIDKRQPYETDIRVPLLMRGPGIVPSKI 331


>gi|380011321|ref|XP_003689757.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Apis florea]
          Length = 528

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 132/225 (58%), Gaps = 14/225 (6%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y         +IP GW  W  LI NSKYYNY +++NG + K G D  +DY  
Sbjct: 110 YAGKYLNQYGDKIVGGAGHIPTGWDWWAGLIGNSKYYNYILSINGTEKKFGNDS-SDYLT 168

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I++ +  F++    +   +P ++V++ PAPH P   A ++ N + NV +  TP ++  
Sbjct: 169 DVISDMATNFIKIYNPN--GQPFLMVLAPPAPHAPFTPAQRHINKYKNVKAKRTPNFNTQ 226

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              DK W+  V R   P+        D +   R +TL +VD+ V+ + + LK    LNNT
Sbjct: 227 TQMDKHWL--VERGPSPLPDNLLPKLDEIYRNRWETLLAVDELVKNVYQVLKLQNLLNNT 284

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           YII+TSD+GYH+GQF +   K  P+E DIRVP L+RGPG++P  +
Sbjct: 285 YIIFTSDNGYHIGQFSMPIDKRQPYETDIRVPLLIRGPGVMPSKI 329


>gi|350595017|ref|XP_003484027.1| PREDICTED: extracellular sulfatase Sulf-2-like, partial [Sus
           scrofa]
          Length = 281

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 69/109 (63%), Positives = 90/109 (82%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TP+Y+YAPNPDK WI++ T  M+P+H +FT++L  KRLQTL SVDD++E I   L + GE
Sbjct: 1   TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGE 60

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP +  G++
Sbjct: 61  LDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 109


>gi|195151379|ref|XP_002016625.1| GL10403 [Drosophila persimilis]
 gi|194110472|gb|EDW32515.1| GL10403 [Drosophila persimilis]
          Length = 499

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 124/219 (56%), Gaps = 9/219 (4%)

Query: 18  SHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           +HGY     GKYLN++ G+ +P GW E+  L  NS+YYNY++  N + +     Y + Y 
Sbjct: 115 NHGYRTFFAGKYLNQFRGAKVPRGWNEFYGLHGNSRYYNYTLRENSQNVS----YTSTYL 170

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
            DL+ + +V FLR + Q    +P   +++ PAPH P   AP++  +F N  +  TP+++ 
Sbjct: 171 TDLLRDRAVHFLRSATQGDVHRPFFAMVAPPAPHAPYTPAPRHRGVFSNTKALRTPSFN- 229

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            P  DK W++  +  +        D     R ++L +VD+ V  +V  L +   L NTYI
Sbjct: 230 NPTEDKHWLVGSSAHLSNETIATIDRYFQGRWESLLAVDELVAALVAVLNETKSLENTYI 289

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           IYTSD+GYH+GQF     K  P+E DI VP L+RGPGI 
Sbjct: 290 IYTSDNGYHMGQFAQPFDKRQPYETDIHVPLLIRGPGIA 328


>gi|198459803|ref|XP_002138742.1| GA24970, partial [Drosophila pseudoobscura pseudoobscura]
 gi|198136813|gb|EDY69300.1| GA24970, partial [Drosophila pseudoobscura pseudoobscura]
          Length = 450

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 124/219 (56%), Gaps = 9/219 (4%)

Query: 18  SHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           +HGY     GKYLN++ G+ +P GW E+  L  NS+YYNY++  N + +     Y + Y 
Sbjct: 114 NHGYRTFFAGKYLNQFRGAKVPRGWNEFYGLHGNSRYYNYTLRENSQNVS----YTSTYL 169

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
            DL+ + +V FLR + Q    +P   +++ PAPH P   AP++  +F N  +  TP+++ 
Sbjct: 170 TDLLRDRAVDFLRSATQGDVHRPFFAMVAPPAPHAPYTPAPRHRGVFSNTKALRTPSFN- 228

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            P  DK W++  +  +        D     R ++L +VD+ V  +V  L +   L NTYI
Sbjct: 229 NPTEDKHWLVGSSAHLSNETIATIDRYFQGRWESLLAVDELVAALVAVLNETKSLENTYI 288

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           IYTSD+GYH+GQF     K  P+E DI VP L+RGPGI 
Sbjct: 289 IYTSDNGYHMGQFAQPFDKRQPYETDIHVPLLIRGPGIA 327


>gi|163793311|ref|ZP_02187286.1| sulfatase [alpha proteobacterium BAL199]
 gi|159181113|gb|EDP65628.1| sulfatase [alpha proteobacterium BAL199]
          Length = 464

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 130/218 (59%), Gaps = 8/218 (3%)

Query: 20  GYFGKYLNKYN---GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           G  GKY+N+Y     +Y+PPGW EW A      YYNY +N NGR + +G     DY  D+
Sbjct: 108 GLIGKYMNQYEPNEQTYVPPGWDEWYA--AGHDYYNYVLNENGRHVSYGSKP-EDYLTDV 164

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           IA  +V  +R++  +   KP    ++  APHGP  +AP++A++F +       ++D A  
Sbjct: 165 IAGHTVSAIRRAADT--GKPFFFFVTPYAPHGPATAAPRHADLFADTPYPRPASFDEADV 222

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
            DK   ++    +      + + +  +RL++LQ++DD VE IV  L+ +G+L NTY+IYT
Sbjct: 223 SDKPAYIRDLPPLSADQIAWIEDMYRRRLRSLQAMDDMVESIVNTLEAVGQLENTYLIYT 282

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           SD+G+H+G+     GK+  +E DIRVPF+MRGPG+  G
Sbjct: 283 SDNGWHMGEHRQQYGKTTAYEEDIRVPFVMRGPGVPKG 320


>gi|194757651|ref|XP_001961076.1| GF11188 [Drosophila ananassae]
 gi|190622374|gb|EDV37898.1| GF11188 [Drosophila ananassae]
          Length = 501

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/241 (37%), Positives = 131/241 (54%), Gaps = 14/241 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
           W   L  R L       GY     GKYLN+Y+G   P GW  +  L  NS+YYNY++  N
Sbjct: 102 WRESLEPRTLAMHLQRRGYTTFFAGKYLNQYSGLETPVGWNSFYGLQGNSRYYNYTLREN 161

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSK-----KPIMLVMSFPAPHGPEDSA 113
              + + E Y  D   DL    +V FL    QS +K     +P   +++ PA H P   A
Sbjct: 162 KNNVHYKEVYLTDKLRDL----TVDFLESVSQSANKSKEVERPFFAMITPPAAHAPFTPA 217

Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
           P++  +F  + +  TP+++  P  DK W+++ +  +Q       D    KR +TL +VD+
Sbjct: 218 PRHDGVFNKIKALRTPSFN-KPVDDKHWLVRSSWPLQDKTIDTIDSYYRKRWETLLAVDE 276

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            VEK+V +L   G L+NTY+I++SD+GYH+GQF     K  P+E DI +P L+RGPGI P
Sbjct: 277 LVEKVVTKLNQTGVLDNTYVIFSSDNGYHVGQFAQPFDKRQPYETDIHIPLLIRGPGIAP 336

Query: 234 G 234
           G
Sbjct: 337 G 337


>gi|195134107|ref|XP_002011479.1| GI14019 [Drosophila mojavensis]
 gi|193912102|gb|EDW10969.1| GI14019 [Drosophila mojavensis]
          Length = 493

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 135/224 (60%), Gaps = 12/224 (5%)

Query: 19  HGYF----GKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           HGY     GKYLN++ G+ +PPGW ++  L  NS+YYNY+I  N R +     Y + Y  
Sbjct: 115 HGYHTFFAGKYLNQFKGAEVPPGWDKFYGLHGNSRYYNYTIRENRRNVS----YADIYLT 170

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++ + +++F+   ++   + P   ++S PA H P   AP++  +F +V +  TP+++ A
Sbjct: 171 DMLRDRTLKFIHNVRKG--QNPFFAMISPPAAHAPFTPAPRHKGVFSHVRALRTPSFN-A 227

Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
           P+ DK W+++  +++        D    KR +TL +VD+ +  I+  LK    L+NTYII
Sbjct: 228 PSKDKHWLVRTAKRLPNSTVLTVDKYFQKRWETLLAVDEMMVAIITLLKKYQCLDNTYII 287

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           YTSD+GYHLGQF     K  P++ DI+VP L+RGP  VPG++ +
Sbjct: 288 YTSDNGYHLGQFAQPFDKRQPYQTDIKVPLLIRGPD-VPGSILM 330


>gi|405977217|gb|EKC41676.1| N-acetylglucosamine-6-sulfatase [Crassostrea gigas]
          Length = 1011

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 10/229 (4%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++PPGW  W  L+ NSKYYNY +++NG    H +D   DY  
Sbjct: 126 FAGKYLNQYGTPMAGGVEHVPPGWDWWIGLVGNSKYYNYHLSVNGTLEVHKDDPKTDYLT 185

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I   +  FL  S Q+    P  +++S PA HGP   AP+Y N F N T+    +Y+  
Sbjct: 186 DIINRRATSFL--SLQNSDSAPFFMMLSTPACHGPFTPAPKYQNNFPNKTAPRDGSYNKH 243

Query: 135 PNPDKQWILQVTRKMQPVHR-QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            + DK W++Q      P    +  D +   R +TL SVDD +E IV  L+    L NTYI
Sbjct: 244 GH-DKHWLIQQAITPLPNDTVKSEDQVFRNRWRTLLSVDDMMENIVNILQTKNLLENTYI 302

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           +++SD+G+HLGQF +   K   +EFD+RVP ++RGP +  G +    V+
Sbjct: 303 VFSSDNGFHLGQFSMPSDKRQLYEFDVRVPLMIRGPNVTAGQVVKEQVM 351


>gi|307174678|gb|EFN65061.1| N-acetylglucosamine-6-sulfatase [Camponotus floridanus]
          Length = 521

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/225 (38%), Positives = 131/225 (58%), Gaps = 15/225 (6%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        +++P GW  W  LI NSKYYNYS+++NG + K+G +  +DY  
Sbjct: 132 YAGKYLNEYGNRKVGGAAHVPMGWDWWAGLIGNSKYYNYSLSINGTEAKYG-NSTSDYLT 190

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I N +V F+         +P ++V++ PAPH P   A ++ + +    +  TP ++  
Sbjct: 191 DVINNLAVNFINGHSDD---QPFLMVLAPPAPHAPFTPADRHNDKYNGTKAKRTPNFNVP 247

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
            + DK W+  V +   P+        D +  KR +TL +VD+ V+ +   LK+   LN+T
Sbjct: 248 VHQDKHWL--VRKGPTPLPNSVLPKLDEIYRKRWETLLAVDELVKNVHDLLKERNLLNDT 305

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           Y IYTSD+GYH+GQF +   K  P+E DIRVP L+ GPGI P T+
Sbjct: 306 YFIYTSDNGYHIGQFSMPMDKRQPYETDIRVPLLISGPGINPSTI 350


>gi|195041317|ref|XP_001991230.1| GH12546 [Drosophila grimshawi]
 gi|193900988|gb|EDV99854.1| GH12546 [Drosophila grimshawi]
          Length = 501

 Score =  153 bits (387), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/246 (35%), Positives = 138/246 (56%), Gaps = 15/246 (6%)

Query: 18  SHGYF----GKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           +HGY     GKYLN++ G+ +PPGW E+  L  NS+YYNY++  N + +     Y + Y 
Sbjct: 123 NHGYHTFFAGKYLNQFKGAEVPPGWDEFYGLHGNSRYYNYTMRENQQNVS----YTDTYL 178

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
            DL+ + +V F+R       +KP   ++S PA H P   AP++   F  V +  TP ++ 
Sbjct: 179 TDLLKDRAVTFIRNMASKQYQKPFFAMISPPAAHAPFTPAPRHKGFFSEVHALRTPNFN- 237

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
           AP+ DK W++   + +     +  D     R +TL +VD+ ++ ++  L +   L+NTYI
Sbjct: 238 APDTDKHWLVGSAQNLSEATVRTIDEYFQSRWETLLAVDEMMQNLLTVLNETQYLDNTYI 297

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-----TMYLLDVLIPQVRK 248
           +YTSD+GYHLGQF     K  P+E DI+VP L+ GP +  G      + LLD L P + +
Sbjct: 298 VYTSDNGYHLGQFAQPFDKRQPYETDIKVPLLISGPDVPAGGSTDIIVSLLD-LAPTILE 356

Query: 249 FSSGSL 254
           ++  S+
Sbjct: 357 WTGMSV 362


>gi|158298003|ref|XP_318108.3| AGAP004721-PA [Anopheles gambiae str. PEST]
 gi|157014599|gb|EAA13233.3| AGAP004721-PA [Anopheles gambiae str. PEST]
          Length = 494

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 139/243 (57%), Gaps = 19/243 (7%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           + GKYLN+Y  + IPPGW +W  L  NS+YYN+++  NG+       Y N+Y+ D +   
Sbjct: 125 FAGKYLNEYFSASIPPGWSQWFGLHGNSRYYNFTVTENGKVAT----YTNEYFTDYLDTK 180

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-DK 139
           +V FL+  +   + +    +++ PAPH P   A ++ ++F NV +  TP ++ + +P  K
Sbjct: 181 TVNFLKNLR---ADRHFFAMVTPPAPHAPFTPADRHKDLFPNVKAIQTPNFNISSSPLQK 237

Query: 140 QWILQVTRKMQPVHRQFTDLLMT---KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
            W+L +     P+  +F D + T   KR QTL +VD+ VE IV  L++   L  T+I YT
Sbjct: 238 HWLLTMEPSTLPL--KFLDEMDTIHRKRWQTLMAVDNMVENIVNILREKKLLEKTFIFYT 295

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP-----GTMYLLDVLIPQVRKFSS 251
           SD+G+HLGQF     K  P+E DIRVP  +RGP I P       + LLD+ +P + + ++
Sbjct: 296 SDNGFHLGQFSQAYDKRQPYETDIRVPLFVRGPTIKPKAVQESAVALLDI-VPTILQLAN 354

Query: 252 GSL 254
             L
Sbjct: 355 LEL 357


>gi|229577317|ref|NP_001153351.1| glucosamine (N-acetyl)-6-sulfatase precursor [Nasonia vitripennis]
          Length = 501

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/219 (38%), Positives = 133/219 (60%), Gaps = 13/219 (5%)

Query: 21  YFGKYLNKY---NG-SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GKYLNKY   NG S +PPGW +W  LI NS+YY+YS+++NG + K+G     DY  D+
Sbjct: 125 YAGKYLNKYGQRNGVSRVPPGWDKWYGLIGNSRYYDYSLSINGAEKKYGSSP-TDYLTDV 183

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           I   ++ FL+  K+    +P ++V++ PAPH P   AP++ + +    +  T  ++ +  
Sbjct: 184 IQGFAIDFLKTQKKD---QPFLMVLAPPAPHEPFIPAPRHNDKYKGTKAKRTKNFNTSSQ 240

Query: 137 PDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            DK W+  V R   P+        D +  KR ++L +VD+ V+K+  +LK    L++TYI
Sbjct: 241 RDKHWL--VRRGPSPLPDSVVARLDDIYRKRWESLLAVDELVQKVYSQLKSQNLLDDTYI 298

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           I+TSD+G+H+GQF L   K  P+E DI++P ++RGP I 
Sbjct: 299 IFTSDNGFHVGQFSLPYDKRQPYETDIKIPLMIRGPNIT 337


>gi|195334065|ref|XP_002033705.1| GM20282 [Drosophila sechellia]
 gi|194125675|gb|EDW47718.1| GM20282 [Drosophila sechellia]
          Length = 492

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 91/236 (38%), Positives = 129/236 (54%), Gaps = 12/236 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L Y    HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 100 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 160 NAGNVH----YESTYLTDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F +V +  TP+++     DK W+++  R++        D    KR +TL +VD+ V  
Sbjct: 214 GVFSHVEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDKYFQKRWETLLAVDELVVT 272

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           +V  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI VP L+RGPGI P
Sbjct: 273 LVGVLNDTESLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 328


>gi|195403316|ref|XP_002060238.1| GJ19791 [Drosophila virilis]
 gi|194141891|gb|EDW58302.1| GJ19791 [Drosophila virilis]
          Length = 500

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/238 (34%), Positives = 134/238 (56%), Gaps = 9/238 (3%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
           W      R+L      HGY     GKYLN++ G+ +P GW E+  L  NS+YYNY++  N
Sbjct: 103 WRTMFEPRSLPMLLQKHGYQTFFAGKYLNQFKGAEVPHGWDEFYGLHGNSRYYNYTLREN 162

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
            + +     + + Y  DL+ + +  F+R S ++   KP   ++S PA H P   AP++  
Sbjct: 163 TKNVS----FTDIYLTDLLRDKAAEFIRNSTRNRHSKPFFAMVSPPAAHEPFTPAPRHEG 218

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           +F  V +  TP+++ AP+ +K W+++  + +        D    +R +TL +VD+ +  +
Sbjct: 219 VFSQVRALRTPSFN-APDKNKHWLVRSAQHLSNDSVFTIDKYFQRRWETLLAVDEMMVTL 277

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           V  L +   L+NTYII+TSD+GYHLGQF     K  P++ DI+VP L+RGPG+  G +
Sbjct: 278 VTVLNETQCLDNTYIIFTSDNGYHLGQFAQPFDKRQPYDTDIKVPLLIRGPGVPAGRL 335


>gi|402886724|ref|XP_003906772.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylglucosamine-6-sulfatase
           [Papio anubis]
          Length = 551

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 127/220 (57%), Gaps = 12/220 (5%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL   K +F  +P  ++++ PAPH P  +APQY   F NV +     ++  
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              +K W++ Q    M     QF D    KR QTL SVDD   K++K+L     L ++  
Sbjct: 263 -GMNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDXXXKVLKQLP-YTXLQDSTP 320

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            Y SD GY+ GQF L   K   +EFDI+VP L+RGPGI P
Sbjct: 321 DYASDSGYYTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 360


>gi|297565681|ref|YP_003684653.1| sulfatase [Meiothermus silvanus DSM 9946]
 gi|296850130|gb|ADH63145.1| sulfatase [Meiothermus silvanus DSM 9946]
          Length = 501

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 133/230 (57%), Gaps = 21/230 (9%)

Query: 23  GKYLNKY----------------NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKH 64
           GKYLN Y                + +Y+PPGW EW A + N+ Y  YNYS+N NG+ +++
Sbjct: 131 GKYLNGYGQDNPRRAQRQGLNLPDPTYVPPGWTEWYAGVGNAPYQDYNYSLNENGQLVRY 190

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           G     DY  D+IA  +V+ +  + +    KP  L ++   PH P + AP++A++F +  
Sbjct: 191 GSSP-EDYLTDVIARKAVQAITNATRE--GKPFFLYLAPFTPHAPANFAPRHASLFKDAE 247

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
               P +D A   DK  +L+   ++          L  KRL++LQ++DD VE IV+ L+ 
Sbjct: 248 LPRPPNFDEADVSDKPPLLRRLPRLSERELARMRELYVKRLRSLQAIDDLVESIVQALRQ 307

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            G+L NTYI+YTSD+G+H+G   + +GK+ P+E DIRVP ++RGPG+  G
Sbjct: 308 NGQLANTYIVYTSDNGFHMGNHRMPQGKNMPYEEDIRVPLVVRGPGVPAG 357


>gi|195583054|ref|XP_002081339.1| GD25763 [Drosophila simulans]
 gi|194193348|gb|EDX06924.1| GD25763 [Drosophila simulans]
          Length = 1772

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 91/238 (38%), Positives = 130/238 (54%), Gaps = 12/238 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGS-YIPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L Y    HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 60  WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 119

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 120 NTGNVH----YQSTYLTDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 173

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F +V +  TP+++     DK W+++  R++        D    KR +TL +VD+ V  
Sbjct: 174 GVFSHVEALRTPSFNQVKQ-DKHWLVRAARRLPKETINTIDKYFQKRWETLLAVDELVVT 232

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           +V  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI VP L+RGPGI P +
Sbjct: 233 LVGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAPES 290


>gi|24653364|ref|NP_725289.1| CG18278 [Drosophila melanogaster]
 gi|21627222|gb|AAG22274.2| CG18278 [Drosophila melanogaster]
 gi|60677693|gb|AAX33353.1| SD21168p [Drosophila melanogaster]
 gi|220952048|gb|ACL88567.1| CG18278-PA [synthetic construct]
          Length = 492

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 89/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L Y    HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 100 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 160 NSGNVH----YESTYLTDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F ++ +  TP+++     DK W+++  R++        D    KR +TL +VD+ V  
Sbjct: 214 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVT 272

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           ++  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI VP L+RGPGI P
Sbjct: 273 LMGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 328


>gi|321470213|gb|EFX81190.1| hypothetical protein DAPPUDRAFT_50373 [Daphnia pulex]
          Length = 506

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 81/219 (36%), Positives = 127/219 (57%), Gaps = 13/219 (5%)

Query: 21  YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y  +      ++PPG+  W  L+ NS+YYNY+++++G+   HG+DY+ DY  
Sbjct: 123 YGGKYLNQYGKNETGGVEHVPPGYDRWVGLVGNSRYYNYTLSVDGQPRDHGDDYHLDYLT 182

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++   +   L       + +P+  V++ PA H P   APQ+A  F    +   P+++  
Sbjct: 183 DVLVFIATSALNHGS---NNRPLFAVIAPPACHSPFTPAPQFARKFAGRRAPRWPSFNRH 239

Query: 135 PNPDKQWILQVTRKMQPVHRQFTDL---LMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
           P  DK W+++      P+ ++  D    +  +R +TL SVD  + KIVK L   G+L+ T
Sbjct: 240 PGLDKHWLVRQA-PQGPLPKEIIDQVDHIFRQRWRTLLSVDQMIGKIVKLLGWKGQLDRT 298

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +++ TSDHGYHLGQF L   K  P+E D+RVP L+  PG
Sbjct: 299 WLVVTSDHGYHLGQFTLPVDKRLPYETDVRVPLLVIAPG 337


>gi|28317151|gb|AAO39590.1| LD10990p, partial [Drosophila melanogaster]
          Length = 515

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 89/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L Y    HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 123 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNNFYGLHGNSRYYNYTLRE 182

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 183 NTGNVH----YESTYLSDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 236

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F ++ +  TP+++     DK W+++  R++        D    KR +TL +VD+ V  
Sbjct: 237 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVT 295

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           ++  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI VP L+RGPGI P
Sbjct: 296 LMGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 351


>gi|19922168|ref|NP_610872.1| CG30059 [Drosophila melanogaster]
 gi|16769828|gb|AAL29133.1| SD03412p [Drosophila melanogaster]
 gi|21627223|gb|AAM68572.1| CG30059 [Drosophila melanogaster]
 gi|220946552|gb|ACL85819.1| CG30059-PA [synthetic construct]
          Length = 492

 Score =  150 bits (378), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 89/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L Y    HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 100 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNNFYGLHGNSRYYNYTLRE 159

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 160 NTGNVH----YESTYLSDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F ++ +  TP+++     DK W+++  R++        D    KR +TL +VD+ V  
Sbjct: 214 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVT 272

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           ++  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI VP L+RGPGI P
Sbjct: 273 LMGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 328


>gi|195484992|ref|XP_002090907.1| GE13365 [Drosophila yakuba]
 gi|194177008|gb|EDW90619.1| GE13365 [Drosophila yakuba]
          Length = 492

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 88/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L +    HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 100 WWRSLEPRALPHILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 160 NSGNVH----YESTYLTDLLKDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F ++ +  TP+++     DK W+++  R++        D    KR +TL +VD+ V  
Sbjct: 214 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDKYFQKRWETLLAVDELVVT 272

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           +V  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI +P L+RGPGI P
Sbjct: 273 LVGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINIPLLIRGPGIAP 328


>gi|170028855|ref|XP_001842310.1| N-acetylglucosamine-6-sulfatase [Culex quinquefasciatus]
 gi|167877995|gb|EDS41378.1| N-acetylglucosamine-6-sulfatase [Culex quinquefasciatus]
          Length = 472

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 82/220 (37%), Positives = 130/220 (59%), Gaps = 12/220 (5%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y GKYLN+Y+   +P GW +W  L  NS+YYNY++N NG +I+    Y ++Y  DL+   
Sbjct: 79  YAGKYLNEYHSDRVPSGWTQWYGLHGNSRYYNYTLNENG-QIRF---YQDEYLTDLLKTR 134

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-DK 139
           ++ F+  +    +K P   +++ PAPH P  +A ++  +F +V +  T  ++    P +K
Sbjct: 135 TLEFI--TSIDATKTPFFAMVAPPAPHAPFTAADRHQQLFPDVEALKTKNFNLPCGPLEK 192

Query: 140 QWILQVTRKMQPVHRQ---FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
            W+L  T    P+ ++   + D +   R ++L +VD+ VE ++  L+    LNNTYI YT
Sbjct: 193 HWLL--TMPPSPLSKKTLTYLDTIYRNRWRSLMAVDELVEAVISLLESKNLLNNTYIFYT 250

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           SD+GYH+GQF L   K  P+E DIRVPFL+ GP I   ++
Sbjct: 251 SDNGYHIGQFSLPYDKRQPYETDIRVPFLVAGPKIASKSL 290


>gi|271968605|ref|YP_003342801.1| N-acetylglucosamine-6-sulfatase [Streptosporangium roseum DSM
           43021]
 gi|270511780|gb|ACZ90058.1| N-acetylglucosamine-6-sulfatase [Streptosporangium roseum DSM
           43021]
          Length = 513

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 127/223 (56%), Gaps = 16/223 (7%)

Query: 20  GYFGKYLNKYNG-----SYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYNDYY 73
           G  GKYLN Y G     +Y+PPGW EW   +    + Y Y++N NG     G     DY 
Sbjct: 128 GLMGKYLNHYPGGSAEPTYVPPGWDEWDVPVRKLYEEYGYTLNENGVLTTRGS-APEDYL 186

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
            D+++  +  F+ QS       P  L ++  APH P + AP+YAN F +  +  TP++D 
Sbjct: 187 TDVLSQKAGAFVSQSPD-----PFFLYLAPTAPHNPANHAPRYANAFADAIAPRTPSFDQ 241

Query: 134 AP-NPDKQWILQVTR-KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
           A  + + +W+    R   + + +   D    +RL+++  VDD V  ++K L+D G+L++T
Sbjct: 242 ADVSAEPRWLSSRPRFSARTIEK--IDERYRRRLRSMLGVDDMVGALIKTLRDTGKLDDT 299

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           YI + SD+G+HLGQ  L +GK+ PF+  I+VP ++RGPG+ PG
Sbjct: 300 YIFFASDNGFHLGQHRLAQGKTTPFDESIKVPLVVRGPGVRPG 342


>gi|296269863|ref|YP_003652495.1| sulfatase [Thermobispora bispora DSM 43833]
 gi|296092650|gb|ADG88602.1| sulfatase [Thermobispora bispora DSM 43833]
          Length = 493

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 84/222 (37%), Positives = 126/222 (56%), Gaps = 14/222 (6%)

Query: 20  GYFGKYLNKY----NGSYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYNDYYP 74
           G  GKYLN+Y    +  YIPPGW EW   +    + Y Y++N NGR  K+GE    DY  
Sbjct: 134 GLLGKYLNQYPKGADRRYIPPGWDEWAVPVTRLYQEYGYALNDNGRIRKYGE-APEDYLE 192

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+++  +  F+          P  L ++  APH P + AP++A+ F    +  TP++D A
Sbjct: 193 DVLSAKANAFVS------GGGPFFLYLAPVAPHLPANHAPRHASAFAGEIAPRTPSFDQA 246

Query: 135 P-NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
             + + +W+ +  R      R+  D +   RL+ +  +DD V  ++  L+  G L++TYI
Sbjct: 247 DLSAEPRWLRERPRLTDRDIRRI-DRVHRNRLRAMLGIDDMVGALIAALERAGRLDDTYI 305

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            +TSD+G+HLGQ  L  GK+ PFE DIRVP ++RGPGI PGT
Sbjct: 306 FFTSDNGFHLGQHRLRPGKTTPFEEDIRVPLVVRGPGIAPGT 347


>gi|242014810|ref|XP_002428078.1| N-acetylglucosamine-6-sulfatase precursor, putative [Pediculus
           humanus corporis]
 gi|212512597|gb|EEB15340.1| N-acetylglucosamine-6-sulfatase precursor, putative [Pediculus
           humanus corporis]
          Length = 555

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 84/219 (38%), Positives = 131/219 (59%), Gaps = 11/219 (5%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y         ++P GW  W  L+ NS YYNY++++NG K++ G    +DY  
Sbjct: 157 YAGKYLNRYGYKSSGGVGHVPKGWDWWIGLVGNSAYYNYTLSVNGSKVRRGTQE-SDYLT 215

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I   S++FL +  Q+F+ + +M +   P+PH P   A ++   +  + +  TP ++  
Sbjct: 216 DVIKEYSLKFLNEH-QTFNNEFLMFINP-PSPHEPFTPAKRHLGKYDKIKAPRTPNFNKL 273

Query: 135 PNPD-KQWILQVTRKMQPVHR-QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
            + + K W+L++     P +  +  D +   R +TL SVD+ VE I+ +LK L  LN TY
Sbjct: 274 TDKNGKHWLLRLPPNELPSNVIERIDTIYRNRWETLLSVDELVETIIMKLKKLNILNETY 333

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           II+TSD+G+H+GQF L   K  P+E DIRVP ++RGPGI
Sbjct: 334 IIFTSDNGFHMGQFELPWDKRQPYETDIRVPLMIRGPGI 372


>gi|270008450|gb|EFA04898.1| hypothetical protein TcasGA2_TC014962 [Tribolium castaneum]
          Length = 482

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/226 (36%), Positives = 132/226 (58%), Gaps = 14/226 (6%)

Query: 21  YFGKYLNKYNGS-----YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GKYLN+Y  S     ++PPG+  W  L  NSKYYNY++++NG     G  +  DY  D
Sbjct: 116 YAGKYLNQYGKSGKGVKHVPPGYDWWLGLKGNSKYYNYTLSINGS----GHFFEKDYLTD 171

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            I   ++ FL Q+ +        ++++ PA H P   A ++  +F ++ +  TP ++  P
Sbjct: 172 KITKYALDFLNQTDEG----NFFMMLAPPACHAPFTPADRHRRLFPDLETLKTPPFNATP 227

Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
           + DK WI+ +     P + Q  D +   R++TLQSVD+ V+ ++ +L+++  L+NTY I 
Sbjct: 228 S-DKHWIVAMPPMSLPQNVQILDEIYKNRIRTLQSVDEMVQALITKLQEIRVLDNTYFIV 286

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           TSD+G+H+GQF     K  P+E DIRVPF++RGP I   T+  + V
Sbjct: 287 TSDNGFHIGQFTQPWDKRQPYESDIRVPFMIRGPNIRKKTVSEVSV 332


>gi|91085257|ref|XP_973432.1| PREDICTED: similar to sulfatase [Tribolium castaneum]
          Length = 475

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/226 (36%), Positives = 132/226 (58%), Gaps = 14/226 (6%)

Query: 21  YFGKYLNKYNGS-----YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GKYLN+Y  S     ++PPG+  W  L  NSKYYNY++++NG     G  +  DY  D
Sbjct: 116 YAGKYLNQYGKSGKGVKHVPPGYDWWLGLKGNSKYYNYTLSINGS----GHFFEKDYLTD 171

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            I   ++ FL Q+ +        ++++ PA H P   A ++  +F ++ +  TP ++  P
Sbjct: 172 KITKYALDFLNQTDEG----NFFMMLAPPACHAPFTPADRHRRLFPDLETLKTPPFNATP 227

Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
           + DK WI+ +     P + Q  D +   R++TLQSVD+ V+ ++ +L+++  L+NTY I 
Sbjct: 228 S-DKHWIVAMPPMSLPQNVQILDEIYKNRIRTLQSVDEMVQALITKLQEIRVLDNTYFIV 286

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           TSD+G+H+GQF     K  P+E DIRVPF++RGP I   T+  + V
Sbjct: 287 TSDNGFHIGQFTQPWDKRQPYESDIRVPFMIRGPNIRKKTVSEVSV 332


>gi|167538032|ref|XP_001750682.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770816|gb|EDQ84495.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1428

 Score =  145 bits (367), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 78/195 (40%), Positives = 107/195 (54%), Gaps = 10/195 (5%)

Query: 50  YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
           YY+Y+I+ NG+  KHG+D   DY P ++   ++ FL     S    P  ++++ P+ H P
Sbjct: 753 YYHYTISNNGQAEKHGDDPEADYLPKVLQRKAMAFLENVTSSDDNAPFFMMVATPSCHDP 812

Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVH--------RQFTDLLM 161
            + A +YA +  N T+  TP Y    + DK W     +     H          + DL  
Sbjct: 813 TEPAAEYAGLMVNATAPRTPNYGGFAS-DKHW-YAAQQNCAAGHCDGYTKQQEAYNDLQY 870

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
            +R  TL SVDD V  I   L+    L+NTYI Y+SDHGYHLGQF L+K K  P+E DIR
Sbjct: 871 RRRALTLMSVDDIVGNITFYLESKRLLDNTYIFYSSDHGYHLGQFALMKDKRNPYESDIR 930

Query: 222 VPFLMRGPGIVPGTM 236
           +P L RGPGI PGT+
Sbjct: 931 IPGLARGPGIAPGTI 945


>gi|194883353|ref|XP_001975767.1| GG22495 [Drosophila erecta]
 gi|190658954|gb|EDV56167.1| GG22495 [Drosophila erecta]
          Length = 492

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 88/236 (37%), Positives = 127/236 (53%), Gaps = 12/236 (5%)

Query: 3   WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
           W   L  R L      HGY     GKYLN+Y G+  +P GW  +  L  NS+YYNY++  
Sbjct: 100 WRRALEPRALPQVLQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           N   +     Y + Y  DL+ + +  FLR + QS   +P   +++ PA H P   AP++ 
Sbjct: 160 NTGNVH----YESTYLTDLLRDRAADFLRNATQS--AEPFFAMVAPPAAHEPFTPAPRHE 213

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +F +V +  TP+++     DK W+++   ++        D    KR +TL +VD+ V  
Sbjct: 214 GVFSHVEALRTPSFNQVKQ-DKHWLVRAASRLPNETISTIDKYFQKRWETLLAVDELVVT 272

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           +V  L D   L NTYIIYTSD+GYH+GQF     K  P+E DI +P L+RGPGI P
Sbjct: 273 LVGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINIPLLIRGPGIAP 328


>gi|307206693|gb|EFN84648.1| N-acetylglucosamine-6-sulfatase [Harpegnathos saltator]
          Length = 521

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 134/224 (59%), Gaps = 12/224 (5%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        +++P GW  W  L+ NSKYY+YS+++NG    +G    +DY  
Sbjct: 98  YAGKYLNEYGNNKVGGAAHVPVGWDWWAGLLGNSKYYDYSLSINGTGRLYGNGS-SDYLT 156

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I+N +V F+  +K + + +P ++V++ PAPH P   A ++ + +    +  TP ++  
Sbjct: 157 DVISNLAVDFI--NKHTDNTQPFLMVLAPPAPHAPFTPADRHNHKYNGTKAKRTPNFNVP 214

Query: 135 PNPDKQWILQVTRKMQP--VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
            + +K W+++      P  V  +  D+  ++R +TL +VD+ VE + + LK    L++TY
Sbjct: 215 VHENKHWLVRNGPTPLPDDVLPRLDDIY-SRRWETLLAVDELVETVHESLKRRNLLHDTY 273

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            IYTSD+GYH+GQF +   K   +E DIRVP ++ GPGI P T+
Sbjct: 274 FIYTSDNGYHIGQFSMPMDKRQLYETDIRVPLMISGPGIAPATI 317


>gi|332023429|gb|EGI63672.1| N-acetylglucosamine-6-sulfatase [Acromyrmex echinatior]
          Length = 465

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 86/240 (35%), Positives = 135/240 (56%), Gaps = 36/240 (15%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        +++P GW  W  LI NSKYYNYS+++NG + K+G D  NDY  
Sbjct: 82  YAGKYLNQYGKKETGGAAHVPLGWDWWAGLIGNSKYYNYSLSINGTERKYG-DSSNDYLT 140

Query: 75  DLIA---------NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
           D+I          N +V F+   K+    +P +++++ PAPHGP   A ++ + +    +
Sbjct: 141 DVIVILYIYIYIYNLAVNFI---KEYSGDQPFLMMLAPPAPHGPFIPAVRHKDKYIGTKA 197

Query: 126 HHTPAYDYAPNPDKQWILQ---------VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
             TP ++   N DK W+++         +  K+  ++R        +R +TL +VD+ ++
Sbjct: 198 KRTPNFNIPVNQDKHWLVRKGPIPLPDDILPKLDDIYR--------RRWETLLAVDELIK 249

Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            I   L++   L++TY IYTSD+GYH+GQF +   K  P+E DIRVP L+ GPGI   T+
Sbjct: 250 NIHDLLEERNLLDDTYFIYTSDNGYHVGQFSMPIDKRQPYETDIRVPLLISGPGIELSTV 309


>gi|323450161|gb|EGB06044.1| hypothetical protein AURANDRAFT_29931, partial [Aureococcus
           anophagefferens]
          Length = 410

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 120/222 (54%), Gaps = 13/222 (5%)

Query: 21  YFGKYLNKY-NG-----SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN Y NG      ++PPGW +W  L+ NS +Y+Y +++NG +  HG     DY  
Sbjct: 137 YAGKYLNMYGNGKMGGPGHVPPGWDDWHGLLGNSVFYDYDLSVNGVREAHGT-APEDYLT 195

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DL+A  +V FL  +       P   ++  PA H P +  P +       +   TPA++ A
Sbjct: 196 DLVARRAVNFLEAAA-----APWFAILGTPASHDPFEPRPDHFGALDGASRPRTPAWN-A 249

Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
              DK  I++    ++     + D    +RL+ L SVD+A+   +  +   GE + TY I
Sbjct: 250 SAADKHAIVRRQGFLKAPVADYIDRTYRRRLECLLSVDEAMNATLAAVAARGEADATYTI 309

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +TSDHG+HLG+FGL   K   +EFD RVPF++ GPG+  G +
Sbjct: 310 FTSDHGWHLGEFGLNYDKRQLYEFDTRVPFVLAGPGVQEGVV 351


>gi|390337306|ref|XP_003724525.1| PREDICTED: extracellular sulfatase Sulf-2-like [Strongylocentrotus
           purpuratus]
          Length = 935

 Score =  141 bits (356), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 64/107 (59%), Positives = 81/107 (75%)

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TPA++Y PN DK WI++   +M  + R FTD L  KRL TL SVDDA+EK+ +EL+ LGE
Sbjct: 211 TPAWNYGPNEDKHWIIRQASRMTKLERDFTDALQRKRLITLLSVDDAIEKLCRELEALGE 270

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           L+NTYI+ TSDHGYHLGQF ++KGKS  ++ DIRVPF +RGP I  G
Sbjct: 271 LDNTYILMTSDHGYHLGQFNMLKGKSNAYDTDIRVPFYIRGPDIPHG 317


>gi|322790248|gb|EFZ15247.1| hypothetical protein SINV_08807 [Solenopsis invicta]
          Length = 499

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 82/225 (36%), Positives = 126/225 (56%), Gaps = 15/225 (6%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y        +++P GW  W  LI NSKYY+YS+++NG + K+G +  +DY  
Sbjct: 118 YAGKYLNQYGNEKVGGAAHVPVGWDWWAGLIGNSKYYDYSLSINGTERKYG-NSSSDYLT 176

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+I N +V F+         +P +LV++ PAPH P   A ++   +    +  TP ++  
Sbjct: 177 DVINNLAVDFIEGHS---GDQPFLLVLAPPAPHAPFTPADRHDGKYNGTKAKRTPNFNVP 233

Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
            + DK W+  V +   P+        D +  +R + L +VD+ V+ +   L+    L +T
Sbjct: 234 IHQDKHWL--VRKGPTPLPDDILPKLDEIYRRRWEALLAVDELVKNVHDLLEKRNLLKDT 291

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           Y IYTSD+GYH+GQF +   K  P+E DIRVP L+ GPGI   T+
Sbjct: 292 YFIYTSDNGYHIGQFSMPMDKRQPYETDIRVPLLISGPGIESSTI 336


>gi|403731455|ref|ZP_10949313.1| putative sulfatase [Gordonia rhizosphera NBRC 16068]
 gi|403202234|dbj|GAB93644.1| putative sulfatase [Gordonia rhizosphera NBRC 16068]
          Length = 531

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 121/220 (55%), Gaps = 9/220 (4%)

Query: 20  GYFGKYLNKYNGS--YIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYPD 75
           G  GKYLN    S  +IPPGW +W A +  S Y  Y Y++N NGR + +G    +DY  D
Sbjct: 161 GMAGKYLNGLENSPAHIPPGWDDWNAGVTLSLYTGYGYTLNENGRMVPYGI-RPDDYETD 219

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
           ++A  SV F+R S  + S KP     +  APH P  +AP++        +HH P +  A 
Sbjct: 220 VVARKSVDFIRAS--AHSGKPFFWYAASTAPHFPLPAAPRHLASSPWTPAHHLPNFQEAD 277

Query: 136 NPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
             DK  W++              DL  T RL +L+++D+ VE IV EL   G+L+NTY++
Sbjct: 278 VSDKPSWLIDTAGVRAATIAATNDLDNTNRLGSLKALDEMVEAIVGELSRTGQLDNTYLV 337

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +TSD+GY LG   L + K  P+E  IR+P ++ GPG+ P 
Sbjct: 338 FTSDNGYSLGAHRLTQ-KMVPYEESIRIPLVITGPGVRPA 376


>gi|357631807|gb|EHJ79275.1| sulfatase [Danaus plexippus]
          Length = 496

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 123/220 (55%), Gaps = 23/220 (10%)

Query: 21  YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN+Y          +PPGW EW  L+ NS YYNY+I+ NG      + Y  D   
Sbjct: 118 YAGKYLNEYGVHASGGPEQVPPGWSEWHGLVGNSVYYNYTISNNGVPTFSTDLYLTDIIR 177

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DL    S+ ++    +S   +P ++V++ PAPH P   A ++  ++ N T   TP ++ A
Sbjct: 178 DL----SLNYIENQTES---RPFLMVLAPPAPHQPATPAERHRGVYDNTTVLKTPNFNIA 230

Query: 135 PNPDKQWILQ-----VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
            + +K W+++     +  K+ P      D +   R ++L +VD+ V  +V+ L   G L 
Sbjct: 231 DD-NKHWLIRMPPSPLPEKIMPE----LDRVYRSRWESLLAVDEMVADVVESLDSSGLLQ 285

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           NTY+I+TSD+GYH+GQF  V  K  P+E D++VP L+RGP
Sbjct: 286 NTYLIFTSDNGYHIGQFSQVYDKRQPYEADVKVPLLIRGP 325


>gi|108804933|ref|YP_644870.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
 gi|108766176|gb|ABG05058.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
          Length = 484

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 124/218 (56%), Gaps = 8/218 (3%)

Query: 21  YFGKYLNKYNGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
           YFGKY+N Y  S   +PPGW EW A+  N  Y +   N NGR   +    YND   DLIA
Sbjct: 136 YFGKYMNGYGRSETRVPPGWDEWHAVAGN--YLSSWYNDNGRVRYYSPALYNDT--DLIA 191

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
             +  +LR++  +    P  +V++  APH P    P+YA+ F        P++D     D
Sbjct: 192 EKATSYLRRT--AGRGAPFFVVLAPRAPHQPAVPPPRYADAFPEAPLPRGPSFDERDVSD 249

Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
           K   ++   ++     +F   L  +RL+++ +V+D VE++++ L++ G+L NTYI +TSD
Sbjct: 250 KPRWVRDNPRLGRKKLEFLGSLYRRRLRSMLAVEDLVERLLRTLRESGQLENTYIFFTSD 309

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +G+H+G   L +GK   +E DIRVP L+RGPG+  G +
Sbjct: 310 NGFHMGHHRLPEGKWTAYEEDIRVPLLVRGPGVPEGRV 347


>gi|108805892|ref|YP_645829.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
 gi|108767135|gb|ABG06017.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
          Length = 492

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 77/219 (35%), Positives = 126/219 (57%), Gaps = 8/219 (3%)

Query: 20  GYFGKYLNKY-NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
           G+ GKYLN Y   +++PPGW  W        Y+++++N NGR + +     + Y  D++ 
Sbjct: 142 GFVGKYLNGYLRTTHVPPGWDRWYGF-NGGGYHDFTLNENGRNVSYRGP--SSYQTDVLG 198

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
             ++ F+R + +    +P  L +S  APHGP + AP++A +F        P++D     D
Sbjct: 199 RKALGFVRWAARR--DRPFFLHLSPWAPHGPAEPAPRHARLFARTPLPRPPSFDERDVSD 256

Query: 139 K-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
           K +W+    R  +   R+   L    RL+TL++VD+ V ++V  L++ G+L NTYI +TS
Sbjct: 257 KPRWVRDNPRLGREEVREMGRLYRN-RLRTLRAVDELVGRLVAALRESGQLENTYIFFTS 315

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           D+G+H+G   L +GK   +E DIRVP L+RGPG+  G +
Sbjct: 316 DNGFHMGHHRLPEGKWTAYEEDIRVPLLVRGPGVPEGRV 354


>gi|345314989|ref|XP_003429575.1| PREDICTED: extracellular sulfatase Sulf-2-like, partial
           [Ornithorhynchus anatinus]
          Length = 107

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 60/107 (56%), Positives = 84/107 (78%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           DY  DLI NDSV F R SK+++  +P+++V+S  APHGPEDSAPQY+ +F N + H TP+
Sbjct: 1   DYLTDLITNDSVSFFRTSKRTYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASQHITPS 60

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           Y+YAP+PDK WI++ T  M+P+H +FT++L  KRLQTL SVDD+++K
Sbjct: 61  YNYAPSPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMDK 107


>gi|434398901|ref|YP_007132905.1| N-acetylglucosamine-6-sulfatase [Stanieria cyanosphaera PCC 7437]
 gi|428269998|gb|AFZ35939.1| N-acetylglucosamine-6-sulfatase [Stanieria cyanosphaera PCC 7437]
          Length = 508

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/226 (34%), Positives = 126/226 (55%), Gaps = 15/226 (6%)

Query: 20  GYFGKYLNKY----NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHG---EDYYN 70
            + GKYLN Y      +Y+PPGW EW + +  S Y  Y+Y++N NG+ + +G   +DY N
Sbjct: 135 AFIGKYLNLYPYTSKMNYVPPGWDEWDSPVHGSAYVQYDYTVNENGKLVHYGNQPQDYGN 194

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           D Y     N + +FL Q  ++   +P  L +S+ APH P   AP++ N+F +       A
Sbjct: 195 DVY----TNKAKQFLTQVAKT--NQPFFLFLSYYAPHQPATPAPRHQNLFPDAQVPRRQA 248

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
           ++     DK   L+    +    ++  D L  KRL++LQ+VD+ +  +   L+   +L+N
Sbjct: 249 FNEIDVSDKPQYLRELPLLNQEQQEKIDYLHQKRLRSLQAVDEGLATLYNTLQATNQLDN 308

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           TYI ++SD+G+HLGQ  L+ GK   +E DI +P L+R P +  G +
Sbjct: 309 TYIFFSSDNGFHLGQHRLLPGKETAYEEDIHLPLLVRSPHVPAGKV 354


>gi|345013357|ref|YP_004815711.1| sulfatase [Streptomyces violaceusniger Tu 4113]
 gi|344039706|gb|AEM85431.1| sulfatase [Streptomyces violaceusniger Tu 4113]
          Length = 506

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 75/226 (33%), Positives = 126/226 (55%), Gaps = 18/226 (7%)

Query: 20  GYFGKYLNKY-----NGS---YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
           G+ GKYLN Y     NG+   Y+P GW EW       K Y Y +N NG+ + +G+    D
Sbjct: 143 GFMGKYLNGYQPADRNGTSKPYVPSGWDEWDVAGDGYKEYKYDLNENGKVVPYGK-APQD 201

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  D+++  +  F+  S    +KKP ML ++  APHGP   AP+  + F  + +  T AY
Sbjct: 202 YLTDVLSKKATSFIGSSAD--AKKPFMLEVATFAPHGPSTPAPRDKDKFPGLKAPRTDAY 259

Query: 132 DYAPNPDKQWILQVTRKMQPV---HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
           D A     +W     R + P+    ++ TD    KR++++Q+VD  + ++ K L++ G  
Sbjct: 260 DKATRNAPKW----QRALSPLTAKEKEETDQKFAKRVRSVQAVDAMIGQLEKTLEEKGLA 315

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           ++TY+++ SD+G+H+G+  L  GK   ++ D++VP ++ GPG+  G
Sbjct: 316 DDTYLVFGSDNGFHMGEHRLRPGKQTAYDTDVKVPMMVTGPGVPAG 361


>gi|326917732|ref|XP_003205150.1| PREDICTED: extracellular sulfatase Sulf-1-like, partial [Meleagris
           gallopavo]
          Length = 242

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 58/105 (55%), Positives = 79/105 (75%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGKYLN+YNGSYI   WREW  L+ NS++YNY+I+ NG K KHG DY  DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIXXXWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           +S+ + R SK+ +  +PIM+V+S  APHGPEDSAPQ++ ++ N +
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNAS 242


>gi|302548207|ref|ZP_07300549.1| arylsulfatase [Streptomyces hygroscopicus ATCC 53653]
 gi|302465825|gb|EFL28918.1| arylsulfatase [Streptomyces himastatinicus ATCC 53653]
          Length = 507

 Score =  133 bits (334), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 72/223 (32%), Positives = 124/223 (55%), Gaps = 12/223 (5%)

Query: 20  GYFGKYLNKY-----NGS---YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
           G+ GKYLN Y     NG+   Y+P GW EW       + ++Y +N NG+ + +G+D   D
Sbjct: 144 GFMGKYLNGYQPGDKNGTSRPYVPSGWDEWNVAGDGYQEFDYDLNENGKVVPYGDD-PED 202

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  D+++  +  F+  S +  +KKP ML ++  APHGP   AP+  + F  + +  T AY
Sbjct: 203 YLTDVLSRKATSFVDSSAK--AKKPFMLEVATFAPHGPSTPAPRDEDTFPGLKAPRTDAY 260

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
           D A     +W  Q    +    ++  D    +R++++Q+VD  + ++   L+  G  ++T
Sbjct: 261 DEATRNAPEW-QQALSPLTSKEKKRIDRKFAQRVRSVQAVDKMIGRLKSTLEAKGLADDT 319

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           YI++ SD+G+H+G+  L  GK   ++ D+RVP ++ GPG+  G
Sbjct: 320 YIMFGSDNGFHMGEHRLRPGKQTAYDTDVRVPMMVTGPGVPSG 362


>gi|167536101|ref|XP_001749723.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771871|gb|EDQ85532.1| predicted protein [Monosiga brevicollis MX1]
          Length = 493

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 113/222 (50%), Gaps = 48/222 (21%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN Y        ++IP GW+EW AL+ NSKYY+YS++ NGR+   G+DY  DYY 
Sbjct: 130 YSGKYLNTYGAPSVGGVAHIPDGWKEWHALVGNSKYYDYSVSDNGRQRNFGQDYATDYYT 189

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++ N+SV +L  +       P  ++++ P PH        YAN+F ++T+  TP ++ A
Sbjct: 190 DVLKNESVAWL--THNWTQDDPFFMMVATPCPH--------YANLFPDMTAPRTPNWNKA 239

Query: 135 PNPD-----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
           P        K W+++    M+P H   +D     R + L                     
Sbjct: 240 PAASSGPGAKHWLMRQFGPMEPQHIINSDSTFVDRWRCLH-------------------- 279

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
                  SDHG HLGQFG+   K   +E DIRVP ++RGPGI
Sbjct: 280 -------SDHGQHLGQFGMEFDKRQLYETDIRVPLVVRGPGI 314


>gi|134101323|ref|YP_001106984.1| sulfatase [Saccharopolyspora erythraea NRRL 2338]
 gi|291008139|ref|ZP_06566112.1| sulfatase [Saccharopolyspora erythraea NRRL 2338]
 gi|133913946|emb|CAM04059.1| sulfatase [Saccharopolyspora erythraea NRRL 2338]
          Length = 504

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 69/223 (30%), Positives = 120/223 (53%), Gaps = 11/223 (4%)

Query: 21  YFGKYLNKY------NGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDY 72
           + GKY+N Y      +G+  Y+PPGW  W       K Y+Y +N NG+ + HG   + DY
Sbjct: 143 FMGKYMNGYRPDGTVDGTPNYVPPGWNTWAVAGDGYKQYDYQLNENGQVVDHGHAPH-DY 201

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD 132
             D++      F+R +  + + +P  L ++  +PHGP   AP++  +F  +T+   PA++
Sbjct: 202 LTDVLNRKGTDFIRSN--AAANRPFYLQLNTFSPHGPSTPAPRHEGLFPGLTAPRGPAFN 259

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
            A   DK   L+    +    +   D     R Q++QS++D +  I   L+  G+ +NTY
Sbjct: 260 EADMSDKPGWLRGMPPLDQKQQAKIDEQYRLRAQSIQSINDMLADIRDTLQQTGQADNTY 319

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           ++++SD+G+H+G+  L  GK   F+ D+ VP  + GPG+  G+
Sbjct: 320 VVFSSDNGFHMGEHRLRNGKQTAFQTDVVVPLTITGPGVPAGS 362


>gi|443287713|ref|ZP_21026808.1| Arylsulfatase [Micromonospora lupini str. Lupac 08]
 gi|385885974|emb|CCH18154.1| Arylsulfatase [Micromonospora lupini str. Lupac 08]
          Length = 497

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 74/225 (32%), Positives = 118/225 (52%), Gaps = 12/225 (5%)

Query: 20  GYFGKYLNKY------NGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
           G+ GKYLN Y       G+  Y+PPGW EW         YNY +N N   + +G     D
Sbjct: 134 GFLGKYLNGYLPADTQGGALPYVPPGWSEWYVAGNGYPEYNYDLNENHTVVHYGSQPA-D 192

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  ++++     F+++S  + +  P +L +S  APH P   A Q  + F  +T+  T AY
Sbjct: 193 YLTNVLSRKGKDFIQRS--AAANTPFLLEVSTFAPHSPFTPANQDLDDFPGLTAPRTAAY 250

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
           D  P     W L     +    +   D    KR Q++Q+VD  +  +  +L   G  +NT
Sbjct: 251 DRLPTNPPSW-LAGNGPLTAQEKSTLDTNFRKRAQSVQAVDRLIGNLRAQLTTSGIADNT 309

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           Y++++SD+G+H+G++ L  GK   F+ D+RVP ++ GPG+VPG+ 
Sbjct: 310 YVVFSSDNGFHMGEYRLTGGKQTAFDTDVRVPLVVAGPGVVPGSQ 354


>gi|297623492|ref|YP_003704926.1| sulfatase [Truepera radiovictrix DSM 17093]
 gi|297164672|gb|ADI14383.1| sulfatase [Truepera radiovictrix DSM 17093]
          Length = 522

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 124/228 (54%), Gaps = 25/228 (10%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALI------MNSKY--YNYSINMNGRKIKHGEDYYND 71
           GY G  +N     Y+PPGW EW   I       NS Y  Y+Y IN NGR I+H      D
Sbjct: 157 GYRGPRVN-----YVPPGWSEWYGFIDVPRDPRNSPYAMYDYKINANGR-IRHYGSRPQD 210

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  D++A  +V F+R+S +   + P  L+++  APH P   AP++A+ F ++ +   P++
Sbjct: 211 YQTDVLAGLAVDFVRRSAR---RGPFFLLLAPTAPHLPAVPAPRHADAFRDLQAPRPPSF 267

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQ---FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
           +  P    +W+     ++ P+        D     + Q L +VD+ +  ++  L++ G L
Sbjct: 268 NADPQGKARWL-----ELPPLSDADIAAIDATYRAQAQMLLAVDELLGSLLDALRETGTL 322

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +NTY+++TSDHG+H G+  L   K  P++  +R+P ++RGPGI  G +
Sbjct: 323 DNTYLVFTSDHGWHSGEHRLFHLKLTPYDASVRLPLVIRGPGIPKGEV 370


>gi|383776154|ref|YP_005460720.1| putative sulfatase [Actinoplanes missouriensis 431]
 gi|381369386|dbj|BAL86204.1| putative sulfatase [Actinoplanes missouriensis 431]
          Length = 508

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/237 (32%), Positives = 120/237 (50%), Gaps = 18/237 (7%)

Query: 20  GYFGKYLNKY--------NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYY 69
           G+FGKYLN+Y           Y+PPGW +W   +  + Y  +NY++N NG   K+GE   
Sbjct: 145 GFFGKYLNEYLPATSFGGGKPYVPPGWDQW--FVGGNAYDNFNYALNENGTVKKYGEKPA 202

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
            DY  D++A  +  F+  S  + +  P ML ++  APH P   AP+ AN+F  + +  T 
Sbjct: 203 -DYLTDVLAARTTDFITTSAATGT--PFMLEVATYAPHAPYTPAPRDANLFPGLKAPRTA 259

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
           AYD  P     W L    K+    +   D    KR Q++ SVD  +  +   L   G  +
Sbjct: 260 AYDKVPADAPAW-LAPHPKLNAQQKARMDQEFRKRAQSVVSVDRMLGALRSTLTKAGVAD 318

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
            T +I++SD+GYH+G+  L  GK   F+ D+ VP +  G G+  G    +D ++  +
Sbjct: 319 ETVVIFSSDNGYHMGEHSLPSGKQTAFDTDVNVPLIAAGHGVKAGQT--VDAVVANI 373


>gi|340372047|ref|XP_003384556.1| PREDICTED: extracellular sulfatase Sulf-2-like [Amphimedon
           queenslandica]
          Length = 497

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 78/247 (31%), Positives = 132/247 (53%), Gaps = 17/247 (6%)

Query: 20  GYFGKYLNKYNG--------SYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYN 70
           G FGK LN  NG        + +P PG+  +  +  +++Y+  +   NG  ++ G+D   
Sbjct: 124 GQFGKLLNP-NGVAAYCAKTNAVPVPGFDSYLTMCNDNRYFGNTFTSNGTFVEKGKDP-E 181

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           DY   +I N S+ F+  S Q+   +P    +S  APH P   AP YA+ F +  +  TP 
Sbjct: 182 DYLTSVIGNHSLDFIEASLQA--GRPFFAYISPHAPHVPATPAPWYADKFSDKQAPRTPN 239

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
           Y+Y+   D  ++++    +       +D L   R ++L SVDD  + +V  L+   +++N
Sbjct: 240 YNYSA-ADHHYVIRSQPILTSKEGNESDALFRDRWRSLLSVDDITKDVVSLLEKYDQVDN 298

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI---PQVR 247
           TYI++TSDHG+ LGQF L   K  P++ DIRVPF ++GPG+  G+   +  ++   P + 
Sbjct: 299 TYILWTSDHGFQLGQFRLPSCKLQPYDHDIRVPFRIKGPGLKQGSHSFVASMVDVAPTII 358

Query: 248 KFSSGSL 254
           +   G++
Sbjct: 359 ELGGGTV 365


>gi|284042770|ref|YP_003393110.1| sulfatase [Conexibacter woesei DSM 14684]
 gi|283946991|gb|ADB49735.1| sulfatase [Conexibacter woesei DSM 14684]
          Length = 498

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/224 (29%), Positives = 120/224 (53%), Gaps = 14/224 (6%)

Query: 22  FGKYLNKYNGS---------YIPPGWREWGALIMNSKYYNYSINM-NGRKIKHGEDYYND 71
            GKY+N+Y+           Y+PPGW +W   +  + Y  Y   + +G  ++       D
Sbjct: 131 MGKYMNQYSPGRVRDALGRPYVPPGWTDW--RVAGNGYPGYGYRLTSGEGVERRGHAAQD 188

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  D++  D++ F+  S    + +P +L +S  APH P   AP+  + F N  +  T ++
Sbjct: 189 YLTDVLRRDAIGFV--SGAVAAGQPFLLQLSTFAPHTPATPAPRDEDRFGNAMAPRTASF 246

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
           D A   DK   L+    +    +   D L  +R++++Q+VD A+ ++ ++L+ LG   +T
Sbjct: 247 DEADLSDKPRWLRGHPPLTAAQQWRIDDLFRERVRSVQAVDRAIGRLREQLRRLGVARST 306

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           Y++++SD+G+H+GQ  L  GK   ++ D+RVP ++ GPG+  G 
Sbjct: 307 YVVFSSDNGFHMGQHRLTPGKLTAYDADVRVPLIVAGPGVPAGA 350


>gi|340385547|ref|XP_003391271.1| PREDICTED: extracellular sulfatase Sulf-2-like [Amphimedon
           queenslandica]
          Length = 491

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 78/253 (30%), Positives = 130/253 (51%), Gaps = 13/253 (5%)

Query: 11  NLLYFDISHGYFGKYLN---KYNGSYIP---PGWREWGALIMNSKYYNYSINMNGRKIKH 64
           NL     + G FGK LN   +Y     P   PG+  +  +   ++Y+  + + NG   K 
Sbjct: 111 NLQSLGYTTGQFGKLLNGMKEYCSEKDPLRLPGFDSYLTMCNYARYFENTFSSNGTLYKT 170

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           G     DY   +I N S+ F+  S Q+   +P    +S  APH P   AP YA+ F ++ 
Sbjct: 171 GTKP-EDYLTSVIGNHSLDFIEASLQA--GRPFFAYISPHAPHVPATPAPWYADKFSDMK 227

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           +  TP Y+Y+   D  ++++    +       +D L   R ++L SVDD  + +V  L+ 
Sbjct: 228 APRTPNYNYSAT-DHHYVIRSQPILTSDEGDISDALFRDRWRSLLSVDDITKDVVSLLEK 286

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI- 243
             + +NTYI++TSDHGY LGQF L   K  P++ D++VPF ++GPG+  G+   +  ++ 
Sbjct: 287 YDQADNTYILWTSDHGYQLGQFRLPSCKLQPYDHDLKVPFRIKGPGLKQGSHSFVASIVD 346

Query: 244 --PQVRKFSSGSL 254
             P + +   G++
Sbjct: 347 VAPTIIELGGGTV 359


>gi|26339842|dbj|BAC33584.1| unnamed protein product [Mus musculus]
          Length = 604

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 69/84 (82%)

Query: 153 HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
           H +FT++L  KRLQTL SVDD++E I   L + GEL+NTYI+YT+DHGYH+GQFGLVKGK
Sbjct: 1   HMEFTNMLQRKRLQTLMSVDDSMETIYDMLVETGELDNTYILYTADHGYHIGQFGLVKGK 60

Query: 213 SFPFEFDIRVPFLMRGPGIVPGTM 236
           S P+EFDIRVPF +RGP +  G++
Sbjct: 61  SMPYEFDIRVPFYVRGPNVEAGSL 84


>gi|224613406|gb|ACN60282.1| N-acetylglucosamine-6-sulfatase precursor [Salmo salar]
          Length = 291

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 65/130 (50%), Positives = 82/130 (63%), Gaps = 2/130 (1%)

Query: 105 APHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTK 163
           APH P  +APQY   F +V +    ++D  P  DK W+L Q +  M      F D    K
Sbjct: 1   APHSPWTAAPQYEKNFADVKAPRDGSFD-KPGKDKHWLLRQPSNPMPSTSLNFLDNAYRK 59

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           R QTL SVDD VE +VK+L++L ELNNTYI YTSD+GYH GQF L   K   ++FDIR+P
Sbjct: 60  RWQTLLSVDDMVEVLVKKLEELKELNNTYIFYTSDNGYHTGQFSLPIDKRQLYDFDIRIP 119

Query: 224 FLMRGPGIVP 233
            ++RGPGI P
Sbjct: 120 LMVRGPGIKP 129


>gi|410931337|ref|XP_003979052.1| PREDICTED: extracellular sulfatase Sulf-2-like, partial [Takifugu
           rubripes]
          Length = 613

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 19/130 (14%)

Query: 1   MAWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSIN 56
           M+W  +   R    +  + GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY+  
Sbjct: 111 MSWQRQHEPRTFGVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWLGLVKNSRFYNYT-- 168

Query: 57  MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
                         DY  DLI  +S+R+ R SK+ +  +P+++V+S  APHGPEDSAPQY
Sbjct: 169 -------------QDYLTDLITAESMRYFRYSKRVYPHRPVLMVLSHAAPHGPEDSAPQY 215

Query: 117 ANMFFNVTSH 126
           +  F N + H
Sbjct: 216 STAFPNASQH 225


>gi|37520209|ref|NP_923586.1| glucosamine-6-sulfatase [Gloeobacter violaceus PCC 7421]
 gi|35211202|dbj|BAC88581.1| gll0640 [Gloeobacter violaceus PCC 7421]
          Length = 834

 Score =  116 bits (290), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 74/214 (34%), Positives = 109/214 (50%), Gaps = 7/214 (3%)

Query: 20  GYFGKYLNKYNGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           G FGKY N Y+ S  Y PPGW EW    + + YYNY IN NG    +G    N Y  D++
Sbjct: 127 GLFGKYFNGYSYSAFYTPPGWDEWQTFQL-AGYYNYRINANGTIEDYGRSESN-YSTDVL 184

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
              +V F+  S  + S KP  L ++  APH P   AP++A  + ++     P Y+     
Sbjct: 185 TQKAVAFITNS--AASDKPFFLFLAPFAPHAPYTPAPRHAGRYADIPPWRPPNYNEQDVL 242

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
           DK   +Q  R   P  +   D      L+ L +VDD VE I++ L+  G+  NT +I+TS
Sbjct: 243 DKPTWVQKLRPASPQTQTDYDKERQAYLEMLLAVDDGVESILQALESTGQRENTLVIFTS 302

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D+G   G+    + K   +E  +RVP ++  PG+
Sbjct: 303 DNGLTWGEHRWWE-KGCSYEESLRVPMVVSFPGV 335


>gi|323451516|gb|EGB07393.1| hypothetical protein AURANDRAFT_27956 [Aureococcus anophagefferens]
          Length = 447

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/244 (34%), Positives = 121/244 (49%), Gaps = 19/244 (7%)

Query: 3   WPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKI 62
           W V    R+L Y   S GYFGK+LN  N    PPG   W A      Y + S  ++G ++
Sbjct: 65  WHVATHFRDLGY---SVGYFGKHLNNDNPLAAPPGVDRWVA-NGGGTYLDPSFAVDGVEV 120

Query: 63  KHGEDYYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GP----EDSAP 114
              +      Y   +I N +V +LR        KP + V +  APH   GP       AP
Sbjct: 121 AFDDCAGRPCYSTSIIGNKTVDWLRGD----FGKPFVAVAAVKAPHVEDGPGWPVAVPAP 176

Query: 115 QY--ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
            Y  + +F       TP ++ A  PD  W++     M       +D L   RL++L SVD
Sbjct: 177 WYNSSALFAGEKPPRTPNWN-ASCPDHHWLVASQPPMTREQAHRSDELFRARLRSLLSVD 235

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           D V  +V  L+    L++T++++TSDHG+  GQF + +GK   +E D+RVPF +RGPG+ 
Sbjct: 236 DLVAGVVDALEASQRLDDTFVVFTSDHGFRFGQFRMPEGKWNAYENDLRVPFWIRGPGVK 295

Query: 233 PGTM 236
           PG +
Sbjct: 296 PGGV 299


>gi|307105104|gb|EFN53355.1| hypothetical protein CHLNCDRAFT_53914 [Chlorella variabilis]
          Length = 818

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 71/222 (31%), Positives = 118/222 (53%), Gaps = 12/222 (5%)

Query: 20  GYFGKYLNKYNGSYIP---PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           G  GKYL  YN ++IP   PGW  W  +  ++++YNY+++  G K K   +  +DY  D+
Sbjct: 455 GMIGKYLTGYNSTFIPHVPPGWDRW-FVFSDNEFYNYTVSDQG-KSKSFANAEDDYSTDV 512

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +    V F++Q K     +P  L ++  A H P   A ++ N    +     PAY+ +  
Sbjct: 513 VTAHGVEFIQQVKDD--SRPFFLYLAPYACHYPHIPANRHLNKLKGLKIPRVPAYNESQQ 570

Query: 137 PDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
             ++ +      + P+  +     D     R +TL +VDD +E +   L+++GEL+NTY+
Sbjct: 571 HLRKKV-SFINSLPPIDEEMAAGLDFKYQARAETLLAVDDMIEAVFLALEEVGELDNTYV 629

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            +TSD+GY LG   L  GK   +E D+++PF +RGPG+  G 
Sbjct: 630 FFTSDNGYKLGHHRL-SGKMSGYETDVKLPFYVRGPGVPRGV 670


>gi|159462408|ref|XP_001689434.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283422|gb|EDP09172.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 620

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 118/228 (51%), Gaps = 25/228 (10%)

Query: 24  KYLNKYNGSYIPPGWREWG-----ALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
           K+LN+Y  +Y P G+   G     AL +N+  YN + +           Y  +Y  +LI 
Sbjct: 132 KFLNQYLETYAPSGYAPAGFTHFEALTVNA--YNMTNSCFSLSGAASTCYPGEYQTELIM 189

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------PEDSAPQYANMFFNVTSHHTP 129
             ++ ++     + +  P +L ++  APH          P   A QYAN++ + + +  P
Sbjct: 190 TKALDYI--GSYAAANAPFLLYVAPTAPHRSSTDGILWYPPTPAKQYANLY-SASENQVP 246

Query: 130 A---YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
               +D+    ++    +    +      F D L   RL+ L+SVDD +  IV +L+ LG
Sbjct: 247 RGLNWDFR---NRNLPRRGAVAVDATFEAFMDSLYLYRLRALRSVDDLIGSIVGKLQRLG 303

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            LNNTY+I+TSD+GYH+G F L+ GK+ P E D+RVPF +RGPGI  G
Sbjct: 304 VLNNTYVIFTSDNGYHMGAFSLLDGKNLPIEEDVRVPFFIRGPGIPAG 351


>gi|355690576|gb|AER99199.1| glucosamine -6-sulfatase [Mustela putorius furo]
          Length = 220

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 67/163 (41%), Positives = 91/163 (55%), Gaps = 11/163 (6%)

Query: 21  YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           + GKYLN+Y         ++P GW  W AL  NSKYYNY++++NG+  KHGE+Y  DY  
Sbjct: 62  FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 121

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D++AN S+ FL     S   +P  +++S PAPH P  +APQY   F NV +     ++  
Sbjct: 122 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKSFQNVFAPRNKNFNIH 178

Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
              +K W++ Q    M     QF D    KR QTL SVDD VE
Sbjct: 179 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVE 220


>gi|336176655|ref|YP_004582030.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
           glomerata]
 gi|334857635|gb|AEH08109.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
           glomerata]
          Length = 540

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 127/221 (57%), Gaps = 12/221 (5%)

Query: 23  GKYLN--KYNGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHG--EDYYNDYYPDL 76
           GKY+N  + + +++PPGW EW     N  Y  YNY++N NG  + +G  E+  N Y  D+
Sbjct: 170 GKYMNGIEADPTHVPPGWSEWYGSADNFFYTGYNYALNENGAIVDYGGPENPAN-YSTDV 228

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA-NMFFNVTSHHTPAYDYAP 135
           +A  S  F+R++  +   +P  L  +  APH P   AP++A N F +  +  TP ++   
Sbjct: 229 VAAKSTDFIRRA--AAGDEPFFLYAASTAPHLPLPPAPRHADNPFADDAAPRTPNFNEPD 286

Query: 136 NPDKQWILQVTRKMQPVHRQFTD-LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
             DK W LQ + + +       +      R+ +L ++D+ V  IV+ L+ +GEL++TY++
Sbjct: 287 VSDKPWWLQQSARERSAQVAVVNNQDYQNRMGSLYALDEMVRDIVETLRSVGELDDTYLV 346

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           +TSD+GY+LG   L+  K  P++  +RVP ++ GPGI PGT
Sbjct: 347 FTSDNGYNLGAHRLIN-KQAPYDESMRVPLVVAGPGIAPGT 386


>gi|159468317|ref|XP_001692329.1| hypothetical protein CHLREDRAFT_145838 [Chlamydomonas reinhardtii]
 gi|158278515|gb|EDP04279.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 651

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 75/234 (32%), Positives = 116/234 (49%), Gaps = 27/234 (11%)

Query: 23  GKYLNKY----------NGSYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYND 71
           GK+LN Y          +G Y P GW  + AL   +  Y+N     NG   K    +   
Sbjct: 167 GKFLNAYMTTAAFMGADSGDYYPRGWTIFDALTQGTYAYWNSCFAYNGGSDKC---FPKQ 223

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------GPEDSAPQYANMFFNV 123
           Y  D+I + ++ +LR +       P  + ++ PAPH         P   A ++AN++ N 
Sbjct: 224 YQTDIIRDKALSYLRTATAK-KDTPFFMYIAPPAPHVELTNNGWQPPSPAARHANLYAND 282

Query: 124 TSHHTPAYDYA-PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
                   ++  PNP+   I      M P      +    +RL++L++VD+ ++ +V +L
Sbjct: 283 KIRMPKGANFGVPNPN---IPMEVDDMTPDFVAQLEFHYLQRLRSLRAVDEMLDAVVNQL 339

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
              G+L+NTY++YTSD+G HLGQF L  GK+   E D RVPF +RGPGI  G +
Sbjct: 340 SASGQLDNTYVLYTSDNGLHLGQFSLGDGKATAVEEDSRVPFYIRGPGIPAGQV 393


>gi|169786467|ref|XP_001827694.1| arylsulfatase [Aspergillus oryzae RIB40]
 gi|83776442|dbj|BAE66561.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391866120|gb|EIT75392.1| sulfatase [Aspergillus oryzae 3.042]
          Length = 576

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 120/233 (51%), Gaps = 23/233 (9%)

Query: 21  YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N ++ S      P G+      +++   Y+Y  +   R  +  ++Y   Y  D+
Sbjct: 131 YTGKLMNAHSLSTYNKPFPKGFNG-SDFLLDPHTYSYYNSTYQRNREPPKNYAGHYTTDV 189

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------------APQYANMFF 121
           I   ++  L ++ +S   +P  + +S  APH   D                AP++ ++F 
Sbjct: 190 ITEKALGLLDEALES--DRPFFVAVSPVAPHSNIDPNTMGTAASIMSEPIPAPRHRHLFQ 247

Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
            V    TP+++        W+  +  + Q V   + D    +RL+ LQ VD+ VEK+V  
Sbjct: 248 GVKVPRTPSFNPLNRTGVSWVSNLALQNQTV-IDYEDHYYRQRLRALQGVDELVEKLVSR 306

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           L++ GE++NTYIIYTSD+G+H+GQ  +  GK+  FE DIRVPF +RGPGI  G
Sbjct: 307 LEESGEIDNTYIIYTSDNGFHIGQHRMPPGKTTGFEEDIRVPFFIRGPGIPEG 359


>gi|238507485|ref|XP_002384944.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
 gi|220689657|gb|EED46008.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
          Length = 581

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 120/233 (51%), Gaps = 23/233 (9%)

Query: 21  YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N ++ S      P G+      +++   Y+Y  +   R  +  ++Y   Y  D+
Sbjct: 131 YTGKLMNAHSLSTYNKPFPKGFNG-SDFLLDPHTYSYYNSTYQRNREPPKNYAGHYTTDV 189

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------------APQYANMFF 121
           I   ++  L ++ +S   +P  + +S  APH   D                AP++ ++F 
Sbjct: 190 ITEKALGLLDEALES--DRPFFVAVSPVAPHSNIDPNTMGTAASIMSEPIPAPRHRHLFQ 247

Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
            V    TP+++        W+  +  + Q V   + D    +RL+ LQ VD+ VEK+V  
Sbjct: 248 GVKVPRTPSFNPLNRTGVSWVSNLALQNQTV-IDYEDHYYRQRLRALQGVDELVEKLVSR 306

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           L++ GE++NTYIIYTSD+G+H+GQ  +  GK+  FE DIRVPF +RGPGI  G
Sbjct: 307 LEESGEIDNTYIIYTSDNGFHIGQHRMPPGKTTGFEEDIRVPFFIRGPGIPEG 359


>gi|453082749|gb|EMF10796.1| Arylsulphatase, partial [Mycosphaerella populorum SO2202]
          Length = 538

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 25/267 (9%)

Query: 19  HGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
           H +    +N +NGS            +++   Y+Y  +   R       Y   Y PD++A
Sbjct: 90  HNHRNPPVNGFNGS----------DFLLDPYTYSYWKSQMSRNHADPVSYEGQYSPDVVA 139

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-------SAPQYANMFFNVTSHHTPAY 131
             +  +L ++  S   +P  +V +  APH  +D       +APQYA    ++   +    
Sbjct: 140 EKTYGWLDEAA-SHHDRPFFIVAAPIAPHSDQDPEARGVMTAPQYAPRHAHLFQDYQIPR 198

Query: 132 DYAPNPDKQWILQVTRKMQPVHR---QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
               NP+    +   + + P++    ++ D     RL+ LQSVD+ VE+IV+ L   G L
Sbjct: 199 TANFNPEAASGVSWVKNLAPLNATVVEYNDEYQRSRLRALQSVDEMVEQIVERLDSHGLL 258

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----VPGTMYLLDVLIP 244
           + TYIIYT+D+GYH+ Q  L  GK  PFE DI VP L+RGPG+    V G +     L P
Sbjct: 259 DTTYIIYTTDNGYHISQHRLPPGKECPFETDINVPLLIRGPGVPRGQVAGIVSSHTDLAP 318

Query: 245 QVRKFSSGSLIFIMSFLINLNTMRAEK 271
            + K +  +   +    I L++  AE+
Sbjct: 319 TILKLAGQNRPDLDGTPIPLSSTEAEQ 345


>gi|67903834|ref|XP_682173.1| hypothetical protein AN8904.2 [Aspergillus nidulans FGSC A4]
 gi|40744962|gb|EAA64118.1| hypothetical protein AN8904.2 [Aspergillus nidulans FGSC A4]
          Length = 1264

 Score =  108 bits (269), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 65/198 (32%), Positives = 103/198 (52%), Gaps = 18/198 (9%)

Query: 50   YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
            Y+N S   N    K    Y   Y  D+    ++ F+  + +   ++P  L ++  APH  
Sbjct: 852  YWNSSYQRNHEAPK---SYAGQYTTDVTEEKALGFVDDALED-KERPFFLTVAPIAPHFE 907

Query: 110  EDS-------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
            +D              AP++A++F +      P+++        WI  +  + Q V   +
Sbjct: 908  QDPGHSSDTPPQAPIPAPRHAHLFPDAKVPRVPSFNPLNQTGPSWIRDLKHQNQSV-VDY 966

Query: 157  TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
             D    +RL+ LQSVD+ V+K++  L+  G+LNNTY+IY+SD+G+H+G   L  GKS  +
Sbjct: 967  EDFFYRQRLRALQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHIGHHRLPPGKSTSY 1026

Query: 217  EFDIRVPFLMRGPGIVPG 234
            E DIRVPF +RGPGI  G
Sbjct: 1027 EEDIRVPFFIRGPGIKSG 1044


>gi|340372495|ref|XP_003384779.1| PREDICTED: arylsulfatase-like [Amphimedon queenslandica]
          Length = 723

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 72/223 (32%), Positives = 115/223 (51%), Gaps = 22/223 (9%)

Query: 22  FGKYLNKYNGSY---IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
            GKYLN Y+ ++   +PPGW  W  +  ++ YY    + NG+ +K  ++ Y     D I+
Sbjct: 273 IGKYLNGYSDAHADVVPPGWDYWHGM-TDTAYYGPHFSDNGKLLKVPKEVYQ---TDYIS 328

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
             +   L++   +   KP  + +S  APH P   A ++ N+F +  +  T ++    NPD
Sbjct: 329 ETATTLLKKRSPN---KPFFMYLSPFAPHAPSIPAHRHENLFNDKQAPRTVSF----NPD 381

Query: 139 KQ-------WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
            +       W+  +  K+        D     RL+ LQ+VD+ +  + + LK  GE +NT
Sbjct: 382 DKVQKEKPSWLGHLP-KLTDKQIDSIDNFYRNRLRALQAVDEMLVSLEETLKGTGEWDNT 440

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           Y+ Y  D+G HLG + L  GK   ++ DIRVPFL+RGPGI  G
Sbjct: 441 YVFYMGDNGQHLGDYRLPGGKRQAYDTDIRVPFLVRGPGIKGG 483


>gi|355722614|gb|AES07631.1| sulfatase 2 [Mustela putorius furo]
          Length = 596

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/83 (60%), Positives = 64/83 (77%), Gaps = 5/83 (6%)

Query: 159 LLMTKRLQTLQS-----VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
           +L  KRLQTL S     VDD++E +   L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS
Sbjct: 1   MLQRKRLQTLMSXTLMSVDDSMETLYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKS 60

Query: 214 FPFEFDIRVPFLMRGPGIVPGTM 236
            P+EFDIRVPF +RGP +  G++
Sbjct: 61  MPYEFDIRVPFYVRGPNVEAGSL 83


>gi|429848907|gb|ELA24343.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
          Length = 576

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 76/216 (35%), Positives = 110/216 (50%), Gaps = 22/216 (10%)

Query: 35  PPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSK 94
           P GW      +++   Y Y      R     + Y  +Y  DL+A  ++ FL  + +    
Sbjct: 145 PKGWTG-SDFLLDPNTYLYLNATFQRNDSPPQTYPGEYSNDLVAERALGFLDDALKFNGS 203

Query: 95  KPIMLVMSFPAPH----GPEDSA--------PQYANMFFNVTSHHTPAYDYAPNPDK--- 139
           KP  + ++   PH    G E SA         ++ N+F N+T   T    Y  NPDK   
Sbjct: 204 KPFFVGIAPIGPHNNGLGGETSAGATPPIPAKRHENLFPNITVPRT----YNFNPDKPSM 259

Query: 140 -QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
             W L +  K+      + D    +RLQTLQSVD+ V+ + K L+D G L+NTY+I+TSD
Sbjct: 260 ANWGL-IIPKLNSSEIAYGDNYYRRRLQTLQSVDEMVDAVFKRLEDAGVLDNTYVIFTSD 318

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +G+H+GQ  L  GK+   E DI VPFL+RGPG+  G
Sbjct: 319 NGFHIGQHRLKPGKTCAIEEDINVPFLIRGPGVPKG 354


>gi|330916911|ref|XP_003297607.1| hypothetical protein PTT_08067 [Pyrenophora teres f. teres 0-1]
 gi|311329644|gb|EFQ94320.1| hypothetical protein PTT_08067 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 77/235 (32%), Positives = 118/235 (50%), Gaps = 23/235 (9%)

Query: 21  YFGKYLNKYNGS-YIPP---GWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK +N ++ + Y  P   GW +   LI    Y  YN S+  N    K    +  +Y  
Sbjct: 33  YTGKLMNGHSTTTYNKPRAAGWNQSDFLIDPGTYVFYNTSMTRNNDPYKF---FPGEYST 89

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------PEDSAPQYANMFFN 122
           DL++  +V FL  +  + S++P  L ++  APH             P   A ++ ++F N
Sbjct: 90  DLVSKAAVGFLDDAIAAASERPFFLGVAPVAPHSETITDPRPAKFNPPVPAKRHEHLFPN 149

Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
           VT   TP ++    P      +  R++      + D+   KRLQ+LQSVD+ V+ I+  L
Sbjct: 150 VTVPRTPNFN-PEKPGTASYFKTLRQLNRTELDYNDVWYRKRLQSLQSVDELVDSIMDRL 208

Query: 183 KDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
               E + NTY+IYT+D+G+H+GQ  L  GKS   E D+ +PF MRGPGI    +
Sbjct: 209 GASPEVIENTYMIYTTDNGFHIGQHRLGPGKSCGIEEDVNIPFFMRGPGIAKAAV 263


>gi|86740887|ref|YP_481287.1| sulfatase [Frankia sp. CcI3]
 gi|86567749|gb|ABD11558.1| sulfatase [Frankia sp. CcI3]
          Length = 524

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 76/225 (33%), Positives = 113/225 (50%), Gaps = 23/225 (10%)

Query: 22  FGKYLNKYNGSYIPP-------GWREWGALIMNSKYY---NYSINMNGRKIKH--GEDYY 69
            GKYLN Y    I P       GW +W   + N+  Y   N+  N NG  ++H  G+D  
Sbjct: 154 LGKYLNGYGDPTITPTTGPVPRGWSDW--HVSNTTGYAELNFDQNDNG-VVRHYAGQD-- 208

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
            +Y  D++  D+  F+R+S    + KP  L ++  APH P   AP+ A+ F  +T    P
Sbjct: 209 -NYGVDVLNADAQAFIRRS----AGKPFALEVATYAPHQPYTPAPRNADDFPGLTEPRDP 263

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
           +++        W+ Q    + P      D    +R Q ++SVD  V      L     L+
Sbjct: 264 SFNTNNTDAPAWLGQRA-PLAPSVLTNLDQAYRERAQAVESVDKLVGDTEATLAAEHLLD 322

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           NTY +++SD+GYHLGQ  LV+GK   F+ DIRVP ++ GPG+  G
Sbjct: 323 NTYFVFSSDNGYHLGQHRLVRGKQTAFDTDIRVPLIVTGPGVPHG 367


>gi|259486661|tpe|CBF84693.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 565

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/199 (32%), Positives = 103/199 (51%), Gaps = 18/199 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
            Y+N S   N    K    Y   Y  D+    ++ F+  + +   ++P  L ++  APH 
Sbjct: 156 SYWNSSYQRNHEAPK---SYAGQYTTDVTEEKALGFVDDALED-KERPFFLTVAPIAPHF 211

Query: 109 PEDS-------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
            +D              AP++A++F +      P+++        WI  +  + Q V   
Sbjct: 212 EQDPGHSSDTPPQAPIPAPRHAHLFPDAKVPRVPSFNPLNQTGPSWIRDLKHQNQSV-VD 270

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQSVD+ V+K++  L+  G+LNNTY+IY+SD+G+H+G   L  GKS  
Sbjct: 271 YEDFFYRQRLRALQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHIGHHRLPPGKSTS 330

Query: 216 FEFDIRVPFLMRGPGIVPG 234
           +E DIRVPF +RGPGI  G
Sbjct: 331 YEEDIRVPFFIRGPGIKSG 349


>gi|361131542|gb|EHL03215.1| putative Arylsulfatase [Glarea lozoyensis 74030]
          Length = 371

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 116/230 (50%), Gaps = 22/230 (9%)

Query: 21  YFGKYLNKYN-GSYIPP---GWREWGALIMNS--KYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GKYLN +   +Y  P   G+     L+  S  +YYN S++ NG       +Y   Y  
Sbjct: 39  YVGKYLNGHTLDNYNKPLAKGFTGSEFLLDPSTYQYYNASMSRNGGIPV---NYPGQYSN 95

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP------EDSAPQY----ANMFFNVT 124
           DL+AN +  FL  +  +   KP  L ++   PH        E SAPQY    AN+F  +T
Sbjct: 96  DLVANKAYDFLDDAIAA--GKPFFLGIAPIGPHSEFQFATLEASAPQYPPRHANLFNTIT 153

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
              TP ++ +      WI  +   + P    + D     RL++LQS+D+ +E +   L  
Sbjct: 154 VPRTPNFNPSTPGGTSWISTLP-PLTPEALAYGDTFYRARLRSLQSIDEMIESLTLRLNT 212

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
              L NTYIIYTSD+G+H+ Q  L  GK+  FE DI +P L+RGPG+  G
Sbjct: 213 KKLLENTYIIYTSDNGFHISQHRLPPGKNCGFEEDINIPLLIRGPGVSKG 262


>gi|258572460|ref|XP_002544992.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237905262|gb|EEP79663.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 580

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 114/223 (51%), Gaps = 15/223 (6%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
           Y GK++N  + +   PG + W     ++    Y+Y   +  R      DY   Y  D++A
Sbjct: 126 YVGKFMNGLSTNNYNPGPKGWTGSDFLLGMTTYDYWGAVMSRNRGAPVDYTGQYSGDVVA 185

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSHHT 128
           +  + F+  + Q    KP  +  +  APHG     P          ++A++F +     T
Sbjct: 186 DKVLGFMDDAIQD--DKPFFVAAAPVAPHGQVKPRPLWFDKPEYPERHAHLFKDYKIPRT 243

Query: 129 PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
           P ++        W+  + R  Q  H  + D     RL+ LQSVD+ V ++V++L++ G L
Sbjct: 244 PNFNPDTPSGAGWVAAMPRLNQ-THIDYHDEYQRCRLRALQSVDEMVGQMVEKLEENGLL 302

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           +NTY +YT+D+G+H+ Q+ L  GK+  F+ DIR+P ++RGPG+
Sbjct: 303 DNTYFMYTTDNGFHISQYRLPPGKTCGFDTDIRIPMIVRGPGV 345


>gi|302886649|ref|XP_003042214.1| hypothetical protein NECHADRAFT_42509 [Nectria haematococca mpVI
           77-13-4]
 gi|256723123|gb|EEU36501.1| hypothetical protein NECHADRAFT_42509 [Nectria haematococca mpVI
           77-13-4]
          Length = 591

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 105/197 (53%), Gaps = 17/197 (8%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           +Y N ++  NG       +Y   Y PDL A  +  FL  +  +  ++P  LV +  APHG
Sbjct: 155 QYLNATMTRNGAPPV---NYAGLYSPDLTAEKAYGFLDDALSN--EEPWFLVHAPIAPHG 209

Query: 109 PED-------SAPQYANMFFNVTSHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFT 157
             D        AP+YA    ++   +    D + NP+ Q    W+ ++ R    V   + 
Sbjct: 210 TVDLTPNFFSDAPKYAERHTHLFKDYIIPRDESFNPEVQGGVSWVKELPRLNDTV-IAYN 268

Query: 158 DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
           D     RL+ LQSVD+ VE ++K+++  G+L+NTYIIYT+D+GYH+ Q  +  GK   FE
Sbjct: 269 DEYQRARLRALQSVDEMVEAMIKKVEAAGQLDNTYIIYTTDNGYHISQHRMHPGKECGFE 328

Query: 218 FDIRVPFLMRGPGIVPG 234
            D+ +P ++RGPG+  G
Sbjct: 329 TDVHIPLIIRGPGVPAG 345


>gi|67901294|ref|XP_680903.1| hypothetical protein AN7634.2 [Aspergillus nidulans FGSC A4]
 gi|40742630|gb|EAA61820.1| hypothetical protein AN7634.2 [Aspergillus nidulans FGSC A4]
 gi|259483966|tpe|CBF79787.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 562

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 106/201 (52%), Gaps = 18/201 (8%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +YYN  +  NG +     DY   Y  D++   ++ FL ++ ++   +P ML ++  APH 
Sbjct: 124 RYYNAKMTRNGAEPV---DYSGQYVTDVLTEKAIGFLNETLKN-PDRPWMLTVAPNAPHS 179

Query: 108 ------------GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
                       G  + AP++AN+F +       +++        W+  +++  Q     
Sbjct: 180 NGSATADGTHWFGEPEYAPRHANLFKDYKVPRDKSFNTVIEGAVGWVKDISQLSQK-EVD 238

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D     RL+ LQ+VD+ V++++  L + G L+NTYI +++D+GYHLGQ     GK+  
Sbjct: 239 YIDEFQRCRLRALQAVDEMVDRLISRLDEAGVLDNTYIFFSTDNGYHLGQHAFRPGKNCG 298

Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
           +E DI VP ++RGPG+  GT+
Sbjct: 299 YETDINVPLIVRGPGVPKGTV 319


>gi|255957079|ref|XP_002569292.1| Pc21g23250 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591003|emb|CAP97222.1| Pc21g23250 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 571

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 105/189 (55%), Gaps = 19/189 (10%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--GPEDS------------A 113
           Y   Y  D+IA  +  F+ ++ +   ++P ML ++  APH  G  DS            A
Sbjct: 166 YDGQYSTDVIAEKAEGFIDEAVKH--EQPWMLTVAPNAPHSNGSHDSTRNANWFGEPEFA 223

Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
           P++A++F +V +    +++        W+  V  ++   H  + D     RL+ LQ+VD+
Sbjct: 224 PRHADLFKDVRAPRDESFNTLIENAVSWVQNVP-ELNQTHIDYIDEFQRCRLRALQAVDE 282

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            + K+V++L+ +GEL+NTYI Y++D+GYHLGQ  +  GK+  +E DI VP ++RGPG+  
Sbjct: 283 MIGKLVEKLEKVGELDNTYIFYSTDNGYHLGQHRMQPGKNCGYETDINVPLIVRGPGM-- 340

Query: 234 GTMYLLDVL 242
           G    LD +
Sbjct: 341 GKNQTLDAI 349


>gi|452977923|gb|EME77687.1| hypothetical protein MYCFIDRAFT_65580 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 583

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 77/263 (29%), Positives = 121/263 (46%), Gaps = 30/263 (11%)

Query: 21  YFGKYLNKYN-GSYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK  N  N  +Y  P    W     +++   Y Y      R       Y   Y  D++
Sbjct: 136 YVGKLFNAQNVDNYASPHAAGWTGSEFLLDPYTYEYLNATFQRNHDPPVSYEGLYSTDVL 195

Query: 78  ANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHG--------------------PEDSAPQ 115
           AN S  FL     ++  S KP  L ++  APH                     P  SA +
Sbjct: 196 ANKSFEFLADGLDAYKASGKPFFLTIAPTAPHSNVHMNTAIDGNFSEGSNTQSPPVSAER 255

Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
           +A++F +V    TP ++ +      WI ++  +    +  F D    +RL++LQ+VD+ V
Sbjct: 256 HAHLFEDVIVPRTPHFNPSNPHGVSWISRLPEQNN-TNLAFNDHFYRQRLRSLQAVDEMV 314

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            ++ K L+D   L++TY+ +++D+GYH+GQ  L  GK  PFE DI +PF++RGPGI  G 
Sbjct: 315 GELFKRLEDARVLDDTYVFFSTDNGYHIGQHRLQPGKQCPFEEDINIPFMVRGPGIGKGV 374

Query: 236 MYLLDV----LIPQVRKFSSGSL 254
              +      L P   K + G +
Sbjct: 375 TADIVTTHTDLAPTFLKLAGGEI 397


>gi|389750805|gb|EIM91878.1| alkaline phosphatase-like protein [Stereum hirsutum FP-91666 SS1]
          Length = 618

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 77/239 (32%), Positives = 114/239 (47%), Gaps = 23/239 (9%)

Query: 21  YFGKYLNKYNGSYI----PPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK +N    + I      GW     L+  + Y  YN S + NG       +Y  +Y  
Sbjct: 135 YTGKLMNGMKTTNIQELPAAGWTNADFLLDPNTYLFYNASFSANGEA---SINYEGEYQT 191

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---------SAPQYANMFFNVTS 125
            LI   S   L+Q+  S S+ P  L ++  APH              +P+YA  F +   
Sbjct: 192 ALIQQKSTSMLQQALASPSQ-PFFLGIAPTAPHLEVQFNGSFTEPLPSPEYAGSFNDSIV 250

Query: 126 HHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
             +P Y+           W+ ++  ++      + D +  +RL++LQSVDD VE IV+++
Sbjct: 251 PRSPNYNVPVGGPGGGVSWLKELP-ELNDTVVDYIDHVYRQRLRSLQSVDDLVENIVQQV 309

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           +D G  NNTYIIY +D+GY LG       K+ PFE D+ +P  +RGPGI P T    DV
Sbjct: 310 EDAGIANNTYIIYVADNGYALGSHRRQPSKTLPFEEDVLIPLFIRGPGITPNTTNSADV 368


>gi|350633016|gb|EHA21383.1| hypothetical protein ASPNIDRAFT_44607 [Aspergillus niger ATCC 1015]
          Length = 588

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 118/234 (50%), Gaps = 37/234 (15%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   ++N +NGS            +++   Y+Y      R  +    Y   Y  D++   
Sbjct: 154 YNAPFVNGFNGS----------DFLLDPHTYSYWNATYQRNHEPPRSYEGQYTTDVMKEK 203

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDS----APQYANMFFNVT 124
           +   L  +  S    P  L ++  APH            GP+ +    AP++A++F +  
Sbjct: 204 ASGLLADALDS--DAPFFLTVAPIAPHTNIDVEGLSGAGGPKMTEPLPAPRHAHLFADAK 261

Query: 125 SHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
              TP +    NPDK     WI  +  + Q V   + D L  +RL+TLQ+VD+ V+ ++ 
Sbjct: 262 VPRTPNF----NPDKDSGAGWIQTMELQNQTV-IDYEDHLYRQRLRTLQAVDEMVDALIT 316

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +L++ G+++NTYIIY++D+GYH+G   L  GK+  +E DIRVPF +RGPGI  G
Sbjct: 317 QLEESGQIDNTYIIYSADNGYHIGHHRLPPGKTTGYEEDIRVPFYIRGPGIPEG 370


>gi|37523000|ref|NP_926377.1| sulfatase [Gloeobacter violaceus PCC 7421]
 gi|35214003|dbj|BAC91372.1| gll3431 [Gloeobacter violaceus PCC 7421]
          Length = 521

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 111/242 (45%), Gaps = 35/242 (14%)

Query: 20  GYFGKYLNKY-----------NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGE 66
           G+ GKYLN Y           + +Y+PPGW  W  L+  + Y  YN+ IN NGR   +G 
Sbjct: 141 GFLGKYLNGYGANKDKSSPRDDATYVPPGWDVWQGLVDPTTYQVYNFKINENGRVANYGT 200

Query: 67  DYYN---DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPE 110
           D      +Y  D+++  +V F+ Q        P  L ++  APH              P+
Sbjct: 201 DQAEPPEEYQTDVLSARAVNFVEQYGSG--DAPFFLWVNPIAPHFELRGTRRCTVNPRPQ 258

Query: 111 DS---APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT 167
           +S   AP++A     V     PA++     DK   +Q    M          L   RL+ 
Sbjct: 259 NSIRPAPRHAGSAAAVPLPRGPAFNEQDVSDKPTWVQQNPAMSERTIDCLQNLYRNRLEA 318

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           +++VDD V  +   L+  G L +T ++ TSD+GY LGQ  L   K  P+E  IRVP L+R
Sbjct: 319 MRAVDDLVAALFDALERTGALGDTIVLLTSDNGYLLGQHRLT-AKVLPYEESIRVPLLVR 377

Query: 228 GP 229
            P
Sbjct: 378 VP 379


>gi|145235761|ref|XP_001390529.1| arylsulfatase [Aspergillus niger CBS 513.88]
 gi|134058218|emb|CAK38410.1| unnamed protein product [Aspergillus niger]
          Length = 581

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 118/234 (50%), Gaps = 37/234 (15%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   ++N +NGS            +++   Y+Y      R  +    Y   Y  D++   
Sbjct: 147 YNAPFVNGFNGS----------DFLLDPHTYSYWNATYQRNHEPPRSYEGQYTTDVMKEK 196

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDS----APQYANMFFNVT 124
           +   L  +  S    P  L ++  APH            GP+ +    AP++A++F +  
Sbjct: 197 ASGLLADALDS--DAPFFLTVAPIAPHTNIDVEGLSGAGGPKMTEPLPAPRHAHLFADAK 254

Query: 125 SHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
              TP +    NPDK     WI  +  + Q V   + D L  +RL+TLQ+VD+ V+ ++ 
Sbjct: 255 VPRTPNF----NPDKDSGAGWIQTMELQNQTV-IDYEDHLYRQRLRTLQAVDEMVDALIT 309

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +L++ G+++NTYIIY++D+GYH+G   L  GK+  +E DIRVPF +RGPGI  G
Sbjct: 310 QLEESGQIDNTYIIYSADNGYHIGHHRLPPGKTTGYEEDIRVPFYIRGPGIPEG 363


>gi|269124890|ref|YP_003298260.1| sulfatase [Thermomonospora curvata DSM 43183]
 gi|268309848|gb|ACY96222.1| sulfatase [Thermomonospora curvata DSM 43183]
          Length = 474

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/240 (30%), Positives = 122/240 (50%), Gaps = 24/240 (10%)

Query: 15  FDISHGYFGKYLNKY--NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYN 70
           +D  H   GKYLN Y  +GS IP GW++W  L+  S Y  + Y++N NG+  K+G+    
Sbjct: 119 YDTVH--IGKYLNGYGRHGSEIPQGWKDWYGLMGPSTYSMWGYTLNENGKLRKYGKKKGG 176

Query: 71  ---DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPEDSAP 114
              +Y  D++A  +V +L  ++ +  K+P  +  +  APH             G    AP
Sbjct: 177 KPANYQTDVLARKAVAYL--NRPTTGKRPFFMWFAPLAPHLEAPWNKKMGDKWGGPRPAP 234

Query: 115 QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
           ++ N + N     +P+++     DK   ++  ++               RL++L +VD+A
Sbjct: 235 RHRNAYDNAVMWRSPSFNEKDISDKPRHVRRNKRFGKKKIASIQAEYRNRLESLLAVDEA 294

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           V KI + +K  G+L+ T  ++TSD+GY LGQ      K  P+E  +R+P ++RGPG   G
Sbjct: 295 VAKIYRAVKRRGQLDRTLFVFTSDNGYMLGQHRFAHQKIHPYEESVRIPLIVRGPGFPKG 354


>gi|158316689|ref|YP_001509197.1| sulfatase [Frankia sp. EAN1pec]
 gi|158112094|gb|ABW14291.1| sulfatase [Frankia sp. EAN1pec]
          Length = 534

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/222 (32%), Positives = 114/222 (51%), Gaps = 12/222 (5%)

Query: 22  FGKYLNKYNGS--YIPPGWREWGALIMNSKY--YNYSINMNGRKIKHG--EDYYNDYYPD 75
            GKY+N    +  ++PPGW EW   + N  Y  YNY++N NG  + +G   D  N Y  D
Sbjct: 155 VGKYMNGIEDAPDHVPPGWTEWYGSVDNFFYTGYNYALNENGTIVHYGGPSDPAN-YSTD 213

Query: 76  LIANDSVRFLRQSKQSFSKKPIML-VMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           ++A  SV FL ++  +   +P ML   S         +     N F +  +  +P Y   
Sbjct: 214 VVAAKSVDFLERA--AAKDEPFMLYTASTAPHLPLPPAPRDSNNPFTDDLAPRSPNYQEP 271

Query: 135 PNPDKQWILQVTRKMQPVHRQF-TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              DK   L+ +  ++        D     R+ +L ++DD V  IV  L+D GEL++TY+
Sbjct: 272 DVSDKPAWLRTSAGVRSAQVNLINDNDYRNRMGSLLALDDMVGDIVTTLRDTGELDHTYL 331

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           ++TSD+GY+LG   L+  K  P+E  +RVP ++ GPG+  GT
Sbjct: 332 VFTSDNGYNLGAHRLIH-KMAPYEESLRVPLVVAGPGVTRGT 372


>gi|323450570|gb|EGB06451.1| hypothetical protein AURANDRAFT_65638 [Aureococcus anophagefferens]
          Length = 515

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/170 (35%), Positives = 94/170 (55%), Gaps = 11/170 (6%)

Query: 73  YPDLIANDSVRFLRQSKQSFSKK----PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT 128
           Y +L+   +  F RQ   + +++    P    +   APH P   AP YA+ F +V    T
Sbjct: 130 YEELLNETAPVFDRQPDAALARRAHTGPFFAYVGPHAPHFPATPAPWYAHAFDDVGIPET 189

Query: 129 PAYDYAPNPDKQWILQVTRKMQPVH---RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
           P Y+ + + DK   +Q  R+  P+    + +       R  +L SVDD V  +  +L+ +
Sbjct: 190 PNYNCS-HEDK---VQHVRQTPPIDDRVKCWQSQHFRDRWASLLSVDDLVFSLYHKLEAM 245

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           G L++TYI+YTSDHGY  GQ+     K+ P+E D+R+PF++RGPGI PG+
Sbjct: 246 GVLDDTYIVYTSDHGYKFGQWRQTTSKTHPYETDVRIPFIVRGPGIAPGS 295


>gi|325981531|ref|YP_004293933.1| N-acetylglucosamine-6-sulfatase [Nitrosomonas sp. AL212]
 gi|325531050|gb|ADZ25771.1| N-acetylglucosamine-6-sulfatase [Nitrosomonas sp. AL212]
          Length = 489

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 71/233 (30%), Positives = 119/233 (51%), Gaps = 20/233 (8%)

Query: 18  SHGYFGKYLNKYNG-------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYN 70
           S  YFGKY N+Y         SYIPPGW  W   + ++KY+NY +N NG    +G     
Sbjct: 120 STAYFGKYFNQYGTTTLVDPESYIPPGWNYWYTFV-STKYFNYRLNENGITKSYGS-APE 177

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------APQYANMFF 121
           DY  D+IA  +   L  +  + S  P  ++++  A H   +S         AP++ N+F 
Sbjct: 178 DYGTDVIAKHATNHLSTALSNHS--PFFMMIAPYAVHSSTESTESPFQPIPAPRHKNLFN 235

Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
            VT+    +++     +K   ++ +  + P    + D    +R+++LQSVD+ + K++ +
Sbjct: 236 GVTAPRHGSFNQKDVSNKSPSVRNSPLLLPDDIDWIDNNFRERIRSLQSVDEMIGKLIDQ 295

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           LK+  +   TYII+TSD+GY  G+  L   K+  ++   RVP  + GP +  G
Sbjct: 296 LKNSPKGKETYIIFTSDNGYWFGEHRLKFAKAGLYDQSHRVPLAIAGPKVQRG 348


>gi|189209101|ref|XP_001940883.1| arylsulfatase precursor [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187976976|gb|EDU43602.1| arylsulfatase precursor [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 571

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/235 (30%), Positives = 116/235 (49%), Gaps = 23/235 (9%)

Query: 21  YFGKYLNKYNGS-YIPP---GWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK +N ++ + Y  P   GW +   LI    Y  YN S+  N    K    +  +Y  
Sbjct: 128 YTGKLMNGHSTTTYNTPRAAGWNQSDFLIDPGTYVFYNTSMTRNNEMYKF---FPGEYST 184

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------PEDSAPQYANMFFN 122
           DLI+  +V FL  +  + S++P  L ++  APH             P   A ++ ++F N
Sbjct: 185 DLISKAAVGFLDDAIAAASERPFFLGVAPVAPHSETIIDPRPAKFNPPVPAKRHEHLFPN 244

Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
           VT    P ++    P      +  R++      + D    KRLQ+LQSV++ ++ ++  L
Sbjct: 245 VTVPRRPNFN-PEKPGTASYFKTLRQLNQTEIDYNDAWYRKRLQSLQSVNELIDSVMDRL 303

Query: 183 KDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
               E L NTY++YT+D+G+H+GQ  L  GKS   E D+ +PF +RGPG+    +
Sbjct: 304 SASPEVLENTYVLYTTDNGFHIGQHRLGPGKSCGIEEDVNIPFFIRGPGVAKAAV 358


>gi|302835590|ref|XP_002949356.1| hypothetical protein VOLCADRAFT_59221 [Volvox carteri f.
           nagariensis]
 gi|300265183|gb|EFJ49375.1| hypothetical protein VOLCADRAFT_59221 [Volvox carteri f.
           nagariensis]
          Length = 593

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 67/238 (28%), Positives = 114/238 (47%), Gaps = 25/238 (10%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK+L  Y   N    P GW  + AL+     +YY+ + ++NG   +     Y  Y  D
Sbjct: 134 YTGKFLVDYTIRNYDPAPSGWSAFDALVHPFTFEYYSPAFSLNGGTPRMYPGMY--YSTD 191

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
           ++   ++  LR +    + +P  L ++ PAPH                   P   A ++ 
Sbjct: 192 VVGAKALAMLRSAATDPAGRPFYLQVAPPAPHHSMHYDFNSSGAFLARHWYPPIPAARHW 251

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
                      P+++      K   L+    +   +  F + +   RL+ +++VD+ VE 
Sbjct: 252 EELSGAALPRPPSFNDPWVAGKPSWLRSLAPLTSANETFLEEIYRLRLRCMRAVDELVEG 311

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           IV+E++ LG    TY+++TSD+GYH G   +  GK+ P+E DIRVP +M GPG++PG+
Sbjct: 312 IVREVESLGLSRRTYVLFTSDNGYHEGVHRMGSGKTTPYEEDIRVPLIMAGPGLIPGS 369


>gi|212538775|ref|XP_002149543.1| arylsulfatase, putative [Talaromyces marneffei ATCC 18224]
 gi|210069285|gb|EEA23376.1| arylsulfatase, putative [Talaromyces marneffei ATCC 18224]
          Length = 497

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 114/235 (48%), Gaps = 38/235 (16%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   Y+N +NGS            +++   Y+Y  +   R  +    Y   +  D+I + 
Sbjct: 159 YDKPYVNGFNGS----------DFLLDPFTYSYLNSTYQRNHEPPVSYEGQHTTDVITDK 208

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS-------AP----------QYANMFFNV 123
           ++ FL         +P  LV++  +PH   D        AP          ++ N+F +V
Sbjct: 209 ALGFLEDGLAG--DQPFFLVVAPISPHSNVDGGALSGHQAPTVMTEPIPLSRHENLFGDV 266

Query: 124 TSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
                  +    NPD+     WI  + ++ +P   ++ D    +RL+ LQ VD+ V  I+
Sbjct: 267 KVPRMAHF----NPDEPSGVSWIRDLPKQDEPT-MEYNDHFYRQRLRALQGVDELVRDII 321

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
             L+D  +L+NTYIIYTSD+GYH+GQ  L  GK+  FE DIRVPF +RGPGI  G
Sbjct: 322 TRLEDSNQLDNTYIIYTSDNGYHIGQHRLPPGKTCGFEEDIRVPFFIRGPGIAKG 376


>gi|37520215|ref|NP_923592.1| N-acetylglucosamine 6-sulfatase [Gloeobacter violaceus PCC 7421]
 gi|35211208|dbj|BAC88587.1| gll0646 [Gloeobacter violaceus PCC 7421]
          Length = 515

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 73/242 (30%), Positives = 110/242 (45%), Gaps = 37/242 (15%)

Query: 20  GYFGKYLNKYNG-----------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY 68
           GY GKYLN Y             +Y+PPGW +W ALI  +  YNY +N NG  I +GED 
Sbjct: 142 GYVGKYLNGYGSDRNRRSPLDDPTYVPPGWDDWQALITGT-MYNYVLNDNGTLIDYGEDP 200

Query: 69  YN---DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-- 123
                +Y  D++A  +V F+     + +  P  L ++  APH           +F N   
Sbjct: 201 AEPPEEYQTDVLAERAVAFV----DAAATAPFFLTVAPVAPHFEVAPGVSSCRLFPNSLR 256

Query: 124 ----TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTK-----------RLQTL 168
                  H  +    P P      +     +P   Q   LL  +           ++++L
Sbjct: 257 SIRPAPRHQGSAGQVPLPQGTAFNEQDVTDKPTWVQAAPLLSERSIDCLGNIYRDQIESL 316

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           ++VDD    ++  L+  G L NT +++TSD+GY  GQ  L  GK+  +E  IRVP ++R 
Sbjct: 317 RAVDDLTASLIAALERRGVLGNTIVVFTSDNGYLFGQHRLT-GKNVIYEESIRVPLVVRL 375

Query: 229 PG 230
           PG
Sbjct: 376 PG 377


>gi|342886504|gb|EGU86314.1| hypothetical protein FOXB_03181 [Fusarium oxysporum Fo5176]
          Length = 1194

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 82/246 (33%), Positives = 118/246 (47%), Gaps = 45/246 (18%)

Query: 21  YFGKYLNK-----YNGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           Y GK  N      YN  + P GW     L+      Y+N +   +       E  YN   
Sbjct: 727 YTGKLFNAHTVDNYNDPH-PAGWTSSDFLLDPFTYSYWNATWQRDSEPPISHEGTYNT-- 783

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------PEDSA------ 113
            D +A  ++ ++R++  +   KP  L  +  APH               P+D A      
Sbjct: 784 -DDLAEKTINYIREAHAA--GKPFFLTSAPIAPHNEAVVPEMPDIWTSTPDDPALRPRLL 840

Query: 114 -PQYA----NMFFNVTSHHTPAYDYAPNPDK-QWILQVTR--KMQPVHRQFTDLLMTKRL 165
            PQ A    ++F +V    TP +    NPDK   +  V R  K+      + D     RL
Sbjct: 841 PPQPAKRHEHLFLDVKVPRTPNF----NPDKPSGVNGVARLPKLNQTSLDYNDNYFRLRL 896

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           Q+LQ+VD+ VE++V EL+ LG  ++TYIIYT+D+GYH+GQ  L  GK+  FE DI VP L
Sbjct: 897 QSLQAVDEMVERVVLELESLGIADDTYIIYTTDNGYHIGQHRLQPGKTCGFETDINVPLL 956

Query: 226 MRGPGI 231
           +RGPG+
Sbjct: 957 IRGPGV 962


>gi|358374273|dbj|GAA90866.1| arylsulfatase [Aspergillus kawachii IFO 4308]
          Length = 581

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 117/234 (50%), Gaps = 37/234 (15%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   ++N +NGS            +++   Y+Y      R  +    Y   Y  D++   
Sbjct: 147 YNAPFVNGFNGS----------DFLLDPHTYSYWNATYQRNHEPPRSYEGQYTTDVMREK 196

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDS----APQYANMFFNVT 124
           +   L  +  S    P  L ++  APH            GP+ +    AP++A++F +  
Sbjct: 197 ASGLLADALDS--DAPFFLTVAPIAPHTNIDVEGLGGSGGPKMTEPLPAPRHAHLFADAK 254

Query: 125 SHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
              TP +    NPDK     WI  +  + Q V   + D L  +RL+ LQ+VD+ V+ ++ 
Sbjct: 255 VPRTPNF----NPDKDSGAGWIQTMELQNQTV-IDYEDHLYRQRLRALQAVDEMVDSLIT 309

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +L++ G+++NTYIIY++D+GYH+G   L  GK+  +E DIRVPF +RGPGI  G
Sbjct: 310 QLEESGQIDNTYIIYSADNGYHIGHHRLPPGKTTGYEEDIRVPFYIRGPGIPEG 363


>gi|159489378|ref|XP_001702674.1| arylsulfatase-like protein [Chlamydomonas reinhardtii]
 gi|158280696|gb|EDP06453.1| arylsulfatase-like protein [Chlamydomonas reinhardtii]
          Length = 635

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 118/237 (49%), Gaps = 36/237 (15%)

Query: 21  YFGKYLNKYN-----GSYIPPGWREWGALIMNS--KYYNYSINMNGRKIKHGEDYYNDYY 73
           + GK++N ++         P GW  +  L   +   Y NY       +    + + N Y 
Sbjct: 178 FTGKFINLFDVPVGDTKNCPKGWDMFDPLTDKTVYDYINYDFT---PQCARTDSFTNAYQ 234

Query: 74  PDLIANDSVRFLRQSKQSFSK-KPIMLVMSFPA-----PHGPEDS--------APQYANM 119
            D ++  +  ++     + SK KP  L ++  A     P G E+         +P+Y  +
Sbjct: 235 TDTVSQRASNYI---DDAVSKGKPFYLQVNPAACHTACPKGAEEGGSCVPPVPSPKYDGV 291

Query: 120 FFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
           +  +       +D  P P+    L +   ++ V + +      +R++T+ SVDD +E +V
Sbjct: 292 WSGLKVPRLANFD-VPLPNA---LGIGNDVKGVDKHYQ-----RRVETMMSVDDMIESLV 342

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            +L+ LG L+NTY+IY SD+GYHLG  GL KGK+ P+E DIRVPF MRGPGI  G +
Sbjct: 343 TKLETLGVLDNTYVIYMSDNGYHLGNHGLAKGKTLPYEEDIRVPFYMRGPGIPAGVV 399


>gi|310795554|gb|EFQ31015.1| sulfatase [Glomerella graminicola M1.001]
          Length = 515

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 79/242 (32%), Positives = 111/242 (45%), Gaps = 30/242 (12%)

Query: 21  YFGKYLNKYN-GSY---IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK  N ++ G++    P GW      +++   Y Y      R     + +  +Y  DL
Sbjct: 115 YTGKLFNAHDTGNWNNPFPKGW-TGSDFLLDPNTYLYLNATFQRNQSPSQTFPGEYSNDL 173

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH----GPEDS--------APQYANMFFNVT 124
            A   + FL  +      KP  L ++   PH    G E S        A ++ N+F NVT
Sbjct: 174 AAERVLGFLDDAFGFNGSKPFFLGVAPIGPHNNGLGGETSPGATPPIPAKRHENLFANVT 233

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
              T    Y  NP          K+      + D+   +RLQ LQSVD+ V+ I+  L  
Sbjct: 234 VPRT----YNSNP---------VKLNATEVAYGDIYYHRRLQALQSVDEMVDTIINRLDR 280

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIP 244
            G LNNTY+I+TSD+G+H GQ  L  GK+   E DI VPFL+RGPG+  G     D   P
Sbjct: 281 AGILNNTYVIFTSDNGFHTGQHRLKPGKTCSIEEDINVPFLIRGPGVPKGKTVDFDFDGP 340

Query: 245 QV 246
            +
Sbjct: 341 PI 342


>gi|302837253|ref|XP_002950186.1| hypothetical protein VOLCADRAFT_90537 [Volvox carteri f.
           nagariensis]
 gi|300264659|gb|EFJ48854.1| hypothetical protein VOLCADRAFT_90537 [Volvox carteri f.
           nagariensis]
          Length = 672

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/75 (58%), Positives = 59/75 (78%)

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
           +RL+T+ SVDD +E +V +L++ G LNNTY+IY SD+GYHLG  GL KGK+ P+E DIRV
Sbjct: 355 RRLETMLSVDDMIEDVVNKLEERGVLNNTYVIYVSDNGYHLGNHGLPKGKTLPYEEDIRV 414

Query: 223 PFLMRGPGIVPGTMY 237
           PF +RGPGI  G ++
Sbjct: 415 PFYIRGPGIPAGVVH 429


>gi|322693417|gb|EFY85278.1| arylsulfatase precursor [Metarhizium acridum CQMa 102]
          Length = 614

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 114/223 (51%), Gaps = 12/223 (5%)

Query: 21  YFGKYLNKYNG-SYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK +N+++  +Y  P  + W     +++   Y+Y   +              Y  D+I
Sbjct: 172 YTGKLMNQHSTRTYNQPFAKGWTRSDFLLDPGTYSYLDGIMALDKNEYRSIKGQYSTDVI 231

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----SAPQYANMFFNVTSHHTPAYD 132
           AN S+ FL  +  +   KP  L ++   PH          A ++ ++F NV+   +P+++
Sbjct: 232 ANRSLEFLDNALAA--GKPFFLGVAPIGPHANTGFTTPIPADRHKHLFPNVSIPRSPSFN 289

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-LNNT 191
               P     L+   ++     Q++D     RLQTLQ+VD+ V  I+  L+ + E L NT
Sbjct: 290 -PITPGHVNYLKNLPRLNSTQIQYSDNFYRARLQTLQAVDELVGGIIDALEKVPEVLANT 348

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           YIIYTSD+GYH+GQ  L  GK+   E DI VPF++RGPGI  G
Sbjct: 349 YIIYTSDNGYHIGQHRLPPGKTCNIEEDINVPFVIRGPGIAAG 391


>gi|322703651|gb|EFY95257.1| arylsulfatase precursor [Metarhizium anisopliae ARSEF 23]
          Length = 602

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 115/223 (51%), Gaps = 12/223 (5%)

Query: 21  YFGKYLNKYNG-SYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK +N+++  +Y  P  + W     +++   Y+Y   +              Y  D+I
Sbjct: 160 YTGKLMNQHSTRTYNQPFAKGWTRSDFLLDPGTYSYLDAIMALDKNEYRSIKGQYSTDVI 219

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----SAPQYANMFFNVTSHHTPAYD 132
           AN S+ FL  +  +   KP  L ++   PH          A ++ ++F NV+   +P+++
Sbjct: 220 ANRSLEFLDNALAA--GKPFFLGVTPIGPHANTGFTTPIPADRHKHLFPNVSIPRSPSFN 277

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-LNNT 191
            +  P     L+   ++     Q++D     RLQTLQ+VD+ V  I+  L+ + E L NT
Sbjct: 278 PS-TPGHVNYLKNLPQLTSTQIQYSDNFYRARLQTLQAVDELVGGIIDVLEKVPEVLANT 336

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           YIIYTSD+GYH+GQ  L  GK+   E DI VPF++RGPGI  G
Sbjct: 337 YIIYTSDNGYHIGQHRLPPGKTCNIEEDINVPFIIRGPGIAAG 379


>gi|398401790|ref|XP_003853188.1| hypothetical protein MYCGRDRAFT_40096 [Zymoseptoria tritici IPO323]
 gi|339473070|gb|EGP88164.1| hypothetical protein MYCGRDRAFT_40096 [Zymoseptoria tritici IPO323]
          Length = 576

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/184 (35%), Positives = 97/184 (52%), Gaps = 24/184 (13%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-----GPEDSA----PQYA- 117
           Y   Y PD++A  +  FL ++ QS   KP  L  +  APH      P+D      PQYA 
Sbjct: 171 YKGQYSPDVVAEKTYGFLDEAAQS--TKPFFLTAAPVAPHFELTNSPKDGVRLNPPQYAQ 228

Query: 118 ---NMFFNVTSHHTPAYDYAPNPD----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
              ++F +     TP +    NPD      WI  +  K+     ++ D     RL++LQ+
Sbjct: 229 RHAHLFQDYKIPRTPDF----NPDIPSGVSWIKGMP-KLNDTVIEYYDEFQRARLRSLQA 283

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           VD+ VE++V+ L   G L+NTYI YT+D+G+H+GQ  +  GK   FE D+ +P  +RGPG
Sbjct: 284 VDEMVEQLVQRLDAKGLLDNTYIFYTTDNGFHIGQHRMHPGKECAFESDVHIPLAVRGPG 343

Query: 231 IVPG 234
           +  G
Sbjct: 344 VPAG 347


>gi|398390355|ref|XP_003848638.1| hypothetical protein MYCGRDRAFT_76800 [Zymoseptoria tritici IPO323]
 gi|339468513|gb|EGP83614.1| hypothetical protein MYCGRDRAFT_76800 [Zymoseptoria tritici IPO323]
          Length = 610

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 76/244 (31%), Positives = 109/244 (44%), Gaps = 37/244 (15%)

Query: 21  YFGKYLNK-----YNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN      GW      +++   Y Y      R       Y   Y  D
Sbjct: 147 YVGKLFNAQTVDNYNDPEPAAGWTG-SEFLLDPYTYEYLNASFTRNAGPPTSYPGQYSTD 205

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------------------PEDSAP 114
           LI + S  FL  S  +    P  L ++  APH                      P   A 
Sbjct: 206 LIRSKSFAFL-SSALANPSSPFFLTIAPTAPHSNVHQLQGLIHGNFTEKSNIQSPPVPAA 264

Query: 115 QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
           ++A++F NVT   TP +    NPD      WI ++ ++ Q  +  F D     RL+ LQS
Sbjct: 265 RHAHLFANVTVPRTPHF----NPDSPHGVSWISRLPQQNQ-TNIDFNDHFYRSRLRALQS 319

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           VD+ V+++   L++ G L+ TYI +++D+GYH+GQ  L  GK   FE DI +P ++RGPG
Sbjct: 320 VDEMVDELFTRLEEAGVLDETYIFFSTDNGYHIGQHRLQPGKQCAFEEDINIPLIVRGPG 379

Query: 231 IVPG 234
           I  G
Sbjct: 380 IPRG 383


>gi|440466456|gb|ELQ35723.1| arylsulfatase [Magnaporthe oryzae Y34]
 gi|440488158|gb|ELQ67898.1| arylsulfatase [Magnaporthe oryzae P131]
          Length = 640

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 123/242 (50%), Gaps = 28/242 (11%)

Query: 18  SHGYFGKYLNKYN-GSYIPP--GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           S  Y GK+LN +N G+Y PP   W E  ALI +   Y+++  +  +  +H  +Y   +  
Sbjct: 121 STNYIGKFLNGHNLGNYNPPPKAWTEIDALI-DPYMYDFNRAVFSKNGQHPVNYPGWHQT 179

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------GPEDSAPQ--------YANMF 120
           D++   +V  ++Q  Q    KP    +S  APH      GP D+ PQ        + ++ 
Sbjct: 180 DIVRIKAVERIKQLAQD--DKPFFYWISPTAPHTVPPINGPSDNMPQPLTRHKDLFPDLV 237

Query: 121 FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF---TDLLMTKRLQTLQSVDDAVEK 177
                +  P  +YA    KQ +  V R   P++      T++L   R+Q+LQ +D+ +E 
Sbjct: 238 LPKKGNWNPPDEYA----KQKVNWVGR-TPPLNESMLAETEVLYKTRIQSLQGIDEIIED 292

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
           +V  L+  G L+NTYIIYTSD+G+ +G   +   KS  ++   + PF++RGPG+  G + 
Sbjct: 293 VVATLEQEGILDNTYIIYTSDNGFMIGTHRITAMKSLAYKEAGQTPFIVRGPGVPEGVVS 352

Query: 238 LL 239
            L
Sbjct: 353 RL 354


>gi|302889078|ref|XP_003043425.1| hypothetical protein NECHADRAFT_53397 [Nectria haematococca mpVI
           77-13-4]
 gi|256724341|gb|EEU37712.1| hypothetical protein NECHADRAFT_53397 [Nectria haematococca mpVI
           77-13-4]
          Length = 601

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 82/275 (29%), Positives = 125/275 (45%), Gaps = 50/275 (18%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           Y GK  N      YN  + P GW +   L+      YYN S   N       + Y   + 
Sbjct: 142 YTGKLFNAHTVQNYNKPW-PRGWTQSDFLLDPFTYAYYNSSFQHNHDP---PQSYEGQHS 197

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------------- 108
            D++A  S  FL ++ ++   +P  L ++ PAPH                          
Sbjct: 198 VDVLAEKSYGFLEEAAKA--DRPFFLGIAPPAPHTDVDFNWDTVRDNSALHLGGINATVT 255

Query: 109 --PEDSAPQYANMFFNVTSHHTPAYD-YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
             P   A ++ ++F +     T  ++   P+   +W+ Q    + P   +  D    +RL
Sbjct: 256 FRPPVPAHRHEHLFQDARIPRTENFNPKTPSDSARWVQQ-RPPVSPSDLELYDHFYRQRL 314

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           +TLQSVD+ V+ +   L+ LG LNNTY+ YT+D+G+H+GQ  L  GK   FE DI VP L
Sbjct: 315 RTLQSVDELVDGLFDRLERLGLLNNTYVFYTTDNGFHIGQHRLPPGKECGFEEDINVPLL 374

Query: 226 MRGPGIVPGTMYLLDV------LIPQVRKFSSGSL 254
           +RGPG+  GT    DV      L+P + K +  +L
Sbjct: 375 VRGPGVPAGTR--TDVVTTHTDLVPTILKLAGAAL 407


>gi|389631020|ref|XP_003713163.1| arylsulfatase [Magnaporthe oryzae 70-15]
 gi|351645495|gb|EHA53356.1| arylsulfatase [Magnaporthe oryzae 70-15]
          Length = 633

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 123/242 (50%), Gaps = 28/242 (11%)

Query: 18  SHGYFGKYLNKYN-GSYIPP--GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           S  Y GK+LN +N G+Y PP   W E  ALI +   Y+++  +  +  +H  +Y   +  
Sbjct: 121 STNYIGKFLNGHNLGNYNPPPKAWTEIDALI-DPYMYDFNRAVFSKNGQHPVNYPGWHQT 179

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------GPEDSAPQ--------YANMF 120
           D++   +V  ++Q  Q    KP    +S  APH      GP D+ PQ        + ++ 
Sbjct: 180 DIVRIKAVERIKQLAQD--DKPFFYWISPTAPHTVPPINGPSDNMPQPLTRHKDLFPDLV 237

Query: 121 FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF---TDLLMTKRLQTLQSVDDAVEK 177
                +  P  +YA    KQ +  V R   P++      T++L   R+Q+LQ +D+ +E 
Sbjct: 238 LPKKGNWNPPDEYA----KQKVNWVGR-TPPLNESMLAETEVLYKTRIQSLQGIDEIIED 292

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
           +V  L+  G L+NTYIIYTSD+G+ +G   +   KS  ++   + PF++RGPG+  G + 
Sbjct: 293 VVATLEQEGILDNTYIIYTSDNGFMIGTHRITAMKSLAYKEAGQTPFIVRGPGVPEGVVS 352

Query: 238 LL 239
            L
Sbjct: 353 RL 354


>gi|383779199|ref|YP_005463765.1| putative sulfatase [Actinoplanes missouriensis 431]
 gi|381372431|dbj|BAL89249.1| putative sulfatase [Actinoplanes missouriensis 431]
          Length = 488

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/235 (29%), Positives = 115/235 (48%), Gaps = 12/235 (5%)

Query: 20  GYFGKYLNKY--NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
            + GKYLN+Y     Y+PPGW EW A        NY++N NG   ++G +  +DY  D++
Sbjct: 131 AFLGKYLNEYLPESGYVPPGWDEWMAGGNAYDNVNYTLNENGTVKRYGSEP-SDYLTDVL 189

Query: 78  ANDSVRFLRQSK------QSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           +  +  ++R         +S   KP  + +S   PH P   AP+ A  F  + +    AY
Sbjct: 190 SAKAQDWIRTGSHEQGLIKSTQVKPFFIEISTYTPHLPYPPAPRDAQSFPGLIAPRGDAY 249

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              P     W L     +        D    KR+Q++QS+D  +  +   +   G   NT
Sbjct: 250 GKVPKNAPAW-LAPHSPLTAAQIAHMDRGFRKRVQSIQSIDRMIGDLRATIDRAGIAENT 308

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
            I+++SD+GYH+G++ L  GK   F+ D+ VP ++ G G+  G  +++D L+  +
Sbjct: 309 VIVFSSDNGYHMGEYRLNPGKQTAFDVDVNVPLVVVGAGVRRG--HVVDALVENI 361


>gi|452838367|gb|EME40308.1| hypothetical protein DOTSEDRAFT_66113 [Dothistroma septosporum
           NZE10]
          Length = 580

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 76/245 (31%), Positives = 111/245 (45%), Gaps = 41/245 (16%)

Query: 21  YFGKYLNKYN-GSYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGED---YYNDYYP 74
           Y GK  N     +Y  P    W     +++   Y Y   +N    ++GE+   Y   Y  
Sbjct: 97  YVGKLFNAQTIDNYASPHAAGWTGSEFLLDPYTYEY---LNATFTRNGEEPVSYEGQYTT 153

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------------------PEDSA 113
           DL+A  S  FL  +     KKP  L ++  APH                      P   A
Sbjct: 154 DLVAEKSFGFLDDALAE--KKPFFLTIAPTAPHSNVHIGSNIIDGNFSENTNVQSPPVPA 211

Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
            ++  +F +VT   TP +    NPD      WI ++ R+ +  + +  D     RL+ LQ
Sbjct: 212 KRHERLFQDVTVPRTPHF----NPDDAHSVSWIGRLPRQNR-TNIEANDHFYRSRLRALQ 266

Query: 170 SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           SVD+  + + K L D G L  TYI +++D+GYH+GQ  L  GK   FE DI VP ++RGP
Sbjct: 267 SVDEIFDDLTKRLDDAGILEETYIFFSTDNGYHIGQHRLQPGKQCAFEEDINVPLIVRGP 326

Query: 230 GIVPG 234
           G+  G
Sbjct: 327 GVAHG 331


>gi|389750799|gb|EIM91872.1| arylsulfatase precursor [Stereum hirsutum FP-91666 SS1]
          Length = 586

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/239 (30%), Positives = 108/239 (45%), Gaps = 24/239 (10%)

Query: 21  YFGKYLNKYNGSYI----PPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK +N      I      GW     L+  + Y  YN S + NG    + E  Y     
Sbjct: 137 YTGKLMNGMQVDNIQQLPAAGWTNADFLVDPNTYLFYNSSFSANGEAAINHEGTYQ---M 193

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---------SAPQYANMFFNVTS 125
           DLI   S+   +Q+  S   +P  L ++  APH               P+YA +F     
Sbjct: 194 DLIVQKSIAMFQQALSS--SQPFFLGIAPTAPHMEVQLNGSFTEPIPPPEYAGLFNETVV 251

Query: 126 HHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
              P Y+           W+ ++    + V   + D +  +RL+TL++VDD VE  V+++
Sbjct: 252 PRVPNYNVPVGGPGGGVSWLKELEALNETV-LSYVDHVYRQRLRTLRTVDDLVENTVQQV 310

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           +D G  NNTYIIY +D+GY LG       K+ PF+ D+ +P  +RGPGI P      DV
Sbjct: 311 EDAGIANNTYIIYVADNGYALGSHRRQPSKTLPFQEDVLIPLFIRGPGIAPNVTNAADV 369


>gi|323454692|gb|EGB10562.1| hypothetical protein AURANDRAFT_12528, partial [Aureococcus
           anophagefferens]
          Length = 327

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 70/232 (30%), Positives = 107/232 (46%), Gaps = 25/232 (10%)

Query: 20  GYFGKYLN-----KYNGSYI--------PPGWREWGALIMNSK-----YY--NYSINMNG 59
            Y GKYLN     +Y  +          PPGW  +  +   +      YY  N+  +  G
Sbjct: 101 AYVGKYLNPPAMVRYCRNETLGPLEHGWPPGWDAFFGMCDQASTPQGAYYSVNWVDSAAG 160

Query: 60  RKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANM 119
             +  G +   DY   ++ N SV +L+ + +    +P  L  +  APH P+   P +A+ 
Sbjct: 161 AVVFTGSEP-ADYTTSIVGNRSVAYLKLAPKD---RPFFLAAAVRAPHAPQLPPPWHASA 216

Query: 120 FFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
           F +       AY+  P     W+ +          +F D +   R + L +VDD V  + 
Sbjct: 217 FPDARVPRDGAYNATPTGKPYWMSRNPPLSDEDAAEF-DAVYADRWRALLAVDDLVAGVF 275

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            EL+    +++TY+ YTSDHGYH+G F L   K   +EFDIRVP L  GPG+
Sbjct: 276 DELRASSLIDDTYVFYTSDHGYHMGHFRLPPLKMHKYEFDIRVPLLATGPGL 327


>gi|452979586|gb|EME79348.1| hypothetical protein MYCFIDRAFT_97143, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 548

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 117/249 (46%), Gaps = 21/249 (8%)

Query: 21  YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK  N +N  +Y  P    +     +++   Y+Y  +   R  +    Y   Y PD+I
Sbjct: 100 YTGKLFNSHNVDNYNDPPVNGFNGSDFLLDPYTYSYYGSYMSRNFEPPVSYEGLYSPDVI 159

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAPQYANMFFNVTSH 126
           A  +  FL ++ Q     P  + ++  APH   D           SA ++ ++F +    
Sbjct: 160 AEKAYGFLEEASQH--ANPFFIGVAPIAPHADADPRKGRQMSIPLSAARHQHLFKDYKIP 217

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
            T  ++ +      W  Q+ R    V  ++ D     R++ LQ+VD+ VE IV +L+  G
Sbjct: 218 RTANFNPSRASGVGWTKQLERLNDTV-IEYNDEFQRGRIRALQAVDEMVEGIVSKLEAKG 276

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----VPGTMYLLDVL 242
            L+NTY+IYT+D+GYH+    L  GK  PFE DI +P  +RGPG+    V G +     L
Sbjct: 277 LLDNTYVIYTTDNGYHISHHRLPPGKECPFETDIHIPLAIRGPGVPIRHVAGVVSSHTDL 336

Query: 243 IPQVRKFSS 251
            P + K + 
Sbjct: 337 TPTILKLAG 345


>gi|402086553|gb|EJT81451.1| hypothetical protein GGTG_01430 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 631

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 68/232 (29%), Positives = 115/232 (49%), Gaps = 17/232 (7%)

Query: 21  YFGKYLNKYN---GSYIPPGWREWGALIMNSKYYNYSINM--NGRKIKHGEDYYNDYYPD 75
           Y GK+LN  N       P GW      +   +    S+ M  NG      + ++     D
Sbjct: 131 YLGKFLNGVNILNWVSQPRGWNHVDVFLEPYQTTPNSVVMSENGNTPIWFKGFHQT---D 187

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP-----QYANMFFNVTSHHTPA 130
           +I   ++  L +   +   +P  L +S  APH  +++A      ++  +F N T   TP+
Sbjct: 188 IIRAKALDRLDKLTATEQDQPFFLFLSPTAPHVDDETATTVPCQRHEGLFLNATVPRTPS 247

Query: 131 YDYAPNPDKQWILQVTRKMQPV---HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           ++  P+  ++  +   R ++P+      + D    +R Q L  +D+ V  +V +L++ G 
Sbjct: 248 FN-PPDAVQRNKVSWLRDLEPMPDADVAWADAEYRRRAQALMGIDEMVADVVAKLEEKGV 306

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
           ++NTYIIYTSD+GYHLGQ  +  GK+ PF  D  +PF++RGPG+  G    L
Sbjct: 307 IDNTYIIYTSDNGYHLGQHRMAVGKTTPFAEDTNLPFVVRGPGVPAGARSTL 358


>gi|374854282|dbj|BAL57168.1| mucin-desulfating sulfatase [uncultured planctomycete]
          Length = 490

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 79/240 (32%), Positives = 116/240 (48%), Gaps = 23/240 (9%)

Query: 3   WPVELTS--RNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNG- 59
           +P +L S  R L        Y GK+          PG+  W +      Y++   N+NG 
Sbjct: 113 YPRDLPSYPRRLQEIGYETAYIGKWHMGEQEDQPRPGFDFWMSHKGQGNYFDNEFNINGQ 172

Query: 60  RKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-PEDSAPQYAN 118
           R I  G      YY  ++ + +V +LR+ +Q    KP  L++   APHG P    P+Y  
Sbjct: 173 RTIIKG------YYTHVVTDYAVNWLRKPRQ----KPFCLILGHKAPHGGPIVPEPKYEQ 222

Query: 119 MFFNVT-SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL-----LMTKRLQTLQSVD 172
           +F N T +    A DY   PD  W+ Q       +      L      +   L T+ SVD
Sbjct: 223 IFDNQTITRPATADDYTGKPD--WLKQCRFTWHGIGGPLYGLKEYPKFVRYYLATIVSVD 280

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           D+V +I K L+D G+L+NT IIYTSD+G+ LG+ G V  ++  +E  IRVP L+R P + 
Sbjct: 281 DSVGRIYKVLQDTGQLDNTLIIYTSDNGFALGEHGRVDKRTM-YEESIRVPLLVRYPKLA 339


>gi|159487144|ref|XP_001701595.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158271536|gb|EDO97353.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 643

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 115/233 (49%), Gaps = 29/233 (12%)

Query: 22  FGKYLNKYN-----GSYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYNDYYPD 75
            GK+LN +      G   P GW  + AL   +   YN + ++N             Y  D
Sbjct: 104 VGKFLNSFGVPLPAGVVCPRGWDSFEALTTGTYALYNSTFSLNCGPST--PPLSGQYQTD 161

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------PEDSAPQYANMFFNVTSHH 127
           LI + ++ ++  +  +  +KP+ L ++   PH         P   A ++A ++  +    
Sbjct: 162 LIRDKALAYIDTAVAA--RKPLYLHLTPATPHKDNNGEGWVPPPPAARHAGLYPGLQLPA 219

Query: 128 TPAYDYAPNPDKQWILQVTR----KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
            P     PNP    ++  TR    K Q V +Q TDL +  RL+ L++VD+ +E +V  L+
Sbjct: 220 NPTL-RTPNP----LIPDTRIDMTKAQSV-QQMTDLYVA-RLRALRAVDEMLEALVNRLE 272

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
             G LNNTYII+TSD+G H+GQ  L+ GK    E D RVP  +RGPG+  G +
Sbjct: 273 ARGILNNTYIIFTSDNGLHMGQHSLLDGKGTNLEEDNRVPLFIRGPGVEAGVV 325


>gi|171680767|ref|XP_001905328.1| hypothetical protein [Podospora anserina S mat+]
 gi|27764278|emb|CAD60558.1| unnamed protein product [Podospora anserina]
 gi|170940011|emb|CAP65237.1| unnamed protein product [Podospora anserina S mat+]
          Length = 618

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 75/253 (29%), Positives = 119/253 (47%), Gaps = 46/253 (18%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN  Y P GW      +++   YNY  +   R     + Y   +  D
Sbjct: 143 YTGKLFNAHTVENYNSPY-PAGWN-GSDFLLDPYTYNYLNSSFQRNQDPPKSYEGFHSVD 200

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---------------------------- 107
           ++A  S+ F+ ++ ++    P  L ++  APH                            
Sbjct: 201 VLAEKSLGFVDEAVRA--DGPFFLGIAPVAPHSNVESNTLGDDMGDDWGNLPDIEEFAKF 258

Query: 108 GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTK 163
           GP   A ++ ++F +V    TP +    NPDK     W+ ++ +K+   +  + D    +
Sbjct: 259 GPPVPAKRHEHLFKDVKIPRTPNF----NPDKPTGANWV-RLRKKLNQENIDYNDHFYRQ 313

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           RL+TLQSVD+ V  +V  L++ G L+NTYI YT+D+GYH+ Q  L  GK   FE DI +P
Sbjct: 314 RLRTLQSVDELVASVVGRLEEHGVLDNTYIFYTTDNGYHISQHRLNPGKECGFEEDINIP 373

Query: 224 FLMRGPGIVPGTM 236
            ++RGPG+  G +
Sbjct: 374 LIIRGPGVAKGVV 386


>gi|116200710|ref|XP_001226167.1| hypothetical protein CHGG_10900 [Chaetomium globosum CBS 148.51]
 gi|88175614|gb|EAQ83082.1| hypothetical protein CHGG_10900 [Chaetomium globosum CBS 148.51]
          Length = 575

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 51/129 (39%), Positives = 77/129 (59%), Gaps = 2/129 (1%)

Query: 109 PEDSAPQYANMFFNVTSHHTPAYDYAPNPD-KQWILQVTRKMQPVHRQFTDLLMTKRLQT 167
           P   A ++A +F NVT   TP ++ A       W+  + R+ +  +  + D    +RL+T
Sbjct: 210 PPIPAKRHARLFGNVTVPRTPHFNPAGGASGASWVRGLARQSEE-NVAWNDHYYRQRLRT 268

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           LQSVD+ VE +V+ L DLG L  TY+ YT+D+G+H+GQ  L  GK   FE DI +P ++R
Sbjct: 269 LQSVDELVEAVVRRLGDLGVLEETYVFYTTDNGFHIGQHRLQPGKECGFEEDINIPLIVR 328

Query: 228 GPGIVPGTM 236
           GPG+  G +
Sbjct: 329 GPGVPRGEV 337


>gi|320594028|gb|EFX06431.1| arylsulfatase [Grosmannia clavigera kw1407]
          Length = 615

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 118/247 (47%), Gaps = 42/247 (17%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE---DYYNDYY 73
           Y GK  N +      S  P GW      +++   Y+Y   +N    ++GE    Y   Y 
Sbjct: 145 YTGKLFNAHTIDNYDSPRPAGW-TGSDFLLDPYTYSY---LNATYQRNGEPPVSYEGHYT 200

Query: 74  PDLIANDSVRFLRQS-KQSFSKKPIMLVMSFPAPHG----------PEDS---------- 112
            D++A  ++ FL  +       +P  L ++  APH             DS          
Sbjct: 201 GDVLAEKALGFLDDATADGVDARPFFLGIAPVAPHSNVVVLEGEPVSNDSSRAVFSAPVA 260

Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQ-----WILQVTRKMQPVHRQFTDLLMTKRLQT 167
           A ++A++F +V    TP +    NP +Q     W+ Q+ R+    +  + D    +RL+ 
Sbjct: 261 ATRHAHLFADVKVPRTPHF----NPARQAAGGSWVRQLARQ-DDANVAYNDYFYRQRLRA 315

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           LQSVD+ V+++   L  LG L +TY+ YT+D+G+H+GQ  L  GK+  FE DI +P ++R
Sbjct: 316 LQSVDELVDELFARLDRLGLLASTYVFYTTDNGFHIGQHRLQPGKTCGFEEDINIPLIVR 375

Query: 228 GPGIVPG 234
           GPG+  G
Sbjct: 376 GPGVAAG 382


>gi|402074765|gb|EJT70274.1| hypothetical protein GGTG_12447 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 614

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 97/189 (51%), Gaps = 27/189 (14%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSF----SKKPIMLVMSFPAPHGPEDS----------- 112
           Y   Y PD++A+ ++ F+  + +S       +P  + ++  APH    S           
Sbjct: 199 YEGQYSPDVLASKALGFIDDAAESLKPENGGRPFFIGLAPIAPHSDVASDRGNGKAAISA 258

Query: 113 ---APQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRL 165
              A ++ N+F +V    T  +    NPD      W+ ++ +K+   + +  D    +RL
Sbjct: 259 PVAAKRHENLFADVKVPRTANF----NPDAPSGVSWVSRL-QKLNDTNVEANDHFYRQRL 313

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           + LQSVD+ V+ +   L+  G L+NTY+ YTSD+G+H+ Q  L  GK+  FE DI VP L
Sbjct: 314 RALQSVDEMVDNVFASLEKHGLLDNTYVFYTSDNGFHISQHRLQPGKTCGFEEDINVPLL 373

Query: 226 MRGPGIVPG 234
           +RGPGI  G
Sbjct: 374 VRGPGIAAG 382


>gi|358394598|gb|EHK43991.1| hypothetical protein TRIATDRAFT_37606 [Trichoderma atroviride IMI
           206040]
          Length = 596

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/199 (32%), Positives = 97/199 (48%), Gaps = 24/199 (12%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           +YYN S   NG    +       Y  D++A  +  FL  + Q   K P  + ++  APH 
Sbjct: 161 QYYNVSTTKNGAPPINP---VGKYSTDIVAESAEEFLNHALQD-DKSPFFITLAPIAPHA 216

Query: 109 -----------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVH 153
                      P   A ++  +F       TP +    NPD      WI ++ +    V 
Sbjct: 217 LVGEGSTGAFDPPQVAERHEGLFPEYKIPRTPNF----NPDSPSGVGWISKLPKFNDSV- 271

Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
             + D     RL++LQ+VD+ +E +V  L + G L+NT+I YTSD+GYH+GQ  L  GK+
Sbjct: 272 VDYGDEYQRLRLRSLQAVDEIIESVVTRLDEAGVLDNTFIFYTSDNGYHIGQHRLHPGKT 331

Query: 214 FPFEFDIRVPFLMRGPGIV 232
             FE DI VP ++RGPG+ 
Sbjct: 332 CGFETDINVPLVVRGPGVA 350


>gi|242805691|ref|XP_002484584.1| arylsulfatase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218715209|gb|EED14631.1| arylsulfatase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 586

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 113/245 (46%), Gaps = 38/245 (15%)

Query: 12  LLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
           L  F     Y   Y+N + GS            +++   Y+Y      R  +    Y   
Sbjct: 144 LFNFHTVDNYDSPYVNGFTGS----------DFLLDPFTYSYLNATFQRNREPPVSYKGH 193

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-------SAP---------- 114
           +  ++I + ++ FL         +P  LV++  APH   D        AP          
Sbjct: 194 HTTEVITDKALGFLEDGLAE--DQPFFLVIAPIAPHSNIDVGALDDHGAPTVMTEPIPLS 251

Query: 115 QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
           ++ N+F +     T  +    NPD+     W+  + ++      ++ D    +RL+ LQ 
Sbjct: 252 RHENLFQDAKVPRTAHF----NPDEPNGVSWVRNLPKQDDQT-IEYNDHFYRQRLRALQG 306

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           VD+ V  +V  L+D  +L+NTYIIYTSD+GYH+GQ  L  GK+  FE DIRVPFL+RGPG
Sbjct: 307 VDELVGAVVNRLEDSDQLDNTYIIYTSDNGYHIGQHRLPPGKACGFEEDIRVPFLIRGPG 366

Query: 231 IVPGT 235
           I  G 
Sbjct: 367 IAKGA 371


>gi|46117250|ref|XP_384643.1| hypothetical protein FG04467.1 [Gibberella zeae PH-1]
          Length = 703

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 74/229 (32%), Positives = 113/229 (49%), Gaps = 28/229 (12%)

Query: 26  LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYY-NDYYPDLIANDSVRF 84
           +N YN  Y P GW      +++   Y+Y +N   +  +    Y+  ++  DLIA  S   
Sbjct: 150 VNNYNSPY-PAGWTGTN-FLLDPGTYDY-LNPIYQHNQEAPVYHKGEHTSDLIAKYSHDL 206

Query: 85  LRQSKQSFSKKPIMLVMSFPAPHGPED-------------SAPQYANMFFNVTSHHTPAY 131
           L+++  S    P  + ++  APH                  A +++++F  V    T  +
Sbjct: 207 LKEAIDS--NNPFFVAIAPIAPHSDISVDRVAGSGMTIPIPAERHSHLFEGVKVPRTENF 264

Query: 132 DYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
               NPD      WI ++ ++         D     RLQ LQ VD+ VE++ ++L+D G 
Sbjct: 265 ----NPDSPSGVSWIKEL-KQFNESSVSHLDNYYRARLQALQGVDEIVEQVTQQLEDAGL 319

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L++TYIIY+SD+GYHLGQ  L  GK   FE DIRVP  +RGPGI  G +
Sbjct: 320 LDDTYIIYSSDNGYHLGQHRLPPGKECGFEEDIRVPLFIRGPGISSGAV 368


>gi|367032166|ref|XP_003665366.1| hypothetical protein MYCTH_2095444 [Myceliophthora thermophila ATCC
           42464]
 gi|347012637|gb|AEO60121.1| hypothetical protein MYCTH_2095444 [Myceliophthora thermophila ATCC
           42464]
          Length = 621

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 110/251 (43%), Gaps = 40/251 (15%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK  N +      S  P GW      +++   Y Y      R       Y N +  D+
Sbjct: 144 YTGKLFNAHTVDNYDSPYPAGWNG-TEFLLDPYTYMYLNASFQRNRDPPVSYPNQHSVDV 202

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------------------------- 108
           +   ++ FL ++ QS   +P  L ++  APH                             
Sbjct: 203 LTQKALSFLDEASQS--PRPFFLGIAPVAPHSNVAQAPTPDGDGGWDEDGDYDDIESRVT 260

Query: 109 --PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTK 163
             P   A ++A++F +     TP ++ A      W     R++ P    +  F D    +
Sbjct: 261 FTPPIPAARHAHLFADAVVPRTPHFNPAEPAVATWGAGWLRQLPPQTAANVAFNDHFYRQ 320

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           RL+ LQSVD+ V+ +V+ L  LG L +TY+ YT+D+G+H+GQ  L  GK   FE DI VP
Sbjct: 321 RLRVLQSVDELVDAVVRRLDQLGLLADTYLFYTTDNGFHIGQHRLQPGKECGFEEDINVP 380

Query: 224 FLMRGPGIVPG 234
            ++RGPG+ P 
Sbjct: 381 LIVRGPGVAPA 391


>gi|238498084|ref|XP_002380277.1| sulfatase-1, sulf-1, putative [Aspergillus flavus NRRL3357]
 gi|220693551|gb|EED49896.1| sulfatase-1, sulf-1, putative [Aspergillus flavus NRRL3357]
          Length = 564

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 15/195 (7%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +Y N +I  N  K +    Y   Y  DL+AN S+ ++  + +   KKP  L ++   PH 
Sbjct: 158 QYTNTTIQHNFEKPR---SYPGVYATDLLANKSMAWMDDAAKE--KKPFFLAINPVNPHN 212

Query: 108 ----GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDL 159
               G   + P  AN               + NPD+     W+ ++ + +     +  DL
Sbjct: 213 NYQWGKGWTKPVPANRHEGTFPDAKVPRSVSFNPDRPSGAAWVHELPQ-LSDAAIKDNDL 271

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
              +RLQ LQ+VDD VE  +  LK    L+NTYIIYTSD+G+H+ Q  L+ GK  P+E D
Sbjct: 272 YYRRRLQALQAVDDLVETTIGTLKRHKMLDNTYIIYTSDNGFHISQHRLMPGKRCPYEED 331

Query: 220 IRVPFLMRGPGIVPG 234
           + VP ++RGPGI  G
Sbjct: 332 VNVPMIIRGPGIPKG 346


>gi|425773153|gb|EKV11523.1| hypothetical protein PDIG_49660 [Penicillium digitatum PHI26]
          Length = 587

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 116/235 (49%), Gaps = 36/235 (15%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   ++N +NGS            +++   Y+Y  +   R  +    +  ++  D+I   
Sbjct: 151 YHSPHINGFNGS----------DFLLDPYTYSYLNSTYQRNHEPPVSHEGEHTIDVITGK 200

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAP----QYANMFFNVTS 125
           ++ FL  +     ++P  L +S  APH   D           SAP    ++ ++F +VT 
Sbjct: 201 ALGFLDDALAG--ERPFFLAVSPVAPHSNIDPGNVASGKTYMSAPIPLQRHTHLFQDVTV 258

Query: 126 HHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
             T  +    NPD+     W+  +  + Q V   + D     RL+ LQ VD+ V+ +V  
Sbjct: 259 PRTANF----NPDQPSGVSWVHDLPLQNQSV-VDYHDHFYRSRLRALQGVDELVDGLVTR 313

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L+  G+L++TYIIYTSD+G+H+GQ  L  GK+  FE DIRVP  +RGPG+  G +
Sbjct: 314 LEKSGQLDHTYIIYTSDNGFHIGQHRLPPGKTCGFEEDIRVPLFIRGPGVAKGDV 368


>gi|408392195|gb|EKJ71554.1| hypothetical protein FPSE_08295 [Fusarium pseudograminearum CS3096]
          Length = 586

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 74/229 (32%), Positives = 113/229 (49%), Gaps = 28/229 (12%)

Query: 26  LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYY-NDYYPDLIANDSVRF 84
           +N YN  Y P GW      +++   Y+Y +N   +  +    Y+  ++  DLIA  +   
Sbjct: 150 VNNYNSPY-PAGWTGTN-FLLDPGTYDY-LNPIYQHNQEAPVYHKGEHTSDLIAKYAHEL 206

Query: 85  LRQSKQSFSKKPIMLVMSFPAPHGPED-------------SAPQYANMFFNVTSHHTPAY 131
           L+++  S    P  + ++  APH   D              A +++++F  V    T  +
Sbjct: 207 LKEAIDS--NNPFFVGIAPIAPHSNIDVDRVAGSGMTIPIPAERHSHLFEGVKVPRTENF 264

Query: 132 DYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
               NPD      WI ++  +         D     RLQ LQ VD+ VE++ ++L+D G 
Sbjct: 265 ----NPDSPSGVSWIKEL-EQFNESSVSHLDNYYRARLQALQGVDEIVEQVTQQLEDAGL 319

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L++TYIIY+SD+GYHLGQ  L  GK   FE DIRVP  +RGPGI  G +
Sbjct: 320 LDDTYIIYSSDNGYHLGQHRLPPGKECGFEEDIRVPLFIRGPGISSGAV 368


>gi|425776527|gb|EKV14743.1| hypothetical protein PDIP_42520 [Penicillium digitatum Pd1]
          Length = 587

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 116/235 (49%), Gaps = 36/235 (15%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   ++N +NGS            +++   Y+Y  +   R  +    +  ++  D+I   
Sbjct: 151 YHSPHINGFNGS----------DFLLDPYTYSYLNSTYQRNHEPPVSHEGEHTIDVITGK 200

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAP----QYANMFFNVTS 125
           ++ FL  +     ++P  L +S  APH   D           SAP    ++ ++F +VT 
Sbjct: 201 ALGFLDDALAG--ERPFFLAVSPVAPHSNIDPGNVASGKTYMSAPIPLQRHTHLFQDVTV 258

Query: 126 HHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
             T  +    NPD+     W+  +  + Q V   + D     RL+ LQ VD+ V+ +V  
Sbjct: 259 PRTANF----NPDQPSGVSWVHDLPLQNQSV-VDYHDHFYRSRLRALQGVDELVDGLVTR 313

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L+  G+L++TYIIYTSD+G+H+GQ  L  GK+  FE DIRVP  +RGPG+  G +
Sbjct: 314 LEKSGQLDHTYIIYTSDNGFHIGQHRLPPGKTCGFEEDIRVPLFIRGPGVAKGDV 368


>gi|255939408|ref|XP_002560473.1| Pc16g00510 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211585096|emb|CAP92721.1| Pc16g00510 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 623

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 20/187 (10%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAP----QYANM 119
           D+I   ++ FL  +     ++P  L +S  APH   D           SAP    ++ ++
Sbjct: 237 DVITGKALGFLDDALAG--ERPFFLAVSPVAPHSNVDPGNITSENFYMSAPIPLERHEHL 294

Query: 120 FFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
           F +V    T  ++        W+  +  + Q V   + D     RL+ LQ VD+ V+ +V
Sbjct: 295 FQDVRIPRTANFNSDQPSGVSWVHDLPLQNQSV-VDYHDHFYRSRLRALQGVDELVDGLV 353

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
             L++ G+L+NTYIIYTSD+G+H+GQ  L  GK+  FE DIRVP  +RGPG+  G  Y+ 
Sbjct: 354 TRLEESGQLDNTYIIYTSDNGFHIGQHRLPPGKTCGFEEDIRVPLFIRGPGVTKG--YVQ 411

Query: 240 DVLIPQV 246
           D +   V
Sbjct: 412 DAVTTHV 418


>gi|453080905|gb|EMF08955.1| Arylsulphatase [Mycosphaerella populorum SO2202]
          Length = 588

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 73/256 (28%), Positives = 117/256 (45%), Gaps = 37/256 (14%)

Query: 4   PVELTSRNLLYFDISHGYFGKYLNKYN-GSYIPPGWREW--GALIMNSKYYNYSINMNGR 60
           PV L S N   +     Y GK  N ++  +Y  P    W     +++   Y Y   +N  
Sbjct: 86  PVWLQSLNYATY-----YVGKLFNAHSVHNYASPRAAGWTGSEFLLDPYTYEY---LNAT 137

Query: 61  KIK-HGE--DYYNDYYPDLIANDSVRFLRQSKQSFSK--KPIMLVMSFPAPHG------- 108
            ++ HGE   Y   Y  DLIA  +  FL  +   + K  KP  L ++  APH        
Sbjct: 138 FVRNHGEPKSYEGQYSTDLIAAKAFGFLDTALDEYQKEGKPFFLTIAPTAPHSNVHMNTQ 197

Query: 109 -------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
                        P   A +++++F +V    TP ++        WI ++ R+ Q  +  
Sbjct: 198 IDGNFSEGSNTQSPPIPAARHSHLFEDVVVPRTPHFNPVDAHGVSWISRLPRQNQ-TNID 256

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           F D     RL+ LQ+VD+ +  +  +L   G    T++ +++D+GYH+GQ  L  GK   
Sbjct: 257 FNDHFYRSRLRALQAVDEMIHDLFMKLDAAGVTEETFVFFSTDNGYHIGQHRLQPGKQCA 316

Query: 216 FEFDIRVPFLMRGPGI 231
           FE D+ +PF++RGPGI
Sbjct: 317 FEEDVNIPFMVRGPGI 332


>gi|380477057|emb|CCF44357.1| sulfatase [Colletotrichum higginsianum]
          Length = 459

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 79/263 (30%), Positives = 120/263 (45%), Gaps = 52/263 (19%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK  N ++     +  P GW      +++   Y Y      R     + +  +Y  DL
Sbjct: 128 YTGKLFNSHDVDNWNNPFPKGWTG-SDFLLDPNTYLYLNATFQRNQSPPQSFPGEYSNDL 186

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH----GPEDS--------APQYANMFFNVT 124
           +++ ++ FL  + +    KP  L ++   PH    G E S        A ++ N+F NVT
Sbjct: 187 VSDRALGFLDDALKYNGSKPFFLGVAPIGPHNNGLGGETSPGATPPIPAKRHENLFSNVT 246

Query: 125 SHHTPAYDYAPNP---------------DKQWILQVT----RKMQP-------------- 151
              T  Y++ P+                D    LQV+    ++ QP              
Sbjct: 247 VPRT--YNFNPDKLCPASLACLPGPGRIDPSSSLQVSNTAPQQQQPSMANWGLIIPQLNA 304

Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
               + D    +RLQTLQSVD+ V+ I+  L   G L+NTY+I+TSD+G+H+GQ  L  G
Sbjct: 305 TEVAYGDNYYRRRLQTLQSVDEMVDAIIDRLDRAGVLDNTYVIFTSDNGFHIGQHRLKPG 364

Query: 212 KSFPFEFDIRVPFLMRGPGIVPG 234
           K+   E DI VPFL+RGPG+  G
Sbjct: 365 KTCAIEEDINVPFLIRGPGVPKG 387


>gi|83766705|dbj|BAE56845.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 615

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 98/195 (50%), Gaps = 15/195 (7%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +Y N +I  N  K +    Y   Y  DL+AN S+ ++  + +   KKP  L ++   PH 
Sbjct: 158 QYTNTTIQHNFEKPR---SYPGVYATDLLANKSMAWMDDAAKE--KKPFFLAINPVNPHN 212

Query: 108 ----GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDL 159
               G   + P  A                + NPD+     W+ ++ +    V +   DL
Sbjct: 213 NYQWGKGWTKPVPAKRHEGTFPDAKVPRSVSFNPDRPSGAAWVHELPQLSDAVIKD-NDL 271

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
              +RLQ LQ+VDD VE  +  LK    L+NTYIIYTSD+G+H+ Q  L+ GK  P+E D
Sbjct: 272 YYRRRLQALQAVDDLVETTIDTLKRHKMLDNTYIIYTSDNGFHISQHRLMPGKRCPYEED 331

Query: 220 IRVPFLMRGPGIVPG 234
           + VP ++RGPGI  G
Sbjct: 332 VNVPMIIRGPGIPKG 346


>gi|317141886|ref|XP_001818847.2| arylsulfatase [Aspergillus oryzae RIB40]
 gi|391863140|gb|EIT72453.1| sulfatase [Aspergillus oryzae 3.042]
          Length = 564

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 98/195 (50%), Gaps = 15/195 (7%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +Y N +I  N  K +    Y   Y  DL+AN S+ ++  + +   KKP  L ++   PH 
Sbjct: 158 QYTNTTIQHNFEKPR---SYPGVYATDLLANKSMAWMDDAAKE--KKPFFLAINPVNPHN 212

Query: 108 ----GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDL 159
               G   + P  A                + NPD+     W+ ++ +    V +   DL
Sbjct: 213 NYQWGKGWTKPVPAKRHEGTFPDAKVPRSVSFNPDRPSGAAWVHELPQLSDAVIKD-NDL 271

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
              +RLQ LQ+VDD VE  +  LK    L+NTYIIYTSD+G+H+ Q  L+ GK  P+E D
Sbjct: 272 YYRRRLQALQAVDDLVETTIDTLKRHKMLDNTYIIYTSDNGFHISQHRLMPGKRCPYEED 331

Query: 220 IRVPFLMRGPGIVPG 234
           + VP ++RGPGI  G
Sbjct: 332 VNVPMIIRGPGIPKG 346


>gi|291301372|ref|YP_003512650.1| sulfatase [Stackebrandtia nassauensis DSM 44728]
 gi|290570592|gb|ADD43557.1| sulfatase [Stackebrandtia nassauensis DSM 44728]
          Length = 522

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 72/259 (27%), Positives = 121/259 (46%), Gaps = 36/259 (13%)

Query: 16  DISHGYFGKYLNKY---NGSYIPPGWREWGALIMNS-KYYNYSINM-NGRKIKHGEDYYN 70
           D    Y GKYLN Y    G  +P GW EW          Y+Y ++   G   K   D   
Sbjct: 129 DYHTAYLGKYLNGYKMKEGYKVPAGWDEWHVADGGGYNEYDYKLSEYTGGDDKPISDGDG 188

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--EDSAPQYANM--------- 119
            Y  DL+A+ +V  + +S+ +  +K   + ++  +PH    +D  P++            
Sbjct: 189 KYLVDLMADRAVESIDRSRDA--EKSFFVQVAPFSPHSGVGKDGGPRFPPAKRDRPGADE 246

Query: 120 ---------FFNVTSHHTPAYDYAPNPDKQWILQ---VTRKMQPVHRQFTDLLMTKRLQT 167
                      ++     P ++        W+ Q     ++++ ++R F +     R + 
Sbjct: 247 KHGDCGKVDCADLDVTKLPGFNEDTKDKPDWVRQKPLTDKEIKELNRDFRN-----RARM 301

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           +QSVDD VEK+ K L    EL+NTYI++ SD+G+HLGQ  L++GK+  ++ D+R PFL++
Sbjct: 302 VQSVDDMVEKVTKSLS-QSELDNTYIMFGSDNGFHLGQHRLMRGKTTAYDHDVRTPFLVK 360

Query: 228 GPGIVPGTMYLLDVLIPQV 246
            PG   G     D ++  V
Sbjct: 361 RPGSSGGDSIKSDEIVQNV 379


>gi|346319604|gb|EGX89205.1| arylsulfatase precursor [Cordyceps militaris CM01]
          Length = 565

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 70/224 (31%), Positives = 110/224 (49%), Gaps = 16/224 (7%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N        S  P GW      +++   Y Y+        K  + +  +Y  DL
Sbjct: 123 YTGKLMNNLGVRTYQSPPPSGWNR-SDFLLDPNTYVYNNAYFTLDNKKWQAFPGEYSTDL 181

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP-EDS------APQYANMFFNVTSHHTP 129
           IA  S+ FL++   +  +KP  + ++  APH    D+      A ++ ++F +       
Sbjct: 182 IAKRSLEFLKEGIDT--QKPFFIGVAPIAPHSELTDTFREPVPAARHQDLFADAQVPRNK 239

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-L 188
            ++  P P      +   ++     Q+ D    +RLQ+LQ+VDD V+ I   L+D    L
Sbjct: 240 NFN-PPQPGTASYFKNLPRLSNSQVQYLDNFHRRRLQSLQAVDDLVDDIFDALRDHPSVL 298

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
            NTYIIYT+D+G+HLGQ  L  GK+   E D+ VPF++RGPGI 
Sbjct: 299 ANTYIIYTADNGFHLGQHRLPAGKTCGIEEDVNVPFIIRGPGIT 342


>gi|347840483|emb|CCD55055.1| similar to arylsulfatase [Botryotinia fuckeliana]
          Length = 585

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 114/262 (43%), Gaps = 40/262 (15%)

Query: 21  YFGKYLNKYNGSYI----PPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK  N +  S      P GW     L+     +YYN +   N       E    +Y  
Sbjct: 128 YTGKLFNHHTVSNWNNPHPAGWTSSDFLLDPFTYQYYNSTFQRNEHPPVSHE---GEYST 184

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYAN 118
           D++A  +   L +  QS   KP  LV +  APH   D                 A ++ +
Sbjct: 185 DVLAQKAYGLLDEGVQS--GKPFFLVAAPIAPHANVDLAAFKPGEGVLMTTPLPAERHKH 242

Query: 119 MFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
           +F          +    NPDK     WI  + R    V   + D     RLQTLQ+VD+ 
Sbjct: 243 LFPGAKVPRNANF----NPDKPSGASWIKHLDRLTDEV-IAYNDEFYRARLQTLQAVDEL 297

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +E +V  L D G L+NTY I+++D+GYH+ Q  L  GK   FE DI +P ++RGPG+  G
Sbjct: 298 IEGLVDRLTDYGILDNTYFIFSTDNGYHISQHRLAPGKECGFEEDINIPLVIRGPGVPQG 357

Query: 235 TMYLLDV----LIPQVRKFSSG 252
               +      L P + K ++G
Sbjct: 358 EETSIVTAHTDLAPTILKIANG 379


>gi|218132019|ref|ZP_03460823.1| hypothetical protein BACEGG_03644 [Bacteroides eggerthii DSM 20697]
 gi|217985779|gb|EEC52119.1| arylsulfatase [Bacteroides eggerthii DSM 20697]
          Length = 520

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 65/249 (26%), Positives = 114/249 (45%), Gaps = 36/249 (14%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGK+    +     PG+  W  +     Y+N  IN+NG K K  +D  + Y  DL+ +
Sbjct: 135 AFFGKWHMGNDTGEPQPGFTHWEGIRGQGVYWNPEINING-KWKEFKD--STYLGDLLTD 191

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-----TSHHTPAYDYA 134
            ++ F+R+ K++   KP  + +S    H P  +  +Y   + N      TS   P Y   
Sbjct: 192 HAIDFIREQKKA--DKPFFVYLSHKGVHDPFQAPKRYEGCYANKKVPLPTSFENPHYGIT 249

Query: 135 PNPDKQ--------------------WILQVTRKMQPVH-----RQFTDLLMTKRLQTLQ 169
           P P+K                     W+         V      R+  +  + K  +TL+
Sbjct: 250 PTPNKSVQTGKPLSGVDYYGEQMKPDWVKMQRESWHGVDFCYNGRRNWEEEVRKYCETLR 309

Query: 170 SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           +VD+++ +++  L+++G   NT +IY  D+G+  G+ GL+  + F +E  +RVP L+R P
Sbjct: 310 AVDESIGRVIDSLQEMGLDENTVVIYMGDNGFCWGEHGLIDKRQF-YEASVRVPMLIRAP 368

Query: 230 GIVPGTMYL 238
           G+ P    L
Sbjct: 369 GLFPAGQVL 377


>gi|310801414|gb|EFQ36307.1| sulfatase [Glomerella graminicola M1.001]
          Length = 607

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 118/250 (47%), Gaps = 41/250 (16%)

Query: 21  YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK  N +   +Y  P  + W +   +++   Y+Y      R      ++  +Y  DL+
Sbjct: 141 YTGKMFNAHTIWNYDSPHLKGWTSSDFLLDPNTYDYYNAAFQRNHDQPVNHPGEYSTDLV 200

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----------- 114
           A  +  FL  + Q    KP  L ++  APH            GP+DS P           
Sbjct: 201 AKKAYGFLDDAVQE--DKPFFLAIAPIAPHSNVNSSVLNPVKGPDDSVPVSDGKFRVSIP 258

Query: 115 ----QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
               ++ ++F +     T  +    NPDK     WI +   ++   + ++ D     RLQ
Sbjct: 259 LAAKRHEHLFKDAKVPRTENF----NPDKPSGADWIRR-QPQLTNENIEYNDRHYQGRLQ 313

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
            LQ+VD+ V+ ++K+L   G L+NTYI +T+D+GYH+GQ  L  GK   +E DI +P ++
Sbjct: 314 ALQAVDELVDGVIKKLGKHGILDNTYIFFTTDNGYHIGQHRLQPGKECGYEEDINIPLIV 373

Query: 227 RGPGIVPGTM 236
           RGPG+  G +
Sbjct: 374 RGPGVPKGGI 383


>gi|367047595|ref|XP_003654177.1| hypothetical protein THITE_2116964 [Thielavia terrestris NRRL 8126]
 gi|347001440|gb|AEO67841.1| hypothetical protein THITE_2116964 [Thielavia terrestris NRRL 8126]
          Length = 609

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 69/248 (27%), Positives = 116/248 (46%), Gaps = 38/248 (15%)

Query: 21  YFGKYLNK-----YNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE-DYYNDYYP 74
           Y GK  N      YN  + P GW   G+  +   Y    +N + ++ +     Y   +  
Sbjct: 144 YTGKLFNAHTVENYNAPF-PAGWN--GSDFLLDPYTYMYLNASFQRNREAPVSYEGQHSV 200

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------PED--------- 111
           D++A  ++ FL ++ +    +P  L ++  APH               P+D         
Sbjct: 201 DVLAQKALGFLDEAARG--DRPFFLGIAPVAPHSNVQASLLTNHTFDNPDDINKIVGAFT 258

Query: 112 ---SAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
               A ++ ++F +     TP ++        W+ ++ R+    +  F D    +RL+TL
Sbjct: 259 PPIPAERHKHLFADARVPRTPNFNPEAATGANWVRRL-RRQSAENVAFNDHFYRQRLRTL 317

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           QSVD+ V  +V  L + G L++TY+ YT+D+GYH+GQ  L  GK   FE DI VP ++RG
Sbjct: 318 QSVDELVAAVVDRLAERGLLDSTYVFYTTDNGYHIGQHRLQPGKECGFEEDINVPLIVRG 377

Query: 229 PGIVPGTM 236
           PG+  G +
Sbjct: 378 PGVPRGEV 385


>gi|390605164|gb|EIN14555.1| Arylsulphatase [Punctularia strigosozonata HHB-11173 SS5]
          Length = 581

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 92/168 (54%), Gaps = 11/168 (6%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------PEDSAPQYANMFFNV 123
           Y  DLIA  +  FL  S    + KP  L ++  APH             A ++  +F N 
Sbjct: 201 YQTDLIAEKAYGFL--SDAVNAGKPFFLAVAPTAPHANVKDGVFTEPQPARRHEKLFQNA 258

Query: 124 TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
               T +++   +    W+L++ ++ +  +  + D L  +RL+ LQSVD+ ++ ++  L+
Sbjct: 259 RVPRTDSFNPEKSSGANWVLELPQQNK-SNVDYNDHLYRQRLRALQSVDEMIDGLITRLE 317

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
             G L++TYIIYT+D+GYH+GQ  L  GK+  +E DI VP ++RGP +
Sbjct: 318 QRGLLDSTYIIYTADNGYHIGQHRLQPGKTCGYEEDINVPLIVRGPEV 365


>gi|317149839|ref|XP_001823212.2| arylsulfatase precursor [Aspergillus oryzae RIB40]
          Length = 776

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 99/197 (50%), Gaps = 19/197 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +YYN  +  NG        Y   Y  D+I N S+ FL ++  +   +P ML ++  APH 
Sbjct: 343 RYYNAKMTRNGETPV---SYAGQYSTDVIKNKSLGFLDEALAN-PDRPWMLTIAPNAPHS 398

Query: 108 -GPEDS------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
            G +D+            AP++A +F         +++   +    W+  +   +     
Sbjct: 399 NGSQDAVTGATWFGEPEYAPRHAQLFKEYKIPRDESFNKIIDGAVGWVADLP-TLTEEEI 457

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
            + D     RL+ LQ+VD+ + ++ ++L   G LNNTYI +++D+GYH+GQ  +  GK+ 
Sbjct: 458 NYIDEFQRCRLRALQAVDEMIGELFEKLDKAGVLNNTYIFFSTDNGYHIGQHAMQPGKNC 517

Query: 215 PFEFDIRVPFLMRGPGI 231
            +E DI +P  +RGPGI
Sbjct: 518 GYETDINIPLFIRGPGI 534


>gi|350636401|gb|EHA24761.1| hypothetical protein ASPNIDRAFT_183005 [Aspergillus niger ATCC
           1015]
          Length = 591

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN  +  P        +++   Y+Y  +   R  +  + Y  +Y  D
Sbjct: 133 YTGKLFNVHTVDNYNAPF--PAGFTGNDFLLDPFTYDYLNSTFQRDHEIPQSYEGEYSTD 190

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
           ++A  + R L ++  +  +KP  L ++  APH          D+ P+++           
Sbjct: 191 VLAQKAYRLLNEAVTA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 248

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F +V    TP+++        WI +  ++    +  + D    +RL+TLQ+VD+ V+ 
Sbjct: 249 HLFPDVKVPRTPSFNPEIPSGANWI-KTLKRQNDTNVDYNDHFYRQRLRTLQAVDELVDG 307

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +   LK+   L+NTYI+Y+SD+GYH+GQ  L  GKS  +E DI VP ++RGP I 
Sbjct: 308 LFTRLKEYNILDNTYIVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIIRGPDIA 362


>gi|391871485|gb|EIT80645.1| sulfatase [Aspergillus oryzae 3.042]
          Length = 644

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 100/197 (50%), Gaps = 19/197 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +YYN  +  NG        Y   Y  D+I N S+ FL ++  +   +P ML ++  APH 
Sbjct: 152 RYYNAKMTRNGETPV---SYAGQYSTDVIKNKSLGFLDEALAN-PDRPWMLTIAPNAPHS 207

Query: 108 -GPEDS------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
            G +D+            AP++A +F +       +++   +    W+  +   +     
Sbjct: 208 NGSQDAVTGATWFGEPEYAPRHAQLFKDYKIPRDESFNKIIDGAVGWVADLP-TLTEEEI 266

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
            + D     RL+ LQ+VD+ + ++ ++L   G LNNTYI +++D+GYH+GQ  +  GK+ 
Sbjct: 267 NYIDEFQRCRLRALQAVDEMIGELFEKLDKAGVLNNTYIFFSTDNGYHIGQHAMQPGKNC 326

Query: 215 PFEFDIRVPFLMRGPGI 231
            +E DI +P  +RGPGI
Sbjct: 327 GYETDINIPLFIRGPGI 343


>gi|134082572|emb|CAK42487.1| unnamed protein product [Aspergillus niger]
          Length = 585

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN  +  P        +++   Y+Y  +   R  +  + Y  +Y  D
Sbjct: 131 YTGKLFNVHTVDNYNAPF--PAGFTGNDFLLDPFTYDYLNSTFQRDHEIPQSYEGEYSTD 188

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
           ++A  + R L ++  +  +KP  L ++  APH          D+ P+++           
Sbjct: 189 VLAQKAYRLLNEAVTA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 246

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F +V    TP+++        WI +  ++    +  + D    +RL+TLQ+VD+ V+ 
Sbjct: 247 HLFPDVKVPRTPSFNPEIPSGANWI-KTLKRQNDTNVDYNDHFYRQRLRTLQAVDELVDG 305

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +   LK+   L+NTYI+Y+SD+GYH+GQ  L  GKS  +E DI VP ++RGP I 
Sbjct: 306 LFTRLKEYNILDNTYIVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIIRGPDIA 360


>gi|340518976|gb|EGR49216.1| predicted protein [Trichoderma reesei QM6a]
          Length = 593

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 97/198 (48%), Gaps = 24/198 (12%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           +YYN S   NG    +       Y  D++A  +  FL+ + +  S  P  + ++  APH 
Sbjct: 161 QYYNVSTTRNGAPPTNP---VGKYSTDILAESAQEFLQHALEDESS-PFFITLAPIAPHA 216

Query: 109 -----------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVH 153
                      P   A ++  +F       TP +    NPD      W+  + +    V 
Sbjct: 217 LVGEGSVGAFDPPQVAERHQGLFPEYKIPRTPNF----NPDSPSGVNWVSNLPKLDDEV- 271

Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
             + D     RL++LQ+VD+ V  +V  L++ G L+NT+I YTSD+GYH+GQ  L  GK+
Sbjct: 272 VDYNDEYQRLRLRSLQAVDEIVGSVVARLEEAGVLDNTFIFYTSDNGYHIGQHRLHPGKT 331

Query: 214 FPFEFDIRVPFLMRGPGI 231
             FE DI VP ++RGPGI
Sbjct: 332 CGFETDINVPLIVRGPGI 349


>gi|317035851|ref|XP_001397044.2| arylsulfatase [Aspergillus niger CBS 513.88]
          Length = 589

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN  +  P        +++   Y+Y  +   R  +  + Y  +Y  D
Sbjct: 131 YTGKLFNVHTVDNYNAPF--PAGFTGNDFLLDPFTYDYLNSTFQRDHEIPQSYEGEYSTD 188

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
           ++A  + R L ++  +  +KP  L ++  APH          D+ P+++           
Sbjct: 189 VLAQKAYRLLNEAVTA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 246

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F +V    TP+++        WI +  ++    +  + D    +RL+TLQ+VD+ V+ 
Sbjct: 247 HLFPDVKVPRTPSFNPEIPSGANWI-KTLKRQNDTNVDYNDHFYRQRLRTLQAVDELVDG 305

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +   LK+   L+NTYI+Y+SD+GYH+GQ  L  GKS  +E DI VP ++RGP I 
Sbjct: 306 LFTRLKEYNILDNTYIVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIIRGPDIA 360


>gi|302839150|ref|XP_002951132.1| periplasmic arylsulfatase, sulfate-repressible [Volvox carteri f.
           nagariensis]
 gi|300263461|gb|EFJ47661.1| periplasmic arylsulfatase, sulfate-repressible [Volvox carteri f.
           nagariensis]
          Length = 652

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 70/238 (29%), Positives = 117/238 (49%), Gaps = 33/238 (13%)

Query: 21  YFGKYLNKYNGSY---IPPGWREWGALIM--NSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK+L  Y+ S    +P GW +  AL+      Y N   + NG        +Y+    D
Sbjct: 126 YVGKFLVDYSVSNYQNVPAGWTDIDALVTPYTFDYNNPGFSRNGATPNIYPGFYST---D 182

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
           +IA+ +V  ++ +  +   KP    +S  APH                   P   AP++ 
Sbjct: 183 VIADKAVAQIKTAVAA--GKPFYAQISPIAPHTSTQIYFDPVANATKTFFYPPIPAPRHW 240

Query: 118 NMFFNVT----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
            +F + T    + H   Y+   +    WI  +    Q  +R + + +   RL++L SVD+
Sbjct: 241 ELFSDATLPEGTSHKNLYEADVSDKPAWIRALPLAQQN-NRTYLEEVYRLRLRSLASVDE 299

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            ++++V  L++ G L+NTY+IY++D+GYH+G      GK   ++ D+RVPFL+RGPGI
Sbjct: 300 LIDRVVATLQEAGVLDNTYLIYSADNGYHVGTHRFGAGKVTAYDEDLRVPFLIRGPGI 357


>gi|342874897|gb|EGU76804.1| hypothetical protein FOXB_12701 [Fusarium oxysporum Fo5176]
          Length = 601

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 70/232 (30%), Positives = 114/232 (49%), Gaps = 30/232 (12%)

Query: 26  LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFL 85
           +N YN  Y P GW      +++   Y+Y   +     +    +   +  DLI+  +   L
Sbjct: 162 INNYNSPY-PAGWTGTN-FLLDPGTYDYLNPIYQHNQEPPVQHKGVHTSDLISKYAHELL 219

Query: 86  RQSKQSFSKKPIMLVMSFPAPHGPED--------SAP---------QYANMFFNVTSHHT 128
           +++  +  + P  + ++  APH   +        SAP         +++++F  V    T
Sbjct: 220 KEAIDA--ENPFFVAIAPVAPHSNVNLRRQPGNPSAPLMTIPIPLERHSHLFEGVKVPRT 277

Query: 129 PAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
             +    NPD      WI ++  ++      + D     RLQ LQ VD+ VE++ K+L+D
Sbjct: 278 ENF----NPDSPSGVSWIHKLA-QLNDSSVSYLDDFYRARLQALQGVDEIVEQVTKQLED 332

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            G L+ TYIIY+SD+GYHLGQ  L  GK   F+ DIRVP  +RGPG+  G++
Sbjct: 333 AGILDETYIIYSSDNGYHLGQHRLPPGKECGFDEDIRVPLFIRGPGVSSGSI 384


>gi|1703431|sp|Q10723.1|ARS_VOLCA RecName: Full=Arylsulfatase; Short=AS; AltName: Full=Aryl-sulfate
           sulphohydrolase; Flags: Precursor
 gi|452339|emb|CAA54426.1| arylsulfatase [Volvox carteri f. nagariensis]
 gi|110174603|gb|ABG54272.1| arylsulfatase precursor [Volvox carteri f. nagariensis]
          Length = 649

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/238 (29%), Positives = 116/238 (48%), Gaps = 33/238 (13%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALIM--NSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK+L  Y   N   +P GW +  AL+      Y N   + NG        +Y+    D
Sbjct: 126 YVGKFLVDYSVSNYQNVPAGWTDIDALVTPYTFDYNNPGFSRNGATPNIYPGFYST---D 182

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
           +IA+ +V  ++ +  +   KP    +S  APH                   P   AP++ 
Sbjct: 183 VIADKAVAQIKTAVAA--GKPFYAQISPIAPHTSTQIYFDPVANATKTFFYPPIPAPRHW 240

Query: 118 NMFFNVT----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
            +F + T    + H   Y+   +    WI  +    Q  +R + + +   RL++L SVD+
Sbjct: 241 ELFSDATLPEGTSHKNLYEADVSDKPAWIRALPLAQQN-NRTYLEEVYRLRLRSLASVDE 299

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            ++++V  L++ G L+NTY+IY++D+GYH+G      GK   ++ D+RVPFL+RGPGI
Sbjct: 300 LIDRVVATLQEAGVLDNTYLIYSADNGYHVGTHRFGAGKVTAYDEDLRVPFLIRGPGI 357


>gi|58265540|ref|XP_569926.1| Arylsulfatase precursor [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134108857|ref|XP_776543.1| hypothetical protein CNBC0370 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50259223|gb|EAL21896.1| hypothetical protein CNBC0370 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57226158|gb|AAW42619.1| Arylsulfatase precursor, putative [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 604

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 109/237 (45%), Gaps = 30/237 (12%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK++N +   N   +P        ++++   Y+Y      R     + +  +Y  DL+
Sbjct: 127 YTGKFMNDHTDANCESLPVSGFNSSDILVDPYTYDYWTPGFSRDNGPVKVHAGEYSTDLV 186

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYANMFF 121
              +V +L  + Q    +P  L ++  A H   D                S P++A +F 
Sbjct: 187 HEKAVNYLEDALQE--DRPFYLTVAPVACHSWLDYKQQGDIKKFVTDIPASHPRHARLFP 244

Query: 122 NVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
                 T  +    NPD+     W+ ++  K+      + D     RL+ LQ+VD+ VE 
Sbjct: 245 VEQIERTENW----NPDEPSGVSWVKELP-KLNSTEEAYLDEFFRGRLRALQAVDELVED 299

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           IV  L+  GEL+NTYI YT+D+GY LG      GK+  FE DI VPF+ RGPGI  G
Sbjct: 300 IVDRLEKAGELDNTYIFYTADNGYALGSHRRQPGKTLGFEEDIHVPFIARGPGIKKG 356


>gi|340931779|gb|EGS19312.1| arylsulfatase-like protein [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 630

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 117/248 (47%), Gaps = 46/248 (18%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN  Y P GW +    +++   Y Y      R  +  + Y   +  D
Sbjct: 165 YTGKLFNAHTVENYNKPY-PAGWNQ-SDFLLDPYTYMYLNATYQRNREPPKSYKGQHTVD 222

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------------------- 108
           ++A  ++ F+  + ++  ++P  L ++  APH                            
Sbjct: 223 VLAEKALGFIDIAAKN--ERPFFLGIAPVAPHSNVQSKLLDDIDESDPWKDIDDEDETLF 280

Query: 109 -PEDSAPQYANMFFNVTSHHTPAYDYAPNPD----KQWILQVTRKMQPVHRQFTDLLMTK 163
            P   A ++ ++F +      P +    NPD      W+ ++  K+   + +F D    +
Sbjct: 281 SPPIPAERHKHLFPDAKVPRNPNF----NPDVPSGANWVRKLP-KLSDENVEFNDHFYRQ 335

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           RL+ LQSVD+ VE +V+ L++LG L+NTYIIYT+D+GYH+ Q  L  GK   FE DI VP
Sbjct: 336 RLRALQSVDELVEGVVERLEELGILDNTYIIYTTDNGYHISQHRLQPGKECGFEEDINVP 395

Query: 224 FLMRGPGI 231
            ++RGPG+
Sbjct: 396 LIIRGPGV 403


>gi|302902097|ref|XP_003048580.1| hypothetical protein NECHADRAFT_95823 [Nectria haematococca mpVI
           77-13-4]
 gi|256729513|gb|EEU42867.1| hypothetical protein NECHADRAFT_95823 [Nectria haematococca mpVI
           77-13-4]
          Length = 571

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 112/229 (48%), Gaps = 27/229 (11%)

Query: 26  LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFL 85
           +N YN  + P GW      +++   Y+Y   +  R  +    ++  +  DLI+  +   L
Sbjct: 134 INNYNSPF-PAGWTGTN-FLLDPGTYSYLNPIYQRNDEPPVQHHGTHTLDLISKYAHDLL 191

Query: 86  RQSKQSFSKKPIMLVMSFPAPHGPED--------------SAPQYANMFFNVTSHHTPAY 131
            ++  S   KP  + ++  APH   D               A +++++F  V    T  +
Sbjct: 192 DEAISS--DKPFFVAIAPVAPHSNIDVNRGSGPPTMTIPIPAERHSHLFEGVKIPRTANF 249

Query: 132 DYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
               NPD      WI Q++ ++      + D     RLQ LQ VD+ VE+++ +L   G 
Sbjct: 250 ----NPDSPSGVSWIRQLS-QLNETAVSYLDDYYRARLQVLQGVDEIVEQVIHQLDSAGL 304

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L++TY++Y+SD+G+HLGQ  L  GK   F+ DIRVP  +RGPG+  G +
Sbjct: 305 LDDTYVVYSSDNGFHLGQHRLPPGKECGFDEDIRVPLFIRGPGVSAGAV 353


>gi|83771949|dbj|BAE62079.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 644

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 99/197 (50%), Gaps = 19/197 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +YYN  +  NG        Y   Y  D+I N S+ FL ++  +   +P ML ++  APH 
Sbjct: 152 RYYNAKMTRNGETPV---SYAGQYSTDVIKNKSLGFLDEALAN-PDRPWMLTIAPNAPHS 207

Query: 108 -GPEDS------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
            G +D+            AP++A +F         +++   +    W+  +   +     
Sbjct: 208 NGSQDAVTGATWFGEPEYAPRHAQLFKEYKIPRDESFNKIIDGAVGWVADLP-TLTEEEI 266

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
            + D     RL+ LQ+VD+ + ++ ++L   G LNNTYI +++D+GYH+GQ  +  GK+ 
Sbjct: 267 NYIDEFQRCRLRALQAVDEMIGELFEKLDKAGVLNNTYIFFSTDNGYHIGQHAMQPGKNC 326

Query: 215 PFEFDIRVPFLMRGPGI 231
            +E DI +P  +RGPGI
Sbjct: 327 GYETDINIPLFIRGPGI 343


>gi|358385978|gb|EHK23574.1| hypothetical protein TRIVIDRAFT_190530 [Trichoderma virens Gv29-8]
          Length = 576

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 95/198 (47%), Gaps = 24/198 (12%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           +YYN S + NG       +    Y  D++A  +  FL  +       P  + ++  APH 
Sbjct: 148 QYYNVSTSKNGAPPT---NPVGKYSTDIVAESANEFLDHALLD-DGSPFFITLAPIAPHA 203

Query: 109 -----------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVH 153
                      P   A ++  +F       TP +    NPD      W+ ++  K+    
Sbjct: 204 LVGEGSLGAFDPPQVAERHEGLFPEYKIPRTPNF----NPDSPSGVNWVSKLP-KLNDTI 258

Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
             + D    KRL++LQ+VD+ VE +V  L   G L +T+I YTSD+GYH+GQ  L  GK+
Sbjct: 259 LDYNDEYQRKRLRSLQAVDEIVESVVARLDQAGVLEDTFIFYTSDNGYHIGQHRLHPGKT 318

Query: 214 FPFEFDIRVPFLMRGPGI 231
             FE DI VP ++RGPG+
Sbjct: 319 CGFETDINVPLVVRGPGV 336


>gi|159467885|ref|XP_001692122.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
 gi|158278849|gb|EDP04612.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
          Length = 654

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/238 (30%), Positives = 114/238 (47%), Gaps = 33/238 (13%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNG--RKIKHGEDYYNDYYPD 75
           Y GK+L  Y   N   +P GW +  AL+     Y +  N  G  R       Y  +Y  D
Sbjct: 132 YVGKFLVDYSVSNYQQVPAGWDDIDALVTP---YTFDYNTPGFSRNGATPNIYPGEYSTD 188

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
           +I +  V  ++ +  +   KP    +S  APH                   P   AP++ 
Sbjct: 189 VIRDKGVAQIKSAVAA--GKPFYAQISPIAPHTSTQISTNPATGVTRSYFFPPIPAPRHW 246

Query: 118 NMFFNVT-SHHTPA---YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
            +F +      TP    Y+   +    WI  +    Q  +R + + +   RL++L +VD+
Sbjct: 247 QLFSDANLPGGTPNKNLYEVDVSDKPAWIRALPLAQQ-NNRTYLEEIYRLRLRSLAAVDE 305

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            +E++VK L + G L+NTYIIY++D+GYH+G      GK+  +E D+RVPFL+RGPGI
Sbjct: 306 LIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHRFGAGKTTGYEEDLRVPFLIRGPGI 363


>gi|154293528|ref|XP_001547281.1| hypothetical protein BC1G_13903 [Botryotinia fuckeliana B05.10]
          Length = 585

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 114/262 (43%), Gaps = 40/262 (15%)

Query: 21  YFGKYLNKYNGSYI----PPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK  N +  S      P GW     L+     +YYN +   N       E    +Y  
Sbjct: 128 YTGKLFNHHTVSNWNNPHPAGWTSSDFLLDPFTYQYYNSTFQRNEHPPVSHE---GEYST 184

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYAN 118
           D++A  +   L +  QS   KP  LV +  APH   D                 A ++ +
Sbjct: 185 DVLAQKAYGLLDEGVQS--GKPFFLVAAPIAPHANVDLAAFKPGEGVLMTTPLPAERHKH 242

Query: 119 MFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
           +F          +    NPDK     WI  + +    V   + D     RLQTLQ+VD+ 
Sbjct: 243 LFPGAKVPRNANF----NPDKPSGASWIKHLDQLTDEV-IAYNDEFYRARLQTLQAVDEL 297

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +E +V  L D G L+NTY I+++D+GYH+ Q  L  GK   FE DI +P ++RGPG+  G
Sbjct: 298 IEGLVDRLTDYGILDNTYFIFSTDNGYHISQHRLAPGKECGFEEDINIPLVIRGPGVPQG 357

Query: 235 TMYLLDV----LIPQVRKFSSG 252
               +      L P + K ++G
Sbjct: 358 EETSIVTAHTDLAPTILKIANG 379


>gi|169779723|ref|XP_001824326.1| arylsulfatase [Aspergillus oryzae RIB40]
 gi|83773065|dbj|BAE63193.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 598

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 91/180 (50%), Gaps = 25/180 (13%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAP----QYAN 118
           D++   ++  L  +     ++P  L ++  APH   +            SAP    ++ +
Sbjct: 200 DVLRKKALDLLDDAVAESHERPFFLTIAPIAPHSNFEMTNASDYTTFRFSAPIPLERHKD 259

Query: 119 MFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
           +F  V    T  +    NPD+     WI  + ++ Q       D     RL+ LQ VD+ 
Sbjct: 260 LFPEVKVPRTEHF----NPDQPSGVNWISTLPQQNQS-SIDSNDEFYRARLRALQGVDEI 314

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           VE+IV+ L+D G L+NTYI YTSD+GYH+GQ  L  GK   FE DIRVP  +RGPGI  G
Sbjct: 315 VEQIVQRLEDAGVLDNTYIFYTSDNGYHIGQHRLHPGKECGFEEDIRVPMFIRGPGIPSG 374


>gi|407921643|gb|EKG14784.1| Sulfatase [Macrophomina phaseolina MS6]
          Length = 368

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 49/124 (39%), Positives = 73/124 (58%), Gaps = 3/124 (2%)

Query: 111 DSAPQYANMFFNVTSHHTPAYDYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
           ++A ++ N+FFNVT   TP ++ A        WI  +   +      + D     RL+ L
Sbjct: 25  EAAKRHQNLFFNVTVPRTPHFNPAGGNAGAASWIKDLPL-LNASEVAYGDEFYRLRLEAL 83

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           Q+VD+ ++ +V  L  LG LN+TY+ YTSD+G+H+GQ  L  GKS  +E DI VPF +RG
Sbjct: 84  QAVDELIDALVARLGALGVLNDTYVFYTSDNGFHIGQHRLQPGKSCGYEEDILVPFFVRG 143

Query: 229 PGIV 232
           PG+ 
Sbjct: 144 PGVA 147


>gi|358373836|dbj|GAA90432.1| arylsulfatase [Aspergillus kawachii IFO 4308]
          Length = 820

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/235 (26%), Positives = 116/235 (49%), Gaps = 28/235 (11%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      YN  + P G+      +++   Y+Y  +   R  +  + Y  +Y  D
Sbjct: 130 YTGKLFNVHTVDNYNAPF-PAGFTG-NDFLLDPFTYDYLNSTFQRDREPPQSYEGEYSTD 187

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
           ++A  + R L ++  +  +KP  L ++  APH          D+ P+++           
Sbjct: 188 VLAQKAYRLLDEAVAA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 245

Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           ++F ++    TP+++        WI +  ++    +  + D    +RL+ LQ++D+ V+ 
Sbjct: 246 DLFSDIKVPRTPSFNPEKPSGANWI-KTLKRQNDTNIDYNDHFYRQRLRALQAIDELVDG 304

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +   L+D   L+NTY++Y+SD+GYH+GQ  L  GKS  +E DI VP ++RGP + 
Sbjct: 305 LFSRLEDYDILDNTYVVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIVRGPSVA 359


>gi|67902708|ref|XP_681610.1| hypothetical protein AN8341.2 [Aspergillus nidulans FGSC A4]
 gi|40747747|gb|EAA66903.1| hypothetical protein AN8341.2 [Aspergillus nidulans FGSC A4]
 gi|259484268|tpe|CBF80344.1| TPA: arylsulfatase, putative (AFU_orthologue; AFUA_8G02520)
           [Aspergillus nidulans FGSC A4]
          Length = 608

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 70/233 (30%), Positives = 110/233 (47%), Gaps = 39/233 (16%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   Y+N + GS            +++   Y+Y  +   R       Y   +  D+I   
Sbjct: 169 YHSPYVNGFTGS----------DFLLDPFTYSYLNSTYQRNGDEPVSYEGRHTVDVITEK 218

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPH--------GPEDSAP----------QYANMFFN 122
           ++ FL         +P  L ++  APH        G +  AP          ++ ++F +
Sbjct: 219 ALGFLDDGLNG--DRPFFLTVAPVAPHSNVDVSALGADRRAPTIMTEPIPLDRHKSLFQD 276

Query: 123 VTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           V    T  +    NPD+     WI  + ++ +    ++ D    +RL+ LQ VD+ V+ I
Sbjct: 277 VKVPRTKHF----NPDEPSGVSWIRDLPQQDEST-IEYNDHFYRQRLRALQGVDELVDLI 331

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           V  L+  G+L+NTYIIYTSD+GYH+GQ  L  GK+  FE DIRVP  +RGPG+
Sbjct: 332 VTRLEASGQLDNTYIIYTSDNGYHIGQHRLPPGKACGFEEDIRVPLFIRGPGV 384


>gi|121714837|ref|XP_001275028.1| arylsulfatase, putative [Aspergillus clavatus NRRL 1]
 gi|119403184|gb|EAW13602.1| arylsulfatase, putative [Aspergillus clavatus NRRL 1]
          Length = 584

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 114/238 (47%), Gaps = 26/238 (10%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +      S  P GW      +++   Y+Y  +   R     + Y   +  ++
Sbjct: 134 YTGKLMNGHRLTNYDSPFPKGWTG-SDFLLDPNTYSYHNSTYQRNQDPPKSYAGHHTSEV 192

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------GPEDSAP----QYAN 118
           IA  ++ FL  +  S   KP  L +S  APH              GP  + P    ++ +
Sbjct: 193 IAEKAMGFLEDALGS--DKPFFLGVSPVAPHTNIEPIDLGPNAVNGPRMTEPVPLEKHMH 250

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           +F +     TP ++        ++L +    Q V   + D    +RL++LQSVD+ V+ +
Sbjct: 251 LFKDTKVPRTPNFNPTTASGASYVLGLPLLNQTV-IDYQDHFYRQRLRSLQSVDELVDSL 309

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           V+ L++ G  +NTYI YT+D+G+H+G   L  GK+  +E DI VP  +RGPG+  G +
Sbjct: 310 VRRLEESGRADNTYIFYTADNGFHIGNHRLPPGKTTGYEEDIHVPLFVRGPGVPKGKV 367


>gi|159467585|ref|XP_001691972.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
 gi|12698891|gb|AAK01720.1|AF333184_1 arylsulfatase 2 precursor [Chlamydomonas reinhardtii]
 gi|20750294|gb|AAM23308.1| arylsuflatase 2 [Chlamydomonas reinhardtii]
 gi|158278699|gb|EDP04462.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
          Length = 649

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 71/238 (29%), Positives = 115/238 (48%), Gaps = 33/238 (13%)

Query: 21  YFGKYLNKYNGS---YIPPGWREWGALIMNSKYYNYSINMNG--RKIKHGEDYYNDYYPD 75
           Y GK+L  Y+ S    +P GW +  AL+     Y +  N  G  R       Y  +Y  D
Sbjct: 127 YVGKFLVDYSVSNYQQVPAGWDDIDALVTP---YTFDYNTPGFSRNGATPNIYPGEYSTD 183

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
           +I +  +  ++ +  +   KP    +S  APH                   P   AP++ 
Sbjct: 184 VIRDKGIAQIKSAVAA--GKPFYAQISPIAPHTSTQISTDPVTGVTRSFFYPPIPAPRHW 241

Query: 118 NMFFNVT-SHHTPA---YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
            +F +      TP    Y+   +    W+  +    Q  +R + + +   RL++L +VD+
Sbjct: 242 QLFSDANLPGGTPNKNLYEVDVSDKPAWVRALPLAQQ-NNRTYLEEIYRLRLRSLAAVDE 300

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            +E++VK L + G L+NTYIIY++D+GYH+G      GK+  +E D+RVPFL+RGPGI
Sbjct: 301 LIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHRFGAGKTTGYEEDLRVPFLIRGPGI 358


>gi|115388119|ref|XP_001211565.1| hypothetical protein ATEG_02387 [Aspergillus terreus NIH2624]
 gi|114195649|gb|EAU37349.1| hypothetical protein ATEG_02387 [Aspergillus terreus NIH2624]
          Length = 576

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 100/199 (50%), Gaps = 19/199 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
            YYN S   N    K    Y   Y  D+I    + FL  + +S   +P  L ++  APH 
Sbjct: 167 SYYNSSYQRNHEPPK---SYAGQYTTDVITEKGLGFLDDALES--DRPFFLTVAPIAPHS 221

Query: 109 PEDS-------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
             D              AP++ ++F +V    T  ++         I  +  + + V   
Sbjct: 222 NIDPSAAGSGRMTEPIPAPRHRHLFKDVKVPRTANFNPLNRTGVSSIWDLPLQNKTV-VD 280

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQSVD+ V+K++  L+  G ++NTY+I+++D+G+H+GQ  L  GK+  
Sbjct: 281 YEDHFYRQRLRALQSVDEMVDKLIDRLEKSGAIDNTYVIFSADNGFHIGQHRLGPGKTTG 340

Query: 216 FEFDIRVPFLMRGPGIVPG 234
           ++ DIRVPF +RGPGI  G
Sbjct: 341 YDEDIRVPFFIRGPGIPEG 359


>gi|452842014|gb|EME43950.1| hypothetical protein DOTSEDRAFT_171886 [Dothistroma septosporum
           NZE10]
          Length = 591

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 68/231 (29%), Positives = 111/231 (48%), Gaps = 32/231 (13%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED---YYNDYYPDLI 77
           Y   ++N +NGS            +++   Y Y    N    ++GE    Y   Y PD++
Sbjct: 134 YDDPFVNGFNGS----------DFLLDPYTYEY---FNAHMTRNGEGPVAYTGLYSPDVV 180

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-----------PEDS---APQYANMFFNV 123
           A  +  FL ++      +P  + ++  APH             +D+   A ++A++F   
Sbjct: 181 AQKAYGFLEEAVLH-KDRPFFIGVAPIAPHSNMRIDVENGEFSKDTPFYAERHAHLFKEY 239

Query: 124 TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
               T  ++        W+  + +    V + + D     RL++LQSVD+ VE++V  L+
Sbjct: 240 KIPRTANFNPGVPSGVSWVKDLPQLNDTVIK-YHDEFQRARLRSLQSVDEMVEEVVHRLE 298

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
             G L+NTYI YT+D+GYH+GQ  L  GK  PFE DI +P  +RGPG+  G
Sbjct: 299 AHGLLDNTYIFYTTDNGYHIGQHRLPPGKECPFEEDIHIPLAVRGPGVPAG 349


>gi|391873570|gb|EIT82595.1| sulfatase [Aspergillus oryzae 3.042]
          Length = 598

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/176 (34%), Positives = 88/176 (50%), Gaps = 17/176 (9%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED--SAPQYANMFFNV---TSHHTP 129
           D++   ++  L  +     ++P  L ++  APH   +  +A  Y    F+       H  
Sbjct: 200 DVLRKKALGLLDDAVTESHERPFFLTIAPIAPHSNFEMTNASDYTTFRFSAPIPLERHKD 259

Query: 130 AYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
            +  A        NPD+     WI  + ++ Q       D     RL+ LQ VD+ VE++
Sbjct: 260 LFPEAKVPRTEHFNPDQPSGVNWISTLPQQNQS-SIDSNDEFYRARLRALQGVDEIVERV 318

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           V+ L+D G L+NTYI YTSD+GYH+GQ  L  GK   FE DIRVP  +RGPGI  G
Sbjct: 319 VQRLEDAGVLDNTYIFYTSDNGYHIGQHRLHPGKECGFEEDIRVPMFIRGPGIPSG 374


>gi|159479262|ref|XP_001697712.1| hypothetical protein CHLREDRAFT_192731 [Chlamydomonas reinhardtii]
 gi|158274080|gb|EDO99864.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 686

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 79/245 (32%), Positives = 117/245 (47%), Gaps = 38/245 (15%)

Query: 23  GKYLNKYNGSYI-----PPGWREWGALIMN----SKYY---------NYSINMNGRKIKH 64
           GK+LN +    +     P GW     LI +    S+Y          NY  N  G     
Sbjct: 165 GKFLNGFTYKAVKKFGCPKGWTVADPLIQDFTTDSEYEFSEDYPRAPNYMTNCQGEV--- 221

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GP---EDSAPQYAN 118
            + Y+ DY+ ++I + ++ +L     + S KP  L +S  APH   GP       P Y +
Sbjct: 222 -QTYFGDYHEEVIRSKALTYLDDG--AASGKPFFLYISTVAPHDAKGPGAYPQVPPAYRD 278

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL--LMTKRLQTLQSVDDAVE 176
           +F  V +  T  +   P P          K  P     TD+      RLQ+L++VDD ++
Sbjct: 279 LFPGVKAPRTGNWG-KPVPAAVGF----SKQTPNDFNVTDIDGRYRARLQSLRAVDDTID 333

Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            ++K L   G L+NT I+YTSD+G+ LG   + + K   FE D+RVPFLM GPG VP  +
Sbjct: 334 AVMKRLACHGLLDNTVIMYTSDNGFKLGNHNMAQEKFTYFEEDVRVPFLMAGPG-VPRGV 392

Query: 237 YLLDV 241
           Y+ +V
Sbjct: 393 YVPEV 397


>gi|302905774|ref|XP_003049336.1| hypothetical protein NECHADRAFT_45116 [Nectria haematococca mpVI
           77-13-4]
 gi|256730271|gb|EEU43623.1| hypothetical protein NECHADRAFT_45116 [Nectria haematococca mpVI
           77-13-4]
          Length = 580

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 58/200 (29%), Positives = 98/200 (49%), Gaps = 20/200 (10%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           +Y+N  I  NG +      Y   Y  D+IA  ++ +L ++       P  L +S  APH 
Sbjct: 152 QYWNAKITHNGEEPV---SYAGQYSTDVIAKKALAWLDEALAE--PDPFFLTVSPIAPHS 206

Query: 109 --------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
                            SAP++ ++F +       +++ A +    W+  +   +     
Sbjct: 207 NWVIDTERDLSYLEEPKSAPRHQHLFSDYVIPREKSFNAAIDGAAGWVGDLP-PLNETTV 265

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
            + D    +RL+ LQSVD+ V ++V  L+  G+L+NT+I YT+D+GYH+ Q  +  GK  
Sbjct: 266 TYNDHYQRQRLRALQSVDEMVSELVGRLEKAGQLDNTFIFYTTDNGYHISQHRMNPGKEC 325

Query: 215 PFEFDIRVPFLMRGPGIVPG 234
            ++ DI +PF +RGPGI  G
Sbjct: 326 GYDTDIHIPFFVRGPGIPAG 345


>gi|167521003|ref|XP_001744840.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776454|gb|EDQ90073.1| predicted protein [Monosiga brevicollis MX1]
          Length = 589

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 94/208 (45%), Gaps = 40/208 (19%)

Query: 61  KIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GP----EDSA 113
            I      Y +Y   +I N S++++    ++ + +P M  ++  APH   GP       A
Sbjct: 225 AITFNNGTYANYSTSVIGNVSLQWIE--SKARANQPFMAYIAPKAPHIQDGPGWPKPIPA 282

Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT---------------- 157
           P YAN F  V++   P ++ +  PD  W++     M       T                
Sbjct: 283 PWYANAFAQVSAPRPPNFNVSA-PDHHWLVASQAPMTAEEVSDTKSTTFASSLYSAQSIG 341

Query: 158 --------------DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
                         D L   R ++L SVDD VE +V  L    +L+ TYI+YTSDHGY  
Sbjct: 342 RLSGLVCVQQEVNMDELYRDRWRSLLSVDDLVESLVNTLAATNQLDRTYILYTSDHGYRF 401

Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           GQF +  GK   +E D+R+PF+ RGPGI
Sbjct: 402 GQFRMPMGKWGLYENDVRIPFVARGPGI 429


>gi|238500393|ref|XP_002381431.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
 gi|220693184|gb|EED49530.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
          Length = 598

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/176 (34%), Positives = 88/176 (50%), Gaps = 17/176 (9%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED--SAPQYANMFFNV---TSHHTP 129
           D++   ++  L  +     ++P  L ++  APH   +  +A  Y    F+       H  
Sbjct: 200 DVLRKKALGLLDDAVTESHERPFFLTIAPIAPHSNFEMTNASDYTTFRFSAPIPLERHKD 259

Query: 130 AYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
            +  A        NPD+     WI  + ++ Q       D     RL+ LQ VD+ VE++
Sbjct: 260 LFPEAKVPRTEHFNPDQPSGVNWISTLPQQNQS-SIDSNDEFYRARLRALQGVDEIVEQV 318

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           V+ L+D G L+NTYI YTSD+GYH+GQ  L  GK   FE DIRVP  +RGPGI  G
Sbjct: 319 VQRLEDAGVLDNTYIFYTSDNGYHIGQHRLHPGKECGFEEDIRVPMFIRGPGIPSG 374


>gi|380477817|emb|CCF43942.1| sulfatase [Colletotrichum higginsianum]
          Length = 607

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 69/248 (27%), Positives = 117/248 (47%), Gaps = 41/248 (16%)

Query: 21  YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK  N +   +Y  P  + W +   +++   Y+Y      R  +   ++  +Y  DL+
Sbjct: 141 YTGKMFNAHTIWNYDSPHLKGWNSSDFLLDPNTYDYYNATFQRNHEKPVNHLGEYSTDLV 200

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----------- 114
           A  +  FL  + ++   KP  L ++  APH            GP+D+ P           
Sbjct: 201 AEKAYGFLDDAVEA--SKPFFLAIAPIAPHSNVNASILLPVKGPDDNVPVSDDKFRVNFP 258

Query: 115 ----QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
               ++ ++F +     T  +    NPDK     WI +   K+   + ++ D     RL+
Sbjct: 259 LAAKRHEHLFTDAKVPRTDNF----NPDKPSGADWIRR-QPKLTDENVEYNDRHYQGRLR 313

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
            LQ+VD+ V+ +V  L+  G L+NTY+ YT+D+GYH+ Q  L  GK   +E DI +P ++
Sbjct: 314 ALQAVDELVDGLVDRLEKHGILDNTYVFYTTDNGYHISQHRLQPGKECGYEEDINIPLIV 373

Query: 227 RGPGIVPG 234
           RGPG+  G
Sbjct: 374 RGPGVPKG 381


>gi|405122939|gb|AFR97704.1| arylsulfatase [Cryptococcus neoformans var. grubii H99]
          Length = 604

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 107/237 (45%), Gaps = 30/237 (12%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK +N +   N   +P        ++++   Y+Y      R     E +  +Y  DL+
Sbjct: 127 YTGKLMNDHTDANCESLPVSGFNSSDILVDPYTYDYWTPGFSRDNGPVEVHAGEYSTDLV 186

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYANMFF 121
              +V +L  + Q    +P  L ++  A H   D                S P++A +F 
Sbjct: 187 HEKAVNYLEDALQE--DQPFYLTVAPVACHSWLDHKHQVDLHKFVADIPASHPRHARLFP 244

Query: 122 NVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
                 T  +    NPD+     W+ ++  K+      + D     RL+ LQ+VD+ VE 
Sbjct: 245 VEQIERTENW----NPDEPSGVSWVKELP-KLNSTQEAYLDEFFRGRLRALQAVDEIVED 299

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           IV  L+  GEL+NTYI YT+D+GY LG      GK+  FE DI VP + RGPGI  G
Sbjct: 300 IVDRLEKAGELDNTYIFYTADNGYALGSHRRQPGKTLGFEEDIHVPLIARGPGIKKG 356


>gi|258564744|ref|XP_002583117.1| hypothetical protein UREG_07890 [Uncinocarpus reesii 1704]
 gi|237908624|gb|EEP83025.1| hypothetical protein UREG_07890 [Uncinocarpus reesii 1704]
          Length = 587

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 64/225 (28%), Positives = 106/225 (47%), Gaps = 18/225 (8%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +        +  GW      +++   Y+Y      R       Y   Y  D+
Sbjct: 130 YVGKLMNAHTVVNYNQPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 188

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
           +A   + F+  + +    KP  LV +  +PHG     P          ++A++F +    
Sbjct: 189 VAEKVLGFMDDALKG--DKPFFLVAAPVSPHGQIKPPPLWFDKPEYPERHAHLFKDYKIP 246

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
            T  ++        WI  + R  Q     + D     RL++LQSVD+ V  ++++LK   
Sbjct: 247 RTSNFNPETPSGAGWIAAMPRLNQ-TQIDYHDEYQRCRLRSLQSVDEMVGDMIEKLKQHD 305

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           EL+NTY IYT+D+G+H+ Q+ L  GK+  F+ DIR+P + RGPG+
Sbjct: 306 ELDNTYFIYTTDNGFHISQYRLPPGKTCGFDTDIRIPMVARGPGV 350


>gi|119483994|ref|XP_001261900.1| arylsulfatase, putative [Neosartorya fischeri NRRL 181]
 gi|119410056|gb|EAW20003.1| arylsulfatase, putative [Neosartorya fischeri NRRL 181]
          Length = 583

 Score = 90.1 bits (222), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 115/238 (48%), Gaps = 26/238 (10%)

Query: 21  YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N ++ S      P GW      +++   Y Y  +   R  +  ++Y   Y  D+
Sbjct: 134 YTGKLMNGHSTSNYNSPFPKGWNG-SDFLLDPYTYAYLNSTYQRNREPPKNYAGQYTTDV 192

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------GPEDSAP----QYAN 118
           I   ++ FL  +  S   +P  L ++  APH              G   +AP    ++ +
Sbjct: 193 IREKAMGFLNDALSS--DRPFFLAVAPIAPHSNIEPSALGTGGAKGTLMTAPIPAERHKD 250

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           +F +     T  ++        W+  +    Q V   + D    +RL++LQ++D+ V+ +
Sbjct: 251 LFPDAKVPRTANFNPKVPTGGGWVQHLPLANQTV-IDYQDHFYRQRLRSLQALDEMVDAL 309

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +  L++ G+  NTYIIYT+D+G+H+GQ  L  GKS  +E DI VPF +RGPGI  G +
Sbjct: 310 ITRLEESGQAENTYIIYTADNGFHIGQHRLPPGKSSGYEEDIHVPFYIRGPGIPKGKI 367


>gi|46126687|ref|XP_387897.1| hypothetical protein FG07721.1 [Gibberella zeae PH-1]
          Length = 603

 Score = 90.1 bits (222), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 98/192 (51%), Gaps = 32/192 (16%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS------------------ 112
           +Y  D++A  +  FL ++ +     P  L ++  APH   D+                  
Sbjct: 194 EYVTDVLARKTQGFLDEALKE--DAPFFLTIAPNAPHSNVDASFAKINFTDTPRNWTFKM 251

Query: 113 -----APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR---KMQPVHRQFTDLLMTKR 164
                AP++A++F +V    T  +    NPD Q +    R   K+   +  + D     R
Sbjct: 252 TTPIPAPRHAHLFKDVVVPRTENF----NPDSQIVPSWIRDQPKLTQENIDWNDDFYRNR 307

Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
           L+ LQ+VD+ ++ + K+L+D G L++TY++YT+D+G+H+ Q  L  GK   +E DI +P 
Sbjct: 308 LRALQAVDELIDDVFKKLEDAGVLDHTYVVYTTDNGFHISQHRLQPGKECGYEEDINIPL 367

Query: 225 LMRGPGIVPGTM 236
           ++RGPGI  G++
Sbjct: 368 IIRGPGIPAGSI 379


>gi|346321498|gb|EGX91097.1| arylsulfatase precursor [Cordyceps militaris CM01]
          Length = 603

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 74/231 (32%), Positives = 112/231 (48%), Gaps = 29/231 (12%)

Query: 21  YFGKYLN-----KYNGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           Y GK +N      YN  + P GW     L+     +Y N  ++++ ++ +        Y 
Sbjct: 158 YTGKLMNMHGVETYNNPF-PAGWNRSDYLLDPGTYRYLNSVMSLDEQEYR---SMAGTYS 213

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS--------APQYANMFFNVTS 125
            D I + +V FL     +   KP  L ++   PHG +          A ++ ++F ++  
Sbjct: 214 TDNIRDRAVEFLGNGIDA--GKPFFLGVAPIGPHGQDTPKGFDHPIPADRHKDLFPDLQV 271

Query: 126 HHTPAYDYAPNPDKQWILQ---VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
             TP +    NPDK   +     T K+ P   ++ D    +R+Q LQSVD+ VE ++ +L
Sbjct: 272 PRTPNF----NPDKAGDVHYFASTPKLAPHQVEYLDEYYRRRIQALQSVDELVEALMLKL 327

Query: 183 KDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +     L NTY+ YTSD+GYHLGQ  L  GK    E DI VP L+RGPGI 
Sbjct: 328 EQHRHVLANTYVFYTSDNGYHLGQHRLPPGKCTSMEDDINVPLLVRGPGIA 378


>gi|119181412|ref|XP_001241920.1| hypothetical protein CIMG_05816 [Coccidioides immitis RS]
          Length = 806

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/229 (27%), Positives = 108/229 (47%), Gaps = 18/229 (7%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +        +  GW     L+  + Y  +   M+ R       Y   Y  D+
Sbjct: 349 YVGKLMNAHTIVNYNEPVAAGWTGSDFLLDPTTYDYWGAKMS-RNGAAPVSYAGQYSSDV 407

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
           +A   + F+  + +    KP  LV +  +PHG     P          ++A++F +    
Sbjct: 408 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 465

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
            T  ++        W+  + R  Q    ++ D     RL+ LQSVD+ ++ +V++L +  
Sbjct: 466 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 524

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            L+NTY IYT+D+G+H+ Q  L  GK+  F+ DIR+P ++RGPG+  G 
Sbjct: 525 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMIVRGPGVAKGA 573


>gi|336178163|ref|YP_004583538.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
           glomerata]
 gi|334859143|gb|AEH09617.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
           glomerata]
          Length = 452

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 104/218 (47%), Gaps = 29/218 (13%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G FGKYLN +N +  PP + EW   ++   Y + + N+NG          N Y  ++I N
Sbjct: 143 GLFGKYLNAWNINDNPPHFEEWA--LLQPSYTDGTFNVNGTV-----QTINGYTTNVIRN 195

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ FL   K +   +P    ++  APH P   A QYA+          P ++  P+   
Sbjct: 196 RTLSFL--DKAATDTRPWFAYVAPYAPHEPNTPAAQYADTVV-------PTWNGRPS--- 243

Query: 140 QWILQVTRKMQPVHRQFT-------DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
             + +  R  +P + Q           L  ++L+TL SVDDAV+    +L  LG+LNNT 
Sbjct: 244 --VFETDRSDKPGYIQNATNTLADGQALRQRQLRTLLSVDDAVQAFRDKLVALGQLNNTL 301

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +IY  D+GY     G + GKS P+     VPF +  PG
Sbjct: 302 VIYIPDNGYLWADHGRL-GKSTPYTPAHEVPFYLSWPG 338


>gi|115437586|ref|XP_001217849.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114188664|gb|EAU30364.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 579

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 101/200 (50%), Gaps = 26/200 (13%)

Query: 49  KYYNYSINMNGRKIKHGE--DYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAP 106
           +YYN  +  NG     GE   Y   Y  D++   +  FL ++     ++P M+ ++  AP
Sbjct: 152 QYYNSMMTRNG-----GEPVSYAGQYITDVMTEKAHGFLDEALAD--ERPWMMTVAPVAP 204

Query: 107 HG---------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP 151
           H                PE  AP++A+MF +       +++ A +    WI ++      
Sbjct: 205 HSNGTLEIHSGVFWQSQPE-YAPRHAHMFKDYKIPRDASFNAAIDGGVSWIGKLDALNDS 263

Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
           V  ++ D     RL++LQ+VD+ VE +V  L++ G L+ TYI Y++D+GYH+ Q  L  G
Sbjct: 264 V-IEYHDEYQRCRLRSLQAVDEMVESLVSRLEEAGVLDTTYIFYSTDNGYHISQHRLAPG 322

Query: 212 KSFPFEFDIRVPFLMRGPGI 231
           K   F+ DI +P  +RGPGI
Sbjct: 323 KECGFDTDIHIPLYVRGPGI 342


>gi|156057213|ref|XP_001594530.1| hypothetical protein SS1G_04337 [Sclerotinia sclerotiorum 1980]
 gi|154702123|gb|EDO01862.1| hypothetical protein SS1G_04337 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 492

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 77/258 (29%), Positives = 113/258 (43%), Gaps = 32/258 (12%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK  N +      S  P GW     L+     +YYN +   N       E    +Y  
Sbjct: 34  YTGKLFNHHTVDNWNSPHPAGWTSSDFLLDPFTYQYYNSTFQRNEHPPVSHE---GEYST 90

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GPEDSAPQYANMFFNV--TSHHTP 129
           D++A  +   L +  QS   KP  LV +  APH   GP+   P    +         H  
Sbjct: 91  DVLAQKAYGLLDEGVQS--GKPFFLVAAPIAPHSNVGPDAFKPGEGILMTTPLPAERHKH 148

Query: 130 AYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
            +  A        NPDK     W+  +  ++      + D     RLQTLQ+VD+ +E +
Sbjct: 149 LFPSAKVPRNANFNPDKPSGASWVKHLD-QLTDEAIAYNDEFYRARLQTLQAVDELIEGL 207

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
              L D G L+NTY I+++D+GYH+ Q  L  GK   FE DI +P ++RGPG+  G    
Sbjct: 208 FHRLTDYGILDNTYFIFSTDNGYHISQHRLAPGKECGFEEDINIPLVIRGPGVPQGEETS 267

Query: 239 LDV----LIPQVRKFSSG 252
           +      L P + K ++G
Sbjct: 268 IVTAHTDLAPTILKIANG 285


>gi|303318527|ref|XP_003069263.1| arylsulfatase precursor, putative [Coccidioides posadasii C735
           delta SOWgp]
 gi|240108949|gb|EER27118.1| arylsulfatase precursor, putative [Coccidioides posadasii C735
           delta SOWgp]
          Length = 658

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 18/229 (7%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +        +  GW      +++   Y+Y      R       Y   Y  D+
Sbjct: 201 YVGKLMNAHTIVNYNEPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 259

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
           +A   + F+  + +    KP  LV +  +PHG     P          ++A++F +    
Sbjct: 260 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 317

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
            T  ++        W+  + R  Q    ++ D     RL+ LQSVD+ ++ +V++L +  
Sbjct: 318 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 376

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            L+NTY IYT+D+G+H+ Q  L  GK+  F+ DIR+P ++RGPG+  G 
Sbjct: 377 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMVVRGPGVAKGA 425


>gi|392864834|gb|EAS30557.2| arylsulfatase [Coccidioides immitis RS]
          Length = 584

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 18/229 (7%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +        +  GW      +++   Y+Y      R       Y   Y  D+
Sbjct: 127 YVGKLMNAHTIVNYNEPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 185

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
           +A   + F+  + +    KP  LV +  +PHG     P          ++A++F +    
Sbjct: 186 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 243

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
            T  ++        W+  + R  Q    ++ D     RL+ LQSVD+ ++ +V++L +  
Sbjct: 244 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 302

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            L+NTY IYT+D+G+H+ Q  L  GK+  F+ DIR+P ++RGPG+  G 
Sbjct: 303 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMIVRGPGVAKGA 351


>gi|406858889|gb|EKD11969.1| arylsulfatase [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
          Length = 576

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 97/187 (51%), Gaps = 29/187 (15%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------------GP 109
           Y   Y  D++A  +   L  + Q+  ++P  L ++  APH                  G 
Sbjct: 193 YEGQYSTDVLAQKAYSLLDHAVQA--EEPFFLTLAPIAPHANIEMNGSALDDGHTLKFGS 250

Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRL 165
             SA ++ ++F +V    T ++    NPD      W+L + ++    +  + D    +RL
Sbjct: 251 PVSAERHKHLFKDVKVPRTESF----NPDNPSGANWLLGL-KQQNKSNVDYNDHFYRQRL 305

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           + LQ+VD+ V+ + + L+  G ++NTYIIY+SD+G+H+GQ  L  GKS  +E DI VP +
Sbjct: 306 RALQAVDELVDGLFERLEAHGIMDNTYIIYSSDNGFHIGQHRLQPGKSCGYEEDINVPLI 365

Query: 226 MRGPGIV 232
           +RGPG+ 
Sbjct: 366 IRGPGVA 372


>gi|320036152|gb|EFW18091.1| arylsulfatase [Coccidioides posadasii str. Silveira]
          Length = 584

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 18/229 (7%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +        +  GW      +++   Y+Y      R       Y   Y  D+
Sbjct: 127 YVGKLMNAHTIVNYNEPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 185

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
           +A   + F+  + +    KP  LV +  +PHG     P          ++A++F +    
Sbjct: 186 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 243

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
            T  ++        W+  + R  Q    ++ D     RL+ LQSVD+ ++ +V++L +  
Sbjct: 244 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 302

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            L+NTY IYT+D+G+H+ Q  L  GK+  F+ DIR+P ++RGPG+  G 
Sbjct: 303 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMVVRGPGVAKGA 351


>gi|326435643|gb|EGD81213.1| hypothetical protein PTSG_11248 [Salpingoeca sp. ATCC 50818]
          Length = 698

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/167 (32%), Positives = 88/167 (52%), Gaps = 4/167 (2%)

Query: 71  DYYPDLIANDSVRFLRQ---SKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHH 127
           +Y  DL+   +  +LR+   + ++ + +P  L  +   PH P   + +YA+   NVT+  
Sbjct: 348 EYATDLLLEKASEWLRRDNVTGKASNGRPFFLYFAPHCPHIPSTPSDKYADACPNVTAPR 407

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           TP ++Y+     + I +    ++   R   D L   R Q L SVDDA  +++  L+DLG 
Sbjct: 408 TPNFNYSTPLFHELISKQAPFVEEEARAI-DALARLRCQCLLSVDDAHAQLLSVLEDLGV 466

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            NNTY   +SDHGY+LG   L   K   +E  +R+P ++ GPG+  G
Sbjct: 467 ANNTYWFVSSDHGYNLGNHRLPSNKFLLYEHSLRIPLVVHGPGVPAG 513


>gi|321252792|ref|XP_003192520.1| arylsulfatase precursor [Cryptococcus gattii WM276]
 gi|317458989|gb|ADV20733.1| Arylsulfatase precursor, putative [Cryptococcus gattii WM276]
          Length = 604

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/187 (32%), Positives = 89/187 (47%), Gaps = 27/187 (14%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---------------- 111
           +  +Y  DL+   +V +L  + Q    +P  L ++  A H   D                
Sbjct: 177 HAGEYSTDLVHEKAVNYLEDALQE--DRPFYLTVAPIACHSWIDHRRMVDVHQFVADVPA 234

Query: 112 SAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQT 167
           S P++A +F       T  +    NPD+     W+ ++  K+      + D     RL+ 
Sbjct: 235 SHPRHARLFPVEQIERTENW----NPDEPSGVSWVKELP-KLNSTQEAYLDEFFRGRLRA 289

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           LQ+VD+ VE IV  L+  GEL+NTYI YT+D+GY LG      GK+  FE DI VP + R
Sbjct: 290 LQAVDEMVEDIVDRLEKAGELDNTYIFYTADNGYALGSHRRQPGKTLGFEEDIHVPLIAR 349

Query: 228 GPGIVPG 234
           GPGI  G
Sbjct: 350 GPGIKKG 356


>gi|345570135|gb|EGX52960.1| hypothetical protein AOL_s00007g296 [Arthrobotrys oligospora ATCC
           24927]
          Length = 594

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 96/191 (50%), Gaps = 29/191 (15%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED--------------SA 113
           Y+  +  D++ + S   L  +  S  +KP  L ++  APH   +              S 
Sbjct: 197 YHGQHTIDILIDKSNALLDDAIAS--EKPFFLALAPVAPHSNVEGDISDISDIEKVTFSE 254

Query: 114 P----QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRL 165
           P    ++ ++F +     TP +    NPDK     WI +   ++        D    +RL
Sbjct: 255 PIPLKRHEHLFPDAKVPRTPHF----NPDKPSGVNWIAE-RPQLNKTWVDHNDYYYRQRL 309

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           + LQ VD+ V+ I K L++ G+L+NTYI YT+D+G+H+ Q  L  GK   FE DIRVPF+
Sbjct: 310 RALQGVDELVDGIFKRLEENGQLDNTYIFYTADNGFHVSQHRLNPGKECGFEDDIRVPFI 369

Query: 226 MRGPGIVPGTM 236
           +RGPGI  G +
Sbjct: 370 VRGPGIAEGAV 380


>gi|70982853|ref|XP_746954.1| arylsulfatase [Aspergillus fumigatus Af293]
 gi|66844579|gb|EAL84916.1| arylsulfatase, putative [Aspergillus fumigatus Af293]
 gi|159123838|gb|EDP48957.1| arylsulfatase, putative [Aspergillus fumigatus A1163]
          Length = 598

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 111/236 (47%), Gaps = 26/236 (11%)

Query: 21  YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           Y GK +N +  S      P GW      +++   Y Y  +   R  +  ++Y   Y  D+
Sbjct: 134 YTGKLMNGHTTSNYNSPFPKGWNG-SDFLLDPYTYAYLNSTYQRNREPPKNYAGQYTTDV 192

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------GPEDSAP----QYAN 118
           I   ++ FL  +  S   +P  L ++  APH              G   +AP    ++ +
Sbjct: 193 ITEKALGFLNDALSS--DRPFFLAVAPIAPHSNIEPSALGTGGAKGTLMTAPIPAERHKD 250

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
           +F +     T  ++        W+  +    Q V   + D    +RL+ LQ++D+ V+ +
Sbjct: 251 LFPDAKVPRTANFNPKVPTGGGWVQHLPLANQTV-IDYQDHFYRQRLRALQALDEMVDAL 309

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +  L++ G   NTYIIYT+D+G+H+GQ  L  GKS  +E DI VPF +RGPGI  G
Sbjct: 310 ITRLEESGHAENTYIIYTADNGFHIGQHRLPPGKSSGYEEDIHVPFYIRGPGIPRG 365


>gi|269839628|ref|YP_003324320.1| sulfatase [Thermobaculum terrenum ATCC BAA-798]
 gi|269791358|gb|ACZ43498.1| sulfatase [Thermobaculum terrenum ATCC BAA-798]
          Length = 477

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 107/234 (45%), Gaps = 31/234 (13%)

Query: 22  FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
           FGKYLN ++    PP +  WG  I    + N   N  GR ++    Y  D+    +A+D+
Sbjct: 130 FGKYLNYWDVYRAPPYFDRWG--IQEPVHVNALFNEQGR-VERVPGYSTDH----VADDA 182

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANM---------------------F 120
           V F+R+   + + +P  L ++   PH P     +YA+                      F
Sbjct: 183 VEFIRERAAAGNGRPFFLYVAPHVPHAPFQPPERYADAPVPPFRGDPATCERDISDKPPF 242

Query: 121 FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
                       Y   PD +  ++ +   +   R FT ++ T++ + L + DD V +++ 
Sbjct: 243 VRRLERTHRVEHYQLPPDDRGRMRWSSAQR--QRYFTQIIPTQQERMLMAADDLVNRVMS 300

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L   G   NT +IY SD+G+  G+ GLV  KS+P+   ++VP +MR PG VP 
Sbjct: 301 TLHSTGLDRNTLVIYISDNGFLWGEHGLVS-KSWPYTQSVQVPMMMRWPGHVPA 353


>gi|119488189|ref|XP_001262637.1| arylsulfatase [Neosartorya fischeri NRRL 181]
 gi|119410795|gb|EAW20740.1| arylsulfatase [Neosartorya fischeri NRRL 181]
          Length = 575

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/184 (31%), Positives = 88/184 (47%), Gaps = 29/184 (15%)

Query: 77  IANDSVRFLRQSKQSF------SKKPIMLVMSFPAPHG-------------------PED 111
           +   +   LR+    F       K+P  L ++  APH                    PED
Sbjct: 180 LGRHTTEVLREKAMGFLEDALDGKRPFFLAVTPIAPHSNMNGTYGGGAGPMWMDEPIPED 239

Query: 112 SAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSV 171
              ++  +F +     TP ++        WI  +  + Q V   + D    +RL+ LQ V
Sbjct: 240 ---RHKRLFPDAKVPRTPNFNPKEPTGVSWIHDLPYRNQTV-VDYNDHYYRQRLRALQGV 295

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D+ V++++  L+   +L NTYIIYTSD+G+H+GQ  L  GK+  F+ DIRVPF +RGPGI
Sbjct: 296 DELVDQLITRLEQSDKLENTYIIYTSDNGFHVGQHRLPPGKTCGFDEDIRVPFFIRGPGI 355

Query: 232 VPGT 235
             G 
Sbjct: 356 PEGA 359


>gi|323455238|gb|EGB11107.1| hypothetical protein AURANDRAFT_62077 [Aureococcus anophagefferens]
          Length = 537

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/229 (28%), Positives = 104/229 (45%), Gaps = 25/229 (10%)

Query: 22  FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE--DYYNDYYPDLIAN 79
            GKY+N     Y+P GW  + A      +    +  N   I  G      + Y   ++ N
Sbjct: 130 LGKYVNAMPEGYVPQGWSAYLANGGGDYFGPEFVARNAPGIPDGAYAAPKDAYTVSVVGN 189

Query: 80  DSVRFLRQSKQSFSK-KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
            SV FL   ++  SK +P++L  +  A H P +  P YA+       H  P++   P   
Sbjct: 190 ASVAFL---ERHLSKSEPVLLYCAPKAVHEPFNPPPWYAD-------HWDPSWPAKPVRG 239

Query: 139 KQW--ILQVTRK------MQP----VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
             W       R+       QP       +  D +   R ++L +VDD VE +V  +   G
Sbjct: 240 PAWNATFDARRRHAANVPTQPMITAAAAEVVDGVYRNRWRSLMAVDDLVEALVTTVDARG 299

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             + TY ++TSDHG+ LGQ  ++  K   +++D RVP +++GPG+  GT
Sbjct: 300 ATDRTYFVFTSDHGFQLGQLNMLMDKRHVYDWDTRVPLVIKGPGVGRGT 348


>gi|159489062|ref|XP_001702516.1| arylsulfatase [Chlamydomonas reinhardtii]
 gi|158280538|gb|EDP06295.1| arylsulfatase [Chlamydomonas reinhardtii]
          Length = 705

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 123/252 (48%), Gaps = 53/252 (21%)

Query: 22  FGKYLNKYN---GSYIPPGWREWGALIMNSKYY----NYSINMNGRKIKHGEDYYNDYYP 74
            GK+LN +    GS  P G+    AL+    Y      +S N  G   K+G+   +DY  
Sbjct: 163 IGKFLNGFAVGPGSVCPRGFTVLDALVDPYVYSYFGPGFSKNC-GTVQKYGD---SDYST 218

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-PEDS---------------APQYAN 118
           D+I + ++ +LR++  +   +P  + ++  APH   +DS               AP++A 
Sbjct: 219 DVIRDKAIGYLREAVAA--GQPFYMQVNPIAPHERCKDSEEGGGGRLACDNAIPAPRHAY 276

Query: 119 MFFNVT----------------SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMT 162
           +F N +                +    + D   +  +   +Q + ++   +RQ       
Sbjct: 277 LFRNASLPRSANWMTPLPESFRAAVAASTDSVRSGKELGAVQSSLQLNTAYRQ------- 329

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
            RL+ + SVD+ +  IV+EL   G L++T+II+TSD+GYH+G     +GK+ P+E D+RV
Sbjct: 330 -RLRAMASVDEMLGDIVQELSAQGVLDSTHIIFTSDNGYHMGNHNFGQGKTLPYEEDVRV 388

Query: 223 PFLMRGPGIVPG 234
           PF +RGPG+  G
Sbjct: 389 PFFIRGPGLPRG 400


>gi|169598524|ref|XP_001792685.1| hypothetical protein SNOG_02066 [Phaeosphaeria nodorum SN15]
 gi|160704411|gb|EAT90278.2| hypothetical protein SNOG_02066 [Phaeosphaeria nodorum SN15]
          Length = 577

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 69/233 (29%), Positives = 108/233 (46%), Gaps = 29/233 (12%)

Query: 21  YFGKYLNKYNGSY----IPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
           Y GK +N ++ +     +P GW     L+    Y  +N +   +          YN    
Sbjct: 140 YTGKLMNGHSTTTWNKPLPAGWNGTDFLVDPGTYIYWNATFQKDQAPPAPAPGQYNT--- 196

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------APQYANMFFNVTS 125
           DL+    + F+     +   KP  + ++   PH   +          A ++  +F +  +
Sbjct: 197 DLVKEKGLEFIDTVANT--SKPFFIGIAPIGPHAEFNGTGGFTKAVGAKRHQKLFLDAKA 254

Query: 126 HHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
             T  +    NPDK     WI ++  K+        D     RLQ+LQSVD+ V+++V  
Sbjct: 255 PRTSNW----NPDKPSGASWISKL-EKLNGTVIASHDEWHVGRLQSLQSVDELVDEVVNR 309

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           L+    L+NTYII+TSD+GYH+GQ  L  GK+  FE DI VPF +RGP +  G
Sbjct: 310 LEKYKLLDNTYIIFTSDNGYHIGQHRLQPGKTCAFEEDINVPFYVRGPNVPKG 362


>gi|171693357|ref|XP_001911603.1| hypothetical protein [Podospora anserina S mat+]
 gi|170946627|emb|CAP73430.1| unnamed protein product [Podospora anserina S mat+]
          Length = 577

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 92/174 (52%), Gaps = 15/174 (8%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPEDSAPQYANMFFN 122
           LI+  ++ FL  +  S   KP  + ++  APH             GP     ++A++F +
Sbjct: 180 LISEKALGFLDDAI-SNPDKPFFIGIAPIAPHSNMYIDENDVRHWGPPIPLDKHAHLFPD 238

Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
                TP ++        W+  + RK      ++ D    +RL++LQSVD+ V+ ++K L
Sbjct: 239 AQVPRTPNFNPDVESGGSWVRGL-RKQDDDVVEYNDHYYRERLRSLQSVDELVDDVIKRL 297

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +    L+NTYIIYTSD+G+H+GQ  +  GK   +E DI VP ++RGPG+  G +
Sbjct: 298 EAAKLLDNTYIIYTSDNGFHIGQHRMQPGKMCGYEEDINVPLIIRGPGVPKGEV 351


>gi|241601619|ref|XP_002405037.1| sulfatase, putative [Ixodes scapularis]
 gi|215500549|gb|EEC10043.1| sulfatase, putative [Ixodes scapularis]
          Length = 145

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 86/159 (54%), Gaps = 21/159 (13%)

Query: 77  IANDSVRFLR-QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
           ++  +  FLR ++ QS   KP  + +S PAPH P   A ++ N +  V +  TP+++ + 
Sbjct: 1   MSEKAANFLRPRTNQS---KPFFMFLSPPAPHQPFTPAVRHRNAYNGVQAPRTPSFNVSS 57

Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
           N        V+R   P  R+          +TL +VDD V ++V  L     L +TY+ +
Sbjct: 58  N-------AVSR---PYDRRIP-------WRTLLAVDDLVSEVVGILASKKLLASTYVFF 100

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           TSD+GYHLGQF   K K  P+E DI VP L+RGPGI  G
Sbjct: 101 TSDNGYHLGQFSQPKDKRQPYETDIHVPLLVRGPGIPAG 139


>gi|159489004|ref|XP_001702487.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280509|gb|EDP06266.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 825

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 66/222 (29%), Positives = 103/222 (46%), Gaps = 36/222 (16%)

Query: 21  YFGKYLNKYNGS----YIPPGWREWGALIMNSKYYNY----SINMNGRKIKHGEDYYNDY 72
           + GK+LN  + +      P GW  W AL+    Y  Y    S+N    ++       N Y
Sbjct: 133 HVGKFLNAMDPTDSRFRCPRGWSTWDALVEPYVYMYYTPAFSLNCGPTQVLE-----NQY 187

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD 132
             D+I+ D   ++                 FP P      A ++  +F +      P ++
Sbjct: 188 STDVIS-DKCDYINARGTCV----------FPIP------AERHRTLFSHAVLPMNPNFN 230

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
            AP  +   + ++T     V + F       R++ L +VD+ + ++V  L +LG L+NTY
Sbjct: 231 VAPPAELGLVNEMTSDTG-VQKHFL-----ARIRALAAVDEMIGRMVNTLTELGVLDNTY 284

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           I+YTSD+G+ LG     +GK F +E   RVPF MRGPGI PG
Sbjct: 285 ILYTSDNGFQLGNHAQKQGKQFHWEEITRVPFYMRGPGIPPG 326


>gi|407918049|gb|EKG11340.1| Sulfatase, partial [Macrophomina phaseolina MS6]
          Length = 431

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 106/234 (45%), Gaps = 28/234 (11%)

Query: 21  YFGKY---LNKYNGSYIPPGWREWGALIMNS--KYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK    LN  N S  P GW     LI      Y N  ++MNG +    E Y+      
Sbjct: 95  YIGKLFNGLNLANYSPKPKGWDHVDVLIDPYIYSYNNVVMSMNGDRPIWYEGYHQTDV-- 152

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----QYANMFFNVTSHHTPAY 131
           L A    R      Q   + P  L ++  +PH      P    ++A+ F N T       
Sbjct: 153 LRAKGLARLEHLISQ---EDPFYLTIAPASPHVEGSDLPVPLQRHAHDFPNATVPKPKGG 209

Query: 132 DYAPNPD-----KQWILQVT----RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
           ++ PN         W+  +T     ++   H  +       RL+ L+ +D+ V  +VK+L
Sbjct: 210 NFNPNDSFTEQKVSWLRNLTLLDDEQVNRTHEHYR-----HRLRALKGLDEIVHDVVKKL 264

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            + G LNNTY+IYT+D+GYHLG   +  GKS P+  D  +P  +RGPGI PG +
Sbjct: 265 NETGALNNTYVIYTTDNGYHLGNHRVPAGKSLPYIEDTNIPMAIRGPGIPPGKI 318


>gi|302892713|ref|XP_003045238.1| hypothetical protein NECHADRAFT_94512 [Nectria haematococca mpVI
           77-13-4]
 gi|256726163|gb|EEU39525.1| hypothetical protein NECHADRAFT_94512 [Nectria haematococca mpVI
           77-13-4]
          Length = 600

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 70/250 (28%), Positives = 116/250 (46%), Gaps = 41/250 (16%)

Query: 21  YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK  N +   +Y  P  + W +   +++   Y+Y      R  +  +++  +Y  DL+
Sbjct: 134 YTGKMFNAHTVWNYDSPHLKGWNSSDFLLDPNTYDYWNATFQRNHEKPKNHLGEYSTDLV 193

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS------------------------- 112
           A+ +  FL  +  +   +P  L ++  APH   +S                         
Sbjct: 194 ASKAYGFLDDAVAA--GEPFFLGIAPIAPHSNVNSTILNPVKGPDDDVPVSDDLFRVNFP 251

Query: 113 --APQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
             A ++ ++F +     T  +    NPDK     WI +   K+     +F D     RLQ
Sbjct: 252 KAAKRHEHLFPDAKVPRTENF----NPDKPSGGDWIRR-QPKLTDEMVEFNDRHYRGRLQ 306

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
            LQ+VD+ VE +V+ L+  G L+NTYI YT+D+GYH+ Q  L  GK   +E DI +P ++
Sbjct: 307 ALQAVDELVEGLVERLEKHGILDNTYIFYTTDNGYHISQHRLQPGKECSYEEDINIPLIV 366

Query: 227 RGPGIVPGTM 236
           RGPG+  G +
Sbjct: 367 RGPGVPKGGI 376


>gi|336260685|ref|XP_003345136.1| hypothetical protein SMAC_07425 [Sordaria macrospora k-hell]
 gi|380096516|emb|CCC06564.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 618

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 74/125 (59%), Gaps = 2/125 (1%)

Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSV 171
           A ++A++F +V    TP ++ +       WI Q+  +    +  F D    +RL+TLQSV
Sbjct: 256 AARHAHLFPDVVVPRTPHFNPSSRASGVSWISQLPVQSAE-NVAFNDHFYRQRLRTLQSV 314

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D+ V  +++ L+  G L +TY+ YT+D+GYH+GQ  L  GK   FE DI VP ++RGPG+
Sbjct: 315 DELVAGVIERLERHGLLEDTYVFYTTDNGYHIGQHRLQPGKECGFEEDINVPLIVRGPGV 374

Query: 232 VPGTM 236
             G +
Sbjct: 375 KKGAV 379


>gi|391870792|gb|EIT79965.1| arylsulfatase [Aspergillus oryzae 3.042]
          Length = 586

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 56/81 (69%)

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQ VD+ V+ +V  L+   +L+NTYIIYTSD+G+H+GQ  L  GK+  
Sbjct: 290 YNDHYYRQRLRALQGVDELVDSLVTRLEQSDKLDNTYIIYTSDNGFHIGQHRLPPGKTCG 349

Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
           FE DIRVPF +RGPGI  G +
Sbjct: 350 FEEDIRVPFFIRGPGIPEGAV 370


>gi|350636581|gb|EHA24941.1| hypothetical protein ASPNIDRAFT_131173 [Aspergillus niger ATCC
           1015]
          Length = 566

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/79 (53%), Positives = 53/79 (67%)

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQ VD+ VE+IV  L D G L+NTYI YTSD+GYH+GQ  L  GK   
Sbjct: 267 YNDEFYRERLRALQGVDELVEEIVLRLDDAGILDNTYIFYTSDNGYHIGQHRLHPGKECG 326

Query: 216 FEFDIRVPFLMRGPGIVPG 234
           F+ DIRVP  +RGPG+  G
Sbjct: 327 FDEDIRVPMFVRGPGVPKG 345


>gi|395752419|ref|XP_002830441.2| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-2
           [Pongo abelii]
          Length = 974

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 37/71 (52%), Positives = 48/71 (67%), Gaps = 4/71 (5%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 273 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 332

Query: 58  NGRKIKHGEDY 68
           NG K KHG DY
Sbjct: 333 NGVKEKHGSDY 343


>gi|169776738|ref|XP_001822835.1| arylsulfatase [Aspergillus oryzae RIB40]
 gi|83771571|dbj|BAE61702.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 586

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 56/81 (69%)

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQ VD+ V+ +V  L+   +L+NTYIIYTSD+G+H+GQ  L  GK+  
Sbjct: 290 YNDHYYRQRLRALQGVDELVDSLVTRLEQSDKLDNTYIIYTSDNGFHIGQHRLPPGKTCG 349

Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
           FE DIRVPF +RGPGI  G +
Sbjct: 350 FEEDIRVPFFIRGPGIPEGAV 370


>gi|164425630|ref|XP_959804.2| hypothetical protein NCU06041 [Neurospora crassa OR74A]
 gi|157071002|gb|EAA30568.2| hypothetical protein NCU06041 [Neurospora crassa OR74A]
          Length = 615

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 111/261 (42%), Gaps = 48/261 (18%)

Query: 21  YFGKYLNK-----YNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK  N      Y+  YI  GW      +++   Y+Y      R       Y   Y  D
Sbjct: 143 YTGKLFNAHTVDNYDAPYIA-GWN-GSDFLLDPYTYSYLNATFQRNRDPPISYEGQYSVD 200

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------------------- 108
           ++A  +  FL ++ ++   +P  L ++  APH                            
Sbjct: 201 VLAEKAYGFLDEAAKNVHNRPFFLGIAPIAPHSNVEPRFPSSSSSSSSSDSATLHRRPTN 260

Query: 109 ------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTRKMQPVHRQ 155
                       P   A ++A++F +V    TP ++ +       WI ++  +    +  
Sbjct: 261 EHDDIEKSVSFTPPIPAARHAHLFPDVIVPRTPHFNPSSRASGVSWIARLPHQSAE-NVA 319

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           F D    +RL+ LQSVD+ V  +V  L+  G L NTY+ YT+D+GYH+GQ  L  GK   
Sbjct: 320 FNDHFYRQRLRALQSVDELVAGVVARLERHGLLENTYVFYTADNGYHIGQHRLQPGKECG 379

Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
           FE DI VP ++RGPG+  G +
Sbjct: 380 FEEDINVPLIVRGPGVKKGEV 400


>gi|448735634|ref|ZP_21717827.1| N-acetylglucosamine-6-sulfatase [Halococcus salifodinae DSM 8989]
 gi|445797667|gb|EMA48128.1| N-acetylglucosamine-6-sulfatase [Halococcus salifodinae DSM 8989]
          Length = 515

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 115/233 (49%), Gaps = 27/233 (11%)

Query: 20  GYFGKYLNKY-----NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           GY   Y+ K+     N +   PG+  W +      Y++ ++N+NG+K++        Y  
Sbjct: 143 GYETAYIGKWHTYQLNSAEPRPGFDHWVSFEGQGHYFDQTLNVNGKKVER-----KGYIT 197

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--- 131
           D++   +  +L+  +     KP  L +S  APH     AP++ N + N +  +       
Sbjct: 198 DILTEYARDWLKNRE---GDKPFFLFLSHKAPHAWFCPAPRHRNQYANASIEYPKTMADT 254

Query: 132 --DYAPNPDKQWILQVTRKMQPVHRQFT-----DLLMTKRLQTLQSVDDAVEKIVKELKD 184
             +Y   PD  W+      ++ V+  F      D L  +  +TL ++D+++  ++ +L +
Sbjct: 255 EENYTGKPD--WVQNQRESVRGVNYIFGGRFDFDELYRRYSETLLALDESIGAVMDQLNE 312

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
            G  ++T ++Y SD+GY LG+ GL+ GK   +E  IRVP L   PG++ PGT+
Sbjct: 313 SGVADSTLMLYMSDNGYSLGEHGLI-GKQTAYEPSIRVPMLAYAPGMIEPGTV 364


>gi|145251529|ref|XP_001397278.1| arylsulfatase [Aspergillus niger CBS 513.88]
 gi|134082813|emb|CAK42644.1| unnamed protein product [Aspergillus niger]
          Length = 594

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/79 (53%), Positives = 53/79 (67%)

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D     RL++LQ VD+ VE+IV  L D G L+NTYI YTSD+GYH+GQ  L  GK   
Sbjct: 295 YNDEFYRDRLRSLQGVDELVEEIVLRLDDAGILDNTYIFYTSDNGYHIGQHRLHPGKECG 354

Query: 216 FEFDIRVPFLMRGPGIVPG 234
           F+ DIRVP  +RGPG+  G
Sbjct: 355 FDEDIRVPMFVRGPGVPKG 373


>gi|336467297|gb|EGO55461.1| arylsulfatase [Neurospora tetrasperma FGSC 2508]
 gi|350288073|gb|EGZ69309.1| arylsulfatase [Neurospora tetrasperma FGSC 2509]
          Length = 635

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 109/254 (42%), Gaps = 49/254 (19%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y+  Y+  +NGS            +++   Y+Y      R       Y   Y  D++A  
Sbjct: 156 YYSPYVAGWNGS----------DFLLDPYTYSYLNATFQRNRDPPASYEGQYSVDVLAKK 205

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHG-------------------------------- 108
           +  FL ++ ++   +P  L ++  APH                                 
Sbjct: 206 AYGFLDKAAKNVHNRPFFLGIAPIAPHSNVEPRFSSSSSSSSDPATLQRRPTKEHDDIEK 265

Query: 109 -----PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTRKMQPVHRQFTDLLMT 162
                P   A ++A++F +V    TP ++ +       WI ++  +    +    D    
Sbjct: 266 SVSFTPPIPAARHAHLFPDVIVPRTPHFNPSSRASGVSWIARLPHQSAE-NVACNDHFYR 324

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
           +RL+TLQSVD+ V  +V  L+  G L NTY+ YT+D+GYH+GQ  L  GK   FE DI V
Sbjct: 325 QRLRTLQSVDELVAGVVARLERHGLLENTYVFYTADNGYHIGQHRLQPGKECGFEEDINV 384

Query: 223 PFLMRGPGIVPGTM 236
           P ++RGPG+  G +
Sbjct: 385 PLIVRGPGVKRGEV 398


>gi|301311577|ref|ZP_07217504.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 20_3]
 gi|423340655|ref|ZP_17318393.1| hypothetical protein HMPREF1059_04318 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830663|gb|EFK61306.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 20_3]
 gi|409226897|gb|EKN19800.1| hypothetical protein HMPREF1059_04318 [Parabacteroides distasonis
           CL09T03C24]
          Length = 515

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 114/244 (46%), Gaps = 35/244 (14%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGK+    +     PG+  W       +YYN  IN NG  I++ +   + Y  DL+ +
Sbjct: 131 AFFGKWHMGNDSGAPQPGFDHWEGFKGQGEYYNPRINTNGEWIQYKD---STYVTDLLTD 187

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHHTPAYDYA 134
            +++F+++ KQ+   KP  + +S    H    +A ++ + + N       + +TP Y   
Sbjct: 188 HAIQFMKEQKQA--DKPFFVYLSHKGVHDNFSAAKRHKDCYKNKELVLPPNFNTPHYGIK 245

Query: 135 --PNPDKQ------------------WILQVTRKMQPV----HRQFTDLLMTKRLQTLQS 170
             P  DK+                  W+         V    H +  D+ + K  +TL+S
Sbjct: 246 ELPTIDKKTGKAAFGKEYYGEKMAPDWVKSQRESWHGVDYSYHGRPWDVQVRKYCETLRS 305

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           VD+++  ++  LK+ G  +NT +IY  D+G+  G+ GL+  + F +E  +RVP L+R P 
Sbjct: 306 VDESIGSVLDYLKEAGLDDNTLVIYMGDNGFAWGEHGLIDKRQF-YEESVRVPMLVRCPS 364

Query: 231 IVPG 234
           +  G
Sbjct: 365 MFEG 368


>gi|115433238|ref|XP_001216756.1| hypothetical protein ATEG_08135 [Aspergillus terreus NIH2624]
 gi|114189608|gb|EAU31308.1| hypothetical protein ATEG_08135 [Aspergillus terreus NIH2624]
          Length = 594

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 98/197 (49%), Gaps = 19/197 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           +Y++  +  NG K      Y  +Y  D++   +  FL  +  +   +P ML ++  APH 
Sbjct: 153 RYFHAVMTRNGGKPV---SYAGNYSTDVLREKAEGFLDDALAN-PDRPWMLTVAPNAPHS 208

Query: 108 -------------GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
                        G  + AP++A++F + T     +++        W+  ++   Q    
Sbjct: 209 NGSHDPSRDVTWYGEPEYAPRHAHLFKDYTIPRDKSFNKVIEGAVGWVKDLSELSQK-EV 267

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
            + D     RL+ L++VD+ V  +V +L   G L+NTYI Y++D+GYH GQ  +  GK+ 
Sbjct: 268 DYIDEFQRCRLRALRAVDEMVGALVDKLDKAGVLDNTYIFYSTDNGYHAGQHAMQPGKNC 327

Query: 215 PFEFDIRVPFLMRGPGI 231
            ++ DI VP ++RGPGI
Sbjct: 328 GYDTDIHVPLIVRGPGI 344


>gi|384496845|gb|EIE87336.1| hypothetical protein RO3G_12047 [Rhizopus delemar RA 99-880]
          Length = 378

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 66/234 (28%), Positives = 112/234 (47%), Gaps = 26/234 (11%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK +N+Y   N ++  P   ++   +++   Y Y   +  +  +    Y N Y  D+I
Sbjct: 58  YIGKLMNQYSVNNYNHPTPKGFDYQDQLVDPYTYLYHTAVFSKDGQDPVYYSNQYQTDVI 117

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPEDSAPQYANMFFNVT 124
              +   L++ + +    P  L ++  APH             GP   A ++A++F +V 
Sbjct: 118 HAKAKAALKRLENA--TDPFFLWVAPMAPHAQFTIHANGSLSTGPAVPAARHAHLFQDVK 175

Query: 125 SHHTPAYDYAPNPDKQW----ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
               P +    NP +Q       +   ++ P      D     RL+ LQ+VD+ V  +++
Sbjct: 176 IPRHPHF----NPKRQTKTASFWKHLDRLSPDEVDGLDEAYRNRLRALQAVDEMVGSLME 231

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           EL+  G L NT++IY++D+GYHLGQ     GK+   E DI VP ++RGPG+  G
Sbjct: 232 ELERQGRLENTFVIYSADNGYHLGQHRSFPGKTSNLEEDINVPLIVRGPGVAQG 285


>gi|2873363|gb|AAC02716.1| arylsulfatase [Neurospora crassa]
          Length = 639

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 41/209 (19%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------- 108
           Y   Y  D++A  +  FL ++ ++   +P  L ++  APH                    
Sbjct: 193 YEGQYSVDVLAEKAYGFLDEAAKNVHNRPFFLGIAPIAPHSNVEPGFPSSSSSSSSSDSA 252

Query: 109 --------------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTR 147
                               P   A ++A++F +V    TP ++ +       WI ++  
Sbjct: 253 TLHRRPTNEHDDIEKSVSFTPPIPAARHAHLFPDVIVPRTPHFNPSSRASGVSWIARLPH 312

Query: 148 KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
           +    +  F D    +RL+ L+SVD+ V  +V  L+  G L NTY+ YT+D+GYH+GQ  
Sbjct: 313 QSAE-NVAFNDHFYRQRLRALESVDELVAGVVARLERHGLLENTYVFYTADNGYHIGQHR 371

Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           L  GK   FE DI VP ++RGPG+  G +
Sbjct: 372 LQPGKECGFEEDINVPLIVRGPGVKKGEV 400


>gi|238502181|ref|XP_002382324.1| gloeobacteria sulfatase, putative [Aspergillus flavus NRRL3357]
 gi|220691134|gb|EED47482.1| gloeobacteria sulfatase, putative [Aspergillus flavus NRRL3357]
          Length = 588

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 56/81 (69%)

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQ VD+ V+ +V  L+   +L+NTYIIYTSD+G+H+GQ  L  GK+  
Sbjct: 290 YNDHYYRQRLRALQGVDELVDSLVTRLEQSDKLDNTYIIYTSDNGFHIGQHRLPPGKTCG 349

Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
           F+ DIRVPF +RGPGI  G +
Sbjct: 350 FDEDIRVPFFIRGPGIPEGAV 370


>gi|6822089|emb|CAB70953.1| Arylsulfatase [Colletotrichum gloeosporioides f. sp. malvae]
          Length = 511

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 86/150 (57%), Gaps = 13/150 (8%)

Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTL 168
           A ++A++F +V    T  +    NPDK+    W+ +  R+    +  F D     RL+ L
Sbjct: 167 AARHAHLFPDVVVPRTENF----NPDKESGANWVREQPRQSDE-NVAFNDHFYRSRLRAL 221

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           Q+VD+ VE +V +L++ G L +TYI YT+D+G+H+GQ     GK   +E DI +P ++RG
Sbjct: 222 QAVDELVESVVLKLEEYGILEDTYIFYTTDNGFHIGQHRPQPGKECGYEEDINIPLIVRG 281

Query: 229 PGIVPG-TMYLLDV---LIPQVRKFSSGSL 254
           P +  G T+ ++ V   L+P +   +SG L
Sbjct: 282 PRVPEGKTVEIVTVHQDLVPTILGLASGPL 311


>gi|402075556|gb|EJT71027.1| hypothetical protein GGTG_12048 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 620

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 71/224 (31%), Positives = 114/224 (50%), Gaps = 19/224 (8%)

Query: 21  YFGKYLNKYN--GSYI-PPGWREWGALIMN-SKYYNYSI-NMNGRKIKHGEDYYNDYYPD 75
           Y GK LN YN   ++I P G+    AL+   + YYN  + + NG +  + + +++     
Sbjct: 123 YIGKLLNGYNVVNAHIRPKGFDHIDALLDPYTTYYNVPVMSQNGERPVYYKGFHSTDVVR 182

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---SAP--QYANMFFNVTSHHTPA 130
           + A D ++FL    Q     P  L ++  +PH   +   +AP  ++ ++F +  +   P 
Sbjct: 183 IKALDRLQFLLSQDQ-----PYYLQIAPYSPHIQNNLNAAAPLARHMDLFPDAKAPRLPN 237

Query: 131 YDYAPN---PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           Y+ A         W+  +   M    +Q TD    +R ++LQ VD+ +E +V  L   G 
Sbjct: 238 YNPADEFQAGKGSWVGGLPL-MNETTQQRTDYAYRRRAESLQGVDEIIEDVVAMLAAKGT 296

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           L+ TYIIYTSD+GYHLGQ  +  GK+  F  D  +P  +RGPGI
Sbjct: 297 LDTTYIIYTSDNGYHLGQHRIPAGKALFFGEDTNLPMAVRGPGI 340


>gi|325110459|ref|YP_004271527.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970727|gb|ADY61505.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 501

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 93/191 (48%), Gaps = 29/191 (15%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
           A   + +L + +Q   + P ++   F  PH   D  P+  + + +V  +H       P  
Sbjct: 182 AEQVLNYLNEREQQADEDPFLIYFGFSHPHDTRDGKPELLDKYGSV--NHKDKSSLPPAN 239

Query: 138 DKQWILQVT-RKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVE 176
           DKQ  L     +  P H    +L                     + +     +++D  ++
Sbjct: 240 DKQPELPPNYLEAHPFHHGHPNLRDEVAVSGVWERRDERTIRNELGREFACSENIDIQLD 299

Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----- 231
           +++K L+++GEL+NTY+ YTSDHG  +G+ GL +GK   +E   RVPF+++GPGI     
Sbjct: 300 RVLKRLEEMGELDNTYVFYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIVKGPGIEAGSR 358

Query: 232 VPGTMYLLDVL 242
           V G +YLLDVL
Sbjct: 359 VQGNIYLLDVL 369


>gi|350630052|gb|EHA18425.1| hypothetical protein ASPNIDRAFT_176469 [Aspergillus niger ATCC
           1015]
          Length = 561

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 60/94 (63%), Gaps = 1/94 (1%)

Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
           WI  +  + Q V   + D    +RL+ LQ VD+ V+ ++  L+   E++NTYIIYTSD+G
Sbjct: 253 WIHNLPYRNQSV-IDYNDHYYRQRLRALQGVDELVDSLIARLEASSEIDNTYIIYTSDNG 311

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +H+ Q  L  GK+  F+ DIRVPF +RGPGI  G
Sbjct: 312 FHISQHRLPPGKTCAFDEDIRVPFFIRGPGIPEG 345


>gi|336178164|ref|YP_004583539.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
           glomerata]
 gi|334859144|gb|AEH09618.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
           glomerata]
          Length = 475

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 67/227 (29%), Positives = 110/227 (48%), Gaps = 30/227 (13%)

Query: 19  HGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
           +G+FGKYLN +N +  PP + ++ AL+    Y +   N+NG          + Y   +I 
Sbjct: 137 NGFFGKYLNAWNVADAPPHFEDF-ALLDPVVYNDGQYNVNGTI-----QTISGYSTHVIR 190

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
           + +  F++  K +   +P  L ++  APHGP    PQYAN          P +D  P+  
Sbjct: 191 DFAKGFIQ--KGTTDTRPWFLYIAPKAPHGPNTPEPQYANTVV-------PTWDGRPS-- 239

Query: 139 KQWILQVTRKMQPVHRQFTDLLMT-------KRLQTLQSVDDAVEKIVKELKDLGELNNT 191
              + +  +  +P + Q  +  +         +L+TL SVDDAV+ +  +L   G+L+NT
Sbjct: 240 ---VFETDKSDKPDYIQSANATLADGRAIREAQLRTLLSVDDAVQAVHDKLVATGQLDNT 296

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP--GIVPGTM 236
            +IY  D+G+     G  K KS P+   + VPF +  P  G+  GT 
Sbjct: 297 LVIYIGDNGFTWADHGWTK-KSVPYLPSVEVPFYLSWPAGGLGAGTA 342


>gi|255951092|ref|XP_002566313.1| Pc22g24250 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211593330|emb|CAP99713.1| Pc22g24250 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 550

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/91 (45%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
           WI  +  + Q V   + D    +RL+ LQ VD+ V+ ++  L+  G+++NT+IIYTSD+G
Sbjct: 230 WIHDLPYRNQTV-IDYNDHYYRQRLRALQGVDELVDNLITHLEKSGKIDNTFIIYTSDNG 288

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           +H+ Q  L  GK+  F+ DIRVPF+MRGPGI
Sbjct: 289 FHISQHRLPPGKTCGFDEDIRVPFMMRGPGI 319


>gi|159462406|ref|XP_001689433.1| arylsulfatase [Chlamydomonas reinhardtii]
 gi|158283421|gb|EDP09171.1| arylsulfatase [Chlamydomonas reinhardtii]
          Length = 518

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 97/219 (44%), Gaps = 53/219 (24%)

Query: 23  GKYLNKYNGS-----YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           GK+LN Y  S     Y+P G+  + AL  N+  YN + +   R       Y +DY  DLI
Sbjct: 120 GKFLNGYYTSEAPAGYVPKGFTVFDALTTNA--YNMTNSCFSRNGGRSTCYPSDYQTDLI 177

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
              +  ++  + ++  KKP    ++  APH              N+ + H P        
Sbjct: 178 GVKAKNYISAAVRA--KKPFFAYVTPTAPH----------RSSTNLVTWHPPT------- 218

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
                        P  R   DL              A E ++  L+DL  L+NT +IY S
Sbjct: 219 -------------PASRH-KDLY-------------ADENVLGHLQDLEVLDNTIVIYMS 251

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           D+GYH+G F L+ GK+ P E D+RVPF +RGPGI  G +
Sbjct: 252 DNGYHMGAFSLLDGKNLPIEEDVRVPFFIRGPGIPAGQV 290


>gi|134077589|emb|CAK96733.1| unnamed protein product [Aspergillus niger]
          Length = 573

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 60/94 (63%), Gaps = 1/94 (1%)

Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
           WI  +  + Q V   + D    +RL+ LQ VD+ V+ ++  L+   E++NTYIIYTSD+G
Sbjct: 267 WIHNLPYRNQSV-IDYNDHYYRQRLRALQGVDELVDSLIARLEASSEIDNTYIIYTSDNG 325

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +H+ Q  L  GK+  F+ DIRVPF +RGPGI  G
Sbjct: 326 FHISQHRLPPGKTCAFDEDIRVPFFIRGPGIPEG 359


>gi|317030660|ref|XP_001393065.2| arylsulfatase [Aspergillus niger CBS 513.88]
          Length = 573

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 60/94 (63%), Gaps = 1/94 (1%)

Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
           WI  +  + Q V   + D    +RL+ LQ VD+ V+ ++  L+   E++NTYIIYTSD+G
Sbjct: 267 WIHNLPYRNQSV-IDYNDHYYRQRLRALQGVDELVDSLIARLEASSEIDNTYIIYTSDNG 325

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +H+ Q  L  GK+  F+ DIRVPF +RGPGI  G
Sbjct: 326 FHISQHRLPPGKTCAFDEDIRVPFFIRGPGIPEG 359


>gi|346978743|gb|EGY22195.1| arylsulfatase [Verticillium dahliae VdLs.17]
          Length = 606

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 111/243 (45%), Gaps = 40/243 (16%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   +LN +NGS            +++   Y Y      R      ++   Y  DL+A  
Sbjct: 153 YDSPHLNGWNGS----------DFLLDPNTYAYWNATFQRNKDSPVNHLGQYSTDLVAQK 202

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----QYANMFFNVT 124
           +  FL  +  +   +P  L ++  APH            GP+++ P    ++   F    
Sbjct: 203 AYGFLDDAVAA--GEPFFLGIAPIAPHSNVNASTLNPVKGPDENVPVSDDEFRVSFPQPA 260

Query: 125 SHHTPAYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
             H   +  A        NPDK     WI +   K+   +  + DL    RLQ LQ+VD+
Sbjct: 261 KRHEHLFPDAKVPRTENFNPDKPSGADWIRR-QPKLSDENVAYNDLHYRGRLQALQAVDE 319

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            V+ ++ +L++ G L+NTYI +T+D+G+H+ Q  L  GK   +E DI +P ++RGPG+  
Sbjct: 320 LVDGLIDKLEEHGVLDNTYIFFTTDNGFHISQHRLQPGKECGYEEDINIPLIVRGPGVPK 379

Query: 234 GTM 236
           G +
Sbjct: 380 GAI 382


>gi|429859197|gb|ELA33986.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
          Length = 608

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 112/248 (45%), Gaps = 41/248 (16%)

Query: 21  YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           Y GK  N +   +Y  P  + W     +++   Y+Y      R      ++  +Y  DL+
Sbjct: 142 YTGKMFNAHTVWNYDSPHLKGWNGSDFLLDPNTYDYWNATFQRNKDKPVNHLGEYSTDLV 201

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS------------------------- 112
           A  +  FL  + ++   KP  L ++  APH   +S                         
Sbjct: 202 AQKAYDFLDDAVEA--GKPFFLAVAPIAPHSNVNSSVLLPVKGPDDDVPVSDDKFRTSIP 259

Query: 113 --APQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
             A ++  +F +     T  +    NPDK     WI +   K+   + ++ D     RL+
Sbjct: 260 KAAKRHEKLFPDAKVPRTENF----NPDKPSGADWIRR-QPKLSDENVEYNDEHYRGRLR 314

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
            LQ+VD+ V+ ++ +L+  G L+NTYI YT+D+GYH+ Q  L  GK   +E DI +P ++
Sbjct: 315 ALQAVDELVDGLIDKLEKHGLLDNTYIFYTTDNGYHISQHRLQPGKECGYETDINIPLII 374

Query: 227 RGPGIVPG 234
           RGPG+  G
Sbjct: 375 RGPGVPKG 382


>gi|302830105|ref|XP_002946619.1| hypothetical protein VOLCADRAFT_86751 [Volvox carteri f.
           nagariensis]
 gi|300268365|gb|EFJ52546.1| hypothetical protein VOLCADRAFT_86751 [Volvox carteri f.
           nagariensis]
          Length = 976

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 61/196 (31%), Positives = 98/196 (50%), Gaps = 19/196 (9%)

Query: 51  YNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--- 107
           YN S ++N       +     Y  D+I + ++ ++  +  + S  P  L +   APH   
Sbjct: 225 YNSSFSLN---CGPSQAMMGQYQTDVIRDKALAYIEGAVAAGS--PFYLQLDPAAPHVDN 279

Query: 108 ------GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQ-PVHRQFTDLL 160
                  P   A ++A+++  +     P +    NP   +    TR M  P + +    L
Sbjct: 280 GGGEGWRPPPPAQRHADLYPGLELPDNPTF-LKVNPLNPF---RTRDMNNPENVKDATDL 335

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
              R+++L++VD+ V  IV +L DLG L+NTY+IY SD+G H+GQ  L  GK+ P E D 
Sbjct: 336 YVGRIRSLRAVDEMVGAIVAKLSDLGILDNTYVIYMSDNGLHMGQHSLDDGKATPIEEDS 395

Query: 221 RVPFLMRGPGIVPGTM 236
           RVP  +RGPG+  G +
Sbjct: 396 RVPLYLRGPGVPAGVV 411


>gi|159464461|ref|XP_001690460.1| hypothetical protein CHLREDRAFT_144396 [Chlamydomonas reinhardtii]
 gi|158279960|gb|EDP05719.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 692

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 113/273 (41%), Gaps = 69/273 (25%)

Query: 23  GKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYY--NDYYPDLI 77
           GK+LN++     +  P GW    AL+ +   Y Y   +  R        Y   DY  D+I
Sbjct: 113 GKFLNEFTPAGAAGCPRGWTALDALV-DPWVYTYFGPVFSRDCDPQLQRYADTDYSTDVI 171

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAPQYANMFFNVTS 125
            +  + ++R++  S   +P  L ++   PH            G    AP++A +F     
Sbjct: 172 RDKGLAYIREAVSS--GRPFFLELAPVGPHEACTQGESGQWCGDPLPAPRHAGLFPGAAI 229

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMT---------------KRLQTLQS 170
             T ++   P PD      +T K+QP  R     ++                 RLQ+LQ+
Sbjct: 230 PSTRSW-LTPFPDPA----LTGKLQPPQRASNGSVIGPYLPPSVRRLKQRQRARLQSLQA 284

Query: 171 VDDAVEKIV----------KELKDL-------------------GELNNTYIIYTSDHGY 201
           VD+ V  +V            +K                     G L NTY+IY SD+GY
Sbjct: 285 VDELVRDVVFPQCRQGYGITTMKHCHHRRRLLLLLPAVLELAAQGVLENTYVIYMSDNGY 344

Query: 202 HLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           H+GQ     GK  P+E D+RVPF MRGPG+  G
Sbjct: 345 HMGQHNANAGKYMPWEEDVRVPFFMRGPGVPSG 377


>gi|302417488|ref|XP_003006575.1| arylsulfatase [Verticillium albo-atrum VaMs.102]
 gi|261354177|gb|EEY16605.1| arylsulfatase [Verticillium albo-atrum VaMs.102]
          Length = 606

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 110/243 (45%), Gaps = 40/243 (16%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           Y   +LN +NGS            +++   Y Y      R      ++   Y  DL+A  
Sbjct: 153 YDSPHLNGWNGS----------DFLLDPNTYAYWNATFQRNKDAPVNHLGQYSTDLVAQK 202

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----QYANMFFNVT 124
           +  FL  +  +   +P  L ++  APH            GP++  P    ++   F    
Sbjct: 203 AYGFLDDAVAA--GEPFFLGIAPIAPHSNVNASTLNPVKGPDEDVPVSDGEFRVNFPQPA 260

Query: 125 SHHTPAYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
             H   +  A        NPDK     W+ +   K+   +  + DL    RLQ LQ+VD+
Sbjct: 261 KRHEHLFPDAKVPRTENFNPDKPSGADWVRR-QPKLSDENVAYNDLHYRGRLQALQAVDE 319

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            V+ ++ +L++ G L+NTYI +T+D+G+H+ Q  L  GK   +E DI +P ++RGPG+  
Sbjct: 320 LVDGLIDKLEEHGVLDNTYIFFTTDNGFHISQHRLQPGKECGYEEDINIPLIVRGPGVPK 379

Query: 234 GTM 236
           G +
Sbjct: 380 GAI 382


>gi|358375590|dbj|GAA92170.1| arylsulfatase [Aspergillus kawachii IFO 4308]
          Length = 578

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/71 (56%), Positives = 51/71 (71%)

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           +L++LQ VD+ VE+IV  L D G L+NTYI YTSD+GYH+GQ  L  GK   F+ DIRVP
Sbjct: 287 QLRSLQGVDELVEEIVLRLDDAGILDNTYIFYTSDNGYHIGQHRLHPGKECGFDEDIRVP 346

Query: 224 FLMRGPGIVPG 234
             +RGPG+  G
Sbjct: 347 MFVRGPGVPRG 357


>gi|410055272|ref|XP_514701.4| PREDICTED: extracellular sulfatase Sulf-2 isoform 3 [Pan
           troglodytes]
          Length = 552

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 58/99 (58%), Gaps = 6/99 (6%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +  SR    +  S GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDYYNDYYPDLIAND--SVRFLRQSKQSFSK 94
           NG K KHG DY    Y  L      S+  L + K S+ +
Sbjct: 177 NGVKEKHGSDYSKMMYRHLFTRKQLSLEILLKYKASYVR 215


>gi|357394060|ref|YP_004908901.1| putative sulfatase [Kitasatospora setae KM-6054]
 gi|311900537|dbj|BAJ32945.1| putative sulfatase [Kitasatospora setae KM-6054]
          Length = 473

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 22/215 (10%)

Query: 20  GYFGKYLNKYNGSYIPPGWREW---GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           G FGKYLN +  +  PP + EW     ++ N   YN +  ++             Y   +
Sbjct: 135 GLFGKYLNAWQPADNPPHFDEWLLSAPVVYNDGQYNDNGTVH---------TIPGYSTSV 185

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           I + ++ FL   K +   +P    ++  A H P    P+YA+    V +   P+ D A  
Sbjct: 186 IRDHALAFL--DKAATDPRPWFAFVAPKASHEPNTPEPKYADTVVPVWNGR-PSVDEADR 242

Query: 137 PDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            DK +WI   T  +    +     L  ++L+TL SVDDAV+ +  +L  LG+L+NT +IY
Sbjct: 243 SDKPEWIQNSTGTLATARQ-----LRARQLRTLLSVDDAVQALHDKLAALGQLDNTLVIY 297

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
             D+GY     G  K KS P+   + VPF +  P 
Sbjct: 298 LGDNGYTWADHGWSK-KSVPYLPSVEVPFYVSWPA 331


>gi|448726816|ref|ZP_21709206.1| N-acetylglucosamine-6-sulfatase [Halococcus morrhuae DSM 1307]
 gi|445793377|gb|EMA43955.1| N-acetylglucosamine-6-sulfatase [Halococcus morrhuae DSM 1307]
          Length = 516

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 109/227 (48%), Gaps = 23/227 (10%)

Query: 21  YFGKYLNKYNGSYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           Y GK+      S  P PG+  W +     +Y++  +N++G ++K        Y  D++  
Sbjct: 144 YIGKWHTYQLDSATPRPGFDRWVSFEGQGRYFDPILNVDGNRVKR-----TGYITDILTE 198

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-----TSHHTPAYDYA 134
            +  +L         KP  L +S  A H     AP++   + NV      +       YA
Sbjct: 199 YARDWLDSRD---GDKPFFLFLSHKAAHAWFRPAPRHRGRYSNVPIEYPQTMANTKQAYA 255

Query: 135 PNPDKQWILQVTRKMQPVHRQFT-----DLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
             PD  W+ +    ++ V+  F      D L  +  +TL ++DD++  ++ EL + G  +
Sbjct: 256 GKPD--WVRRQRNGVRGVNYLFGGRFDFDELYRRYSETLLALDDSIGAVMDELDESGMAD 313

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGT 235
           +T ++YTSD+GY LG+ GLV GK   +E  +RVP L   PG++ PGT
Sbjct: 314 STLLLYTSDNGYTLGEHGLV-GKQTAYESSVRVPLLAYAPGMIDPGT 359


>gi|302851682|ref|XP_002957364.1| hypothetical protein VOLCADRAFT_119669 [Volvox carteri f.
           nagariensis]
 gi|300257323|gb|EFJ41573.1| hypothetical protein VOLCADRAFT_119669 [Volvox carteri f.
           nagariensis]
          Length = 1147

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 66/238 (27%), Positives = 111/238 (46%), Gaps = 31/238 (13%)

Query: 21  YFGKYLNKYNGSYI-----PPGWREWGALIMNSK------YYNYSINMNGRKIKHGEDYY 69
           Y GK+LN      +     P GW    A+++         YY  +     R+ +  E + 
Sbjct: 136 YVGKFLNGLTHDSLLELGCPRGWTAMDAMLLGDTGGAEELYYKDTPRFM-RECRTLETFP 194

Query: 70  NDYYPDLIANDSVRFLRQSKQSFS---KKPIMLVMSFPAPH---GPEDSAPQ----YANM 119
           + +   ++   +++++ ++  + S   KKP  L++S  APH   G  D+ P+    Y  M
Sbjct: 195 DTFTEIVVRQKALQYIERAAATNSLRGKKPFFLMVSTFAPHDASGGTDAVPEVEVKYRGM 254

Query: 120 FFNVTSHHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
           + +V    +P +   P P+K    W       +  +   +       RL  L+SVDD V 
Sbjct: 255 YGDVELPESPNFGVRP-PNKIGYYWPYDSKDAIPRLTEHYR-----ARLAALRSVDDTVA 308

Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            +V +L     LNNT I+YTSD+G+ LG   + + K   +E D+R+P L+ GPGI  G
Sbjct: 309 AVVSKLACRDLLNNTVIVYTSDNGFKLGSHNIAQEKFTQYEEDVRLPLLLAGPGIPRG 366


>gi|269124330|ref|YP_003297700.1| sulfatase [Thermomonospora curvata DSM 43183]
 gi|268309288|gb|ACY95662.1| sulfatase [Thermomonospora curvata DSM 43183]
          Length = 461

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 109/226 (48%), Gaps = 16/226 (7%)

Query: 22  FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
           FGKYLN ++    PP + E+ AL+   KY     N+NG+  +        Y   LI + +
Sbjct: 128 FGKYLNGWDVHQAPPHFDEY-ALMHPPKYGETWWNVNGKVSRK-----RAYSTSLIRDHA 181

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQW 141
           VRFLR+ + S   +P  L ++  APH P     +YAN+    +    PA   +   DK +
Sbjct: 182 VRFLRRHRAS--GRPWFLYLTPYAPHAPFTPEARYANLSVP-SWRGNPAVAESDKRDKPF 238

Query: 142 ILQVTRKMQP-VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
            +   R+  P +HR     +   +L+TL+SVDD +  +  EL+    L++T II  SD+G
Sbjct: 239 YI---RRSDPDLHRARR--IRAGQLRTLRSVDDLLGAVRDELRAQRRLDDTLIIVISDNG 293

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           Y  G  G    KS P+   +R+P  +  P    G     D L+  +
Sbjct: 294 YCWGDHGW-HAKSVPYSPAVRIPLYLSWPAGGLGRGATDDRLVANI 338


>gi|149195728|ref|ZP_01872785.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149141190|gb|EDM29586.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 474

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 96/182 (52%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           ++ +++ N  + F+ ++K     KP  + +++ APH P  +  ++ +M+  + S   P  
Sbjct: 169 HWTEIVTNHGIDFIDEAKND--DKPFFMYIAYNAPHDPRQAPKEFVDMY-PLESISVPEN 225

Query: 132 DYAPNPDKQWI----LQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
             A  PDK+ I    LQ   ++ P  R    + + ++     +  +D  + +++K L D 
Sbjct: 226 FLAEYPDKELIGCGPLQRDARLAPFPRTEYAVKVNRQEYFACISYMDAEIGRLLKALADS 285

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLD 240
           G+ +NT +I+T+DHG  +G+ G + GK   ++  +RVPFL+ GPGI     +   +YL D
Sbjct: 286 GQADNTIVIFTADHGLAVGRHGFI-GKQNMYDHSVRVPFLISGPGIKKAQKISSPIYLQD 344

Query: 241 VL 242
            +
Sbjct: 345 AM 346


>gi|159479368|ref|XP_001697765.1| hypothetical protein CHLREDRAFT_150903 [Chlamydomonas reinhardtii]
 gi|158274133|gb|EDO99917.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 483

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 107/241 (44%), Gaps = 33/241 (13%)

Query: 23  GKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDY------ 72
           GK+LN +         P GW     LI +    +      G +      Y  D       
Sbjct: 10  GKFLNSFKEPLARKGCPQGWTGISPLIQDVDTED-----GGEEAPRAPTYMEDCNGQLTS 64

Query: 73  YPDLIANDSVRFLRQS---KQSFSKKPIMLVMSFPAPHGPEDSA-------PQYANMFFN 122
           YP     + +R    S   + S S  P  L++S  APH  +          PQY +++  
Sbjct: 65  YPGAFQEEVIRDKALSYIDEASGSPLPFFLLISTVAPHDAKGKGSTYPQVLPQYKDLYPG 124

Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL--LMTKRLQTLQSVDDAVEKIVK 180
           + +  T  +   P     +      ++       TD+      RLQ+L++VDD +E +V+
Sbjct: 125 LKAPRTANWGRPPPVTIGFAPSTPNRLN-----VTDINERYRARLQSLRAVDDTIEAVVR 179

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLD 240
            L   G+LNNT  ++TSD+G+ LG   + + K   +E D+RVPFLM GPG VP  +Y+ +
Sbjct: 180 RLACHGQLNNTVFMFTSDNGFKLGTHNIPREKFTWYEEDVRVPFLMAGPG-VPRGVYVPE 238

Query: 241 V 241
           V
Sbjct: 239 V 239


>gi|345328217|ref|XP_003431251.1| PREDICTED: extracellular sulfatase Sulf-2 [Ornithorhynchus
           anatinus]
          Length = 828

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 35/71 (49%), Positives = 48/71 (67%), Gaps = 4/71 (5%)

Query: 2   AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
           +W  +   R+   +  + GY    FGKYLN+YNGSY+PPGW+EW  L+ NS++YNY++  
Sbjct: 117 SWQAQHEVRSFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176

Query: 58  NGRKIKHGEDY 68
           NG K KHG DY
Sbjct: 177 NGVKEKHGFDY 187


>gi|288917454|ref|ZP_06411820.1| sulfatase [Frankia sp. EUN1f]
 gi|288351157|gb|EFC85368.1| sulfatase [Frankia sp. EUN1f]
          Length = 475

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 103/218 (47%), Gaps = 29/218 (13%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G FGKYLN ++ +  PP + EW   ++   Y + + N+NG          N Y  +++ N
Sbjct: 140 GLFGKYLNGWDINASPPHFEEWA--LLKPSYVDGTYNVNGTV-----QTINGYTTNVLRN 192

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ FL   + +   +P    ++  APHGP   A QYA+          P ++  P    
Sbjct: 193 RTLSFL--DRAATDTRPWFAYVAPYAPHGPNTPAAQYADTVV-------PEWNGRPA--- 240

Query: 140 QWILQVTRKMQPVHRQFTDLLMT-------KRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
             + +  +  +P + Q +   +        ++L+TL SVDDAV+    +L  LG+L NT 
Sbjct: 241 --VHETDKSDKPPYVQNSTSTLADGQAVRERQLRTLLSVDDAVQAFRDKLVALGQLENTL 298

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           + Y  D+G+  G  G V  KS P+     +PF +  PG
Sbjct: 299 VFYVPDNGFLWGDHGRV-AKSVPYTPAHELPFYVSWPG 335


>gi|429848344|gb|ELA23841.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
          Length = 578

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 98/199 (49%), Gaps = 18/199 (9%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           +Y++  +  NG        Y   Y  D++A+ +   L Q+      +P  L ++  APH 
Sbjct: 150 RYWDTKMTHNGEPPV---SYAGKYNTDVVADKAYALLEQALNR--DEPWFLTVAPIAPHS 204

Query: 109 ----PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR------KMQPVHRQ--- 155
                E +   Y +   +   H    +DY     K +  Q++       +++ ++     
Sbjct: 205 NWVFDERTNTTYLSTPQSAYRHEHLFHDYKIPRGKSFNRQISGAASWPGRLEALNESVLA 264

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           + D    +RL+ LQ+VD+ V ++V+ L+  GE+ NTYI Y++D+GYH+ Q  +  GK   
Sbjct: 265 YNDHYQRQRLRALQAVDEMVHELVERLEVAGEIENTYIFYSTDNGYHISQHRMHPGKECG 324

Query: 216 FEFDIRVPFLMRGPGIVPG 234
           ++ DI +P  +RGPG+V G
Sbjct: 325 YDTDIHIPMFVRGPGLVAG 343


>gi|400592940|gb|EJP60966.1| arylsulfatase, putative [Beauveria bassiana ARSEF 2860]
          Length = 343

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 53/79 (67%), Gaps = 1/79 (1%)

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKS 213
           Q+ D    +RLQ+LQ VDD VE I + L D  + L NTYIIYT+D+G+HLGQ  L  GK+
Sbjct: 26  QYLDNFQRRRLQSLQDVDDLVEGIFETLLDYPDVLANTYIIYTADNGFHLGQHRLPAGKT 85

Query: 214 FPFEFDIRVPFLMRGPGIV 232
              E D+ VPF++RGP I 
Sbjct: 86  CGIEEDVNVPFIIRGPEIA 104


>gi|319954020|ref|YP_004165287.1| sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422680|gb|ADV49789.1| sulfatase [Cellulophaga algicola DSM 14237]
          Length = 505

 Score = 80.1 bits (196), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 110/226 (48%), Gaps = 17/226 (7%)

Query: 22  FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
             K  N  + S +P G+     L +    +N +    G   K G+     ++ +++ +D+
Sbjct: 169 LSKIPNATSESIMPNGYNR--PLSLTDTTWNPTDTTKGGYWKDGQ-----HWSEVLKDDA 221

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYDYAPNP 137
           +RF+  +K S  +KP  + ++F APH P  +  +Y +M+     +V     P Y Y  + 
Sbjct: 222 IRFINTAKTS--EKPFFMYLAFNAPHDPRQAPQEYQDMYALKNLSVPKSFLPEYPYRHDI 279

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
                L+    + P  R    +   K+    ++  VD  +  I++ LK+ G+++NTYII+
Sbjct: 280 ANGDDLR-DEALAPFPRTELAIKTHKKEYYASITHVDAQIGNIIEALKNSGKMDNTYIIF 338

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           T+DHG  +G+ GL+ GK   F+  IR P ++ GP I       +DV
Sbjct: 339 TADHGIAMGRHGLL-GKQSLFDHSIRPPIIILGPDIPKNKKTNVDV 383


>gi|429852351|gb|ELA27491.1| arylsulfatase precursor [Colletotrichum gloeosporioides Nara gc5]
          Length = 474

 Score = 80.1 bits (196), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 58/200 (29%), Positives = 100/200 (50%), Gaps = 28/200 (14%)

Query: 50  YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF---SKKPIMLVMSFPAP 106
           + N  ++ NG + +H        YP     D +R     +  +    +KP  L ++   P
Sbjct: 3   FNNIVMSENGGRPRH--------YPGFHQTDIIRAQSLDRLEYLTSQEKPFFLTIAPTVP 54

Query: 107 HGPED-SAP----QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT---- 157
           H   +  AP    ++  +F NVTS   P +    NP  +  +Q +  M+ + RQ      
Sbjct: 55  HENYNFYAPVPLLRHKGLFPNVTSPKPPNW----NPPDELQMQKSNWMKKL-RQLNETEI 109

Query: 158 ---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
              D L  +R+++LQ VD+ +E +V+ L+D   + NTYIIYT+D+G+ +G   +  GK+ 
Sbjct: 110 AQGDWLFRQRIESLQGVDEIIEDVVQLLEDKSIIENTYIIYTTDNGWTIGANRVAAGKAT 169

Query: 215 PFEFDIRVPFLMRGPGIVPG 234
           P+  D  +P ++RGPG+  G
Sbjct: 170 PYADDTNLPIVVRGPGVPSG 189


>gi|443897158|dbj|GAC74500.1| sulfatases [Pseudozyma antarctica T-34]
          Length = 576

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 99/198 (50%), Gaps = 18/198 (9%)

Query: 50  YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
           Y N S  ++G+++     Y   Y  D++ +  +++L ++  S   +P  + ++   PH  
Sbjct: 169 YNNASFAIDGKQLVQ---YEGQYSTDVVRDFGLQYLEEAASS--GQPFFVGIAPIGPHAE 223

Query: 110 ED------------SAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT 157
                          A ++ N+F    +    +++ A +P     ++   ++      + 
Sbjct: 224 TTFINGSAIFSNPVPAKRHKNLFATEVAPRKSSFN-ADDPSGAGFIRDLPRLNSSVVDYI 282

Query: 158 DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
           D     RL++LQ+VD+ V  ++ + + LG L+NTY IY+SD+GY +G+     GK+  +E
Sbjct: 283 DEWYRDRLRSLQAVDELVGAVMDKAEQLGILDNTYFIYSSDNGYSVGEHRRQPGKTTGYE 342

Query: 218 FDIRVPFLMRGPGIVPGT 235
            DIRVP  +RGPGI  G+
Sbjct: 343 EDIRVPLYIRGPGINSGS 360


>gi|159468309|ref|XP_001692325.1| hypothetical protein CHLREDRAFT_145830 [Chlamydomonas reinhardtii]
 gi|158278511|gb|EDP04275.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 723

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 97/190 (51%), Gaps = 14/190 (7%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
           KYY  +   + RK+   +D   +    LI  DSV          S  P  L ++  APH 
Sbjct: 129 KYYYTNCQYHERKVGLPDDNLFERAHSLI-EDSVD---------SGNPFFLKLAVSAPHD 178

Query: 108 GPEDSAPQYANMFFNV-TSHHTP-AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
           GP D  P+ A+ + +       P   +Y    D ++ +   R +       +D+    R 
Sbjct: 179 GPVDGLPRIADRYLSAFPGLKVPRGPNYGKPIDDRFGMSKQRPLW-WDSADSDIRYRARA 237

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           Q++ ++D+ ++K+V +L+ LG LNNTYIIYTSD+G+ LGQ  + + K   +E D+ +PF 
Sbjct: 238 QSMLAIDEHLDKLVTKLECLGILNNTYIIYTSDNGFKLGQHNIPQEKWTYYEEDVALPFF 297

Query: 226 MRGPGIVPGT 235
           +RGPG+  G 
Sbjct: 298 IRGPGVPRGA 307


>gi|340619599|ref|YP_004738052.1| sulfatase [Zobellia galactanivorans]
 gi|339734396|emb|CAZ97773.1| Sulfatase, family S1-28 [Zobellia galactanivorans]
          Length = 505

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 107/214 (50%), Gaps = 21/214 (9%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ +D++ F+ Q+K S  +KP  + ++F APH P  +  ++ +++     +V    
Sbjct: 209 HWSEVLKDDALGFIEQTKTS--EKPFFMYLAFNAPHDPRQAPQEFVDLYPIENISVPESF 266

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELK 183
            P Y Y         L+    + P  R  T+  + K  Q     +  +D  + +I+  L+
Sbjct: 267 LPLYPYKDGMGNGPSLR-DEALAPFPR--TEYAIKKHTQEYYALISHLDFQIGQILDGLE 323

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYL 238
             G+++NTYII+T+DHG  +G+ GL+ GK   F+  IR PF++ GP I     +   +YL
Sbjct: 324 KTGKMDNTYIIFTADHGLAMGKHGLI-GKQSQFDHSIRAPFMIVGPKIPENKKIDADIYL 382

Query: 239 LDVLIP--QVRKFSSGSLIFIMSFLINLNTMRAE 270
            D ++   ++        +F  S +  +N  R E
Sbjct: 383 QDAMVTSLELAGIEKPEYVFFNSVIDLVNGERKE 416


>gi|389625515|ref|XP_003710411.1| arylsulfatase [Magnaporthe oryzae 70-15]
 gi|351649940|gb|EHA57799.1| arylsulfatase [Magnaporthe oryzae 70-15]
          Length = 643

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/71 (47%), Positives = 50/71 (70%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M +R + LQ VD+ +E +V  L+  G+L+NTY++YTSD+GYHLGQ  L  GK+  +  D 
Sbjct: 264 MARRAEALQGVDEIIEDVVAMLEANGQLDNTYVVYTSDNGYHLGQHRLPGGKALFYGEDT 323

Query: 221 RVPFLMRGPGI 231
            +PF +RGPG+
Sbjct: 324 NLPFAVRGPGV 334


>gi|149195700|ref|ZP_01872757.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149141162|gb|EDM29558.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 482

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 92/181 (50%), Gaps = 13/181 (7%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ ++ A+D++ FL  +K     +P  + + F APH P  +  +Y +M+     ++  +H
Sbjct: 179 HWTEVTADDAIEFLDDAKDI--DQPFFMYVGFNAPHDPRQAPKEYVDMYPIDKISLPKNH 236

Query: 128 TPAYDYAPNPDKQWILQVTRKMQ-PVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
              Y Y         L+    M  P +   T +   +       +D  + +++K LKD G
Sbjct: 237 LGLYPYRDQIGCGKQLRDAAIMPFPRNEYATKINRQEYFAATTHMDHNIGRVLKALKDSG 296

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-----TMYLLDV 241
           + +NTY+I+TSDHG  +G+ GLV GK   +E  +R PF++ GP I  G     ++Y+ D 
Sbjct: 297 KADNTYVIFTSDHGLAVGEHGLV-GKQNMYEHSMRAPFIIVGPSIPKGQRNATSIYIQDA 355

Query: 242 L 242
           +
Sbjct: 356 M 356


>gi|302837023|ref|XP_002950071.1| hypothetical protein VOLCADRAFT_120839 [Volvox carteri f.
           nagariensis]
 gi|300264544|gb|EFJ48739.1| hypothetical protein VOLCADRAFT_120839 [Volvox carteri f.
           nagariensis]
          Length = 1230

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 34/71 (47%), Positives = 51/71 (71%)

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           RL+TL++VD+ + ++   L  +G+L+NTY+I+TSD+G+ LG      GK F +E  +RVP
Sbjct: 246 RLRTLRAVDEMIGRLATTLASVGQLSNTYVIFTSDNGFQLGNHAQSSGKQFHWEEVVRVP 305

Query: 224 FLMRGPGIVPG 234
           F +RGPGI PG
Sbjct: 306 FYIRGPGIPPG 316


>gi|149197559|ref|ZP_01874610.1| probable sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149139577|gb|EDM27979.1| probable sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 505

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 58/79 (73%), Gaps = 6/79 (7%)

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +++D  +++++K+L+ +GEL+NTYIIYTSDHG  +G+ GL +GK   +E   RVPF+++G
Sbjct: 281 ENIDIQIDRVLKKLESMGELDNTYIIYTSDHGIAIGRHGL-QGKQNLYEHTFRVPFIVKG 339

Query: 229 PGI-----VPGTMYLLDVL 242
           PGI     V G +YL D+L
Sbjct: 340 PGIPAGKRVKGNIYLTDIL 358


>gi|1703428|sp|P14217.2|ARS_CHLRE RecName: Full=Arylsulfatase; Short=AS; AltName: Full=Aryl-sulfate
           sulphohydrolase; Flags: Precursor
          Length = 647

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 99/204 (48%), Gaps = 29/204 (14%)

Query: 53  YSINMNGRKIKHGED---YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG- 108
           Y+ + N R  ++G     Y  +Y  D+I +  V  ++ +  +   KP    +S  APH  
Sbjct: 157 YTFDYNTRLQRNGATPNIYPGEYSTDVIRDKGVAQIKSAVAA--GKPFYAQISPIAPHTS 214

Query: 109 -----------------PEDSAPQYANMFFNVT----SHHTPAYDYAPNPDKQWILQVTR 147
                            P   AP +  +F +      S +   Y+   +    WI  +  
Sbjct: 215 TQISTNPATGVTRSYFFPPIPAPPHWQLFSDANLPGGSXNKNLYEVDVSDKPAWIRALPL 274

Query: 148 KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
             Q  +R + + +   RL++L   D+ +E++VK L + G L+NTYIIY++D+GYH+G   
Sbjct: 275 AQQ-NNRTYQEEIYRLRLRSL-GPDELIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHR 332

Query: 208 LVKGKSFPFEFDIRVPFLMRGPGI 231
              GK+  +E D+RVPFL+RGPGI
Sbjct: 333 FGAGKTTGYEEDLRVPFLIRGPGI 356


>gi|340619425|ref|YP_004737878.1| sulfatase [Zobellia galactanivorans]
 gi|339734222|emb|CAZ97599.1| Sulfatase, family S1-28 [Zobellia galactanivorans]
          Length = 493

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 92/187 (49%), Gaps = 29/187 (15%)

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQW 141
           + +L   +++  + P  +   F  PH   D  P+    +  V  +HT      P  DKQ 
Sbjct: 176 LEYLGDREKTKEEDPFFIYFGFSHPHDTRDGTPELLAKYGAV--NHTDKNSLPPANDKQP 233

Query: 142 ILQVT-RKMQPVHRQFTDLLMTKRL---------QTL-----------QSVDDAVEKIVK 180
            LQ    +  P      +L   +R+         QT+           +++D  + K++K
Sbjct: 234 PLQDNYLEAHPFFHGHPELRDEERVSGVWKNRDEQTVRNELGREYACSENIDIQLGKVLK 293

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGT 235
           +L+ +GEL+NTYI+YTSDHG  +G+ GL+ GK   +E   RVP +++GPG+       G 
Sbjct: 294 KLEAMGELDNTYIVYTSDHGMSIGRHGLM-GKQNLYEHTWRVPMIIKGPGLPSGKRTKGN 352

Query: 236 MYLLDVL 242
           +YLLDVL
Sbjct: 353 VYLLDVL 359


>gi|159487921|ref|XP_001701971.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281190|gb|EDP06946.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 567

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 112/249 (44%), Gaps = 44/249 (17%)

Query: 21  YFGKYLNKY---NGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           Y GK++  Y   N    P GW  + AL+     +YY  +  ++G        Y   Y  D
Sbjct: 125 YTGKFIVDYTIRNHRPPPAGWTAFDALVHPFTFEYYCPAFALDGGPPAM---YPAQYVTD 181

Query: 76  LIANDSVRFLRQS--KQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
           ++   +   LR +    + S +P  L ++ PAPH          +  +N T      + Y
Sbjct: 182 VVGAKARALLRSAAADAAVSGRPFYLQVAPPAPH-------HSMHYDYNATGGLRARHWY 234

Query: 134 APNPDKQ--------------------------WILQVTRKMQPVHRQFTDLLMTKRLQT 167
            P P ++                          W+  + R + P +  F + +   RL++
Sbjct: 235 PPVPAERHWEAFSGAALPRPASFNPRNLWGKPAWVRGLER-LTPGNVTFLEEVFRLRLRS 293

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           L +VD+ V  I+ +L+ LG   +TY+++TSD+GYH G      GK+  +E DIRVP L+ 
Sbjct: 294 LLAVDELVAGIMSDLEALGLTGSTYVLFTSDNGYHEGAHRQGSGKTTAYEEDIRVPLLVA 353

Query: 228 GPGIVPGTM 236
           GPG+ PG++
Sbjct: 354 GPGLRPGSV 362


>gi|326334165|ref|ZP_08200389.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
 gi|325948051|gb|EGD40167.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
          Length = 509

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 67/245 (27%), Positives = 110/245 (44%), Gaps = 28/245 (11%)

Query: 21  YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNY---SINMNGRKIKHGEDYYNDYY 73
           + GKYLN Y        +PPGW +W A   +   YN+    IN NG+   +    Y  Y 
Sbjct: 156 FAGKYLNGYGLRGTAKDVPPGWTDWRA-TTDMSTYNFIRPRINHNGKFTTY--HRYTTYV 212

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--GPEDSAPQYANMFFNVTSHHTPAY 131
               AND +     S    ++KP  + +++ APH  GP +             +    ++
Sbjct: 213 MRDQANDMI-----SAPERAEKPWYMWLNYVAPHHGGPGEPDDPEGTSTTRPAAEDKHSF 267

Query: 132 DYAPNPDKQWILQVTRKMQPVH--RQFTD--------LLMTKRLQTLQSVDDAVEKIVKE 181
           D    PD  ++ +       +   R+F D        +   +R++ +Q++D AV   V  
Sbjct: 268 DGLELPDLPYMFRTPTDAPRISASRKFFDAEGRAELRIAHEQRVEAVQALDRAVASHVLR 327

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           L++ G+L+NT II++SD+GY  G    + GK   +    R+P ++RGPGI  G      V
Sbjct: 328 LQETGQLDNTIIIFSSDNGYTTGGHN-INGKLMHYRETQRIPLILRGPGIPVGRSLATGV 386

Query: 242 LIPQV 246
             P +
Sbjct: 387 GNPDI 391


>gi|429847721|gb|ELA23288.1| arylsulfatase precursor [Colletotrichum gloeosporioides Nara gc5]
          Length = 633

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 102/236 (43%), Gaps = 27/236 (11%)

Query: 20  GYFGKYLNK-YNGSYI------PPGWREWGALIMNSKYYNYSINM--NGRKIKHGEDYYN 70
           GY  +Y+ K +NG  I      P GWR    L+      + ++ M  NG +         
Sbjct: 117 GYNAEYVGKLFNGDSIISYNPAPKGWRHIDVLLDPYTNVHNTVVMSENGARPV------- 169

Query: 71  DYYPDLIANDSVRFL---RQSKQSFSKKPIMLVMSFPAPH-----GPEDSAPQYANMFFN 122
            +Y      D VR     R       + P  L+++  APH      P     +Y   F N
Sbjct: 170 -WYQGFQQTDVVRIKALDRLDNLLKEEDPFFLMIAPTAPHVENLRDPPTPPARYLGSFAN 228

Query: 123 VTSHHTPAYDYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
            T   TP ++ A      K   L+    +        D    +RL++L+ VDD ++ +V+
Sbjct: 229 STVPRTPNFNPADEYQHGKPSWLRTLPSLNSTQIAEIDHFYQRRLESLKGVDDIIDDVVE 288

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
            L     +++TYIIYT+D GYHLG      GKS P+  D   P ++RGPG+  G +
Sbjct: 289 MLTVNDAIDDTYIIYTTDQGYHLGIHRTGVGKSLPYLEDTNTPLIVRGPGVPKGAV 344


>gi|18119|emb|CAA34302.1| unnamed protein product [Chlamydomonas reinhardtii]
 gi|18174|emb|CAA36545.1| unnamed protein product [Chlamydomonas reinhardtii]
          Length = 646

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 33/61 (54%), Positives = 47/61 (77%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           VD+ +E++VK L + G L+NTYIIY++D+GYH+G      GK+  +E D+RVPFL+RGPG
Sbjct: 295 VDELIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHRFGAGKTTGYEEDLRVPFLIRGPG 354

Query: 231 I 231
           I
Sbjct: 355 I 355


>gi|429849912|gb|ELA25239.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
          Length = 502

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 84/166 (50%), Gaps = 14/166 (8%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           +Y  D++A  ++ FL  +  +   KP  L ++  APH    S     N+  N + H    
Sbjct: 174 EYSTDVLACKALDFLDDAVAA--SKPFFLGIAPVAPHSNVQS-----NVISNGSVHDPHG 226

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
             + P     +         P+ R+ T  +  +RL+ LQ+VD+ V+ +V  L++ G L  
Sbjct: 227 IVFPPRGTPIF------SPMPLSRERTTSI-PRRLRALQAVDELVDSVVSRLEEYGILEE 279

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           TYI YT+D+GYH+G   L  GK   +E DI +P ++RGP +  G +
Sbjct: 280 TYIFYTTDNGYHIGHHRLQPGKECGYEEDINIPLIIRGPQVPEGKV 325


>gi|254444649|ref|ZP_05058125.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
 gi|198258957|gb|EDY83265.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
          Length = 499

 Score = 76.6 bits (187), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 109/238 (45%), Gaps = 33/238 (13%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDL 76
           + GY    + K++    P G+  W  LI   +YY+   +   G+    G      Y  D+
Sbjct: 113 AAGYETAVIGKWHLGSDPTGFDFWKILIGQGQYYDAPFLTAEGQVETEG------YVTDV 166

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-----GPE-------DSAPQYANMFFNVT 124
           I + ++ +L   +    +KP ML+    APH     GP+       ++ P+   +F +  
Sbjct: 167 ITDLAIDWLNTRE---DEKPFMLMYQHKAPHANFQPGPDYLNWREDETIPEPETLFDDYA 223

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKM----------QPVHRQFTDLLMTKRLQTLQSVDDA 174
           +    A+D     D    LQ   ++              R      +   L+ ++SVDD 
Sbjct: 224 TRSPAAWDNEMRIDPTLELQYQGELNLKVPDGLRGHERSRWLYQFYIKNYLRCVKSVDDG 283

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           V ++ ++L+ +GEL+NT +IYTSD G+ LG+ G    K F +E  +++P L+R P ++
Sbjct: 284 VGRVFEQLEAMGELDNTIVIYTSDQGFFLGEHGYYD-KRFMYEESLQIPLLVRYPKMI 340


>gi|325109858|ref|YP_004270926.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970126|gb|ADY60904.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 595

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 62/232 (26%), Positives = 111/232 (47%), Gaps = 28/232 (12%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            Y GK+          PG+  +        YY+ + N+NG++ K     +  YY  ++ +
Sbjct: 219 AYIGKWHMGEKNDEPRPGFDYFVTHTGQGLYYDTTFNINGKERK----TFPGYYTHVVTD 274

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHH-TPAYDYAPNPD 138
            +  +L Q +     +P ML++   APH      P+YA+ + ++  ++   A+D   NPD
Sbjct: 275 MAEEWLDQQQDD---EPFMLMVGQKAPHSFYLPEPKYAHRYDDIDVNYPQSAFDLDDNPD 331

Query: 139 KQWILQ-------------VTRKMQPVHRQ--FTDLLMTKRL--QTLQSVDDAVEKIVKE 181
             W+ +               RK  P  +    TD     R    T+ SVDD+V ++ K+
Sbjct: 332 --WMTKRLYTWHGIYGPLFAWRKDHPNDKAEAVTDFANMIRAYWGTINSVDDSVGRLTKK 389

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           L+++GEL++T  ++  D+G   G+ G+V  ++   E  IRVP ++R PG+ P
Sbjct: 390 LREMGELDSTVFVFMGDNGLLEGEHGMVDKRTM-HEPSIRVPMIVRYPGLTP 440


>gi|374606778|ref|ZP_09679601.1| sulfatase [Paenibacillus dendritiformis C454]
 gi|374387613|gb|EHQ59112.1| sulfatase [Paenibacillus dendritiformis C454]
          Length = 472

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 110/243 (45%), Gaps = 35/243 (14%)

Query: 19  HGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYY 73
           HGY     GK+   + G   P G+  W  L    +Y+N   +  NG +   G      Y 
Sbjct: 96  HGYQTAIVGKWHLGHGGHANPTGFDHWSVLDGQGEYFNSEFLEANGSRRIEG------YV 149

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD- 132
            D+IA+ S+ +LR+       KP ML+    APH P     +++ M+ +V   +   +D 
Sbjct: 150 TDVIADLSLDWLRKRD---PDKPFMLMCHHKAPHRPWQPDERHSRMYEDVEIPYPETFDD 206

Query: 133 -YAPNPDK--QWILQVTRKM---------------QPVHRQFTDLLMTKRLQTLQSVDDA 174
            YA       +  +++ R M               + V R      +   L+ + S+DD 
Sbjct: 207 DYANRSQAAVEAKMRIGRDMVTTDTKGDPPEGLTEEEVKRWKYQRYIKDYLRCVASIDDN 266

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GIVP 233
           V +++  L + G   +T +IYTSD G+ LG  G    K F +E  +R+PF++R P  I P
Sbjct: 267 VGRLLDYLDEEGIAEDTIVIYTSDQGFFLGDHGWFD-KRFMYEESLRMPFIVRYPRAIAP 325

Query: 234 GTM 236
           G++
Sbjct: 326 GSV 328


>gi|325285339|ref|YP_004261129.1| sulfatase [Cellulophaga lytica DSM 7489]
 gi|324320793|gb|ADY28258.1| sulfatase [Cellulophaga lytica DSM 7489]
          Length = 513

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 90/178 (50%), Gaps = 14/178 (7%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ +D++ F+  +KQ   +KP  + ++F APH P  +   Y  M+      +    
Sbjct: 218 HWSEVLKDDAITFINTAKQK--EKPFFMYLAFNAPHDPRQAPKAYQAMYNVNNITIPKSF 275

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELK 183
            P Y Y  N      L+    + P  R  T+L +   ++    ++  +D  +  I+  LK
Sbjct: 276 LPEYPYRHNIANGDDLR-DEALAPFPR--TELAIKTHIKEYYASITHLDAQIGDILTALK 332

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           + G++NNTYII+TSDHG  +G+ GL+ GK   F+  IR P ++ GP I        DV
Sbjct: 333 NSGKINNTYIIFTSDHGLAMGKHGLL-GKQNLFDHSIRPPMIIVGPDIPKNKKNTADV 389


>gi|326333280|ref|ZP_08199527.1| hypothetical protein NBCG_04717 [Nocardioidaceae bacterium Broad-1]
 gi|325948924|gb|EGD41017.1| hypothetical protein NBCG_04717 [Nocardioidaceae bacterium Broad-1]
          Length = 499

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 44/243 (18%)

Query: 21  YFGKYLNKYNG-----SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           + GKYLN Y       S   PGW +W A +  S Y   S   N          Y+     
Sbjct: 124 FLGKYLNNYGKGSERYSSGEPGWDDWQATVDPSTYNLLSPTFNSNGTLQSPAIYSSV--- 180

Query: 76  LIANDSVRFLRQSKQSFS-KKPIMLVMSFPAPH--GP------------------EDSAP 114
            + N S+R ++ +  +   +KP  L +++ APH  GP                  E++ P
Sbjct: 181 ALQNRSIRMIKHAGTNVGGRKPWFLWVNYVAPHAGGPVEASDPKKRYPDQPSLWLENTRP 240

Query: 115 QYAN--MFFNVTSHHTPAYDYAP----NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
             A+   F  +    TP   YA        K +     R M+ V++Q        R++  
Sbjct: 241 MGADKGRFSGLELPKTPDMFYAGPGQVTAGKPYTTAQKRVMRYVYQQ--------RIEAA 292

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           QSVD AV + +  LKD G+   T II++ D+GY  G     +GK + F   +R+P +M G
Sbjct: 293 QSVDRAVARTIAHLKDSGQYRRTVIIFSGDNGYTTGHHNR-QGKLWEFSQSMRIPLIMVG 351

Query: 229 PGI 231
           PGI
Sbjct: 352 PGI 354


>gi|223936143|ref|ZP_03628057.1| sulfatase [bacterium Ellin514]
 gi|223895365|gb|EEF61812.1| sulfatase [bacterium Ellin514]
          Length = 456

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 108/223 (48%), Gaps = 38/223 (17%)

Query: 36  PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
           PG+  W +      Y N ++N++G K+   E Y  D    ++ + +V F++Q      KK
Sbjct: 127 PGFDHWLSFKGQGVYENPNLNIDG-KVSRVEGYITD----ILNSRAVEFVKQE----HKK 177

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD--YAPNPDKQWILQVTRKMQ--- 150
           P  L +   A HGP   A ++  ++      H P+ D   A  P       +TRK Q   
Sbjct: 178 PFCLYVGHKAVHGPFTPAERHKELYTKEQIPHPPSIDDDLAGKP------VLTRKEQQGP 231

Query: 151 -----PVHRQFTD-----------LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
                P    F D            L+ ++L+TL ++D+ V ++++ L++  +L NT II
Sbjct: 232 KDGQKPQKVGFDDEAERPMGKVPERLVRQQLRTLMAIDEGVGQLLRALEESRQLENTVII 291

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
           +TSD+GY  G+  L   K + +E  IR P L+R P ++ PGT+
Sbjct: 292 FTSDNGYFWGEHHL-GDKRWAYEESIRDPLLIRYPKLIKPGTV 333


>gi|449132149|ref|ZP_21768311.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula europaea 6C]
 gi|448888596|gb|EMB18910.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula europaea 6C]
          Length = 516

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 106/235 (45%), Gaps = 20/235 (8%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY----NYSINMNGRKIKHGEDYYNDYYPD 75
           G+ GK+   +      PG+  W +     +YY     Y+IN NG+++       + Y   
Sbjct: 125 GFIGKWHMGHANDDPRPGFDFWFSFKGQGQYYPPGPQYTINDNGKRVPQ-----DGYITP 179

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
           L+   ++ FL Q  Q+   KP  L +S  A H P    P+Y +   +       + +   
Sbjct: 180 LLTRKAIEFLEQ--QTDRDKPFFLYLSHKAVHDPFTPEPKYKDSLKDQPYEFPASSELLK 237

Query: 136 NPDK---QWILQVTRKMQ----PVHR-QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           +  K   +W+L            +H  Q  +    +  + L+SVDD++  +V++LK +G 
Sbjct: 238 DNLKNRPRWLLDQRNSWHGMDFALHTGQSVEGFYKRYCEALRSVDDSIGAVVQQLKMMGI 297

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
              T +IY  D+G+  G+ GL   K   +E   RVP LM+ P I+ G   + +V+
Sbjct: 298 YEETLVIYMGDNGFMFGEHGLFD-KRVAYETSSRVPMLMQCPAIIKGGTVVEEVV 351


>gi|261346415|ref|ZP_05974059.1| N-acetylglucosamine-6-sulfatase [Providencia rustigianii DSM 4541]
 gi|282565398|gb|EFB70933.1| N-acetylglucosamine-6-sulfatase [Providencia rustigianii DSM 4541]
          Length = 519

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/244 (28%), Positives = 102/244 (41%), Gaps = 36/244 (14%)

Query: 12  LLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNS------KYYNYSIN 56
           L   D S GY GK         Y+  YN       W EW A            Y  Y ++
Sbjct: 136 LQSLDYSTGYIGKWHLDAPYAPYIESYNNPMEGRYWNEWAAPDRRHGFDFWYSYGTYDLH 195

Query: 57  MN----GRKIKHGEDYYNDYY-PDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGP 109
           M     G      +  Y D + P+  A+ +++FLR     F    KP  LV+S   PH P
Sbjct: 196 MKPMYWGNNTPREKPIYVDQWGPEHEADMAIKFLRNENGQFRDETKPFALVVSMNPPHSP 255

Query: 110 EDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
            D  PQ Y +++   TS      +    P+ QW  +      P +         + L  +
Sbjct: 256 YDQVPQKYLDVYKGKTSK-----ELNTRPNVQWDAEYQEGYGPQY-------FKEYLAMV 303

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
             VDD   +I+ EL+  G  +NT +++ SDHG  LG  G    K+ P+E  +RVP + R 
Sbjct: 304 NGVDDQFGRIMNELEKQGLDDNTLVVFFSDHGSCLGANGQPT-KNNPYEESMRVPMMFRL 362

Query: 229 PGIV 232
           PG +
Sbjct: 363 PGKI 366


>gi|407923746|gb|EKG16811.1| Sulfatase [Macrophomina phaseolina MS6]
          Length = 298

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 93/190 (48%), Gaps = 26/190 (13%)

Query: 49  KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
           KYY+  +  NG +      Y  +Y  D+I +  + FL +      ++P  +V +  APHG
Sbjct: 118 KYYDAVMTRNGEEPVS---YAGNYSTDVIKDKILGFLDEGLSH--ERPWFVVAAPIAPHG 172

Query: 109 PEDS--------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
              +              AP++A++F + T     +++        WI    + ++P++ 
Sbjct: 173 TTTTDPATNGTYFDFAQYAPRHADLFKDYTIPRDSSFNAKVEGGVSWI----KNLEPLND 228

Query: 155 Q---FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
               + D     RL++LQ+VD+ +E  V++L+D G L+ TYIIY++D+GYH+ Q  +  G
Sbjct: 229 SVIAYNDEFQRSRLRSLQAVDEMIEAFVEKLEDAGVLDETYIIYSTDNGYHISQHRMTPG 288

Query: 212 KSFPFEFDIR 221
           K   +    R
Sbjct: 289 KECGYGMSFR 298


>gi|410638706|ref|ZP_11349259.1| predicted sulfatase [Glaciecola lipolytica E3]
 gi|410141234|dbj|GAC16464.1| predicted sulfatase [Glaciecola lipolytica E3]
          Length = 522

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/182 (30%), Positives = 90/182 (49%), Gaps = 19/182 (10%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ-----YANMFFNVTSHHTP 129
           ++IA+ +V FL   K     KP M+ + F APH P   APQ     Y      +     P
Sbjct: 218 EVIADAAVDFLTSEKS----KPFMMYVGFLAPHDPR-QAPQAYMGKYPGDSIELPKSFAP 272

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
           ++ +  N    +I     +  P         + +    ++  D  + +I++ LK  GEL+
Sbjct: 273 SHPF--NQGDYYIRDEILENNPRTPYVIQKAIGEYYAMIEHTDAQIGRIIEALKASGELD 330

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDVLIP 244
           NT I++TSDHG  +G+ GL+ GK   ++  +R PF+M+GP +     VPG M+ L+ + P
Sbjct: 331 NTLIVFTSDHGLAVGRHGLL-GKQNQYDHSVRAPFIMQGPNVPKGKSVPG-MFYLNSVFP 388

Query: 245 QV 246
            V
Sbjct: 389 TV 390


>gi|302538002|ref|ZP_07290344.1| predicted protein [Streptomyces sp. C]
 gi|302446897|gb|EFL18713.1| predicted protein [Streptomyces sp. C]
          Length = 488

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 105/234 (44%), Gaps = 29/234 (12%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G FGKYLN +  +  PP + E+   ++   Y +   N+NG          N Y  ++I N
Sbjct: 153 GLFGKYLNSWALADNPPHFEEFA--LLQPGYVDAQWNVNGTV-----QTINGYTTNIIKN 205

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ FL   K +   +P    ++  A HGP     +YA          TP   ++  P  
Sbjct: 206 RTLSFL--DKAATDTRPWFAYVTPYASHGPRTPEAKYAG---------TPVPHWSGRPS- 253

Query: 140 QWILQVTRKMQPVH-RQFTDLLM------TKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
             + +  R  +P + +  T  L        ++L+TL SVDDAV+     L+ LG+L+NT 
Sbjct: 254 --VPENDRSDKPAYIKNATGTLADGQTIRAEQLRTLLSVDDAVQAFKDRLQALGQLDNTL 311

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           +IY  D+G+  G  G    KS P+     VPF +  P    G   + D ++  +
Sbjct: 312 VIYIGDNGFSWGDHGWT-AKSVPYRQAHEVPFYLSWPAGGRGAGAVDDRIVANI 364


>gi|255036000|ref|YP_003086621.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254948756|gb|ACT93456.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 506

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 36/256 (14%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYN--YSINMNGRKIKHGEDYYNDYYPD 75
           + GY    + K++    P G+  +  L    +Y N       + R+ + G   Y  Y  D
Sbjct: 113 TAGYQTALIGKWHLVKEPAGFDYYCVLPGQGRYRNPIMMTKEDFREDQKGGKVYEGYSTD 172

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
           +I + S+ +L +  +S   KP  L   F A H P D   +Y N   +V   + PA D+A 
Sbjct: 173 VITDQSIAWLEKRDKS---KPFYLSTHFKATHEPFDYPKRYENYLEDVEIPY-PA-DFAD 227

Query: 136 N-------PDKQWILQV--------TRKMQPVHR---QFTDLLMTKR----------LQT 167
                       W L +        T K  P H    Q  D +  ++          ++ 
Sbjct: 228 RGATGSGRTHDGWPLDLLGTRYEKGTGKEYPGHSFSLQGLDSVAARKKIYQKFVKDYIRC 287

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
             +VDD + ++++ LKD GEL+NT IIYTSD GY LG+ G    K F +E  IR+PF++ 
Sbjct: 288 GAAVDDNIGRLIQYLKDAGELDNTIIIYTSDQGYFLGEHGFFD-KRFIYEPSIRMPFVIS 346

Query: 228 GPGIVPGTMYLLDVLI 243
            P  +P    + D+++
Sbjct: 347 YPKEIPKGKRVNDLIL 362


>gi|449134522|ref|ZP_21770018.1| choline sulfatase [Rhodopirellula europaea 6C]
 gi|448886797|gb|EMB17190.1| choline sulfatase [Rhodopirellula europaea 6C]
          Length = 473

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ ++IAN S  F   + +    +P  + ++F A H P  +  +Y + +      V +++
Sbjct: 183 HWSEVIANHSDDFFADAAKH--DQPAFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 240

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P + YA        L+  R + P  R    + + +R     L  +D  + +I+  ++  
Sbjct: 241 QPLHPYAEEIGCGRKLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 299

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
           GE +NT+I +T+DHG  +GQ GL+ GK  P++  +RVPFL++GPG+  G      +YL D
Sbjct: 300 GEADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVKAGVRIEEPIYLQD 358

Query: 241 VL 242
           V+
Sbjct: 359 VM 360


>gi|223936731|ref|ZP_03628641.1| sulfatase [bacterium Ellin514]
 gi|223894582|gb|EEF61033.1| sulfatase [bacterium Ellin514]
          Length = 515

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 66/238 (27%), Positives = 107/238 (44%), Gaps = 31/238 (13%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIA 78
           GY+   + K++    P G+  W  L    KY++   I M  RK   G      Y  ++I 
Sbjct: 149 GYYTGMVGKWHLESDPTGFDYWNVLPGQGKYHDPDFIEMGNRKKIEG------YATEIIT 202

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--DYA-- 134
           + S+ FL+   Q    KP  L+    APH P +   ++A M+ +VT      +  DY   
Sbjct: 203 DLSINFLKNRPQD---KPFFLMCHHKAPHRPWEPDEKHAKMYEDVTIPEPETFNDDYKTR 259

Query: 135 PNPDKQWILQVTRKMQP---------------VHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
            +   +  +++ R + P               + +      +   L+ + SVDD V +++
Sbjct: 260 SSAATEATMRIDRDLTPKDLKQAPPPGLAGEALKKWKYQRYIKDYLRCIASVDDNVGRLL 319

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTM 236
           K L D G   NT +IYTSD G+ LG       K F +E  +R+PF++R P  I P T+
Sbjct: 320 KFLDDSGLAENTIVIYTSDQGFFLGDHNWFD-KRFMYEESLRMPFIVRYPNHIKPATV 376


>gi|149196998|ref|ZP_01874051.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
 gi|149140108|gb|EDM28508.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
          Length = 495

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 106/240 (44%), Gaps = 43/240 (17%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+ GK+          PG+  W       KY++ + N+NG + K        YY   + +
Sbjct: 121 GWIGKWHMGEEDDSKRPGFDYWVTHKGQGKYWDTTFNVNGERKK-----VPGYYAHKVTD 175

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ FL +  +S   KP  L +   APHGP     +Y +++     + TP     P PD 
Sbjct: 176 MAIDFLNKVDKS---KPFALCLGHKAPHGPFIPEAKYDSIY-----NDTPV----PYPDS 223

Query: 140 QWILQ-----VTRKMQPVHRQFTDLLMTKR--------------------LQTLQSVDDA 174
            W L      +  ++   H  +  L   ++                      T+ SVDD+
Sbjct: 224 SWKLGDKPKWIVDRLPTWHGIYGPLYGFRKDFPNDKASAIVDFEHFVRSYTATINSVDDS 283

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           V +I   L+++G L+NT +I+TSD+G+ LG+ G++  ++   E  + +P  +R P  + G
Sbjct: 284 VGRIYDHLEEMGILDNTILIFTSDNGFLLGEHGMIDKRTM-HEASVSIPLTVRFPKKIKG 342


>gi|126728061|ref|ZP_01743877.1| choline sulfatase [Sagittula stellata E-37]
 gi|126711026|gb|EBA10076.1| choline sulfatase [Sagittula stellata E-37]
          Length = 506

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 89/185 (48%), Gaps = 13/185 (7%)

Query: 69  YNDYYPDLIANDSVRFL---RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
           Y   Y D +   + R+L    + + +  + P  +V SF  PH P  + P++ +++ +   
Sbjct: 158 YQTDYDDEVEFAARRWLIDRGRDRAAGDQTPFCMVASFIHPHDPYVAKPEWWDLYSD-DD 216

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR----LQTLQSVDDAVEKIVKE 181
              P Y  AP        +V   ++  +   TD  + +     L  +   D  V  +VK 
Sbjct: 217 VDMPEYSLAPEEQDPGSRRVMDGIEASYIPLTDEEVHRARRAYLANVSYFDSKVGALVKT 276

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----VPGTMY 237
           L+D+GEL+NT ++ T+DHG  LG+ GL    +F FE   R+P +M GPG+     P    
Sbjct: 277 LEDIGELDNTLLVVTADHGDMLGERGLWFKMNF-FEHSARIPMIMTGPGVAKGAAPNACS 335

Query: 238 LLDVL 242
           L+D+L
Sbjct: 336 LIDLL 340


>gi|305665307|ref|YP_003861594.1| choline sulfatase [Maribacter sp. HTCC2170]
 gi|88710062|gb|EAR02294.1| choline sulfatase [Maribacter sp. HTCC2170]
          Length = 503

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 95/184 (51%), Gaps = 19/184 (10%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ +D+V F+ Q+K      P  + ++F APH P  +  +Y +M+     ++    
Sbjct: 209 HWSEVLKDDAVGFIDQAK--VKDNPFFMYLAFNAPHDPRQAPQEYVDMYSLDKISLPKSW 266

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELK 183
            P Y Y  +      L+    + P  R  T+    K +Q     +  +D+ + +I+  L+
Sbjct: 267 MPMYPYKDSIGNGPGLR-DEALAPFPR--TEYATKKHIQEYYALISHMDNQIGEILDALE 323

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYL 238
           + G++ NTY+I+T+DHG  +G+ GL+ GK   F+  IR PF++ GP I     +   +YL
Sbjct: 324 NSGKMENTYVIFTADHGLAIGKHGLL-GKQSQFDHSIRPPFMIVGPDIPKDASIDKDIYL 382

Query: 239 LDVL 242
            D +
Sbjct: 383 QDAM 386


>gi|440479319|gb|ELQ60092.1| arylsulfatase precursor [Magnaporthe oryzae P131]
          Length = 668

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 51/78 (65%), Gaps = 4/78 (5%)

Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
           + FTD       + LQ VD+ +E +V  L+  G+L+NTY++YTSD+GYHLGQ  L  GK+
Sbjct: 286 QNFTDYAA----EALQGVDEIIEDVVAMLEANGQLDNTYVVYTSDNGYHLGQHRLPGGKA 341

Query: 214 FPFEFDIRVPFLMRGPGI 231
             +  D  +PF +RGPG+
Sbjct: 342 LFYGEDTNLPFAVRGPGV 359


>gi|436835021|ref|YP_007320237.1| sulfatase [Fibrella aestuarina BUZ 2]
 gi|384066434|emb|CCG99644.1| sulfatase [Fibrella aestuarina BUZ 2]
          Length = 522

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 108/244 (44%), Gaps = 42/244 (17%)

Query: 36  PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQ--SKQSFS 93
           PG+  W +      YYN ++N+NG+++ + +   + Y  DL+ + ++ +L      Q  S
Sbjct: 143 PGFNHWVSFRGQGVYYNPTLNVNGKQVPYTD---SSYTTDLLTDYALNWLNSLNKSQGQS 199

Query: 94  KKPIMLVMSFPAPHGPEDSAPQYANMFFNV--------------TSHHTPAY----DYAP 135
            +P  L +S  A H     A ++  M+ N+              +S + PA         
Sbjct: 200 SQPFCLYLSHKAVHAEFQPAKRHRGMYRNMPINYPQSMFLTATDSSKNWPALAGNGQTRR 259

Query: 136 NPDK-----------QWILQVTRKMQPV----HRQ--FTDLLMTKRLQTLQSVDDAVEKI 178
           NP+             W+ Q       V    H Q  F D    +  +TL  VDD+V ++
Sbjct: 260 NPETGAMKANLADMPNWVKQQRYSWHGVDYMYHGQIGFNDFY-RQYCETLMGVDDSVGRV 318

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           +K L D G+L NT ++Y  D+G+  G+ GL+  K   +E  +RVP L+  P +  G   L
Sbjct: 319 LKWLADNGQLENTLVVYMGDNGFSFGERGLID-KRHMYEESMRVPLLVHCPALTKGGTKL 377

Query: 239 LDVL 242
             V+
Sbjct: 378 EQVV 381


>gi|390958071|ref|YP_006421828.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390958412|ref|YP_006422169.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390412989|gb|AFL88493.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390413330|gb|AFL88834.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
          Length = 450

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 85/187 (45%), Gaps = 32/187 (17%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF---- 120
           G +    +  ++ A+ ++ FL + +    K P  + + F APH P  +  +Y  M+    
Sbjct: 148 GPEGSTKHSSEVYADSAISFLDRQRAEHDK-PFFMYVGFNAPHDPRQAPEEYQAMYPVDK 206

Query: 121 ------------FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
                       FN    HT     AP P  +  ++  RK              +    +
Sbjct: 207 IALPPNYLPQHPFNQGDFHTRDEQLAPFPRTELDVKTHRK--------------EYYAII 252

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
             +D  + +++  L   G+ +NTY+I T+DHG  +G+ GL+ GK   +E  +R+P ++RG
Sbjct: 253 THMDAQIGRVLDALDRSGQADNTYVIMTADHGLAVGEHGLM-GKQNQYECSMRMPLIVRG 311

Query: 229 PGIVPGT 235
           PGI PG+
Sbjct: 312 PGIKPGS 318


>gi|305666556|ref|YP_003862843.1| mucin-desulfating sulfatase [Maribacter sp. HTCC2170]
 gi|88708825|gb|EAR01060.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Maribacter sp. HTCC2170]
          Length = 502

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 100/226 (44%), Gaps = 18/226 (7%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+FGK+    +G    PG+  W +      YYN ++N+NG ++ + +   + Y  DL+  
Sbjct: 135 GFFGKWHMGSHGDEPQPGFTHWESFPGQGVYYNPTLNINGERVSYKD---STYITDLLTE 191

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF------FNVTSHHTPAYDY 133
            ++ +L++  +    KP    +S  A H     A ++   +         T   T    Y
Sbjct: 192 HTIDWLKKRDKD---KPFFAYLSHKAVHAGFKPARRHKGKYKGKRIALPATYDQTKTGAY 248

Query: 134 APNPDKQWILQVTRKMQPVHRQFTD-----LLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
                 QW+         V   + D      ++    +TL  VD++V  I+  LK  G  
Sbjct: 249 RDLKWPQWVADQRISWHGVDYMYHDNRDIHEMVVDYCETLLGVDESVGAIMDYLKKEGLD 308

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            +T +IY  D+G+  G+ GL+  + F +E  ++VP L+R P +  G
Sbjct: 309 ESTMVIYMGDNGFSWGEHGLIDKRHF-YEESVKVPLLVRCPELFDG 353


>gi|329928120|ref|ZP_08282066.1| arylsulfatase [Paenibacillus sp. HGF5]
 gi|328937997|gb|EGG34396.1| arylsulfatase [Paenibacillus sp. HGF5]
          Length = 497

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 75/157 (47%), Gaps = 9/157 (5%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP- 135
           +   S+ FLR+       KP  L MSF  PH P D    Y N++ +      P  D+A  
Sbjct: 192 VVTQSIDFLRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLNLYEDADIPEPPVGDWAET 248

Query: 136 -NPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            +PD      +T R + P  R      M      +  +D  + + ++ L + GEL NT I
Sbjct: 249 EDPDLAGYSPITGRGLVPKRRLRK--AMAAYYALITHIDHQIGRFLQSLNEYGELGNTVI 306

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           ++TSDHG  LG   L + KS P+E   RVPF++  PG
Sbjct: 307 LFTSDHGELLGDHHLFR-KSLPYEGSTRVPFIVNDPG 342


>gi|417304251|ref|ZP_12091280.1| choline sulfatase [Rhodopirellula baltica WH47]
 gi|327539463|gb|EGF26078.1| choline sulfatase [Rhodopirellula baltica WH47]
          Length = 477

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ ++IAN S  F   + +    +P  + ++F A H P  +  +Y + +      V +++
Sbjct: 187 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 244

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P + YA        L+  R + P  R    + + +R     L  +D  + +I+  ++  
Sbjct: 245 QPLHPYAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 303

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
           G+ +NT+I +T+DHG  +GQ GL+ GK  P++  +RVPFL++GPG+  G      +YL D
Sbjct: 304 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVKAGGRVEEPIYLQD 362

Query: 241 VL 242
           V+
Sbjct: 363 VM 364


>gi|149177416|ref|ZP_01856020.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Planctomyces maris DSM 8797]
 gi|148843749|gb|EDL58108.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Planctomyces maris DSM 8797]
          Length = 510

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 92/192 (47%), Gaps = 42/192 (21%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHH 127
           Y + +  D+I + S+++L+  KQ   +KP  L+  F APH   ++A +Y  ++ +V    
Sbjct: 171 YDSKHSSDVITDLSLKWLKNRKQK--EKPFFLMHHFKAPHDNFENAERYDWLYQDVV--- 225

Query: 128 TPAYDYAPNPDKQW----------------ILQVTRKMQPVHRQFTDLLMT--------- 162
                  P PD  W                I     +    H  F D L++         
Sbjct: 226 ------IPEPDSLWHRGQHGPLHHALYGTSIGLRNERRNMGHHMFVDPLLSSEQYTREAY 279

Query: 163 -----KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
                K L+ ++ VDD ++++V+ LK  GEL+NT IIYT+D G+ LG+   +  K + +E
Sbjct: 280 QRYLKKYLRCVRGVDDNIQRLVEYLKQTGELDNTIIIYTADQGFMLGEHDYID-KRWMYE 338

Query: 218 FDIRVPFLMRGP 229
             +R+PF++R P
Sbjct: 339 ESLRMPFIVRYP 350


>gi|440716907|ref|ZP_20897409.1| choline sulfatase [Rhodopirellula baltica SWK14]
 gi|436437975|gb|ELP31557.1| choline sulfatase [Rhodopirellula baltica SWK14]
          Length = 477

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ ++IAN S  F   + +    +P  + ++F A H P  +  +Y + +      V +++
Sbjct: 187 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 244

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P + YA        L+  R + P  R    + + +R     L  +D  + +I+  ++  
Sbjct: 245 QPLHPYAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 303

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
           G+ +NT+I +T+DHG  +GQ GL+ GK  P++  +RVPFL++GPG+  G      +YL D
Sbjct: 304 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVEAGGRVEEPVYLQD 362

Query: 241 VL 242
           V+
Sbjct: 363 VM 364


>gi|294053768|ref|YP_003547426.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613101|gb|ADE53256.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 444

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 80/177 (45%), Gaps = 20/177 (11%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           + Y  DL  N +V F+   +Q+   +P  L  +F APH P     ++ +       H+ P
Sbjct: 154 DTYSVDLFCNTAVDFIENYQQN---QPFFLYTAFTAPHDPRTPPKEFLD-------HYPP 203

Query: 130 AYDYAPNP---------DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
                PN          D + I        P   +     +      ++S+D  + +++ 
Sbjct: 204 QTVRVPNNYADEHPFDLDVRDIRDECLAPYPRTPERIQEELAAYYGMIESMDSGIARVLA 263

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
            L+  GEL+NT I+YT DHG  +GQ GL+ GK   +E   RVP L+ GPG+  G  Y
Sbjct: 264 ALEARGELDNTIIVYTGDHGLAVGQHGLL-GKQNVYEHSARVPLLLSGPGVPKGEAY 319


>gi|90416259|ref|ZP_01224191.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [gamma proteobacterium HTCC2207]
 gi|90331984|gb|EAS47198.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [marine gamma proteobacterium HTCC2207]
          Length = 360

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 87/186 (46%), Gaps = 31/186 (16%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP--------------- 114
           N Y  D++   +VRF+  S    + +P  L +   A H PE S                 
Sbjct: 38  NGYMTDILTQRAVRFIHDS----ASQPFFLYIGHKAVH-PEASQRNDGSADLSVKREFIP 92

Query: 115 ------QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP-----VHRQFTDLLMTK 163
                 +Y N   N    +    D++P  +   I  +  K        ++  F++  +  
Sbjct: 93  AKRHQGKYKNAEVNTVERYIATQDFSPLNESLAIRHMLEKSDERWYRWINSGFSEDTIRA 152

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           R + + SVD+++ +IVK L+++  L+NT II+TSD+GY  G+ GL   + +P+E  I+VP
Sbjct: 153 RAEMMLSVDESLGEIVKTLQNMDLLDNTLIIFTSDNGYFYGEHGLTIERRYPYEEAIKVP 212

Query: 224 FLMRGP 229
            ++R P
Sbjct: 213 LIIRKP 218


>gi|149199161|ref|ZP_01876200.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149137758|gb|EDM26172.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 451

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 96/185 (51%), Gaps = 20/185 (10%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++AN+S+ F+ ++K     KP  + ++F A H P  +  ++ +M+     +V  + 
Sbjct: 168 HWSEVVANNSIDFIERAKNV--NKPFFMYIAFNATHDPRQAPKEFIDMYPLEDMDVPKNF 225

Query: 128 TPAYDY----APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS-VDDAVEKIVKEL 182
              Y Y      +P +   L+  R M      F   +  +    L + +D  + +I+K +
Sbjct: 226 LANYPYDTEIGCDPKR---LRDERLMPTPRTDFAVKVHRQEYFALATHMDQQIGRILKAI 282

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MY 237
           +  G+ +NTYII+T+DHG  +G  GLV GK   +E  +RVPF + GP +  G+     +Y
Sbjct: 283 ELSGKADNTYIIFTADHGLAVGNHGLV-GKQSMYEHSMRVPFFIAGPKMKKGSVFETPIY 341

Query: 238 LLDVL 242
           L D++
Sbjct: 342 LQDIM 346


>gi|406834550|ref|ZP_11094144.1| choline sulfatase [Schlesneria paludicola DSM 18645]
          Length = 486

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 91/182 (50%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++A+D+V FLRQ+       P  + ++F APH P  S  ++  M+     ++  + 
Sbjct: 194 HWSEIVADDAVDFLRQAADE--PAPFFMYVAFNAPHDPRQSPREFVEMYPPEKISLPVNF 251

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P Y +      +  L+   K+ P  R    + + +      +  +D  + +I+  LK  
Sbjct: 252 LPEYPFKDLIGCEPTLR-DEKLAPFPRTERAIRVHRAEYYAIVSHLDRQIGRILDALKAS 310

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
           G+  NT+I +T+DHG  +G+ GLV GK   +E  +RVPF+  GP    G      +YL D
Sbjct: 311 GKEQNTWIFFTADHGLAVGRHGLV-GKQNLYEHSVRVPFVAVGPKAAAGERIDAPIYLQD 369

Query: 241 VL 242
           V+
Sbjct: 370 VM 371


>gi|260062063|ref|YP_003195143.1| N-acetylglucosamine-6-sulfatase [Robiginitalea biformata HTCC2501]
 gi|88783625|gb|EAR14796.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Robiginitalea biformata HTCC2501]
          Length = 490

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 107/242 (44%), Gaps = 25/242 (10%)

Query: 11  NLLYFDI---SHGYFGKYLNKYNGSYIP----PGWREWGALIMNSKYYNYSINMNGRKIK 63
           NL YF       GY   +L K++ S       PG+  W +      YYN ++N+NG +I+
Sbjct: 107 NLTYFPQYLQEAGYQTAFLGKWHMSSHTDEPRPGFDHWESFFGQGVYYNPTLNINGERIE 166

Query: 64  HGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV 123
           + +   + Y  DL+   +V +L    +    KP  L +S  A H     A ++   +   
Sbjct: 167 YKD---STYITDLLTEHAVDWLESRDKD---KPFFLYLSHKAVHAEFQPARRHKGRYAGK 220

Query: 124 TSHHTPAYDYAPNP---DKQWILQVTRKMQPVH------RQFTDL--LMTKRLQTLQSVD 172
                P Y+        D +W   V  +    H          D+  L+    +TL  VD
Sbjct: 221 KIELPPTYEQTKTGAWRDLKWPEWVADQRVSWHGVDYMYHSNIDMQELVQAYCETLLGVD 280

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           D+V  +++ L++ G    T +IY  D+G+  G+ GL+  + F +E  ++VP L+R P + 
Sbjct: 281 DSVGAVLEYLEEEGLDEETLVIYMGDNGFSWGEHGLIDKRHF-YEESVKVPLLVRCPELF 339

Query: 233 PG 234
            G
Sbjct: 340 EG 341


>gi|440717674|ref|ZP_20898156.1| choline sulfatase [Rhodopirellula baltica SWK14]
 gi|436437294|gb|ELP30950.1| choline sulfatase [Rhodopirellula baltica SWK14]
          Length = 507

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 82/174 (47%), Gaps = 24/174 (13%)

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
           + +L Q   +  + P ++   F  PH   D  P+    +    ++HT      P NP + 
Sbjct: 187 LEYLDQRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 244

Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
            +     +  P H    +L                     + +     +++D  + K+++
Sbjct: 245 PLPPNHLEAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 304

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+++GEL+NTYI+YTSDHG  +G+ GL +GK   +E   RVPF+++GPGI  G
Sbjct: 305 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 357


>gi|238921150|ref|YP_002934665.1| sulfatase [Edwardsiella ictaluri 93-146]
 gi|238870719|gb|ACR70430.1| sulfatase [Edwardsiella ictaluri 93-146]
          Length = 510

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/254 (27%), Positives = 107/254 (42%), Gaps = 37/254 (14%)

Query: 5   VELTSRNLLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YN 52
            +  S  L   D S GY GK         ++  YN       W EW A      +   Y+
Sbjct: 119 AQCWSDVLKKLDYSTGYIGKWHLDAPHAPFIESYNNPMEGRYWNEWTAPDRRHGFDFWYS 178

Query: 53  YSI-NMNGRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMS 102
           Y   +++ R +  G +       Y + +  +  A+ +++FLR  K SF  + KP  LV+S
Sbjct: 179 YGTYDLHMRPMYWGNNTPRESPVYVDQWSAEHEADMAIKFLRNEKGSFRAADKPFALVVS 238

Query: 103 FPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLM 161
              PH P D  PQ Y + F   +S      D    P+  W         P +        
Sbjct: 239 MNPPHSPYDQVPQKYLDRFPGKSSR-----DLNTRPNVDWQANYQEGYGPQY-------F 286

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
            + +  +  VD+   +I+ EL   G  +NT +++ SDHG  LG  G    K+  +E  IR
Sbjct: 287 KEYMAMVNGVDEQFGRILAELDKQGLADNTLVVFFSDHGCCLGANGQPT-KNVAYEEAIR 345

Query: 222 VPFLMRGPG-IVPG 234
           +P + R PG I PG
Sbjct: 346 IPMMFRLPGKIAPG 359


>gi|296121821|ref|YP_003629599.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014161|gb|ADG67400.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 523

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 80/167 (47%), Gaps = 25/167 (14%)

Query: 83  RFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS----------HHTPAYD 132
           R ++QS  +   KP ++ + +  PH P    P++  M+ +  S            TP + 
Sbjct: 221 RQVQQSSSTGESKPFLMCVGYFLPHVPCYVTPEWLAMYPDDDSILPFIEKDDRKDTPRFS 280

Query: 133 Y-----APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           +      P P  +W+ Q        H  +  L+ +  L +   VD  + +++  L+  GE
Sbjct: 281 WYLHWRLPEPRLKWLQQ--------HEHWRSLVRSY-LASTSYVDAQIGRLLAALEATGE 331

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            NNT I+  SDHG+HLG+ G + GK+  +E   RVP L  GPG++ G
Sbjct: 332 ANNTLIVLWSDHGWHLGEKG-ITGKNTLWERSTRVPLLFAGPGVLAG 377


>gi|340616913|ref|YP_004735366.1| sulfatase [Zobellia galactanivorans]
 gi|339731710|emb|CAZ94975.1| Sulfatase, family S1-28 [Zobellia galactanivorans]
          Length = 515

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 96/194 (49%), Gaps = 36/194 (18%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ +D++ F+ ++K+   + P  + ++F A H P  +  ++ +M+      V  + 
Sbjct: 219 HWSEVVRDDALSFIEKAKKK--EDPFFMYLAFNATHDPRQAPQRFLDMYPVKDIKVPDNF 276

Query: 128 TPAYDY---------------APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
            P Y +               AP P  ++ ++V       HRQ    L+T        +D
Sbjct: 277 MPEYPWKDDMGNPPSLRDEALAPYPRTEYAVKV-------HRQEYYALLT-------HMD 322

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           + + KI+  L+  G+++NTYI + +DHG  +G  GL+ GK   F+  +R+P ++ GPGI 
Sbjct: 323 EQIGKILDALEASGKMDNTYIFFGADHGLSVGHHGLI-GKQSMFDHSVRIPMMVVGPGIP 381

Query: 233 PGTMYLLDVLIPQV 246
            G +   DV +  +
Sbjct: 382 KGKLLDQDVYLQDI 395


>gi|171915410|ref|ZP_02930880.1| iduronate-2-sulfatase [Verrucomicrobium spinosum DSM 4136]
          Length = 514

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 77/162 (47%), Gaps = 8/162 (4%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D +  D  R+L     +   +P  + M F  PH P  +   Y +++        P  +  
Sbjct: 189 DGMVADDARWLLSRCAADKSRPFFVAMGFFRPHTPYVAPKAYFDLYPVEKMPLYPPVEKV 248

Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNN 190
           P       L   +K Q    Q TD L  + +Q    ++  +D  V ++V+ELK LG   +
Sbjct: 249 PAEVPAAALVSYKKEQ---DQLTDDLRQQAVQAYYASISFMDAQVGRVVEELKQLGLAES 305

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           T I++TSDHGYH+G+ GL +  S  FE   RVP ++  PG+ 
Sbjct: 306 TIIVFTSDHGYHMGEHGLWQKMSL-FEASARVPLMIVAPGVA 346


>gi|326334100|ref|ZP_08200327.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
 gi|325948076|gb|EGD40189.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
          Length = 511

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 112/256 (43%), Gaps = 37/256 (14%)

Query: 21  YFGKYLNKY--NGSY--IPPGWREWGALIMNSKYYNY---SINMNGRKIKHGEDYYNDYY 73
           + GKYLN+Y  +G+   IPPGW EW A + +   YNY     N+NGR +K        Y 
Sbjct: 137 FTGKYLNEYGEDGTDRDIPPGWDEWRATV-DPSTYNYLGAKFNVNGRLVKP-----KGYS 190

Query: 74  PDLIANDSVRFLR--QSKQSFSKKPIMLVMSFPAPH--GPEDSAPQYANMFFNVTSHHTP 129
             +I   +   L+  + K     KP    +++ APH  GP +             +   P
Sbjct: 191 TTVITQQAQAALQDARKKSGAEAKPWFQWVNYVAPHIGGPTEKGDPKQRFPGTKGAIGVP 250

Query: 130 --------AYDYAPNPDKQWILQVTRKMQP-----------VHRQFTDLLMTKRLQTLQS 170
                   AY   P P +  +    RK  P           V +   +L   +R+++ +S
Sbjct: 251 VPDQQDRDAYADKPIPQQPNLFPEGRKQFPKGSPSRKNPSQVEKDAYELSYQRRIESARS 310

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  +  ++  LK  GEL  T +++TSD+G+  G      GK + +E  +R+P LM GPG
Sbjct: 311 LDRNIASLLGGLKKSGELARTLVVFTSDNGFSNGYHNF-NGKLWYYEESLRIPVLMSGPG 369

Query: 231 IVPGTMYLLDVLIPQV 246
           +  G      +  P +
Sbjct: 370 VPKGRKVATTLTNPDI 385


>gi|326334096|ref|ZP_08200323.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
 gi|325948072|gb|EGD40185.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
          Length = 514

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 69/249 (27%), Positives = 105/249 (42%), Gaps = 35/249 (14%)

Query: 23  GKYLNKYN-GSY---IPPGWREWGALIMNSKYYNYS--INMNGRKIKHGEDYYNDYYPDL 76
           GKYLN Y  GS    +PPGW +W A +  S Y   S   N+NG  IK        Y   +
Sbjct: 146 GKYLNGYGEGSSRGDVPPGWDQWRATVDPSTYNFRSPKFNVNGEIIKS-----KGYSSTV 200

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-----GPEDSAPQY----ANMFFNVTSHH 127
           I   +   +  + +  S KP    +++ APH     GP+D    Y    A +   V    
Sbjct: 201 ITEHAKAGI--AAERGSGKPWFTWVNYVAPHHGGPSGPDDPKKIYPGTDAALSVTVPDRQ 258

Query: 128 TPA-YDYAPNPDKQWILQVTRK-----------MQPVHRQFTDLLMTKRLQTLQSVDDAV 175
               YD  P P +  +                   P  +    +   +RL+  +S+D  +
Sbjct: 259 DRGFYDNVPIPARPNLFPEDTSGYAKGSPGRGTFNPRKKDALRIAYQRRLEATRSLDRNI 318

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             ++  LK  GEL  T +++TSD+G+  G   L  GK + ++  +R+P LM GPG+  G 
Sbjct: 319 ASLIGSLKKSGELKRTLVVFTSDNGFSNGYHNL-NGKLWHYDESLRIPVLMSGPGVPRGR 377

Query: 236 MYLLDVLIP 244
                V  P
Sbjct: 378 TVRTPVTNP 386


>gi|149177547|ref|ZP_01856150.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Planctomyces maris DSM 8797]
 gi|148843697|gb|EDL58057.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Planctomyces maris DSM 8797]
          Length = 633

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 102/234 (43%), Gaps = 31/234 (13%)

Query: 20  GYFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           GY   Y+ K+   N     PG+  W ++      ++ ++N+NG +I+     +  +  D+
Sbjct: 282 GYATAYVGKWHMGNDDTARPGFDHWVSMKGQGTSFDPTLNINGERIQ-----FKGHTTDV 336

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE------DSAPQYANMFFNVTSHHTPA 130
           +      F++       +KP  L ++  A H PE       S    +   F     H   
Sbjct: 337 LNQKVNEFVKAQ----GEKPFCLYIAHKALH-PELTQRDDGSITDPSAAKFMPAKRHEKL 391

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQF------------TDLLMTKRLQTLQSVDDAVEKI 178
           Y     P +  ++      + + R              +D ++  RL+ L  +D+ V  +
Sbjct: 392 YSDDAIPRRLNVVDTLEGKRALKRTVPGLPPLSQKTGTSDEVIRDRLRMLAGIDEGVGSL 451

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
            + L+  G+L++T  ++TSDHGY  G+ GL   +  P+E  IRVP L+R P ++
Sbjct: 452 CELLESQGKLDDTVFVFTSDHGYWYGEHGLSVERRLPYEEGIRVPLLVRYPPVI 505


>gi|261407393|ref|YP_003243634.1| arylsulfatase [Paenibacillus sp. Y412MC10]
 gi|261283856|gb|ACX65827.1| sulfatase [Paenibacillus sp. Y412MC10]
          Length = 496

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/157 (32%), Positives = 76/157 (48%), Gaps = 9/157 (5%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP- 135
           +   S+ FLR+   +   KP  L MSF  PH P D    Y +++ +      P  D+A  
Sbjct: 191 VVTQSIDFLRRRDPA---KPFFLWMSFVRPHPPFDPPQAYLDLYEDAEIPEPPVGDWAET 247

Query: 136 -NPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            +PD      +T R + P  R      M      +  +D  + + ++ L + GEL NT I
Sbjct: 248 EDPDLSGYSPITGRGLVPKRRLRK--AMAAYYALITHIDHQIGRFLQSLNEYGELGNTVI 305

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           ++TSDHG  LG   L + KS P+E   RVPF++  PG
Sbjct: 306 LFTSDHGELLGDHHLFR-KSLPYEGSTRVPFIVSDPG 341


>gi|417301585|ref|ZP_12088736.1| choline sulfatase [Rhodopirellula baltica WH47]
 gi|327542177|gb|EGF28670.1| choline sulfatase [Rhodopirellula baltica WH47]
          Length = 507

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 24/174 (13%)

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
           + +L Q   +  + P ++   F  PH   D  P+    +    ++HT      P NP + 
Sbjct: 187 LEYLDQRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 244

Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
            +     +  P H    +L                     + +     +++D  + K+++
Sbjct: 245 PLPPNHLEAHPFHHGHPNLRDEVSVNGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 304

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+++GEL+NTYI+YTSDHG  +G+ GL +GK   +E   RVPF+++GPG+  G
Sbjct: 305 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGVPKG 357


>gi|253575992|ref|ZP_04853325.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251844567|gb|EES72582.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 502

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 96/192 (50%), Gaps = 20/192 (10%)

Query: 72  YYPDLIANDSVRFLR--QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           +Y   IAN SV FL   Q ++SF      L  SFP PH P  +   Y+ M+ +  S   P
Sbjct: 195 HYNRWIANRSVAFLENLQPEESF-----YLWCSFPDPHHPFAACRPYSEMY-DPQSLTLP 248

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL-QT---LQSVDDAVEKIVKELKDL 185
              +    D    L+  RK+ P H QF +  + + L QT   +  +D  + +++++LK+L
Sbjct: 249 E-GWENEADSIDWLKGLRKVHPGHTQFDETTLREVLAQTYGMITHMDSCIGEVLEKLKEL 307

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY------LL 239
           G  +NT I++ +DHG +LG   L+    +P+E  + VP++ + P      +Y      LL
Sbjct: 308 GLYDNTVIVFMADHGEYLGSHRLLTKAEWPWEELLNVPYIWKAPQAAKQGVYRDRVVSLL 367

Query: 240 DVLIPQVRKFSS 251
           D  +P +  F+ 
Sbjct: 368 D-FVPTILDFAG 378


>gi|149196927|ref|ZP_01873980.1| putative N-acetylglucosamine-6-sulfatase [Lentisphaera araneosa
           HTCC2155]
 gi|149140037|gb|EDM28437.1| putative N-acetylglucosamine-6-sulfatase [Lentisphaera araneosa
           HTCC2155]
          Length = 489

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 103/205 (50%), Gaps = 15/205 (7%)

Query: 38  WREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
           W   G       Y+N S +  NG + K    Y  D + +L    +V ++   KQ+  +KP
Sbjct: 142 WDRGGPKSNKYAYFNNSLVRTNGGERKPLGGYTTDKFTEL----AVDYIH--KQNDKEKP 195

Query: 97  IMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
             L + +P  HGP   A ++ + + +V     P+  + P   K   L+   + +   +  
Sbjct: 196 WYLWLCYPGVHGPYTPAKRHKDFYKDVEVK-VPSDIFGPRHSKPAHLKNMTRWKK-DKNG 253

Query: 157 TDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
           T +    +++     ++S+D+AV  ++K L+D G+L NT +I+TSD G+  GQ G  K K
Sbjct: 254 TPVGFASQVKKYHNAVKSLDEAVGTLMKALEDSGQLENTIVIFTSDQGFAWGQHG-SKEK 312

Query: 213 SFPFEFDIRVPFLMRGPGIV-PGTM 236
             P++ +I  P +++ PGI  PGT+
Sbjct: 313 WLPYDANIIAPLIIKAPGITQPGTV 337


>gi|269140257|ref|YP_003296958.1| sulfatase [Edwardsiella tarda EIB202]
 gi|387868773|ref|YP_005700242.1| Choline-sulfatase [Edwardsiella tarda FL6-60]
 gi|267985918|gb|ACY85747.1| putative sulfatase [Edwardsiella tarda EIB202]
 gi|304560086|gb|ADM42750.1| Choline-sulfatase [Edwardsiella tarda FL6-60]
          Length = 510

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 69/254 (27%), Positives = 107/254 (42%), Gaps = 37/254 (14%)

Query: 5   VELTSRNLLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YN 52
            +  S  L   D S GY GK         ++  YN       W +W A      +   Y+
Sbjct: 119 AQCWSDVLKKLDYSTGYIGKWHLDAPQAPFIESYNNPMEGRYWNDWTAPDRRHGFDFWYS 178

Query: 53  YSI-NMNGRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMS 102
           Y   +++ R +  G +       Y + +  +  A+ +++FLR  K SF  + KP  LV+S
Sbjct: 179 YGTYDLHMRPMYWGNNTPRESPMYVDQWSAEHEADMAIKFLRNEKGSFRAADKPFALVVS 238

Query: 103 FPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLM 161
              PH P D  PQ Y + F   +S      D    P+  W         P +        
Sbjct: 239 MNPPHSPYDQVPQKYLDRFPGKSSR-----DLNTRPNVDWQANYQEGYGPQY-------F 286

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
            + +  +  VD+   +I+ EL   G   NT +++ SDHG  LG  G    K+ P+E  +R
Sbjct: 287 KEYMAMVNGVDEQFGRILAELDKQGLAENTLVVFFSDHGCCLGANGQPT-KNVPYEEAMR 345

Query: 222 VPFLMRGPG-IVPG 234
           +P + R PG I PG
Sbjct: 346 IPMMFRLPGKIAPG 359


>gi|197285971|ref|YP_002151843.1| exported sulfatase [Proteus mirabilis HI4320]
 gi|227356480|ref|ZP_03840868.1| exported sulfatase [Proteus mirabilis ATCC 29906]
 gi|425068874|ref|ZP_18471990.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
           mirabilis WGLW6]
 gi|425071592|ref|ZP_18474698.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
           mirabilis WGLW4]
 gi|194683458|emb|CAR44241.1| putative exported sulfatase [Proteus mirabilis HI4320]
 gi|227163590|gb|EEI48511.1| exported sulfatase [Proteus mirabilis ATCC 29906]
 gi|404598450|gb|EKA98920.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
           mirabilis WGLW4]
 gi|404598774|gb|EKA99242.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
           mirabilis WGLW6]
          Length = 519

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 67/247 (27%), Positives = 107/247 (43%), Gaps = 37/247 (14%)

Query: 12  LLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YNYSI-NMN 58
           L   D S GY GK         ++  YN       W EW A      +   Y+Y   +++
Sbjct: 136 LKSLDYSTGYIGKWHLDAPYEPFIESYNNPMEGRYWNEWAAPDRRHGFDFWYSYGTYDLH 195

Query: 59  GRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGP 109
            + +    D       Y + + P+  A+ +++FL+     F  + KP  LV+S   PH P
Sbjct: 196 LKPMYWANDTPRDKPIYIDQWGPEHEADMAIKFLKNENNQFRDNSKPFALVVSMNPPHSP 255

Query: 110 EDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
            D  P+ Y + +   TS      +    P+ QW  +      P +         + +  +
Sbjct: 256 YDQVPKKYLDAYQGKTSK-----ELNTRPNVQWDSEYQEGYGPQY-------FKEYMAMV 303

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
             VD+   +IV EL+  G  +NT +++ SDHG  LG  G    K+ P+E  +RVP + R 
Sbjct: 304 NGVDEQFGRIVDELEKQGLADNTLVVFFSDHGSCLGSNGQAT-KNNPYEESMRVPMMFRL 362

Query: 229 PG-IVPG 234
           PG I PG
Sbjct: 363 PGKIKPG 369


>gi|253576233|ref|ZP_04853564.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251844360|gb|EES72377.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 503

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 101/213 (47%), Gaps = 17/213 (7%)

Query: 22  FGKYLNKYNGSYIPPGWREWGALIMNSKYY-NYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           FG YL      Y P G ++   L     YY +  I M+ R I   E+Y+ +++   IA  
Sbjct: 152 FGDYLRWLEQVY-PGGGKK---LSKEGAYYADDKIPMSWR-IDLPEEYHYNHW---IAER 203

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ 140
           S+ FL Q  Q    +P  L  SFP PH P  +   Y+ M+ +  S   P +         
Sbjct: 204 SIDFLEQMSQQ--DQPFFLWCSFPDPHHPFAACRPYSEMY-DPASLTLPEHWDVEEDGIS 260

Query: 141 WILQVTRKMQPVHRQFTD----LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
           W L+  R + P +  F +     ++ +    +  VD  + +I K+LK+L    NT I++ 
Sbjct: 261 W-LKERRNIHPDYTSFDEHDLREILAQTYGMISHVDKTIGEITKKLKELELDQNTVIVFL 319

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           +DHG +LG   L+    +P+E  IRVPF+ + P
Sbjct: 320 ADHGEYLGSHHLITKGEWPWEELIRVPFIWKIP 352


>gi|308512815|gb|ADO33061.1| N-acetylglucosamine-6-sulfatase [Biston betularia]
          Length = 198

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/64 (50%), Positives = 47/64 (73%)

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           L +VD+ V  +V  L++   L+NTY+++TSD+GYH+GQF  V  K  P+E DI+VPFL++
Sbjct: 1   LLAVDEMVADVVGALENSALLDNTYLVFTSDNGYHIGQFSQVYDKRQPYETDIKVPFLIK 60

Query: 228 GPGI 231
           GP I
Sbjct: 61  GPKI 64


>gi|449132608|ref|ZP_21768623.1| choline sulfatase [Rhodopirellula europaea 6C]
 gi|448888287|gb|EMB18609.1| choline sulfatase [Rhodopirellula europaea 6C]
          Length = 492

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 81/174 (46%), Gaps = 24/174 (13%)

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
           + +L Q   +  + P ++   F  PH   D  P+    +    ++HT      P NP + 
Sbjct: 172 LEYLDQRDAASDEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 229

Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
            +        P H    +L                     + +     +++D  + K+++
Sbjct: 230 PLPPNHLDAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 289

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+++GEL+NTYI+YTSDHG  +G+ GL +GK   +E   RVPF+++GPGI  G
Sbjct: 290 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 342


>gi|363581191|ref|ZP_09314001.1| sulfatase [Flavobacteriaceae bacterium HQM9]
          Length = 510

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 92/182 (50%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           ++ +++A+D++ F+  +  S    P  + ++F APH P  +  +Y +M+  ++    P  
Sbjct: 213 HWSEVLADDAITFMDTA--STMNNPFFMYLAFNAPHDPRQAPQKYQDMY-KLSDISLPKS 269

Query: 132 DYAPNPDKQWILQVT----RKMQPVHRQ--FTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
                PD+  I          + P  R    T + + +   ++  +D  + KI+K LK  
Sbjct: 270 WLPEYPDRHQIANGNNLRDEALAPFPRTEYATKVNIKEYYASITHLDAQIGKILKALKTS 329

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLD 240
           G+  NTYII+T+DHG  +G+ GL+ GK   ++  IR P ++ GP I     +   +YL D
Sbjct: 330 GKDKNTYIIFTADHGLAMGKHGLL-GKQNLYDHSIRPPLVITGPDIPKNKKITTDVYLQD 388

Query: 241 VL 242
           V+
Sbjct: 389 VM 390


>gi|383115489|ref|ZP_09936245.1| hypothetical protein BSGG_2640 [Bacteroides sp. D2]
 gi|313695105|gb|EFS31940.1| hypothetical protein BSGG_2640 [Bacteroides sp. D2]
          Length = 526

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 100/229 (43%), Gaps = 43/229 (18%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           DL+   ++R++   KQ   K+P ++   F A H P D   +  +++  VT        D+
Sbjct: 185 DLVTQKAIRWM---KQQDGKQPFLMCCHFKATHEPYDYPIRMEHLYDGVTFPEPENLLDW 241

Query: 134 AP------------------------NPDKQWILQ-----VTRKMQ--PVHRQFTDLLMT 162
            P                        +PDK W         T  MQ     R     L+ 
Sbjct: 242 GPETNGRSFIGQKLEELVRRWRTASKDPDKWWCRYPGLPFSTEGMQRTTARRAAYQKLIR 301

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
             ++   +VDD + K++K L ++G  +NT ++Y SD GY LG+ G    + F +E   R+
Sbjct: 302 DYMRCGATVDDNIGKLLKALDEMGIADNTIVVYVSDQGYFLGEHGFFDKRMF-YEESARM 360

Query: 223 PFLMRGPGIVPGTMYLLDVLI-----PQVRKFSSGSLIFIM--SFLINL 264
           PF++R P  +P    L D+++     P + +FS   +  +   SF+ NL
Sbjct: 361 PFVIRYPKKIPAGQRLKDLVLNVDFAPTLAEFSGVKMDNVQGRSFVDNL 409


>gi|421611137|ref|ZP_16052288.1| choline sulfatase [Rhodopirellula baltica SH28]
 gi|408497951|gb|EKK02459.1| choline sulfatase [Rhodopirellula baltica SH28]
          Length = 326

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ ++IAN S  F   + +    +P  + ++F A H P  +  +Y + +      V +++
Sbjct: 36  HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 93

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P + YA        L+  R + P  R    + + +R     L  +D  + +I+  ++  
Sbjct: 94  QPLHPYAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 152

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-----TMYLLD 240
           G+ +NT+I +T+DHG  +GQ GL+ GK  P++  +RVPFL++GPG+  G      +YL D
Sbjct: 153 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVEAGGRVEEPVYLQD 211

Query: 241 VL 242
           V+
Sbjct: 212 VM 213


>gi|238587342|ref|XP_002391445.1| hypothetical protein MPER_09124 [Moniliophthora perniciosa FA553]
 gi|215456112|gb|EEB92375.1| hypothetical protein MPER_09124 [Moniliophthora perniciosa FA553]
          Length = 184

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 52/84 (61%), Gaps = 10/84 (11%)

Query: 158 DLLMTKRLQTLQSVDDAVEKIVKELKDLGE--LNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
           D +  +RL+ LQ VDD VE IV+E+++ G     NTYIIYTSD+GY LG      GKS P
Sbjct: 2   DHVYRQRLRVLQPVDDLVEAIVQEVENAGPEVAKNTYIIYTSDNGYALGSHQRSPGKSLP 61

Query: 216 FE--------FDIRVPFLMRGPGI 231
           +E         D+ +P ++RGPG+
Sbjct: 62  YEEGNKRFYFADVLIPLIIRGPGV 85


>gi|325285349|ref|YP_004261139.1| sulfatase [Cellulophaga lytica DSM 7489]
 gi|324320803|gb|ADY28268.1| sulfatase [Cellulophaga lytica DSM 7489]
          Length = 433

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 95/209 (45%), Gaps = 26/209 (12%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
           D+ AN+++ F+   K   SKKP    ++F APH P  +   Y +++        P+  D 
Sbjct: 153 DITANEAINFI---KNYSSKKPFFAYVAFNAPHVPRQTTQNYYDLYPVDKVKLPPSVVDN 209

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLGELN 189
            P       L    K Q  +   +   M KR+Q     +  +D  +  I+  LK+ G  N
Sbjct: 210 TP-------LNTNVKYQYTNNPLSAKTMQKRVQQNNAMVTHMDTRIGDIINTLKEKGIYN 262

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDVL-- 242
           NT I++TSDHG + G+ G V GK   +E  +  P +++ P +     +   +YL DV+  
Sbjct: 263 NTIIVFTSDHGINFGENG-VAGKVCLYEPSVTAPLIIKAPTVKQNNKITARVYLQDVVPT 321

Query: 243 ---IPQVRKFSSGSLIFIMSFLINLNTMR 268
              + +++   +     +M  LIN    R
Sbjct: 322 LFSLLKIKDKEATDFKSLMPLLINSKKTR 350


>gi|340619483|ref|YP_004737936.1| sulfatase [Zobellia galactanivorans]
 gi|339734280|emb|CAZ97657.1| Sulfatase, family S1-27 [Zobellia galactanivorans]
          Length = 490

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 74/250 (29%), Positives = 110/250 (44%), Gaps = 47/250 (18%)

Query: 20  GYFGKYLNKYNGSY--IPPGWREWGALIMNSKY----YNYSINMNGRKIKHGEDYYNDYY 73
           GY GK+    +G    +PPG R  G     +      YN+S+  +       E    ++Y
Sbjct: 137 GYIGKWHLDGHGRLQNVPPGARRQGFDFWKANECTHDYNHSVYYDNLD---PEQKIWEHY 193

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSHHTPAYD 132
                 D+     Q+K   S  P M+++S+  PH P  +AP+ Y  MF        PA  
Sbjct: 194 DTFDQTDAALNYIQTKNGDSS-PFMMIVSYGTPHAPYHTAPEKYRKMF-------DPA-- 243

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
                + +    V  +MQ   +  TDL        + ++DD + KI K+LK+ G+  NT 
Sbjct: 244 -----NIKLRENVPEEMQ--EKAKTDL--AGYYAHIAAIDDMIGKITKKLKEEGQWENTL 294

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-----------------GIVPGT 235
           I++TSDHG  LG  G  K K  P+E  IRVP LM+ P                  I+P  
Sbjct: 295 ILFTSDHGDMLGSQGAYK-KQQPYEESIRVPMLMKLPKDMNLAPAVKKTVINTEDILPTL 353

Query: 236 MYLLDVLIPQ 245
           + L D+ IP+
Sbjct: 354 LGLCDIDIPK 363


>gi|406699829|gb|EKD03024.1| arylsulfatase precursor [Trichosporon asahii var. asahii CBS 8904]
          Length = 646

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 65/242 (26%), Positives = 98/242 (40%), Gaps = 35/242 (14%)

Query: 21  YFGKYLNKYN---GSYIPPGWREWGALIMNSKYYNY------SINMNGRKIKHGEDYYND 71
           Y GK +N +N    + +P      G  +++   Y+Y       +N     + +G      
Sbjct: 138 YTGKLMNGHNIDNCASLPVSGVTEGEFLVDPWTYDYWSPGFCDVNTGQIAVHNGS----- 192

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFP-APHG------------PEDSAPQYAN 118
           Y  D++A    + L     + SK     V++ P  PH             P  S PQY  
Sbjct: 193 YSSDVVAQ---KVLGHLDTALSKDEPFFVVAAPIGPHSHIPGAKNSNDYIPGMSIPQYRP 249

Query: 119 MFFNVTSHHTPAYDYAPNPD----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
                 + H+       NPD      W+  + R  Q       D     RL+ LQ +D  
Sbjct: 250 QDAGKFADHSIERHEGFNPDFPTGVSWVKDLPRLNQ-TQIDHLDDWHRARLRALQPIDAL 308

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +  +  +L   G  + TY+IYTSD+GY LG      GK+  FE D RVP  +RGPG+  G
Sbjct: 309 IGDVFAKLDAAGVTDETYVIYTSDNGYALGSHRRQPGKTLGFEEDTRVPLFVRGPGVEKG 368

Query: 235 TM 236
            +
Sbjct: 369 RV 370


>gi|32477511|ref|NP_870505.1| sulfatase [Rhodopirellula baltica SH 1]
 gi|32448065|emb|CAD77582.1| probable sulfatase [Rhodopirellula baltica SH 1]
          Length = 524

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 24/174 (13%)

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
           + +L +   +  + P ++   F  PH   D  P+    +    ++HT      P NP + 
Sbjct: 204 LEYLDKRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 261

Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
            +     +  P H    +L                     + +     +++D  + K+++
Sbjct: 262 PLPPNHLEAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 321

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+++GEL+NTYI+YTSDHG  +G+ GL +GK   +E   RVPF+++GPGI  G
Sbjct: 322 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 374


>gi|357384309|ref|YP_004899033.1| choline-sulfatase [Pelagibacterium halotolerans B2]
 gi|351592946|gb|AEQ51283.1| choline-sulfatase [Pelagibacterium halotolerans B2]
          Length = 476

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 100/218 (45%), Gaps = 34/218 (15%)

Query: 18  SHGYFGKYLNKYNGSYIPPG--WREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
           + G  G++L+ Y G  +  G   ++W  G+L      Y    N++ + +   +D YND +
Sbjct: 158 ARGLEGRFLDDYAGRGVVRGTLLQQWLDGSLGTPPDAY---ANVDPQDLP--DDAYNDNF 212

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
                    R L + K      P  ++++FP PH P D  P  A  +  V          
Sbjct: 213 V------GQRTLDELKGFDPNSPWFMIVNFPGPHEPMDVTPSMAEAWEGVEF-------- 258

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            P P +    +     Q + R++  ++        +++D  + + +  L++ G+L+NT I
Sbjct: 259 -PLP-RMRNTEDAELQQDIRRRYAAMI--------ENIDRWIGRYIAHLEETGQLDNTVI 308

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           ++ SDHG  LG   L K K  PFE  +RVP ++ GPGI
Sbjct: 309 VFASDHGEMLGDRNLWK-KQVPFEASVRVPMILAGPGI 345


>gi|334364983|ref|ZP_08513955.1| arylsulfatase [Alistipes sp. HGB5]
 gi|313158816|gb|EFR58199.1| arylsulfatase [Alistipes sp. HGB5]
          Length = 516

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 56/246 (22%), Positives = 105/246 (42%), Gaps = 41/246 (16%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGK+          PG+  W        YYN  +N+NG ++K   + Y+    D++ +
Sbjct: 132 AFFGKWHMGNQDDMPQPGFDHWEGFRGQGTYYNTVLNINGERVKFDPELYST---DILTD 188

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------FNVTSHHTP 129
            ++ F+R +++     P  + +S+ + H     +P    M+          FNV  +  P
Sbjct: 189 HAIDFVRDNEEG----PFFIYLSYKSVHSGFQPSPSRKGMYKDEKAVYPPSFNVPEYGIP 244

Query: 130 A----------------YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR-----LQTL 168
                            Y  +  PD  W+         V  Q+   L  +       +T+
Sbjct: 245 RLPGKDADGRPLAGRGWYGESRLPD--WVKNQRESWHGVDYQYHGALPYEEDFRNYCETV 302

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
            S+DDA+ +++  L+  G   +T +IY  D+G+  G+ GL+  ++F +E  +RVP L   
Sbjct: 303 TSMDDAIGRLLDFLQAEGLGESTLVIYMGDNGFTWGEHGLIDKRNF-YEPSVRVPMLAYC 361

Query: 229 PGIVPG 234
           P ++P 
Sbjct: 362 PELIPA 367


>gi|421611911|ref|ZP_16053039.1| choline sulfatase [Rhodopirellula baltica SH28]
 gi|408497316|gb|EKK01847.1| choline sulfatase [Rhodopirellula baltica SH28]
          Length = 524

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 24/174 (13%)

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
           + +L +   +  + P ++   F  PH   D  P+    +    ++HT      P NP + 
Sbjct: 204 LEYLDKRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 261

Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
            +     +  P H    +L                     + +     +++D  + K+++
Sbjct: 262 PLPPNHLEAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 321

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+++GEL+NTYI+YTSDHG  +G+ GL +GK   +E   RVPF+++GPGI  G
Sbjct: 322 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 374


>gi|159487839|ref|XP_001701930.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281149|gb|EDP06905.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 795

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 35/72 (48%), Positives = 44/72 (61%)

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
           RLQ L+SVDD V  +V  L   G LN T  IYTSD+G+ LG   + + K   +E D+R+P
Sbjct: 304 RLQALRSVDDTVSALVSRLACRGLLNRTVFIYTSDNGFKLGNHDITQEKFTQYEEDVRLP 363

Query: 224 FLMRGPGIVPGT 235
            LM GPGI  G 
Sbjct: 364 LLMTGPGIPVGA 375


>gi|440712658|ref|ZP_20893273.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SWK14]
 gi|436442644|gb|ELP35762.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SWK14]
          Length = 534

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 104/223 (46%), Gaps = 37/223 (16%)

Query: 31  GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
           G+Y+PPG +             Y++N+NG ++K        Y  D + + +V +L++   
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218

Query: 91  SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
               KP  L +S  A H      E    +YA+  + F  T     A    P    +W+  
Sbjct: 219 D---KPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 271

Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
                  +   +      D L  +  +++ +VDD+V +++++LKD+G  ++T IIY  D+
Sbjct: 272 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 331

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           G+  G+ GL+  K   +E  IRVP LM+ P +  G   + +V+
Sbjct: 332 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGQPIENVV 373


>gi|32474463|ref|NP_867457.1| choline sulfatase [Rhodopirellula baltica SH 1]
 gi|32445001|emb|CAD75003.1| choline sulfatase [Rhodopirellula baltica SH 1]
          Length = 477

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 94/182 (51%), Gaps = 15/182 (8%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ ++IAN S  F   + +    +P  + ++F A H P  +  +Y + +      V +++
Sbjct: 187 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPTENILVPANY 244

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P +  A        L+  R + P  R    + + +R     L  +D  + +I+  ++  
Sbjct: 245 QPLHPCAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 303

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
           G+ +NT+I +T+DHG  +GQ GL+ GK  P++  +RVPFL++GPG+  G      +YL D
Sbjct: 304 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVKAGGRVEEPIYLQD 362

Query: 241 VL 242
           V+
Sbjct: 363 VM 364


>gi|408529297|emb|CCK27471.1| sulfatase [Streptomyces davawensis JCM 4913]
          Length = 509

 Score = 70.1 bits (170), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 68/234 (29%), Positives = 102/234 (43%), Gaps = 34/234 (14%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNG--RKIKHGEDYYNDYYPDLI 77
           G FGKYLN +     PP + EW  L    +Y +   N NG  R+I         Y   +I
Sbjct: 155 GLFGKYLNFWRTGDNPPHFDEW-LLQEPVRYVDGHYNDNGAVRRIP-------GYNTTVI 206

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
            + ++ F+  S+    ++P    ++  + H      P+YA         HT   D+   P
Sbjct: 207 KDRALAFMEASRTD--QRPWFAYVATRSAHEVNVPEPKYA---------HTRVPDWKGRP 255

Query: 138 --------DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
                   DK   L+         R     L  ++L+TL SVDDA+    ++L+ LG+L 
Sbjct: 256 SVFETGKDDKPPFLRAAGHPFAAGRD----LRARQLRTLLSVDDAMHDFREKLRALGQLE 311

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
           NT +++TSDHG   G  G ++ KS P+   + VPF    P    GT    D L 
Sbjct: 312 NTLVVFTSDHGLCWGDHGWLR-KSVPYRPSLEVPFHASWPAGGLGTSRADDRLT 364


>gi|332665111|ref|YP_004447899.1| N-acetylglucosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
 gi|332333925|gb|AEE51026.1| N-acetylglucosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
          Length = 501

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 54/237 (22%), Positives = 102/237 (43%), Gaps = 37/237 (15%)

Query: 36  PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
           PG+  W +      YYN + N+NG+++ H +   + Y  DL+ + +V +++      +KK
Sbjct: 131 PGFDYWLSFKGQGVYYNPTFNINGQQVPHKD---SAYISDLLTDYAVDWIKAQP---AKK 184

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
           P  L +S    H   + A ++  M+ N+     P+         +++     ++ P  + 
Sbjct: 185 PFFLYLSHKGVHALFEPAKRHKGMYKNLRFTPPPSMFMTARDTSKYLELKNSRISPELKP 244

Query: 156 FTD------------------------------LLMTKRLQTLQSVDDAVEKIVKELKDL 185
           F +                                  + L+TL +VD++V +++  +   
Sbjct: 245 FVENQRDQPNWVYDQRHSWHGVDYMYHGYISFYEFYQQYLETLMAVDESVGRVLDYVSSS 304

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
              NNT +IY  D+G+  G+ GL+  K   +E  +RVP LM  P +VP    L  V+
Sbjct: 305 ALANNTMVIYMGDNGFSFGEHGLID-KRHAYEESMRVPMLMYAPKLVPAGSKLTQVI 360


>gi|153807006|ref|ZP_01959674.1| hypothetical protein BACCAC_01283 [Bacteroides caccae ATCC 43185]
 gi|149130126|gb|EDM21336.1| arylsulfatase [Bacteroides caccae ATCC 43185]
          Length = 529

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 93/205 (45%), Gaps = 44/205 (21%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
           DL+A  ++ +++Q     S +P ++   F A H P D   +  +++  VT      + D+
Sbjct: 189 DLVAEKAINWIKQQD---SNQPFLMCCHFKATHEPYDYPTRMEHLYDGVTFPEPENFLDW 245

Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
            P                        +PDK W            +Q T   + ++++   
Sbjct: 246 GPETNGRSFQGQTLEELGHRWRVASKDPDKWWCRYPGLPFNTDGMQRTTARRAIYQK--- 302

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
            L+   L+   ++DD + K++K L ++G  +NT ++Y SD GY LG+ G    + F +E 
Sbjct: 303 -LIRDYLRCGATIDDNIGKLLKTLDEMGIADNTIVVYVSDQGYFLGEHGFFDKRMF-YEE 360

Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI 243
             R+PF++R P  +P    L D+++
Sbjct: 361 TARMPFVIRYPKKIPAGKRLKDLIL 385


>gi|149174110|ref|ZP_01852738.1| N-acetylgalactosamine 6-sulfate sulfatase [Planctomyces maris DSM
           8797]
 gi|148847090|gb|EDL61425.1| N-acetylgalactosamine 6-sulfate sulfatase [Planctomyces maris DSM
           8797]
          Length = 469

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 88/196 (44%), Gaps = 37/196 (18%)

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
           PDL+ +D++ F+R SK     KP  L + F APH P    P+  +  +       P    
Sbjct: 180 PDLLVDDAINFIRTSKD----KPFALCLHFRAPHTPYGPVPEQDSAHYEGMKIDVP---I 232

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            P    + I Q  ++                  ++ SVD  + +++KEL  L    NT +
Sbjct: 233 TPGVIPEQIRQKNKEY---------------YASVSSVDRNIGRLLKELDQLRLAENTLV 277

Query: 194 IYTSDHGYHLGQFGL------------VKGKSFPFEFD--IRVPFLMRGPGIV-PGTMYL 238
           I+TSDHGY+ G+ G+            V G   P  +D  IRVP +MR P ++ PGT + 
Sbjct: 278 IFTSDHGYNNGRHGVSTKGNGHWIAGGVTGPKRPNMWDTSIRVPLVMRWPAVIKPGTQFD 337

Query: 239 LDVLIPQVRKFSSGSL 254
             V    + KF  G+L
Sbjct: 338 EIVSNIDMFKFVLGAL 353


>gi|423220051|ref|ZP_17206547.1| hypothetical protein HMPREF1061_03320 [Bacteroides caccae
           CL03T12C61]
 gi|392623883|gb|EIY17981.1| hypothetical protein HMPREF1061_03320 [Bacteroides caccae
           CL03T12C61]
          Length = 538

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 93/205 (45%), Gaps = 44/205 (21%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
           DL+A  ++ +++Q     S +P ++   F A H P D   +  +++  VT      + D+
Sbjct: 198 DLVAEKAINWIKQQD---SNQPFLMCCHFKATHEPYDYPTRMEHLYDGVTFPEPENFLDW 254

Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
            P                        +PDK W            +Q T   + ++++   
Sbjct: 255 GPETNGRSFKGQTLEELGHRWRVASKDPDKWWCRYPGLPFNTDGMQRTTARRAIYQK--- 311

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
            L+   L+   ++DD + K++K L ++G  +NT ++Y SD GY LG+ G    + F +E 
Sbjct: 312 -LIRDYLRCGATIDDNIGKLLKTLDEMGIADNTIVVYVSDQGYFLGEHGFFDKRMF-YEE 369

Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI 243
             R+PF++R P  +P    L D+++
Sbjct: 370 AARMPFVIRYPKKIPAGKRLKDLIL 394


>gi|421613439|ref|ZP_16054521.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH28]
 gi|408495803|gb|EKK00380.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH28]
          Length = 534

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 104/223 (46%), Gaps = 37/223 (16%)

Query: 31  GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
           G+Y+PPG +             Y++N+NG ++K        Y  D + + +V +L++   
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218

Query: 91  SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
               KP  L +S  A H      E    +YA+  + F  T     A    P    +W+  
Sbjct: 219 D---KPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 271

Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
                  +   +      D L  +  +++ +VDD+V +++++LKD+G  ++T IIY  D+
Sbjct: 272 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 331

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           G+  G+ GL+  K   +E  IRVP LM+ P +  G   + +V+
Sbjct: 332 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGKPIENVV 373


>gi|268592966|ref|ZP_06127187.1| N-acetylglucosamine-6-sulfatase [Providencia rettgeri DSM 1131]
 gi|291311438|gb|EFE51891.1| N-acetylglucosamine-6-sulfatase [Providencia rettgeri DSM 1131]
          Length = 519

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/247 (26%), Positives = 104/247 (42%), Gaps = 37/247 (14%)

Query: 12  LLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNS------KYYNYSIN 56
           L   D S GY GK         ++  YN       W EW A            Y  Y ++
Sbjct: 136 LKSLDYSTGYIGKWHLDAPYAPFIESYNNPMEGRYWNEWAAPNRRHGFDFWYSYGTYDLH 195

Query: 57  MN-----GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGP 109
           M          +    Y + + P+  A+ +++FL+     +  + KP  LV+S   PH P
Sbjct: 196 MKPMYWANNTPREKPIYVDQWGPEHEADMAIKFLQNKNGQYRDNTKPFALVVSMNPPHSP 255

Query: 110 EDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
            D  PQ Y + +   TS      +    P+ +W ++      P +         + L  +
Sbjct: 256 YDQVPQKYLDAYKGKTSQ-----ELNTRPNVKWDVEYQEGYGPQY-------FKEYLAMV 303

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
             VD+   +IV EL+  G  +NT +++ SDHG  LG  G    K+ P++  +RVP + R 
Sbjct: 304 NGVDEQFGRIVDELEKQGLADNTLVVFFSDHGSCLGANGQPT-KNNPYDESMRVPMMFRL 362

Query: 229 PG-IVPG 234
           PG I PG
Sbjct: 363 PGKIQPG 369


>gi|386821832|ref|ZP_10109048.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426938|gb|EIJ40768.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 434

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 91/184 (49%), Gaps = 35/184 (19%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVT-------- 124
           D+  +++V+F+       + KP    +SF +PH P  +A  Y +++   N+T        
Sbjct: 153 DITGSEAVKFINNYN---NDKPFFAYVSFNSPHVPRQTAQNYYDLYPIENITLPPSVKDN 209

Query: 125 SHHTP--AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
           S   P   Y YAPNP     LQ+   ++ V +   + ++T        +D  +  I+K L
Sbjct: 210 SLLNPNVRYQYAPNP-----LQLKTMLERVQQN--NAMVTH-------MDKQIGDILKSL 255

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MY 237
           KD G  +NT I++TSDHG + G+ G V GK   +E  +  P ++R P +  GT     +Y
Sbjct: 256 KDKGIYDNTIIVFTSDHGINFGENG-VAGKVCLYEPSVTAPLIIRAPNLEAGTTISERVY 314

Query: 238 LLDV 241
           L D+
Sbjct: 315 LQDI 318


>gi|160890613|ref|ZP_02071616.1| hypothetical protein BACUNI_03058 [Bacteroides uniformis ATCC 8492]
 gi|156859612|gb|EDO53043.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
          Length = 510

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 102/195 (52%), Gaps = 17/195 (8%)

Query: 56  NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
           N NG+ +   +   N +  +L A+ ++RFL ++K S    P ++ ++F +PH P +  P 
Sbjct: 193 NTNGKWV---DASLNTFSSELYADAAIRFL-EAKAS-DDTPFLMYVAFTSPHDPRNVLPH 247

Query: 116 YANMFFNVTSHHTPAYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQSV 171
           Y + + + T    P    A +P D   + +   K+    R  +  ++T+R      +  V
Sbjct: 248 YGHKYAS-TEVPMPENFVAQHPFDNGDLNERDEKLLSTPR-ISAQVLTERANYYGMVSEV 305

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D  + +I+  L+  G+++NT I++TSD+G  +G+ GL+ GK   +E  +RVP ++ GPGI
Sbjct: 306 DFQIGRILDVLESSGKVDNTIIVFTSDNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPGI 364

Query: 232 VPGTM-----YLLDV 241
               M     YL D+
Sbjct: 365 PQNAMCDAYCYLYDI 379


>gi|449133266|ref|ZP_21768913.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula europaea 6C]
 gi|448887952|gb|EMB18296.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula europaea 6C]
          Length = 554

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 104/223 (46%), Gaps = 37/223 (16%)

Query: 31  GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
           G+Y+PPG +             Y++N+NG ++K        Y  D + + +V +L++   
Sbjct: 197 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 238

Query: 91  SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
               KP  L +S  A H      E    +YA+  + F  T     A    P    +W+  
Sbjct: 239 D---KPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 291

Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
                  +   +      D L  +  +++ +VDD+V +++++LKD+G  ++T IIY  D+
Sbjct: 292 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 351

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           G+  G+ GL+  K   +E  IRVP LM+ P +  G   + +V+
Sbjct: 352 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGKPIENVV 393


>gi|417304905|ref|ZP_12091902.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica WH47]
 gi|327538823|gb|EGF25470.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica WH47]
          Length = 534

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 101/219 (46%), Gaps = 29/219 (13%)

Query: 31  GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
           G+Y+PPG +             Y++N+NG ++K        Y  D + + +V +L++   
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218

Query: 91  SFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD--YAPNPDKQWILQVTRK 148
               KP  L +S  A H     A ++   + +      P      A     +W+      
Sbjct: 219 D---KPFFLYLSHKAVHSNFTPAKRHQGRYADEDLSFLPTGKELSADKNTPRWVRDQKNS 275

Query: 149 MQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
              +   +      D L  +  +++ +VDD+V +++++LKD+G  ++T IIY  D+G+  
Sbjct: 276 WHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDNGFMW 335

Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           G+ GL+  K   +E  IRVP LM+ P +  G   + +V+
Sbjct: 336 GEHGLID-KRVSYEASIRVPMLMQCPNLFDGGQPIENVV 373


>gi|317479849|ref|ZP_07938969.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|316904055|gb|EFV25889.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 491

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 102/195 (52%), Gaps = 17/195 (8%)

Query: 56  NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
           N NG+ +   +   N +  +L A+ ++RFL ++K S    P ++ ++F +PH P +  P 
Sbjct: 174 NTNGKWV---DASLNTFSSELYADAAIRFL-EAKAS-DDTPFLMYVAFTSPHDPRNVLPH 228

Query: 116 YANMFFNVTSHHTPAYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQSV 171
           Y + + + T    P    A +P D   + +   K+    R  +  ++T+R      +  V
Sbjct: 229 YGHKYAS-TEVPMPENFVAQHPFDNGDLNERDEKLLSTPR-ISAQVLTERANYYGMVSEV 286

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D  + +I+  L+  G+++NT I++TSD+G  +G+ GL+ GK   +E  +RVP ++ GPGI
Sbjct: 287 DFQIGRILDVLESSGKVDNTIIVFTSDNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPGI 345

Query: 232 VPGTM-----YLLDV 241
               M     YL D+
Sbjct: 346 PQNAMCDAYCYLYDI 360


>gi|340619412|ref|YP_004737865.1| mucin-desulfating sulfatase [Zobellia galactanivorans]
 gi|339734209|emb|CAZ97586.1| Mucin-desulfating sulfatase, family S1-11 [Zobellia
           galactanivorans]
          Length = 505

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/227 (24%), Positives = 100/227 (44%), Gaps = 20/227 (8%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           +FGK+    +     PG+  W +      YY  ++N+NG+ I++ +D    Y  DL+   
Sbjct: 138 FFGKWHMGDHNDSPRPGFDHWESFAGQGDYYAPNLNINGKNIQYDKD---TYVTDLLTEH 194

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD------YA 134
           +V +L +       KP  + +S  A H     A ++  M+         +Y       Y 
Sbjct: 195 AVDWLDKRD---VDKPFFMYLSHKAVHAMFKPAKRHEGMYAGKKIEKPDSYTQTIGSAYK 251

Query: 135 PNPDKQWILQVTRKMQPVHRQFT-------DLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
                +W+ +       V   +        D L+ +  +TL  VD++V  ++  LK  G 
Sbjct: 252 KLNWPEWVKKQRDSWHGVDYMYHGRADGTFDDLVQRYCETLMGVDESVGAVMDWLKTNGL 311

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
             +T +IY  D+G+  G+ GL+  + F +E  ++VP L+R P I  G
Sbjct: 312 DESTMVIYMGDNGFSWGEHGLIDKRHF-YEESVKVPMLVRCPEIFEG 357


>gi|149198031|ref|ZP_01875079.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138943|gb|EDM27348.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 647

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/185 (24%), Positives = 87/185 (47%), Gaps = 12/185 (6%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++  ++A++++ F  + K     KP  + ++F APH P  +  +Y +M+      +  ++
Sbjct: 178 HWTQVVADNTLTFFEKVKND--NKPFFMYLAFNAPHDPRQAPKEYVDMYPLDSIKIPENY 235

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
            P Y YA     + +        P           +   ++  +D  + +++  L+  G+
Sbjct: 236 MPEYPYAAEICGKKLRDEVLAPYPRTTYAVKRNRQEYYASITYMDHHIGRMLDALEASGK 295

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDVL 242
             NTYII+T+DHG   G  GL+ GK   +E  +R PF++ GPGI     +   +YL D +
Sbjct: 296 AENTYIIFTADHGLAAGHHGLM-GKQSMYEHSMRPPFIVVGPGIKQNSKIDTPIYLQDAM 354

Query: 243 IPQVR 247
              + 
Sbjct: 355 ATAIE 359


>gi|372209721|ref|ZP_09497523.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 505

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 94/184 (51%), Gaps = 19/184 (10%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ +D+++F+  +K S   KP  + ++F APH P  +  +Y +++      V    
Sbjct: 213 HWSEVVKDDALQFINHAKTS--DKPFFMYLAFNAPHDPRQAPQEYQDLYNVHEIQVPESF 270

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELK 183
            P Y Y  +      L+    + P  R  T+  +   ++    ++   D  +  I++ L+
Sbjct: 271 LPEYPYRKSIGNGNNLR-DEALAPFPR--TEFAVKTHIKEYYASITHADAQIGMILEALE 327

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYL 238
             G+++NTYII+T+DHG  +G+ GL+ GK   F+  +R P ++ GP I     V   +YL
Sbjct: 328 KSGKMDNTYIIFTADHGLAMGRHGLL-GKQNLFDHSLRPPMVIVGPDIPKNKQVATDVYL 386

Query: 239 LDVL 242
            D +
Sbjct: 387 QDAM 390


>gi|406660077|ref|ZP_11068212.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405556211|gb|EKB51161.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 491

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 84/164 (51%), Gaps = 9/164 (5%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT----SHH 127
           Y  ++ A+ ++ F+R  K++  + P    ++F APH P      Y + + + T     ++
Sbjct: 175 YSTEIFADSAIDFIRSQKEA--ENPFFCYVAFTAPHDPYSPEATYIDYYPDGTIPLPGNY 232

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
            P + +    D   +        P   +   ++++     +  VD  V +I+K L++ G+
Sbjct: 233 KPFHPF--QFDDLTVRDENLAGWPRKPEVIQMILSDYYALVTHVDHHVGRILKVLEETGQ 290

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           L NT I+YT+D+G  +G  GL+ GK   +E  I+VP +++GPGI
Sbjct: 291 LENTIIVYTADNGLAVGSHGLL-GKQSLYEHSIKVPLIIKGPGI 333


>gi|149197403|ref|ZP_01874454.1| putative sulfatase yidj [Lentisphaera araneosa HTCC2155]
 gi|149139421|gb|EDM27823.1| putative sulfatase yidj [Lentisphaera araneosa HTCC2155]
          Length = 511

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 74/160 (46%), Gaps = 15/160 (9%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           N Y  D + N ++ F+ Q K S    P   ++S+P PHGP+     Y  M+ ++      
Sbjct: 203 NTYTTDWLTNKTIDFITQHKAS----PFCYMVSYPDPHGPDTVRAPYDTMYTHMNFQKPK 258

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
                 +    W    T++      Q+  ++        + +DD + +I+  L + G L 
Sbjct: 259 TASKKQDDLPSWA--TTKRGAANQSQYYGMI--------KCIDDNIARIMTCLDEQGILE 308

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           NT +++TSDHG   G+ G  + K  P E   +VPF++R P
Sbjct: 309 NTIVVFTSDHGDMRGEHGR-QNKGIPLEASAKVPFIVRYP 347


>gi|340751541|ref|ZP_08688352.1| arylsulfatase [Fusobacterium mortiferum ATCC 9817]
 gi|340562196|gb|EEO35554.2| arylsulfatase [Fusobacterium mortiferum ATCC 9817]
          Length = 473

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 90/174 (51%), Gaps = 14/174 (8%)

Query: 66  EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
           E++   Y+  + + +++R     K  F+K P+ ++ +F  PH P D A ++A M+  V  
Sbjct: 172 EEFNQTYWIGMKSCEAIR-----KYDFNK-PLFMMTNFVDPHHPFDPAEKFARMYDGVEI 225

Query: 126 HHTPAYDYAPNPDKQWI-LQVTRKMQP---VHRQFTDLLMTKRLQ----TLQSVDDAVEK 177
               + D   N   +++  Q  R   P      + +D  + +  +     +  +D  + K
Sbjct: 226 DSPISKDKFCNERPEYLKRQGERGYWPGGGEQHKLSDEKVEEYTRYYYAMITFIDQEIGK 285

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           I KEL+  GEL+NT II+TSDHG ++G +GL++   F ++  I+VP L  G G+
Sbjct: 286 IRKELEKKGELDNTIIIFTSDHGEYMGDYGLLQKGPFMYDNLIKVPLLFWGKGV 339


>gi|409730693|ref|ZP_11272254.1| N-acetylglucosamine-6-sulfatase [Halococcus hamelinensis 100A6]
 gi|448723708|ref|ZP_21706224.1| N-acetylglucosamine-6-sulfatase [Halococcus hamelinensis 100A6]
 gi|445787247|gb|EMA37995.1| N-acetylglucosamine-6-sulfatase [Halococcus hamelinensis 100A6]
          Length = 536

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 107/235 (45%), Gaps = 34/235 (14%)

Query: 20  GYFGKYLNKY-----NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
           GY   ++ K+     N +   P +  W +      Y++  +N++G  + H     + Y  
Sbjct: 142 GYETAFIGKWDTFRINSAEPRPAFDRWVSFEKQGNYFDPRLNVDGDWVDH-----SGYIT 196

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV------TSHHT 128
           D++   ++++LR        KP  L +S  APH     A ++   + +       T  ++
Sbjct: 197 DILTEYALQWLRNRS---GDKPFFLYLSHKAPHAWFRPARRHRGRYADAPIEYPKTMANS 253

Query: 129 PAYDYAPNPDKQWILQVTRKMQPVH-----------RQFTDLLMTKRLQTLQSVDDAVEK 177
           PA +Y   PD  W+ +    ++ V                D L  +  +TL S+D+++  
Sbjct: 254 PA-NYRDKPD--WVREQRDSVRGVDYVFGGNFGFGGPSGFDTLYRRYCETLLSLDESIGA 310

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           ++ +L   G  ++T  +Y SD+G+ LG+ GL+ GK   +E  +RVP L   PG+V
Sbjct: 311 VMDQLDTSGVADSTLTLYMSDNGFSLGEHGLI-GKQTAYEPSMRVPLLAWAPGLV 364


>gi|319953999|ref|YP_004165266.1| sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422659|gb|ADV49768.1| sulfatase [Cellulophaga algicola DSM 14237]
          Length = 434

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 85/179 (47%), Gaps = 23/179 (12%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D+ AN++V FL    ++   KP    ++F APH P  +  +Y +M+        PA +  
Sbjct: 153 DITANEAVSFLNTYDKT---KPFFAYVAFNAPHVPRQTEQKYYDMY--------PAEEMV 201

Query: 135 --PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLGEL 188
             P+      L    K Q  +       M +R+Q     +  +D  +  I++ LKD G  
Sbjct: 202 LPPSVINNTPLNTNVKYQYTNDPLQSKTMQQRVQQNNAMVTHMDTGIGAIIQALKDKGIY 261

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLDVL 242
           +NT I++TSDHG + G+ G V GK   +E  +  P +++ P   P T     +YL DV+
Sbjct: 262 DNTIIVFTSDHGINFGENG-VAGKVCLYEPSVTAPLIIKAPLAKPNTKIASRVYLQDVV 319


>gi|451853738|gb|EMD67031.1| hypothetical protein COCSADRAFT_168273 [Cochliobolus sativus
           ND90Pr]
          Length = 458

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 95/229 (41%), Gaps = 50/229 (21%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKH-----GEDYYNDYYP 74
           GY       +NG Y+P   ++ G     + YY     +N   I++      + Y  +Y  
Sbjct: 66  GYPKSIREGHNGRYLPVWLQQAGY----NTYYTGKF-LNAHSIRNYNSPFPKGYPGNYST 120

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           DLI++ ++ FL ++  + +K P  L +    PH                           
Sbjct: 121 DLISSKAIEFLEEA--AAAKAPFFLGVMPIVPH--------------------------- 151

Query: 135 PNPDKQWILQVTRKMQPV-------HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
                Q IL       PV       +  + D     RL  L SVDD V+ + + L+  G 
Sbjct: 152 ----TQTILSTIPGGLPVFEPPVSAYVTYNDEFYRLRLAALASVDDHVDAMFERLESFGL 207

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +NN YIIYTSD+G+ +GQ  L    S  +E D+ VP  +RG G+  G +
Sbjct: 208 MNNIYIIYTSDNGFPIGQHRLALENSCAYEEDVNVPMFIRGTGVPKGEV 256


>gi|32473376|ref|NP_866370.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH 1]
 gi|32398056|emb|CAD78151.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH 1]
          Length = 534

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/223 (25%), Positives = 104/223 (46%), Gaps = 37/223 (16%)

Query: 31  GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
           G+Y+PPG +             Y++N+NG ++K        Y  D + + +V +L++   
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218

Query: 91  SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
               +P  L +S  A H      E    +YA+  + F  T     A    P    +W+  
Sbjct: 219 D---EPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 271

Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
                  +   +      D L  +  +++ +VDD+V +++++LKD+G  ++T IIY  D+
Sbjct: 272 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 331

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           G+  G+ GL+  K   +E  IRVP LM+ P +  G   + +V+
Sbjct: 332 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGQPIENVV 373


>gi|410030244|ref|ZP_11280074.1| sulfatase [Marinilabilia sp. AK2]
          Length = 491

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 83/164 (50%), Gaps = 9/164 (5%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT----SHH 127
           Y  +L A+ ++ F+R  K++  + P    ++F APH P      Y + + + T     ++
Sbjct: 175 YSTELFADSAIDFIRSQKEA--ENPFFCYVAFTAPHDPYSPEAAYVDYYPDGTIPLPGNY 232

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
            P + +    D   +        P   +   ++++     +  VD  V +I+K L++ G 
Sbjct: 233 MPFHPF--QFDDLTVRDENLTGWPRKPEVIQMILSDYYALVTHVDHQVGRILKVLEETGL 290

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           L NT I+YT+D+G  +G  GL+ GK   +E  I+VP +++GPGI
Sbjct: 291 LENTIIVYTADNGLAVGSHGLL-GKQSLYEHSIKVPLIIKGPGI 333


>gi|83950157|ref|ZP_00958890.1| Sulfatase [Roseovarius nubinhibens ISM]
 gi|83838056|gb|EAP77352.1| Sulfatase [Roseovarius nubinhibens ISM]
          Length = 518

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 84/165 (50%), Gaps = 13/165 (7%)

Query: 87  QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT---SHHTPAYDYAPNPDKQWIL 143
           + + +  + P+ +V SF  PH P  + P++ +++ +         P  D+ P   ++ + 
Sbjct: 187 RDRAAGQEAPLCMVASFIHPHDPYVARPEWWDLYSDDEIELPEVLPLADHDPF-SRRLMD 245

Query: 144 QVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGY 201
            +     P+ R   +++  +R  L  +   D  +  +VK L + GEL+NT +I T+DHG 
Sbjct: 246 GIEASYVPLSRD--EVIRARRAYLANVSYFDSKIGALVKTLDETGELDNTVVIVTADHGD 303

Query: 202 HLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG----TMYLLDVL 242
            LG+ GL    +F FE   RVP +M GPG+V G       L+D+L
Sbjct: 304 MLGERGLWYKMNF-FEHSARVPLIMAGPGVVQGAAANACSLIDLL 347


>gi|392967509|ref|ZP_10332927.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387844306|emb|CCH54975.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 510

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 104/222 (46%), Gaps = 24/222 (10%)

Query: 37  GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
           G+  W +      YYN  +N++G++++H +   + Y  D++ + ++ FL +  +    KP
Sbjct: 156 GFNHWVSFRGQGVYYNPVLNVDGKRVEHRD---SAYIADVLTDYALNFLNRRDKD---KP 209

Query: 97  IMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ-----------WILQV 145
             + +S  + H     +    N F      + PA  Y  +P  Q           W+ + 
Sbjct: 210 FFMYLSHKSVHHQFQPSRADRNRFAQSPIQY-PASFYQTDPRNQNPETKESGVPHWVRRQ 268

Query: 146 TRKMQPV----HRQFT-DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
                 V    H   T D  +    +TL+S+D ++ +++K L D G   NT +IY  D+G
Sbjct: 269 RYSWHGVDYLYHGSTTFDSFVRDYCETLRSMDTSIGRVLKYLDDNGLAENTLVIYMGDNG 328

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           +  G+ GL+  ++  +E  +RVPFL+R P ++   + +  ++
Sbjct: 329 FMFGEHGLIDKRT-AYEESMRVPFLVRCPALIKAGITVAQMI 369


>gi|430747734|ref|YP_007206863.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430019454|gb|AGA31168.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 475

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 79/177 (44%), Gaps = 37/177 (20%)

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
           PDL+  + +RF+  ++     +P +L + F APH P    P+  +  +       P    
Sbjct: 182 PDLLVGEGIRFVEANRD----RPFLLSIHFRAPHVPYAPVPEQDSAHYETLDPTIPDVPG 237

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            P                  RQ    L  +   ++ SVD  V +++  L+ LG    T +
Sbjct: 238 LP------------------RQRVKTLTRRYYASVSSVDRNVGRLLDRLEALGLTGKTIV 279

Query: 194 IYTSDHGYHLGQFG---------LVKGKS---FPFEFD--IRVPFLMRGPGIV-PGT 235
           I+TSDHGY +G  G         LV+GK     P  FD  IRVP ++R PG+V PGT
Sbjct: 280 IFTSDHGYMIGHHGLWHKGNGTWLVEGKKEERRPNMFDDSIRVPLIVRWPGVVAPGT 336


>gi|354582496|ref|ZP_09001398.1| sulfatase [Paenibacillus lactis 154]
 gi|353199895|gb|EHB65357.1| sulfatase [Paenibacillus lactis 154]
          Length = 497

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 70/151 (46%), Gaps = 5/151 (3%)

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD-K 139
           S+ FLR+       KP  L MSF  PH P D    Y +++        P  D+A   D  
Sbjct: 195 SIDFLRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLDLYEEAEIPEPPVGDWAEKEDVS 251

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           Q  L        V ++     M      +  +D  + + ++ L + GELNNT I++TSDH
Sbjct: 252 QAGLSPVTGRGIVPKRRLRKAMAAYYALITHLDHQIGRFLQALHEYGELNNTVIVFTSDH 311

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           G  LG   L + KS P+E    VPF++  PG
Sbjct: 312 GELLGDHHLFR-KSLPYEGSAHVPFIVNDPG 341


>gi|390947095|ref|YP_006410855.1| arylsulfatase A family protein [Alistipes finegoldii DSM 17242]
 gi|390423664|gb|AFL78170.1| arylsulfatase A family protein [Alistipes finegoldii DSM 17242]
          Length = 515

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/246 (22%), Positives = 105/246 (42%), Gaps = 41/246 (16%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            +FGK+          PG+  W        YY+  +N+NG ++K   + Y+    D++ +
Sbjct: 131 AFFGKWHMGNQDDMPQPGFDHWEGFRGQGTYYDTVLNINGERVKFDPELYST---DILTD 187

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------FNVTSHHTP 129
            ++ F+R +++     P  + +S+ + H     +P    M+          FNV  +  P
Sbjct: 188 HAIDFVRDNEEG----PFFIYLSYKSVHSGFQPSPSRKGMYKDEKAVYPPSFNVPEYGIP 243

Query: 130 A----------------YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR-----LQTL 168
                            Y  +  PD  W+         V  Q+   L  +       +T+
Sbjct: 244 RLPGKDADGRPLAGRGWYGESRLPD--WVKNQRESWHGVDYQYHGALPYEEDFRNYCETV 301

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
            S+DDA+ +++  L+  G   +T +IY  D+G+  G+ GL+  ++F +E  +RVP L   
Sbjct: 302 TSMDDAIGRLLDFLQAEGLDESTLVIYMGDNGFTWGEHGLIDKRNF-YEPSVRVPMLAYC 360

Query: 229 PGIVPG 234
           P ++P 
Sbjct: 361 PELIPA 366


>gi|297180521|gb|ADI16734.1| arylsulfatase a and related enzymes [uncultured Verrucomicrobiales
           bacterium HF0010_05E02]
          Length = 535

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 103/233 (44%), Gaps = 31/233 (13%)

Query: 51  YNYSINMNGRKIKHGEDYYNDYYPD-LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
           Y +++  +G + ++ E Y  D  PD L A D+VRFL++       KP +L + F  PH P
Sbjct: 213 YGWTLFHSGEQWEYREGYNRDLMPDELCAQDAVRFLKEKH----AKPFILTVGFSRPHSP 268

Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR---------KMQPVHRQFTDLL 160
             +  ++ + F   +   TP      + D   IL V +         K + +     D  
Sbjct: 269 WYAPQEFFDRFPLESLELTPLLKGDAD-DVSKILSVEQDIAQPWGWSKYKKIVNNGGDQQ 327

Query: 161 MTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
           + K  Q     +  VD  V  ++  L+      NT +I+T DHGYH+G+   +  K  P+
Sbjct: 328 LLKWTQAYLACVAFVDHLVGVVLGVLESSEYAENTVVIFTGDHGYHMGEKDYLF-KYSPW 386

Query: 217 EFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGSLIFIMSFLINLNTMRA 269
           E  +RVP ++ GP I  G +    V           SL+ +    I+L  M+A
Sbjct: 387 EESVRVPLILAGPDIAEGAISRTPV-----------SLVDLYPTCIDLAAMQA 428


>gi|374605454|ref|ZP_09678382.1| arylsulfatase [Paenibacillus dendritiformis C454]
 gi|374388953|gb|EHQ60347.1| arylsulfatase [Paenibacillus dendritiformis C454]
          Length = 488

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 75/153 (49%), Gaps = 9/153 (5%)

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD-- 138
           S+ F+R+       KP  L MSF  PH P D    Y +M+ +      P  D+A   D  
Sbjct: 193 SIDFVRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLDMYKDEEIPDPPVGDWADTADEA 249

Query: 139 KQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
           ++ +  +T R + P  R      M      +  +D  + + ++ L + GEL NT I++TS
Sbjct: 250 REGLSPITSRGIVPKRR--LKQAMAAYYALITHLDHQIGRFLQVLDEYGELRNTVILFTS 307

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           DHG  LG   L + KS P+E   RVPF++  PG
Sbjct: 308 DHGELLGDHNLFR-KSLPYEGSARVPFIVNDPG 339


>gi|87306335|ref|ZP_01088482.1| choline sulfatase [Blastopirellula marina DSM 3645]
 gi|87290514|gb|EAQ82401.1| choline sulfatase [Blastopirellula marina DSM 3645]
          Length = 462

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 92/184 (50%), Gaps = 19/184 (10%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++A+D+V FL  + +    KP  + ++F A H P  S   Y + +     +V +  
Sbjct: 170 HWSEVVADDAVDFLDTAAKR--DKPFFMYLAFNAVHDPRQSPQDYVDHYPAEKTDVPADF 227

Query: 128 TPAYDYAPNPDKQWILQVTR--KMQPVHR--QFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
            P Y   P  +     +  R  K+ P  R      +  ++    ++ +D  + +I+  L+
Sbjct: 228 QPLY---PEREAMGAGKSLRDEKLAPFPRTPHAIQVHRSEYYAIIEHMDAQIGRILDALQ 284

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYL 238
             G+ +NTYI +T+DHG  +G  GL+ GK   ++  +RVP L+ GPG+  G      +YL
Sbjct: 285 KTGKADNTYIFFTADHGLAVGHHGLL-GKQNMYDHSVRVPLLVAGPGLPQGKVCETPVYL 343

Query: 239 LDVL 242
            DV+
Sbjct: 344 QDVM 347


>gi|349805009|gb|AEQ17977.1| putative glucosamine (n-acetyl)-6-sulfatase [Hymenochirus curtipes]
          Length = 110

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 65/114 (57%), Gaps = 6/114 (5%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           DY  D++AN S+ FL       + +P  +++S PAPH P  +APQY   F NVT+     
Sbjct: 1   DYLTDVLANVSLDFLNYKS---NYQPFFMMISTPAPHSPWIAAPQYEKTFPNVTAPRGGN 57

Query: 131 YDYAPNPDKQWILQVTRK-MQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
           ++   + DK W+++  +  M     QF D    KR QTL SVDD +EK++K+L+
Sbjct: 58  FNV--HKDKHWLIRQDKSPMSNSSIQFLDNAFRKRWQTLLSVDDLIEKLLKQLE 109


>gi|294634825|ref|ZP_06713348.1| N-acetylglucosamine-6-sulfatase [Edwardsiella tarda ATCC 23685]
 gi|451966770|ref|ZP_21920021.1| putative sulfatase [Edwardsiella tarda NBRC 105688]
 gi|291091761|gb|EFE24322.1| N-acetylglucosamine-6-sulfatase [Edwardsiella tarda ATCC 23685]
 gi|451314442|dbj|GAC65383.1| putative sulfatase [Edwardsiella tarda NBRC 105688]
          Length = 521

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 106/254 (41%), Gaps = 37/254 (14%)

Query: 5   VELTSRNLLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YN 52
            +  S  L   D S GY GK         Y+  YN       W +W A      +   Y+
Sbjct: 130 AQCWSDVLKKLDYSTGYIGKWHLDAPHAPYIESYNNPMEGRYWNDWTAPDRRHGFDFWYS 189

Query: 53  YSI-NMNGRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMS 102
           Y   +++ R +  G         + + +  +  A+ +++FLR  + +F  + KP  LV+S
Sbjct: 190 YGTYDLHMRPMYWGNTTPREKPMFVDQWSAEHEADMAIKFLRNDEGNFRAADKPFALVVS 249

Query: 103 FPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLM 161
              PH P D  PQ Y + F   TS      D    P+  W         P +        
Sbjct: 250 MNPPHSPYDQVPQKYLDRFAGKTSR-----DLNTRPNVDWQANYQEGYGPQY-------F 297

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
            + +  +  VD+   +I+ EL   G  +NT +++ SDHG  LG  G    K+  +E  +R
Sbjct: 298 KEYMAMVNGVDEQFGRILAELDKQGLADNTLVVFFSDHGCCLGANGQPT-KNVAYEEAMR 356

Query: 222 VPFLMRGPG-IVPG 234
           +P + R PG I PG
Sbjct: 357 IPMMFRLPGKIAPG 370


>gi|423018509|ref|ZP_17009230.1| putative sulfatase [Achromobacter xylosoxidans AXX-A]
 gi|338778394|gb|EGP42868.1| putative sulfatase [Achromobacter xylosoxidans AXX-A]
          Length = 485

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 11/192 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++ A   V +LR++ +  S KP +L +S  APH P  + P++   +FN  
Sbjct: 154 GESSYTFYDREITARAQV-WLREAARRESDKPWVLFVSLVAPHFPLTAPPEHYYRYFNQN 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD    PD  +++         +  + +    KR           +D+ + KI+
Sbjct: 213 LPLPKLYDRRERPDHPYLVDYRNSFN--YDDYFEGDAVKRAVAGYYGLCSFLDENIGKIL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
             L+D G  ++T ++YTSDHG +LG  GL  GKS  FE    VP +M G  I  G   ++
Sbjct: 271 NALEDAGLADDTRVMYTSDHGDNLGARGLW-GKSTMFEETAGVPLIMAGDDIPQGR--VV 327

Query: 240 DVLIPQVRKFSS 251
           DV +  +  + +
Sbjct: 328 DVPVTHIDAYPT 339


>gi|402494641|ref|ZP_10841381.1| sulfatase [Aquimarina agarilytica ZC1]
          Length = 506

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 91/186 (48%), Gaps = 23/186 (12%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++  D+  FL  +  S  + P  + ++F APH P  +  +Y +M+     ++    
Sbjct: 213 HWSEVLTQDATTFLDMA--SKKEDPFFMYLAFNAPHDPRQAPKKYQDMYDLDRISLPKSW 270

Query: 128 TPAYDYAPNPDKQWILQ----VTRKMQPVHRQ--FTDLLMTKRLQTLQSVDDAVEKIVKE 181
            P Y     PD+  I          + P  R    T + + +   ++  +D  + KI++ 
Sbjct: 271 LPEY-----PDRHQIANGDHLRDEALAPFPRTEYATKVHIKEYYASITHLDAQIGKILEA 325

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTM 236
           LK  G+  NTYII+T+DHG  +G+ GL+ GK   ++  +R P ++ GP I     V   +
Sbjct: 326 LKASGKDKNTYIIFTADHGLAMGRHGLL-GKQNLYDHSLRPPLIIMGPDIPKHKKVNADV 384

Query: 237 YLLDVL 242
           YL DV+
Sbjct: 385 YLQDVM 390


>gi|301309188|ref|ZP_07215132.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Bacteroides sp. 20_3]
 gi|423338919|ref|ZP_17316661.1| hypothetical protein HMPREF1059_02586 [Parabacteroides distasonis
           CL09T03C24]
 gi|300832870|gb|EFK63496.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Bacteroides sp. 20_3]
 gi|409233044|gb|EKN25885.1| hypothetical protein HMPREF1059_02586 [Parabacteroides distasonis
           CL09T03C24]
          Length = 524

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           D++ + ++R+++++ +    +P M+   F A H P D   +  +++  V        +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239

Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
            P                        +PDK W    ++    + +HR          L+ 
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVC 299

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
             L+   ++DD + K++K L D+G  +NT ++Y SD GY LG+ G    K   +E  +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358

Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
           PF++R P  +P      D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379


>gi|406833294|ref|ZP_11092888.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 520

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 59/264 (22%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++    P G+  W  L     YYN  +  NG ++KH       Y  D+I +
Sbjct: 122 GYSTAIIGKWHLISDPTGFDHWHILPGQGLYYNPPMIRNGEEVKH-----EGYATDIITD 176

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA------------PQYANMFFNVTSHH 127
            S+ +L+   +S   KP +L+    APH     A            P+   +F +     
Sbjct: 177 LSIDWLKNRDKS---KPFVLMSQHKAPHREWAPALRDLGWDQDRQYPEPETLFDDYAGRS 233

Query: 128 TPAYDYAPNPDKQWI----------------LQVT-----------RKMQPVHRQFTDLL 160
               D+    D+ +                 LQV            RKM P   Q  DL+
Sbjct: 234 KAVSDHDMGIDRTFTDLDAKLKPPPNMTAEQLQVWNAYYGPRNDAFRKMNP---QGKDLV 290

Query: 161 -------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
                  M   L  +++VDD V +++K L   G   +T I++TSD G++LG+ G    K 
Sbjct: 291 RWRYNRYMHDYLGCVKAVDDNVGRLLKFLDQEGLSQDTLIVFTSDQGFYLGEHGWFD-KR 349

Query: 214 FPFEFDIRVPFLMRGPGIV-PGTM 236
           + FE  +R P +MR PG+V PG +
Sbjct: 350 WIFEESLRSPLVMRMPGLVKPGQV 373


>gi|422320449|ref|ZP_16401509.1| sulfatase [Achromobacter xylosoxidans C54]
 gi|317404779|gb|EFV85161.1| sulfatase [Achromobacter xylosoxidans C54]
          Length = 485

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 11/192 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++ A   V +LR++ +  S KP +L +S  APH P  + P++   +FN  
Sbjct: 154 GESSYTFYDREITARAQV-WLREAARRESDKPWVLFVSLVAPHFPLTAPPEHYYRYFNQN 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD    PD  +++         +  + +    KR           +D+ + KI+
Sbjct: 213 LPLPKLYDRRERPDHPYLVDYRNSFN--YDDYFEGDAVKRAVAGYYGLCSFLDENIGKIL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
             L+D G  ++T ++YTSDHG +LG  GL  GKS  FE    VP +M G  I  G   ++
Sbjct: 271 NALEDAGLADDTRVMYTSDHGDNLGARGLW-GKSTMFEETAGVPLIMAGDDIPRGR--VV 327

Query: 240 DVLIPQVRKFSS 251
           DV +  +  + +
Sbjct: 328 DVPVTHIDAYPT 339


>gi|71281899|ref|YP_269086.1| N-acetylglucosamine-6-sulfatase [Colwellia psychrerythraea 34H]
 gi|71147639|gb|AAZ28112.1| putative N-acetylglucosamine-6-sulfatase [Colwellia psychrerythraea
           34H]
          Length = 537

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 62/236 (26%), Positives = 105/236 (44%), Gaps = 32/236 (13%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYY--------NYSINMNGRKIKHGEDYYNDY 72
           +FGK+    +G    PG+  W +      YY           IN+NG ++         Y
Sbjct: 128 FFGKWHMGGHGDSPQPGFDHWLSFAGQGHYYPKKDKKGRTNKININGERVDQ-----KGY 182

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHH 127
             D + + +V +L +     S KP  + +S  A H   D AP++ + + +V      S  
Sbjct: 183 ITDELTDYAVDWLDKRD---SDKPFFMYLSHKAVHSNFDPAPRHKDQYSDVAIEVPESQA 239

Query: 128 TPAYDYAPNPDKQWILQVTRKMQ----PVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKE 181
               +YA  P   W+            P H +  D+   KR   + L +VDD++ +++K 
Sbjct: 240 DTPENYAGKP--MWVKNQRNSWHGVDFPYHSEM-DVQEYKRQYHRALSAVDDSLGRVLKW 296

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTM 236
           LKD    N+T ++   D+G+  G+ GL+  ++  +E  +RVP L   PG   PGT+
Sbjct: 297 LKDNNLENDTIVMLMGDNGFMFGEHGLIDKRN-AYEESMRVPLLAYAPGYFKPGTV 351


>gi|298375117|ref|ZP_06985074.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Bacteroides sp. 3_1_19]
 gi|298267617|gb|EFI09273.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Bacteroides sp. 3_1_19]
          Length = 524

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           D++ + ++R+++++ +    +P M+   F A H P D   +  +++  V        +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239

Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
            P                        +PDK W    ++    + +HR          L+ 
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSVARSAAYQKLVR 299

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
             L+   ++DD + K++K L D+G  +NT ++Y SD GY LG+ G    K   +E  +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358

Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
           PF++R P  +P      D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379


>gi|223939156|ref|ZP_03631039.1| sulfatase [bacterium Ellin514]
 gi|223892205|gb|EEF58683.1| sulfatase [bacterium Ellin514]
          Length = 492

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 67/153 (43%), Gaps = 20/153 (13%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
            +D+ ++LR    S +KKP  L +S+  PH P  +AP      +                
Sbjct: 201 THDASQYLR--NHSHAKKPFFLYLSWGPPHDPYQTAPATYRSQYQAAK------------ 246

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
                ++    + P  R      +        ++D  V  +++ LKD G   NT +I+TS
Sbjct: 247 -----IKTRLNVPPGMRASAQTNLAGYYSHCTAIDSCVGTLLQTLKDTGLETNTLVIFTS 301

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           DHG  L   GLVK K  PF+  IRVP LMR P 
Sbjct: 302 DHGDMLHSHGLVK-KQHPFDESIRVPLLMRWPA 333


>gi|320103099|ref|YP_004178690.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319750381|gb|ADV62141.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 952

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 51/175 (29%), Positives = 85/175 (48%), Gaps = 10/175 (5%)

Query: 77  IANDSVRFL--RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           IA+ +V FL  R +     ++P ++ ++   PH P    P        + +   P     
Sbjct: 645 IADAAVEFLEHRAALPPDQRRPFLMHLALECPHDPLAPDPADLAALPPIEAFPLPPSFLP 704

Query: 135 PNP-DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNT 191
            +P D   +L    +++   R    +    R  L  +  +D  V ++++ L   GE +NT
Sbjct: 705 FHPFDNGEMLVRDERLEAFPRDPQAIRTRWRDYLAVVAGIDRVVGRLLEALTVAGERDNT 764

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG----TMYLLDVL 242
           Y+I+TSD G  LG  GL+ GK   +E  IRVPF++ GPG+ PG     ++LLD+ 
Sbjct: 765 YVIFTSDQGLALGDHGLL-GKQNLYEHSIRVPFIVAGPGLRPGRCAAMVWLLDLF 818


>gi|149178829|ref|ZP_01857409.1| putative sulfatase yidJ [Planctomyces maris DSM 8797]
 gi|148842304|gb|EDL56687.1| putative sulfatase yidJ [Planctomyces maris DSM 8797]
          Length = 394

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 63/246 (25%), Positives = 100/246 (40%), Gaps = 70/246 (28%)

Query: 44  LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSF 103
           LI    YYN  +  NG ++KH       Y  D+I + ++ +L+  +     KP ML+   
Sbjct: 7   LIGQGPYYNPPMIKNGERVKH-----EGYTTDIITDLALDYLKNDRDP--DKPFMLMFQH 59

Query: 104 PAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-------------QWILQVTRKMQ 150
            APH      P+Y +M+ +VT          P PD              Q  + + + M 
Sbjct: 60  KAPHRNWQPGPKYLHMYDDVT---------IPEPDNLFDNYEGRGTAAHQQDMTIAKTMT 110

Query: 151 PVHRQFT--------------------------------DLL-------MTKRLQTLQSV 171
           P   + T                                DL+       M   L+ + SV
Sbjct: 111 PFDLKLTPPTNLTPEQLATWNAAYEPKNEAFRKANLTGKDLVRWKYQRYMKDYLRCVASV 170

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D+ V +++  L+  G   NT +IY+SD G++LG  G    K F +E   R+P L R PG+
Sbjct: 171 DENVGRMLDYLESSGLAKNTVVIYSSDQGFYLGDHGWF-DKRFMYEESYRMPLLARWPGV 229

Query: 232 V-PGTM 236
           + PG++
Sbjct: 230 IKPGSV 235


>gi|315647334|ref|ZP_07900447.1| arylsulfatase [Paenibacillus vortex V453]
 gi|315277536|gb|EFU40865.1| arylsulfatase [Paenibacillus vortex V453]
          Length = 490

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 73/157 (46%), Gaps = 9/157 (5%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +   S+ FLR+       KP  L MSF  PH P D    Y +++ +      P  D+A  
Sbjct: 181 VVTQSIDFLRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLDLYEDADLPEPPVGDWART 237

Query: 137 PDKQ---WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            D +   +     R + P  R      M      +  +D  + + ++ L + GEL+NT I
Sbjct: 238 EDPELAGYSPITGRGLVPKRRLRK--AMAAYYALITHLDHQIGRFLQSLNEYGELHNTVI 295

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           ++TSDHG  LG   L + KS P+E    VPF++  PG
Sbjct: 296 LFTSDHGELLGDHHLFR-KSLPYEGSTHVPFIVNDPG 331


>gi|390957885|ref|YP_006421642.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390958226|ref|YP_006421983.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390412803|gb|AFL88307.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390413144|gb|AFL88648.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
          Length = 526

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 111/266 (41%), Gaps = 56/266 (21%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           S GY    + K++    P G+  W  ++     YN  +  +    K G      Y  D+I
Sbjct: 125 SAGYHTAMIGKWHLGSDPTGFDTWN-ILPGQGVYNDPVFYD----KDGSTQIKGYCTDII 179

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-----------------PEDSA------- 113
           A+ ++R L++  Q    KP  ++    APH                  PE +        
Sbjct: 180 ADLTIRTLKERPQD---KPFFIMSHHKAPHREWTPDEKHRKQFENLHIPEPATLRDDYAG 236

Query: 114 ------PQYANMFFNVTSHH---TPAYDYAPNPDKQWI----LQVTRKMQPVHRQFT--- 157
                  Q  ++F ++T       P    +P   KQW+    ++V  +   V +  T   
Sbjct: 237 RTDALREQKQSVFRDLTRADLKIVPPAGLSPEDQKQWLAAKPMEVEIESDGVKKTITGRE 296

Query: 158 ------DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
                    M   L  +QSVDD V +++  L   G   NT +IYTSD G+ LG+ GL   
Sbjct: 297 LEDWKYQRYMRDYLACVQSVDDNVGRVLDWLDANGLRENTIVIYTSDQGFFLGEHGLFD- 355

Query: 212 KSFPFEFDIRVPFLMRGPG-IVPGTM 236
           K F +E  +++PF++R PG I PGT+
Sbjct: 356 KRFMYEESLQMPFVVRWPGFIKPGTV 381


>gi|168333887|ref|ZP_02692125.1| choline sulfatase [Epulopiscium sp. 'N.t. morphotype B']
          Length = 459

 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 84/187 (44%), Gaps = 28/187 (14%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--------N 122
           ++  D+IA   ++FL Q       KP     SF APH P     ++ NM+         N
Sbjct: 170 EHSTDVIAESVIKFLDQKHD----KPFFAYTSFLAPHDPRTMPEEFLNMYNPEDIKLPPN 225

Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF--TDLLMTKRLQTLQSVDDAVEKIVK 180
             S+H   Y         W  +    + P  R    T   + +    +  +D  + +I+ 
Sbjct: 226 FMSYHFIEY-------ANWECR-DETLAPYPRTLANTQKHIAEYYAMITHLDYQIGRILD 277

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT----- 235
           +L+++GE +NT I+Y  D+G  LGQ GL  GK   ++  +RVP L+ G GI  G      
Sbjct: 278 KLEEIGEKDNTIIVYAGDNGLALGQHGLF-GKQSLYDHSMRVPLLISGAGIKAGMKTDAL 336

Query: 236 MYLLDVL 242
           +YL D+ 
Sbjct: 337 VYLFDIF 343


>gi|159487449|ref|XP_001701735.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280954|gb|EDP06710.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 631

 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 101/240 (42%), Gaps = 35/240 (14%)

Query: 23  GKYLNKYNGSYI-----PPGWREWGALIMNSKYYNYSINMNGRKIKHGED----YYNDYY 73
           G++LN +N S +     P GW     L+     Y  +          G D    Y   Y+
Sbjct: 136 GQFLNGFNRSVLQQYGCPQGWTAADVLV---PAYGCA----------GPDGVVTYTGRYH 182

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA-------PQYANMFFNVTSH 126
            +LI     +  R  + +   +P  ++++  AP     +A       P+Y N+F    + 
Sbjct: 183 EELIREKVTK--RIDEAAGQDQPFFMMVATVAPRDSGRAATAYPEVMPEYKNLFLGERAP 240

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
               +      +  +  Q T           D     RLQ L++VDD +  ++  L    
Sbjct: 241 RLADWGVPVPAEVGFTSQRTNNFNAAD---IDARFRARLQALRAVDDTLALMLSRLACHD 297

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
            L++T I+++SD G+ LG   + + KS  FE D+RVP L+ GPG+ P  +Y  +V    V
Sbjct: 298 LLDDTVIVFSSDSGFKLGSHNMAQDKSTYFEEDVRVPMLLAGPGM-PVGVYAAEVTAAAV 356


>gi|227495529|ref|ZP_03925845.1| sulfatase [Actinomyces coleocanis DSM 15436]
 gi|226831076|gb|EEH63459.1| sulfatase [Actinomyces coleocanis DSM 15436]
          Length = 482

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 67/230 (29%), Positives = 107/230 (46%), Gaps = 36/230 (15%)

Query: 19  HGYFGKYLNKYN--GSYIP-PGWREWGA-LIMNSKYYNYSI-NMNGRKIKHGEDYYNDYY 73
           +GY+   + K++   S  P PG+  W A       YYN  I + NG +    +     Y+
Sbjct: 101 NGYYCGMVGKWHVGTSQTPAPGFSYWYAHRYGGGPYYNAPIWDENGNEATEPK-----YF 155

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-DSAPQ-----YANMFFNVTSHH 127
            D IA ++  F++ +     +KP  L+++F APH P  ++ PQ     YA+  F      
Sbjct: 156 TDAIAENACDFIQSAASVNEEKPFFLMVNFTAPHSPWINNHPQELMDLYADTDFPSIPRE 215

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
            P           W    T+        F D + + R    +L  VD+AV  I+K L++ 
Sbjct: 216 EP---------HPW----TKYYDDFADAFADPVPSLRGYAASLTGVDNAVGDILKALEEN 262

Query: 186 GELNNTYIIYTSDHGYHLGQFGL-VKGK-SFPFEF---DIRVPFLMRGPG 230
              +NT ++Y SD+G+  GQ G+  KG  +FP  F    +RVPF++  PG
Sbjct: 263 AYADNTVVMYMSDNGFSCGQHGIWGKGNGTFPLNFWENSVRVPFIIHLPG 312


>gi|150008305|ref|YP_001303048.1| mucin-desulfating sulfatase [Parabacteroides distasonis ATCC 8503]
 gi|255015459|ref|ZP_05287585.1| mucin-desulfating sulfatase [Bacteroides sp. 2_1_7]
 gi|256839409|ref|ZP_05544918.1| mucin-desulfating sulfatase [Parabacteroides sp. D13]
 gi|410101513|ref|ZP_11296441.1| hypothetical protein HMPREF0999_00213 [Parabacteroides sp. D25]
 gi|149936729|gb|ABR43426.1| mucin-desulfating sulfatase [Parabacteroides distasonis ATCC 8503]
 gi|256738339|gb|EEU51664.1| mucin-desulfating sulfatase [Parabacteroides sp. D13]
 gi|409239311|gb|EKN32095.1| hypothetical protein HMPREF0999_00213 [Parabacteroides sp. D25]
          Length = 524

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           D++ + ++R+++++ +    +P M+   F A H P D   +  +++  V        +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239

Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
            P                        +PDK W    ++    + +HR          L+ 
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVR 299

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
             L+   ++DD + K++K L D+G  +NT ++Y SD GY LG+ G    K   +E  +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358

Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
           PF++R P  +P      D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379


>gi|149197078|ref|ZP_01874130.1| sulfatase 1 precursor [Lentisphaera araneosa HTCC2155]
 gi|149139624|gb|EDM28025.1| sulfatase 1 precursor [Lentisphaera araneosa HTCC2155]
          Length = 555

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 96/205 (46%), Gaps = 33/205 (16%)

Query: 40  EWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIML 99
           E+  +  N  ++   IN+  ++    +D    Y    +  +S++FL        K+P  +
Sbjct: 157 EYEEIAPNGNFHGEYINIKNKESLPEDDPKRIYS---LTKESLKFLDDHA---GKRPFYM 210

Query: 100 VMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN-PDKQWILQVTRKMQPVHRQFTD 158
           ++S  A HGP  ++P       +V       YD  PN P+K       R+    +R  T 
Sbjct: 211 MISHYAVHGPHVASP-------DVLKKWQARYDALPNKPEK------GREKDVFYRLHTP 257

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGY---------HLGQFGLV 209
           +  +     ++  D  + K++  LK+ G L+NTYII+TSD+G          H     L 
Sbjct: 258 VFAS----MIEESDTHLGKLIDSLKEKGVLDNTYIIFTSDNGAEWAPRNEKNHRYNGPLT 313

Query: 210 KGKSFPFEFDIRVPFLMRGPGIVPG 234
           +GK FPFE  +R+PF++ GPGI  G
Sbjct: 314 EGKYFPFEGGLRIPFVVAGPGIPAG 338


>gi|423332426|ref|ZP_17310210.1| hypothetical protein HMPREF1075_02223 [Parabacteroides distasonis
           CL03T12C09]
 gi|409229175|gb|EKN22055.1| hypothetical protein HMPREF1075_02223 [Parabacteroides distasonis
           CL03T12C09]
          Length = 524

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           D++ + ++R+++++ +    +P M+   F A H P D   +  +++  V        +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239

Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
            P                        +PDK W    ++    + +HR          L+ 
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVR 299

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
             L+   ++DD + K++K L D+G  +NT ++Y SD GY LG+ G    K   +E  +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358

Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
           PF++R P  +P      D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379


>gi|406835434|ref|ZP_11095028.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 523

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 26/201 (12%)

Query: 50  YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
           YY+  +  NG   +    Y +D Y    A        Q K   + KP  L + + A HGP
Sbjct: 170 YYDQLLTFNGED-RETPGYSSDNYSQWAAE-----YIQGKHRTADKPWYLWLCYGAIHGP 223

Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
              A ++     + T+   P     P PDK   L  TR   P     T  LM +    ++
Sbjct: 224 TTPAERHRGTLADKTAS-LPMDILGPWPDKPAYLNKTRAWMP-GPDGTPALMKRNRGAVE 281

Query: 170 -----------------SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
                            +VD+ V +++  LK+ G+L NT +IYT+D G+ LG+ G  + K
Sbjct: 282 PGKSLNAWVQQVNECNLAVDEGVGRVMTALKESGQLENTLVIYTADQGFGLGEHGFSQ-K 340

Query: 213 SFPFEFDIRVPFLMRGPGIVP 233
             P++  +  P ++  PG VP
Sbjct: 341 VAPYDATLSSPLIISWPGHVP 361


>gi|262384577|ref|ZP_06077711.1| mucin-desulfating sulfatase [Bacteroides sp. 2_1_33B]
 gi|262293870|gb|EEY81804.1| mucin-desulfating sulfatase [Bacteroides sp. 2_1_33B]
          Length = 524

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           D++ + ++R+++++ +    +P M+   F A H P D   +  +++  V        +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239

Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
            P                        +PDK W    ++    + +HR          L+ 
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVR 299

Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
             L+   ++DD + K++K L D+G  +NT ++Y SD GY LG+ G    K   +E  +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358

Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
           PF++R P  +P      D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379


>gi|374602783|ref|ZP_09675771.1| sulfatase [Paenibacillus dendritiformis C454]
 gi|374391542|gb|EHQ62876.1| sulfatase [Paenibacillus dendritiformis C454]
          Length = 459

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/179 (24%), Positives = 89/179 (49%), Gaps = 21/179 (11%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPA 130
           +L  +++  F+R+  +   + P  L +++ +PH P  +   YA+ +      V  +  P 
Sbjct: 169 ELFTDEAESFIRRYDR---EDPFFLYVAYTSPHDPRTAPEPYASHYEGDRVTVPGNFLPE 225

Query: 131 YDYAPNP---DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
           + +         + +  + R    V R   D         +  +D  + ++++ L++ G 
Sbjct: 226 HPFDNGEMHIRDEGLAGLPRTRDEVRRHIADYYAM-----ITHMDAQMGRLMQALEETGR 280

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDV 241
           +++T ++YTSDHG  +GQ GL+ GK   +E  IR+P+LM GPG+     + G +Y +D+
Sbjct: 281 MDDTIVVYTSDHGLAVGQHGLM-GKQNLYEHSIRIPWLMAGPGVPAKGPIRGQVYQMDI 338


>gi|296123653|ref|YP_003631431.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296015993|gb|ADG69232.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 526

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 107/262 (40%), Gaps = 56/262 (21%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           S GY    + K++    P G+  W  L     YYN  +  NG+K++        Y  D+I
Sbjct: 125 SQGYSTALIGKWHLISEPTGFDHWEILPGQGIYYNPPMIANGQKVQR-----EGYVTDII 179

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS-HHTPAYDYAPN 136
            + S+ +L+   +S   KP +L+    APH   + +P   ++ FN       PA  +  +
Sbjct: 180 TDRSIDWLKNRDKS---KPFLLMAQHKAPH--REWSPALRHLGFNKDKPFAEPATLFDQH 234

Query: 137 PDK-------------------------------------QWILQVTRKMQPVHRQFTDL 159
            D+                                     ++ L      +  H Q  DL
Sbjct: 235 KDRAQAVVDHDMGIDRTFTKLDAKLVPPPGINSTQLEEWNKYYLPRNNAFEAAHLQGQDL 294

Query: 160 L-------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
           +       M   L  +++VD++V ++++ L + G   NT ++ +SD G++LG+ G    K
Sbjct: 295 VRWRYQRYMHDYLACVKAVDESVGRLLQTLDEEGLAENTLVVVSSDQGFYLGEHGWFD-K 353

Query: 213 SFPFEFDIRVPFLMRGPGIVPG 234
            + FE  +R P L R P  +P 
Sbjct: 354 RWIFEESLRTPLLARWPAAIPA 375


>gi|407773361|ref|ZP_11120662.1| choline-sulfatase [Thalassospira profundimaris WP0211]
 gi|407283825|gb|EKF09353.1| choline-sulfatase [Thalassospira profundimaris WP0211]
          Length = 504

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 83/174 (47%), Gaps = 21/174 (12%)

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD 132
           Y D +A ++   +    +   ++P  L +SF  PH P  +  ++ +++ +         D
Sbjct: 162 YDDEVAYNATHKIYDLSRGNDERPWCLTVSFTHPHDPYVARKKFWDLYEDCPELEP---D 218

Query: 133 YAP------NPDKQWILQVTR------KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
            AP      +P  Q +++ +       K + V R             +  +DD V +I++
Sbjct: 219 VAPFDFEDQDPHSQRLMKASDYTAFDIKNEDVRRS-----RQAYFANISYIDDKVGQIME 273

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+   +L+NT II+ SDHG  LG+ GL    SF FE   RVP +M GPGI+PG
Sbjct: 274 VLETTRQLDNTIIIFVSDHGDMLGERGLWFKMSF-FEGSARVPLMMAGPGIMPG 326


>gi|251795725|ref|YP_003010456.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247543351|gb|ACT00370.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 509

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 66/151 (43%), Gaps = 5/151 (3%)

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
           D +  LRQ  Q     P    +SFP PH P      YA+M         P       P++
Sbjct: 187 DYLAHLRQENQ-----PFFTWLSFPDPHTPYQVPEPYASMIRPEDVPMPPVDSLEGKPER 241

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
             +  +   M     Q    +       ++ +DD + KI + +  L  L NT II+TSDH
Sbjct: 242 VKVAHLMDAMDTADEQLIRQVRAIHYGMIRFIDDTLAKIFERMDALSLLENTVIIFTSDH 301

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           G  +G  G+++  +F ++    VPF+M  PG
Sbjct: 302 GDSMGAHGIIQKHNFFYDSFTHVPFIMSLPG 332


>gi|87309127|ref|ZP_01091264.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Blastopirellula marina DSM 3645]
 gi|87288118|gb|EAQ80015.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Blastopirellula marina DSM 3645]
          Length = 493

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 105/235 (44%), Gaps = 34/235 (14%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            Y GK+    +     PG+  +       KY++   N NG+  K  + YY     D+ A 
Sbjct: 116 AYIGKWHMGEDNDEPRPGFDYFVTHKGQGKYFDTEFNFNGQGRKVVDGYYTTVVTDM-AE 174

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHG---PEDSAPQYANMFFNVTSHH-TPAYDYAP 135
           D +     SKQ    KP ML++   APH    PE+   +Y + F      +   A+D   
Sbjct: 175 DWI-----SKQD-GDKPWMLMLGHKAPHSFYLPEE---KYEHTFDQADIQYPKSAFDLED 225

Query: 136 NPDKQWILQVTRKMQPVH-------RQFTD----------LLMTKRLQTLQSVDDAVEKI 178
            P  +W  +       ++       + F D           ++     T+ SVDD+V ++
Sbjct: 226 KP--EWFKKRLDTWHGIYGPLFDWRKNFPDESPAGVKDFARMVRAYWGTILSVDDSVGRL 283

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
              LK+ GEL+NT II+TSD+G   G+ G+V  K    E  IR+P ++R PG+ P
Sbjct: 284 YDFLKERGELDNTLIIFTSDNGLLEGEHGMVD-KRTGHEPSIRIPLVVRYPGLTP 337


>gi|222147868|ref|YP_002548825.1| choline sulfatase [Agrobacterium vitis S4]
 gi|221734856|gb|ACM35819.1| choline sulfatase [Agrobacterium vitis S4]
          Length = 496

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 80/164 (48%), Gaps = 9/164 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     +++P  L +SF  PH P  +  QY +++ + ++H  P      
Sbjct: 159 FLANQKLYHLSRENDDAARRPWCLTVSFTHPHDPYVARKQYWDLYED-SNHLLPDVGALA 217

Query: 136 NPD---KQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNN 190
           + D   K+ I         V  +  D+  ++R     +  +DD V +++  L     L+N
Sbjct: 218 DQDPHSKRLIHACDYDNFNVTEE--DIRRSRRAYFANISYIDDKVGELIDTLTRTRMLDN 275

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           T I++ SDHG  LG+ GL    +F FE   RVP ++ GPGI PG
Sbjct: 276 TTILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMVAGPGIAPG 318


>gi|343085191|ref|YP_004774486.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353725|gb|AEL26255.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 478

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 29/184 (15%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           +Y  D + N+S++ ++++    +K+P +L +SF + H P   AP+  + F +   +   +
Sbjct: 194 EYLTDRLTNESLQLIKEN----NKEPFLLYLSFYSVHTPIQPAPKRYDYFLDKKENMIVS 249

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGEL 188
            + A             K +P     T L+         + ++D+ V +I++ELK  G  
Sbjct: 250 EEQA-------------KYKPEGEGQTKLIQDNASYASMVAAMDENVGRILQELKAQGLD 296

Query: 189 NNTYIIYTSDHGYHLGQFG---------LVKGKSFPFEFDIRVPFLMRGPGIV-PGTMYL 238
           +NT++I TSD+G     +G         L  GK + +E  IRVP L++GPGI  PGT   
Sbjct: 297 DNTWVILTSDNGGLSTLYGEGAPTSNGPLRAGKGWCYEGGIRVPMLIKGPGIKNPGTAPE 356

Query: 239 LDVL 242
           L V+
Sbjct: 357 LPVI 360


>gi|441500082|ref|ZP_20982252.1| Choline-sulfatase [Fulvivirga imtechensis AK7]
 gi|441436173|gb|ELR69547.1| Choline-sulfatase [Fulvivirga imtechensis AK7]
          Length = 504

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 102/226 (45%), Gaps = 26/226 (11%)

Query: 21  YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
           +FGK+    + +    G+  W +      YYN ++N++G ++++ +   + Y  DL+   
Sbjct: 138 FFGKWHMGEHHANPRKGFDYWVSFKGQGVYYNPTLNIDGEEVEYTD---STYITDLLTEK 194

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGP-----------EDSAPQYANMFFNVTSHHTP 129
           ++ +L + K+    KP  + +S    H             E + PQY    F  ++  T 
Sbjct: 195 AIEWLGKRKKD---KPFFMYLSHKGVHSEFSPAKRHQDVYEGTKPQYPPTMFPASAD-TT 250

Query: 130 AYDYAPNPDKQWILQVTRKMQPV----HRQFT-DLLMTKRLQTLQSVDDAVEKIVKELKD 184
            Y+Y   PD  W+ +       V    H Q   D       +TL S+D+++  ++  L++
Sbjct: 251 KYNYKDVPD--WVKKQRHSWHGVDYMYHGQINFDEFYENYNETLLSIDESIGDVLSYLEE 308

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
              L +T + Y  D+G+  G+ GL+  K   +E  IRVP L  G G
Sbjct: 309 NDLLESTLVFYMGDNGFSFGEHGLID-KRHAYEESIRVPLLAYGGG 353


>gi|338212322|ref|YP_004656377.1| N-acetylglucosamine-6-sulfatase [Runella slithyformis DSM 19594]
 gi|336306143|gb|AEI49245.1| N-acetylglucosamine-6-sulfatase [Runella slithyformis DSM 19594]
          Length = 509

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/256 (24%), Positives = 114/256 (44%), Gaps = 42/256 (16%)

Query: 20  GYFGKYLNKY---NGSYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
           GY   +L K+   N    P PG+  W +      YY+   N+NG+++ H + Y  D    
Sbjct: 124 GYKTAFLGKWHMGNADDAPQPGFDYWLSFKGQGVYYSPIFNINGKQVTHKDGYTTD---- 179

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
           L+ + ++ ++ +  ++   KP  L +S  A H     A ++A ++ ++   + PA  Y  
Sbjct: 180 LLTDYAIEWMGKQDKN---KPFFLYLSHKAVHADFQPAKRHAGVYKDMPIQY-PASMYLT 235

Query: 136 NPD--KQWILQV----TRKMQPVHRQFTDLLMTKR-----------------------LQ 166
             D  K W        T +++   R   + +  +R                        +
Sbjct: 236 KSDTSKIWGKNTKDPETGEVKSNLRDMPNWVKNQRYSWHGVDYLYHGSISFNDFYRQYCE 295

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
           TL  VD++V +++K L+D G   NT +IY  D+G+  G+ GL+  K   +E  +RVP L+
Sbjct: 296 TLLGVDESVGRVMKYLEDNGLAENTLVIYMGDNGFSFGERGLID-KRHAYEESMRVPMLV 354

Query: 227 RGPGIVPGTMYLLDVL 242
             P ++     L +V+
Sbjct: 355 SCPSVIKPQTKLTNVI 370


>gi|260587313|ref|ZP_05853226.1| mucin-desulfating sulfatase [Blautia hansenii DSM 20583]
 gi|260542180|gb|EEX22749.1| mucin-desulfating sulfatase [Blautia hansenii DSM 20583]
          Length = 484

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 114/248 (45%), Gaps = 37/248 (14%)

Query: 4   PVELTSRNLLYFDI--SHGYFGKYLNKYN-GSYIPP--GWREWGAL-IMNSKYYNYSINM 57
           P+        Y D+   +GY      K++ G  + P  G+ +W  + +    Y++  I  
Sbjct: 100 PISYLEGKTTYTDVLNENGYRCALAGKWHLGDSVCPQHGFSKWYTIGLGGCDYFHPDIVE 159

Query: 58  NGR-KIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP---EDSA 113
           NG  K+ H       Y  ++IAN ++ +L + +    ++P  L + F APH P   E   
Sbjct: 160 NGNIKVLH-----EQYVTEVIANKAIEYLNEFQHQ--EEPFYLSVHFTAPHSPWGEEQHP 212

Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQW--ILQVTRKMQPVHRQFTDLLMTKRLQTLQSV 171
            ++ + + N      P  D A +PD     +    ++ + +   F           + ++
Sbjct: 213 KKWMDYYENCDFQSIP--DEADHPDLTTGPVFGTEKRKENLRGYFA---------AISAM 261

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK---SFPF---EFDIRVPFL 225
           D+ + +I+  L+  G   NT ++YT+D+G  +G  G V GK   +FPF   E  ++VPFL
Sbjct: 262 DEQIGRILDTLEANGLRENTLVVYTADNGMSMGHHG-VWGKGNGTFPFNMYETSVKVPFL 320

Query: 226 MRGPGIVP 233
           M  PG++P
Sbjct: 321 MSLPGVIP 328


>gi|440719037|ref|ZP_20899471.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SWK14]
 gi|436435625|gb|ELP29454.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SWK14]
          Length = 735

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)

Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
           PE   P+  N  +  +      A+D A  P+ Q  +   +  +   +  T       +  
Sbjct: 264 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 323

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
            L T+Q+VDD+V +++  L + G   NT +IY+SD G++LG+ G    K + FE  +R+P
Sbjct: 324 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 382

Query: 224 FLMRGPGIV-PGT 235
           FL+R PG++ PGT
Sbjct: 383 FLIRWPGVIDPGT 395


>gi|283781956|ref|YP_003372711.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283440409|gb|ADB18851.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 506

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/258 (23%), Positives = 109/258 (42%), Gaps = 55/258 (21%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++    P G+  W  L     YYN ++      ++H       Y  D+I +
Sbjct: 123 GYQTAVIGKWHLGSDPTGFDYWHILQGQGPYYNPAMKTPNGIVRH-----TGYTTDIITD 177

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--------------- 124
           +++ +L + +     KP  L+    APH     AP+Y + + +VT               
Sbjct: 178 EAIGWLEKGRDK--NKPFFLMYQHKAPHRNWQPAPRYLDKYKDVTIPEPTTLFDDYKGRG 235

Query: 125 -----SHHTPAYDYAP-----------NPDKQWILQVTRKMQPVHRQFTDLLMTKR---- 164
                   + A D +P           NP+++ I +  +     ++ F +  +T      
Sbjct: 236 TAARDQQMSIAKDLSPHDLKLVAQPGLNPEQKEIFE--KAYAEENKAFQEAKLTGDDKTR 293

Query: 165 ----------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
                     L+ + +VD+ V ++++ L + GE  NT +IYTSD G++LG+ G    K +
Sbjct: 294 WMYQRYVKDYLRCVDAVDENVGRMLEWLDNSGEAANTLVIYTSDQGWYLGEHGWYD-KRW 352

Query: 215 PFEFDIRVPFLMRGPGIV 232
            +E   R P ++R PG V
Sbjct: 353 MYEESFRTPLIVRWPGQV 370


>gi|417306141|ref|ZP_12093068.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica WH47]
 gi|327537574|gb|EGF24291.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica WH47]
          Length = 747

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)

Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
           PE   P+  N  +  +      A+D A  P+ Q  +   +  +   +  T       +  
Sbjct: 276 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 335

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
            L T+Q+VDD+V +++  L + G   NT +IY+SD G++LG+ G    K + FE  +R+P
Sbjct: 336 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 394

Query: 224 FLMRGPGIV-PGT 235
           FL+R PG++ PGT
Sbjct: 395 FLIRWPGVIDPGT 407


>gi|32471182|ref|NP_864175.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH 1]
 gi|32396884|emb|CAD71852.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH 1]
          Length = 747

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)

Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
           PE   P+  N  +  +      A+D A  P+ Q  +   +  +   +  T       +  
Sbjct: 276 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 335

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
            L T+Q+VDD+V +++  L + G   NT +IY+SD G++LG+ G    K + FE  +R+P
Sbjct: 336 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 394

Query: 224 FLMRGPGIV-PGT 235
           FL+R PG++ PGT
Sbjct: 395 FLIRWPGVIEPGT 407


>gi|320106473|ref|YP_004182063.1| sulfatase [Terriglobus saanensis SP1PR4]
 gi|319924994|gb|ADV82069.1| sulfatase [Terriglobus saanensis SP1PR4]
          Length = 495

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/180 (26%), Positives = 81/180 (45%), Gaps = 33/180 (18%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF---------- 120
           ++  ++ A+ ++  L   K +    P  + + F APH P  +  +Y +++          
Sbjct: 200 EHSSEIYADAAIEHL--GKVAKRDAPFFMYVGFNAPHDPRQAPQEYLDLYPADKIAIPPN 257

Query: 121 ------FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
                 F+   H T     AP P  Q  +QV R+              +    +  +D  
Sbjct: 258 YLPEHPFDQGEHKTRDELLAPFPRTQHDVQVHRR--------------EYYAIISHMDAQ 303

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           + +I   LK  G+ +NTYII T+DHG  +G+ GL+ GK   +E  +R+P ++ GPGI  G
Sbjct: 304 IGRIFDALKKSGKADNTYIILTADHGLSVGEHGLM-GKQNQYERSMRMPLILSGPGIKAG 362


>gi|237721982|ref|ZP_04552463.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448851|gb|EEO54642.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 517

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 80/162 (49%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E  +R+P +  G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLVRIPMVWAG 333


>gi|338210316|ref|YP_004654363.1| sulfatase [Runella slithyformis DSM 19594]
 gi|336304129|gb|AEI47231.1| sulfatase [Runella slithyformis DSM 19594]
          Length = 479

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 85/162 (52%), Gaps = 13/162 (8%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--SHHTPAYD 132
           D  A  +++FL   KQ+ + KP +  ++F APH P    P+Y   +  V    +  P + 
Sbjct: 169 DHFAETAIQFL--DKQT-ANKPFVTYVAFTAPHDPRSPLPEYLKRYGTVPLPPNFMPIHP 225

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLL---MTKRLQTLQSVDDAVEKIVKELKDLGELN 189
           +    D    + V  +M   + + TD++   +T+    +  +D+A+ KI+ +LK+ G   
Sbjct: 226 FNFGSD----MTVRDEMLAGYPRTTDVIKSQLTEYYAMITHLDEAIGKILAKLKEKGLDK 281

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           NT I++ +D+G  +G  GL+ GK   +E  +RVP +M G GI
Sbjct: 282 NTIIVFAADNGLAMGSHGLL-GKQNLYEHSMRVPLIMSGKGI 322


>gi|449136746|ref|ZP_21772114.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula europaea 6C]
 gi|448884630|gb|EMB15114.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula europaea 6C]
          Length = 747

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)

Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
           PE   P+  N  +  +      A+D A  P+ Q  +   +  +   +  T       +  
Sbjct: 276 PEQFLPRLGNGEYRRMNDEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 335

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
            L T+Q+VDD+V +++  L + G   NT +IY+SD G++LG+ G    K + FE  +R+P
Sbjct: 336 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 394

Query: 224 FLMRGPGIV-PGT 235
           FL+R PG++ PGT
Sbjct: 395 FLIRWPGVIAPGT 407


>gi|149195789|ref|ZP_01872846.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149141251|gb|EDM29647.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 524

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 108/235 (45%), Gaps = 19/235 (8%)

Query: 20  GYFGKYLNKYNGSYIPPGWRE-------WGALIMNSKYYNYSINMNGRKIK--HGEDY-Y 69
           G FG +LN   G  +  G  E       W +  M    Y  + N   R+ K  +G+   +
Sbjct: 119 GKFGFHLNDEEGKALKGGATERVINSFDWWSGWMGQGSYQMNENREAREYKSVYGDKAEH 178

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
             Y   L+  D   F++++K S   KP  L +SF APH P  + P+Y  ++  V      
Sbjct: 179 TTYALGLMGRD---FIKKAKAS--GKPFCLSISFKAPHTPLATDPRYDALYEGVKFTKPA 233

Query: 130 AYDYAPN-PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
            ++   + P++    +   K +   + + D  M K    +  +D A+  I++ELK+    
Sbjct: 234 DFNEVDSLPEQAKSGRPMSKGKSWIKSYDDT-MKKYHTMIYGMDKAIAMIMEELKEQALD 292

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY-LLDVL 242
           +NT II+TSD+G+H G  GL  GK + +E     P ++  P    GT +  +D L
Sbjct: 293 HNTIIIFTSDNGHHNGSKGL-GGKLYAYEVGSLAPAIIYDPRHDTGTQFKTIDAL 346


>gi|340616998|ref|YP_004735451.1| hypothetical protein zobellia_1003 [Zobellia galactanivorans]
 gi|339731795|emb|CAZ95060.1| Pseudogene of sulfatase [Zobellia galactanivorans]
          Length = 339

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 39/167 (23%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  D + ++ + F+R++    SKKP M+V+S    H P  + P     F N  +      
Sbjct: 37  YVADALTDECIDFIRKN----SKKPFMVVLSHYLVHNPIQAKPALVKRFKNKATTDQDNP 92

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
           +YA                                 L SVD+++ +I  ELK LG  NNT
Sbjct: 93  EYAA-------------------------------MLVSVDESIGRINTELKRLGIENNT 121

Query: 192 YIIYTSDHG---YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PG 234
            +I+TSD+G        + L+ GKS+ FE  +RVP +++ P ++ PG
Sbjct: 122 LVIFTSDNGGLNNSTSNYPLLGGKSYSFEGAMRVPLIVKWPAMIKPG 168


>gi|423288164|ref|ZP_17267015.1| hypothetical protein HMPREF1069_02058 [Bacteroides ovatus
           CL02T12C04]
 gi|392671053|gb|EIY64529.1| hypothetical protein HMPREF1069_02058 [Bacteroides ovatus
           CL02T12C04]
          Length = 482

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 79/163 (48%), Gaps = 15/163 (9%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D   + ++  L + K     KP  L++S P PH P+ + P Y +M+ N+     P   + 
Sbjct: 197 DFFTDKALEILERDKN----KPFCLMLSIPDPHTPDYARPPYHDMYKNMK----PEMPFT 248

Query: 135 PNPD-----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
             P+       W +   +      + F    + +    ++ +DD+V +I+K L+D   L+
Sbjct: 249 MKPEYASQRPNWGMGGEKDKNEA-KSFDANALKQYFGMVKHIDDSVGRILKFLEDNNLLD 307

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           NT +++TSDHG    +    + K  P+E   R+PF++R PG +
Sbjct: 308 NTIVVFTSDHGDMFYEHKR-RNKGVPYEASSRIPFVIRYPGKI 349


>gi|430744285|ref|YP_007203414.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430016005|gb|AGA27719.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 475

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/216 (25%), Positives = 100/216 (46%), Gaps = 15/216 (6%)

Query: 18  SHGYFGKY-LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           + GY GK+ ++   G    PG+ E  + I   KY +    +NG ++   + + +D   D 
Sbjct: 135 TTGYVGKWHMDGQKGKR--PGFDESVSFIGQGKYVDCPFEVNG-ELTPTQGWVDDRSTDF 191

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
               ++ FL++++     KP  LV+ F A HGP D   +  + F   T    PA +    
Sbjct: 192 ----AIDFLKKNQ----AKPFALVVGFKAAHGPFDPPERARDRFAGETPR--PAANQGVR 241

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
           P            +               + + +VDD + +++  L +LG   +T ++++
Sbjct: 242 PIFSKAAAAPAVAKKKKGATRPEGNIGYFRCVSAVDDNLGRLMTTLDELGLAEDTVLVFS 301

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           SD+G++LG+ GL   +S  +E  +RVP L+R P + 
Sbjct: 302 SDNGFYLGEHGLGDKRS-AYEASLRVPLLLRYPRLA 336


>gi|149195819|ref|ZP_01872876.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
 gi|149141281|gb|EDM29677.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
          Length = 519

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 100/236 (42%), Gaps = 66/236 (27%)

Query: 62  IKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF 121
           I  G +    Y  D++ ++S+++L Q  Q+   KP ++++ F +PH P    P+Y NM+ 
Sbjct: 153 IDKGHEQIMGYSTDVVTDESIKWLDQRDQN---KPFLMMVQFKSPHVPRIPHPRYMNMYT 209

Query: 122 NVTSHHTPAYDYAPN-----------------------PDKQWILQVTRK-------MQP 151
              +     YD   N                       P K     V +K       M P
Sbjct: 210 EDVAEPATLYDNYQNRLKGASTAWMEINGQNEEVLAYFPPKNATEPVNKKQKKHLDRMTP 269

Query: 152 VHRQ---------------------FTDLLMTKRLQ----------TLQSVDDAVEKIVK 180
             R+                     F D +  ++L+           +Q++DD V +++K
Sbjct: 270 EQRKALLDAMDNQNSEYYELKKAGAFKDPVKARKLKYQFFIKNYLRCVQAIDDNVGRLLK 329

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGT 235
            L+D     NT +IY+SD  Y +G+ G  + K + +E  +++PF++R PG I PG+
Sbjct: 330 WLEDNELDENTIVIYSSDQSYFIGEHGWAE-KRWMYEEALKMPFVIRWPGKIKPGS 384


>gi|311748198|ref|ZP_07721983.1| choline sulfatase [Algoriphagus sp. PR1]
 gi|311302763|gb|EAZ80936.2| choline sulfatase [Algoriphagus sp. PR1]
          Length = 496

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 86/195 (44%), Gaps = 21/195 (10%)

Query: 48  SKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH 107
           + +Y+  +   G   K GE     +  +  A  ++ F++   Q  + +P    ++F APH
Sbjct: 152 ADHYDLPLRDYGADGKLGEPTRKGFSTEQFAQAAIDFIKDHGQRNTDQPFFCYVAFTAPH 211

Query: 108 GPEDSAPQYANMF--------FNVTSHHTPAYDYAPNPDKQ---WILQVTRKMQPVHRQF 156
            P      Y N +         N   +H   +D+    D+    W         P   + 
Sbjct: 212 DPYSPEANYINHYPDGTLPLPGNYMPYHPFEFDHLTVRDENLTGW---------PRKPEV 262

Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
             ++++     +  +D  + KI+  LK+ G+ +NT I+Y +D+G   G  GL+ GK   +
Sbjct: 263 IQMILSDYYALVTHLDTQIAKILNTLKETGQYDNTIIVYAADNGLAAGSHGLL-GKQSLY 321

Query: 217 EFDIRVPFLMRGPGI 231
           E   +VP +++GPG+
Sbjct: 322 EHSSKVPLIIKGPGV 336


>gi|229819795|ref|YP_002881321.1| arylsulfatase [Beutenbergia cavernae DSM 12333]
 gi|229565708|gb|ACQ79559.1| sulfatase [Beutenbergia cavernae DSM 12333]
          Length = 486

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 15/199 (7%)

Query: 44  LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSF 103
           L  ++ ++++ I+ N    +  +   + +  + +A++S  FLR+   +   KP +L  SF
Sbjct: 152 LGRDATFFDHGIDSNSWVARPWDKPEHLHPTNFVASESADFLRRRDPT---KPFLLFASF 208

Query: 104 PAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT---DLL 160
            APH P D        +        P  D+A   D  W         P     T   DLL
Sbjct: 209 NAPHPPFDPPAWAFEQYLETDMPDPPVGDWAEAFDP-W----ANSADPTALVGTIPPDLL 263

Query: 161 MTKRLQ---TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
              R      +  VD  +  +++EL   G  +NT + + +DHG  LG   L + K FP+E
Sbjct: 264 QRARAGYYGHMTHVDQQINFLLEELSHRGLRDNTLVCFLADHGEMLGDHHLFR-KGFPYE 322

Query: 218 FDIRVPFLMRGPGIVPGTM 236
              R+P ++ GPG+  G +
Sbjct: 323 GSARIPMILSGPGVPAGQV 341


>gi|225619372|ref|YP_002720598.1| sulfatase [Brachyspira hyodysenteriae WA1]
 gi|225214191|gb|ACN82925.1| sulfatase [Brachyspira hyodysenteriae WA1]
          Length = 474

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 76/149 (51%), Gaps = 14/149 (9%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM----- 149
           KP+ + +SF  PH P D   +Y +++ ++      +       DK+    +T++      
Sbjct: 196 KPLFMWVSFVDPHHPFDPIKKYYDIYKDIKPKELNS---KLKLDKKRPEHLTKQGDRGYW 252

Query: 150 ----QPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
               +  H    ++   K+L    +  +D  + +I+ +LK+  E +NT II+TSDHG +L
Sbjct: 253 PGGGEEHHYSQEEIKEIKKLYYGMISFIDSQIGRIIDKLKEKNEFDNTIIIFTSDHGEYL 312

Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           G +GL+K   F ++  I+VP L  G GIV
Sbjct: 313 GDYGLLKKGPFMYDCLIKVPLLFYGKGIV 341


>gi|331270234|ref|YP_004396726.1| putative sulfatase [Clostridium botulinum BKT015925]
 gi|329126784|gb|AEB76729.1| probable sulfatase [Clostridium botulinum BKT015925]
          Length = 504

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 58/102 (56%), Gaps = 3/102 (2%)

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
           Y P  DK W+      +Q       D+ +   + ++  +D  + KI+ +L++LG   +T 
Sbjct: 275 YEPQ-DKNWLHGCHSHLQDKEELKKDIAIYYGMVSM--MDHYIGKILDKLEELGMAEDTI 331

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +++TSDHG+  GQ GL+    F +E  I++PF++R PG+VP 
Sbjct: 332 VVFTSDHGHFFGQHGLIAKGPFHYEDMIKIPFIVREPGVVPA 373


>gi|149197012|ref|ZP_01874065.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
 gi|149140122|gb|EDM28522.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
          Length = 463

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 52/201 (25%), Positives = 86/201 (42%), Gaps = 40/201 (19%)

Query: 37  GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
           G REWG       Y+ +  +M     K G     DY  D + + +V F+ + +    K P
Sbjct: 165 GGREWGQPKGRGIYF-HPFDMPNMTSKEG-----DYLTDRLTDYAVDFIEKHQ----KDP 214

Query: 97  IMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
            +L  S+   HGP  + P     +                  K+W     +   PV+   
Sbjct: 215 FLLYFSYYTVHGPLQAKPDLVEKY-----------------KKKWQSGKYKHKNPVYAAM 257

Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQF--GLVKGKSF 214
                      +QS+D++V +++ +L +LG  +NT +I+T D+G     +  GL   K+ 
Sbjct: 258 -----------VQSLDESVGRVLDKLDELGIADNTIVIFTGDNGAVGTNYCGGLKGAKAL 306

Query: 215 PFEFDIRVPFLMRGPGIVPGT 235
             E  +R PF ++GPGI PG 
Sbjct: 307 SHEGGVREPFFIKGPGIEPGV 327


>gi|402075483|gb|EJT70954.1| hypothetical protein GGTG_11977 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 518

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 51/83 (61%), Gaps = 1/83 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M + L+T+QS+DD V +++  L+D G  +NT ++YTSD G+ LG+ G    K F +E   
Sbjct: 295 MQRYLRTIQSIDDNVGRMLDWLEDEGIADNTLVVYTSDQGFFLGEHGWFD-KRFMYEESF 353

Query: 221 RVPFLMRGPGIVPGTMYLLDVLI 243
           ++PFL R P +V G     D++ 
Sbjct: 354 QMPFLARFPKLVAGGSVCRDIVC 376


>gi|149197555|ref|ZP_01874606.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
           HTCC2155]
 gi|149139573|gb|EDM27975.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
           HTCC2155]
          Length = 515

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 117/280 (41%), Gaps = 66/280 (23%)

Query: 10  RNLLYFDISH----------GYFGKYLNKYNGSYIPPGWREWGALIM---NSKYYNYSIN 56
           +N +YFD             GY    + K++ + +P G+  +  +       KYY+  +N
Sbjct: 90  KNDMYFDGRQITFPKLLRQAGYQTAVIGKWHLASLPTGFDHFEVITGYGGQGKYYHPVMN 149

Query: 57  MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
            NG   KH       Y  ++I   ++ +L+   Q    KP ML+M   APH     +P+Y
Sbjct: 150 RNGEPTKH-----RGYTTEVITKLNMEWLKN--QRDPNKPFMLMMQHKAPHRAWLPSPKY 202

Query: 117 ANMFFNV-----TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT--------DL---- 159
            N F +       + HT     A +  KQ ++ +   M P   + T        DL    
Sbjct: 203 MNAFKDKKFPKPANLHTDYQGKASHVKKQDMM-IKDSMNPGDLKLTPPKYLDGADLANWH 261

Query: 160 --------------LMTKRLQT-------------LQSVDDAVEKIVKELKDLGELNNTY 192
                         L  K L++             +QS+DD++ +++  L + G   NT 
Sbjct: 262 KAYDEENAAFAKAKLSGKALRSWNYQRYIRDYVRCVQSIDDSIGEVLNYLDESGLAENTL 321

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +IY+SD G+ LG+ G    K F +E  +R P +MR PG +
Sbjct: 322 LIYSSDQGFFLGEHGWFD-KRFMYEEALRTPLVMRWPGKI 360


>gi|325110420|ref|YP_004271488.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970688|gb|ADY61466.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 469

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 82/181 (45%), Gaps = 14/181 (7%)

Query: 58  NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
           NG + +   D  N YY   +A  ++  L +   S  +KP  L + F  PH P ++  +Y 
Sbjct: 171 NGIECRDVPD--NAYYDGRVAEVAIDALDEI--SREEKPFFLAVGFWKPHTPFNAPKKYW 226

Query: 118 NMF----FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
           +++      V  H TP  D       +  L   R       ++   L    L  +  +D 
Sbjct: 227 DLYDRDQIPVPEHVTPPTDV-----PEVALTSARYKGGPDTEYLRELHHGHLAAITYLDA 281

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            V K++ +L + G  NNT +++ SDHG HLG+ GL + K+  FE D  VP ++  P    
Sbjct: 282 QVGKVLAKLDETGLRNNTIVVFWSDHGLHLGEHGLTR-KTTAFELDAGVPMIIATPEHTA 340

Query: 234 G 234
           G
Sbjct: 341 G 341


>gi|149177554|ref|ZP_01856156.1| putative N-acetylglucosamine-6-sulfatase [Planctomyces maris DSM
           8797]
 gi|148843534|gb|EDL57895.1| putative N-acetylglucosamine-6-sulfatase [Planctomyces maris DSM
           8797]
          Length = 513

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 108/234 (46%), Gaps = 26/234 (11%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY------NYSINMNGRKIKHGEDYYNDYY 73
           G+FGK+    +     PG+ +W +      YY       +S+N++G+ +         Y 
Sbjct: 126 GFFGKWHMGGHSDDPRPGFDKWISFRGQGHYYPPRHLKKWSLNIDGKSVPQ-----KGYI 180

Query: 74  PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHHT 128
            D + + ++ +L  + +  S KP  + +S    HG    A ++A  + + +     +   
Sbjct: 181 TDELTDYAIEWLDDTVKP-SGKPFFVYLSHKGVHGMFHPAERHAGRYKDKSMPIPKTMEN 239

Query: 129 PAYDYAPNPDKQWILQVTRKMQPV---HRQFTDLLMTKRL--QTLQSVDDAVEKIVKELK 183
            + +Y   P   W+         V   + Q TD+    RL  + L SVD+++ ++ K LK
Sbjct: 240 TSENYFNKP--MWLKNQRNSWHGVDFAYHQDTDIEEHYRLYCEALLSVDESIARVRKWLK 297

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-VPGTM 236
           D G   NT ++Y  D+G+  G+ GL+  K   +E  +RVP +   PG+  PGT+
Sbjct: 298 DNGLAENTLVMYMGDNGFQWGEHGLID-KRTAYEASMRVPLVGVCPGLWKPGTV 350


>gi|332307886|ref|YP_004435737.1| sulfatase [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175215|gb|AEE24469.1| sulfatase [Glaciecola sp. 4H-3-7+YE-5]
          Length = 554

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 62/255 (24%), Positives = 100/255 (39%), Gaps = 53/255 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
           GY    + K++    P  +  +  L     Y++      G K     E  Y  +  D++ 
Sbjct: 141 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGAKPWPENETQYQGHSSDVVT 200

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
           + S+ +L   K   S KP  L+  F APH     AP+Y +           A +  P PD
Sbjct: 201 DLSIEWL---KNRSSDKPFFLMHQFKAPHDMFQYAPRYEDFL---------AAETIPEPD 248

Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
             + +  T                           R M            +   RQ    
Sbjct: 249 DLYGVAKTFGSIATRGKNDALRAEIGTSVSRRNNRRNMGIDLGVDPNLNEEEFTRQAYQK 308

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
            +   L+ ++ VDD V ++++ LKD G+  NT IIYTSD G  LG+  L + K + ++  
Sbjct: 309 YLKAYLRCVKGVDDNVARLIQTLKDAGQYENTIIIYTSDQGMMLGEHDL-QDKRWIYDES 367

Query: 220 IRVPFLMRGPGIVPG 234
           IR+PF+++ P  V  
Sbjct: 368 IRMPFIVKHPDAVAA 382


>gi|343516047|ref|ZP_08753093.1| sulfatase [Vibrio sp. N418]
 gi|342797271|gb|EGU32926.1| sulfatase [Vibrio sp. N418]
          Length = 499

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 96/192 (50%), Gaps = 37/192 (19%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           NDY   ++A++S+++L    +   + P  L + FP  H P  S  +Y +M+ +  +    
Sbjct: 200 NDYAGGVVADESIQWLDTINK---RDPFFLYVPFPEVHTPIASPKKYLDMYSDYIT---- 252

Query: 130 AYDYAP-NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
             D+A  NPD   +     K QP   Q       +    +  +D+ V +IV +LK++GE 
Sbjct: 253 --DFAKQNPD---LYHWDWKGQPYRGQ------GEYYANISYMDEQVGRIVSKLKEIGEY 301

Query: 189 NNTYIIYTSDHG---------YHL---GQFGLVKG-KSFPFEFDIRVPFLMRGPGIVPGT 235
           +NT I+++SD+G         + L   G+ G ++G K   FE  IRVP +M+G GI   T
Sbjct: 302 DNTIILFSSDNGPVTREARKPWELNMAGETGGLRGRKDNLFEGGIRVPLIMKGSGITQNT 361

Query: 236 -----MYLLDVL 242
                +Y LD++
Sbjct: 362 QSHEPVYGLDIV 373


>gi|338212053|ref|YP_004656106.1| sulfatase [Runella slithyformis DSM 19594]
 gi|336305872|gb|AEI48974.1| sulfatase [Runella slithyformis DSM 19594]
          Length = 528

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 39/238 (16%)

Query: 19  HGYFGKYLNKYN------GSYIPP----GWREWGALIMNSKYYN---YSINMNGRKIKHG 65
           HGY   ++ K++       +YIPP    G+  W AL     Y N   Y+ + N + +  G
Sbjct: 161 HGYTTGFVGKWHIDGHGRNTYIPPTRQQGFGYWKALECTHDYNNSAYYAGDSNKKLVWKG 220

Query: 66  EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
            D        +IA         + ++    P M+ +S  +PH P  +AP+     F    
Sbjct: 221 YD--------VIAQTDDVCNYMTAKAKEPNPFMVFISIGSPHDPYQTAPEKYRKLF---- 268

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
                       DK  ++     ++   +   DL        + ++DD + K+ + L DL
Sbjct: 269 -----------ADKDIMINDNVPVEKREKVKNDL--RGYYSHIAAIDDCIGKMWQTLIDL 315

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
           G  +NT I+++SDHG  LG  G    K  P+E  IRVPFL+  P +   T    + L+
Sbjct: 316 GIEDNTIIVFSSDHGDLLGAHGSWN-KQQPYEESIRVPFLLHYPKLFGKTGKTSEALL 372


>gi|410647586|ref|ZP_11358010.1| mucin-desulfating sulfatase [Glaciecola agarilytica NO2]
 gi|410132848|dbj|GAC06409.1| mucin-desulfating sulfatase [Glaciecola agarilytica NO2]
          Length = 554

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 62/255 (24%), Positives = 100/255 (39%), Gaps = 53/255 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
           GY    + K++    P  +  +  L     Y++      G K     E  Y  +  D++ 
Sbjct: 141 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGAKPWPENETQYQGHSSDVVT 200

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
           + S+ +L   K   S KP  L+  F APH     AP+Y +           A +  P PD
Sbjct: 201 DLSIEWL---KNRSSDKPFFLMHQFKAPHDMFQYAPRYEDFL---------AAETIPEPD 248

Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
             + +  T                           R M            +   RQ    
Sbjct: 249 DLYGVAKTFGSIATRGKNDALRAEIGTSVSRRNNRRNMGIDLGVDPNLNEEEFTRQAYQK 308

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
            +   L+ ++ VDD V ++++ LKD G+  NT IIYTSD G  LG+  L + K + ++  
Sbjct: 309 YLKAYLRCVKGVDDNVARLIQTLKDAGQYENTIIIYTSDQGMMLGEHDL-QDKRWIYDES 367

Query: 220 IRVPFLMRGPGIVPG 234
           IR+PF+++ P  V  
Sbjct: 368 IRMPFIVKHPDAVAA 382


>gi|311109197|ref|YP_003982050.1| sulfatase family protein 12 [Achromobacter xylosoxidans A8]
 gi|310763886|gb|ADP19335.1| sulfatase family protein 12 [Achromobacter xylosoxidans A8]
          Length = 508

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 76/175 (43%), Gaps = 21/175 (12%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
           I + ++ +LR+ +     +P  L  SFP PH P D    ++ +      ++  H    Y+
Sbjct: 205 IGDRTIDYLREHRD----QPFCLWASFPDPHHPFDCPEPWSRLHHPDEVDLPRHRATDYE 260

Query: 133 YAP-------------NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
             P             N + Q + Q   +M     +    ++      +  VD  V +I+
Sbjct: 261 RRPWWHRASMESKPMGNAEVQAVRQNFSRMPTQSDEALRRIIANYYGMISLVDHQVGRIM 320

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           K L + G   +T +I+TSDHG  LG  GL+     P+E  +RV  +  GPGI  G
Sbjct: 321 KALSEYGLDQDTIVIFTSDHGEWLGDHGLMLKGPMPYEGVLRVAMVASGPGIAAG 375


>gi|298482548|ref|ZP_07000733.1| sulfatase [Bacteroides sp. D22]
 gi|298271255|gb|EFI12831.1| sulfatase [Bacteroides sp. D22]
          Length = 517

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDMPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|196232752|ref|ZP_03131603.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196223212|gb|EDY17731.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 496

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 26/234 (11%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            Y GK+    +     PG+  +       KY++ + N+NG     G      YY  ++ +
Sbjct: 121 AYMGKWHMGEDNDAPRPGFDFFATHKGQGKYWDTAWNINGA----GSKVIPGYYTTIVTD 176

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ +L   K+    KP  L +   APH       +YA++F NV   + P   +      
Sbjct: 177 MALDWL---KKDHGGKPWALCIGHKAPHSFYTPEEKYAHVFDNVRVPY-PESAFHLEDKP 232

Query: 140 QWILQVT-------------RKMQPVHRQFT----DLLMTKRLQTLQSVDDAVEKIVKEL 182
            W+ Q               RK  P  R       + ++     T+ SVDD+V +++K L
Sbjct: 233 TWMKQRLYTWHGIYGPLFEWRKKFPDDRPEAVKDFENMVHGYWGTILSVDDSVGRLLKYL 292

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           +D  +L+NT I++  D+G   G+ G+V  K    E  +R+P L+R PG+  G +
Sbjct: 293 EDTKQLDNTIIVFMGDNGLLEGEHGMVD-KRTAHEPSMRIPMLVRYPGLAKGKV 345


>gi|168334519|ref|ZP_02692680.1| sulfatase [Epulopiscium sp. 'N.t. morphotype B']
          Length = 510

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 83/175 (47%), Gaps = 25/175 (14%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT----- 124
            DYY + IAN++V+F+   K     +P+   +SF  PH P D AP    +   +T     
Sbjct: 155 EDYYEEWIANNAVKFIEDHKD----EPLFAWVSFQGPHQPYD-APAENGISTALTPPDPI 209

Query: 125 ------SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
                 S + P Y  A N     IL +       +R+       +  + +Q +D  V KI
Sbjct: 210 MLDFEPSCNVPKYRRASNISTPSILGMK-----AYRE----RRARYAEGVQIIDYEVGKI 260

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           + +LK+LG   +T II+++DHG  LG + + +  + P++  + +P ++     +P
Sbjct: 261 IAKLKELGIYKDTVIIFSTDHGCMLGDYDMYEKGAMPYKAQLEIPMIVVDADALP 315


>gi|423291167|ref|ZP_17270015.1| hypothetical protein HMPREF1069_05058 [Bacteroides ovatus
           CL02T12C04]
 gi|392664201|gb|EIY57743.1| hypothetical protein HMPREF1069_05058 [Bacteroides ovatus
           CL02T12C04]
          Length = 517

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|293368572|ref|ZP_06615180.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292636369|gb|EFF54853.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 517

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDMPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|421610218|ref|ZP_16051399.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH28]
 gi|408499015|gb|EKK03493.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SH28]
          Length = 735

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 69/133 (51%), Gaps = 7/133 (5%)

Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
           PE   P+  N  +  +      A+D A  P+ Q  +   +  +   +  T       +  
Sbjct: 264 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 323

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
            L T+Q+VDD+V +++  L + G   NT +IY+SD G++LG+ G    K + FE  +R+P
Sbjct: 324 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 382

Query: 224 FLMRGPGIV-PGT 235
           FL+R PG + PGT
Sbjct: 383 FLIRWPGAIDPGT 395


>gi|423293725|ref|ZP_17271852.1| hypothetical protein HMPREF1070_00517 [Bacteroides ovatus
           CL03T12C18]
 gi|392677683|gb|EIY71099.1| hypothetical protein HMPREF1070_00517 [Bacteroides ovatus
           CL03T12C18]
          Length = 517

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|160884808|ref|ZP_02065811.1| hypothetical protein BACOVA_02798 [Bacteroides ovatus ATCC 8483]
 gi|156109843|gb|EDO11588.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
 gi|295085766|emb|CBK67289.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
           XB1A]
          Length = 517

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|423214424|ref|ZP_17200952.1| hypothetical protein HMPREF1074_02484 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692839|gb|EIY86075.1| hypothetical protein HMPREF1074_02484 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 517

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|336412903|ref|ZP_08593256.1| hypothetical protein HMPREF1017_00364 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942949|gb|EGN04791.1| hypothetical protein HMPREF1017_00364 [Bacteroides ovatus
           3_8_47FAA]
          Length = 517

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q+     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|320102664|ref|YP_004178255.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319749946|gb|ADV61706.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 1074

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 64/262 (24%), Positives = 113/262 (43%), Gaps = 54/262 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIA 78
           GY      K++    P G+ +W  L     YYN   +   G+  K G      Y  DLI 
Sbjct: 212 GYQTALFGKWHLVSEPTGFDDWEILPGQGHYYNPDFLTPTGKITKTG------YVTDLIT 265

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--DYAP- 135
             ++++  QS+     +P + ++   APH   + A ++     +      P    DYA  
Sbjct: 266 ERALQWFTQSRDP--SRPFLWMVHHKAPHREWEPALRHLGHDNDRVYPEPPTLFDDYAGR 323

Query: 136 -NPDKQWILQVTRKM-------------------------QPVHRQF-------TDLL-- 160
              +++  + + R M                         +P +R F        DL+  
Sbjct: 324 GKAEREQDMTIARTMNERDLKLVPPARLNAEQRRVWDAYYEPRNRAFRQAKLEGNDLVRW 383

Query: 161 -----MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
                M   L  +++VD++V+++++ L+  G  +NT ++Y SD G++LG+ G    K F 
Sbjct: 384 KYNRYMHDYLGCVKAVDESVQRLLEALEAAGVADNTIVVYCSDQGFYLGEHGWFD-KRFI 442

Query: 216 FEFDIRVPFLMRGPGIV-PGTM 236
           FE  +R P L+R PG+V PG++
Sbjct: 443 FEESLRTPLLIRWPGVVAPGSV 464



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 7/163 (4%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           D+    +A++++R L Q +     +P+ L + F  PH P  +  +Y +++   T   TP 
Sbjct: 796 DHTDGKVASEAIRILNQRQD----QPLFLAVGFYRPHTPYVAPQKYFDLYPFETVSLTPR 851

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGEL 188
            +  P          TR               K+    ++  VD  + +++  +  LG  
Sbjct: 852 PNMIPEDVPAAAFASTRPWPSFGITPEQERRCKQAYWASISFVDAQLGRVLAAIDALGLR 911

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           + T +++ SDHGYHLG+ GL   +S  FE   RVP ++ G G+
Sbjct: 912 DKTIVVFLSDHGYHLGEHGLWLKQS-NFEESARVPLVIAGYGV 953


>gi|149176052|ref|ZP_01854669.1| putative sulfatase [Planctomyces maris DSM 8797]
 gi|148845206|gb|EDL59552.1| putative sulfatase [Planctomyces maris DSM 8797]
          Length = 470

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 65/236 (27%), Positives = 99/236 (41%), Gaps = 42/236 (17%)

Query: 20  GYFGKYLNKYNG--SYIPPGWRE----WGALIMNSKYYN--YSINMNGRKIKHGEDYYND 71
           G+ GK+     G  ++ PP  R+    W AL     Y N  Y  +   R+   G   Y+ 
Sbjct: 118 GFIGKWHLDGRGRTAFTPPERRQGFEFWRALECTHNYNNSFYYADSPERRTWEG---YDA 174

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           +    +A   +R      QS + +P +LVMS+ +PH P  +AP              P Y
Sbjct: 175 FAQTRVARQFIR-----NQSKAGQPFLLVMSYGSPHNPYHTAP--------------PEY 215

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLGE 187
                P+K        K++P   +       K L        ++DD V  ++  LK+ G 
Sbjct: 216 QSMYEPEKI-------KLRPNVPKDQQAAAKKELAGYYAHCSALDDCVGDLLATLKETGV 268

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
            +NT I++TSDHG  L   G ++ K  P++  +RVP L R  G        LD  I
Sbjct: 269 DDNTIIVFTSDHGDMLHSHGQIR-KQKPWDESLRVPMLFRLNGAEHAKGRTLDTPI 323


>gi|54288350|gb|AAV31638.1| probable sulfatase [uncultured alpha proteobacterium EBAC2C11]
          Length = 512

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 80/177 (45%), Gaps = 23/177 (12%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +A+ S+ +L   ++S    P  L +SFP PH P D  P+  N+  N      P +     
Sbjct: 207 VADRSIHWLSNRRES---NPFCLWISFPDPHHPFD-CPEPWNLLHNPEDVDLPKFLEKDL 262

Query: 137 PDKQW---------------ILQVTRKM---QPVHRQFTDLLMTKRLQTLQS-VDDAVEK 177
            D+ W               +L+  RK     P   +     MT     + S +D  V +
Sbjct: 263 NDRPWWHRRSLESEPDLSDPVLKRFRKQGSRMPDQSEAQLREMTANYYGMISLIDHNVGR 322

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           ++  L++ G L+ T IIYTSDHG H+G+ GL       ++  I V  ++RGPG+  G
Sbjct: 323 VIACLREKGILDETIIIYTSDHGDHMGERGLYLKGPMLYDSLINVGMIVRGPGVAAG 379


>gi|149196160|ref|ZP_01873216.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
 gi|149141007|gb|EDM29404.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
          Length = 504

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 77/167 (46%), Gaps = 13/167 (7%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           Y   +IA + VR +++  Q   KKP  + + F  PH P ++  +Y +++    F +  + 
Sbjct: 202 YEDGMIAREGVRLIKELGQK--KKPFFIAVGFKKPHLPFNAPKKYWDLYKETDFALEKYQ 259

Query: 128 TPAYDYAPNPDKQ--WILQ---VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
            P    AP+   Q  W      V R    V   F   L       +  VD  + K++K L
Sbjct: 260 KPVQG-APHYAYQNSWEFSGYNVPRINGEVLESFQRKLKHAYAACISYVDAQIAKLLKTL 318

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           KD G   NT I++ SDHG+HLG  G+   K   +E   RVPF +  P
Sbjct: 319 KDQGLEKNTVIVFWSDHGFHLGDHGMW-CKHSNYEQATRVPFFVYDP 364


>gi|410639714|ref|ZP_11350259.1| mucin-desulfating sulfatase [Glaciecola chathamensis S18K6]
 gi|410140595|dbj|GAC08446.1| mucin-desulfating sulfatase [Glaciecola chathamensis S18K6]
          Length = 499

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 62/255 (24%), Positives = 100/255 (39%), Gaps = 53/255 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
           GY    + K++    P  +  +  L     Y++      G K     E  Y  +  D++ 
Sbjct: 86  GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGAKPWPENETQYQGHSSDVVT 145

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
           + S+ +L   K   S KP  L+  F APH     AP+Y +           A +  P PD
Sbjct: 146 DLSIEWL---KNRSSDKPFFLMHQFKAPHDMFQYAPRYEDFL---------AAETIPEPD 193

Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
             + +  T                           R M            +   RQ    
Sbjct: 194 DLYGVAKTFGSIATRGKNDALRAEIGTSVSRRNNRRNMGIDLGVDPNLNEEEFTRQAYQK 253

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
            +   L+ ++ VDD V ++++ LKD G+  NT IIYTSD G  LG+  L + K + ++  
Sbjct: 254 YLKAYLRCVKGVDDNVARLIQTLKDAGQYENTIIIYTSDQGMMLGEHDL-QDKRWIYDES 312

Query: 220 IRVPFLMRGPGIVPG 234
           IR+PF+++ P  V  
Sbjct: 313 IRMPFIVKHPDAVAA 327


>gi|15964702|ref|NP_385055.1| choline sulfatase [Sinorhizobium meliloti 1021]
 gi|334315413|ref|YP_004548032.1| choline-sulfatase [Sinorhizobium meliloti AK83]
 gi|384528657|ref|YP_005712745.1| choline-sulfatase [Sinorhizobium meliloti BL225C]
 gi|384535063|ref|YP_005719148.1| choline sulfatase [Sinorhizobium meliloti SM11]
 gi|433612715|ref|YP_007189513.1| choline-sulfatase [Sinorhizobium meliloti GR4]
 gi|7387556|sp|O69787.2|BETC_RHIME RecName: Full=Choline-sulfatase
 gi|15073880|emb|CAC45521.1| Choline sulfatase [Sinorhizobium meliloti 1021]
 gi|333810833|gb|AEG03502.1| choline-sulfatase [Sinorhizobium meliloti BL225C]
 gi|334094407|gb|AEG52418.1| choline-sulfatase [Sinorhizobium meliloti AK83]
 gi|336031955|gb|AEH77887.1| choline sulfatase [Sinorhizobium meliloti SM11]
 gi|429550905|gb|AGA05914.1| choline-sulfatase [Sinorhizobium meliloti GR4]
          Length = 512

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 46/165 (27%), Positives = 81/165 (49%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     S++P  L +SF  PH P  +  ++ +++ +   H TP     P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLTPEVGAIP 228

Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
               +P  Q I+ ++   Q       ++  ++R     +  +D+ V +++  L     L+
Sbjct: 229 LDEQDPHSQRIM-LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +T I++ SDHG  LG+ GL    +F FE   RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331


>gi|388567294|ref|ZP_10153729.1| putative sulfatase YidJ [Hydrogenophaga sp. PBC]
 gi|388265505|gb|EIK91060.1| putative sulfatase YidJ [Hydrogenophaga sp. PBC]
          Length = 517

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 80/194 (41%), Gaps = 32/194 (16%)

Query: 72  YYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--------FN 122
           YYP   I + +V FLR ++     +P    +SFP PH P     +Y  ++          
Sbjct: 209 YYPTSYIRDRAVDFLRTTRGQ--AQPFFAFVSFPDPHHPFTPPGRYWGLYDPADFPVRLP 266

Query: 123 VTSHHTPAYDYAPNPDKQWI----LQVTRKMQPVHRQF------TDLLMTKRLQTLQSVD 172
             +H  P       P  +W     L  +R  Q  H  F          M      +  +D
Sbjct: 267 FEAHRNPP------PPIRWCRERFLDGSRARQGPHDSFFATEREIREAMALSAGMITMID 320

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL-----MR 227
           DAV  IV+ L++ G+  NT I++TSDHG +LG F LV   +       RVPF+       
Sbjct: 321 DAVGDIVRALQESGQAENTVIVFTSDHGDYLGDFNLVLKGALASRGITRVPFIWCDPRQA 380

Query: 228 GPGIVPGTMYLLDV 241
            P + P     +D+
Sbjct: 381 APAVSPALASTIDI 394


>gi|359408276|ref|ZP_09200747.1| arylsulfatase A family protein [SAR116 cluster alpha
           proteobacterium HIMB100]
 gi|356676686|gb|EHI49036.1| arylsulfatase A family protein [SAR116 cluster alpha
           proteobacterium HIMB100]
          Length = 509

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 45/181 (24%), Positives = 83/181 (45%), Gaps = 27/181 (14%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
            A+ ++ +L++  Q    KP    +SFP PH P D+   ++++      ++  H    ++
Sbjct: 205 TADRTINWLKEQAQQ--DKPFCAWVSFPDPHHPFDAPEPWSHLHKPENVDLPEHRQRTFE 262

Query: 133 YAPNPDKQWILQV-------TRKMQPVHRQFTDL----------LMTKRLQTLQSVDDAV 175
             P     W  QV       T++     + ++ +          ++      +  VD  +
Sbjct: 263 GRP----WWHEQVLTAEPAGTKENAETRKAYSRISHQSDEQLREIIANTYGQIALVDHQL 318

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
            +I+  L++LG   NT I+Y SDHG  LG  GL+      +E  +RVP +M+GPG+  G 
Sbjct: 319 GRIMASLEELGLAENTLIVYASDHGDWLGDHGLILKGPMHYEGLLRVPLIMKGPGVPAGQ 378

Query: 236 M 236
           +
Sbjct: 379 V 379


>gi|3046314|gb|AAC13371.1| choline sulfatase [Sinorhizobium meliloti]
          Length = 512

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 46/165 (27%), Positives = 81/165 (49%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     S++P  L +SF  PH P  +  ++ +++ +   H TP     P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLTPEVGAIP 228

Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
               +P  Q I+ ++   Q       ++  ++R     +  +D+ V +++  L     L+
Sbjct: 229 LDEQDPHSQRIM-LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +T I++ SDHG  LG+ GL    +F FE   RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331


>gi|149199420|ref|ZP_01876456.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149137498|gb|EDM25915.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 476

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 81/173 (46%), Gaps = 23/173 (13%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ N  V F+ ++ +    KP  + ++F A H P  +  ++ +M+      V   +
Sbjct: 172 HWSEVVGNHGVDFINEAAKD--DKPFFMYIAFNATHDPRQAPKEFIDMYPLDSIEVPKSY 229

Query: 128 TPAYDYAPNPDKQWILQVTRKMQP---------VHRQFTDLLMTKRLQTLQSVDDAVEKI 178
              Y Y        +     ++ P         VHRQ    L T   + +  + DA+EK 
Sbjct: 230 LSNYPYDEEMGCNPVGLRDERLMPSPRTEFAVKVHRQEYFALATHMDREIGRIFDALEKS 289

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
                  G  +NTY+I+T+DHG  +G  GLV GK   ++  ++VPF + GPGI
Sbjct: 290 -------GMKDNTYVIFTADHGLSVGHHGLV-GKQNMYDHSMQVPFFIAGPGI 334


>gi|410456454|ref|ZP_11310315.1| arylsulfatase [Bacillus bataviensis LMG 21833]
 gi|409928123|gb|EKN65246.1| arylsulfatase [Bacillus bataviensis LMG 21833]
          Length = 486

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 79/162 (48%), Gaps = 19/162 (11%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP- 135
           +  +S+ FLR+   +   KP  L MSF  PH P D    + +M+ ++     P  D+A  
Sbjct: 186 VVTESIDFLRRRDPT---KPFFLKMSFVRPHPPFDPPQVFFDMYKDLELPDAPVGDWAER 242

Query: 136 -NPDKQWILQVTRK-MQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIVKELKDLGEL 188
            +P K     +T K + P +R        KR Q      +  +D  + + +  +++ G L
Sbjct: 243 EDPKKSGYNPITSKGIVPKNR-------LKRAQAAYYALITHIDYQIGRFLMAMEEHGVL 295

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           NNT I++ SDHG  LG   L + KS  +E   +VPF++  PG
Sbjct: 296 NNTVILFCSDHGELLGDHNLFR-KSLAYEGSAKVPFILADPG 336


>gi|288870307|ref|ZP_06113643.2| arylsulfatase [Clostridium hathewayi DSM 13479]
 gi|288867673|gb|EFC99971.1| arylsulfatase [Clostridium hathewayi DSM 13479]
          Length = 535

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 81/160 (50%), Gaps = 21/160 (13%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN-----VTSHHTPAY 131
           +A  SV  L   K++  K+P  L  SF APH P D   ++A+++       +    TP  
Sbjct: 189 VAKRSVYHL---KENGGKRPFFLWSSFIAPHPPFDVPEKWADLYKGKELPPLKESKTPIS 245

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELN 189
             A     +W     +K    +   + L   + L   ++  VD  +  I+++LKD+GE +
Sbjct: 246 GIA-----EW-----KKYIADYPNESYLRRARELYYASISFVDYNIGTILQQLKDMGEYD 295

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           NT I++TSDHG  LG  G  + K  P++  +R+PF+MR P
Sbjct: 296 NTLILFTSDHGEMLGDHGTFQ-KMLPYDGSVRIPFIMRYP 334


>gi|29348460|ref|NP_811963.1| mucin-desulfating sulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383124650|ref|ZP_09945313.1| hypothetical protein BSIG_1599 [Bacteroides sp. 1_1_6]
 gi|29340364|gb|AAO78157.1| mucin-desulfating sulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|382983652|gb|EES69274.2| hypothetical protein BSIG_1599 [Bacteroides sp. 1_1_6]
          Length = 518

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/233 (23%), Positives = 100/233 (42%), Gaps = 51/233 (21%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           DL+   ++R++++       +P ++   F A H P D   +  +++  VT        D+
Sbjct: 177 DLVTEKAIRWMKEQD---GNQPFLMCCHFKATHEPYDYPIRMEHLYDGVTFPEPENLLDW 233

Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
            P                        +PDK W            +Q T   +  +++F  
Sbjct: 234 GPETNGRSFKGQTLEELERRWRIASQDPDKWWCRYPGLPFSTEGMQRTAARRASYQKF-- 291

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
             +   L+   +VDD + K++  L ++   +NT +IY SD GY LG+ G    + F +E 
Sbjct: 292 --IRDYLRCGATVDDNIGKLLNALDEMNIADNTIVIYVSDQGYFLGEHGFFDKRMF-YEE 348

Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI-----PQVRKFSSGSLIFIM--SFLINL 264
             R+PF++R P  VP    L D+++     P + +F+   +  +   SF+ NL
Sbjct: 349 SARMPFVIRYPKKVPAGKRLDDLILNVDFAPTLAEFAGVKMENVQGDSFVSNL 401


>gi|254515103|ref|ZP_05127164.1| choline-sulfatase [gamma proteobacterium NOR5-3]
 gi|219677346|gb|EED33711.1| choline-sulfatase [gamma proteobacterium NOR5-3]
          Length = 464

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/230 (21%), Positives = 103/230 (44%), Gaps = 22/230 (9%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           G+FGK+   +      PG+  W + +    Y +  +N++G        + + Y  D++ +
Sbjct: 123 GFFGKWHMGHADDSPRPGFDRWVSFVGQGVYDDPEMNIDGVT-----SHASGYMTDILTD 177

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA------PQYA---NMFFNVTSHHTPA 130
            +V F+    +   ++P +  ++  A H PE  A      P  A   +++       +P 
Sbjct: 178 QAVSFI----EEIDERPFLAFIAHKASH-PEIHANYLRTFPPAAGDEDLYTQEPLQRSPN 232

Query: 131 YDY--APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
           +    A  P  +  +       P      D ++  RL+ L +VD  + +++  L+    L
Sbjct: 233 WQASTAGKPALERPVDYNDPRSP-EGGLPDDIILNRLRMLSAVDRGLGRLIDVLETKKLL 291

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           ++T ++ +SD G+  G+FGL + +   +E  I +P ++R P +VP    L
Sbjct: 292 DDTVVVLSSDQGFFYGEFGLAQERRLAYEASIHIPLIIRYPQMVPAGAEL 341


>gi|407719814|ref|YP_006839476.1| choline-sulfatase [Sinorhizobium meliloti Rm41]
 gi|407318046|emb|CCM66650.1| Choline-sulfatase [Sinorhizobium meliloti Rm41]
          Length = 512

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/165 (27%), Positives = 80/165 (48%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     S++P  L +SF  PH P  +  ++ +++     H TP     P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEGC-EHLTPEVGAIP 228

Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
               +P  Q I+ ++   Q       ++  ++R     +  +D+ V +++  L     L+
Sbjct: 229 LDKQDPHSQRIM-LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +T I++ SDHG  LG+ GL    +F FE   RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMLAGPGIAPG 331


>gi|89069432|ref|ZP_01156787.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Oceanicola granulosus HTCC2516]
 gi|89045062|gb|EAR51140.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Oceanicola granulosus HTCC2516]
          Length = 517

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 48/75 (64%), Gaps = 2/75 (2%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M++ L+T+Q++DD V +++  L  L    NT +IYTSD G+ LG+ G    K F +E  +
Sbjct: 288 MSRYLRTVQAIDDGVGRMLDRLDALDLAENTLVIYTSDQGFFLGEHGWFD-KRFMYEESL 346

Query: 221 RVPFLMRGP-GIVPG 234
           R+PFL R P GI PG
Sbjct: 347 RMPFLARFPAGIAPG 361


>gi|298385837|ref|ZP_06995394.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Bacteroides sp. 1_1_14]
 gi|298261065|gb|EFI03932.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Bacteroides sp. 1_1_14]
          Length = 527

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/233 (23%), Positives = 100/233 (42%), Gaps = 51/233 (21%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
           DL+   ++R++   K+    +P ++   F A H P D   +  +++  VT        D+
Sbjct: 186 DLVTEKAIRWM---KEQDGNQPFLMCCHFKATHEPYDYPIRMEHLYDGVTFPEPENLLDW 242

Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
            P                        +PDK W            +Q T   +  +++F  
Sbjct: 243 GPETNGRSFKGQTLEELERRWRIASQDPDKWWCRYPGLPFSTEGMQRTAARRASYQKF-- 300

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
             +   L+   +VDD + K++  L ++   +NT +IY SD GY LG+ G    + F +E 
Sbjct: 301 --IRDYLRCGATVDDNIGKLLNALDEMNIADNTIVIYVSDQGYFLGEHGFFDKRMF-YEE 357

Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI-----PQVRKFSSGSLIFIM--SFLINL 264
             R+PF++R P  VP    L D+++     P + +F+   +  +   SF+ NL
Sbjct: 358 SARMPFVIRYPKKVPAGKRLDDLILNVDFAPTLAEFAGVKMENVQGDSFVSNL 410


>gi|365134097|ref|ZP_09343135.1| hypothetical protein HMPREF1032_00931 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363614613|gb|EHL66099.1| hypothetical protein HMPREF1032_00931 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 477

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 27/183 (14%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP------EDSAPQYANMFFNVTS 125
           Y  D I +D+V FL   K++  +KP  L +++ APH P      E+    Y N  F    
Sbjct: 152 YVTDAITDDAVEFLNSHKEA--EKPFYLHVAYTAPHAPWLNNHPEEYTKLYENCLFESVP 209

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
           H     D          + +T ++    R      +      + ++D  +  I++ L +L
Sbjct: 210 HLPRHLD---------SIYLTDEVAKDERAN----LIGYFAAVTAMDVGIGHILETLDEL 256

Query: 186 GELNNTYIIYTSDHGYHLGQFGL-VKGK-SFP---FEFDIRVPFLMRGPGIVPGTMYLLD 240
           G   +T +++TSD+G+  G  G   KG  +FP   +E  ++VPF+ R PG +P  + + D
Sbjct: 257 GLAEDTLVVFTSDNGFSCGHHGFWGKGNGTFPLNMYEESVKVPFIARHPGRIPAGV-VCD 315

Query: 241 VLI 243
            L+
Sbjct: 316 SLV 318


>gi|150395788|ref|YP_001326255.1| sulfatase [Sinorhizobium medicae WSM419]
 gi|150027303|gb|ABR59420.1| sulfatase [Sinorhizobium medicae WSM419]
          Length = 512

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 81/165 (49%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     S++P  L +SF  PH P  +  ++ +++ +   H TP  +  P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLTPEVEAIP 228

Query: 136 NPDKQWILQVTRKMQPVHRQF----TDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
             DKQ        +   +R F     ++  ++R     +  +D+ V +++  L     L+
Sbjct: 229 -LDKQDSHSQRIMLSCDYRNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +T I++ SDHG  LG+ GL    +F FE   RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331


>gi|393776751|ref|ZP_10365045.1| putative sulfatase [Ralstonia sp. PBA]
 gi|392716108|gb|EIZ03688.1| putative sulfatase [Ralstonia sp. PBA]
          Length = 485

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 9/175 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  +Y   IA  +  +LR+   +   KP +L +SF APH P  + P++   +++  
Sbjct: 154 GESQYT-FYDREIAARAQVWLREQATAEHDKPWVLFVSFVAPHFPLTAPPEHFYKYYDQD 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 Y +   P   +  QV  +    +  + D +  K+           +D+ + K++
Sbjct: 213 LPLPKLYAHDQRPTHPY--QVDYRKSFNYDDYFDAVKLKKALAGYYGLCSFLDENIGKVL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           + L D G  ++T +IYTSDHG +LG  G+  GKS  FE    VP +M GP +  G
Sbjct: 271 QALDDAGLTDDTRVIYTSDHGDNLGSRGMW-GKSTLFEEAAGVPLIMAGPDVPAG 324


>gi|296120919|ref|YP_003628697.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013259|gb|ADG66498.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 535

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 155 QFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVK 210
           Q TD +  + LQ    ++  +D  V +++  LK+ G  +NT +++TSDHGYHLG+ GL +
Sbjct: 283 QLTDPMRQEILQAYFASISFMDAQVGRVLASLKENGLADNTIVVFTSDHGYHLGEHGLWQ 342

Query: 211 GKSFPFEFDIRVPFLMRGPGI 231
            +S  FE   RVP L+  PGI
Sbjct: 343 KQSL-FEESARVPLLISAPGI 362


>gi|427825191|ref|ZP_18992253.1| putative sulfatase [Bordetella bronchiseptica Bbr77]
 gi|410590456|emb|CCN05545.1| putative sulfatase [Bordetella bronchiseptica Bbr77]
          Length = 485

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++++   + +LR+     + KP +L +SF +PH P  + P++   ++N  
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD +  PD  +  Q   +    +  + D  + K+ Q         +D+ + K++
Sbjct: 213 LPLPKLYDKSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             L DL  L++T ++YTSDHG +LG  G+  GKS  FE    VP ++ G  I  G 
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGKDIPSGV 325


>gi|410421771|ref|YP_006902220.1| sulfatase [Bordetella bronchiseptica MO149]
 gi|408449066|emb|CCJ60752.1| putative sulfatase [Bordetella bronchiseptica MO149]
          Length = 485

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++++   + +LR+     + KP +L +SF +PH P  + P++   ++N  
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD +  PD  +  Q   +    +  + D  + K+ Q         +D+ + K++
Sbjct: 213 LPLPKLYDKSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             L DL  L++T ++YTSDHG +LG  G+  GKS  FE    VP ++ G  I  G 
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGKDIPSGV 325


>gi|427819811|ref|ZP_18986874.1| putative sulfatase [Bordetella bronchiseptica D445]
 gi|410570811|emb|CCN19009.1| putative sulfatase [Bordetella bronchiseptica D445]
          Length = 485

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++++   + +LR+     + KP +L +SF +PH P  + P++   ++N  
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD +  PD  +  Q   +    +  + D  + K+ Q         +D+ + K++
Sbjct: 213 LPLPKLYDKSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             L DL  L++T ++YTSDHG +LG  G+  GKS  FE    VP ++ G  I  G 
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGKDIPSGV 325


>gi|149178047|ref|ZP_01856643.1| sulfatase [Planctomyces maris DSM 8797]
 gi|148843104|gb|EDL57471.1| sulfatase [Planctomyces maris DSM 8797]
          Length = 406

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 79/158 (50%), Gaps = 13/158 (8%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
           ++ ++ F+ Q  Q    +P +L ++   PH P     +YA MF +  +   P ++ +   
Sbjct: 112 SDRAIEFIEQDHQ----QPWLLNINVYDPHPPFTPPEKYAKMF-DPAAMPGPHFEESDKA 166

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTL-----QSVDDAVEKIVKELKDLGELNNTY 192
            ++ + +V    QPV    T++   K++Q L       +DD   +I+  L   G+ +NT 
Sbjct: 167 TQELLSRV--DFQPV-TMSTEISELKKVQALYYAMIAQIDDQFARILSVLDSTGQRDNTV 223

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           II+TSDHG  LG  GLV+     +E  IRVP +   PG
Sbjct: 224 IIFTSDHGETLGDHGLVQKGCRFYEGLIRVPLIFSWPG 261


>gi|423226172|ref|ZP_17212638.1| hypothetical protein HMPREF1062_04824 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392630283|gb|EIY24277.1| hypothetical protein HMPREF1062_04824 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 491

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/224 (24%), Positives = 102/224 (45%), Gaps = 42/224 (18%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++A++++ ++ ++K S  K+P+ +  +F APH P  +  +Y +M+     +V  + 
Sbjct: 196 HWSEILADETITYMEENKDS--KEPLFMFCAFNAPHDPRQAPKEYVDMYDVDKISVPQNF 253

Query: 128 TPAY---------------DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
            P +                 AP P   + +Q        HRQ    L+T        +D
Sbjct: 254 LPVHPLGEQMKSGKNLRDEQLAPFPRTHYSVQK-------HRQEYYALITH-------MD 299

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
             V +I++ LK  G   NT I++ SD+G  +G  GL+ GK   ++   +VP    G G+ 
Sbjct: 300 AQVGRIIEALKRNGLDKNTLIVFASDNGLSVGHHGLI-GKQSMYDHSAKVPLAFCGLGLP 358

Query: 233 PGT-----MYLLDVLIPQVRKFSSGSLIFIMSFLINLNTMRAEK 271
            G      +YL D L+P V           + F+  +N +++ K
Sbjct: 359 TGEVRNQLIYLQD-LVPTVYDLVGIKKPNHLEFISQINALKSSK 401


>gi|33598356|ref|NP_885999.1| sulfatase [Bordetella parapertussis 12822]
 gi|33603289|ref|NP_890849.1| sulfatase [Bordetella bronchiseptica RB50]
 gi|412341382|ref|YP_006970137.1| sulfatase [Bordetella bronchiseptica 253]
 gi|427816299|ref|ZP_18983363.1| putative sulfatase [Bordetella bronchiseptica 1289]
 gi|33566914|emb|CAE39130.1| putative sulfatase [Bordetella parapertussis]
 gi|33577413|emb|CAE34678.1| putative sulfatase [Bordetella bronchiseptica RB50]
 gi|408771216|emb|CCJ56016.1| putative sulfatase [Bordetella bronchiseptica 253]
 gi|410567299|emb|CCN24870.1| putative sulfatase [Bordetella bronchiseptica 1289]
          Length = 485

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++++   + +LR+     + KP +L +SF +PH P  + P++   ++N  
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD +  PD  +  Q   +    +  + D  + K+ Q         +D+ + K++
Sbjct: 213 LPLPKLYDRSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             L DL  L++T ++YTSDHG +LG  G+  GKS  FE    VP ++ G  I  G 
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGRDIPSGV 325


>gi|227821277|ref|YP_002825247.1| choline-sulfatase [Sinorhizobium fredii NGR234]
 gi|227340276|gb|ACP24494.1| choline-sulfatase [Sinorhizobium fredii NGR234]
          Length = 512

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 78/165 (47%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     +++P  L +SF  PH P  +  ++ +++ N   H  P     P
Sbjct: 170 FLANQKLYQLARDNDDEARRPWCLTVSFTHPHDPYVARRKFWDLYENC-EHLRPEVGALP 228

Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
               +P  Q I+      +       ++  ++R     +  +D+ V ++V  L     L+
Sbjct: 229 FDSQDPHSQRIMHSC-DYKNFELTEVNIRQSRRAYFANISYLDEKVGELVDTLTRTRMLD 287

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           NT I++ SDHG  LG+ GL    +F FE   RVP ++ GPG+ PG
Sbjct: 288 NTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGVAPG 331


>gi|262408667|ref|ZP_06085213.1| sulfatase [Bacteroides sp. 2_1_22]
 gi|294646699|ref|ZP_06724324.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294807128|ref|ZP_06765946.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|345511274|ref|ZP_08790819.1| sulfatase [Bacteroides sp. D1]
 gi|229444367|gb|EEO50158.1| sulfatase [Bacteroides sp. D1]
 gi|262353532|gb|EEZ02626.1| sulfatase [Bacteroides sp. 2_1_22]
 gi|292637987|gb|EFF56380.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294445694|gb|EFG14343.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
          Length = 464

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 104/230 (45%), Gaps = 36/230 (15%)

Query: 19  HGYFGKYLNKYN-GSYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           +GY+  ++ KY  G+Y+P   +  W  L     YY    + N R +             L
Sbjct: 115 NGYYVGFIGKYGVGNYLPQKEFDYWRGLAGQGTYYQKDKDGNPRHL-----------TGL 163

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP---------QYANMFFN----V 123
           I+     FL    ++   KP  L +SF +PH   +  P          Y + FF+     
Sbjct: 164 ISEQIDEFLANRDET---KPFCLSVSFKSPHTESEQDPFPFDKKYSFMYEDEFFDKPETF 220

Query: 124 TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIV 179
             ++   +  +   D +W  +   + +  +R  TD      ++     +  +D+A+ K+ 
Sbjct: 221 GENYYRLFPESFRKDGKWENEGYVRFK--NRYGTDEKYQSSVKGYYRLIAGIDEAIGKLR 278

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           K LK++G   NT II+TSD+GY+LG+ GL +GK +  E  IR+P ++  P
Sbjct: 279 KRLKEMGLDKNTLIIFTSDNGYYLGEHGL-EGKWYGHEESIRLPLVIYDP 327


>gi|410474393|ref|YP_006897674.1| sulfatase [Bordetella parapertussis Bpp5]
 gi|408444503|emb|CCJ51254.1| putative sulfatase [Bordetella parapertussis Bpp5]
          Length = 485

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  ++++   + +LR+     + KP +L +SF +PH P  + P++   ++N  
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
                 YD +  PD  +  Q   +    +  + D  + K+ Q         +D+ + K++
Sbjct: 213 LPLPKLYDRSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
             L DL  L++T ++YTSDHG +LG  G+  GKS  FE    VP ++ G  I  G 
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGRDIPSGV 325


>gi|378825232|ref|YP_005187964.1| choline sulfatase [Sinorhizobium fredii HH103]
 gi|365178284|emb|CCE95139.1| choline sulfatase [Sinorhizobium fredii HH103]
          Length = 510

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 78/165 (47%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +      ++P  L +SF  PH P  +  ++ +++ N   H  P  D  P
Sbjct: 168 FLANQKLYQLARENDDEGRRPWCLTVSFTHPHDPYVARRKFWDLYENC-EHLLPEVDALP 226

Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
               +P  + I+      +       ++  ++R     +  +D+ V ++V  L     L+
Sbjct: 227 FDRQDPHSRRIMHSC-DYKNFELTEENIRRSRRAYFANISYLDEKVGELVDTLTRTRMLD 285

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           NT I++ SDHG  LG+ GL    +F FE   RVP ++ GPG+ PG
Sbjct: 286 NTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGVAPG 329


>gi|196230236|ref|ZP_03129099.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225833|gb|EDY20340.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 489

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 55/237 (23%), Positives = 104/237 (43%), Gaps = 35/237 (14%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
            + GK+    +      G+  W        Y++ + N+NG   K      + Y  D++ +
Sbjct: 122 AFLGKWHMGEDSDNPRKGFDYWAGFRGQGHYFDDTYNINGEHKK-----IDGYSSDVLTD 176

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
            ++ +L+        KP    + + APH P + AP+    +        P  +   N ++
Sbjct: 177 LALDWLKHR----GDKPFFCELCYKAPHYPFEPAPRNKGRYEKAP---IPYPETMANTEE 229

Query: 140 QWILQ---VTRKMQPVHRQFTDLLMTKRL----------------QTLQSVDDAVEKIVK 180
            ++ Q   V  +   +H    D + T R                 +T+ S+D+ + +++K
Sbjct: 230 NYLTQPRWVRERRFGIHG--VDHMETGRFDHDPVPSFEDLYHRYSETVFSMDENIGRLLK 287

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
            L + G  ++T ++Y +D+G+ LG+ G    K   FE  +RVP L+R PG V PGT+
Sbjct: 288 YLDNTGLRDSTIVVYMADNGFELGEHGFYD-KRDAFETSMRVPMLLRAPGAVKPGTV 343


>gi|299145033|ref|ZP_07038101.1| putative sulfatase [Bacteroides sp. 3_1_23]
 gi|298515524|gb|EFI39405.1| putative sulfatase [Bacteroides sp. 3_1_23]
          Length = 517

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 78/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ +++Q K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL Q      P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 232 TSRKDLAKKGEKYRILAQQEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333


>gi|452126972|ref|ZP_21939555.1| sulfatase [Bordetella holmesii F627]
 gi|452130345|ref|ZP_21942917.1| sulfatase [Bordetella holmesii H558]
 gi|451920270|gb|EMD70416.1| sulfatase [Bordetella holmesii H558]
 gi|451922067|gb|EMD72212.1| sulfatase [Bordetella holmesii F627]
          Length = 487

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 80/181 (44%), Gaps = 12/181 (6%)

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
            R I  GE  Y  Y  D I   +V +L    +    KP +L  SF +PH P  + P++  
Sbjct: 150 ARSIGPGESSYIRYDRD-ITEKTVNWLNTQGRKSHDKPWVLYSSFISPHSPLIAPPEFYA 208

Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTD-----LLMTKRLQTLQSVDD 173
           M+ +  S   P  D   +P   WI Q   K       F       + +         +D 
Sbjct: 209 MY-DARSIELPRKDVKYHP---WI-QAWNKCYGFDSHFGSDDQRRIAIASYFGLCSFLDS 263

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            V KI+  L++ G  + T +I+ SDHG +LG   L  GKS  +E    VP ++ GPG+ P
Sbjct: 264 NVGKILYALEETGLAHTTRVIFLSDHGDNLGARALW-GKSTMYEESAGVPMILGGPGVAP 322

Query: 234 G 234
           G
Sbjct: 323 G 323


>gi|430747567|ref|YP_007206696.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430019287|gb|AGA31001.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 541

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/261 (22%), Positives = 111/261 (42%), Gaps = 53/261 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++    P G+  W  L     YYN  +  NG ++KH       Y  D+I +
Sbjct: 129 GYQTAVVGKWHLVTEPTGFDYWHILPGQGVYYNPPMIDNGTQVKH-----EGYATDIITD 183

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGP------------EDSAPQYANMFFNVTSHH 127
            ++ +L++  +S   KP +++    APH              + + P+ + +F + +   
Sbjct: 184 LTIDWLKKRDKS---KPFLMMSQNKAPHREWAPALRHLGHDNDRAYPEPSTLFDDYSGRG 240

Query: 128 TPAYDYAPNPDKQWI-----LQVTRKMQPVHRQFTDLLMTKR------------------ 164
              +D      K        L+   ++ P  R+  D     R                  
Sbjct: 241 KAEHDQDMTIAKTMTARDLKLEAPPQLTPDQRKVWDAYYEPRNQAFAKADLKGDDLVRWK 300

Query: 165 --------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
                   L  +++VD+ V +++K L++ G  +NT ++Y +D G++LG+ G    K + F
Sbjct: 301 YNRYMHDYLACIKAVDENVGRLLKYLEEEGLADNTIVVYAADQGFYLGEHGWFD-KRWIF 359

Query: 217 EFDIRVPFLMRGPGIV-PGTM 236
           E  +R P ++R PG+  PG++
Sbjct: 360 EESLRTPLVVRWPGVTKPGSV 380


>gi|340619559|ref|YP_004738012.1| mucin-desulfating sulfatase [Zobellia galactanivorans]
 gi|339734356|emb|CAZ97733.1| Mucin-desulfating sulfatase, family S1-11 [Zobellia
           galactanivorans]
          Length = 495

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 102/239 (42%), Gaps = 47/239 (19%)

Query: 37  GWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
           G+  W  L    KYYN   I   G       +Y  +Y  D+I + ++ +L   +     K
Sbjct: 135 GFDYWEVLPGQGKYYNPDFITEEG-----DTNYKGEYVTDVITDRALNWLENERDQ--DK 187

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNV----------------TSHHTPAYDYAPNPDK 139
           P M+++   APH   D A +Y + + +V                T+ H    D A + + 
Sbjct: 188 PFMMMVHHKAPHRNWDPAERYMSKYEDVEIPEPDNFFDDYATRTTAAHKQEMDIATSMNT 247

Query: 140 QWILQVTR---------------------KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
              L+                        K + + +      M   L+ + SVD++V +I
Sbjct: 248 HIDLKAEGDRYANDPRYKERYAKFAAAKLKGEALVKWKYQTYMKDYLRCIWSVDESVGQI 307

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
           ++ LK+ G   NT +IY+SD G+++G+ G    K F +E   R P +++ PG V PG++
Sbjct: 308 MQSLKEQGLDENTIVIYSSDQGFYMGEHGWF-DKRFMYEESFRTPLIVKWPGTVKPGSV 365


>gi|397566700|gb|EJK45164.1| hypothetical protein THAOC_36235 [Thalassiosira oceanica]
          Length = 475

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 86/198 (43%), Gaps = 24/198 (12%)

Query: 53  YSINMNGRKIKHGED---YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
           Y   M G   + G D   +    YPD    D    L + K S SK P  + ++FP PH P
Sbjct: 147 YIACMTGEGPEGGCDATSFPESIYPDDFVRDRAIDLLKRKPSDSK-PFFMQVNFPGPHHP 205

Query: 110 EDSAPQYANMFFNVTSHHTP------AYDYAPNPD-----KQWILQVTR--KMQPVHRQF 156
                  ++M  +V     P        ++AP+ +     +QW L   +  ++ P   + 
Sbjct: 206 ---VVVTSSMARSVMDRLWPPPVNPQQVEFAPSGEAMTCQQQWKLSGKKHVRVDPAKMKL 262

Query: 157 TDLLMTKRLQ---TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
                  R      ++ +D  +  IV++LK  GE+ NT ++   DHG +LG  G   GK 
Sbjct: 263 VHPQKEGRCNYGAEIEHIDRLMSSIVEQLKATGEIANTVVVVAGDHGENLGDNG-TTGKG 321

Query: 214 FPFEFDIRVPFLMRGPGI 231
            P++  + VP L+ GP +
Sbjct: 322 IPWQASVSVPLLISGPSV 339


>gi|410096139|ref|ZP_11291129.1| hypothetical protein HMPREF1076_00307 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409227210|gb|EKN20111.1| hypothetical protein HMPREF1076_00307 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 501

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 81/193 (41%), Gaps = 33/193 (17%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV----------T 124
           D     ++ F+ + K+     P   ++ FP PH      P Y  M+  +           
Sbjct: 198 DYFTTQAIEFINRHKEG----PFCCMLCFPDPHSANLVRPPYDTMYSRIRFKEPATALTD 253

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           +   PA+ +  N          +  QP         M +    ++ +DD V +I+  LKD
Sbjct: 254 TTGRPAWSHGNN---------LKADQPED-------MAQYFGMIKCIDDNVGRILDALKD 297

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIP 244
            G L+NT II+TSDHG    Q GL+  K  P +   RVPF++  P  +   + + +V+  
Sbjct: 298 AGILDNTIIIFTSDHGDMCRQHGLI-NKGVPLDDSSRVPFIISYPAGMRQGIRIENVM-- 354

Query: 245 QVRKFSSGSLIFI 257
            V  F+   L F 
Sbjct: 355 SVTDFTPSLLSFC 367


>gi|336426381|ref|ZP_08606391.1| hypothetical protein HMPREF0994_02397 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010816|gb|EGN40796.1| hypothetical protein HMPREF0994_02397 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 454

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/167 (28%), Positives = 79/167 (47%), Gaps = 13/167 (7%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           Y  D +A++++RF+++ K     +P + ++S P PH P      Y  MF  +       +
Sbjct: 153 YTTDWLADETLRFIKEQKDG---RPFLFMVSMPDPHQPFRVRKPYDTMFDPLQMEIPETF 209

Query: 132 DYAPNPDKQWILQVT---RKMQPV----HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
                PD  W  Q     R   P            L  + L  ++ +DD V KI+  L++
Sbjct: 210 YEEELPD--WAEQDEWGRRHYFPTGLFEREDHFRRLKAQYLGAVKCIDDNVGKILDGLRE 267

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
              L  T +I+T+DHG +LG+ GL++ K+  +E    +P L+R PG 
Sbjct: 268 ANLLEETVVIFTTDHGEYLGEHGLME-KNNLYESVYHIPMLLRLPGC 313


>gi|440714555|ref|ZP_20895134.1| choline-sulfatase [Rhodopirellula baltica SWK14]
 gi|436440751|gb|ELP34055.1| choline-sulfatase [Rhodopirellula baltica SWK14]
          Length = 517

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 80/168 (47%), Gaps = 11/168 (6%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           +  D + + +  F+R+     S++P    +S P PHGP      Y  MF N+     P  
Sbjct: 221 FSTDWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVR--PPM 274

Query: 132 DYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
            +  + D+  W+    R  Q   ++F   LMT+    ++ +DD V  ++  L +L     
Sbjct: 275 TFQLDGDQPGWLPATNRNSQ---QRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKR 331

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           T +++TSDHG    + G +  K  P+E   +VP ++  PG+VP  + +
Sbjct: 332 TVVVFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIITAPGLVPAGLRI 378


>gi|343085359|ref|YP_004774654.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353893|gb|AEL26423.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 504

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 84/180 (46%), Gaps = 14/180 (7%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++ ND+  FL  +  S S  P  + ++F A H P  S   + +++      +  + 
Sbjct: 213 HWSEVLRNDAHAFLDSA--SKSNNPFFMYLAFNAVHDPRQSPGSFLDLYAVEDIKIPKNF 270

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELK 183
            P Y Y         L+    + P  R  T   + K LQ     +  +D  + +I  EL+
Sbjct: 271 LPVYPYKDEIGNGPALR-DEALAPYPR--TPYSIKKHLQEYYAIISHMDHQIGEIFNELE 327

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
             G   NTYII+T+DHG  +G+ GL+ GK   F+  +  PF++ GP I  G     D+ I
Sbjct: 328 RKGLRENTYIIFTADHGLAMGRHGLM-GKQTMFDHSMAAPFIISGPNIPKGQRLDQDIYI 386


>gi|54301819|ref|YP_131812.1| sulfatase [Photobacterium profundum SS9]
 gi|46915239|emb|CAG22012.1| putative sulfatase [Photobacterium profundum SS9]
          Length = 520

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 74/166 (44%), Gaps = 16/166 (9%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSH 126
           N + P+  A+ ++++LR     +  S KP  LV+S   PH P D  PQ Y + F   TS 
Sbjct: 223 NQWSPEHEADIAIKYLRNEDGKYRDSNKPFTLVVSMNPPHSPYDQVPQKYLDKFDGQTSE 282

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
                     P+ QW         P +         + +  +  VD    +I+ EL  LG
Sbjct: 283 SLNT-----RPNVQWDKDYQEGYGPAY-------FKEYMAMVHGVDAQFGRIIDELDRLG 330

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
              +T +++ SDHG  +G  G    K+  +E  +R+P + R PG +
Sbjct: 331 LEEDTLVVFFSDHGCCMGSNGKPT-KNVHYEEAMRIPMMFRWPGKI 375


>gi|417302547|ref|ZP_12089645.1| sulfatase [Rhodopirellula baltica WH47]
 gi|327541105|gb|EGF27651.1| sulfatase [Rhodopirellula baltica WH47]
          Length = 517

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 80/168 (47%), Gaps = 11/168 (6%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           +  D + + +  F+R+     S++P    +S P PHGP      Y  MF N+     P  
Sbjct: 221 FSTDWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVR--PPM 274

Query: 132 DYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
            +  + D+  W+    R  Q   ++F   LMT+    ++ +DD V  ++  L +L     
Sbjct: 275 TFQLDGDQPGWLPATNRNSQ---QRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKR 331

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           T +++TSDHG    + G +  K  P+E   +VP ++  PG+VP  + +
Sbjct: 332 TVVVFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIITAPGLVPAGLRI 378


>gi|257069392|ref|YP_003155647.1| choline-sulfatase [Brachybacterium faecium DSM 4810]
 gi|256560210|gb|ACU86057.1| choline-sulfatase [Brachybacterium faecium DSM 4810]
          Length = 520

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 88/192 (45%), Gaps = 20/192 (10%)

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
           ND VR     + +  + P ++V SF  PH P +   ++ + F +V     P +   P+P 
Sbjct: 176 NDRVR---ADQAAGERVPFLMVASFIHPHDPYEPPQEHWDRFADVDIP-APRHPEVPDPA 231

Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNTYI 193
           +       R M    ++ T     +R +      +  +DD V +I + L+ LG   +T +
Sbjct: 232 QDPHSHRLRAMSGFDQRETTEEEVRRARRSYYAAVSYIDDHVGRIRERLESLGLWEDTVV 291

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGS 253
           + TSDHG  LG+ GL   K  P+E   RVP ++ GP          + L+P  R  +  S
Sbjct: 292 VVTSDHGDMLGEKGLWF-KMSPYEESSRVPLILHGP----------EHLVPAGRYANPVS 340

Query: 254 LIFIMSFLINLN 265
           L+ +M  L+ L 
Sbjct: 341 LLDLMPTLLELG 352


>gi|429766630|ref|ZP_19298886.1| arylsulfatase [Clostridium celatum DSM 1785]
 gi|429183835|gb|EKY24874.1| arylsulfatase [Clostridium celatum DSM 1785]
          Length = 483

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 84/175 (48%), Gaps = 11/175 (6%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +  +S+ FLR+       KP  L+ SF  PH P D+   Y +++ N         D+   
Sbjct: 185 VVTESIDFLRRRDPC---KPFFLMTSFVRPHAPFDAPKYYFDLYKNKDLREPVKGDWN-- 239

Query: 137 PDKQWILQVTRKMQ----PVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
            D++ I ++  +      P+ ++    +       +  +D  + ++++ L +   ++NT 
Sbjct: 240 -DEELIKKIGYRYNATTGPLDKELVKEMQIGYYACITHIDHQIGRLIQALVEHQVMDNTI 298

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVR 247
           II+TSDHG  LG    ++ K+ P++  I++P  + G   + G  Y +D  + ++R
Sbjct: 299 IIFTSDHGEMLGDHNYIQ-KARPYQGSIKIPMFISGNEELLGRKYGVDESLVELR 352


>gi|254461230|ref|ZP_05074646.1| putative sulfatase YidJ [Rhodobacterales bacterium HTCC2083]
 gi|206677819|gb|EDZ42306.1| putative sulfatase YidJ [Rhodobacteraceae bacterium HTCC2083]
          Length = 520

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 82/184 (44%), Gaps = 26/184 (14%)

Query: 70  NDYYPDL-IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVT 124
            D YP   +A+ ++ +L + K      P    +SFP PH P +   +Y +M+    F+V 
Sbjct: 211 EDKYPTAWVADKAISYLEERKAG--GDPFFAFVSFPDPHHPFNPPGKYWDMYDPNDFDVD 268

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKM-----QPVHRQFTDLLMTKRLQTLQS--------V 171
                 YD   NP     +Q   KM     +P   QF      + ++  ++        +
Sbjct: 269 L----PYDAHQNPTPP--MQHETKMWETGEEPAIPQFAFRASDQHVREAKALTAGMITMI 322

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           DD + +++  LK  GE +NT II+TSDHG +LG F L+   + P     RVP +   P  
Sbjct: 323 DDHIGRVIDALKASGEYDNTVIIFTSDHGDYLGDFNLMLKGAIPLPSITRVPMIWSDPAT 382

Query: 232 VPGT 235
             G 
Sbjct: 383 RKGA 386


>gi|410644094|ref|ZP_11354578.1| extracellular sulfatase Sulf [Glaciecola agarilytica NO2]
 gi|410136393|dbj|GAC02977.1| extracellular sulfatase Sulf [Glaciecola agarilytica NO2]
          Length = 477

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 87/170 (51%), Gaps = 18/170 (10%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-DSA------PQYANMFFNVT 124
           YY   +AN+ +R       S + KP +L ++F  PH  + D A      P++   + NV 
Sbjct: 168 YYIQNLANEFIR-----DASNTNKPWVLAVNFWNPHAHDTDKADQFHYPPEFEEYYQNVD 222

Query: 125 ---SHHTPAYDYAPNPD--KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
              + +   +D+   P+  K  + +V  + +  + +    ++ +  + + SVD AV  I 
Sbjct: 223 VPDAKNATKHDFEELPEFLKGSMARVRWEYRFANPKMYQRIIKRHYRAISSVDKAVGAIS 282

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
            +L+ LG  +NT IIYT D+GY L +  L  GK F +E D+RVP +++ P
Sbjct: 283 AQLEKLGVDDNTIIIYTGDNGYSLNERQLA-GKWFGWEEDLRVPLIIKDP 331


>gi|343085846|ref|YP_004775141.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354380|gb|AEL26910.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 529

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 62/271 (22%), Positives = 107/271 (39%), Gaps = 69/271 (25%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           ++GY    + K +   +P G+     L     YYN    +NG K +        Y  D+I
Sbjct: 117 ANGYQTAMIGKIHMDGLPQGFDFSMVLPGQGHYYNPDFLINGEKKR-----IEGYCTDII 171

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT------------- 124
            + ++ +L+  ++    +P +L+    APH     A +Y  ++ +VT             
Sbjct: 172 TDAALNWLKNDRKE--DEPFLLLYHQKAPHRNWKPAQKYLTLYDDVTFDPPANFFDDYEG 229

Query: 125 -------------SHHTPAYDYA-----PNPDKQWILQVTRKMQPVHRQFTDLL------ 160
                         H    +D+       N D  W++   +++ P   Q  D L      
Sbjct: 230 RGTAAKTQEMEIDGHARWGHDFKFQFDPTNGDTTWLVNQLKRLNP--EQLADWLAAYEPK 287

Query: 161 ----------------------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
                                 +   L+T++SVDD V +++  LK+ G   NT ++YTSD
Sbjct: 288 NKAMREANLSGKELALWKYNRYIKDYLRTIKSVDDGVGELLDYLKETGLDENTIVVYTSD 347

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
            G++LG+ G    K F +E   R P LM+ P
Sbjct: 348 QGFYLGEHGWFD-KRFMYEESFRTPLLMKYP 377


>gi|304393879|ref|ZP_07375804.1| choline-sulfatase [Ahrensia sp. R2A130]
 gi|303294078|gb|EFL88453.1| choline-sulfatase [Ahrensia sp. R2A130]
          Length = 513

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 78/166 (46%), Gaps = 32/166 (19%)

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
           P  +V SF  PH P ++ P+    F+N+  H   A D   N      +       P+ ++
Sbjct: 184 PFAMVASFTHPHDPYEARPE----FWNLYDHD--AIDMPKN------VLAPEDQDPMSKR 231

Query: 156 FTDLLMTKRLQTLQS---------------VDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
             D +   +++  +                 D  V ++V+ L++ G+L++T +I T+DHG
Sbjct: 232 LMDGIEASKVEVTEEQVRNSRHAYFANISYFDSKVGELVQVLEESGQLDDTVVIITADHG 291

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY----LLDVL 242
             LG+ GL    +F FE   RVP +M GPGI   T+     L+D+L
Sbjct: 292 DMLGERGLWYKMNF-FEHSARVPLIMAGPGIANTTVADPCSLVDIL 336


>gi|189459671|ref|ZP_03008456.1| hypothetical protein BACCOP_00299 [Bacteroides coprocola DSM 17136]
 gi|189433630|gb|EDV02615.1| arylsulfatase [Bacteroides coprocola DSM 17136]
          Length = 522

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/194 (21%), Positives = 81/194 (41%), Gaps = 40/194 (20%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------------------- 107
           D++   ++R+++   +S   +P ++   F A H                           
Sbjct: 179 DIVTEKAIRWMKAQDKS---QPFLMCCHFKATHEPYDFPERMRHLYDGVVFPEPENLLDW 235

Query: 108 GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQV-----TRKMQPV--HRQFTDLL 160
           GPE +   +         H+      + +PDK W   +     T+ M  +   +     L
Sbjct: 236 GPETNGRTFVGQKLETIGHNWEVA--SADPDKWWCRYLELPFTTKGMSRIAARKAIYQKL 293

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           +   L+   +VDD + K++  L ++G  +NT ++Y +D GY LG+ G    + F +E   
Sbjct: 294 IRDYLRCAATVDDNIGKLLDALDEMGIADNTIVVYVADQGYFLGEHGFYDKRMF-YEESA 352

Query: 221 RVPFLMRGPGIVPG 234
           R+PF++R P  +P 
Sbjct: 353 RMPFVIRYPKCIPA 366


>gi|225011395|ref|ZP_03701833.1| sulfatase [Flavobacteria bacterium MS024-2A]
 gi|225003898|gb|EEG41870.1| sulfatase [Flavobacteria bacterium MS024-2A]
          Length = 501

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 63/121 (52%), Gaps = 12/121 (9%)

Query: 133 YAPNPDKQWILQVTRK---MQPVHRQFTDLLMTK-------RLQTLQSVDDAVEKIVKEL 182
           ++P   K W+   T K   M+  + +  +L + K        L+T+QSVDD V +I+  L
Sbjct: 248 FSPEQKKAWLNAYTPKNEAMKTQNLEGNELALWKFNRYIKDYLRTIQSVDDGVGEILDYL 307

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
              G   NT ++YTSD G++LG+ G    K F +E  +R P LMR P  +P     +D L
Sbjct: 308 DQEGLTENTIVVYTSDQGFYLGEHGWFD-KRFMYEESLRTPLLMRYPKEIPANS-KVDAL 365

Query: 243 I 243
           I
Sbjct: 366 I 366


>gi|372210453|ref|ZP_09498255.1| sulfatase yidj [Flavobacteriaceae bacterium S85]
          Length = 512

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 91/185 (49%), Gaps = 26/185 (14%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------GPEDSAPQYANM----FF 121
           +  D + + ++ F    K +   +P   V+S+P PH       P D+     N+     F
Sbjct: 201 FATDWLTDRAIEFFDAQKSN--NQPFFQVVSYPDPHTNNTVRAPYDTLYNIKNVKLPKTF 258

Query: 122 NVTSHHT-PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT--------LQSVD 172
           N+  H   PA+    NP+  + +    K + + +++T     ++L+         ++ +D
Sbjct: 259 NLEYHKDLPAWK---NPELDFAVDKNTKEKKLKKEYTPEKTKRKLKKDITQYFGMVKCID 315

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GI 231
           D + K+++ LK  G+LNNT II+TSDHG  LG++G    K  P E   ++PF++  P  I
Sbjct: 316 DNIGKMIQALKKNGQLNNTIIIFTSDHGDLLGEYGR-DNKGVPQEGSAKIPFIVYYPKAI 374

Query: 232 VPGTM 236
             GT+
Sbjct: 375 KAGTV 379


>gi|118442882|ref|YP_878798.1| sulfatase [Clostridium novyi NT]
 gi|118133338|gb|ABK60382.1| probable sulfatase [Clostridium novyi NT]
          Length = 504

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/186 (25%), Positives = 83/186 (44%), Gaps = 28/186 (15%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------- 120
           +Y + IA  +   L Q K++   +   L  SF  PH P      +A+M+           
Sbjct: 193 HYDNWIAERTNALLEQYKEN--DENFFLWSSFFDPHPPYFVPEPWASMYNPDDITLPHYL 250

Query: 121 ----------FNVTSHHTPAYD--YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
                        T    P +   Y P  DK W+      +Q       D+ +   + ++
Sbjct: 251 GGEHLTTSPILKKTQEENPDFSQYYEPK-DKNWLHGCHSHLQDKEELKKDIAIYYGMISM 309

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
             +D  + KI+ +L++LG   +T +++TSDHG+  GQ  L+    F +E  I++PF++R 
Sbjct: 310 --MDHYIGKILDKLEELGMAEDTIVVFTSDHGHFFGQHNLIAKGPFHYEDMIKIPFIVRE 367

Query: 229 PGIVPG 234
           PG++P 
Sbjct: 368 PGVIPA 373


>gi|410098956|ref|ZP_11293930.1| hypothetical protein HMPREF1076_03108 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409220087|gb|EKN13044.1| hypothetical protein HMPREF1076_03108 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 481

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 85/177 (48%), Gaps = 13/177 (7%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
           D+   D+V ++ + K++   KP  L +++ +PH P     +Y   +        P Y D 
Sbjct: 195 DIFTEDAVEYIEKQKEA--GKPFFLSVAYMSPHDPRTMPDEYMRQYDFDKIQLPPNYMDK 252

Query: 134 APNPDKQWIL--QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
            P  + + ++  +V   +  V  +    +M +    +  VD  V  IVK LK+ GE  NT
Sbjct: 253 HPFDNGELVIRDEVLAAIPRVPDEIKKHIM-EYYAMVTHVDRRVGDIVKALKETGEYENT 311

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-VPGT-----MYLLDVL 242
            I++  D+G  +GQ GL+ GK   +E  +RVP +++  G  V G       YL+DV 
Sbjct: 312 IIVFAGDNGLAVGQHGLM-GKQNVYEHSVRVPLMIKQAGKDVKGQKSDKLCYLIDVF 367


>gi|443695661|gb|ELT96527.1| hypothetical protein CAPTEDRAFT_63083, partial [Capitella teleta]
          Length = 490

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 15/113 (13%)

Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKR--------------LQTLQSVDDAVEKIVKE 181
           +PD  WI        P + QF + L                     +  VDD V +I+K+
Sbjct: 225 SPDIAWIKNTELHGAPEYSQFMEGLSRNNSLPYVIVRDLRRAYFSCISYVDDLVGEILKD 284

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           L DLG  N+T + +  DHGYHLG+  ++ GK+  FE     P ++R PG+  G
Sbjct: 285 LNDLGIANDTVVSFIGDHGYHLGEHSMI-GKNTAFEVANNSPMIIRIPGVTDG 336


>gi|168335082|ref|ZP_02693192.1| sulfatase [Epulopiscium sp. 'N.t. morphotype B']
          Length = 541

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/181 (29%), Positives = 85/181 (46%), Gaps = 20/181 (11%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF-FNVTSHHTP 129
           +Y    + N ++ +L Q+       P  L + +  PH P +   +Y  ++         P
Sbjct: 167 NYETAWVENQTINWLEQN----GDNPFFLFVGYIKPHPPWNIPAKYQGIYQXAAIPEAIP 222

Query: 130 AYDYAP-NPD-KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
              Y P +P+   W        Q   R+  +   T     +  VD++  KI++ L+  G+
Sbjct: 223 KARYYPEDPNHNSWYGDDDSNXQL--RKHQEAYYT----AVSMVDESFGKIIEHLRXTGK 276

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGT-----MYLLDV 241
           L+NT +IYTSDHG  LG  G    KS P+E  +R+P L+R P +  PGT     + LLD+
Sbjct: 277 LDNTLVIYTSDHGEMLGDRGYYS-KSVPYESAVRIPLLVRYPEVFEPGTTNEDFVDLLDI 335

Query: 242 L 242
           L
Sbjct: 336 L 336


>gi|410643845|ref|ZP_11354334.1| extracellular sulfatase Sulf [Glaciecola chathamensis S18K6]
 gi|410136471|dbj|GAC12521.1| extracellular sulfatase Sulf [Glaciecola chathamensis S18K6]
          Length = 477

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 85/165 (51%), Gaps = 15/165 (9%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA-------PQYANMFFNVT---SH 126
           I N +  F++ +  S   KP +L ++F  PH  +          P++   + +VT   ++
Sbjct: 170 IQNLAKEFIQDA--SSKDKPWVLAVNFWNPHAHDTDKVDQFHYPPEFEEYYQDVTVPDAN 227

Query: 127 HTPAYDYAPNPD--KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           ++  +D+   P   K  + +V  + +  + +    ++ +  + + SVD AV  I  +LK 
Sbjct: 228 NSTKHDFEELPKFLKDSMARVRWEYRFANPKMYQRIIKRHYRAISSVDKAVGAISAQLKK 287

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           LG  +NT IIYT D+GY L +  L  GK F +E D+RVP +++ P
Sbjct: 288 LGVDDNTIIIYTGDNGYSLNERQLA-GKWFGWEEDLRVPLIIKDP 331


>gi|73539584|ref|YP_299951.1| sulfatase [Ralstonia eutropha JMP134]
 gi|72122921|gb|AAZ65107.1| Sulfatase [Ralstonia eutropha JMP134]
          Length = 482

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/191 (30%), Positives = 88/191 (46%), Gaps = 21/191 (10%)

Query: 57  MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH----GPEDS 112
           M G  I  GE  Y  Y    +   +V++L+++ Q   +   +L +   APH     PE+ 
Sbjct: 145 MLGEHIGPGESSYTQY-DRAVTQRAVQWLQEAAQR-QEAGFVLYVGLVAPHFPFVVPEEF 202

Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS-- 170
              Y          H P   Y  +P   W+ +    M    RQF D    +RL+   +  
Sbjct: 203 YSLYPTDGLPEPKLH-PRTGYEQHP---WVREYCDFMA-SERQFAD--ADERLRAFAAYY 255

Query: 171 -----VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
                +D  V +I+  L+D G  + T+I+YTSDHG +LG  G V GKS  +E  ++VP L
Sbjct: 256 GLCTWLDHNVGQILGALRDNGLEDTTHIVYTSDHGDNLGARG-VWGKSTLYEESVKVPML 314

Query: 226 MRGPGIVPGTM 236
           + GP + PG  
Sbjct: 315 LAGPIVTPGVC 325


>gi|418404758|ref|ZP_12978202.1| choline-sulfatase [Sinorhizobium meliloti CCNWSX0020]
 gi|359501269|gb|EHK73887.1| choline-sulfatase [Sinorhizobium meliloti CCNWSX0020]
          Length = 512

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 46/164 (28%), Positives = 75/164 (45%), Gaps = 7/164 (4%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +     S++P  L +SF  PH P  +  ++ +++     H TP     P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEGC-EHLTPEVGAIP 228

Query: 136 ----NPDKQWI-LQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
               +P  Q I L    +   V  +            +  +D+ V +++  L     L++
Sbjct: 229 LDEQDPHSQRIMLSCDYQNFDVTEENVRRSCRAYFANISYLDEKVGELIDTLTRTRMLDD 288

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           T I++ SDHG  LG+ GL    +F FE   RVP ++ GPGI PG
Sbjct: 289 TLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331


>gi|383115488|ref|ZP_09936244.1| hypothetical protein BSGG_2641 [Bacteroides sp. D2]
 gi|313695106|gb|EFS31941.1| hypothetical protein BSGG_2641 [Bacteroides sp. D2]
          Length = 497

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 84/172 (48%), Gaps = 19/172 (11%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF---- 120
           GE++ +  Y D     ++RFL+  KQ   K+P +  ++F +PH P +  P Y   +    
Sbjct: 187 GEEFSSKMYADA----AIRFLQ--KQKGDKQPFLAYVAFTSPHDPRNQLPNYGRKYSPDT 240

Query: 121 FNVTSHHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
            +V  +  P + +     +   + +L   R  Q V ++ +D         +  VD  + +
Sbjct: 241 LDVPRNFLPKHPFNNGEMRVRDELLLPAPRTEQQVQKELSDYY-----GMISEVDVQIGR 295

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           I++ L+  G+  NT +++ SD+G  +G+ GL+ GK   ++  ++VP  +  P
Sbjct: 296 IMEVLRATGQAENTIVVFASDNGLAVGRHGLL-GKQNLYDHSVKVPLTIIAP 346


>gi|56697077|ref|YP_167440.1| choline sulfatase [Ruegeria pomeroyi DSS-3]
 gi|56678814|gb|AAV95480.1| choline sulfatase [Ruegeria pomeroyi DSS-3]
          Length = 498

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 80/164 (48%), Gaps = 18/164 (10%)

Query: 81  SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ 140
           ++ +LR +  +    P ML +SF  PH P      Y  M+ +      P  DY  + D +
Sbjct: 184 ALEYLRHAPDT----PWMLTVSFTNPHPPYVVPRPYWEMYKDAD---IPLPDYPADMDAR 236

Query: 141 W--ILQVTRKMQPVHRQFTDLLMTKRLQTLQS--------VDDAVEKIVKELKDLGELNN 190
           +       R+   +H++  ++   + L  ++         VDD +  +++ L + G+ + 
Sbjct: 237 YSEFDHALRRWHGLHQRGHEVRDPRNLIAMRRGFAALAHYVDDKIGALLEVLDETGQRDE 296

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           T II TSDHG  LG+ GL++ +S  +E+  R+P ++  PG  PG
Sbjct: 297 TVIIVTSDHGEMLGEKGLIQKRSL-YEWSARIPLIIDLPGAAPG 339


>gi|171912626|ref|ZP_02928096.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Verrucomicrobium spinosum DSM 4136]
          Length = 468

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 104/232 (44%), Gaps = 30/232 (12%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNG-RKIKHGEDYYNDYYPDLIA 78
            Y GK+    +     P +  W +     KY++   N+ G RK+  G      YY   + 
Sbjct: 94  AYIGKWHMGEDNDEKRPHFNYWASHKGQGKYFDTEFNIEGERKVLPG------YYTTSVT 147

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-TSHHTPAYDYAPNP 137
             +  ++++S      KP  L++   APH       +YA+ F ++   +   A++    P
Sbjct: 148 QLATDYIKKSHD----KPWCLMLGHKAPHSFYTPEEKYAHTFDDIRVPYPKSAFNLKDKP 203

Query: 138 ----DK------------QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
               D+            +W  +         + F +++       + SVDD++  + K+
Sbjct: 204 RWYRDRLNTWHGIYGPLFEWRKEFPNNKPEGVKDFENMIHG-YWGVILSVDDSIGALTKQ 262

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           L++ G+LNNT +++  D+G   G+ G+V  ++   +  IR+P ++R PG+VP
Sbjct: 263 LEESGQLNNTVVVFMGDNGLLNGENGMVDKRAM-HDASIRIPLIVRYPGLVP 313


>gi|91789296|ref|YP_550248.1| sulfatase [Polaromonas sp. JS666]
 gi|91698521|gb|ABE45350.1| sulfatase [Polaromonas sp. JS666]
          Length = 511

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 81/182 (44%), Gaps = 28/182 (15%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
           I + ++ F+R+     +K+   L  SFP PH P D    ++ +      ++ +H T  ++
Sbjct: 205 IGDRTIEFMRKHAGEAAKR-FCLWASFPDPHHPFDCPEPWSRLHHPDEVDLPAHRTTDFE 263

Query: 133 YAPNPDKQWILQVTRKMQPVH-------RQFTDLLMTKRLQTLQS-----------VDDA 174
             P     W  + +   +PV        RQ    + T   Q L++           VD  
Sbjct: 264 RRP-----WWHKASMDSKPVGDAAVQALRQNFSRMPTPAEQQLRNITANYYGMISLVDHQ 318

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           V +I   L+ LG   NT +I+TSDHG  LG  GL+     P+E  +RV  ++ GP +  G
Sbjct: 319 VGRIQTALQQLGLDGNTLVIFTSDHGEWLGDHGLMLKGPIPYEGVLRVGMVVNGPQVQAG 378

Query: 235 TM 236
            +
Sbjct: 379 QV 380


>gi|182625265|ref|ZP_02953040.1| sulfatase family protein [Clostridium perfringens D str. JGS1721]
 gi|177909577|gb|EDT72017.1| sulfatase family protein [Clostridium perfringens D str. JGS1721]
          Length = 481

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDTS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333


>gi|149196982|ref|ZP_01874035.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
           HTCC2155]
 gi|149140092|gb|EDM28492.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
           HTCC2155]
          Length = 495

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 50/228 (21%), Positives = 97/228 (42%), Gaps = 36/228 (15%)

Query: 37  GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
           G+  W  L  + KY N ++     K+ H       +  D+I N ++ +  + +     KP
Sbjct: 132 GFDHWETLPGHGKYINPALLTAEGKVNH-----EGHSSDVITNRALEWYNEKRDK--DKP 184

Query: 97  IMLVMSFPAPHGPEDSAPQYANMF-------------------------FNVTSHHTPAY 131
            ML++ + APH     AP++   F                           +  H     
Sbjct: 185 FMLMVHYKAPHRAWIPAPRFVEKFKDTVFPEPKTLFDDYKTRDMAVQHSMGIREHMNLEK 244

Query: 132 DYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
           D   N    ++++L      + + RQ     M      +  VD+ V +++ +LK  G  +
Sbjct: 245 DLKTNIWEHRKYLLNKDLSKEELTRQKYQAYMRDYFACVAGVDENVGRLLDQLKADGIDD 304

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
           NT ++++SD G++LG+ G    K + +E   R P++++ PG+  PG++
Sbjct: 305 NTIVMFSSDQGFYLGEHGWFD-KRWMYEESFRTPYIVKWPGVTKPGSV 351


>gi|431797960|ref|YP_007224864.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430788725|gb|AGA78854.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 520

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 30/177 (16%)

Query: 69  YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT 128
           Y  Y P L+  ++V+F+    ++  KKP  L +  P PH  E  AP+     F       
Sbjct: 229 YGAYSPALMQKEAVKFI----EAHQKKPFFLFLPTPLPHA-ELIAPEEEMALFEGKLQEG 283

Query: 129 PAY---DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
             Y   D  P   K    Q+  + QP H  F  ++        + +DD V +I+K +  L
Sbjct: 284 KPYKGTDEGPGFRKG---QLGSQEQP-HAAFAAMV--------KILDDQVGEIMKTVDSL 331

Query: 186 GELNNTYIIYTSDHGYHL---------GQFGLVKG-KSFPFEFDIRVPFLMRGPGIV 232
           G  NNT II+TSD+G H+            G +KG K   +E  IRVP +MR PG +
Sbjct: 332 GLANNTLIIFTSDNGPHIEGGGDPAFFDSNGPLKGIKRDLYEGGIRVPLIMRWPGRI 388


>gi|116625900|ref|YP_828056.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116229062|gb|ABJ87771.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 499

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 55/225 (24%), Positives = 99/225 (44%), Gaps = 19/225 (8%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY--NYSINMNGRKIKHGEDYYNDYYPDLI 77
           G+ GK+     G    PG+ +W +      Y      +N++G+ +         Y  D +
Sbjct: 122 GFVGKWHMGREGDDPQPGFDKWVSFRGQGSYLPERNGLNVDGKHVPQ-----KGYITDEL 176

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA-PN 136
            + ++ +LR   +   ++P  L +S  A H     A ++   +   T       D + PN
Sbjct: 177 TDYALDWLRTVPK---EQPYFLYLSHKAVHADFIPADRHKGAYAKETFRPPTTMDESGPN 233

Query: 137 PDKQ--WILQVTRKMQ----PVHRQFTDLLMTKR-LQTLQSVDDAVEKIVKELKDLGELN 189
              +  W+            P H         KR  +TL  VDD+V++++  L++ G+L+
Sbjct: 234 AQHRPMWVQNQRNSWHGVDFPYHSDLDVGEYYKRYAETLLGVDDSVDRMLDALRERGQLD 293

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +T +IY  D+G+  G+ GL+  K   +E  +RVP L R P +  G
Sbjct: 294 STLVIYMGDNGFQFGEHGLID-KRTAYEESMRVPLLARCPEMFSG 337


>gi|171911709|ref|ZP_02927179.1| choline sulfatase [Verrucomicrobium spinosum DSM 4136]
          Length = 469

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 85/177 (48%), Gaps = 16/177 (9%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           +L A+ +V F+R  KQS   +P +  ++F APH P  + P +     N      P     
Sbjct: 166 ELFADKAVEFVRAQKQS--SQPWLCYVAFNAPHDPRKAPPAWHEQT-NANKPPIPENFLP 222

Query: 135 PNPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELN 189
            +P     + V   K+ P  R  T+ ++ + L      +  +D  + +I++ L+  G+  
Sbjct: 223 VHPFNNGEMTVRDEKLAPWPR--TEPVIRQELADYYAAIMFMDSQIGRILESLRATGQDE 280

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLDV 241
            T I+++SDHG  +G  GL+ GK   ++  +  P ++ GPG+  G       YLLDV
Sbjct: 281 KTIIVFSSDHGLAIGSHGLM-GKQSLYDHSMHSPLILAGPGVPKGEKRAALCYLLDV 336


>gi|168333697|ref|ZP_02691950.1| mucin-desulfating sulfatase [Epulopiscium sp. 'N.t. morphotype B']
          Length = 491

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 91/204 (44%), Gaps = 23/204 (11%)

Query: 41  WGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI-ANDSVRFLRQSKQSFSKKPIML 99
           W     N KY+    N    K    E+ +ND +  +  A+ +  FL   K + S KP ML
Sbjct: 151 WNGSPSNLKYFPDKEN----KFNGYENQWNDDHLTMFNAHQADAFLE--KAADSGKPFML 204

Query: 100 VMSFPAPHGPEDSAPQYANMFFNVT----SHHTPAY---------DYAPNPDKQWI-LQV 145
            +SF APH P  ++P++   + ++T     +  P Y          ++ N D+ W   + 
Sbjct: 205 SVSFKAPHRPHTASPKWVQFYKDMTIKRMDNDKPEYFAVLPEVVRTHSRNADEYWGGHRY 264

Query: 146 TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
           TRK       F           +  VD+AV  I  +L +LG   NT IIYTSD+GY  G 
Sbjct: 265 TRKAWNDDATFQHDFRN-YYGLISGVDEAVGSIRHKLDELGLAENTIIIYTSDNGYFCGS 323

Query: 206 FGLVKGKSFPFEFDIRVPFLMRGP 229
             L  GK   +E  I+ P ++  P
Sbjct: 324 KQL-GGKELLYEESIKAPLIIYDP 346


>gi|384208890|ref|YP_005594610.1| sulfatase [Brachyspira intermedia PWS/A]
 gi|343386540|gb|AEM22030.1| sulfatase [Brachyspira intermedia PWS/A]
          Length = 474

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 75/148 (50%), Gaps = 14/148 (9%)

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQ----- 150
           P+ + +SF  PH P D   +Y +++ ++      +       DK+    +T++ +     
Sbjct: 197 PLFMWVSFVDPHHPFDPIKKYYDIYKDINPKELNS---KLKLDKKRPEHLTKQGERGYWP 253

Query: 151 ----PVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLG 204
                 +    ++   KRL    +  +D  + +I+ +LK+  E +NT I++TSDHG +LG
Sbjct: 254 GGGEEHNYSEEEIKEIKRLYYGMISFIDSQIGRIIDKLKEKNEFDNTIIVFTSDHGEYLG 313

Query: 205 QFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
            +GL+K   F ++  I+VP L  G GIV
Sbjct: 314 DYGLLKKGPFMYDCLIKVPLLFYGKGIV 341


>gi|423288192|ref|ZP_17267043.1| hypothetical protein HMPREF1069_02086 [Bacteroides ovatus
           CL02T12C04]
 gi|392671081|gb|EIY64557.1| hypothetical protein HMPREF1069_02086 [Bacteroides ovatus
           CL02T12C04]
          Length = 529

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 80/173 (46%), Gaps = 11/173 (6%)

Query: 61  KIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF 120
           K K G+D Y +Y  D + N  +  L +       +P  +++S P PH P+     Y + +
Sbjct: 191 KNKEGDDAY-EYSTDFLTNKVIELLEERDN----RPFCMMLSIPDPHSPDLCKEPYISQY 245

Query: 121 FNVT-SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
             +  ++ +   +   N    W   +  K Q    +F    +      ++ +DD V KI+
Sbjct: 246 AKIEITNPSTNTEELLNARPGW--GIGGKAQA--EKFVPKNIRNYFAMVKCIDDNVGKIL 301

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           K L D    +NT +++TSDHG  L     V  K  P++  IRVPF++R P  +
Sbjct: 302 KYLDDNSLTDNTIVVFTSDHGDMLYNHMRVD-KGCPYDDAIRVPFIVRYPSRI 353


>gi|393784374|ref|ZP_10372539.1| hypothetical protein HMPREF1071_03407 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666150|gb|EIY59667.1| hypothetical protein HMPREF1071_03407 [Bacteroides salyersiae
           CL02T12C01]
          Length = 546

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 83/189 (43%), Gaps = 27/189 (14%)

Query: 66  EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----- 120
           E Y+ D +   IA   +  L Q K+    KP ML  SF  PH P      +A+M+     
Sbjct: 232 EKYHTDAW---IAERCIALLDQYKKG--DKPFMLWASFFDPHNPYLIPEPWASMYKPEDM 286

Query: 121 -----FNVTSHHTPAY---DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSV- 171
                 N      P +       NPDK  I        PVH         ++LQ  +++ 
Sbjct: 287 DIPETVNDDISDMPLHYRVTREENPDKS-IWAEGNPSYPVHGIAQHRWTKEKLQQNKAIY 345

Query: 172 -------DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
                  D  + KI+  L++ G+++NT II+T+DHG  +G  GLV    F +E  +++PF
Sbjct: 346 YGMVSMMDHYIGKILDHLEENGQIDNTLIIFTTDHGAFIGHHGLVGKGVFDYEDAVKIPF 405

Query: 225 LMRGPGIVP 233
           +    G +P
Sbjct: 406 IASYKGKMP 414


>gi|404367236|ref|ZP_10972607.1| hypothetical protein FUAG_01415 [Fusobacterium ulcerans ATCC 49185]
 gi|404288764|gb|EFS25900.2| hypothetical protein FUAG_01415 [Fusobacterium ulcerans ATCC 49185]
          Length = 472

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 75/147 (51%), Gaps = 8/147 (5%)

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM----QP 151
           P+ +  SF  PH P D   ++A+++ N       + ++  N   + + +  +        
Sbjct: 196 PLFMWTSFVDPHHPFDPIEKFASLYENTIIDKPISKNFLCNGRPEHLQKQGKNGYWPGGG 255

Query: 152 VHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
               F+D  + +  +     +  +D  + KI+K L++  +L+NT II+TSDHG ++G +G
Sbjct: 256 EQHNFSDEKIEEFTKYYYAMISFIDQEIGKILKSLEEKKQLDNTIIIFTSDHGEYMGDYG 315

Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           L++   F ++  I+VP L+ G G+  G
Sbjct: 316 LLQKGPFMYDDLIKVPLLIWGKGVSKG 342


>gi|340620633|ref|YP_004739086.1| sulfatase [Zobellia galactanivorans]
 gi|339735430|emb|CAZ98807.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 538

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 91/205 (44%), Gaps = 27/205 (13%)

Query: 41  WGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLV 100
           W A   N  Y++   N N    K GE+         +A ++V FL+++  + + KP+   
Sbjct: 178 WDAGSPNGGYFSPYNNPNLENRKDGENLSM-----RLAEETVTFLKENDPNKTGKPVFAY 232

Query: 101 MSFPAPHGP-EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL 159
           +SF A HGP + +  ++A        +      Y              K  P+ RQ  D 
Sbjct: 233 LSFYAVHGPIQTTKEKWAKYRKKAVRNGVADSGYK-----------MAKFLPI-RQVQDN 280

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG-------YHLGQFGLVKGK 212
            +   L  ++++DDAV +++  L  LG   NT +++TSD+G       +      L  GK
Sbjct: 281 PVYAGL--VETMDDAVGRVLNTLDSLGLDKNTLVVFTSDNGGVSAGDSFSTSNLPLRGGK 338

Query: 213 SFPFEFDIRVPFLMRGPGIVPGTMY 237
            + FE  IR P+ ++ P +V G  Y
Sbjct: 339 GYQFEGGIREPYFIKAPWLVKGGAY 363


>gi|373498627|ref|ZP_09589132.1| hypothetical protein HMPREF0402_03005 [Fusobacterium sp. 12_1B]
 gi|371960785|gb|EHO78430.1| hypothetical protein HMPREF0402_03005 [Fusobacterium sp. 12_1B]
          Length = 472

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 75/148 (50%), Gaps = 8/148 (5%)

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM----QP 151
           P+ +  SF  PH P D   ++A+++ N       + ++  N   + + +  +        
Sbjct: 196 PLFMWASFVDPHHPFDPIEKFASLYENTIIDKPISKNFLCNGRPEHLQKQGKNGYWPGGG 255

Query: 152 VHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
               F+D  + +  +     +  +D  + KI+K L++  +L+NT II+TSDHG ++G +G
Sbjct: 256 EQHNFSDEKIEEFTKYYYAMISFIDQEIGKILKSLEEKKQLDNTIIIFTSDHGEYMGDYG 315

Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           L++   F ++  I+VP L+ G G+  G 
Sbjct: 316 LLQKGPFMYDDLIKVPLLIWGKGVSKGV 343


>gi|149176956|ref|ZP_01855565.1| choline sulfatase [Planctomyces maris DSM 8797]
 gi|148844211|gb|EDL58565.1| choline sulfatase [Planctomyces maris DSM 8797]
          Length = 549

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 84/174 (48%), Gaps = 15/174 (8%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
           I +D++ FL    Q  + +P  + ++F  PH P  +  +Y + +      + +++ P + 
Sbjct: 162 IVDDAIEFL----QKKNTQPFFMYLAFACPHDPRVADQEYMDHYEREEIPLPANYLPLHP 217

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
           +  N  +Q +        P  +      +      +  +D  + +++K LK+ GE +NT 
Sbjct: 218 F--NNGEQVVRDELLAGFPRSKAEIRKHLHDYYADITGLDRHIGRLIKALKESGEYDNTV 275

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT----MYLLDVL 242
           II++SDHG  +G  GL+ GK   +E  ++ P +  GPGI  G     +YL D+ 
Sbjct: 276 IIFSSDHGLAVGSHGLM-GKQSLYEHSMKSPLIFSGPGIPHGQSNALVYLYDIF 328


>gi|420237904|ref|ZP_14742349.1| arylsulfatase A family protein [Rhizobium sp. CF080]
 gi|398089566|gb|EJL80081.1| arylsulfatase A family protein [Rhizobium sp. CF080]
          Length = 503

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M + L+T+QS+DD V +++  L   G   NT +IYTSD G+ LG+ G    K F +E  +
Sbjct: 286 MKRYLRTVQSIDDNVGRLLDHLDKKGLTENTLVIYTSDQGFFLGEHGWFD-KRFMYEESL 344

Query: 221 RVPFLMRGPGIVPGTMYLLDVLI 243
           ++PFL R P  +P      D+  
Sbjct: 345 QMPFLARYPAAIPAGTTCGDLAC 367


>gi|168210574|ref|ZP_02636199.1| sulfatase family protein [Clostridium perfringens B str. ATCC 3626]
 gi|170711367|gb|EDT23549.1| sulfatase family protein [Clostridium perfringens B str. ATCC 3626]
          Length = 481

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRKDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYAPG 333


>gi|344204106|ref|YP_004789249.1| sulfatase [Muricauda ruestringensis DSM 13258]
 gi|343956028|gb|AEM71827.1| sulfatase [Muricauda ruestringensis DSM 13258]
          Length = 542

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 72/289 (24%), Positives = 112/289 (38%), Gaps = 81/289 (28%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED--YYNDYYPDLI 77
           GY    + K++    P G+  W  L     YYN        +   GED      Y  DLI
Sbjct: 122 GYETAIVGKWHLHGKPTGFDYWDVLNDQGNYYN-------PEFIQGEDTTVVEGYATDLI 174

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-------------------PEDSAPQYAN 118
            + S+ +L+Q + +  +KP  L++   APH                    P+   P++ N
Sbjct: 175 TDKSLEWLKQREGN--QKPFYLMVHHKAPHRNWMPALRHLNVYDSITFPLPDTYFPEFKN 232

Query: 119 MFFNVTSHHT-----------------PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLL- 160
                    T                  + D A NP   W     R M P  R   D   
Sbjct: 233 QRAAAEQQQTIYKDMYEGHDLKMSDGYGSTDLAHNP---WTKDFDR-MTPEQRNIWDEAY 288

Query: 161 -------------------------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
                                    + + L T++SVD+ + KI+  L+  G   NT ++Y
Sbjct: 289 LPKNNAFYEADLHGKELAEWKGQRYLHEYLATVKSVDEGIGKILGYLEATGLDENTLVVY 348

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTMYLLDVLI 243
           TSD G++LG+ G    K F +E  +++P LM+ PG++ PGT   +D L+
Sbjct: 349 TSDQGFYLGEHGWFD-KRFMYEESMKMPLLMQLPGVIEPGTN--IDALV 394


>gi|198274928|ref|ZP_03207460.1| hypothetical protein BACPLE_01087 [Bacteroides plebeius DSM 17135]
 gi|198272375|gb|EDY96644.1| arylsulfatase [Bacteroides plebeius DSM 17135]
          Length = 451

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 59/252 (23%), Positives = 102/252 (40%), Gaps = 47/252 (18%)

Query: 5   VELTSRNLLYFDI--SHGYFGKYLNKY--NGSYIP----------PGWREWG-------- 42
           VEL+   + + D+    GY   Y+ K+  +  Y P            W EW         
Sbjct: 74  VELSEEAVCWSDVLKKEGYRTAYIGKWHLDAPYRPYVDTYNNKGEVAWNEWCPPERRHGF 133

Query: 43  -ALIMNSKYYNYSINMNGRKIKHGEDYY--NDYYPDLIANDSVRFLRQSKQSFSKKPIML 99
              I    Y N+   M        E++Y  N + P   A+ ++ F+R ++ + + +P  +
Sbjct: 134 DKWIAYGTYDNHLKPMYWDTKSGREEFYYVNQWGPAYEADRAIEFIR-TEGTKTDRPFAM 192

Query: 100 VMSFPAPHG-----PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
           V+S   PH      PE     Y ++     +   P       P+  +  +  R       
Sbjct: 193 VVSMNPPHTGYELVPETYTRLYDSLDVEALARQLPYISEKGTPEGDYFRENIRNY----- 247

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
                        +  VD+ V +I++ LKD G  +NT I++TSDHG  +G  G V+GK+ 
Sbjct: 248 ----------YACITGVDEHVGRIIQALKDCGVYDNTLIVFTSDHGVCMGGHG-VEGKNV 296

Query: 215 PFEFDIRVPFLM 226
            +E  +R+P + 
Sbjct: 297 FYEESMRIPMIC 308


>gi|372209997|ref|ZP_09497799.1| sulfatase yidj [Flavobacteriaceae bacterium S85]
          Length = 479

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 45/174 (25%), Positives = 81/174 (46%), Gaps = 7/174 (4%)

Query: 57  MNGRKIKHGEDYYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
           MN +K +   D  N  +  D   + ++  + +  Q    +P  +++S P PH P+ + P 
Sbjct: 179 MNDKKYESAADKSNIVHATDFFVDKTIEIINRDHQ----QPFFVMVSIPDPHTPDYAKPP 234

Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
           Y N  +N      P    A +   +      +K    H  F    + +    ++ +DD +
Sbjct: 235 Y-NSLYNHLDLQAPETMSATSIKNRPEWANNKKNDVSHEGFKPQKLKQYFGMVKHIDDRI 293

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
             I+K L  L   +NT +I+TSDHG    + G +  KS P++  ++VPF++R P
Sbjct: 294 GDILKTLDKLQISDNTILIFTSDHGDMFYEHGRM-NKSVPYDAAMKVPFVIRYP 346


>gi|257067681|ref|YP_003153936.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
 gi|256558499|gb|ACU84346.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
          Length = 478

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 15/149 (10%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP-AYDYAPNP--------DKQWILQV 145
           +P  L  +F APH P     ++A ++        P   ++  +P          + +   
Sbjct: 211 EPFFLWAAFTAPHDPRTPPEEFARLYDRTDPAAVPLPENFRTDPVEATNFGERDENLAAA 270

Query: 146 TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
            R  + V     D         +  +DD + +I+  L+  G   NT ++YT+DHG  LGQ
Sbjct: 271 PRDPEEVRGHLADYY-----GMISHLDDGIGRILAHLERSGLAENTLVVYTADHGLSLGQ 325

Query: 206 FGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            G++ GK   +E  +RVP L+ GPGI  G
Sbjct: 326 HGMM-GKQSLYEHSLRVPLLLAGPGIEAG 353


>gi|168217566|ref|ZP_02643191.1| sulfatase family protein [Clostridium perfringens NCTC 8239]
 gi|182380378|gb|EDT77857.1| sulfatase family protein [Clostridium perfringens NCTC 8239]
          Length = 481

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333


>gi|393771603|ref|ZP_10360073.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. Rr
           2-17]
 gi|392722961|gb|EIZ80356.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. Rr
           2-17]
          Length = 575

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 58/236 (24%), Positives = 108/236 (45%), Gaps = 33/236 (13%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
            +FGK+    +     PG+ +W + I    Y+         + + N++G+ +        
Sbjct: 140 AFFGKWHMGASTDAPRPGFDKWVSFIGQGNYWPSMVNTGQSSPNFNVDGKAVAQ-----K 194

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF------FNVT 124
            Y  D + + ++ +L  +K+   KKP  L +S    H      P+YA+ +         +
Sbjct: 195 GYITDELTDYAMDWL--TKERDPKKPFFLYLSHKGVHSDPLPPPRYAHQYDKTKFTLPAS 252

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF--TDLLMTKRLQ----TLQSVDDAVEKI 178
           + +TP +    N  K   +Q  R        F   D+ MT+ L+    TL  +DD++ +I
Sbjct: 253 AANTPEH----NAGKPLWVQNQRNTWHGIDFFYNADVPMTEYLKYYYATLSGIDDSLGRI 308

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +  L+      +T +++TSD+GY +G+ GL+  K   +E  IRVP ++  PG +P 
Sbjct: 309 MAYLRKNHLEKDTLVVFTSDNGYMIGEHGLID-KRNAYEASIRVPLVVWEPGTIPA 363


>gi|358065466|ref|ZP_09152008.1| choline-sulfatase [Clostridium hathewayi WAL-18680]
 gi|356696358|gb|EHI57975.1| choline-sulfatase [Clostridium hathewayi WAL-18680]
          Length = 491

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 87/212 (41%), Gaps = 47/212 (22%)

Query: 45  IMNSKYYNYSINMNGRKIKHGEDYYND--YYPDLIANDSVRFLRQSKQSFSKKPIMLVMS 102
           IM    +  +I+ N +  K  E  +ND   Y + +   +  +L +      K P  L +S
Sbjct: 139 IMQPDSWGNAIHYNRQSAKPKE--WNDGLQYDEEVQVKAREYLYKKGMEKEKTPFCLCVS 196

Query: 103 FPAPHGPEDSAPQYANMF------------------------FNVTSHHTPAYDYAPNPD 138
           +  PH P     +Y NM+                         N   H T  +D     D
Sbjct: 197 YHHPHDPFQPPEKYWNMYEGKDIAIPDLSHVKLKGDTLLDCWLNTGFHRTDVFDVTKAED 256

Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
                     +  V R ++ L+          +DD V +++ +L+  G   NT +I+TSD
Sbjct: 257 ----------LYAVRRAYSGLVTY--------IDDKVGQLLDDLEQCGLRENTVVIFTSD 298

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           HG  LG+ G+V+ + F +E+  R+P ++  PG
Sbjct: 299 HGDMLGERGMVQKRCF-YEWSSRIPLIISTPG 329


>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
 gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 489

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/167 (27%), Positives = 75/167 (44%), Gaps = 38/167 (22%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           D+  D +A ++V+F+     S  +KP  L MSF A H P  + P+    +          
Sbjct: 204 DFLTDALAEEAVKFI----HSTDEKPFFLNMSFYAVHTPITAIPEKVEKY---------- 249

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
                   K+ I    R+  P +              ++ +DDAV  I+K L++ G ++N
Sbjct: 250 --------KRLIGPDARQKNPTYAGL-----------VEHLDDAVGAILKALEEKGIIDN 290

Query: 191 TYIIYTSDHGYHL-----GQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           T I++TSD+G  +       F L  GK   +E   RVP L++ PG+ 
Sbjct: 291 TIIVFTSDNGGEILHGITDNFPLRDGKGSSYEGGTRVPLLVKWPGVT 337


>gi|422347930|ref|ZP_16428839.1| arylsulfatase [Clostridium perfringens WAL-14572]
 gi|373223241|gb|EHP45592.1| arylsulfatase [Clostridium perfringens WAL-14572]
          Length = 482

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333


>gi|262405803|ref|ZP_06082353.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648075|ref|ZP_06725620.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294809254|ref|ZP_06767969.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|336403337|ref|ZP_08584053.1| hypothetical protein HMPREF0127_01366 [Bacteroides sp. 1_1_30]
 gi|345510315|ref|ZP_08789883.1| hypothetical protein BSAG_00519 [Bacteroides sp. D1]
 gi|229443018|gb|EEO48809.1| hypothetical protein BSAG_00519 [Bacteroides sp. D1]
 gi|262356678|gb|EEZ05768.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636582|gb|EFF55055.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294443535|gb|EFG12287.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|335946510|gb|EGN08315.1| hypothetical protein HMPREF0127_01366 [Bacteroides sp. 1_1_30]
          Length = 515

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/164 (24%), Positives = 79/164 (48%), Gaps = 7/164 (4%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTS 125
           +  + +P  I N+++ ++ + K++    P  + +SFP PH P      Y +MF    +  
Sbjct: 175 FLEEQHPTKIVNEALAWIEKQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPV 230

Query: 126 HHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
             T   D A   +K  IL ++     P   Q    +    +  ++ +DD ++++++ LK 
Sbjct: 231 LKTSRKDLAKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKA 290

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
            G+  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 291 SGQYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 334


>gi|372211260|ref|ZP_09499062.1| arylsulfatase [Flavobacteriaceae bacterium S85]
          Length = 519

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 94/219 (42%), Gaps = 21/219 (9%)

Query: 38  WREWGALIMNSKY---YNYSINMNGRKIKHGE-------DYYNDYYPDLIANDSVRFLRQ 87
           WR+ G       +   Y+   N   RK+  GE       +    +  ++  +++V F+  
Sbjct: 166 WRKDGKYKKEDAFIYVYDKKGNRTTRKVAKGEKKGPVKTEKDGPHTSEIFGDEAVDFI-- 223

Query: 88  SKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR 147
           SKQ   KKP  + ++F APH P  +  +Y  M+       TP+Y      D   +     
Sbjct: 224 SKQE-GKKPFFMYLAFHAPHDPRQAPKKYKKMYPEKKMKLTPSYLPQHPFDNGHLYLRDE 282

Query: 148 KMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
           ++ P  R  T  +  K L      +  VD  + +++K LK+ G+  NT I+ + D G  +
Sbjct: 283 QLAPWPR--TKAVAQKELSDYYAIISHVDAQIGRVIKTLKEKGQYENTIIVLSGDSGLAV 340

Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           G  GL+  ++   E  I VP ++ G   V G     D L
Sbjct: 341 GNHGLMGKQNLYDEDGIHVPLIISGK--VKGQGKRFDAL 377


>gi|168206593|ref|ZP_02632598.1| sulfatase family protein [Clostridium perfringens E str. JGS1987]
 gi|170661981|gb|EDT14664.1| sulfatase family protein [Clostridium perfringens E str. JGS1987]
          Length = 481

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 74/159 (46%), Gaps = 13/159 (8%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNT 191
            D++      + +  V +   +    KR +     ++  +D  + + +  L + GELNNT
Sbjct: 240 DDEE---NRGKDINCV-KGIINNKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNT 295

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
             ++ SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 296 IFLFVSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYDPG 333


>gi|168214076|ref|ZP_02639701.1| sulfatase family protein [Clostridium perfringens CPE str. F4969]
 gi|170714434|gb|EDT26616.1| sulfatase family protein [Clostridium perfringens CPE str. F4969]
          Length = 481

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLNMSFVRPHSPLDPPKFYFDMYKDEELPEHLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSTRVPFFIYDPG 333


>gi|89069431|ref|ZP_01156786.1| probable sulfatase [Oceanicola granulosus HTCC2516]
 gi|89045061|gb|EAR51139.1| probable sulfatase [Oceanicola granulosus HTCC2516]
          Length = 507

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 43/64 (67%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  V +++  L+  GEL++T I++T+DHG+ LGQ GLV    F +E  IR+PF++  PG
Sbjct: 314 IDAEVGRLLDLLEARGELDDTLIVFTTDHGHFLGQHGLVAKGPFHYEDLIRIPFIVAWPG 373

Query: 231 IVPG 234
            VP 
Sbjct: 374 EVPA 377


>gi|311104529|ref|YP_003977382.1| sulfatase family protein 3 [Achromobacter xylosoxidans A8]
 gi|310759218|gb|ADP14667.1| sulfatase family protein 3 [Achromobacter xylosoxidans A8]
          Length = 527

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 77/172 (44%), Gaps = 17/172 (9%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF------FNVTSHHTPA 130
           + + ++  LR+   + S KP  +  SFP PH P     +Y +M+         + H + A
Sbjct: 220 VGDQTIAQLRE--HAASGKPFFIQCSFPDPHHPYTPHGKYWDMYQPDDVILPESFHASAA 277

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ---------TLQSVDDAVEKIVKE 181
             + P P   W+ Q   + + V         T+R           ++  +D  + ++++ 
Sbjct: 278 PGHHPPPHVAWMHQQRDQGKAVKHTPAIFACTEREAREAIALNYGSISHIDATIGRVMQA 337

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           L++ G  +NT +I+TSDHG + G   L+      ++  IR PF+ R P   P
Sbjct: 338 LRENGLDDNTVVIFTSDHGDYFGDHQLLLKGPIHYQSLIRTPFIWRDPQAAP 389


>gi|359408966|ref|ZP_09201434.1| arylsulfatase A family protein [SAR116 cluster alpha
           proteobacterium HIMB100]
 gi|356675719|gb|EHI48072.1| arylsulfatase A family protein [SAR116 cluster alpha
           proteobacterium HIMB100]
          Length = 509

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 3/98 (3%)

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
           P +Q  +Q T++M         L+       +  +D  + ++  +L  LG   NT II+T
Sbjct: 285 PREQGFMQTTQRMS---EDTLRLIKAYTCGMVSMIDTGIGQLRAQLAQLGLTENTIIIFT 341

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           SDHG  LG  GL++    P+   ++VP ++ GPGIV G
Sbjct: 342 SDHGELLGDHGLIRKGPSPYLALLKVPLIITGPGIVAG 379


>gi|221134141|ref|ZP_03560446.1| iduronate-2-sulfatase [Glaciecola sp. HTCC2999]
          Length = 1028

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 84/173 (48%), Gaps = 29/173 (16%)

Query: 70  NDYYPD-LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVT 124
           +D+Y D  +A D+   LR+ +      P ++ + F  PH P ++  +Y +M+    F V 
Sbjct: 188 DDFYKDGKVARDADIALREHQND----PFLMFVGFSKPHLPFNAPKKYWDMYQRDQFTVP 243

Query: 125 SHHTPAYDYAPNPDKQWILQVT-----RKMQPVHRQ-FTDLLMTKRL-----QTLQSVDD 173
           S  TP        D  + L +T     R    + ++ +TD  +TK L      T+  +D 
Sbjct: 244 SRKTP--------DNMFRLALTKWNELRMYGGIPKEGYTDDELTKTLIHAYYATVSYMDA 295

Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
            V K++  L +LG   NT +I+ SDHGY LG++G    K    E D RVP ++
Sbjct: 296 QVGKVLNTLDELGLRENTTVIFMSDHGYKLGEYG-AWNKHTNMELDTRVPLII 347


>gi|154490064|ref|ZP_02030325.1| hypothetical protein PARMER_00293 [Parabacteroides merdae ATCC
           43184]
 gi|423723059|ref|ZP_17697212.1| hypothetical protein HMPREF1078_01272 [Parabacteroides merdae
           CL09T00C40]
 gi|154089213|gb|EDN88257.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
 gi|409241484|gb|EKN34252.1| hypothetical protein HMPREF1078_01272 [Parabacteroides merdae
           CL09T00C40]
          Length = 524

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 89/202 (44%), Gaps = 38/202 (18%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
           DL+ + ++R++R   +    +P M+   F A H P D            + P+ ANM  F
Sbjct: 183 DLVTDKTIRWIRSRDKD---RPFMMCCHFKATHEPWDFPERLKHIYDGVTFPEPANMMEF 239

Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
               S  T    P  D A        +PD  W         ++   K +   R+    L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMDK-EDARRKIYQKLI 298

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
              L+   ++DD + ++++ L + G   NT ++Y SD GY LG+ G    K   +E  +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSENTIVVYVSDQGYFLGEHGFFD-KRMMYEEPLR 357

Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
           +PF++R P  +P      D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379


>gi|110799572|ref|YP_694684.1| arylsulfatase [Clostridium perfringens ATCC 13124]
 gi|123345068|sp|Q0TUK6.1|SULF_CLOP1 RecName: Full=Arylsulfatase; AltName: Full=Cys-type sulfatase
 gi|110674219|gb|ABG83206.1| sulfatase family protein [Clostridium perfringens ATCC 13124]
          Length = 481

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRKDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYDPG 333


>gi|449135790|ref|ZP_21771228.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448885592|gb|EMB16025.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 476

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 47/72 (65%), Gaps = 4/72 (5%)

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG----LVKGKSFPFEFDIRV 222
           T+Q +D +  +I++ LK+LG  NNT +I+TSD+G + G+ G    L  GKS   E  +RV
Sbjct: 256 TIQELDWSTGEILRSLKELGLDNNTLVIFTSDNGPNTGKGGSAGPLKGGKSSTLEGGVRV 315

Query: 223 PFLMRGPGIVPG 234
           PF+ R PG++P 
Sbjct: 316 PFVARWPGMIPA 327


>gi|269104927|ref|ZP_06157623.1| choline-sulfatase [Photobacterium damselae subsp. damselae CIP
           102761]
 gi|268161567|gb|EEZ40064.1| choline-sulfatase [Photobacterium damselae subsp. damselae CIP
           102761]
          Length = 517

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 16/166 (9%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVT 124
           + N + P+  A+ ++ +LR     +  ++KP  LV+S   PH P D  PQ Y + F   +
Sbjct: 218 HINQWSPEHEADVAINYLRNENGKYRDAEKPFTLVVSMNPPHSPYDQVPQKYLDRFKGQS 277

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
           S           P+ QW  +      P +         + +  +  VD+   +IV EL  
Sbjct: 278 SKSLNT-----RPNVQWDKEYLEGYGPNY-------FKEYMAMVNGVDEQFGRIVDELDR 325

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           L   N+T +++ SDHG  +G  G    K+  +E  +R+P + R PG
Sbjct: 326 LNLANDTLVVFFSDHGCCMGSNGQPT-KNVHYEEAMRIPMIFRWPG 370


>gi|339502714|ref|YP_004690134.1| choline-sulfatase BetC [Roseobacter litoralis Och 149]
 gi|338756707|gb|AEI93171.1| choline-sulfatase BetC [Roseobacter litoralis Och 149]
          Length = 502

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 98/204 (48%), Gaps = 18/204 (8%)

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
           Y D +A ++ R L    +    +P  L +SF  PH P  +  +Y +++ +  +H  P   
Sbjct: 162 YDDEVAFNATRKLYDLSRGADARPWCLTVSFTHPHDPYVARKKYWDLYEDC-AHLQPEVA 220

Query: 130 AYDYAP-NPDKQWILQVT--RKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKD 184
           A+DYA  +P  Q I      R      R   D+  ++R     +  +DD + +I++ L+D
Sbjct: 221 AFDYADQDPHSQRIFDANDWRSFDITKR---DIERSRRAYFANISYLDDKIGEILQTLED 277

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG--TMYLLDV- 241
             +     I++ SDHG  LG+ GL    SF FE   RVP ++  P + PG  T  + ++ 
Sbjct: 278 TRQ--EAIILFVSDHGDMLGERGLWFKMSF-FEGSSRVPLMISAPQMEPGLQTTAVSNID 334

Query: 242 LIPQVRKFSSGSLIFIMSFLINLN 265
           + P +   +  S+  +M++   ++
Sbjct: 335 VCPTLCDLAGVSMDEVMAWTTGVS 358


>gi|423346509|ref|ZP_17324197.1| hypothetical protein HMPREF1060_01869 [Parabacteroides merdae
           CL03T12C32]
 gi|409219660|gb|EKN12620.1| hypothetical protein HMPREF1060_01869 [Parabacteroides merdae
           CL03T12C32]
          Length = 524

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 89/202 (44%), Gaps = 38/202 (18%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
           DL+ + ++R++R   +    +P M+   F A H P D            + P+ ANM  F
Sbjct: 183 DLVTDKTIRWIRSRDKD---RPFMMCCHFKATHEPWDFPERLKHIYDGVTFPEPANMMEF 239

Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
               S  T    P  D A        +PD  W         ++   K +   R+    L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMGK-EDARRKIYQKLI 298

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
              L+   ++DD + ++++ L + G   NT ++Y SD GY LG+ G    K   +E  +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSENTIVVYVSDQGYFLGEHGFFD-KRMMYEEPLR 357

Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
           +PF++R P  +P      D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379


>gi|423342537|ref|ZP_17320251.1| hypothetical protein HMPREF1077_01681 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409217454|gb|EKN10430.1| hypothetical protein HMPREF1077_01681 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 524

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 90/202 (44%), Gaps = 38/202 (18%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
           DL+ + +++++R   ++   +P M+   F A H P D            + P+ ANM  F
Sbjct: 183 DLVTDKTIQWIRSRDKN---QPFMMCCHFKATHEPWDFPERLKHLYDGVTFPEPANMMEF 239

Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
               S  T    P  D A        +PD  W         ++   K +   R+    L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMNK-EDARRKIYQKLI 298

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
              L+   ++DD + ++++ L + G   NT ++Y SD GY LG+ G    K   FE  +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSKNTIVVYVSDQGYFLGEHGFFD-KRMMFEEPLR 357

Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
           +PF++R P  +P      D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379


>gi|372210565|ref|ZP_09498367.1| mucin-desulfating sulfatase [Flavobacteriaceae bacterium S85]
          Length = 493

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 66/267 (24%), Positives = 102/267 (38%), Gaps = 63/267 (23%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNS---KYYNYSINMNGRKIKHGEDYYNDYYPDL 76
           GY    + K++    P G+  +  L        YYN      G K+   E  Y+     +
Sbjct: 88  GYKTAVIGKWHLGTAPTGFDYYKVLFNKEGQGSYYNPVFETTGNKLVKEEGKYS---TTV 144

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT------------ 124
           I N+++ +L+  KQ  +K+P ML+  F APH P +  P Y +   +V             
Sbjct: 145 IKNNAISWLK--KQKDAKEPFMLLYQFKAPHRPWEPGPGYEDYLKDVEIPYPTTFNDDYK 202

Query: 125 ------------SHHTPAYDYAPNPDKQWILQV------------------------TRK 148
                         H    D    P K    Q                          RK
Sbjct: 203 GRRAAEEAWMRIDGHMNRKDVKIKPPKSLTEQELIDWNKYGNEDGEFWTPDDSMSDQERK 262

Query: 149 MQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL 208
                R   D L     + ++ VDDA+ +++  L + G   NT ++YTSD G+++G+ G 
Sbjct: 263 NWKYQRYVKDYL-----RCVKGVDDAIGEMLNYLDESGLAENTIVVYTSDQGFYVGEHGW 317

Query: 209 VKGKSFPFEFDIRVPFLMRGP-GIVPG 234
              K F +E  + +P LMR P  I PG
Sbjct: 318 FD-KRFIYEESLHMPLLMRYPKKIKPG 343


>gi|218258523|ref|ZP_03474879.1| hypothetical protein PRABACTJOHN_00534 [Parabacteroides johnsonii
           DSM 18315]
 gi|218225399|gb|EEC98049.1| hypothetical protein PRABACTJOHN_00534 [Parabacteroides johnsonii
           DSM 18315]
          Length = 524

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 90/202 (44%), Gaps = 38/202 (18%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
           DL+ + +++++R   ++   +P M+   F A H P D            + P+ ANM  F
Sbjct: 183 DLVTDKTIQWIRSRDKN---QPFMMCCHFKATHEPWDFPERLKHLYDGVTFPEPANMMEF 239

Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
               S  T    P  D A        +PD  W         ++   K +   R+    L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMNK-EDARRKIYQKLI 298

Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
              L+   ++DD + ++++ L + G   NT ++Y SD GY LG+ G    K   FE  +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSKNTIVVYVSDQGYFLGEHGFFD-KRMMFEEPLR 357

Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
           +PF++R P  +P      D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379


>gi|266621313|ref|ZP_06114248.1| N-acetylglucosamine-6-sulfatase, partial [Clostridium hathewayi DSM
           13479]
 gi|288867025|gb|EFC99323.1| N-acetylglucosamine-6-sulfatase [Clostridium hathewayi DSM 13479]
          Length = 428

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 49/170 (28%), Positives = 76/170 (44%), Gaps = 23/170 (13%)

Query: 91  SFSKKPIMLVMSFPAPHGPEDSAP-QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM 149
           S  ++P  L +S+  PH P +  P +Y N + N+        D  PN + +     T + 
Sbjct: 183 SVKEQPFALFLSWNPPHPPFERVPKEYYNRYRNLEP------DLPPNVEGERFDNQTGEP 236

Query: 150 QPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLV 209
               R+     +      +  +D+   +IV  LK++   + T ++ T+DHG HLG  G V
Sbjct: 237 GFGSREELAEAVRCYYAAITGLDEQFGRIVSWLKEMELYDQTIVLLTADHGEHLGAHGYV 296

Query: 210 KGKSFPFEFDIRVPFLMRGP---------------GIVPGTMYLLDVLIP 244
            GK   +E  I +PFLMR P                IVP  + LLD+ IP
Sbjct: 297 -GKHTWYEESINIPFLMRYPEKLPAGRNDISVETVDIVPTLLGLLDIAIP 345


>gi|149197958|ref|ZP_01875006.1| putative sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138870|gb|EDM27275.1| putative sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 502

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 48/171 (28%), Positives = 73/171 (42%), Gaps = 28/171 (16%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--------------FN 122
             N S+ FL    Q    +P  L MSF  PH P   AP+  +M+              F 
Sbjct: 178 TGNKSLEFLENRDQD---RPFFLKMSFQRPHDPITPAPEDFDMYNPEDIVLPESISDLFE 234

Query: 123 VTSHHTPAY--DYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
                 P +  DY  NP D    +    K++     +  L+          +D+ + +++
Sbjct: 235 NKFVGKPQFMQDYVANPGDYPMCVADEAKLKRALASYYALIT--------KIDEEIGRVI 286

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
             LK+ GE +NT I YT+DHG   G+ GL       +E   R+PFL++ PG
Sbjct: 287 DHLKETGEYDNTIIFYTADHGDFAGEHGLFLKNLGIYESIHRIPFLLKWPG 337


>gi|383115597|ref|ZP_09936353.1| hypothetical protein BSGG_2533 [Bacteroides sp. D2]
 gi|313694998|gb|EFS31833.1| hypothetical protein BSGG_2533 [Bacteroides sp. D2]
          Length = 515

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 78/162 (48%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ ++ + K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 177 EEQHPTKIVNEALAWIEKQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 232

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D A   +K  IL ++     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 233 TSRKDLAKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 292

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 293 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 334


>gi|373461994|ref|ZP_09553727.1| hypothetical protein HMPREF9944_01991 [Prevotella maculosa OT 289]
 gi|371950171|gb|EHO68029.1| hypothetical protein HMPREF9944_01991 [Prevotella maculosa OT 289]
          Length = 476

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 100/246 (40%), Gaps = 35/246 (14%)

Query: 14  YFDISHGYFGKYLNKYNGSYIPPGWREW--GALIMNSKYYNYSINMNGRKIKH------- 64
           Y+ +S+G    ++  +  S+    WR    G  +  ++Y  + + +N    +H       
Sbjct: 123 YYTVSNGKIMHHIEDHAASWSEKPWRSHPDGYDVYWAEYNKWELWLNSESGRHINPKTMR 182

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSK-----KPIMLVMSFPAPHGPEDSAPQYANM 119
           G  Y     PD  A D  R L ++ +   +     KP  L   F  PH P  +  +Y ++
Sbjct: 183 GPFYEWADVPD-SAYDDGRVLGKTIEDLRRLRDNGKPFFLACGFWKPHLPFCAPKKYWDL 241

Query: 120 FFNVTSHHTPAYDYAPNP-----------DKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
           +        PA   A NP           + + I        P  + F  L+ T     +
Sbjct: 242 Y--------PALPMADNPFRPDGLPEEVQNSKEINAYALATAPDDKAFLRLVKTGYYACI 293

Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
             VD    +++  L+ LG  +NT +I   DHG+ LG+ G V GK    +   RVP L+R 
Sbjct: 294 SYVDALFGELMTALETLGLKDNTAVILLGDHGWDLGEHGFV-GKHNLMDITTRVPLLIRV 352

Query: 229 PGIVPG 234
           PG+  G
Sbjct: 353 PGLRGG 358


>gi|410632053|ref|ZP_11342721.1| choline-sulfatase [Glaciecola arctica BSs20135]
 gi|410148339|dbj|GAC19588.1| choline-sulfatase [Glaciecola arctica BSs20135]
          Length = 493

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 83/179 (46%), Gaps = 19/179 (10%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           ++IA+ +V +L + K   + KP ++ + F APH    S   Y + +         A+   
Sbjct: 193 EIIADAAVEYL-EKKPDVNDKPFLMYVGFLAPHDTRQSPKSYVDKYPQEAIPLPKAFKKQ 251

Query: 135 PNPDK-------QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
              D+       + +L + RK   V +      + +    +  +D  V +I+  L+    
Sbjct: 252 NTVDQGDFWVRDEVLLDIPRKPDEVKK-----FIGEYYAMINHMDTQVGRILDSLEKSPY 306

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDV 241
             NT II+TSDHG  +G+ GL+ GK   ++  IR PF++ G GI     + G +YL  V
Sbjct: 307 AENTIIIFTSDHGLAVGKHGLL-GKQNQYDHSIRAPFIIAGKGIDKGQKIKGNIYLNSV 364


>gi|398350561|ref|YP_006396025.1| choline-sulfatase BetC [Sinorhizobium fredii USDA 257]
 gi|390125887|gb|AFL49268.1| choline-sulfatase BetC [Sinorhizobium fredii USDA 257]
          Length = 511

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 79/165 (47%), Gaps = 9/165 (5%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L +      ++P  L +SF  PH P  +  ++ +++ +   H  P     P
Sbjct: 169 FLANQKLYQLSRENDDEGRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLLPEVGALP 227

Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
               +P  Q I+ ++   +       ++  ++R     +  +D+ V ++V  L     L+
Sbjct: 228 FDRQDPHSQRIM-LSCDYKNFKLTEENIRQSRRAYFANISYLDEKVGELVDTLTRTRMLD 286

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           NT I++ SDHG  LG+ GL    +F FE   RVP ++ GPG+ PG
Sbjct: 287 NTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGVAPG 330


>gi|149196556|ref|ZP_01873610.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140236|gb|EDM28635.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 545

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 64/250 (25%), Positives = 107/250 (42%), Gaps = 45/250 (18%)

Query: 1   MAWPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNS--KYYNYS---- 54
           M++PV L  RN       +GYF  ++ K+        + E+  L ++S  ++Y ++    
Sbjct: 103 MSYPVIL--RN-------NGYFTGFIGKFGFPVNAKNYHEYENLPIDSFDRWYGWTGQGY 153

Query: 55  --INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS 112
              + N   +K  ++Y   +     A  +  F+ ++++    KP  L +SF A H P   
Sbjct: 154 FDTSKNKYMVKFAKEY--PHVTLATAEAACEFIDEAQKQ--DKPFCLSLSFKASHKPFSP 209

Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDL---------- 159
            P Y +++ +        YD              RK+ P   + RQ+  +          
Sbjct: 210 DPAYDDVYKDTVWKKRANYDEGG----------ARKLPPQAKLGRQYLTIDDFAPEKYQE 259

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
            M K  Q +  +D AV KIV++L   G   NT IIY +D+GY  G  G   GK  P+E  
Sbjct: 260 SMRKYNQLIYGIDQAVGKIVEKLDQTGLSKNTVIIYATDNGYSCGSHGF-GGKVLPYEGP 318

Query: 220 IRVPFLMRGP 229
            R P ++  P
Sbjct: 319 ARGPMIIMDP 328


>gi|150398338|ref|YP_001328805.1| sulfatase [Sinorhizobium medicae WSM419]
 gi|150029853|gb|ABR61970.1| sulfatase [Sinorhizobium medicae WSM419]
          Length = 489

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           +Q  D+ + +++  LK++G +++T I+Y SDHG H+G+ GL    +F FE  ++ P +MR
Sbjct: 255 VQRTDEMIGQVLDALKEIGAMDDTLIVYASDHGDHVGERGLWWKHTF-FEESVKFPLVMR 313

Query: 228 GPGIVPG 234
            PG +P 
Sbjct: 314 LPGAIPA 320


>gi|168187234|ref|ZP_02621869.1| probable sulfatase [Clostridium botulinum C str. Eklund]
 gi|169294856|gb|EDS76989.1| probable sulfatase [Clostridium botulinum C str. Eklund]
          Length = 504

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 81/185 (43%), Gaps = 26/185 (14%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------- 120
           +Y + IA  +   L Q K +   +   L  SF  PH P      +A+M+           
Sbjct: 193 HYNNWIAERTNALLEQYKDN--DENFFLWSSFFDPHPPYFVPEPWASMYNPDDITLPHYL 250

Query: 121 ----------FNVTSHHTPAY-DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
                        T    P + +Y    DK W+      +Q       D+ +   + ++ 
Sbjct: 251 DGEHLTTSPILEKTQEENPDFSEYYEPKDKNWLHGCHSHLQDKEELKKDIAIYYGMVSM- 309

Query: 170 SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
            +D  + KI+ +L++LG   +T +++TSDHG+  GQ  L+    F +E  I++P ++R P
Sbjct: 310 -MDHYIGKILDKLEELGMAEDTIVVFTSDHGHFFGQHNLIAKGPFHYEDMIKIPLIVREP 368

Query: 230 GIVPG 234
           G+VP 
Sbjct: 369 GVVPA 373


>gi|429757503|ref|ZP_19290038.1| arylsulfatase [Actinomyces sp. oral taxon 181 str. F0379]
 gi|429174942|gb|EKY16401.1| arylsulfatase [Actinomyces sp. oral taxon 181 str. F0379]
          Length = 437

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 11/146 (7%)

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---- 151
           P  L +SF  PH P D  PQ+A   ++      P  D   N   + + +  R   P    
Sbjct: 142 PFFLYLSFHRPHAPLD-PPQWAFDSYDQDELTLPGED---NWSDELMSEERRDWDPTALA 197

Query: 152 VHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLV 209
            H +  D+ + +      +  +D  + + ++ L + G   NT I + SDHG  +G  GL 
Sbjct: 198 AHYRERDVRLARAGYYGHMTHIDSQISRFIQTLGEFGLAENTVIAFVSDHGDMMGDHGLW 257

Query: 210 KGKSFPFEFDIRVPFLMRGPGIVPGT 235
           + K +P+E    VPF++ G GI PG 
Sbjct: 258 R-KGYPYEASSHVPFILYGVGISPGA 282


>gi|428313711|ref|YP_007124688.1| arylsulfatase A family protein [Microcoleus sp. PCC 7113]
 gi|428255323|gb|AFZ21282.1| arylsulfatase A family protein [Microcoleus sp. PCC 7113]
          Length = 478

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 79/185 (42%), Gaps = 32/185 (17%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--SHHTPAYDYAP 135
           A+++ RFL    QS +++P  L +SF   H P          +F V   + HT +    P
Sbjct: 175 ASEAARFL----QSTAQQPFFLYVSFYETHSP---------FWFPVEFRNRHTASEFTVP 221

Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
               +   Q+    + + R     +        + +D  V  +++ L   G  +NT +I+
Sbjct: 222 ELTAEDFEQIPAVFRNLTRPEKRGIQAAYYTCTEFMDKNVGLVLEALDQSGHADNTLVIF 281

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG----------------IVPGTMYLL 239
           TSDHGY LGQ G  + K   FE  +RV  LMR PG                I+P    L 
Sbjct: 282 TSDHGYLLGQHGRFE-KHCCFEGAVRVALLMRFPGVIVSGQASNALVELIDIIPTIFSLC 340

Query: 240 DVLIP 244
           DV IP
Sbjct: 341 DVPIP 345


>gi|171915403|ref|ZP_02930873.1| choline-sulfatase [Verrucomicrobium spinosum DSM 4136]
          Length = 469

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 85/174 (48%), Gaps = 5/174 (2%)

Query: 59  GRKIKH-GEDYYNDYYPDLIAND-SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
           G + +H GE    D + D++  D +++FL++ + + + KP  L + F APH P  + P+Y
Sbjct: 149 GEEFEHAGEHADMDEFVDVVRRDHAIKFLKE-RGADNNKPFFLTIGFIAPHYPLVAPPEY 207

Query: 117 ANMFFN-VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
              F + V     P       P     L+  RK + V        +      ++ +DD +
Sbjct: 208 LEHFRDKVPFPEVPPGYVDTLPLNYRHLRNDRKFERVPPALAKRALEGYYARVEWIDDQI 267

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
             +++ LK+     NT +IYTSDHG +LG+ GL   K+  F+   RVP ++  P
Sbjct: 268 GMVLEALKNSRFAENTVVIYTSDHGENLGEHGLWW-KNCMFDSGARVPLIVSWP 320


>gi|171913979|ref|ZP_02929449.1| sulfatase family protein [Verrucomicrobium spinosum DSM 4136]
          Length = 479

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 80/161 (49%), Gaps = 17/161 (10%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVTSHHTPAYDYAP 135
           A + ++ L   K     KP  L + F  PH P  +  +Y +++    +    TPA  +A 
Sbjct: 209 AAEGIKLLEAHKD----KPFFLAVGFYRPHTPYVAPKKYFDLYPLDQIPLPSTPA-GFAE 263

Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS----VDDAVEKIVKELKDLGELNNT 191
              K  I   T+K +   ++ TD    + +Q   +    +D  V +++  ++ L    NT
Sbjct: 264 QTPKPAI--TTKKNE---QEMTDQQRQEAIQAYYAATSYMDAQVGRLLDAVERLDLAKNT 318

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
            I++TSDHGYHLG++G+ + +S  FE   RVPF++  PG  
Sbjct: 319 VIVFTSDHGYHLGEYGMWQKQSL-FEQSARVPFIISAPGAA 358


>gi|266619688|ref|ZP_06112623.1| mucin-desulfating sulfatase [Clostridium hathewayi DSM 13479]
 gi|288868740|gb|EFD01039.1| mucin-desulfating sulfatase [Clostridium hathewayi DSM 13479]
          Length = 468

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 93/191 (48%), Gaps = 23/191 (12%)

Query: 50  YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
           YYN  +  NG+KI+        Y  D+I ++++ F+ + K    ++P  L + + APH P
Sbjct: 139 YYNAPMFRNGQKIEE-----KGYITDVITDEAISFIDREKNK--EQPFYLSVHYTAPHSP 191

Query: 110 --EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT 167
                  +Y +++ +      P  +  P    + I    +  + +   F           
Sbjct: 192 WINCHPKKYTDLYEDCPFETCPQGEVHPWAKTEVIAGYQKPRESLIGYFA---------A 242

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL-VKGK-SFP---FEFDIRV 222
           + ++DD V +I+K+L++   + +T II++SD+G++ G  G+  KG  +FP   ++  ++V
Sbjct: 243 VTAMDDNVGRILKKLEEENLMEDTLIIFSSDNGFNCGHHGIWGKGNGTFPLNMYDSSVKV 302

Query: 223 PFLMRGPGIVP 233
           P +M   G +P
Sbjct: 303 PLIMCHKGHIP 313


>gi|109897122|ref|YP_660377.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699403|gb|ABG39323.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 549

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/250 (23%), Positives = 98/250 (39%), Gaps = 53/250 (21%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
           GY    + K++    P  +  +  L     Y++      G K     E  Y  +  D++ 
Sbjct: 136 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGPKPWPENETQYTGHSSDVVT 195

Query: 79  NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
           + S+ +L   +   + KP  L+  F APH     AP+Y +           A +  P PD
Sbjct: 196 DLSIEWL---ENRVANKPFFLMHQFKAPHDMFKYAPRYEDFL---------AAETIPEPD 243

Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
             + +  T                           R M            +   RQ    
Sbjct: 244 DLYAVAKTFGSIATRGKNDTLRADIGTSVSRRNNRRSMGIDLGVDPNLSEEEFTRQAYQK 303

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
            +   L+ ++ VDD V ++++ L+D G+  NT IIYTSD G  LG+  L + K + F+  
Sbjct: 304 YLKAYLRCVKGVDDNVARLIQTLRDTGQYKNTIIIYTSDQGMMLGEHDL-QDKRWIFDES 362

Query: 220 IRVPFLMRGP 229
           IR+P +++ P
Sbjct: 363 IRMPLIVKHP 372


>gi|334346371|ref|YP_004554923.1| sulfatase [Sphingobium chlorophenolicum L-1]
 gi|334102993|gb|AEG50417.1| sulfatase [Sphingobium chlorophenolicum L-1]
          Length = 570

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 110/235 (46%), Gaps = 31/235 (13%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
            +FGK+    +     PG+ +W +      Y+         +  +N++G+ +        
Sbjct: 138 AFFGKWHMGASTDAPRPGFDKWVSFKGQGNYWPARAATPESSPDLNVDGKAVPQ-----K 192

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV------T 124
            Y  D + + ++ +L  SK+   K+P  L +S    H      P+YA+ + +       +
Sbjct: 193 GYITDELTDYAMNWL--SKERDPKRPFFLYLSHKGVHSDPLPPPRYAHQYDDAKFTLPAS 250

Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF-TDLLMTKRLQ----TLQSVDDAVEKIV 179
           + +TP  + A  P   W+L        +   +  D+ MT+ L+    TL ++DD++ +I+
Sbjct: 251 AANTPENN-AGKP--VWVLNQRNSWHGIDFFYNADVPMTEYLKYYYGTLSAIDDSLGRIM 307

Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
             L+      +T +++TSD+G+ +G+ GL+  K   +E  +RVP ++  PG VP 
Sbjct: 308 AYLRKNHLEKDTLVVFTSDNGFQVGEHGLID-KRNAYEASVRVPLVVWEPGTVPA 361


>gi|336399820|ref|ZP_08580620.1| sulfatase [Prevotella multisaccharivorax DSM 17128]
 gi|336069556|gb|EGN58190.1| sulfatase [Prevotella multisaccharivorax DSM 17128]
          Length = 514

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 66/273 (24%), Positives = 105/273 (38%), Gaps = 72/273 (26%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++ + +P G+  W  LI    YYN     NG++I+        Y  +++ +
Sbjct: 117 GYQTAMIGKWHLTSLPTGFDYWDILIGQGDYYNPDFIRNGKRIRRP-----GYVTNIVTD 171

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPH---GPEDS------------------------ 112
            ++ +L  S +    KP  L++   APH    P+                          
Sbjct: 172 VALDWL--SNKRDKSKPFCLLLHNKAPHRVWAPDTCDLDLYNDRKYPMPDNFYDNYEGRL 229

Query: 113 APQYANMFFNVTSHHTPAYD---YAPN---PDKQ---WILQVTRKMQPVHRQFTDL---- 159
           A Q   M  N+     P YD   Y P+    DK+   W      +M P  R   D     
Sbjct: 230 AAQKQKM--NIIKDMNPVYDLKMYDPDVHTNDKELETWGFNNYNRMSPSQRAQWDRHYKP 287

Query: 160 ----------------------LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
                                  M    + + S+D  V ++++ LKD   L+NT I+YTS
Sbjct: 288 IIEKFKRDSLKGKALAEWKYQRYMHDYCRVITSIDRNVGRVIQYLKDNNLLDNTMIVYTS 347

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           D G+++G+ G    K F +E   R P L+  PG
Sbjct: 348 DQGFYMGEHGFFD-KRFMYEESFRTPLLVYIPG 379


>gi|414344124|ref|YP_006985645.1| hypothetical protein B932_3169 [Gluconobacter oxydans H24]
 gi|411029459|gb|AFW02714.1| hypothetical protein B932_3169 [Gluconobacter oxydans H24]
          Length = 509

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 75/166 (45%), Gaps = 20/166 (12%)

Query: 76  LIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
            +   ++RFLR  +  F   +KP  L + +   H P    P        V S+   ++  
Sbjct: 187 FLTEQAIRFLRSEQGLFRSDRKPFFLFVGYTDTHSPHSDMPPEL-----VESYAGASFID 241

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQSVDDAVEKIVKELKDLGELNN 190
            P      I  V  +  PV R  +     KR+Q      ++D  V +I+ EL  LGE +N
Sbjct: 242 IPKEQFSAIHGVAER--PVDRN-SGTEREKRMQYYAAATAIDREVGRILDELDHLGEADN 298

Query: 191 TYIIYTSDHGYHLGQFGL------VKGKSFPFEFDIRVPFLMRGPG 230
           T+I+YTSDHG + G  G+       + ++F FE  IRVP  +  P 
Sbjct: 299 TFIVYTSDHGLNAGHHGIWEKGNATRPQNF-FEESIRVPCTISWPA 343


>gi|429856689|gb|ELA31586.1| mucin-desulfating sulfatase (n-acetylglucosamine-6-sulfatase)
           [Colletotrichum gloeosporioides Nara gc5]
          Length = 875

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 49/83 (59%), Gaps = 1/83 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M + L+ +QSVDD V +++  L+D G  +NT +IYTSD G+ LG+ G    K F +E   
Sbjct: 287 MQRYLRVIQSVDDNVGRMLNYLEDNGLADNTIVIYTSDQGFFLGEHGWFD-KRFMYEESF 345

Query: 221 RVPFLMRGPGIVPGTMYLLDVLI 243
           ++PFL+R P  +       D++ 
Sbjct: 346 QMPFLIRYPAAIAKGAVCDDIIC 368


>gi|445061849|ref|ZP_21374332.1| sulfatase [Brachyspira hampsonii 30599]
 gi|444506776|gb|ELV07055.1| sulfatase [Brachyspira hampsonii 30599]
          Length = 474

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 8/145 (5%)

Query: 96  PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHR 154
           P+ + +SF  PH P D   +Y +++ ++      +     +   +++  Q  R   P   
Sbjct: 197 PLFMWVSFVDPHHPFDPIKKYYDIYKDINPKELNSKLKLDSRRPEYLTKQGDRGYWPGGG 256

Query: 155 QFTD-----LLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
           +  D     +   KRL    +  +D  + +I+++LK+  E +NT I++TSDHG +LG +G
Sbjct: 257 EEHDYSEEEIKEIKRLYYAMISFIDSQIGRIIEKLKEKNEWDNTIIVFTSDHGEYLGDYG 316

Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIV 232
           L++   F ++  I+ P L  G GIV
Sbjct: 317 LLRKGPFMYDCLIKTPLLFYGKGIV 341


>gi|110679304|ref|YP_682311.1| choline sulfatase [Roseobacter denitrificans OCh 114]
 gi|109455420|gb|ABG31625.1| choline sulfatase [Roseobacter denitrificans OCh 114]
          Length = 502

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 97/204 (47%), Gaps = 18/204 (8%)

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
           Y D +A ++ R L    +    +P  L +SF  PH P  +  +Y +++ +  +H  P   
Sbjct: 162 YDDEVAFNATRKLYDLSRGADARPWCLTVSFTHPHDPYVARKKYWDLYEDC-AHLQPEVA 220

Query: 130 AYDYAP-NPDKQWILQVT--RKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKD 184
           A+DYA  +P  Q I      R      R   D+  ++R     +  +DD +  I++ L+D
Sbjct: 221 AFDYADQDPHSQRIFDANDWRSFDITKR---DIERSRRAYFANISYLDDKIGDILQTLED 277

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG--TMYLLDV- 241
             +     I++ SDHG  LG+ GL    SF FE   RVP ++  P + PG  T  + ++ 
Sbjct: 278 TRQ--EAIILFVSDHGDMLGERGLWFKMSF-FEGSSRVPLMISAPQMEPGLQTTAVSNID 334

Query: 242 LIPQVRKFSSGSLIFIMSFLINLN 265
           + P +   +  S+  +M++   ++
Sbjct: 335 VCPTLCDLAGVSMDEVMAWTTGVS 358


>gi|196230533|ref|ZP_03129395.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225463|gb|EDY19971.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 497

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 71/149 (47%), Gaps = 17/149 (11%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMF--------FNVTSHHTPAYDYAPNPDKQWILQVT 146
           +P  L ++F APH P    P +   +         N  S H   +D+     +  +L  +
Sbjct: 225 EPFFLHVAFTAPHDPRLLPPGWETKYDPKTMPLPKNFRSVH--PFDHGNMGGRDEVLLAS 282

Query: 147 -RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
            R+   V  +     +      +  +D+ + +IV+ LK  G+L+NT II+TSD G  +G 
Sbjct: 283 PRRPDEVRAE-----LAAYYAAISGMDEQIGRIVEALKSTGQLDNTLIIFTSDQGLAVGS 337

Query: 206 FGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            GL+ GK   +E  + VP +M GPGI  G
Sbjct: 338 HGLI-GKQNLYEHTLGVPLIMSGPGIPKG 365


>gi|254445484|ref|ZP_05058960.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
 gi|198259792|gb|EDY84100.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
          Length = 495

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/195 (22%), Positives = 89/195 (45%), Gaps = 41/195 (21%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
           ++ +++A+D+  F+ Q+  S    P  + ++F A H P  S  ++ + +      +  ++
Sbjct: 200 HWSEVVASDAEDFMLQA--SGRTGPFFMYLAFNAAHDPRQSPKEFVDRYSAESIPIPENY 257

Query: 128 TPAYDY---------------APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
            P Y Y               AP P  ++ ++V R              ++    +  +D
Sbjct: 258 QPIYPYREEMGSGEGLRDEILAPFPRTEYAVRVHR--------------SEYYAVISHLD 303

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
             + +I+K L+  G  + TYI +T+DHG   G  GL+ GK   +E  ++ P ++ GPG+ 
Sbjct: 304 AQIGRILKALEASGRRDRTYIFFTADHGLACGHHGLL-GKQNMYEHSMKAPLIVLGPGLP 362

Query: 233 PGT-----MYLLDVL 242
            G      +Y+ D++
Sbjct: 363 GGQRRKAPVYIQDIM 377


>gi|395005731|ref|ZP_10389600.1| arylsulfatase A family protein [Acidovorax sp. CF316]
 gi|394316287|gb|EJE53017.1| arylsulfatase A family protein [Acidovorax sp. CF316]
          Length = 487

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 89/195 (45%), Gaps = 27/195 (13%)

Query: 56  NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
            M G  I  GE +Y  Y  ++    +V ++R +  +   +P +L +   APH P   AP+
Sbjct: 144 RMLGEHIGAGESHYTAYDREVTGR-AVEWIRDAG-TRPDEPFVLYVGLVAPHFPL-VAPE 200

Query: 116 YANMFFNVTSHHT-------PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
               FF + S          PA  Y  +P   W+ Q          +F      +R Q  
Sbjct: 201 E---FFALYSSEVMPEPKLQPASGYQRHP---WV-QAYADFMDNEGEFKS--PEERRQAF 251

Query: 169 QS-------VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
            +       +D  V +I + L+D G   +T ++YTSDHG +LG  GL  GKS  ++  +R
Sbjct: 252 VAYYALCSFLDHNVGRITEALRDAGLEGDTTLVYTSDHGDNLGTRGLW-GKSTLYQESVR 310

Query: 222 VPFLMRGPGIVPGTM 236
           VP L+ GPG+ PG  
Sbjct: 311 VPMLVAGPGVQPGVC 325


>gi|373252949|ref|ZP_09541067.1| choline-sulfatase [Nesterenkonia sp. F]
          Length = 527

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 93/196 (47%), Gaps = 22/196 (11%)

Query: 79  NDSVRFLRQSKQSFSK-KPIMLVMSFPAPHGPEDSAPQYANMFF-----NVTSHHTPAYD 132
           ND VR  + ++    + +P ++V SF  PH P +   ++ + F      +  +   P+ +
Sbjct: 188 NDRVRANQWAEAHGEQHRPFLMVTSFIHPHDPYEPPREHWDRFGEGEIPDPKNPQVPSVE 247

Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNN 190
             P+  +   +    +  P   +  D+   +R     +  +DD V +I   L+++G   N
Sbjct: 248 EDPHSHRLRAMAGFDERDP---EIEDVRNARRAYYAAVSYIDDHVGRIRAHLEEVGLAEN 304

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFS 250
           T II T+DHG  LG+ GL   K  P+E   RVP ++ GP          D L+P+ R  +
Sbjct: 305 TVIIVTADHGDMLGEKGLWY-KMSPYERSSRVPMIVNGP----------DHLVPKGRYRT 353

Query: 251 SGSLIFIMSFLINLNT 266
             SL+ +M  L++L +
Sbjct: 354 PVSLLDLMPTLMDLGS 369


>gi|171914267|ref|ZP_02929737.1| choline sulfatase [Verrucomicrobium spinosum DSM 4136]
          Length = 496

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 1/75 (1%)

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
           L+   L ++  +D  V ++++ L+  G+L+NT I+   DHG+HLG+ G + GK+  +E  
Sbjct: 274 LVRAYLASISFMDSQVGRVLEALQASGQLDNTIIVVWGDHGWHLGEKG-ITGKNTLWERS 332

Query: 220 IRVPFLMRGPGIVPG 234
            RVPF+  GPGI  G
Sbjct: 333 TRVPFIWAGPGITRG 347


>gi|171912751|ref|ZP_02928221.1| putative exported sulfatase [Verrucomicrobium spinosum DSM 4136]
          Length = 541

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/256 (22%), Positives = 102/256 (39%), Gaps = 52/256 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++    P G+  W  L     YYN  +  NG+ +KH       Y  D+I +
Sbjct: 122 GYQTAIVGKWHLMSDPTGFDYWQILPGQGIYYNPPMIDNGKPVKH-----EGYVTDIITD 176

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA------------PQYANMFFNVTSHH 127
            ++ +L+   ++   KP +++    APH   D A            P+   +F +  +  
Sbjct: 177 LTLDWLKNRDKN---KPFLMMCQHKAPHREWDPALRHLGHDGDRKYPEPETLFDDYENRG 233

Query: 128 TPAYDYAPNPDKQWI-----LQVTRKMQPVHRQFTDLLMTKR------------------ 164
               D     +K +      L     M P  +   D     R                  
Sbjct: 234 VAERDQDMTLEKTFTPKDAKLVAPASMTPEQKAKWDAYYGPRNAEFEKSKPTGKDLVRWR 293

Query: 165 --------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
                   L  +++VD+ V +++K L D G   +T ++Y++D G++LG+ G    K + F
Sbjct: 294 YQRYMHDYLGCVKAVDEGVARLLKYLDDEGLAEDTIVVYSADQGFYLGEHGWFD-KRWIF 352

Query: 217 EFDIRVPFLMRGPGIV 232
           E  +R P ++R PG+ 
Sbjct: 353 EESLRAPLMVRWPGVA 368


>gi|414164695|ref|ZP_11420942.1| choline-sulfatase [Afipia felis ATCC 53690]
 gi|410882475|gb|EKS30315.1| choline-sulfatase [Afipia felis ATCC 53690]
          Length = 487

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 83/177 (46%), Gaps = 11/177 (6%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  D+ A  +  +L++    ++ KP +L +S   PH P  + P++   ++N  
Sbjct: 153 GESPYTFYDRDITAR-AQTWLQEEAPKYADKPWVLFVSLVCPHFPLTAPPEHFYRYYNDP 211

Query: 125 SHHTPA-YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR------LQTLQSVDDAVEK 177
              +P  Y+    P+  ++         V+ +F D     R      L     +D+ V K
Sbjct: 212 RLASPKLYEDMHRPNHPYLQDYAGSF--VYDEFFDTPDKLRRGLAGYLGLCSFLDENVGK 269

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           ++  L D    + T +IYTSDHG ++G  GL  GKS  +E    VPF+M GP I  G
Sbjct: 270 LLACLTDTNLSDTTRVIYTSDHGDNMGARGLW-GKSTMYEETAAVPFIMSGPDIPRG 325


>gi|294085689|ref|YP_003552449.1| sulfatase [Candidatus Puniceispirillum marinum IMCC1322]
 gi|292665264|gb|ADE40365.1| Sulfatase [Candidatus Puniceispirillum marinum IMCC1322]
          Length = 513

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 75/156 (48%), Gaps = 10/156 (6%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
           +P  +V SF  PH P  + P++ +++ +      PAY    +    +  ++   ++    
Sbjct: 189 QPFCMVASFIHPHDPYVARPEWWDLY-DQADIDMPAYTLTRDQQDAFSQRLMDGIEASTV 247

Query: 155 QFT-DLLMTKRLQTLQSV---DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVK 210
             T D +   R     +    D  V  IV+ L + G+L+NT I  T+DHG  LG+ GL  
Sbjct: 248 AVTEDEIRNARHAYYANTSYFDSKVGDIVRTLDETGQLDNTIIFITADHGDMLGERGLWY 307

Query: 211 GKSFPFEFDIRVPFLMRGPGIVPGTMY----LLDVL 242
             +F FE   RVP +M GP +  GT+     L+D+L
Sbjct: 308 KMNF-FEHSGRVPLVMAGPNVAKGTVTSPCSLVDIL 342


>gi|149176616|ref|ZP_01855228.1| choline sulfatase [Planctomyces maris DSM 8797]
 gi|148844495|gb|EDL58846.1| choline sulfatase [Planctomyces maris DSM 8797]
          Length = 479

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/167 (28%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--------EDSAPQYANMFFNVTSHHTP 129
           A+ ++ F+ +  Q    KP  L + F APH P        ++  P    +  N    H  
Sbjct: 200 ADAAIEFVERKHQ----KPFFLHVCFTAPHDPLLMPIGYEQNYDPDQMPVPANFLPQH-- 253

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
            +D+     +   L    + + + +    L  +     +  +D  V +IVK LK  GE  
Sbjct: 254 PFDHGNFDGRDEALLPWPRTKEIVKNDLSLYYS----VISHLDAQVGRIVKALKKTGEWE 309

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           NT +I++SDHG  +G  GL +GK   +E  + VP +M GPGI   T+
Sbjct: 310 NTILIFSSDHGLAMGSHGL-RGKQNMYEHTVNVPLIMVGPGIPADTL 355


>gi|212692494|ref|ZP_03300622.1| hypothetical protein BACDOR_01990 [Bacteroides dorei DSM 17855]
 gi|237709186|ref|ZP_04539667.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
 gi|265752713|ref|ZP_06088282.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
 gi|294777453|ref|ZP_06742904.1| arylsulfatase [Bacteroides vulgatus PC510]
 gi|345514224|ref|ZP_08793737.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
 gi|423230606|ref|ZP_17217010.1| hypothetical protein HMPREF1063_02830 [Bacteroides dorei
           CL02T00C15]
 gi|423240767|ref|ZP_17221881.1| hypothetical protein HMPREF1065_02504 [Bacteroides dorei
           CL03T12C01]
 gi|423244315|ref|ZP_17225390.1| hypothetical protein HMPREF1064_01596 [Bacteroides dorei
           CL02T12C06]
 gi|212664969|gb|EEB25541.1| arylsulfatase [Bacteroides dorei DSM 17855]
 gi|229437205|gb|EEO47282.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
 gi|229456882|gb|EEO62603.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
 gi|263235899|gb|EEZ21394.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
 gi|294448521|gb|EFG17070.1| arylsulfatase [Bacteroides vulgatus PC510]
 gi|392630750|gb|EIY24736.1| hypothetical protein HMPREF1063_02830 [Bacteroides dorei
           CL02T00C15]
 gi|392642496|gb|EIY36262.1| hypothetical protein HMPREF1064_01596 [Bacteroides dorei
           CL02T12C06]
 gi|392643729|gb|EIY37478.1| hypothetical protein HMPREF1065_02504 [Bacteroides dorei
           CL03T12C01]
          Length = 508

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/257 (26%), Positives = 104/257 (40%), Gaps = 69/257 (26%)

Query: 20  GYFGKYLNKYNGSYIPP---GWREWGALIMNSKYYNYSINMNGRKIKH------------ 64
           G FGK+   Y GS   P   G  E+   I   + + Y  N   R  K             
Sbjct: 142 GMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSAGDTAVVRVVMD 201

Query: 65  --------GEDYYN--DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG----PE 110
                   G+DY+    Y  D+I  +++++L +      K+P   + ++  PH     PE
Sbjct: 202 ENINYPMFGKDYFKRPQYSADMIHEEAMKWLDKQD---GKQPFFGIFTYTLPHAELAQPE 258

Query: 111 DSA-PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTKRLQ 166
           DS    Y   FF                DK W  Q   +  P    H QF  ++      
Sbjct: 259 DSILTGYQKKFFE---------------DKTWGGQEGSRYNPSVHTHAQFAGMIT----- 298

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH---------LGQFGLVKG-KSFPF 216
               +D  V +++ +LK+ G   NT +I+TSD+G H          G+ G ++G K   +
Sbjct: 299 ---RLDYYVGEVLNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCY 355

Query: 217 EFDIRVPFLMRGPGIVP 233
           E  IR+PF++R PG VP
Sbjct: 356 EGGIRIPFIVRWPGKVP 372


>gi|298290556|ref|YP_003692495.1| sulfatase [Starkeya novella DSM 506]
 gi|296927067|gb|ADH87876.1| sulfatase [Starkeya novella DSM 506]
          Length = 480

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 79/176 (44%), Gaps = 9/176 (5%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE  Y  Y  D++A   + +LR +      KP  L +SF APH P  + P++   +++  
Sbjct: 153 GESSYTFYDRDILARAQI-WLRGAAARARAKPWALFVSFVAPHFPLTAPPEHYYRYYDDA 211

Query: 125 SHHTPA-YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKI 178
               P  Y+    P+  ++ +           F      +R        +  +D+ + K+
Sbjct: 212 RLQWPKLYETQARPEHPYLREYAECFA-YDEHFAGPDDVRRALAGYYGLVGFLDENIGKL 270

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +  L   G   +T ++Y SDHG +LG  GL  GKS  +E  + VP +  GPG  PG
Sbjct: 271 LATLDATGLSQDTRVVYVSDHGDNLGARGLW-GKSTMYEEAVSVPLIAAGPGFEPG 325


>gi|103486502|ref|YP_616063.1| sulfatase [Sphingopyxis alaskensis RB2256]
 gi|98976579|gb|ABF52730.1| sulfatase [Sphingopyxis alaskensis RB2256]
          Length = 515

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M + L T+ +VD+ V  ++  L+D G   NT ++YTSD G++LG+ G    K F +E  +
Sbjct: 308 MQQYLGTIAAVDEGVGAVLDYLEDSGLDRNTIVVYTSDQGFYLGEHGWFD-KRFIYEESM 366

Query: 221 RVPFLMRGPG-IVPGT 235
           R PFL++ PG I PGT
Sbjct: 367 RTPFLIQYPGHIRPGT 382


>gi|325263385|ref|ZP_08130119.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Clostridium sp. D5]
 gi|324031094|gb|EGB92375.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Clostridium sp. D5]
          Length = 498

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/201 (25%), Positives = 91/201 (45%), Gaps = 44/201 (21%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------- 120
           Y  D+I + + R+L++  +S   KP  ++ +  APH   +   ++ +MF           
Sbjct: 151 YVTDIITDMTTRWLKERDRS---KPFFVMCNHKAPHDFWEYHERHEDMFAEKEIPVPCSL 207

Query: 121 FNVTSHHTPAY-----------------------DYAPNP-DKQWILQVTRKMQPVHRQF 156
           F    H +PA                        DY   P  K   +    K    ++++
Sbjct: 208 FENKVHRSPATRERGSSVTPRSKIRSLYEDFCRPDYVTGPLTKTETMSFEEKGYAAYQKY 267

Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
               +   L+T+ ++D++V  I+ EL++ G L++T +IYTSD G  LG+    + K + F
Sbjct: 268 ----LKDYLRTVAAIDESVGNILNELEEQGILDDTIVIYTSDQGIFLGEHDY-QDKRWSF 322

Query: 217 EFDIRVPFLMRGPG-IVPGTM 236
           E  +R P L+R P  I PGT+
Sbjct: 323 EESLRTPMLVRYPKEIPPGTV 343


>gi|399993702|ref|YP_006573942.1| arylsulfatase [Phaeobacter gallaeciensis DSM 17395 = CIP 105210]
 gi|398658257|gb|AFO92223.1| putative arylsulfatase [Phaeobacter gallaeciensis DSM 17395 = CIP
           105210]
          Length = 508

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 86/192 (44%), Gaps = 35/192 (18%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF-------NVT 124
           +Y + I + +  FL +      ++P  L +S+P PH P      + +M+         +T
Sbjct: 195 HYNNWITDRACGFLEERD---GEQPFFLFVSYPDPHHPFSPPAPWCDMYDPQEVPAPALT 251

Query: 125 SHH---TPAY--------------DYAPNPD---KQWILQVTRKMQPVH-RQFTDLLMTK 163
           +      P+Y              D+  NP    +Q  +Q T++      RQ     +  
Sbjct: 252 ADELAAMPSYILDGDREEAGKSYVDFLRNPGPPREQGFMQTTQRFSEASLRQ----AIAH 307

Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
               +  +D+ + +++ +L+  G   +T II+TSDHG  LG  GL++    P+   + VP
Sbjct: 308 TYGMVSMIDNCIGRLLAQLEAQGLAEDTLIIFTSDHGELLGDHGLIRKGPSPYRPLLHVP 367

Query: 224 FLMRGPGIVPGT 235
            ++ GPG+ PGT
Sbjct: 368 LVIAGPGVAPGT 379


>gi|239624380|ref|ZP_04667411.1| sulfatase [Clostridiales bacterium 1_7_47_FAA]
 gi|239520766|gb|EEQ60632.1| sulfatase [Clostridiales bacterium 1_7_47FAA]
          Length = 511

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 92/190 (48%), Gaps = 25/190 (13%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP---EDSAPQYANMFFNVTSHHT 128
           Y  D+I +D++RFL  +K+    KP  L + + APH P   ED   +  +++ +      
Sbjct: 197 YVTDIITDDALRFLEDNKE----KPFYLSVHYTAPHDPWLKEDQPREIWDLYNDC----- 247

Query: 129 PAYDYAPNPDKQ-WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
             + Y PNP +  W  Q  R+ QP   +     +      + ++D  + +I+  L     
Sbjct: 248 -EFSYIPNPPRHPW--QEARRTQPDDEKQRREYIQGYYTCVTAMDRNIGRIMDYLIANHL 304

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGK---SFP---FEFDIRVPFLMRGPGIVPGTMYLLDV 241
             NT +I+ +D+G+++G  G V GK   +FP   +E  ++VP ++ G G+  G +   + 
Sbjct: 305 DENTMVIFLADNGFNIGHHG-VWGKGNATFPLNMYESSVKVPCIIWGAGVRGGKVN--EA 361

Query: 242 LIPQVRKFSS 251
           L+ Q   F S
Sbjct: 362 LVSQYDLFPS 371


>gi|357384308|ref|YP_004899032.1| choline-sulfatase [Pelagibacterium halotolerans B2]
 gi|351592945|gb|AEQ51282.1| choline-sulfatase [Pelagibacterium halotolerans B2]
          Length = 477

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 70/157 (44%), Gaps = 31/157 (19%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
           +P  L ++F  PH P D     A+M  +V     P               +  +   +H 
Sbjct: 229 RPWFLTVNFAGPHEPLDVT---ASMRESVKDRQFP---------------MPGRHDGLH- 269

Query: 155 QFTDLLMTKR---LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
             TDL  T R      ++ +D  V + V+ L+  G L NT I++ SDHG  LG   L   
Sbjct: 270 --TDLHQTVRQNYAAMVELIDGWVGRYVEILERAGALENTIIVFASDHGEMLGDHNLW-A 326

Query: 212 KSFPFEFDIRVPFLMRGPGIVPGT------MYLLDVL 242
           KS PFE  +RVP +M GPG  P        + LLDV+
Sbjct: 327 KSVPFEASVRVPLVMAGPGFEPSAAPVAAPVSLLDVM 363


>gi|330447504|ref|ZP_08311152.1| putative sulfatase yidJ [Photobacterium leiognathi subsp.
           mandapamensis svers.1.1.]
 gi|328491695|dbj|GAA05649.1| putative sulfatase yidJ [Photobacterium leiognathi subsp.
           mandapamensis svers.1.1.]
          Length = 517

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 64/275 (23%), Positives = 106/275 (38%), Gaps = 50/275 (18%)

Query: 5   VELTSRNLLYFDI------SHGYFGK-------------YLNKYNGSYIPPGWREWGALI 45
           +EL    + + D+      S GY GK             Y N   G Y    W +W A  
Sbjct: 129 IELKKSTVTWSDLLKQQGYSMGYIGKWHLDAPETPFVPSYNNPMEGRY----WNDWTAPD 184

Query: 46  MNS------KYYNYSINMN-----GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF-- 92
                     Y  Y ++MN         +    + N + P+  A+ ++++LR     +  
Sbjct: 185 RRHGFDFWYAYGTYDLHMNPIYWTNETPRDKPLHINQWSPEHEADIAIKYLRNENGQYRD 244

Query: 93  SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP 151
           S +P  LV+S   PH P D  PQ Y + F   TS           P+ QW         P
Sbjct: 245 SSQPFTLVVSMNPPHSPYDQVPQKYLDRFSGKTSQQ-----LNNRPNVQWDKAYQEGYGP 299

Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
            +         + +  +  VD+   +I+ EL  L    +T +++ SDHG  +G  G    
Sbjct: 300 KY-------FKEYMAMVNGVDEQFGRIIDELDRLNLTEDTLVVFFSDHGCCMGSNGQPT- 351

Query: 212 KSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
           K+  ++  +R+P + R PG +    + L    P +
Sbjct: 352 KNVHYDEAMRIPMIFRWPGKINAQQHDLLFSAPDI 386


>gi|340757398|ref|ZP_08694000.1| sulfatase [Fusobacterium varium ATCC 27725]
 gi|251834661|gb|EES63224.1| sulfatase [Fusobacterium varium ATCC 27725]
          Length = 524

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 45/63 (71%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  + KI+ +L++LG   +T I++T+DHG+ +GQ GL++   F +E  I++PF+++ PG
Sbjct: 330 MDKYIGKILDKLEELGLAEDTIIVFTTDHGHFVGQHGLIRKGPFHYEDLIKIPFIVKYPG 389

Query: 231 IVP 233
            VP
Sbjct: 390 FVP 392


>gi|150003729|ref|YP_001298473.1| arylsulfatase [Bacteroides vulgatus ATCC 8482]
 gi|319640049|ref|ZP_07994776.1| arylsulfatase [Bacteroides sp. 3_1_40A]
 gi|345517063|ref|ZP_08796541.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
 gi|423313130|ref|ZP_17291066.1| hypothetical protein HMPREF1058_01678 [Bacteroides vulgatus
           CL09T03C04]
 gi|149932153|gb|ABR38851.1| arylsulfatase precursor [Bacteroides vulgatus ATCC 8482]
 gi|254833835|gb|EET14144.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
 gi|317388327|gb|EFV69179.1| arylsulfatase [Bacteroides sp. 3_1_40A]
 gi|392686344|gb|EIY79650.1| hypothetical protein HMPREF1058_01678 [Bacteroides vulgatus
           CL09T03C04]
          Length = 508

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 68/257 (26%), Positives = 104/257 (40%), Gaps = 69/257 (26%)

Query: 20  GYFGKYLNKYNGSYIPP---GWREWGALIMNSKYYNYSINMNGRKIKH------------ 64
           G FGK+   Y GS   P   G  E+   I   + + Y  N   R  K             
Sbjct: 142 GMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSAGDTAVVRVVMD 201

Query: 65  --------GEDYYN--DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG----PE 110
                   G+DY+    Y  D+I  +++++L +      K+P   + ++  PH     PE
Sbjct: 202 ENINYPMFGKDYFKRPQYSADMIHEEAMKWLDKQD---GKQPFFGIFTYTLPHAELAQPE 258

Query: 111 DSA-PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTKRLQ 166
           DS    Y   FF                DK W  Q   +  P    H QF  ++      
Sbjct: 259 DSILTGYQKKFFE---------------DKTWGGQEGSRYNPSVHTHAQFAGMIT----- 298

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH---------LGQFGLVKG-KSFPF 216
               +D  V +++ +LK+ G   NT +I+TSD+G H          G+ G ++G K   +
Sbjct: 299 ---RLDYYVGEVLNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCY 355

Query: 217 EFDIRVPFLMRGPGIVP 233
           E  IR+PF++R PG VP
Sbjct: 356 EGGIRIPFIVRWPGKVP 372


>gi|149199849|ref|ZP_01876878.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149137020|gb|EDM25444.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 467

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 58/103 (56%), Gaps = 14/103 (13%)

Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG------YHLGQFG-LVKGKSFPFE 217
           +  ++ +DD V +I K+LK+LG   NT +IYT+D+G      +H G  G L  GK   FE
Sbjct: 250 INVIEHMDDQVGRITKKLKELGLDKNTIVIYTTDNGPWLPFKHHGGSAGELRDGKGSTFE 309

Query: 218 FDIRVPFLMRGPGIVP-GT-----MYLLDVLIPQVRKFSSGSL 254
              RVP +M  PG++P GT     M  +D L+P +   SS +L
Sbjct: 310 GGQRVPCVMWAPGLIPAGTETNELMTTID-LLPSLAALSSSTL 351


>gi|196232398|ref|ZP_03131251.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196223470|gb|EDY17987.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 517

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 50/77 (64%), Gaps = 2/77 (2%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   L  ++SVD++V +I++ L++ G   NT +IY+SD G++LG+ G    K + FE  +
Sbjct: 299 MHDYLACIKSVDESVGRILQYLEEAGLAENTIVIYSSDQGFYLGEHGWFD-KRWIFEESL 357

Query: 221 RVPFLMRGPGIV-PGTM 236
           R P L+R PG+  PG++
Sbjct: 358 RAPLLIRWPGVTKPGSV 374


>gi|402222605|gb|EJU02671.1| alkaline phosphatase-like protein [Dacryopinax sp. DJM-731 SS1]
          Length = 484

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 52/87 (59%), Gaps = 4/87 (4%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M + L+T+QSVDD V +++  L   G   NT +IYTSD G+ LG+ G    K F +E   
Sbjct: 257 MQRYLRTIQSVDDNVGRLLDYLDKEGLTENTIVIYTSDQGFFLGEHGWFD-KRFIYEESF 315

Query: 221 RVPFLMRGP-GIVPGTMYLLDVLIPQV 246
           ++PFL+R P  I PGT  + D +I  V
Sbjct: 316 QMPFLIRYPREIKPGT--VCDDIIQNV 340


>gi|167762012|ref|ZP_02434139.1| hypothetical protein BACSTE_00358 [Bacteroides stercoris ATCC
           43183]
 gi|167700104|gb|EDS16683.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
          Length = 463

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 90/213 (42%), Gaps = 61/213 (28%)

Query: 30  NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSK 89
           N  Y  P WR       N K  N   N   + IKH  D        L   D+++F+ ++K
Sbjct: 179 NDPYYYPYWR-----FNNEKLENVKENEGDKHIKHNTD--------LSTEDAIKFINRNK 225

Query: 90  QSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM 149
            +    P  L +++ APH P           +N+    T  YD     D+ W +      
Sbjct: 226 NN----PFFLYLAYDAPHEP-----------YNI--DETTWYD-----DEAWDMN----- 258

Query: 150 QPVHRQFTDLLMTKRLQTLQS-VDDAVEKIVKELKDLGELNNTYIIYTSDHGY------- 201
                       TKR  +L + +D A+ +++ EL  LG   NT +I+ SD+G        
Sbjct: 259 ------------TKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAKQAPLE 306

Query: 202 HLGQFGLVKG-KSFPFEFDIRVPFLMRGPGIVP 233
            LG  G +KG K   +E  IRVPF++  PG VP
Sbjct: 307 ELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVP 339


>gi|260587234|ref|ZP_05853147.1| arylsulfatase [Blautia hansenii DSM 20583]
 gi|260542429|gb|EEX22998.1| arylsulfatase [Blautia hansenii DSM 20583]
          Length = 514

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 83/173 (47%), Gaps = 26/173 (15%)

Query: 68  YYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
           Y   Y+P + + + S+ FLR+      +KP  L+ S+  PH P D AP Y    +N    
Sbjct: 199 YEEKYHPTNWVTDRSIDFLRRKD---PQKPFFLMASYLRPHPPFD-APSYYFDLYNKKEL 254

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQF------TDLLMTKRLQ-----TLQSVDDAV 175
             PA          W  + T ++Q + R F      +D ++ +  Q      +  +D  +
Sbjct: 255 TPPAV-------GDW--ETTEELQAMGRVFDSKCGPSDAVLIREAQIGYYACITHLDHQI 305

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
            ++++ L + G  +NT I++TSDHG  L    + + KS P++  IR+P ++ G
Sbjct: 306 GRLIQALVEYGVYDNTLILFTSDHGEELCDHHMFR-KSRPYQGSIRIPMIVSG 357


>gi|18309213|ref|NP_561147.1| arylsulfatase [Clostridium perfringens str. 13]
 gi|81768190|sp|Q8XNV1.1|SULF_CLOPE RecName: Full=Arylsulfatase; AltName: Full=Cys-type sulfatase
 gi|18143889|dbj|BAB79937.1| probable phosphonate monoester hydrolase [Clostridium perfringens
           str. 13]
          Length = 481

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D++   +    ++ + +++           ++  +D  + + +  L + G+LNNT  ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGKLNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333


>gi|398829781|ref|ZP_10587976.1| choline-sulfatase [Phyllobacterium sp. YR531]
 gi|398216083|gb|EJN02641.1| choline-sulfatase [Phyllobacterium sp. YR531]
          Length = 506

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 76/159 (47%), Gaps = 17/159 (10%)

Query: 85  LRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP----NPDKQ 140
           L +     +++P  L +SF  PH P  +  +Y +++ N  + + P+    P    +P  Q
Sbjct: 176 LSRDSDDAARRPWCLTVSFTHPHDPYVARRKYWDLYENSLALN-PSVGAIPFDDQDPHSQ 234

Query: 141 WILQVTRKMQPVHRQFTDLLMTKR------LQTLQSVDDAVEKIVKELKDLGELNNTYII 194
            +LQ        H  F       R      L  +  VDD V ++++ L+    L++T I+
Sbjct: 235 RLLQACD-----HGAFDITPEQVRRARQGYLANITYVDDKVGELLETLETTRMLDDTIIV 289

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           +TSDHG  LG+ GL    +F FE   RVP ++ G GI P
Sbjct: 290 FTSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGKGITP 327


>gi|325109768|ref|YP_004270836.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970036|gb|ADY60814.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 561

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 65/274 (23%), Positives = 116/274 (42%), Gaps = 55/274 (20%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKI--KHGEDYYNDYYPDLI 77
           GY    + K++    P  +  +  L    KY+N    + G K   K+   + N +  D I
Sbjct: 114 GYQTAMIGKWHLKVEPADFDYYCVLPGQGKYHNPEFRVRGDKPWGKNLIKFDNMHVTDAI 173

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY---- 133
            + ++ +L++ +     KP  L+  + APH   D+AP+Y +    V     PA D     
Sbjct: 174 TDLTLDWLKEGRDE--DKPFFLMHHYKAPHDYFDNAPRYESYLAGVD---IPAPDSLWKR 228

Query: 134 -------------------------APNPDKQWILQVTRKMQPVH-------RQFTD--- 158
                                    + NP + ++L +   M P         + F+D   
Sbjct: 229 GDEFGSMATRGANDELVPHIGTSIGSRNPRRSYLLDLP-DMYPNEFPEGYDPKYFSDEEN 287

Query: 159 ------LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
                   + K L+ ++ +DD + ++   L++ GEL+NT IIYT D G+ LG+    + K
Sbjct: 288 KRLAYNAYLRKFLRCVKGIDDNLGRLFAHLEETGELDNTVIIYTGDQGFMLGEHDY-QDK 346

Query: 213 SFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
            + +E   R+PFL+R P  +P     +D ++  V
Sbjct: 347 RWMYEESQRMPFLVRYPKTIPAGQ-KIDAIVENV 379


>gi|423297327|ref|ZP_17275388.1| hypothetical protein HMPREF1070_04053 [Bacteroides ovatus
           CL03T12C18]
 gi|392667504|gb|EIY61012.1| hypothetical protein HMPREF1070_04053 [Bacteroides ovatus
           CL03T12C18]
          Length = 527

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 47/69 (68%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT+++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTFVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|87307553|ref|ZP_01089697.1| mucin-desulfating sulfatase [Blastopirellula marina DSM 3645]
 gi|87289723|gb|EAQ81613.1| mucin-desulfating sulfatase [Blastopirellula marina DSM 3645]
          Length = 494

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 79/196 (40%), Gaps = 18/196 (9%)

Query: 41  WGALIMNSKYY---NYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPI 97
           WG     + Y    N S+     K  H    Y  +  D+I        RQS +   KKP 
Sbjct: 150 WGGSPGQTDYRTANNKSMKKYATKYPHSTLSYGAFAQDVI--------RQSVEQ--KKPF 199

Query: 98  MLVMSFPAPHGPEDSAPQ----YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVH 153
            L +SF APH P    P+    YA   F    +    +     P  +   Q  R ++  +
Sbjct: 200 CLSISFKAPHKPATPDPRFNEVYAGKKFTKPDNFGREFGEHLAPQSKSGRQFPRFIEWKY 259

Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
               D  M K  Q +  +D A+  I  EL   G  +NT +IYTSD+GY  G  G    K 
Sbjct: 260 DTDYDGEMAKYHQQVYGIDVALGMIRDELAAQGVADNTVVIYTSDNGYICGSHGY-GSKV 318

Query: 214 FPFEFDIRVPFLMRGP 229
            P E   RVP ++  P
Sbjct: 319 LPMEESSRVPLIIYDP 334


>gi|336426668|ref|ZP_08606677.1| hypothetical protein HMPREF0994_02683 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010684|gb|EGN40666.1| hypothetical protein HMPREF0994_02683 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 483

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 81/174 (46%), Gaps = 5/174 (2%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--DY 133
            +A+++ R + ++K+    +P  +  +F  PH P      Y +M+        P+Y  D+
Sbjct: 194 FVAHETNRLIDKAKEK--DQPFFIWSNFWGPHSPSIVPEPYYSMYDPKLIPEHPSYCEDF 251

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
              P   ++ +    +          +  +       +DD V  IV +LK+ G   +T I
Sbjct: 252 HDKPYGYYLTEKMWGLSDYGWSGFQEISARYFGHCTLIDDCVGMIVDKLKECGLYEDTVI 311

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-TMYLLDVLIPQV 246
           IY++DHG  +G   L++  +F F+   RVP +++G G     ++ LL  L+P +
Sbjct: 312 IYSADHGDCMGAHRLIEKGAFTFDEIYRVPLVVKGLGTKDNDSLVLLQELMPSI 365


>gi|393784387|ref|ZP_10372552.1| hypothetical protein HMPREF1071_03420 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666163|gb|EIY59680.1| hypothetical protein HMPREF1071_03420 [Bacteroides salyersiae
           CL02T12C01]
          Length = 496

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 103/242 (42%), Gaps = 29/242 (11%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYN-YSINMNGRK------IKHGEDYYNDY 72
           GY    + K++  Y P G+  +  L    +Y N   I+   RK       K     Y  +
Sbjct: 117 GYQTAIIGKWHLKYEPAGFDYYNVLPGQGRYKNPVLISKEERKGNTCPFDKTPGKIYQGH 176

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV--------- 123
             D+IA+ ++ ++++ K     +P ML+  F APH   + A +++++  +V         
Sbjct: 177 STDVIASQAINWMKEHKDD--NQPFMLMCHFKAPHRSWEYAERFSDLLKDVEIPEPENMF 234

Query: 124 ---------TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
                    T   T + +     D +  L          +      M   L+ +  +D+ 
Sbjct: 235 DTYEGRAYYTKLQTMSLEDMNEKDMKCELPANLSRDEFRKWAYQRYMKDYLRCIAGIDEN 294

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVP 233
           V +I+K L + G   NT I+YTSD G++LG+ G    +    E  + +P L+R P  I P
Sbjct: 295 VGRILKFLDENGLAQNTVIVYTSDQGFYLGEHGWFDKRYMQKE-SLNMPLLIRYPNEIKP 353

Query: 234 GT 235
           G+
Sbjct: 354 GS 355


>gi|325107824|ref|YP_004268892.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968092|gb|ADY58870.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 489

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 103/241 (42%), Gaps = 28/241 (11%)

Query: 14  YFDISHGYFGK----YLNKYNG--SYIPPGWREWGALIMNSKYYNYSINMNG-------- 59
           Y  I H   G+     L + NG  ++    W    A I+ S  +  ++ M G        
Sbjct: 95  YMQIDHQLTGQTTLGQLLRTNGYDTFATGKWHNGKAAILRSFEHGENVFMGGMCDHTKVP 154

Query: 60  -RKIKHGEDYY-----NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA 113
              I+ GE        N +  +L A+ +V +L        +KP    ++F APH P +  
Sbjct: 155 LSDIRDGELVSRGVKPNSFSSELFADAAVNYLNGRD---GEKPFFAYVAFTAPHDPRNPP 211

Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQS 170
            +Y  M+++      PA     +P     L +  ++     +  D++  +  +    +  
Sbjct: 212 EKYREMYYS-NPPSLPANFLTQHPFNNGSLVIRDEVLAPWPRPEDMVQRQLCEYYGLISH 270

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +DD + +I++ L+      NT I+Y +DHG  +G  GL+ GK   +E  ++ P +  GPG
Sbjct: 271 MDDQIGRILEALEKTPGGENTIIVYAADHGLAMGSHGLL-GKQNLYEHSMKCPLIFSGPG 329

Query: 231 I 231
           I
Sbjct: 330 I 330


>gi|440718008|ref|ZP_20898478.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436436890|gb|ELP30586.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 479

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 70/133 (52%), Gaps = 14/133 (10%)

Query: 136 NPDKQWILQVTRKMQ--PVHRQFTDLLMTKRL---QTLQSVDDAVEKIVKELKDLGELNN 190
           N DK + + +   M   P+H     +  +KR     T+Q +D +  +I++ LK+LG   N
Sbjct: 219 NKDKSFFVYLGHNMPHIPLHASEKFVGSSKRGIYGDTIQELDWSTGEILRALKELGLDQN 278

Query: 191 TYIIYTSDHGYHLGQFG----LVKGKSFPFEFDIRVPFLMRGPGIVP-GTMYLLDV---- 241
           T +I+TSD+G + G+ G    L  GK    E  +RVPF+ R PG +P GT +   +    
Sbjct: 279 TLVIFTSDNGPNTGKGGSAGVLKGGKGSTLEGGVRVPFVARWPGTIPAGTEFDEAITGMD 338

Query: 242 LIPQVRKFSSGSL 254
           L+P + K + G +
Sbjct: 339 LLPTLTKLAGGEV 351


>gi|269120708|ref|YP_003308885.1| sulfatase [Sebaldella termitidis ATCC 33386]
 gi|268614586|gb|ACZ08954.1| sulfatase [Sebaldella termitidis ATCC 33386]
          Length = 480

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 85/180 (47%), Gaps = 20/180 (11%)

Query: 72  YYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           Y+P + + N+ + FLR+  +   +K   L +SF  PH P D    Y NM+ N    +   
Sbjct: 177 YHPTNWVVNEGINFLRRRDK---RKNFFLKLSFIRPHSPLDPPEYYYNMYINREIDN--- 230

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDL 185
               P P ++  ++    +     Q +   M KR +     ++  +D  + + +  LK+ 
Sbjct: 231 ----PIPAEEENIKEAYNINAAEGQISKEAM-KRAKVAYYGSITHIDHQIGRFLMVLKEN 285

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG--IVPGTMYLLDVLI 243
             L  + +++ SDHG  +G  GL + KS P++  I VPF++  PG  +  G M   D L+
Sbjct: 286 DLLKESIVLFVSDHGDLMGDHGLFR-KSMPYQGSIHVPFIVYDPGNFLNGGVMREPDELV 344


>gi|305666713|ref|YP_003863000.1| mucin-desulfating sulfatase [Maribacter sp. HTCC2170]
 gi|88708937|gb|EAR01171.1| mucin-desulfating sulfatase [Maribacter sp. HTCC2170]
          Length = 525

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/267 (20%), Positives = 118/267 (44%), Gaps = 50/267 (18%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM-----NGRKIKHGEDYYNDY 72
            +GY    + K++    P G+  +  L    +Y+N  +       +G     G++Y   +
Sbjct: 123 ENGYQTAVIGKWHLVGKPEGFDYYNVLPGQGRYWNPILKSEDNWSDGYDSSTGKEY-KGF 181

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY- 131
             D+IA+ ++  L++  +   +KP M+  +F A H P +   +Y +++ N+      +  
Sbjct: 182 STDVIADLTIEHLQKRDK---EKPFMMFCNFKATHEPFEYPERYNDLYKNIEIPEPESLL 238

Query: 132 DYAPN------------------------PDKQWI-----------LQVTRKMQPVHRQF 156
           D+ PN                        P+  W            L   +K + ++++ 
Sbjct: 239 DFDPNNTGRTFHGQTLERLGARWDKATENPETFWTSYPGLPYPLEGLDSIQKRKKIYQK- 297

Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
              L+   ++   ++DD + K++  L+D G  +NT ++YT+D GY LG+ G    K   +
Sbjct: 298 ---LVKDFMRCGAAIDDNIGKLLDYLEDQGIADNTIVVYTADQGYFLGEHGFFD-KRMIY 353

Query: 217 EFDIRVPFLMRGPGIVPGTMYLLDVLI 243
           E  +R+PF++R P  +     + D+++
Sbjct: 354 EESLRMPFVIRYPKEIREGKRIDDIIL 380


>gi|340619501|ref|YP_004737954.1| sulfatase [Zobellia galactanivorans]
 gi|339734298|emb|CAZ97675.1| Sulfatases, family S1-23 [Zobellia galactanivorans]
          Length = 522

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/226 (23%), Positives = 97/226 (42%), Gaps = 47/226 (20%)

Query: 59  GRKIKHGEDY--YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
           G+ +++GE+    + +Y D++  ++  +L + +     KP +LV+S   PH P     ++
Sbjct: 178 GKFLRNGEEVGETDGWYVDVLTKEATNWLTKVRDK--SKPFLLVVSTHEPHTPIAPPEKF 235

Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDA 174
           ++++ N T                  L+ T K   V R   D+   K+    T+  +D A
Sbjct: 236 SSLYDNATVDS---------------LEQTIKYGGVDRTLYDISSNKKEYYGTVSQLDHA 280

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHG--------------------YHLGQFGLVKG-KS 213
              ++K ++D    +NT +I+TSD+G                    Y  G  G+ +G K 
Sbjct: 281 FSDLMKTVEDENAYDNTMVIFTSDNGPEHPVNLEESQGTWEEPIRDYCFGTPGIYRGMKR 340

Query: 214 FPFEFDIRVPFLMRGPGIVPG-----TMYLLDVLIPQVRKFSSGSL 254
           +PFE   RVP ++R P  +P       M +    +P V   +   +
Sbjct: 341 YPFEGGHRVPGIVRYPAKIPAGTTSDAMVVATDFLPTVATLAQAEI 386


>gi|420238024|ref|ZP_14742462.1| choline-sulfatase [Rhizobium sp. CF080]
 gi|398089025|gb|EJL79561.1| choline-sulfatase [Rhizobium sp. CF080]
          Length = 507

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/171 (27%), Positives = 76/171 (44%), Gaps = 8/171 (4%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
            +AN  +  L + +    ++P  L +SF  PH P  +  ++ +++ +   H  P     P
Sbjct: 168 FLANQKLYHLARERDDADRRPWALTVSFTHPHDPYVARKKFWDLYEDC-EHLLPEVGPIP 226

Query: 136 NPD-----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
             D     K+ I     K   V               +  +D+ V ++V  L     L++
Sbjct: 227 LEDQDPHSKRIIFSCDYKNFDVTEDNVRRSRRAYFANISYIDEKVGELVDTLTRTRMLDD 286

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
           T I++ SDHG  LG+ GL    SF FE   RVP ++ GPG VP  ++L  V
Sbjct: 287 TLILFCSDHGDMLGERGLWFKMSF-FEGSARVPLMIAGPG-VPAGLHLAPV 335


>gi|319900823|ref|YP_004160551.1| sulfatase [Bacteroides helcogenes P 36-108]
 gi|319415854|gb|ADV42965.1| sulfatase [Bacteroides helcogenes P 36-108]
          Length = 509

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 104/256 (40%), Gaps = 69/256 (26%)

Query: 20  GYFGKYLNKYNGSYIPP---GWREWGALIMNSKYYNYSINMNGRKIKH------------ 64
           G FGK+   Y GS   P   G  E+   I   + + Y  N   R  K             
Sbjct: 142 GMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSLGDTAVVRVVMD 201

Query: 65  --------GEDYYN--DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG----PE 110
                   G+DY+    Y  D+I  +++++L       +++P   + ++  PH     PE
Sbjct: 202 ENIKYPMFGKDYFKRPQYSADMIHKEAMKWLDSQN---AQQPFFGIFTYTLPHAELAQPE 258

Query: 111 DSAP-QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTKRLQ 166
           DS    Y   FF                D+ W  Q   +  P    H QF  ++      
Sbjct: 259 DSILLGYEKKFFT---------------DRTWGGQEGSRYNPSIHTHAQFAGMIT----- 298

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH---------LGQFGLVKG-KSFPF 216
               +D  V +++K+LKD G   NT +I+TSD+G H          G+ G ++G K   +
Sbjct: 299 ---RLDHYVGEVLKKLKDKGLDENTIVIFTSDNGPHEEGGADPAFFGRDGKLRGLKRQCY 355

Query: 217 EFDIRVPFLMRGPGIV 232
           E  IRVPF++R PG V
Sbjct: 356 EGGIRVPFIVRWPGKV 371


>gi|325264228|ref|ZP_08130960.1| putative mucin-desulfating sulfatase
           (N-acetylglucosamine-6-sulfatase) [Clostridium sp. D5]
 gi|324030712|gb|EGB91995.1| putative mucin-desulfating sulfatase
           (N-acetylglucosamine-6-sulfatase) [Clostridium sp. D5]
          Length = 539

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 76/167 (45%), Gaps = 13/167 (7%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD----- 132
           AN + +F+ Q+ Q   +KP    ++ P PH    +A ++ +++  +     P YD     
Sbjct: 200 ANCANKFIEQADQE--QKPFCFWLTMPRPHQTYAAAKEFWDLYEGIELELPPNYDDKMEH 257

Query: 133 -YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGE 187
             +   D Q   Q+ +       +  D    + L+     +  +DDAV  ++ +L++LG 
Sbjct: 258 RSSAAKDTQRKFQMDKDWIAFGDKDFDAARKRVLRGYYACVSQMDDAVGTVLNKLEELGI 317

Query: 188 LNNTYIIYTSDHGYHLGQFGLV-KGKSFPFEFDIRVPFLMRGPGIVP 233
             NT IIY +DHG   G+ G++ K     F    ++P +    G +P
Sbjct: 318 RENTLIIYMTDHGEFAGEHGMIEKAPGIGFRCVTQIPMIFSWKGRLP 364


>gi|149198280|ref|ZP_01875326.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
 gi|149138576|gb|EDM26983.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
          Length = 518

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/231 (24%), Positives = 95/231 (41%), Gaps = 35/231 (15%)

Query: 37  GWREWGALIMNSKYY-------------NY-SINMNGRKIKHGEDYYNDYYPDLIANDSV 82
           G+  W A      YY             NY   N+NG+K+         Y  D + + S+
Sbjct: 141 GFDHWAAFRGQGTYYADGHGASRVVPQNNYDGYNVNGKKVPQ-----LGYITDELTDMSI 195

Query: 83  RFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ-- 140
            +L   KQ   +KP  + +S    H   D   +  +         TP   YA  P+ +  
Sbjct: 196 DWLNNRKQK--EKPFFMYVSHKGVH--SDFVARDEDRGIYKDKPWTPPSTYANTPENRKN 251

Query: 141 ---WILQVTRKMQPVH-----RQFTDLLMTKRL-QTLQSVDDAVEKIVKELKDLGELNNT 191
              W++                 F      KR  + L  +DDAV + ++ L+ +GEL+NT
Sbjct: 252 KPMWLINQRNSRHGSDYGYNLENFDVTFYFKRYSEALIPIDDAVGRTIEHLRKMGELDNT 311

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
             +Y  D+G+  G+ GL+  ++  +E  +RVP +MR P    G   + +++
Sbjct: 312 LFVYMGDNGFQFGEQGLIDKRT-AYEASMRVPMIMRYPKAFKGGTVVDEIV 361


>gi|332665689|ref|YP_004448477.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334503|gb|AEE51604.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 475

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 81/165 (49%), Gaps = 17/165 (10%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPA 130
           ++ A+ ++ +L Q+K    ++P    ++F +PH P      Y  M+      +  ++ P 
Sbjct: 181 NMYADAAIAYLNQNKGG--QQPFFAYVAFTSPHDPRTPPAPYDKMYDPNDIPLPKNYLPQ 238

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLG 186
           + +    D   I     ++ P+ R  T  ++ K L      +  VD  + +I++ LK  G
Sbjct: 239 HPF----DNGEIRVRDEQLLPLPR--TPEMVKKDLAAYYGMISEVDAQIGRIIQTLKANG 292

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           + +NT II+  D+G  +G  GLV GK   +E  +R+P +M GP I
Sbjct: 293 QYDNTLIIFAGDNGLAVGSHGLV-GKQSLYEHSMRIPLVMVGPNI 336


>gi|224536908|ref|ZP_03677447.1| hypothetical protein BACCELL_01784 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521463|gb|EEF90568.1| hypothetical protein BACCELL_01784 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 516

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/161 (25%), Positives = 75/161 (46%), Gaps = 7/161 (4%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHHT 128
           D  P  I ++++ ++   K +    P  + +SFP PH P      Y +MF    + +  T
Sbjct: 177 DQQPTRIVDEAIEWIDSQKDN----PFFVWVSFPEPHNPYQVCEPYYSMFAPDKIPAVKT 232

Query: 129 PAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
              D     +K  IL ++     P   +    L    +  ++ +DD ++++V+ELK+ G 
Sbjct: 233 SRKDVLKKGEKYQILAELEDASCPDLERDLPRLRGNYMGMIRLIDDQIKRLVEELKEKGL 292

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
              T I+  SDHG + G++GL++      E   R+P +  G
Sbjct: 293 FEKTIIVVLSDHGDYCGEYGLIRKGVGLSESLTRIPMVWAG 333


>gi|399065906|ref|ZP_10748111.1| arylsulfatase A family protein [Novosphingobium sp. AP12]
 gi|398029084|gb|EJL22580.1| arylsulfatase A family protein [Novosphingobium sp. AP12]
          Length = 579

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 25/232 (10%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
            +FGK+    +     PG+ +W + I    Y+         + ++N++G+ +        
Sbjct: 139 AFFGKWHMGASTDAPRPGFDKWVSFIGQGNYWPSKVNTGQSSPNLNVDGKTVPQ-----K 193

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
            Y  D + + ++ +L  +K+   KKP  L +S    H      P+Y + +         +
Sbjct: 194 GYITDELTDYAMDWL--TKERDPKKPFFLYLSHKGVHSDPLPPPRYVHQYDKAKFTLPAS 251

Query: 131 YDYAPNPDKQWILQVTRKMQPVHR-QF---TDLLMTKRLQ----TLQSVDDAVEKIVKEL 182
               P  +    L V  +    H   F    D+ MT+ L+    TL ++DD++ +++  L
Sbjct: 252 AANTPQNNAGKPLWVQNQRNTWHGIDFFYNADVPMTEYLKYYYGTLSAIDDSLGRVMDYL 311

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +      +T +++TSD+GY +G+ GL+  K   +E  IRVP ++  PG VP 
Sbjct: 312 RKNHLEKDTLVVFTSDNGYMIGEHGLID-KRNAYEASIRVPLVVWEPGTVPA 362


>gi|373956839|ref|ZP_09616799.1| sulfatase [Mucilaginibacter paludis DSM 18603]
 gi|373893439|gb|EHQ29336.1| sulfatase [Mucilaginibacter paludis DSM 18603]
          Length = 491

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 41/260 (15%)

Query: 5   VELTSRNLLYFDI------SHGYFGKY-LNKYNGSYIPP-------GWREW---GALIMN 47
           VEL +  + + DI      S+GY GK+ L+     YIP         W EW   G     
Sbjct: 116 VELPADMVCWSDILEANGYSNGYIGKWHLDSPQKPYIPTYNNEGKTAWNEWTEPGRRHGF 175

Query: 48  SKYYNYS--------INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF--SKKPI 97
             +Y Y         +  + R ++    Y N++ P   A+ ++ F+     ++    KP 
Sbjct: 176 DYWYAYGTYDRHLHPMYWDTRAVRDSFHYVNEWGPQHEADKAIEFITNPGNTYRDQSKPF 235

Query: 98  MLVMSFPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
            LV+S   PH    + PQ Y N++ NV     P      +PD   I      M   +R  
Sbjct: 236 ALVVSMNPPHSDYKNVPQQYVNLYKNV-----PLESLLKDPD---IPPAGTPMGEEYRNN 287

Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
                T     +  VD  V +I+  LK  G  ++T +++ +DHG  LG+   +  K+  +
Sbjct: 288 IKYYYT----CIAGVDAQVGRILTCLKQSGLDDHTIVVFMADHGNCLGKHDEIS-KNNIY 342

Query: 217 EFDIRVPFLMRGPGIVPGTM 236
           E  +R+PF+++  G +   M
Sbjct: 343 EESLRIPFIIKWKGHIKSRM 362


>gi|159468109|ref|XP_001692225.1| hypothetical protein CHLREDRAFT_189474 [Chlamydomonas reinhardtii]
 gi|158278411|gb|EDP04175.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 628

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/141 (29%), Positives = 72/141 (51%), Gaps = 12/141 (8%)

Query: 96  PIMLVMSFPAPH-GPEDSAP----QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM- 149
           P  L ++   PH GP D  P    ++AN    +     P Y    +P  ++ +  T+ + 
Sbjct: 237 PFFLYLASANPHDGPVDGMPRVEAKFANAMPGLKVPRGPNYGVPTDP--RFGMSKTKPLW 294

Query: 150 -QPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL 208
             P    +TD     R  ++ ++D+ ++++V +L+ LG LNNTYI ++SD+G+ LGQ  +
Sbjct: 295 WDPA---WTDKHYRMRALSMLTIDEQIDRLVTKLECLGILNNTYIFFSSDNGFKLGQHNI 351

Query: 209 VKGKSFPFEFDIRVPFLMRGP 229
              K   FE D+ +PF    P
Sbjct: 352 PSEKFTYFEEDVALPFFTLHP 372


>gi|386821061|ref|ZP_10108277.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426167|gb|EIJ39997.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 553

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 49/73 (67%), Gaps = 2/73 (2%)

Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
           L T+++VDD V K++  L++ G  +NT ++YTSD G++LG+ G    K F +E   R+PF
Sbjct: 327 LATVKAVDDGVGKVLDYLEENGLSDNTIVVYTSDQGFYLGEKGFFD-KRFMYEESFRMPF 385

Query: 225 LMRGP-GIVPGTM 236
           L++ P  I PGT+
Sbjct: 386 LIQYPKAIKPGTV 398


>gi|372268475|ref|ZP_09504523.1| sulfatase [Alteromonas sp. S89]
          Length = 469

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 96/211 (45%), Gaps = 28/211 (13%)

Query: 66  EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY------ANM 119
           E + +++    I + S +F+R+S +S   +P  L +SF APH  +    Q+      A +
Sbjct: 171 ETHRHEHTTARIGDLSAQFVRESAKS--GQPFNLSVSFHAPHADDHDPNQFVYPRDLAFL 228

Query: 120 FFNVTSHHTP-AYDYAPNPDKQWILQVTRKMQPVHR----QFTDLLMTKRLQTLQSVDDA 174
           + +VT    P A D   +   +W+     K +   R    Q    ++    + +  +D  
Sbjct: 229 YEDVTIPEPPLADDRYFDAQPEWVRNSISKDRWYWRFDTPQKYQEMVKGYYRMISGLDRE 288

Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP----- 229
           V KIV+EL+  G  +NT I++  D+GY LG+  L  GK   +E  IRVPF +  P     
Sbjct: 289 VGKIVRELEKAGVADNTIIMFMGDNGYFLGERQLA-GKWLLYENSIRVPFFVYDPRRKGM 347

Query: 230 ---------GIVPGTMYLLDVLIPQVRKFSS 251
                     + P  + L  + IP++ +  S
Sbjct: 348 EVDKMVLNIDVAPTILELAGIDIPELMQGES 378


>gi|59714190|ref|YP_206965.1| sulfatase/phosphatase [Vibrio fischeri ES114]
 gi|59482438|gb|AAW88077.1| predicted sulfatase/phosphatase [Vibrio fischeri ES114]
          Length = 518

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 80/175 (45%), Gaps = 16/175 (9%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
           + N + P+  A+ ++++LR    ++  + KP  LV+S   PH P D  PQ     F  +S
Sbjct: 220 HINQWSPEHEADIAIKYLRNENGNYRNNDKPFTLVVSMNPPHSPYDQVPQKYLDRFKESS 279

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
               +      P+ QW  +      P +         + +  +  VD+   +IV EL  L
Sbjct: 280 RTLNS-----RPNVQWDKEYLEGYGPEY-------FKEYMAMVNGVDEQFGRIVDELDAL 327

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLL 239
               +T +++ SDHG  +G  G    K+  +E  +R+P + R PG I P +  LL
Sbjct: 328 NLAKDTLVVFFSDHGCCMGSNG-NPTKNVHYEEAMRIPMMFRWPGKIAPKSDDLL 381


>gi|196233888|ref|ZP_03132726.1| iduronate-2-sulfatase [Chthoniobacter flavus Ellin428]
 gi|196222082|gb|EDY16614.1| iduronate-2-sulfatase [Chthoniobacter flavus Ellin428]
          Length = 264

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 42/65 (64%), Gaps = 2/65 (3%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D    +++  L  LG  +NT +++TSDHGYHLG+ GL + +S  FE   RVP ++ GPG
Sbjct: 45  MDAQTGRVLAALDRLGLRDNTIVVFTSDHGYHLGEHGLWQKQSL-FEESTRVPLIVAGPG 103

Query: 231 IV-PG 234
           +  PG
Sbjct: 104 VTKPG 108


>gi|32476327|ref|NP_869321.1| sulfatase yidj [Rhodopirellula baltica SH 1]
 gi|32446872|emb|CAD78778.1| putative sulfatase yidj [Rhodopirellula baltica SH 1]
          Length = 527

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/168 (25%), Positives = 79/168 (47%), Gaps = 11/168 (6%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
           +  D + + +  F+R+     S++P    +S P PHGP      Y  MF N+     P  
Sbjct: 231 FSTDWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVR--PPM 284

Query: 132 DYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
            +  + D+  W+    R  Q   ++F   LMT+    ++ +DD V  ++  L +L     
Sbjct: 285 TFQLDGDQPGWLPATNRNSQ---QRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKR 341

Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           T +++TSDHG    + G +  K  P+E   +VP ++  PG++   + +
Sbjct: 342 TVVVFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIIAAPGLISAGLRI 388


>gi|159463296|ref|XP_001689878.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283866|gb|EDP09616.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 131

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 14/125 (11%)

Query: 93  SKKPIMLVMSFPAPHG---------PEDSAPQYANMF--FNVTSHHTPAYDYAPNPDKQW 141
           S KP ML ++  APH          P     +YAN++   N+ +  TP +    NP    
Sbjct: 5   SGKPFMLYVAPTAPHRSSTDGATWYPPTPPKRYANLYQGENLQAPRTPNFGVR-NP--TL 61

Query: 142 ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGY 201
             + T ++      + D L   RL+ L++VDD V  +V  L + G LNNTY+I+TSD+GY
Sbjct: 62  PRKGTPRVDAAFGAYMDELFVARLRALRAVDDLVGNLVARLNESGVLNNTYVIFTSDNGY 121

Query: 202 HLGQF 206
               F
Sbjct: 122 VSADF 126


>gi|160890624|ref|ZP_02071627.1| hypothetical protein BACUNI_03069 [Bacteroides uniformis ATCC 8492]
 gi|317479838|ref|ZP_07938958.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156859623|gb|EDO53054.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316904044|gb|EFV25878.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 521

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 69/144 (47%), Gaps = 3/144 (2%)

Query: 91  SFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVTSHHTPAYDYAPNPDK-QWILQVTR 147
           S   KP ++  S   PH P      Y +MF   ++    + A D     ++ Q + ++  
Sbjct: 200 SHKDKPFLMWFSIAEPHNPYQVCEPYYSMFPPESLPEMGSSAKDLNTKGEEYQLLAEMMA 259

Query: 148 KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
           +    +R+    L +     L+ +DD + + V ELK  G  +NT II+ +DHG ++G++G
Sbjct: 260 QGHVGYRENLQRLRSNYHGMLRMIDDQLSRFVGELKKNGVYDNTIIIFVADHGDYVGEYG 319

Query: 208 LVKGKSFPFEFDIRVPFLMRGPGI 231
           L+K      +   R+P    GPGI
Sbjct: 320 LMKKGVGLDDVLTRIPMQWTGPGI 343


>gi|325110426|ref|YP_004271494.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970694|gb|ADY61472.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 489

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 82/184 (44%), Gaps = 13/184 (7%)

Query: 52  NYSINMNGRKIKHGEDY-YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE 110
           +YS   N    K+ E Y ++         D++R     KQ    KP+ L +SF APH P 
Sbjct: 150 SYSTAKNKSMAKYAEKYPHSTLSYGAFGQDAIR--EAVKQD---KPLCLSISFKAPHKPA 204

Query: 111 DSAPQYANMFFNVT-----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
              P++ +++   T     +    A ++     KQ   Q  R  +  +    D  M K  
Sbjct: 205 TPDPRFKHVYAGKTFTRPDNFGREAGEHLSEQSKQG-RQYPRFTEWKYDSDYDGEMAKYF 263

Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
           Q + ++D A+  I +EL   G  +NT IIYTSD+GY  G  G    K  P E   RVP +
Sbjct: 264 QQVYAIDVALGMIREELDAQGIADNTVIIYTSDNGYICGAHGY-GSKVLPMEESSRVPLI 322

Query: 226 MRGP 229
           +  P
Sbjct: 323 IYDP 326


>gi|149175816|ref|ZP_01854434.1| iduronate-2-sulfatase [Planctomyces maris DSM 8797]
 gi|148845263|gb|EDL59608.1| iduronate-2-sulfatase [Planctomyces maris DSM 8797]
          Length = 492

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 66/274 (24%), Positives = 108/274 (39%), Gaps = 52/274 (18%)

Query: 17  ISHGYFGKYLNKYNGS-----YIPPGW-----------REWGALIMNSKYYNYSINMNGR 60
           ISH   G+   +YNG+      +P  W           R WG       Y N     +G 
Sbjct: 141 ISHTADGRVF-EYNGTGDGRDELPHAWDELATPFGSWKRGWGIFFA---YANGRSREDGS 196

Query: 61  KIKHGEDY---YNDYYPD-LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
            I+   ++    ++  PD L+A  ++  LR+ K+    KP  L + F  PH P  +  Q 
Sbjct: 197 GIRDLMEFKVEQDEELPDGLLARQAIEKLREYKEG--GKPFFLGLGFFKPHLPFVAPKQD 254

Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTR---------KMQPVHRQFTDLLMTKRLQT 167
            + F NV     P   +   P+  +  +            K +P+ R+  + +    L  
Sbjct: 255 WDAFENV---EIPPVPHPEKPESAYWHKSGEFYNYDMEFEKTRPLSREARENVRRGYLAC 311

Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
           ++  D  V K++  L +LG   NT +I   DHG+ LG   L   K  P E  ++   ++R
Sbjct: 312 VRYTDRQVGKVLTALDELGMRENTIVIVWGDHGWFLGDSALW-AKHAPLERALKSTLMIR 370

Query: 228 GPGIVPGTM-------------YLLDVLIPQVRK 248
            PG+    +              L+D+  P  RK
Sbjct: 371 APGVAEAGLKSAALVETVDIYPTLIDLCQPAFRK 404


>gi|283781717|ref|YP_003372472.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283440170|gb|ADB18612.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 511

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 106/243 (43%), Gaps = 33/243 (13%)

Query: 10  RNLLYFDISHGYFG--KYLNKYN--GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHG 65
           R L    I HG  G  K L +++  G     G +    LI  +   N+ + M+  K  H 
Sbjct: 136 RTLTTGKIYHGGLGPKKRLEEFDVWGPAGGIGAKPEKKLIPPTPMGNHPL-MDWGKFDHR 194

Query: 66  EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYAN 118
           ++   DY    I + ++  L    Q  ++ P+ L + +  PH P        D  PQ   
Sbjct: 195 DEDKGDYQ---ITSWAIEQLDDQVQHHAETPMFLSVGYFLPHVPCFISPKWYDEVPQGDK 251

Query: 119 MFFNVTS---HHTPAYDY-----APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
           +   V +      P + +      P P  +W+          HRQ+ +L+ +  L +   
Sbjct: 252 LLPLVAADDRSDIPRFAWYLHWSLPEPRLKWVED--------HRQWENLVRSY-LASTTF 302

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           VD  + +++  L++    ++T ++   DHG+HLG+ G + GK+  +E   RVP +  GPG
Sbjct: 303 VDAQIGRLLTALEERKLADDTIVVVWGDHGWHLGEKG-ITGKNTLWERSTRVPLIFAGPG 361

Query: 231 IVP 233
           I P
Sbjct: 362 ITP 364


>gi|423223239|ref|ZP_17209708.1| hypothetical protein HMPREF1062_01894 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392639340|gb|EIY33165.1| hypothetical protein HMPREF1062_01894 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 516

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/161 (25%), Positives = 75/161 (46%), Gaps = 7/161 (4%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHHT 128
           D  P  I ++++ ++   K +    P  + +SFP PH P      Y +MF    + +  T
Sbjct: 177 DQQPTRIVDEAIEWIDSQKDN----PFFVWVSFPEPHNPYQVCEPYYSMFAPDKIPAVKT 232

Query: 129 PAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
              D     +K  IL ++     P   +    L    +  ++ +DD ++++V+ELK+ G 
Sbjct: 233 SRKDILKKGEKYRILAELEDASCPDLERDLPRLRGNYMGMIRLIDDQIKRLVEELKEKGL 292

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
              T I+  SDHG + G++GL++      E   R+P +  G
Sbjct: 293 FEKTIIVVLSDHGDYCGEYGLIRKGVGLSESLTRIPMVWAG 333


>gi|294056299|ref|YP_003549957.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293615632|gb|ADE55787.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 563

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/197 (24%), Positives = 89/197 (45%), Gaps = 39/197 (19%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV----------- 123
           D IA+ ++ + ++ +     +P  L + F APH   + AP+Y +   +V           
Sbjct: 186 DCIADSALEWFQELRDP--DQPFFLKLQFKAPHDYFEYAPRYEDYLADVEIPEPASMWNN 243

Query: 124 --------------------TSHHTPAY--DYAPNPDKQWILQVTRKM--QPVHRQFTDL 159
                               TS     Y  +   + +  W  +V R +      +Q   L
Sbjct: 244 ERNGSIATRGVDDELLPYIGTSVGARHYRRNMVDDENAPWAHKVDRSLTGDAAKKQAYQL 303

Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
            +   L+ ++ VDD V++++  ++  G L+NT +IYT D G++LG+   +  K + +E  
Sbjct: 304 YLKAYLRCVKGVDDNVKRVLDYMEREGILDNTVVIYTGDQGFYLGEHDYID-KRWGYEEA 362

Query: 220 IRVPFLMRGP-GIVPGT 235
           +R+PF++R P GI  GT
Sbjct: 363 MRMPFIVRYPKGIPAGT 379


>gi|417300771|ref|ZP_12087962.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327542922|gb|EGF29375.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 309

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 9/97 (9%)

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG----LVKGKSFPFEFDIRV 222
           T+Q +D +  +I+K +KDLG   NT +I+TSD+G + G+ G    L  GK    E  +RV
Sbjct: 88  TIQELDWSTGEILKAVKDLGLDKNTLVIFTSDNGPNTGKGGSAGVLKGGKGSTLEGGVRV 147

Query: 223 PFLMRGPGIVPGTMYLLDV-----LIPQVRKFSSGSL 254
           PF+ R PG +P      +      L+P + K + G +
Sbjct: 148 PFVARWPGTIPAGTESDEAITGMDLLPTLTKLAGGEV 184


>gi|422872814|ref|ZP_16919299.1| arylsulfatase [Clostridium perfringens F262]
 gi|380306241|gb|EIA18513.1| arylsulfatase [Clostridium perfringens F262]
          Length = 481

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 5/155 (3%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N S+ FLR+   S   KP  L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNGSIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            D +   +    ++ + +++           ++  +D  + + +  L + GELNNT  ++
Sbjct: 240 EDVENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
            SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333


>gi|315645058|ref|ZP_07898184.1| sulfatase [Paenibacillus vortex V453]
 gi|315279479|gb|EFU42784.1| sulfatase [Paenibacillus vortex V453]
          Length = 512

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  + KI+ +L DLG   NT +++T+DHG+  GQ GL     F +E  IR+PF++R P 
Sbjct: 318 MDKYIGKILDKLDDLGLAENTVVVFTTDHGHFFGQHGLQAKGGFHYEDLIRIPFMVRYPD 377

Query: 231 IVPG 234
            VP 
Sbjct: 378 QVPA 381


>gi|430747568|ref|YP_007206697.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430019288|gb|AGA31002.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 491

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 76/156 (48%), Gaps = 25/156 (16%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMFFNVT----------SHHTPAYDY-----APNPDK 139
           +P ++ + F  PH P  ++ ++ +++ + T              P + +      P P  
Sbjct: 204 QPFLVAVGFRLPHVPCFASQRWFDLYPDETLVMPPVKVDDRDDVPEFSWYLHWKLPEPRL 263

Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
            W+L+  ++ +P+ R +        L ++  +D  V +++  L+  G+ + T ++  SDH
Sbjct: 264 SWLLEA-KQWRPLVRSY--------LASISFMDSQVGRVLDALEVSGQADRTVVVLWSDH 314

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           G+HLG+ G + GK+  +E   RVP +  GPGI  G 
Sbjct: 315 GWHLGEKG-ITGKNTLWERSTRVPLIFAGPGIAAGA 349


>gi|423288210|ref|ZP_17267061.1| hypothetical protein HMPREF1069_02104 [Bacteroides ovatus
           CL02T12C04]
 gi|392671099|gb|EIY64575.1| hypothetical protein HMPREF1069_02104 [Bacteroides ovatus
           CL02T12C04]
          Length = 516

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 3/143 (2%)

Query: 95  KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
           KP  +++S P PH P+ ++ +Y   + N+  +  P      + D++       +      
Sbjct: 209 KPFYMMLSIPDPHSPDIASEEYLQKYINL-DYEAPETMVTNDTDQRPRWARGGQYNVNKD 267

Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
           +F    +      ++ VDD V +I+  L      +NT +++T+DHG  L +   V  K  
Sbjct: 268 KFDKKALANYFAMVECVDDNVGRILDFLDKNNLTDNTIVVFTADHGDMLYEHSRV-NKGL 326

Query: 215 PFEFDIRVPFLMRGP-GIVPGTM 236
           P+E   R+PF++R P  I+PG +
Sbjct: 327 PYESSARIPFVIRYPEKIIPGKI 349


>gi|377813493|ref|YP_005042742.1| choline-sulfatase [Burkholderia sp. YI23]
 gi|357938297|gb|AET91855.1| choline-sulfatase [Burkholderia sp. YI23]
          Length = 511

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 83/179 (46%), Gaps = 18/179 (10%)

Query: 86  RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD---YAPNPDKQWI 142
           R+       +P M+V S   PH P     +Y +M+ +      P+Y     A +P  + +
Sbjct: 180 RERHAGKDARPFMMVASLTHPHDPYAIPRKYWDMYRD-EDIDMPSYRDSLEACDPHSKRL 238

Query: 143 LQV--TRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
             V    +  P  RQ   +L  +R     +  VDD    I++ L+  G  ++T ++ TSD
Sbjct: 239 RHVYEADRTPPTERQ---ILNARRAYYGAVSYVDDQFASILEALEQAGLADDTIVVVTSD 295

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-----IVPGTMYLLDVLIPQVRKFSSG 252
           HG  LG+ GL    +F FE   RVP ++  P       VP ++  LDVL P + +F+ G
Sbjct: 296 HGEMLGERGLWYKMTF-FEGGCRVPLIVHAPKRFRAHRVPDSVSHLDVL-PTLVEFARG 352


>gi|32476803|ref|NP_869797.1| iduronate-2-sulfatase [Rhodopirellula baltica SH 1]
 gi|32447349|emb|CAD77175.1| iduronate-2-sulfatase [Rhodopirellula baltica SH 1]
          Length = 571

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/250 (25%), Positives = 106/250 (42%), Gaps = 33/250 (13%)

Query: 15  FDISHGYFGK-------YLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED 67
           F I HG  G        +  +    Y+ P   + G L     Y+   +    + +  G  
Sbjct: 201 FHIGHGNHGDPESFSVPHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKTLPRGAA 260

Query: 68  YYN------DYYPDLIANDSVRFLRQSKQSFSKK--PIMLVMSFPAPHGPEDSAPQ---- 115
           Y +      DY    +A ++++ L+ +KQ    +  P  +   F  PH P  SAPQ    
Sbjct: 261 YESPDAKDEDYADGRVAAETIQRLQAAKQRQKTEGTPFFIASGFARPHLPF-SAPQKYWD 319

Query: 116 -YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV----HRQFTDLLMTKRLQ---- 166
            Y      + +H T   D AP    +   +++   +PV    +  F D L    +     
Sbjct: 320 LYDPASLPMPTHETLPVD-APKVAGKRGGEIS-NYKPVPTEPNADFDDELKRNLIHGYYA 377

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
           ++  VD  + K++KEL  L  L+NT ++   DHG+HLG  G+   K   +E   R+P L+
Sbjct: 378 SVSYVDAQIGKVIKELDRLELLDNTIVVLWGDHGFHLGDLGIWT-KHTNYEQANRIPILI 436

Query: 227 RGPGIV-PGT 235
             PG+  PG+
Sbjct: 437 TAPGVTQPGS 446


>gi|350568607|ref|ZP_08937005.1| arylsulfatase [Propionibacterium avidum ATCC 25577]
 gi|348660850|gb|EGY77546.1| arylsulfatase [Propionibacterium avidum ATCC 25577]
          Length = 491

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 7/161 (4%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N + R+  +   +    P  L +SF  PH P D  PQ+A   ++      PA     +
Sbjct: 191 VVNQATRWFYRRDPT---TPFFLYLSFHRPHAPYD-PPQWAFDRYDGAPLRPPAVGDWSD 246

Query: 137 PDKQWILQVTRKMQPVHRQFTDL--LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
               W             +  D+   M      +  +D  + ++ + L + G  +NTYI 
Sbjct: 247 TFSDWRNDADPTSLVARYRPRDISRAMAGYYGHMTHIDTQISRLRQVLGEFGVADNTYIA 306

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           + SDHG  +G   L + K +P++    +PFL+ GPGI PG+
Sbjct: 307 FISDHGDMMGDHHLWR-KGYPYQGSTHIPFLLAGPGITPGS 346


>gi|270290492|ref|ZP_06196717.1| sulfatase [Pediococcus acidilactici 7_4]
 gi|270281273|gb|EFA27106.1| sulfatase [Pediococcus acidilactici 7_4]
          Length = 485

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/168 (23%), Positives = 77/168 (45%), Gaps = 5/168 (2%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + ++S++FL++   +    P +L MSF  PH P +    Y +M+      H   +     
Sbjct: 189 VVSESIKFLQRRDPT---APFLLKMSFEKPHAPLNPPKYYFDMYMERLPKHIDLHIGNWE 245

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
            +   +  +      +       ++   L  +  +D  + + +  LK+ GE  NT I + 
Sbjct: 246 KESNEMPNIYALKGCLKEDDQRRMVAAYLGLITHIDHQISRFLTALKEFGEDKNTIIWFV 305

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLLDVLI 243
           SDHG  LG+  L + K +P++  IR+P  +  PG ++ G  + +  L+
Sbjct: 306 SDHGDQLGEHYLFR-KGYPYQGSIRIPSFIYDPGNLIAGHKHKIKQLV 352


>gi|297182883|gb|ADI19034.1| arylsulfatase a and related enzymes [uncultured alpha
           proteobacterium HF0070_05I22]
          Length = 512

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 82/178 (46%), Gaps = 23/178 (12%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY----- 131
           +A+ S+ +L +  +    KP  L +SFP PH P D  P+  ++  N      P +     
Sbjct: 207 VADRSIDWLSKRHKD---KPFCLWVSFPDPHHPFD-CPEPWSLLHNPEEVDLPEFLEKDL 262

Query: 132 DYAP---------NPD-KQWILQVTRK---MQPVHRQFTDLLMTKRLQTLQS-VDDAVEK 177
           D  P          PD K  +L+  RK     P   +     MT     + S +D +V +
Sbjct: 263 DERPWWHKRALEDEPDLKDPVLKRFRKEGSRMPDQTEAQLREMTANYYGMISLIDHSVGR 322

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
           IV  L +   L+ T +IYTSDHG HLG+ GL       ++  I V  ++RGPGI  G+
Sbjct: 323 IVACLNENNILDETIVIYTSDHGDHLGERGLYLKGPMLYDSLINVGMIVRGPGIEAGS 380


>gi|374314953|ref|YP_005061381.1| arylsulfatase A family protein [Sphaerochaeta pleomorpha str.
           Grapes]
 gi|359350597|gb|AEV28371.1| arylsulfatase A family protein [Sphaerochaeta pleomorpha str.
           Grapes]
          Length = 463

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 72/158 (45%), Gaps = 12/158 (7%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           I +D++ +LR  K     K   LV    A H P +  P+    ++      T    +  +
Sbjct: 161 ITDDTIEYLRNRKDE--NKSFFLVSGHYATHSPFEGHPERLVEYYRRKGVDT----FLKH 214

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
            +  + +Q    + P      + L  +    +  +D++V +++ EL+  G L NT IIYT
Sbjct: 215 ANYPFGIQRNESLYPSRTDRKEAL-CQYYAGISQIDESVGRVLDELERQGILENTIIIYT 273

Query: 197 SDHGYHLGQFGLV--KGKSFPF---EFDIRVPFLMRGP 229
           +DHG + GQ GL      ++P    E DIRVP +   P
Sbjct: 274 ADHGLNCGQHGLFGKANATYPINMVEEDIRVPLIFYAP 311


>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 551

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/200 (26%), Positives = 92/200 (46%), Gaps = 21/200 (10%)

Query: 41  WGALIMNSKYYNYSINMNGRKIKHGEDYYND--YYPDLIANDSVRFLRQSKQSFSKKPIM 98
           +G L     +Y    N+ G+  +      +D  Y   L+A++  R++R   +   +KP  
Sbjct: 157 YGHLHTEVGFYPPFANVGGKDFQENGVSIDDEGYETYLLADEVSRYIRDRDE---EKPFF 213

Query: 99  LVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTD 158
           + M F APH P D+  +    + ++ +      D AP    Q     TR+M  +  Q + 
Sbjct: 214 IYMPFIAPHTPLDAPQELQEKYKDIET------DLAPARSNQ--TDSTRRMAKLMMQPSA 265

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG------LVKGK 212
             M   +  + ++D A+ +++  L + G  +NT +++ SD+G     +G      L  GK
Sbjct: 266 RPMYAAV--VDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAAYSYGGADNAPLRGGK 323

Query: 213 SFPFEFDIRVPFLMRGPGIV 232
              FE  IRV  LMR PG++
Sbjct: 324 GETFEGGIRVVSLMRWPGVL 343


>gi|255505698|ref|ZP_05347616.3| putative arylsulfatase [Bryantella formatexigens DSM 14469]
 gi|255266363|gb|EET59568.1| arylsulfatase [Marvinbryantia formatexigens DSM 14469]
          Length = 408

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/180 (25%), Positives = 86/180 (47%), Gaps = 21/180 (11%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--EDSAPQYANMFFNVTSHHTP 129
           Y  D I +D+V FL    +   ++P  L +++ APH P   +   +Y  ++ +      P
Sbjct: 83  YVTDAITDDAVSFLESRTED--RQPFYLHVAYTAPHAPWLNNHPKEYTELYKDCPFESVP 140

Query: 130 AYDYAP-NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
              + P +PD  ++     K +  +       +      + ++D  + +I+ +L ++G  
Sbjct: 141 ---HLPRHPDSIYLTDEVAKDERGN-------LIGYYAAITAMDAGIGRILGKLDEMGLT 190

Query: 189 NNTYIIYTSDHGYHLGQFGL-VKGKS-FP---FEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
            NT +++TSD+G+  G  G   KG + FP   +E  I+VPF+   PG +P    + D L+
Sbjct: 191 ENTLVVFTSDNGFSCGHHGFWGKGNATFPLNMYEESIKVPFIACHPGHIP-EGSVCDALV 249


>gi|425736217|ref|ZP_18854524.1| choline-sulfatase [Brevibacterium casei S18]
 gi|425478434|gb|EKU45626.1| choline-sulfatase [Brevibacterium casei S18]
          Length = 619

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 9/156 (5%)

Query: 86  RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQV 145
           R ++ +   +P ++V SF  PH P +   ++ + F   T+   PA+   P+P        
Sbjct: 179 RANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAE-TAIPDPAHPEVPDPAVDPHSHR 237

Query: 146 TRKMQPVHRQFTDL-----LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
            R M  +  +   +             +  +DD V  I+  L+DLG   NT +I TSDHG
Sbjct: 238 LRTMSGLDEREPAIEDVRRARRAYYAAVSYIDDHVGAIMARLRDLGLDANTVVIVTSDHG 297

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPG--IVPG 234
             LG+ GL   K  P+E   RVP ++ GP   ++PG
Sbjct: 298 DMLGEKGLWY-KMSPYEQSSRVPLIVAGPAEVVLPG 332


>gi|357386109|ref|YP_004900833.1| arylsulfatase [Pelagibacterium halotolerans B2]
 gi|351594746|gb|AEQ53083.1| Arylsulfatase [Pelagibacterium halotolerans B2]
          Length = 487

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 94/209 (44%), Gaps = 29/209 (13%)

Query: 37  GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL-----IANDSVRFLRQSKQS 91
           G+ +W  L   +  +  ++  NGR+  H       + PDL      A+ ++ FL  SK  
Sbjct: 139 GYAQW--LAETAPAFRNALARNGRRELH-------HPPDLHMSKWAADRTIDFLHASKDD 189

Query: 92  FSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP 151
              +P   +MS   PH P +  P  A +   +     PA    P  D+   ++  ++   
Sbjct: 190 --GRPFFCLMSLFDPHDPYEDYP--AALAGAIDDARIPA-PIPPRADQPRCVKREQEGSY 244

Query: 152 VHRQFTDL-------LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLG 204
           + R F+D        +      ++  +D  V +++  L D G   +T +I+TSDHG  LG
Sbjct: 245 LGR-FSDFSPHEIASIRKGYAASIAFLDRQVGRVLDALDDAGLTEDTLVIFTSDHGDQLG 303

Query: 205 QFGL-VKGKSFPFEFDIRVPFLMRGPGIV 232
             GL VKG +  +E  + VP L+R PG +
Sbjct: 304 DHGLFVKGVAL-YEPTVGVPLLLRWPGKI 331


>gi|168335540|ref|ZP_02693609.1| choline sulfatase [Epulopiscium sp. 'N.t. morphotype B']
          Length = 419

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 81/187 (43%), Gaps = 25/187 (13%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF-------- 121
           N +  D+  + ++ FL    Q   +KP  L  +  APH P ++  +Y  M+         
Sbjct: 124 NMHSTDIFCDTALEFLENYDQ---EKPFYLYTALVAPHDPRNAPEKYEEMYSPDDVSTPE 180

Query: 122 NVTSHHTPAYDYAPNPDKQWILQV-TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
           N    HT  +D      +  +L+   R+   + R   D         +  +D     ++K
Sbjct: 181 NFLPQHT--FDNGDLWVRDELLEFFPRREYSIRRHIADYY-----SMISHIDARFGDVMK 233

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-----IVPGT 235
           +LKD G   NT II+  D+G  LGQ GL+ GK   +E  IRVP +++  G     +    
Sbjct: 234 KLKDKGLYENTIIIFAGDNGLALGQHGLM-GKQSVYEHSIRVPLIVKPAGEFVPRMTDAY 292

Query: 236 MYLLDVL 242
            YL D+ 
Sbjct: 293 AYLCDIF 299


>gi|449136380|ref|ZP_21771768.1| sulfatase family protein [Rhodopirellula europaea 6C]
 gi|448885000|gb|EMB15464.1| sulfatase family protein [Rhodopirellula europaea 6C]
          Length = 480

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 80/169 (47%), Gaps = 11/169 (6%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D + + +  F+R+     S++P    +S P PHGP      Y  MF ++     P   + 
Sbjct: 187 DWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFEDMPVR--PPMTFQ 240

Query: 135 PNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            +  K  W+    R  +   ++F   LMT+    ++ +DD V  ++  L +L     T +
Sbjct: 241 LDGTKPGWLPGTNRNAE---QRFNARLMTQYFGMVRCIDDNVGMLLALLDELSLTERTVV 297

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           ++TSDHG    + G +  K  P+E   +VP ++  PG+VP  + + + +
Sbjct: 298 VFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIVAAPGLVPAGLRIDEAM 345


>gi|189459670|ref|ZP_03008455.1| hypothetical protein BACCOP_00298 [Bacteroides coprocola DSM 17136]
 gi|189433629|gb|EDV02614.1| arylsulfatase [Bacteroides coprocola DSM 17136]
          Length = 387

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 79/160 (49%), Gaps = 17/160 (10%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYDY 133
           A+ ++ FL+ ++ +   +P M  +SF +PH P +  P Y   +     +V  +  P + +
Sbjct: 87  ADAAINFLKTTEHN--NQPFMAFVSFTSPHDPRNILPDYGKKYCPAELSVPDNFLPEHPF 144

Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELN 189
               D   +     ++ PV R  T  L+ K L      +  VD  + ++V  L++ G ++
Sbjct: 145 ----DNGELNIRDEQLLPVPR--TPELLKKELSLYYGMVSEVDLQIGRVVDALEESGNMD 198

Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
           NT I++ SD+G  +GQ GL+ GK   +E  + VP  +  P
Sbjct: 199 NTIIVFASDNGLAMGQNGLI-GKQSLYEHSVGVPMAIITP 237


>gi|149195944|ref|ZP_01873000.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
 gi|149140791|gb|EDM29188.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Lentisphaera araneosa HTCC2155]
          Length = 515

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 108/237 (45%), Gaps = 29/237 (12%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKI--KHGEDYYNDYYPDLI 77
           GY    + K++    P  +  +  L     Y+N   N+ G K   K+     + +  D I
Sbjct: 121 GYETAMIGKWHLKKEPATFDYYCVLPGQGLYHNPIFNIRGSKPWPKNTITKKDQHSSDAI 180

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN-----------MFFNVTSH 126
            + S+ +L+  +     KP  L+  F APH   + A +Y +             F+V + 
Sbjct: 181 TDISLHWLKNERDK--SKPFFLMHHFKAPHDMFEYAKRYESYLEDVHIPEPESLFSVPAG 238

Query: 127 HTPAYDYAPNPDKQ---WILQVTRKM-------QPVHRQFT-DLLMTKRLQTLQSVDDAV 175
              + D      K    W  Q+ +K+       +P + + +    +   L+ ++ +DD +
Sbjct: 239 SAGSKDLGSGLSKNHNPW--QLPQKLGVSDDIPEPEYTRLSYQKYLKAYLRCVKGIDDNI 296

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
            +++  LKD  +L+NT IIYTSD G+ LG+  L+  K + +E  + +PF++  PG++
Sbjct: 297 ARLLSYLKDSNQLDNTIIIYTSDQGFFLGEHNLI-DKRWMYEEAMGMPFIVYAPGMI 352


>gi|293375272|ref|ZP_06621554.1| arylsulfatase [Turicibacter sanguinis PC909]
 gi|325842515|ref|ZP_08167686.1| arylsulfatase [Turicibacter sp. HGF1]
 gi|292646028|gb|EFF64056.1| arylsulfatase [Turicibacter sanguinis PC909]
 gi|325489559|gb|EGC91923.1| arylsulfatase [Turicibacter sp. HGF1]
          Length = 442

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH-------LGQFGLVKGKS 213
           M   L    S+D  V +I+  LK+ G   NT IIYTSDHG H       +  FG    K 
Sbjct: 228 MPDYLGCCHSLDQNVGRIISVLKEQGIYENTLIIYTSDHGCHFKTMKKYINPFGADDYKR 287

Query: 214 FPFEFDIRVPFLMRGPGIVPGTM 236
            P E  I +P +MRGPG + G M
Sbjct: 288 SPDENSIHIPLVMRGPGFLGGVM 310


>gi|29348504|ref|NP_812007.1| sulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29340409|gb|AAO78201.1| putative sulfatase [Bacteroides thetaiotaomicron VPI-5482]
          Length = 516

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/162 (24%), Positives = 76/162 (46%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ ++   K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 175 EEQHPAKIVNETLSWIESQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 230

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D     +K  IL ++     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 231 TSRKDLEKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 290

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 291 QFENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 332


>gi|311748195|ref|ZP_07721980.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|126576685|gb|EAZ80933.1| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 481

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 81/175 (46%), Gaps = 10/175 (5%)

Query: 75  DLIANDSVRFLRQSK--QSFSKKPIMLVMSFPAPHGPEDSAPQYANM--FFNVTSHHTPA 130
           DL+ +D     + S+  +    +P  L + F  PH P  +   Y +   F  +T+     
Sbjct: 191 DLVHSDGKTAKKASELIREHKDEPFFLAVGFVRPHVPFVAPASYFDPYPFAKITTPKQAP 250

Query: 131 YDYAPNPDK--QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
            D+   P++   ++  V  +M    +Q     +     ++  +D  V K++  LK+ G  
Sbjct: 251 NDWEDIPERGINYVTSVNAEMNQEQKQKA---IAAYYASVSYMDTQVGKVLTTLKEEGLE 307

Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
           NNT I++TSDHG+HLG+       S   E  ++VP +++ PG  P  ++    LI
Sbjct: 308 NNTIIVFTSDHGFHLGEHEFWMKVSL-HEESVKVPMMIKVPGKNPAVVHSFTELI 361


>gi|109900202|ref|YP_663457.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109702483|gb|ABG42403.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 486

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 73/157 (46%), Gaps = 16/157 (10%)

Query: 82  VRFLRQSKQSF----SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
            RFL +    F    + KP  + +++  PH P +  P+       + S + P        
Sbjct: 183 ARFLTEKAIEFIDESTDKPFFININYVEPHFPFEGLPE------RLVSQYRPVARKLLRD 236

Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
                L +  K   V +   + L ++ L  +  +DD V +I+  L+  G L+NT I + S
Sbjct: 237 GGNSSLALASKDTAVPKDHEEKL-SQYLAAISLIDDQVGQIMDALEGRGLLDNTIIAFVS 295

Query: 198 DHGYHLGQFGLV--KGKSFPFEF---DIRVPFLMRGP 229
           DHG  +GQ+GL      SFP+ F    +R+PF++ GP
Sbjct: 296 DHGMLMGQYGLYGKTNASFPYNFYEETVRIPFIIYGP 332


>gi|325263386|ref|ZP_08130120.1| choline sulfatase [Clostridium sp. D5]
 gi|324031095|gb|EGB92376.1| choline sulfatase [Clostridium sp. D5]
          Length = 464

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/181 (26%), Positives = 83/181 (45%), Gaps = 26/181 (14%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--------FNVTSH 126
           +L+ + +V FL +  +    +P  L  ++ APH P     +Y  ++         N    
Sbjct: 177 ELLTDMAVSFLDEYTED---RPFFLYTAYLAPHDPRTMPDKYKELYQPENMTLPVNFQKQ 233

Query: 127 HTPAYDYAPNPDKQWILQ-VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
           H   +++  +  +  +L    R  + + RQ  +         +  +DD + KI++ LK  
Sbjct: 234 H---FEFGVSGIRDELLAPYPRTEEEMRRQLAEYYAM-----ITHLDDEIGKIMEALKRN 285

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
            +  NT II T D+G  +G  GL+ GK   +E  IR+P +  GPGI  G      +YLLD
Sbjct: 286 QQWENTIIILTGDNGLAIGSHGLM-GKQNLYEESIRIPLIFAGPGIPKGQVRDQFVYLLD 344

Query: 241 V 241
           +
Sbjct: 345 I 345


>gi|260429935|ref|ZP_05783910.1| iduronate 2-sulfatase [Citreicella sp. SE45]
 gi|260418858|gb|EEX12113.1| iduronate 2-sulfatase [Citreicella sp. SE45]
          Length = 371

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 79/182 (43%), Gaps = 19/182 (10%)

Query: 93  SKKPIMLVMSFPAPHGPEDSAPQYAN--MFFNVTSHHT---------PAYDYAPNPDKQW 141
            +KP+ L + FP PH P D  P+Y    M  +V   H          P +     PD ++
Sbjct: 155 GEKPLFLSIGFPGPHPPYDPTPEYPGTYMAHDVPLPHVTQDEIDRLPPPWKEQRRPDSEF 214

Query: 142 I-LQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
               V   ++P       +     +  +  +D  V +I+  L+  G L+N  +I+TSDHG
Sbjct: 215 DHYAVHWNLEPSEADLHKMRAYDYVNVVL-IDREVGRILVALERRGRLDNGVVIFTSDHG 273

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGSLIFIMSF 260
             LG  GL + K  P E   RVP ++  PG  P   +  D L   V+ F  G  I  ++ 
Sbjct: 274 DTLGDHGLSQ-KWAPCEQVTRVPMVISQPGRFPARRF--DGL---VQLFDLGPTILRLAG 327

Query: 261 LI 262
           L 
Sbjct: 328 LT 329


>gi|257067331|ref|YP_003153586.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
 gi|256558149|gb|ACU83996.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
          Length = 483

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/211 (23%), Positives = 84/211 (39%), Gaps = 46/211 (21%)

Query: 65  GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
           GE+    Y  D++   S+ F+ +      ++P  L++   APH P    P+Y +++   T
Sbjct: 136 GEEQIPGYATDIVTRQSLDFIDRRD---PEQPFCLLVHHKAPHRPWIPHPRYEHLYEAGT 192

Query: 125 SHHTPAYDYAPNPDKQWILQVTRK--MQPVHRQFTDL----------------------- 159
                     P P+  W    TR   ++ V     DL                       
Sbjct: 193 ---------IPEPETMWDDHATRSEVVREVAMNLDDLRPTDYKDELPAELEGETEEARRA 243

Query: 160 --------LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
                    M   L+ +Q+VDD++ +I+  L   G   NT ++YTSD G+ LG  G    
Sbjct: 244 RASWKYQRYMRDYLRCVQAVDDSIGEILDHLDQEGLGENTLVVYTSDQGFFLGDHGWFD- 302

Query: 212 KSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
           K    +  + +P L+R P  +P    + D++
Sbjct: 303 KRLMLDESLTMPMLLRWPAQIPAGSRVSDIV 333


>gi|383124694|ref|ZP_09945357.1| hypothetical protein BSIG_1553 [Bacteroides sp. 1_1_6]
 gi|251841147|gb|EES69228.1| hypothetical protein BSIG_1553 [Bacteroides sp. 1_1_6]
          Length = 518

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/162 (24%), Positives = 76/162 (46%), Gaps = 7/162 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
            + +P  I N+++ ++   K++    P  + +SFP PH P      Y +MF    +    
Sbjct: 177 EEQHPAKIVNETLSWIESQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 232

Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
           T   D     +K  IL ++     P   Q    +    +  ++ +DD ++++++ LK  G
Sbjct: 233 TSRKDLEKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 292

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
           +  NT  +  SDHG + G++GL++  +   E   R+P +  G
Sbjct: 293 QFENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 334


>gi|381204856|ref|ZP_09911927.1| sulfatase [SAR324 cluster bacterium JCVI-SC AAA005]
          Length = 504

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 45/73 (61%), Gaps = 1/73 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           + + L+T+ S+D++V KI+  L D G   NT +IYTSD G+ LG+ G    K F +E   
Sbjct: 287 LKRYLRTIHSIDESVGKILDYLDDNGLAENTLVIYTSDQGFFLGEHGWYD-KRFMYEESF 345

Query: 221 RVPFLMRGPGIVP 233
           ++PFL R P  +P
Sbjct: 346 QMPFLARFPAEIP 358


>gi|87310895|ref|ZP_01093021.1| iduronate-2-sulfatase [Blastopirellula marina DSM 3645]
 gi|87286410|gb|EAQ78318.1| iduronate-2-sulfatase [Blastopirellula marina DSM 3645]
          Length = 469

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 102/251 (40%), Gaps = 52/251 (20%)

Query: 18  SHGYF----GKYLNKYNG-------SYIPPGWREWGA----LIMNSKYYNYSINMNGRKI 62
           +HGYF    GK  + Y         S+  P   +WGA      ++ + +       G  +
Sbjct: 87  AHGYFTQNIGKIYHNYRQTLRNDPQSWSTPAVHDWGAHSNDWFVSGEPFGLKSISKGPAV 146

Query: 63  KHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN 122
           +  +     Y    IA D+V  +R+  ++  K+P  L + F  PH P           FN
Sbjct: 147 QKVDVADEAYLDGRIAADAVLAIRE--RAAQKQPFFLAVGFWKPHLP-----------FN 193

Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQP---VH-----RQFTDLLMTKRLQTLQSV--- 171
               +   YD  P+  +  + Q+ +   P    H     R +TD+  T  +   Q++   
Sbjct: 194 APKPYWDKYD--PDQIRAHLDQLPKSDAPQIAFHPYGEIRSYTDIPKTGDISAEQNLVLN 251

Query: 172 ----------DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
                     D  + K++ EL+  G   NT ++  SDHG+HLG+  L   K+  FE D R
Sbjct: 252 HGYYAAISFLDAQIGKVLHELQRQGLAENTIVVLWSDHGFHLGEHDLW-CKTSNFELDTR 310

Query: 222 VPFLMRGPGIV 232
           VP L+  P   
Sbjct: 311 VPLLIAPPAAA 321


>gi|358066156|ref|ZP_09152690.1| hypothetical protein HMPREF9473_04753 [Clostridium hathewayi
           WAL-18680]
 gi|356696019|gb|EHI57644.1| hypothetical protein HMPREF9473_04753 [Clostridium hathewayi
           WAL-18680]
          Length = 479

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/174 (23%), Positives = 86/174 (49%), Gaps = 23/174 (13%)

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--EDSAPQYANMFFNVTSHHTP 129
           Y  D+I +D++ F+   K+  +++P  L + + APH P   +   +Y +++        P
Sbjct: 154 YVTDVITDDAISFM---KEYANRQPFYLQVGYTAPHAPWLNNHPKEYTDLYEKCPFESCP 210

Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTK--RLQTLQSVDDAVEKIVKELKDLGE 187
             +   +PD  ++ +   K         DL   +        ++D  + +I++++++LG 
Sbjct: 211 MEE--EHPDSIYLTEEVLK---------DLRSNQIGYYAAATAMDRNIGRILEQIEELGI 259

Query: 188 LNNTYIIYTSDHGYHLGQFGL-VKGK-SFP---FEFDIRVPFLMRGPGIVPGTM 236
              T +I++SD+G+  G  G   KG  +FP   +E  ++VPF+   PG++P  +
Sbjct: 260 REETLVIFSSDNGFSCGHHGFWGKGNGTFPINMYESSVKVPFIASQPGVIPAGV 313


>gi|410665791|ref|YP_006918162.1| sulfatase [Simiduia agarivorans SA1 = DSM 21679]
 gi|409028148|gb|AFV00433.1| sulfatase [Simiduia agarivorans SA1 = DSM 21679]
          Length = 548

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 46/74 (62%), Gaps = 1/74 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   L T++SVD+ V +++  LK  G   NT I+YTSD+G+ LG+ G    K F ++  +
Sbjct: 328 MQDYLATIESVDENVGRLMDYLKAEGLDKNTIIVYTSDNGFFLGEHGWFD-KRFAYQESL 386

Query: 221 RVPFLMRGPGIVPG 234
           +VP ++R PG+ P 
Sbjct: 387 QVPLIVRWPGVTPA 400


>gi|149198504|ref|ZP_01875549.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138510|gb|EDM26918.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 466

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 87/192 (45%), Gaps = 20/192 (10%)

Query: 87  QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVTSHHTPAYDYAPNPDKQWILQ 144
           Q+ Q    KP  + + F  PH P    P++ N++   ++    +P  D    P K ++  
Sbjct: 185 QALQEDFDKPFFMSVGFFRPHVPLLVPPKWFNLYDEESIVLAPSPKSDLDDVP-KNFLSI 243

Query: 145 VTRKMQPVHRQF--TDL---LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
               + P H++   TD    L    L ++  VD  V +++  LK+    +NT +I  SDH
Sbjct: 244 NDYAVAPTHKEVLATDSHRKLTHAYLASISFVDACVGRVIDALKNSKYADNTIVILWSDH 303

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGSLIFIMS 259
           G+HLG+      ++  +E   +VP L+ GPGI  G   L              SLI I  
Sbjct: 304 GFHLGEKEHWAKRTL-WEESTKVPLLVYGPGIESGEACL-----------EPASLIDIYP 351

Query: 260 FLINLNTMRAEK 271
            L++L  ++A K
Sbjct: 352 TLVDLCGVKAPK 363


>gi|393788823|ref|ZP_10376949.1| hypothetical protein HMPREF1068_03229 [Bacteroides nordii
           CL02T12C05]
 gi|392653929|gb|EIY47579.1| hypothetical protein HMPREF1068_03229 [Bacteroides nordii
           CL02T12C05]
          Length = 526

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 316 MRDYMKTVKSLDDNVGRVLDYLKEKGMLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 374

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 375 RTPLIMRLP 383


>gi|333378667|ref|ZP_08470397.1| hypothetical protein HMPREF9456_01992 [Dysgonomonas mossii DSM
           22836]
 gi|332883071|gb|EGK03355.1| hypothetical protein HMPREF9456_01992 [Dysgonomonas mossii DSM
           22836]
          Length = 520

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 71/282 (25%), Positives = 110/282 (39%), Gaps = 74/282 (26%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++    P G+  +  L     YYN S    G   K+  +    Y  +LI  
Sbjct: 111 GYQTGIVGKWHMQCEPKGFDYYCVLNDQGDYYNPSFKTKGSHGKYIRE--EGYATNLITT 168

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------------GPED--------- 111
            S+ FL+   +S   KP  L++   APH                    PE          
Sbjct: 169 HSIDFLKNRDKS---KPFALLVHHKAPHRNWMPEAKYYDLYEDVEFPKPETFYDDYESRC 225

Query: 112 SAPQYANMFFNVTSHHTPAYDYAPNP-------DKQW----ILQVTRKMQPVHRQF---- 156
           SA +  +M  + T   T  YD   N        +K+W    + Q   +M P  R+     
Sbjct: 226 SAARTQDMTIDKTM--TFIYDLKLNELKNTPPYNKEWSDGGLQQAMNQMTPAQREAWEKA 283

Query: 157 --------------TDLL--------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
                          D L        M   L+ ++S+DD V +++  L+  G ++NT I+
Sbjct: 284 YHPKNMEFINSNLSGDALLDWKFQRYMRDYLRCIKSIDDEVGRLIDYLEKEGLMDNTIIV 343

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GIVPGT 235
           YTSD G+++G+ G    K F +E   R P ++R P  I  GT
Sbjct: 344 YTSDQGFYMGEHGWFD-KRFMYEESFRTPLIIRYPKAIKSGT 384


>gi|169343129|ref|ZP_02864154.1| sulfatase family protein [Clostridium perfringens C str. JGS1495]
 gi|169298767|gb|EDS80842.1| sulfatase family protein [Clostridium perfringens C str. JGS1495]
          Length = 481

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 73/159 (45%), Gaps = 13/159 (8%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           + N+S+ FLR+   S   K   L MSF  PH P D    Y +M+ +         D+A  
Sbjct: 183 VVNESIDFLRRRDPS---KSFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239

Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNT 191
            D++      + +  V +   +    KR +     ++  +D  + + +  L + GELNNT
Sbjct: 240 DDEE---NRGKDINCV-KGIINNKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNT 295

Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
             ++ SDHG  +G     + K  P+E   RVPF +  PG
Sbjct: 296 IFLFVSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYDPG 333


>gi|440715395|ref|ZP_20895942.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SWK14]
 gi|436439739|gb|ELP33153.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
           [Rhodopirellula baltica SWK14]
          Length = 510

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 77/183 (42%), Gaps = 11/183 (6%)

Query: 52  NYSINMNGRKIKHGEDY-YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE 110
           NY+   N    K+ E Y ++         D +R    SKQ    KP  L +SF APH P 
Sbjct: 163 NYATAKNKSMRKYAEQYPHSTLSYAAFGKDVIR--EASKQD---KPFCLSISFKAPHKPA 217

Query: 111 DSAPQ----YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ 166
              P+    YA   F    +    Y     P  +   Q  R  +  +    D  M +  Q
Sbjct: 218 TPDPRFNHVYAGKQFTKPLNFGREYSKHLAPQSKLGRQYPRFSEWKYDTDYDGEMARYHQ 277

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
            + ++D A+  I  ELK  G  +NT +IYTSD+GY  G  G    K  P E   RVP ++
Sbjct: 278 QVYAIDVALGVIRDELKAQGVADNTVVIYTSDNGYICGSHGY-GSKVLPMEESSRVPLMI 336

Query: 227 RGP 229
             P
Sbjct: 337 YDP 339


>gi|343085137|ref|YP_004774432.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353671|gb|AEL26201.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 542

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 53/91 (58%), Gaps = 3/91 (3%)

Query: 146 TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
            +K Q  H +F +  +   L+T++SVDD V +++  L+  G   NT ++YTSD G++LG+
Sbjct: 312 AKKKQIAHWKF-NRYIKDYLRTIKSVDDGVGEVLDYLEANGLAENTIVVYTSDQGFYLGE 370

Query: 206 FGLVKGKSFPFEFDIRVPFLMRGPG-IVPGT 235
            G    K F +E   R P L+R P  I PGT
Sbjct: 371 HGWFD-KRFMYEQSFRTPLLVRYPKEIKPGT 400


>gi|284036170|ref|YP_003386100.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283815463|gb|ADB37301.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 485

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 82/171 (47%), Gaps = 22/171 (12%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ--YANMFFNVTSHHT---P 129
           DL+      FL+ +    + KP  L +SF APH  + + P+  YA  F ++    T   P
Sbjct: 181 DLMGQQMDEFLQGNP---AGKPFCLSVSFKAPHAQDAANPEFPYAERFTDLYRDQTLKRP 237

Query: 130 AYD----YAPNPDKQWIL---QVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKI 178
           A      Y   PD  W     Q   +++   R  TD +  +  ++    +  +DD V  +
Sbjct: 238 AAADDKYYRQFPD--WFRHNDQNESRIRWSRRFATDSMFQQTTKSYNRLITGIDDVVGNL 295

Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
            + L++ G  +NT IIYTSD+G++ G++G    K +  E  IRVP ++  P
Sbjct: 296 RRTLQERGLADNTIIIYTSDNGFYEGEYGFAD-KWYGHELSIRVPLIIYDP 345


>gi|149198324|ref|ZP_01875370.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138620|gb|EDM27027.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 514

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/235 (22%), Positives = 98/235 (41%), Gaps = 37/235 (15%)

Query: 37  GWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
           G+  W  L    KY++   +   G+ I  GE     +  D+I   ++ ++   +     K
Sbjct: 132 GFDYWEVLPGQGKYWDPEFVTETGKTIYPGE-----HSSDVITRRALNWMNNERDK--SK 184

Query: 96  PIMLVMSFPAPHG-----------------PE-----------DSAPQYANMFFNVTSHH 127
           P ML++ F APH                  PE            +A +Y +M    + + 
Sbjct: 185 PFMLMVHFKAPHRSWQPTTRWKKKFSTMTFPEPDTLFDDYQGRGTAAKYQDMNIEHSMNM 244

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
                   +P K+++ +     + + +    + M   L  +  VD+ + KI+ +L + G 
Sbjct: 245 VGDLKSNQSPRKEFLKKNALTGKALVKWKYQMYMRDYLACIAGVDENIGKILDQLAESGL 304

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
             NT ++Y+SD G++LG+ G    K F +E   R P L R PG++       D++
Sbjct: 305 DKNTIVMYSSDQGFYLGEHGWFD-KRFMYEESYRTPLLARWPGVIKAKTRNEDLV 358


>gi|444304861|ref|ZP_21140650.1| arylsulfatase A family protein [Arthrobacter sp. SJCon]
 gi|443482831|gb|ELT45737.1| arylsulfatase A family protein [Arthrobacter sp. SJCon]
          Length = 874

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 62/255 (24%), Positives = 107/255 (41%), Gaps = 40/255 (15%)

Query: 6   ELTSRNLLYFDI--SHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMN 58
           E+  R   + D+  ++GY    FGK+         P G+  W        Y +   I+ +
Sbjct: 448 EMDYRVPTFIDVLRNNGYRTALFGKWHLGEQEVSEPRGFDAWKVFPGQGDYMDPEMIDPS 507

Query: 59  GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--------- 109
           GR I+ G      Y  D++ + S+ ++ +  +    +P  L++   APH P         
Sbjct: 508 GRTIEKG------YATDIVTDLSLDWIDERGED---EPFCLLVHHKAPHRPWVPDEKHKH 558

Query: 110 ---EDSAPQYANMFFNVTS--------HHTPAYDYAPNPDKQWI---LQVTRKMQPVHRQ 155
              + + P+    F +  S        H T A D   +  K  +   L+     +   R 
Sbjct: 559 LYADGTIPEPETFFDDYESRSKAVRGVHMTIADDMGADDIKMEVPEHLRGEENREERMRW 618

Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
              + M   LQ +QS+DD V +++  L+  G   NT ++YTSD G+ LG  G    K   
Sbjct: 619 KYQIYMRDYLQCIQSIDDNVGRMLDHLEAKGLAENTLVVYTSDQGFFLGDHGWFD-KRLM 677

Query: 216 FEFDIRVPFLMRGPG 230
           F+  +++P L+R P 
Sbjct: 678 FDQSLQMPMLIRWPA 692


>gi|399154474|ref|ZP_10754541.1| sulfatase family protein 7 [gamma proteobacterium SCGC AAA007-O20]
          Length = 522

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 79/188 (42%), Gaps = 43/188 (22%)

Query: 66  EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT- 124
           E++Y   Y   +   +  FL +   +    P +LV+SFP PH P     +Y +++     
Sbjct: 219 EEHYPTSY---VGEQACEFLEEQDDT----PFILVVSFPDPHHPFTPPGRYFDLYDPADI 271

Query: 125 ------SHHTPAYDYAPN------------PDKQWILQVTRKMQPVHRQFTDLLMTKRL- 165
                 SH T A    PN            PD+ W         P H   TD   T+R+ 
Sbjct: 272 PLPKSFSHRTSARSDLPNHIQRMYEIGAEKPDEFW---------PFH---TDDKATRRMI 319

Query: 166 ----QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
                T+  +D+ V  +++ LK+ G+  NT IIY SDHG ++G  G V          IR
Sbjct: 320 ALNYGTITMIDEQVGVVMQTLKNTGKSENTNIIYMSDHGDYMGDHGTVLKGGVHSHGLIR 379

Query: 222 VPFLMRGP 229
           VP +   P
Sbjct: 380 VPLIWSDP 387


>gi|333383458|ref|ZP_08475118.1| hypothetical protein HMPREF9455_03284 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332827697|gb|EGK00436.1| hypothetical protein HMPREF9455_03284 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 532

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 68/291 (23%), Positives = 112/291 (38%), Gaps = 72/291 (24%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
           GY    + K++    P G+  +  L     YYN S      + K+  +    Y  +LI  
Sbjct: 123 GYQTGVVGKWHMQCEPKGFNYYCVLNDQGDYYNPSFKTKESQGKYVRE--EGYATNLITT 180

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--------------- 124
            S+ F+    ++   KP  L++   APH       +Y N++ +V                
Sbjct: 181 HSIEFIENRDKN---KPFALLVHHKAPHRNWMPEAKYYNLYEDVEFPKPETFYDDYETRC 237

Query: 125 -----------SHHTPAYDYAPNP-------DKQW----ILQVTRKMQPVHRQF------ 156
                         T  YD   N        +K+W    + Q   +M P  R+       
Sbjct: 238 SAAKTQDMTIDKTMTFIYDLKLNELRNTTPYNKEWNNGGLQQAMNQMTPAQREAWENAYH 297

Query: 157 -------------TDLL-------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
                         DLL       M   L+ ++S+DD V +++  L+  G ++NT I+YT
Sbjct: 298 PKNMEFIDSNLSGDDLLNWKYQRYMKDYLRCIKSIDDEVGRLIDYLEKEGLMDNTIIVYT 357

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GIVPGTMYLLDVLIPQV 246
           SD G+++G+ G    K F +E   R P ++R P  I  GT    D L+  +
Sbjct: 358 SDQGFYMGEHGWFD-KRFMYEESFRTPLIIRYPKAIEAGTK--TDALVQNI 405


>gi|237723220|ref|ZP_04553701.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229447742|gb|EEO53533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 549

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 339 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 397

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 398 RTPLIMRLP 406


>gi|423214669|ref|ZP_17201197.1| hypothetical protein HMPREF1074_02729 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692575|gb|EIY85812.1| hypothetical protein HMPREF1074_02729 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 509

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/170 (24%), Positives = 82/170 (48%), Gaps = 7/170 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           N +  +L A+ +++F+  +  + +  P ++ ++F +PH P +  P Y   +        P
Sbjct: 203 NTFSSELYADAAIKFIETN--ASNDNPFLMYVAFTSPHDPRNVLPDYGRKY-GSKEITMP 259

Query: 130 AYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLG 186
                 +P D   + +   K+ P  R    +L  +      +  VD  + +I+  L+  G
Sbjct: 260 KNFVTQHPFDNGDLNERDEKLLPTPRVPEQVLAERARYYSMVNEVDVQIGRILDMLEKSG 319

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           + +NT I++ +D+G  +G+ GL+ GK   +E  +RVP ++ GP I    M
Sbjct: 320 KADNTIIVFAADNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPNIPQNVM 368


>gi|295135339|ref|YP_003586015.1| sulfatase [Zunongwangia profunda SM-A87]
 gi|294983354|gb|ADF53819.1| sulfatase [Zunongwangia profunda SM-A87]
          Length = 544

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 59/118 (50%), Gaps = 1/118 (0%)

Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
           + N F  +T+     ++ A  P+     +     + + R      +   + T+ SVD+ V
Sbjct: 271 WTNDFDRMTAEQRKQWNAAYRPENDAFWEQDLHGEALARYKGQRYLKDYMATIASVDEGV 330

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
            KI+  L+  G   NT ++YTSD G++LG+ G    K F +E   R+P LM+ PG +P
Sbjct: 331 GKILDYLEAQGLDENTLVVYTSDQGFYLGENGWF-DKRFMYEPSFRMPLLMQLPGKIP 387


>gi|161503090|ref|YP_001570202.1| hypothetical protein SARI_01155 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160864437|gb|ABX21060.1| hypothetical protein SARI_01155 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 530

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 76/169 (44%), Gaps = 16/169 (9%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSH 126
           N + P+  A+ ++++LR     +  S +P  LV+S   PH P D  PQ Y + F + TS 
Sbjct: 233 NQWSPEHEADMAIKYLRNEGGKYRNSNEPFALVVSMNPPHSPYDQVPQKYLDRFKDQTSR 292

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
                     P+  W         P +         + +  +  VD+   +I+ EL+ L 
Sbjct: 293 -----SMNKRPNVVWDKTYQEGYGPEY-------FKEYMAMINGVDEQFGRILTELERLK 340

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
              +T +++ SDHG  +G  G    K+  +E  +R+P + R PG +P  
Sbjct: 341 LDKDTLVVFFSDHGCCMGSNGQPT-KNVHYEEAMRIPMMFRWPGKLPAC 388


>gi|443704610|gb|ELU01589.1| hypothetical protein CAPTEDRAFT_204150 [Capitella teleta]
          Length = 446

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 73/153 (47%), Gaps = 21/153 (13%)

Query: 93  SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN--PDKQWILQVTRKM- 149
           S +P  + M    PH P D   ++  ++ +      PA    P   P+  W  +  R+  
Sbjct: 140 STQPFFVAMGIKKPHLPFDFPEEFLELYDDA---EMPANPLLPKNVPNIAWNHKHIREQG 196

Query: 150 -------QPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
                   P+H    D++  K L+     T+  VD  + +++++L  L   NNT +++ S
Sbjct: 197 SHFNATSDPLHG--LDIVAMKALRKAYYATMSYVDSIIGQVLQKLDSLKLTNNTIVVFLS 254

Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           DH  HLG+ G++ GK   F  + +VPF++R PG
Sbjct: 255 DHAIHLGEHGII-GKKTNFIVNAQVPFMIRVPG 286


>gi|298386258|ref|ZP_06995814.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. 1_1_14]
 gi|383125360|ref|ZP_09946002.1| hypothetical protein BSIG_5089 [Bacteroides sp. 1_1_6]
 gi|251837805|gb|EES65895.1| hypothetical protein BSIG_5089 [Bacteroides sp. 1_1_6]
 gi|298260635|gb|EFI03503.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. 1_1_14]
          Length = 527

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|255692838|ref|ZP_05416513.1| mucin-desulfating sulfatase [Bacteroides finegoldii DSM 17565]
 gi|260621400|gb|EEX44271.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 527

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|29347038|ref|NP_810541.1| sulfatase yidJ [Bacteroides thetaiotaomicron VPI-5482]
 gi|29338936|gb|AAO76735.1| putative sulfatase yidJ [Bacteroides thetaiotaomicron VPI-5482]
          Length = 579

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 369 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 427

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 428 RTPLIMRLP 436


>gi|374620758|ref|ZP_09693292.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374303985|gb|EHQ58169.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 552

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 94/231 (40%), Gaps = 24/231 (10%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY--------NYSINMNGRKIKHGEDYYND 71
           G+FGK+    +      G+  W +      YY           IN++G+++         
Sbjct: 121 GFFGKWHMGESNDSPRDGFDRWVSFAGQGSYYPIKRPDGSTNVINVDGQRVPQ-----QG 175

Query: 72  YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT---SHHT 128
           Y  D I   ++ +L + + +   KP  L +S  A H     AP++   + +VT       
Sbjct: 176 YITDEITEYALTWLEEQRDA--TKPFFLYLSHKAVHSDAVPAPRHRGQYDDVTFELPSSV 233

Query: 129 PAYDYAPNPDKQWILQVTRKMQPVHRQF-TDLLMTKRLQ----TLQSVDDAVEKIVKELK 183
              D        W+         +   + +D  MT  L+     L SVDD++  +   L 
Sbjct: 234 EITDAYLEGKPMWVKNQRNSWHGIDFAYASDRKMTDYLKDYFGALSSVDDSIGTLRAYLS 293

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
           +     NT II+ SD+G+  G  GL+  K   +E  IRVP ++  PG VP 
Sbjct: 294 EADIAENTVIIFYSDNGFLTGDHGLID-KRNAYEGSIRVPMIVAAPGRVPA 343


>gi|299146834|ref|ZP_07039902.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 3_1_23]
 gi|383113493|ref|ZP_09934265.1| hypothetical protein BSGG_3196 [Bacteroides sp. D2]
 gi|298517325|gb|EFI41206.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 3_1_23]
 gi|313695661|gb|EFS32496.1| hypothetical protein BSGG_3196 [Bacteroides sp. D2]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|380696306|ref|ZP_09861165.1| sulfatase yidJ [Bacteroides faecis MAJ27]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|334139744|ref|YP_004532942.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. PP1Y]
 gi|333937766|emb|CCA91124.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. PP1Y]
          Length = 569

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 54/234 (23%), Positives = 108/234 (46%), Gaps = 29/234 (12%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
            +FGK+    +     PG+ +W + +    Y+         + ++N++G+++        
Sbjct: 137 AFFGKWHMGASTDAPRPGFDKWVSFMGQGNYWPAKQAGPHSSPNLNVDGKEVPQ-----K 191

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
            Y  D + + ++ +L  +K+   KKP  L +S    H      P+YA+ + +  +  T  
Sbjct: 192 GYITDELTDYAMNWL--TKERDPKKPFFLYLSHKGVHSDPLPPPRYAHQYDD--AKFTLP 247

Query: 131 YDYAPNPDKQ-----WILQVTRKMQPVHRQF-TDLLMTKRLQ----TLQSVDDAVEKIVK 180
              A  P+       W+         +   +  D+ MT+ L+    TL ++DD++ +I+ 
Sbjct: 248 ASAADTPENNAGKPVWVQNQRNTWHGIDFFYNADVPMTEYLKYYYGTLSAIDDSLGRIMT 307

Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            L+      +T +++TSD+G+  G+ GL+  K   +E  IRVP ++  PG VP 
Sbjct: 308 YLRKNHLEKDTLVVFTSDNGFMTGEHGLID-KRNAYEASIRVPLVVWEPGTVPA 360


>gi|331083552|ref|ZP_08332663.1| hypothetical protein HMPREF0992_01587 [Lachnospiraceae bacterium
           6_1_63FAA]
 gi|330403763|gb|EGG83315.1| hypothetical protein HMPREF0992_01587 [Lachnospiraceae bacterium
           6_1_63FAA]
          Length = 489

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 82/173 (47%), Gaps = 26/173 (15%)

Query: 68  YYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
           Y   Y+P + + + S+ FLR+      +KP  L+ S+  PH P D AP Y    +N    
Sbjct: 174 YEEKYHPTNWVTDRSIDFLRRKD---PQKPFFLMASYLRPHPPFD-APSYYFDLYNKKEL 229

Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQF------TDLLMTKRLQ-----TLQSVDDAV 175
             PA          W  + T ++Q + R F      +D  + +  Q      +  +D  +
Sbjct: 230 TPPAV-------GDW--ETTEELQAMGRVFDSKCGPSDAELIREAQIGYYACITHLDHQI 280

Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
            ++++ L + G  +NT I++TSDHG  L    + + KS P++  IR+P ++ G
Sbjct: 281 GRLIQALVEYGVYDNTLILFTSDHGEELCDHHMFR-KSRPYQGSIRIPMIVSG 332


>gi|325109857|ref|YP_004270925.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970125|gb|ADY60903.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 487

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 82/164 (50%), Gaps = 16/164 (9%)

Query: 76  LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAY 131
           + A ++V+ L +  +S  ++P  + +    PH P  +  +Y +++      V        
Sbjct: 194 IAAAEAVKQLNEYAKS--EEPFFMAVGLYRPHTPYVAPKKYFDLYPLDTIRVPQIRAGYL 251

Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGE 187
           D  P P +     VTRK + V+    + L  + +Q    ++   D  + +I+  L D G 
Sbjct: 252 DSLPKPAQ---TSVTRKKEQVN--LEESLAKQAIQAYYASITFADAQLGQILDALDDSGL 306

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
            +NT +++TSDHGYH+G+ G  + K+  FE  +RVP ++  PG+
Sbjct: 307 ADNTIVLFTSDHGYHMGEHGHYQ-KTTLFENALRVPLIISSPGM 349


>gi|423298989|ref|ZP_17277014.1| hypothetical protein HMPREF1057_00155 [Bacteroides finegoldii
           CL09T03C10]
 gi|408474338|gb|EKJ92857.1| hypothetical protein HMPREF1057_00155 [Bacteroides finegoldii
           CL09T03C10]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|404366924|ref|ZP_10972300.1| hypothetical protein FUAG_02208 [Fusobacterium ulcerans ATCC 49185]
 gi|313689858|gb|EFS26693.1| hypothetical protein FUAG_02208 [Fusobacterium ulcerans ATCC 49185]
          Length = 524

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 45/63 (71%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  + KI+ +L++LG   +T I++T+DHG+ +GQ GL++   F +E  I++PF+++ P 
Sbjct: 330 MDKYIGKILDKLEELGLAEDTIIVFTTDHGHFVGQHGLIRKGPFHYEDLIKIPFIVKYPK 389

Query: 231 IVP 233
           +VP
Sbjct: 390 VVP 392


>gi|373496909|ref|ZP_09587452.1| hypothetical protein HMPREF0402_01325 [Fusobacterium sp. 12_1B]
 gi|371964568|gb|EHO82081.1| hypothetical protein HMPREF0402_01325 [Fusobacterium sp. 12_1B]
          Length = 524

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 45/63 (71%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  + KI+ +L++LG   +T I++T+DHG+ +GQ GL++   F +E  I++PF+++ P 
Sbjct: 330 MDKYIGKILDKLEELGLAEDTIIVFTTDHGHFVGQHGLIRKGPFHYEDLIKIPFIVKYPE 389

Query: 231 IVP 233
           +VP
Sbjct: 390 VVP 392


>gi|293372797|ref|ZP_06619178.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292632306|gb|EFF50903.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|160886350|ref|ZP_02067353.1| hypothetical protein BACOVA_04360 [Bacteroides ovatus ATCC 8483]
 gi|423289518|ref|ZP_17268368.1| hypothetical protein HMPREF1069_03411 [Bacteroides ovatus
           CL02T12C04]
 gi|156108235|gb|EDO09980.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
 gi|392667229|gb|EIY60739.1| hypothetical protein HMPREF1069_03411 [Bacteroides ovatus
           CL02T12C04]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|336414624|ref|ZP_08594970.1| hypothetical protein HMPREF1017_02078 [Bacteroides ovatus
           3_8_47FAA]
 gi|335933736|gb|EGM95738.1| hypothetical protein HMPREF1017_02078 [Bacteroides ovatus
           3_8_47FAA]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|262409643|ref|ZP_06086183.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294647124|ref|ZP_06724728.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294807889|ref|ZP_06766670.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|336407138|ref|ZP_08587772.1| hypothetical protein HMPREF0127_05085 [Bacteroides sp. 1_1_30]
 gi|345509515|ref|ZP_08789111.1| hypothetical protein BSAG_04540 [Bacteroides sp. D1]
 gi|229447035|gb|EEO52826.1| hypothetical protein BSAG_04540 [Bacteroides sp. D1]
 gi|262352496|gb|EEZ01596.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292637541|gb|EFF55955.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294444887|gb|EFG13573.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|295087977|emb|CBK69500.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
           XB1A]
 gi|335948239|gb|EGN09956.1| hypothetical protein HMPREF0127_05085 [Bacteroides sp. 1_1_30]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|423216278|ref|ZP_17202803.1| hypothetical protein HMPREF1074_04335 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392691129|gb|EIY84380.1| hypothetical protein HMPREF1074_04335 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|298482971|ref|ZP_07001153.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. D22]
 gi|298270943|gb|EFI12522.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. D22]
          Length = 527

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLIMRLP 384


>gi|423304333|ref|ZP_17282332.1| hypothetical protein HMPREF1072_01272 [Bacteroides uniformis
           CL03T00C23]
 gi|423310553|ref|ZP_17288537.1| hypothetical protein HMPREF1073_03287 [Bacteroides uniformis
           CL03T12C37]
 gi|392681724|gb|EIY75081.1| hypothetical protein HMPREF1073_03287 [Bacteroides uniformis
           CL03T12C37]
 gi|392684919|gb|EIY78239.1| hypothetical protein HMPREF1072_01272 [Bacteroides uniformis
           CL03T00C23]
          Length = 526

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 75/161 (46%), Gaps = 7/161 (4%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHHT 128
           D  P  I ++++ ++   K +    P  + +SFP PH P      Y +MF    + +  T
Sbjct: 177 DQQPTRIVDEAIEWIDSQKDN----PFFVWISFPEPHNPYQVCEPYYSMFAPDKIPAVKT 232

Query: 129 PAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
              D     +K  IL ++     P   +    L    +  ++ +DD ++++V++LK+ G 
Sbjct: 233 SRKDVLKKGEKYQILAELEDASCPNLERDLPRLRGNYMGMIRLIDDQIKRLVEDLKEKGL 292

Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
              T I+  SDHG + G++GL++      E   R+P +  G
Sbjct: 293 FEKTIIVVLSDHGDYCGEYGLIRKGVGLSESLTRIPMVWAG 333


>gi|197337865|ref|YP_002158674.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri MJ11]
 gi|197315117|gb|ACH64566.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri MJ11]
          Length = 518

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 80/175 (45%), Gaps = 16/175 (9%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
           + N + P+  A+ ++++LR    ++  + KP  LV+S   PH P D  PQ     F  +S
Sbjct: 220 HINQWSPEHEADIAIKYLRNDGGNYRDNDKPFTLVVSMNPPHSPYDQVPQKYLDRFKDSS 279

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
               +      P+ QW  +      P +         + +  +  VD+   +I+ EL  L
Sbjct: 280 RTLNS-----RPNVQWDKEYLEGYGPEY-------FKEYMAMVNGVDEQFGRIIDELDTL 327

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLL 239
               +T +++ SDHG  +G  G    K+  +E  +R+P + R PG I P +  LL
Sbjct: 328 NLTEDTLVVFFSDHGCCMGSNG-NPTKNVHYEEAMRIPMMFRWPGKIAPKSDDLL 381


>gi|440638195|gb|ELR08114.1| hypothetical protein GMDG_02941 [Geomyces destructans 20631-21]
          Length = 511

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 67/279 (24%), Positives = 108/279 (38%), Gaps = 66/279 (23%)

Query: 22  FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
            GK+       + P G+  W  +     Y++  +   G K +H + Y  D    +I + S
Sbjct: 99  IGKWHLGEGSDHEPTGFDFWSVVSGQGAYHDPEMIEMG-KTRHEKGYATD----IITDKS 153

Query: 82  VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF-NVTSHHTPAYDYA------ 134
           + +L++  Q+   KP  L+    APH   +  P++A+++  +V    T   DY       
Sbjct: 154 LEWLKKLDQA---KPFFLMCHHKAPHRSWECDPKHADLYTEDVKVPETFDDDYKNRARAA 210

Query: 135 ----------------------------------------PNP---------DKQWILQV 145
                                                   PNP         DKQ     
Sbjct: 211 AEAKIRVDTDLTYFDLGLAQPDGGSEVGELVSLTSTLRKIPNPEDVTKLRLIDKQSGEVF 270

Query: 146 TRKMQPVHRQFTDLLMTKR-LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLG 204
           T K     R F      KR L+T+ S+DD V +++K L D    N+T +IYTSD G+ LG
Sbjct: 271 TFKTNAELRHFKYQRYIKRYLRTIASIDDNVGRMLKYLDDENLTNDTVVIYTSDQGFFLG 330

Query: 205 QFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
             G    K F +E   ++PF++R P  +       D++ 
Sbjct: 331 DHGWFD-KRFMYEESFQMPFMIRYPREIAKGSVCDDIVC 368


>gi|358065392|ref|ZP_09151935.1| hypothetical protein HMPREF9473_03998 [Clostridium hathewayi
           WAL-18680]
 gi|356696383|gb|EHI57999.1| hypothetical protein HMPREF9473_03998 [Clostridium hathewayi
           WAL-18680]
          Length = 519

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 75/159 (47%), Gaps = 18/159 (11%)

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP---AYDYA 134
            N ++ FL     S   +P +L + + APH P +   +Y N ++N T    P    +D  
Sbjct: 178 TNRAIDFLDNRDAS---RPFLLFVGYYAPHSPHNPPAEYFNRYYNRTDLDEPWIAPWDVP 234

Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTY 192
           P  +   + +       +H Q  +L   +      +  +D    ++++       L+NTY
Sbjct: 235 PVNNGNVMSRY------IHLQGEELRAARAGYYGNIAFLDSQTSRLLERAM---RLDNTY 285

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           +I+TSDHG  LG   L + KS P+E  + +PF+M GP I
Sbjct: 286 VIFTSDHGEMLGDHYLYQ-KSRPYEGAVHIPFMMMGPDI 323


>gi|344201753|ref|YP_004786896.1| sulfatase [Muricauda ruestringensis DSM 13258]
 gi|343953675|gb|AEM69474.1| sulfatase [Muricauda ruestringensis DSM 13258]
          Length = 560

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 70/281 (24%), Positives = 109/281 (38%), Gaps = 68/281 (24%)

Query: 15  FDISHGYFGKYLNKYN-----------GSYIPPGWREWGALIMNSKYYNYS-INMNGRKI 62
           FD S   FG+ L +             G     G+     L     YYN + IN  G+  
Sbjct: 121 FDFSQQTFGELLQQAGYKTGVLGKLHLGDTPSKGFDYVDILPGQGSYYNPTFINEEGQYQ 180

Query: 63  KHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN 122
             G      Y  ++I + +++++   K    ++P ML +   +PH P    P    M+ N
Sbjct: 181 LEG------YTTEIITDKAIKWMDSVKSG--EQPFMLFLGHKSPHRPWQPGPNELGMYEN 232

Query: 123 VT--SHHTPAYDYAPNPD----------------------------------KQWIL--- 143
           V      T   DY+ N +                                  KQW     
Sbjct: 233 VEIPEPETLFDDYSGNREVASLNYMSISEAMKLEQDLKITDQPQNGFTEEQQKQWDAIYG 292

Query: 144 QVTRKMQPVHRQFTDLL-------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
            +  K +  + Q  DL        M   L ++  VD +V  ++  LKD G   NT +IYT
Sbjct: 293 PINEKFKKDNPQGDDLTRFKYQRYMRDYLASVAGVDKSVGHVLDYLKDAGLDENTIVIYT 352

Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
           SD G++LG+ G    K + ++  +R P L++ PG V PGT+
Sbjct: 353 SDQGFYLGEHGWFD-KRWMYKESLRTPLLVKWPGTVKPGTV 392


>gi|374375586|ref|ZP_09633244.1| N-acetylglucosamine-6-sulfatase [Niabella soli DSM 19437]
 gi|373232426|gb|EHP52221.1| N-acetylglucosamine-6-sulfatase [Niabella soli DSM 19437]
          Length = 566

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 57/195 (29%), Positives = 89/195 (45%), Gaps = 26/195 (13%)

Query: 54  SINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA 113
            +N+NG+ +         Y  D + + ++ FL    ++   KP ML +S  A H     A
Sbjct: 219 GLNVNGKHVPQ-----KGYITDELTDYALDFLNHRDKN---KPFMLYLSHKAVHDNFVPA 270

Query: 114 PQYANMFFN-----VTSHHTPAYDYAPNPDKQWILQVTRKMQ-----PVHRQFTDLLMTK 163
            +   MF N       +  T   + AP     W LQ  R        P H         K
Sbjct: 271 ERDKGMFKNDHFNPPLTMQTGKLEGAP----MW-LQNQRNSWHGVDFPYHSNLDIEAYYK 325

Query: 164 R-LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
           R  +TL  VD+++ +I+  LK  G L +T IIY  D+G+  G+ GL+  ++  +E  +RV
Sbjct: 326 RYAETLYGVDESLGRILDYLKQKGLLESTLIIYMGDNGFQFGEHGLIDKRT-AYEASMRV 384

Query: 223 PFLMRGPGIV-PGTM 236
           P +   PGI+ PGT+
Sbjct: 385 PMVAYCPGIIRPGTV 399


>gi|251799577|ref|YP_003014308.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247547203|gb|ACT04222.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 511

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 42/64 (65%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  +  I+ +L++LG  +NT +++T+DHG+  GQ GL     F +E  IR+PF++R PG
Sbjct: 317 MDKYIGTILDKLEELGIADNTVVVFTTDHGHFFGQHGLQAKGGFHYEDLIRLPFIVRYPG 376

Query: 231 IVPG 234
            VP 
Sbjct: 377 QVPA 380


>gi|421613935|ref|ZP_16055004.1| sulfatase family protein [Rhodopirellula baltica SH28]
 gi|408495142|gb|EKJ99731.1| sulfatase family protein [Rhodopirellula baltica SH28]
          Length = 494

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 43/165 (26%), Positives = 78/165 (47%), Gaps = 11/165 (6%)

Query: 75  DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
           D + + +  F+R+     S++P    +S P PHGP      Y  MF ++     P   + 
Sbjct: 201 DWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFEDMPVR--PPMTFQ 254

Query: 135 PNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
            +  K  W+    R  +   ++F   LMT+    ++ +DD V  ++  L +L     T +
Sbjct: 255 LDGTKPGWLPGTNRNAE---QRFNARLMTQYFGMVRCIDDNVGMLLALLDELSLTERTVV 311

Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
           ++TSDHG    + G +  K  P+E   +VP ++  PG+VP  + +
Sbjct: 312 VFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMVIAAPGLVPAGLRI 355


>gi|373451267|ref|ZP_09543194.1| hypothetical protein HMPREF0984_00236 [Eubacterium sp. 3_1_31]
 gi|371969034|gb|EHO86486.1| hypothetical protein HMPREF0984_00236 [Eubacterium sp. 3_1_31]
          Length = 483

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 65/252 (25%), Positives = 100/252 (39%), Gaps = 35/252 (13%)

Query: 25  YLNKYNGSYIPPGWREWGAL------IMNSKYYNYSINMNGRK----IKHGEDYYNDYYP 74
           YL  Y G+ IP  W            + NS   +  +N +G +    + H   Y    +P
Sbjct: 122 YLGCYRGNDIP-HWMHQEVCDDYLFDLQNSLDRHADVNASGAECNSWVTHPWIYEERLHP 180

Query: 75  -DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
            +  A++S+RFL+   +S   KP  L+ SF  PH P D  P Y  M+ +         D+
Sbjct: 181 TNWCADNSIRFLKTRDRS---KPFFLMSSFVRPHQPLDPPPAYYAMYKDKQLREPAQGDW 237

Query: 134 APNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
                 KQ+                   M     ++  VD  + +I+  L++ G   +T 
Sbjct: 238 DNEAMTKQYGFTSDSIYGCNDAAMRHDAMAGYYASITHVDHQIGRILSALREEGMYEDTI 297

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM------------RGPGIV------PG 234
           I++ SDHG  L    L + K FP+E    +PFLM            R  GI       P 
Sbjct: 298 ILFVSDHGEMLFDHSLWR-KVFPYEGSTHIPFLMHIGKNIADLLPKRIQGICELRDIMPT 356

Query: 235 TMYLLDVLIPQV 246
            + L D+ IP+ 
Sbjct: 357 LLALCDLPIPET 368


>gi|397611619|gb|EJK61409.1| hypothetical protein THAOC_18109 [Thalassiosira oceanica]
          Length = 373

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 75/166 (45%), Gaps = 28/166 (16%)

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA------NMFFNVTSHH-TPAYD 132
           D++RFLR+  +    KP  L +++ A H  +    QY       +M+ N T       YD
Sbjct: 175 DALRFLREKPKD---KPFFLNVAYYATHAVDTDVRQYMPQRASMSMYLNDTIFEPEDTYD 231

Query: 133 YAP------NPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
             P      N  +   +W     +K Q         +M    +    VD +V  IV +L+
Sbjct: 232 KMPYFFSQENEGRNRWRWRFDTHKKHQR--------MMKNYYRMASEVDTSVGIIVDQLQ 283

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
             GELNNT II+T+D+G    + GL   K +P +  IRVP ++  P
Sbjct: 284 KSGELNNTMIIFTTDNGNFHAEHGLAD-KWYPHQESIRVPLIINDP 328


>gi|423688263|ref|ZP_17663066.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri SR5]
 gi|371492766|gb|EHN68372.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri SR5]
          Length = 518

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 80/175 (45%), Gaps = 16/175 (9%)

Query: 68  YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
           + N + P+  A+ ++++LR    ++  + KP  LV+S   PH P D  PQ     F  +S
Sbjct: 220 HINQWSPEHEADIAIKYLRNDGGNYRDNDKPFTLVVSMNPPHSPYDQVPQKYLDRFKDSS 279

Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
               +      P+ QW  +      P +         + +  +  VD+   +I+ EL  L
Sbjct: 280 RTLNS-----RPNVQWDKEYLEGYGPEY-------FKEYMAMVNGVDEQFGRIINELDAL 327

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLL 239
               +T +++ SDHG  +G  G    K+  +E  +R+P + R PG I P +  LL
Sbjct: 328 NLTEDTLVVFFSDHGCCMGSNG-NPTKNVHYEEAMRIPMMFRWPGKIAPKSDDLL 381


>gi|266621952|ref|ZP_06114887.1| arylsulfatase [Clostridium hathewayi DSM 13479]
 gi|288866341|gb|EFC98639.1| arylsulfatase [Clostridium hathewayi DSM 13479]
          Length = 520

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 81/179 (45%), Gaps = 15/179 (8%)

Query: 68  YYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
           Y   Y+P + ++  S+ FLR+   S   KP  L+ S+  PH P D APQY    +     
Sbjct: 205 YEEKYHPTNWVSTRSIDFLRRRDTS---KPFFLMASYLRPHPPFD-APQYYFDLYRDKQL 260

Query: 127 HTPAY----DYAPNPDKQWILQV-TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
             PA     D     D Q + ++      PV  +            +  +D  + ++++ 
Sbjct: 261 TPPAVGDWEDEDFTGDYQRLGRIYDSATGPVDPELIRQAQIGYYACITHLDHQIGRLIQA 320

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG----IVPGTM 236
           L +   ++NT I++TSDHG  L    L + KS P+E   R+P L+ GP       PGT+
Sbjct: 321 LVEYKLMDNTIILFTSDHGEELCDHHLFR-KSRPYEGSCRIPMLLSGPERLIHAAPGTV 378


>gi|251797269|ref|YP_003012000.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247544895|gb|ACT01914.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 480

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 44/65 (67%), Gaps = 2/65 (3%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL-VKGKSFPFEFDIRVPFLMRGP 229
           +DD V +++  L+  G+L+NT +++ SDHG  LG  G+ +KG  F ++  +RVP ++RGP
Sbjct: 279 IDDQVGRMLDSLERSGQLDNTIVVFMSDHGELLGDHGMYLKGPHF-YDCSVRVPLIVRGP 337

Query: 230 GIVPG 234
           GI  G
Sbjct: 338 GIHGG 342


>gi|153808549|ref|ZP_01961217.1| hypothetical protein BACCAC_02844 [Bacteroides caccae ATCC 43185]
 gi|423219400|ref|ZP_17205896.1| hypothetical protein HMPREF1061_02669 [Bacteroides caccae
           CL03T12C61]
 gi|149128871|gb|EDM20088.1| arylsulfatase [Bacteroides caccae ATCC 43185]
 gi|392625450|gb|EIY19516.1| hypothetical protein HMPREF1061_02669 [Bacteroides caccae
           CL03T12C61]
          Length = 527

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 376 RTPLVMRLP 384


>gi|410456452|ref|ZP_11310313.1| sulfatase [Bacillus bataviensis LMG 21833]
 gi|409928121|gb|EKN65244.1| sulfatase [Bacillus bataviensis LMG 21833]
          Length = 503

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 5/94 (5%)

Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
           +D  + KI+ +L DLG   NT +++T+DHG+  GQ GL     F +E  +++P+++R PG
Sbjct: 309 LDKYIGKILDKLDDLGLAKNTIVVFTTDHGHFFGQHGLHDKGGFHYEDLLKLPYIVRYPG 368

Query: 231 IVPG-----TMYLLDVLIPQVRKFSSGSLIFIMS 259
            VP      +M  L  L P    F+  S+   M+
Sbjct: 369 KVPAGQICHSMQSLVDLAPTFLSFAGISIPRTMT 402


>gi|389872526|ref|YP_006379945.1| type I phosphodiesterase/nucleotide pyrophosphatase family protein
           2 [Advenella kashmirensis WT001]
 gi|388537775|gb|AFK62963.1| type I phosphodiesterase/nucleotide pyrophosphatase family protein
           2 [Advenella kashmirensis WT001]
          Length = 538

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 78/166 (46%), Gaps = 17/166 (10%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY---DY 133
           IA+ S+  L  +  + S++P  L +SFP PH P     QY +M+        PA    D+
Sbjct: 223 IADRSIDLL--NDYAGSEQPFFLQVSFPDPHHPFTPPGQYWDMY-KPEDMALPASFYSDH 279

Query: 134 APNPDKQWILQ------VTRKMQPV----HRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
            P    QW+ Q       T+K   V     R+  + L      ++ ++D  + +++ EL+
Sbjct: 280 EPPQHVQWLYQRRDSGAATKKGTAVFACDEREAKEALALN-YGSISNIDAQIGRVMAELE 338

Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
            LG  +NT +I+TSDHG  LG   L+      +    RVPF+   P
Sbjct: 339 RLGLQDNTIVIFTSDHGDFLGDHQLMLKGPIHYRGLTRVPFIWSDP 384


>gi|229822378|ref|YP_002883904.1| sulfatase [Beutenbergia cavernae DSM 12333]
 gi|229568291|gb|ACQ82142.1| sulfatase [Beutenbergia cavernae DSM 12333]
          Length = 489

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 67/158 (42%), Gaps = 27/158 (17%)

Query: 93  SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV 152
              P    +SFP PH P      Y ++F           D    P++    +        
Sbjct: 177 EDDPFFCWVSFPEPHNPYQVPEPYFSLF-----------DEDEIPERLAGPEALAGKGRT 225

Query: 153 HRQFTDLLMTKR--------------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
           +R   DL+ TKR                 L+ +DD + ++V  L D     +T I + SD
Sbjct: 226 YRWLGDLIGTKRPGYDEAWRRYAANYCGMLRLIDDQLRRLVGHLGD--RARDTVIAFVSD 283

Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           HG  +G++GL +  +   EF +R+PF + GPG+ PG +
Sbjct: 284 HGDFVGEYGLQRKGAGMSEFLMRIPFQLSGPGVRPGGV 321


>gi|357384297|ref|YP_004899021.1| mucin-desulfating sulfatase MdsA [Pelagibacterium halotolerans B2]
 gi|351592934|gb|AEQ51271.1| mucin-desulfating sulfatase MdsA precursor [Pelagibacterium
           halotolerans B2]
          Length = 519

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 46/72 (63%), Gaps = 1/72 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M + L+T+QS+DD V +++  L + G   NT +IYTSD G+ LG+ G    K F +E  +
Sbjct: 292 MKRYLRTVQSIDDNVGRMLDYLDERGLSENTIVIYTSDQGFFLGEHGWFD-KRFMYEESL 350

Query: 221 RVPFLMRGPGIV 232
           ++PFL+R P  +
Sbjct: 351 QMPFLVRYPAAI 362


>gi|270296054|ref|ZP_06202254.1| mucin-desulfating sulfatase MdsA [Bacteroides sp. D20]
 gi|270273458|gb|EFA19320.1| mucin-desulfating sulfatase MdsA [Bacteroides sp. D20]
          Length = 529

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 105/277 (37%), Gaps = 72/277 (25%)

Query: 20  GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSI---NMNGRKIKHGEDYYNDYYPDL 76
           GY      K++ +  P G+  +  L    +YYN      + NG  IK        Y   L
Sbjct: 123 GYVTSVFGKWHLNVEPKGFDHYDILFDQGEYYNPKFRRPDTNGEYIKE-----KGYATTL 177

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------------GPED------ 111
           I + ++ +L   K    +KP  ++++  APH                    PE       
Sbjct: 178 ITDHAIEWL--EKNISDEKPFCVLVNHKAPHRNWMPDFPNMHLFEDVDFPEPETLFDDYT 235

Query: 112 -SAPQYANMFFNVTSHHTPAYDYA-----PNPDKQWI----LQVTRKMQPVHRQFTDLLM 161
              PQ       +  H   A+D+        P  Q+I    L    +M    R   + + 
Sbjct: 236 TRGPQMKTQELTIDRHMGYAFDFKVRELKNEPTLQYIRDSWLMAMAEMDSKQRATWEAIY 295

Query: 162 TKR--------------------------LQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
            K+                           +T++SVD+ V +++  L++ G L+NT I+Y
Sbjct: 296 DKKNEEFLKNPPVGKDLLRWKYQRYVKEYCRTIRSVDEQVGRLLDYLEEKGVLDNTVIVY 355

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           TSD G+ LG+ GL   K F +E   R P ++  P +V
Sbjct: 356 TSDQGFLLGEHGLY-DKRFMYEESFRTPLIISCPSLV 391


>gi|218780829|ref|YP_002432147.1| sulfatase [Desulfatibacillum alkenivorans AK-01]
 gi|218762213|gb|ACL04679.1| sulfatase [Desulfatibacillum alkenivorans AK-01]
          Length = 468

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
            + + SVD  V +++K L+D G  +NT ++Y  D+GY  G+   +  K + +E  IR+PF
Sbjct: 276 CRVVASVDRQVGRLLKFLEDKGLADNTIVVYAGDNGYFWGEHRKID-KRWAYEESIRIPF 334

Query: 225 LMRGPGIVP 233
           ++R PG+VP
Sbjct: 335 MIRAPGVVP 343


>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 445

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 100/228 (43%), Gaps = 45/228 (19%)

Query: 20  GYFGKYLNKYNGSYIPP--GWREWGALIMNSKYYNYSINM-------NGRKIKHGEDYYN 70
           G FGK+   Y+ ++ P   G+ E+   +  +  Y+  I+        +G KIK+ + Y  
Sbjct: 127 GMFGKWHLGYDKAFNPTLQGFDEFVGFVSGNVDYHGHIDQEGYLDWWDGVKIKNEKGYTT 186

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           D    LI+   V+F+++      + P  L +   APH P                     
Sbjct: 187 D----LISEYGVKFIQEHNPEVKRAPFFLYLPHEAPHSP--------------------- 221

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQ--FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
             Y    DK     V R++     Q    D + +   + ++ +D+ V +I++ LK+ G+ 
Sbjct: 222 --YQRRIDK-----VLREIGTAGTQEVIQDSISSIYKEMVEVMDEGVGRIMQSLKETGQY 274

Query: 189 NNTYIIYTSDHGY-HLGQFGLVKG-KSFPFEFDIRVPFLMRGPGIVPG 234
            NT +I+ SD+G  H G  G ++G K+ P+E   RVP +   P  V G
Sbjct: 275 ENTIVIFISDNGANHYGDNGGLRGFKAGPYEGGSRVPAIFSFPKEVKG 322


>gi|319641749|ref|ZP_07996431.1| sulfatase [Bacteroides sp. 3_1_40A]
 gi|317386636|gb|EFV67533.1| sulfatase [Bacteroides sp. 3_1_40A]
          Length = 515

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 311 MRDYMKTVKSLDDNVGRVLDYLKENGLLDNTLVVYTSDQGFYMGEHGWFD-KRFMYEESM 369

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 370 RTPLIMRLP 378


>gi|345519869|ref|ZP_08799280.1| hypothetical protein BSFG_02713 [Bacteroides sp. 4_3_47FAA]
 gi|254836257|gb|EET16566.1| hypothetical protein BSFG_02713 [Bacteroides sp. 4_3_47FAA]
          Length = 515

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 311 MRDYMKTVKSLDDNVGRVLDYLKENGLLDNTLVVYTSDQGFYMGEHGWFD-KRFMYEESM 369

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 370 RTPLIMRLP 378


>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
 gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
          Length = 500

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 88/201 (43%), Gaps = 36/201 (17%)

Query: 52  NYSINMNGRKIKHGEDYYN-------------DYYPDLIANDSVRFLRQSKQSFSKKPIM 98
            + IN+ G  + H   Y++             +Y  D + +++V+F+  +K +    P +
Sbjct: 166 GFDINIGGCGMGHPSSYFSPYKNPTLKDGPVGEYLADRLTDEAVKFIENTKGT----PFL 221

Query: 99  LVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTD 158
           L +S  + H P  +          +  +        P    +++ +     + V  Q   
Sbjct: 222 LYLSHYSVHTPLQAKKGL------IEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIY 275

Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG-------LVKG 211
             M      +QS+D++V +++ +LK+LG   NT II+TSD+G      G       L  G
Sbjct: 276 AAM------MQSLDESVGRVLDKLKELGLDKNTVIIFTSDNGGLSTAEGAPTSNMPLRAG 329

Query: 212 KSFPFEFDIRVPFLMRGPGIV 232
           K +P+E  +R P +++ PG+ 
Sbjct: 330 KGWPYEGGVREPLVVKWPGVT 350


>gi|372210442|ref|ZP_09498244.1| mucin-desulfating sulfatase [Flavobacteriaceae bacterium S85]
          Length = 506

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 54/241 (22%), Positives = 104/241 (43%), Gaps = 53/241 (21%)

Query: 37  GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
           G+  W  L+   +Y N    ++ + IK+ + Y  D    ++ N ++ +L + +    +KP
Sbjct: 136 GFDYWEVLLGQGEYKNPDF-LSSKGIKNEKGYVTD----IVTNKALGWLEKGRNK--EKP 188

Query: 97  IMLVMSFPAPHGP------------EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQ 144
            ML++   APH P            + + P+  N+F +  +  T A +   N +  +   
Sbjct: 189 FMLMVHQKAPHRPWVPKREHMQLYKDVNIPEPDNLFDDYATRTTAAREQDLNMNTSFRDD 248

Query: 145 VTRKMQP------------VHRQFTD---------------------LLMTKRLQTLQSV 171
              K++P            +  +F D                     L M   L+ + SV
Sbjct: 249 SDLKIKPLTFRPERFIKVGIDDRFADRIEKYKHLKLTGKELTKLKYQLYMKDYLRCIWSV 308

Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           D+ + +++  LK  G   NT ++Y+SD G+++G+ G    K F +E   R P +++ PG+
Sbjct: 309 DENIGRLMDYLKATGLDKNTIVMYSSDQGFYMGEHGWF-DKRFMYEESFRTPLIVKWPGV 367

Query: 232 V 232
           V
Sbjct: 368 V 368


>gi|423293953|ref|ZP_17272080.1| hypothetical protein HMPREF1070_00745 [Bacteroides ovatus
           CL03T12C18]
 gi|392677174|gb|EIY70593.1| hypothetical protein HMPREF1070_00745 [Bacteroides ovatus
           CL03T12C18]
          Length = 509

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 82/165 (49%), Gaps = 7/165 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           N +  +L A+ +++F+  +  + +  P ++ ++F +PH P +  P Y   + +      P
Sbjct: 203 NTFSSELYADAAIKFIETN--ASNDNPFLMYVAFTSPHDPRNVLPDYGRKY-DSKEITMP 259

Query: 130 AYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLG 186
                 +P D   + +   K+ P  R    +L  +      +  VD  + +I+  L+  G
Sbjct: 260 KNFITQHPFDNGDLNERDEKLLPTPRVPEQVLAERANYYSMVNEVDVQIGRILDMLEKSG 319

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           + +NT I++ +D+G  +G+ GL+ GK   +E  +RVP ++ GP I
Sbjct: 320 KDDNTIIVFAADNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPNI 363


>gi|149915315|ref|ZP_01903842.1| sulfatase [Roseobacter sp. AzwK-3b]
 gi|149810604|gb|EDM70445.1| sulfatase [Roseobacter sp. AzwK-3b]
          Length = 505

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 82/173 (47%), Gaps = 13/173 (7%)

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
           Y D +A ++ R +    +    +P  L +SF  PH P  +  +Y +++   + H  P   
Sbjct: 163 YDDEVAFNASRKVYDLGRGHDARPWCLTVSFTHPHDPYVARRKYWDLY-EKSDHLLPTVP 221

Query: 130 --AY-DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKD 184
             AY D+ P+  + +     R          D+  ++R     +  +DD + +++  L  
Sbjct: 222 AMAYEDHDPHSKRIFDANDWRSFDITE---ADIRRSRRAYFANISYLDDKIGEVLAALDG 278

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
             + +NT I++ SDHG  LG+ GL    SF +E   RVP ++  PG+ PG ++
Sbjct: 279 TRQTDNTIILFVSDHGDMLGERGLWFKMSF-YEGASRVPLMIAAPGMAPGLVH 330


>gi|256422335|ref|YP_003122988.1| sulfatase [Chitinophaga pinensis DSM 2588]
 gi|256037243|gb|ACU60787.1| sulfatase [Chitinophaga pinensis DSM 2588]
          Length = 520

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   L T +S+D  + +I++ L   G  +NT +IY SD G++LG+ G    K F +E  +
Sbjct: 311 MRDYLATARSLDRNIGRILQYLDSTGLADNTVVIYCSDQGFYLGEHGWFD-KRFIYEQSL 369

Query: 221 RVPFLMRGPGIV-PGT 235
             PF+MR PG++ PGT
Sbjct: 370 HTPFVMRYPGVIKPGT 385


>gi|149196007|ref|ZP_01873063.1| iduronate-2-sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140854|gb|EDM29251.1| iduronate-2-sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 574

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 78/158 (49%), Gaps = 20/158 (12%)

Query: 90  QSFSKKPIMLVMSFPAPHGPEDSAPQYANMF-------------FNVTSHHTPAYDYAPN 136
           +S S +   L + F  PH P   A +Y +MF                 +H+T  +D    
Sbjct: 226 RSGSDQAFFLSVGFVRPHTPLHVAQKYFDMFPIDQIQLPEILENDADDTHYTDLFD---- 281

Query: 137 PDKQWIL--QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
            DK+ ++   + +K  P  ++         L ++ +VD+ + +++K L +    +NT +I
Sbjct: 282 ADKKGLMYFDLLKKSYPNWQEGIKAFTQAYLASIAAVDENIGRVIKTLDESRFKDNTIVI 341

Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +TSDHG+++G+   +  K+ P+E   RVPF++R P I 
Sbjct: 342 FTSDHGWNMGEKDYL-FKNSPWEESGRVPFIVRAPQIA 378


>gi|255690669|ref|ZP_05414344.1| choline sulfatase [Bacteroides finegoldii DSM 17565]
 gi|260623693|gb|EEX46564.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 344

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 81/165 (49%), Gaps = 7/165 (4%)

Query: 70  NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
           N +  +L A+ +++F+  +  + +  P ++ ++F +PH P +  P Y   +        P
Sbjct: 38  NTFSSELYADAAIKFIETN--ASNDNPFLMYVAFTSPHDPRNVLPDYGRKY-GSKEITMP 94

Query: 130 AYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLG 186
                 +P D   + +   K+ P  R    +L  +      +  VD  + +I+  L+  G
Sbjct: 95  KNFVTQHPFDNGDLNERDEKLLPTPRVPEQVLAERANYYSMVNEVDVQIGRILDMLEKSG 154

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           + +NT I++ +D+G  +G+ GL+ GK   +E  +RVP ++ GP I
Sbjct: 155 KADNTIIVFAADNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPNI 198


>gi|399994498|ref|YP_006574738.1| choline-sulfatase [Phaeobacter gallaeciensis DSM 17395 = CIP
           105210]
 gi|398659053|gb|AFO93019.1| putative choline-sulfatase [Phaeobacter gallaeciensis DSM 17395 =
           CIP 105210]
          Length = 468

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 75/173 (43%), Gaps = 38/173 (21%)

Query: 107 HGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQ---------VTRKMQPVH---- 153
           H PE + P +A M   VT H      Y     ++WI Q         V  +++P H    
Sbjct: 172 HAPERNEP-WAAMVSLVTPH------YPLTAPQEWIDQIDPEMIAPPVAGRLRPAHPELA 224

Query: 154 --RQFTD---------LLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
             R F D         +++ +R  L  +  +DD + +++  L+  G+   T I+Y SDHG
Sbjct: 225 RIRDFYDHDPWFDDEAVMVARRSYLALVAFMDDCMGQVLSALEQSGQTEETLIVYVSDHG 284

Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM----YLLDVLIPQVRKF 249
             LG  G   GKS   E  + VP ++ GPG+  G       LLD+    +  F
Sbjct: 285 EMLGDHGFW-GKSVMLESSVGVPMIVSGPGVPVGRCKTPTSLLDIATTALATF 336


>gi|333031085|ref|ZP_08459146.1| N-acetylglucosamine-6-sulfatase [Bacteroides coprosuis DSM 18011]
 gi|332741682|gb|EGJ72164.1| N-acetylglucosamine-6-sulfatase [Bacteroides coprosuis DSM 18011]
          Length = 520

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 45/74 (60%), Gaps = 1/74 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   + T  +VD+ + K++  L++ GEL+NT IIYTSD G+ LG+ G    K F +E   
Sbjct: 316 MRDYMSTTLAVDEGIGKLLDYLEETGELDNTLIIYTSDQGFFLGEHGFFD-KRFMYEECQ 374

Query: 221 RVPFLMRGPGIVPG 234
           R+P +MR P ++  
Sbjct: 375 RMPLVMRYPKLIKA 388


>gi|325109948|ref|YP_004271016.1| N-acetylglucosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324970216|gb|ADY60994.1| N-acetylglucosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 534

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 61/254 (24%), Positives = 109/254 (42%), Gaps = 62/254 (24%)

Query: 35  PPGWREWGALIMNSKYYNYSI------NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQS 88
           P G+     L+    YYN  +      + N  +++H       Y  D+I + ++ +L++ 
Sbjct: 130 PTGFDYSEVLVGQGTYYNPVMLKDPNGDGNRERVRH-----TGYTTDIITDLALDWLKEG 184

Query: 89  KQSFSKKPIMLVMSFPAPH-----GPE-------DSAPQYANMFFNVTSHHTPA------ 130
           + S   KP M++    APH     GP+       ++ P+  N+F +     T A      
Sbjct: 185 RDS--DKPFMMMFQHKAPHREWAPGPDHLTMYDGETIPEPDNLFDDYEGRGTAAKTQDMT 242

Query: 131 -----------YDYAP--NPDKQWILQVTRKMQPVHRQFTDLLMTKR------------- 164
                      +   P  N D++ +        P ++ F +  +T               
Sbjct: 243 IAKTMTARDLKFTVPPYMNEDQKKVW--NEAYDPKNKAFEEANLTGDDLIRWKYQRYIKD 300

Query: 165 -LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
            L+ + SVDD V ++++ L+D G   NT +IY+SD G++LG  G    K F +E   R P
Sbjct: 301 YLRCIASVDDNVGRMLQYLEDNGLAENTVVIYSSDQGFYLGDHGWFD-KRFMYEESYRQP 359

Query: 224 FLMRGPGIV-PGTM 236
            ++R P +V PG++
Sbjct: 360 LMVRWPNVVKPGSV 373


>gi|198274022|ref|ZP_03206554.1| hypothetical protein BACPLE_00159 [Bacteroides plebeius DSM 17135]
 gi|198273100|gb|EDY97369.1| arylsulfatase [Bacteroides plebeius DSM 17135]
          Length = 522

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 45/69 (65%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  LK+ G L NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 312 MRDYMKTVKSLDDNVGRVLDYLKEKGLLENTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 370

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 371 RTPLIMRLP 379


>gi|421613377|ref|ZP_16054463.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
 gi|408495971|gb|EKK00544.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
          Length = 534

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 80/174 (45%), Gaps = 19/174 (10%)

Query: 69  YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT 128
           Y D Y   +A  +++ + Q  ++    P  L M +  PH    +  +Y +++    ++  
Sbjct: 197 YVDGYNTAMAIATLKEMTQDNET----PFFLAMGYKLPHLNWCAPSKYWDLY---DANDI 249

Query: 129 PAYDYAPNPDKQWILQV-------TR----KMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
           P  D    P+    + +       TR    K+ P+  + +  L    L ++  VD  + K
Sbjct: 250 PMADETDAPENGAAMGLHASFELRTRAGIPKIGPLSPELSRKLKHAYLASVSYVDAQIGK 309

Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
           ++  L+D G  +NT I+   DHG+HLG  G V GK+  +E   RVP ++  P +
Sbjct: 310 LIAALEDAGVRDNTVIVVWGDHGWHLGDMG-VWGKATNYEIATRVPLMIWAPDM 362


>gi|196233398|ref|ZP_03132242.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196222538|gb|EDY17064.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 478

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 73/266 (27%), Positives = 118/266 (44%), Gaps = 40/266 (15%)

Query: 1   MAWPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSI----- 55
           M   V L SR +L   +S  +  K + +  G   P    +  +     K   Y+      
Sbjct: 73  MEGAVCLPSRTMLLTGMSMFHAMKAVPRGKGPIAPDDTPDTYSFPRAMKAAGYATLHTGK 132

Query: 56  NMNGRKIKHGE--DYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA 113
           N N  K+   E  + Y+       A+ ++ F+   K++   KP+ L +   APH P D  
Sbjct: 133 NDNSPKLVTAEFDETYDPGESMACADKAIEFI---KRTAGTKPMCLYI---APHEPHD-- 184

Query: 114 PQYANMFFNVTSHHTPA-----YDYAP-----NPDKQWILQVTRKMQ---PVHRQFTDLL 160
           PQYA   F   +H+ P        +AP     N D    ++V  +M    P  ++     
Sbjct: 185 PQYATPDF--YAHYNPEDVPLPKAFAPFHTFDNGD----IKVRDEMTLPWPRTKENLGKK 238

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF-D 219
           + +   + +  D A+ +IV+ L+D G+  NT  I   D+G  LG+ GL+ GK   +EF  
Sbjct: 239 LARYYASTEYWDAAMGRIVQALRDAGQFENTIFIIAGDNGLSLGEHGLL-GKQNLYEFGG 297

Query: 220 IRVPFLMRGPGIVPG----TMYLLDV 241
           + VP ++ G GI  G    T+YL+DV
Sbjct: 298 MHVPLIVAGKGIPKGETRATVYLMDV 323


>gi|260427447|ref|ZP_05781426.1| choline-sulfatase [Citreicella sp. SE45]
 gi|260421939|gb|EEX15190.1| choline-sulfatase [Citreicella sp. SE45]
          Length = 503

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 77/169 (45%), Gaps = 11/169 (6%)

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
           Y D +A  + R +    +    +P  L +SF  PH P  +  +Y +++ +   H  P   
Sbjct: 162 YDDEVAYHATRKVYDLGRRKDDRPWCLTVSFTHPHDPYVARRKYWDLYEDC-EHLLPEVG 220

Query: 130 --AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
              YD   +P  Q I       +       D+  ++R     +  +D  + +I++ L+  
Sbjct: 221 DLGYDNQ-DPHSQRIFDAN-DWRSFDITEEDIRRSRRAYFANISYLDGKIGEILEALEGT 278

Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            +  NT I++ SDHG  LG+ GL    +F FE   RVP +M  PG+ PG
Sbjct: 279 RQAENTAILFVSDHGDMLGERGLWFKMNF-FEGSSRVPMMMAAPGVAPG 326


>gi|309775806|ref|ZP_07670801.1| arylsulfatase [Erysipelotrichaceae bacterium 3_1_53]
 gi|308916483|gb|EFP62228.1| arylsulfatase [Erysipelotrichaceae bacterium 3_1_53]
          Length = 485

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 79/175 (45%), Gaps = 13/175 (7%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +A+ S+RFL+   ++   +P  L+ SF  PH P D    Y +M+ +         D+ P 
Sbjct: 185 VADRSIRFLKTRDRT---RPFFLMSSFVRPHQPLDPPQAYFDMYKDKALRLPADGDWDPQ 241

Query: 137 PDKQWILQVTRKMQPVHRQFTDL-LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
                  +++  +     +   L  M     ++  VD  + +I+  LK+ G  ++T I++
Sbjct: 242 EGCGTAGRISDSIFGCCDESMRLDAMAGYYASITHVDHQLGRILTALKEDGVYDDTIILF 301

Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR--------GPGIVPGTMYLLDVL 242
           TSDHG  L    L + K FP+E    +PFLM          P  V G   L DV+
Sbjct: 302 TSDHGEMLFDHHLWR-KVFPYEGSTHIPFLMHIGKHIADVIPHTVNGITELRDVM 355


>gi|224005180|ref|XP_002296241.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586273|gb|ACI64958.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 619

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 78/170 (45%), Gaps = 22/170 (12%)

Query: 80  DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-- 137
           D++RFL    +    KP  L ++F A H  +    QY     +++ +   A+D  P P  
Sbjct: 321 DALRFLVTRPRD---KPFFLNVAFYATHAKDGDVRQYMPQNSSMSMY---AHDEIPVPAT 374

Query: 138 --DKQW------ILQVTRKMQPVHRQFTD-----LLMTKRLQTLQSVDDAVEKIVKELKD 184
             D+ W        +        H +F +      +M    +    VD A   I++EL+ 
Sbjct: 375 ATDEAWKKMPYFFTEKNEGRARWHSRFENHTLHQSMMKNYYRMATEVDTACGVILEELRR 434

Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
            GEL+NT II+T+D+G    + GL   K +P +  IRVP ++R P +  G
Sbjct: 435 QGELDNTVIIFTTDNGNFHAEHGLAD-KWYPHQESIRVPLIIRDPRMKGG 483


>gi|421609321|ref|ZP_16050519.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
 gi|408499985|gb|EKK04446.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
          Length = 503

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 33/250 (13%)

Query: 15  FDISHGYFGK-------YLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED 67
           F I HG  G        +  +    Y+ P   + G L     Y+   +    + +  G  
Sbjct: 133 FHIGHGNHGDPESFSVPHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKSLPRGAA 192

Query: 68  YYN------DYYPDLIANDSVRFLRQSKQSFSKK--PIMLVMSFPAPHGPEDSAPQ---- 115
           Y +      DY    +A ++++ L+ +KQ    +  P  +   F  PH P  SAPQ    
Sbjct: 193 YESPDAKDEDYADGRVAAETIQRLQAAKQRQQTEGTPFFIASGFARPHLPF-SAPQKYWD 251

Query: 116 -YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV----HRQFTDLLMTKRLQ---- 166
            Y      + +H T   D AP    +   +++   +PV    +  F D L    +     
Sbjct: 252 LYDPASLPMPTHETLPVD-APKVAGKRGGEIS-NYKPVPTEPNADFDDELKRNLIHGYYA 309

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
           ++  VD  + K++KEL  L   +NT ++   DHG+HLG  G+   K   +E   R+P L+
Sbjct: 310 SVSYVDAQIGKVIKELDRLELSDNTIVVLWGDHGFHLGDLGIWT-KHTNYEQANRIPILI 368

Query: 227 RGPGIV-PGT 235
             PG+  PG+
Sbjct: 369 TAPGVTQPGS 378


>gi|402073090|gb|EJT68723.1| hypothetical protein GGTG_13710 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 383

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 59/129 (45%), Gaps = 6/129 (4%)

Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQ---WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
           +  +F N T   TP++    N  +    WI +++R M      + D    +R Q L  VD
Sbjct: 201 HEKLFTNATVPRTPSFSPPDNVQRNQVWWIRELSR-MGDAEMDWADAEYRRRAQALAGVD 259

Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHL--GQFGLVKGKSFPFEFDIRVPFLMRGPG 230
             V  ++ +L++ G L+NTY          L      +  GK+ PF  D  +PF++RGPG
Sbjct: 260 KLVSDVLGKLEEKGALDNTYSKRRHSFAVPLPSSNHRMAVGKTTPFTEDTNLPFVVRGPG 319

Query: 231 IVPGTMYLL 239
           +  G    L
Sbjct: 320 VPQGVESTL 328


>gi|423111042|ref|ZP_17098737.1| hypothetical protein HMPREF9687_04288 [Klebsiella oxytoca 10-5243]
 gi|376377522|gb|EHS90291.1| hypothetical protein HMPREF9687_04288 [Klebsiella oxytoca 10-5243]
          Length = 497

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 76/172 (44%), Gaps = 33/172 (19%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP---------------EDSAPQYANMFF 121
           I+N +V FLRQ  ++   +P ++V+S+  PH P                D  P+  +   
Sbjct: 166 ISNRAVDFLRQPARA--DEPFLMVVSYDEPHHPFTCPVEYLRKYQDFYYDLGPKAHDSLA 223

Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
           +   HH       P+P    + +  R   P++    D            VDD + +++  
Sbjct: 224 DKPEHHRLWSQAMPSP----VGEDGRYHHPLYFACNDF-----------VDDQIGRVINA 268

Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
           L    + +NT++IYTSDHG  +G   L+   +  ++   R+P +MR P  VP
Sbjct: 269 LTPQ-QRDNTWVIYTSDHGEMMGAHRLISKGAAMYDDITRIPLIMRPPQGVP 319


>gi|223936368|ref|ZP_03628280.1| sulfatase [bacterium Ellin514]
 gi|223894886|gb|EEF61335.1| sulfatase [bacterium Ellin514]
          Length = 510

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 85/175 (48%), Gaps = 19/175 (10%)

Query: 77  IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
           +A +++R LR+ K     KP  +   F  PH P+ +  +Y    F++           P 
Sbjct: 224 VATETIRLLREHKD----KPFFIAAGFYRPHVPDIATKKY----FDLYPKEKIELPIGPK 275

Query: 137 PDKQWILQVTRKMQPVHRQFTD--LLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTY 192
              + I ++   ++P++   TD  L + KR    ++  VD  V +++ EL++L   + T 
Sbjct: 276 EHFKDIPEMALTVKPLNYGLTDDQLRLFKRAYFASISFVDAQVGRVLNELENLKLADKTI 335

Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG------IVPGTMYLLDV 241
           ++   DHG+ LG+ G  + +S  F+  + VP ++  PG      + P T+ LLD+
Sbjct: 336 VVCFGDHGWLLGEHGQWQKQSL-FDESVHVPLMVYLPGAKGNGKVSPRTVELLDI 389


>gi|358381505|gb|EHK19180.1| hypothetical protein TRIVIDRAFT_69180 [Trichoderma virens Gv29-8]
          Length = 552

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 76/163 (46%), Gaps = 6/163 (3%)

Query: 71  DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
           DY  +++   +       ++   K+P  L +S   PH P     +Y N + +V     P 
Sbjct: 181 DYDEEVMYRSTQYLWDHVREGPDKRPFALTVSLTHPHDPYTITKKYWNRYEDV-DIDLPK 239

Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLG 186
              A         ++ +       +FTD  + +  +    ++  VDD + K+++ L+D G
Sbjct: 240 VRIAKEKQDTHSKRLLKVCDLWDLEFTDEQIKRAKRAYYGSVSYVDDCIGKLLETLEDAG 299

Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
            ++NT +I++ DHG  LG+ GL    S+ FE  +RVP L+  P
Sbjct: 300 LMDNTIVIFSGDHGDMLGERGLWYKMSY-FESSVRVPMLVHYP 341


>gi|393784525|ref|ZP_10372688.1| hypothetical protein HMPREF1071_03556 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665506|gb|EIY59030.1| hypothetical protein HMPREF1071_03556 [Bacteroides salyersiae
           CL02T12C01]
          Length = 526

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   ++T++S+DD V +++  L++ G L+NT ++YTSD G+++G+ G    K F +E  +
Sbjct: 316 MRDYMKTVKSLDDNVGRVLDYLEEKGMLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 374

Query: 221 RVPFLMRGP 229
           R P +MR P
Sbjct: 375 RTPLIMRLP 383


>gi|430741207|ref|YP_007200336.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430012927|gb|AGA24641.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 451

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 53/236 (22%), Positives = 97/236 (41%), Gaps = 45/236 (19%)

Query: 18  SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
           +HGYF  ++ K++    P            S+++++  + +G       D    +     
Sbjct: 111 AHGYFVGHVGKWHNGTFP------------SEHFDFGRSYSGTHWMTEPDGSKIHVTKKN 158

Query: 78  ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA------NMFFNVT------- 124
            ND++ FLR      + KP  L ++F A H  + +  QY       +++ +VT       
Sbjct: 159 ENDALEFLRDRP---ADKPFCLTLAFFATHAEDGNPKQYLPQPESLSLYQDVTVPVPKTA 215

Query: 125 --SHHTPAYDYAPNPDKQ------WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
             +H      +  N   +      W      + Q   + +  L           VD    
Sbjct: 216 TEAHFKKLPPFIANEKNEGRNRWHWRFDTPERYQQYMKDYYRLA--------SEVDATCG 267

Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
           +++ EL+  G L+NT +I+T+D+GY   + GL   K +P E  IRVP ++R P + 
Sbjct: 268 RVLAELEKQGVLDNTLVIFTTDNGYFHAEHGLAD-KWYPHEESIRVPLIVRDPRMA 322


>gi|260906523|ref|ZP_05914845.1| choline-sulfatase [Brevibacterium linens BL2]
          Length = 509

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 73/157 (46%), Gaps = 9/157 (5%)

Query: 85  LRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT---SHHTPAYDYAPNPDKQW 141
           +R ++ +   +P ++V SF  PH P +   ++ + F  V      H    D A +P    
Sbjct: 178 VRANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAEVDIPDPAHPEVPDIAEDPHSHR 237

Query: 142 ILQVT--RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
           +  ++   K +P                +  +DD + KI + L++L   +NT II TSDH
Sbjct: 238 LRTMSGLDKKEPGTEDIR-RARRAYYAAVSYIDDHIGKIRQRLRELELEDNTVIIVTSDH 296

Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGP--GIVPG 234
           G  LG+ GL   K  P+E   RVP ++ GP   + PG
Sbjct: 297 GDMLGEKGLWY-KMSPYEQSSRVPIIINGPAEAVTPG 332


>gi|383766220|ref|YP_005445201.1| sulfatase [Phycisphaera mikurensis NBRC 102666]
 gi|381386488|dbj|BAM03304.1| sulfatase [Phycisphaera mikurensis NBRC 102666]
          Length = 466

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 70/158 (44%), Gaps = 19/158 (12%)

Query: 90  QSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM 149
           Q+    P   V  F +PH P   APQ     ++  S  TP     P PD +  L      
Sbjct: 211 QAAGGTPFFCVAGFYSPHSPL-VAPQRFLDLYDPKSL-TPL----PEPDDRAGLD----- 259

Query: 150 QPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLV 209
            P HR+     +      +  VD  V +I+ EL   G   +T + +T DHG  LG FG  
Sbjct: 260 DPEHRRQA---LHGYYAMISEVDHWVGEILSELARAGHAEDTVVAFTGDHGESLGDFGR- 315

Query: 210 KGKSFPFEFDI-RVPFLMRGPGIVPGTMYLLDVLIPQV 246
            GKSFP E  + RVP L+  PG  PG   + D L+  V
Sbjct: 316 WGKSFPGEDCVSRVPLLVASPGQRPG---VCDALVEAV 350


>gi|253688811|ref|YP_003018001.1| sulfatase [Pectobacterium carotovorum subsp. carotovorum PC1]
 gi|251755389|gb|ACT13465.1| sulfatase [Pectobacterium carotovorum subsp. carotovorum PC1]
          Length = 564

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 98/234 (41%), Gaps = 25/234 (10%)

Query: 20  GYFGKYL---NKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY----YNDY 72
           G+FGK+      Y       G+  W  L+    YY   INM G + K   D        Y
Sbjct: 132 GFFGKWHFGGADYTAKAGFAGFDRWVGLLGQGDYY--PINMFGEQAKLNIDGKMVPQKGY 189

Query: 73  YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHH 127
             D + + +V +L    +   KKP M+ +S    H     A ++      VT     ++ 
Sbjct: 190 ITDELTDYAVNWLDGIDK---KKPFMMYLSHKGVHSDFYPAIRHKGSMDKVTFPLPETYA 246

Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKEL 182
               +Y   P   W+         V   +   +  ++ Q     TL+SVDD+V ++ + L
Sbjct: 247 DTPENYEGKP--MWVKNQRNSWHGVDYPYNKKMDMQQFQRDYYETLRSVDDSVGRVQEWL 304

Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
           K  G   NT ++   D+G+  G+ GL+  +S  +E  +RVP +  GPG   G +
Sbjct: 305 KKNGLDKNTIVMVMGDNGFTFGEHGLIDKRS-AYETSMRVPLIASGPGFGKGDV 357


>gi|440717286|ref|ZP_20897776.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
 gi|436437472|gb|ELP31098.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
          Length = 503

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 33/250 (13%)

Query: 15  FDISHGYFGK-------YLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED 67
           F I HG  G        +  +    Y+ P   + G L     Y+   +    + +  G  
Sbjct: 133 FHIGHGNHGDPESFSVPHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKSLPRGAA 192

Query: 68  YYN------DYYPDLIANDSVRFLRQSKQSFSKK--PIMLVMSFPAPHGPEDSAPQ---- 115
           Y +      DY    +A ++++ L+ +KQ    +  P  +   F  PH P  SAPQ    
Sbjct: 193 YESPDAKDEDYADGRVAAETIQRLQAAKQRQQTEGTPFFIASGFARPHLPF-SAPQKYWD 251

Query: 116 -YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV----HRQFTDLLMTKRLQ---- 166
            Y      + +H T   D AP    +   +++   +PV    +  F D L    +     
Sbjct: 252 LYDPASLPMPTHETLPVD-APKVAGKRGGEIS-NYKPVPTEPNADFDDELKRNLIHGYYA 309

Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
           ++  VD  + K++KEL  L   +NT ++   DHG+HLG  G+   K   +E   R+P L+
Sbjct: 310 SVSYVDAQIGKVIKELDRLELSDNTIVVLWGDHGFHLGDLGIWT-KHTNYEQANRIPILI 368

Query: 227 RGPGIV-PGT 235
             PG+  PG+
Sbjct: 369 TAPGVTQPGS 378


>gi|387790940|ref|YP_006256005.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
 gi|379653773|gb|AFD06829.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
          Length = 516

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
           M   L T++SVD++V +++  LK  G   NT +IYTSD G++LG+ G    K F +E  +
Sbjct: 312 MQDYLATVKSVDESVGELLDYLKREGLDKNTIVIYTSDQGFYLGEHGWF-DKRFMYEESM 370

Query: 221 RVPFLMRGPG-IVPGT 235
            +P LM+ PG I PGT
Sbjct: 371 GMPLLMKYPGHIEPGT 386


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.139    0.431 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,693,552,418
Number of Sequences: 23463169
Number of extensions: 208934889
Number of successful extensions: 511716
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2207
Number of HSP's successfully gapped in prelim test: 3729
Number of HSP's that attempted gapping in prelim test: 505273
Number of HSP's gapped (non-prelim): 7151
length of query: 271
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 131
effective length of database: 9,074,351,707
effective search space: 1188740073617
effective search space used: 1188740073617
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)