BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy5200
(271 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|380010998|ref|XP_003689601.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase SULF-1
homolog [Apis florea]
Length = 1346
Score = 421 bits (1082), Expect = e-115, Method: Composition-based stats.
Identities = 184/217 (84%), Positives = 208/217 (95%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y DYYPDLIAN
Sbjct: 112 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 171
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 172 DSVTFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 231
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 232 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 291
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 292 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 328
>gi|110760118|ref|XP_393715.3| PREDICTED: extracellular sulfatase SULF-1 homolog [Apis mellifera]
Length = 1436
Score = 421 bits (1082), Expect = e-115, Method: Composition-based stats.
Identities = 184/217 (84%), Positives = 208/217 (95%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y DYYPDLIAN
Sbjct: 170 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 229
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 230 DSVTFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 289
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 290 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 349
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 350 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 386
>gi|350410548|ref|XP_003489072.1| PREDICTED: extracellular sulfatase SULF-1 homolog [Bombus
impatiens]
Length = 1416
Score = 420 bits (1079), Expect = e-115, Method: Composition-based stats.
Identities = 184/217 (84%), Positives = 208/217 (95%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y DYYPDLIAN
Sbjct: 165 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 224
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 225 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 284
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 285 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 344
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 345 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 381
>gi|340719524|ref|XP_003398201.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase SULF-1
homolog [Bombus terrestris]
Length = 1416
Score = 420 bits (1079), Expect = e-115, Method: Composition-based stats.
Identities = 184/217 (84%), Positives = 208/217 (95%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y DYYPDLIAN
Sbjct: 165 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 224
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 225 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 284
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVDDAV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 285 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDDAVDRIYQELKDLGELDNTYIIYTSDH 344
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 345 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 381
>gi|383862653|ref|XP_003706798.1| PREDICTED: extracellular sulfatase SULF-1 homolog [Megachile
rotundata]
Length = 1395
Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats.
Identities = 183/217 (84%), Positives = 208/217 (95%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y DYYPDLIAN
Sbjct: 164 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYSKDYYPDLIAN 223
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDK
Sbjct: 224 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDK 283
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD+AV++I +ELKDLGEL+NTYIIYTSDH
Sbjct: 284 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDNAVDRIYQELKDLGELDNTYIIYTSDH 343
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PGT+
Sbjct: 344 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPGTI 380
>gi|345495672|ref|XP_001606010.2| PREDICTED: extracellular sulfatase SULF-1 homolog [Nasonia
vitripennis]
Length = 1355
Score = 419 bits (1076), Expect = e-115, Method: Composition-based stats.
Identities = 183/217 (84%), Positives = 206/217 (94%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYS+NMNG+KIKHG +Y DYYPDLIAN
Sbjct: 162 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSVNMNGKKIKHGFEYNKDYYPDLIAN 221
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FLRQSK +F++KP+MLV SFPAPHGPEDSAPQY++MFFNVT+HHTPAYDYAPNPDK
Sbjct: 222 DSVAFLRQSKHNFARKPVMLVASFPAPHGPEDSAPQYSDMFFNVTTHHTPAYDYAPNPDK 281
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H++FTDLLMTKRLQTLQSVD AVE+I +ELK LGEL+NTYIIYTSDH
Sbjct: 282 QWILQVTQKMQPIHKEFTDLLMTKRLQTLQSVDAAVERIYQELKSLGELDNTYIIYTSDH 341
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI PG++
Sbjct: 342 GYHLGQFGLIKGKSFPFEFDVRVPFLVRGPGIAPGSV 378
>gi|347969800|ref|XP_314283.5| AGAP003374-PA [Anopheles gambiae str. PEST]
gi|333469277|gb|EAA09641.5| AGAP003374-PA [Anopheles gambiae str. PEST]
Length = 1361
Score = 410 bits (1053), Expect = e-112, Method: Composition-based stats.
Identities = 179/217 (82%), Positives = 202/217 (93%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY DYYPDLIAN
Sbjct: 35 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYAKDYYPDLIAN 94
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLRQSK +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTPAYD+APNPDK
Sbjct: 95 DSIAFLRQSKHQNHRKPVLLAMSFPAPHGPEDSAPQYSHLFFNVTTHHTPAYDHAPNPDK 154
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT++M+P+HR+FTDLLMTKRLQTLQSVD AVE++ +ELK LGEL+NTYIIYTSDH
Sbjct: 155 QWILRVTQQMEPIHRKFTDLLMTKRLQTLQSVDVAVERVYQELKALGELDNTYIIYTSDH 214
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+RVPFLMRGPGI P T+
Sbjct: 215 GYHLGQFGLIKGKSFPFEFDVRVPFLMRGPGIEPATV 251
>gi|270007459|gb|EFA03907.1| hypothetical protein TcasGA2_TC014039 [Tribolium castaneum]
Length = 1049
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 180/217 (82%), Positives = 206/217 (94%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSY+PPGWREWG LIMNSKYYNYSINMNG+KIKHG DY+ DYYPDLIAN
Sbjct: 141 GYFGKYLNKYNGSYVPPGWREWGGLIMNSKYYNYSINMNGKKIKHGFDYHKDYYPDLIAN 200
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLRQSK++ + KP+ML MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YDYAPNPDK
Sbjct: 201 DSIAFLRQSKKNSAHKPVMLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDYAPNPDK 260
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD AV+ IV+EL+ LGEL+NTYIIYTSDH
Sbjct: 261 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDAAVQLIVEELRALGELDNTYIIYTSDH 320
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+R+PFL+RGPG+ PGT+
Sbjct: 321 GYHLGQFGLIKGKSFPFEFDVRIPFLVRGPGVEPGTV 357
>gi|189237136|ref|XP_973189.2| PREDICTED: similar to Sulfated CG6725-PA [Tribolium castaneum]
Length = 1049
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 180/217 (82%), Positives = 206/217 (94%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSY+PPGWREWG LIMNSKYYNYSINMNG+KIKHG DY+ DYYPDLIAN
Sbjct: 141 GYFGKYLNKYNGSYVPPGWREWGGLIMNSKYYNYSINMNGKKIKHGFDYHKDYYPDLIAN 200
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLRQSK++ + KP+ML MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YDYAPNPDK
Sbjct: 201 DSIAFLRQSKKNSAHKPVMLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDYAPNPDK 260
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD AV+ IV+EL+ LGEL+NTYIIYTSDH
Sbjct: 261 QWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDAAVQLIVEELRALGELDNTYIIYTSDH 320
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFEFD+R+PFL+RGPG+ PGT+
Sbjct: 321 GYHLGQFGLIKGKSFPFEFDVRIPFLVRGPGVEPGTV 357
>gi|242005522|ref|XP_002423613.1| sulfatase-1, sulf-1, putative [Pediculus humanus corporis]
gi|212506773|gb|EEB10875.1| sulfatase-1, sulf-1, putative [Pediculus humanus corporis]
Length = 1037
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 178/217 (82%), Positives = 201/217 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNS+YYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 176 GYFGKYLNKYNGSYIPPGWREWGGLIMNSRYYNYSINLNGKKIKHGFDYSKDYYPDLIAN 235
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLRQSKQ FS+KP+MLV+SFPAPHGPEDSAPQY+ MFFNVT+HHTP+YD+APN DK
Sbjct: 236 DSIAFLRQSKQYFSRKPVMLVVSFPAPHGPEDSAPQYSKMFFNVTTHHTPSYDFAPNLDK 295
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT KMQ +H+QFTD LMTKRLQTLQSVD AVE+I +ELK++GEL NTYI+YTSDH
Sbjct: 296 QWILRVTNKMQDIHKQFTDTLMTKRLQTLQSVDSAVERIYQELKNIGELENTYILYTSDH 355
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGLVKGKSFPFEFDIRVPFL+RGPG+ G++
Sbjct: 356 GYHLGQFGLVKGKSFPFEFDIRVPFLVRGPGVEAGSI 392
>gi|218506045|gb|ACK77664.1| RE44752p [Drosophila melanogaster]
Length = 1114
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360
>gi|195451083|ref|XP_002072760.1| GK13512 [Drosophila willistoni]
gi|194168845|gb|EDW83746.1| GK13512 [Drosophila willistoni]
Length = 1148
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 196/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 153 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 212
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 213 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 272
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK LGEL+NTYI+YTSDH
Sbjct: 273 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 332
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 333 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 364
>gi|24647401|ref|NP_524987.1| sulfated, isoform A [Drosophila melanogaster]
gi|442619385|ref|NP_001262626.1| sulfated, isoform B [Drosophila melanogaster]
gi|442619387|ref|NP_001262627.1| sulfated, isoform C [Drosophila melanogaster]
gi|33112451|sp|Q9VEX0.1|SULF1_DROME RecName: Full=Extracellular sulfatase SULF-1 homolog;
Short=DmSulf-1; Flags: Precursor
gi|7300128|gb|AAF55296.1| sulfated, isoform A [Drosophila melanogaster]
gi|270289773|gb|ACZ67866.1| FI12873p [Drosophila melanogaster]
gi|440217485|gb|AGB96006.1| sulfated, isoform B [Drosophila melanogaster]
gi|440217486|gb|AGB96007.1| sulfated, isoform C [Drosophila melanogaster]
Length = 1114
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360
>gi|195501026|ref|XP_002097625.1| GE26324 [Drosophila yakuba]
gi|194183726|gb|EDW97337.1| GE26324 [Drosophila yakuba]
Length = 1108
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360
>gi|194901252|ref|XP_001980166.1| GG20070 [Drosophila erecta]
gi|190651869|gb|EDV49124.1| GG20070 [Drosophila erecta]
Length = 1115
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 150 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 209
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 210 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 269
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 270 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 329
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 330 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 361
>gi|21429186|gb|AAM50312.1| SD04414p [Drosophila melanogaster]
Length = 1006
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 41 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 100
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 101 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 160
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 161 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 220
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 221 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 252
>gi|195152788|ref|XP_002017318.1| GL22250 [Drosophila persimilis]
gi|194112375|gb|EDW34418.1| GL22250 [Drosophila persimilis]
Length = 1173
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 163 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 222
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 223 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 282
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ EL+DLGEL+NTYI+YTSDH
Sbjct: 283 QWILRVTDPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELRDLGELDNTYIVYTSDH 342
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 343 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 374
>gi|198454076|ref|XP_001359462.2| GA19814 [Drosophila pseudoobscura pseudoobscura]
gi|198132640|gb|EAL28608.2| GA19814 [Drosophila pseudoobscura pseudoobscura]
Length = 1160
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 162 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 221
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 222 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 281
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ EL+DLGEL+NTYI+YTSDH
Sbjct: 282 QWILRVTDPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELRDLGELDNTYIVYTSDH 341
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 342 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 373
>gi|195349414|ref|XP_002041240.1| GM15446 [Drosophila sechellia]
gi|194122845|gb|EDW44888.1| GM15446 [Drosophila sechellia]
Length = 1114
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360
>gi|195570418|ref|XP_002103204.1| GD20299 [Drosophila simulans]
gi|194199131|gb|EDX12707.1| GD20299 [Drosophila simulans]
Length = 1114
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 197/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 149 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 208
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 209 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 268
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 269 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 328
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 329 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 360
>gi|195388950|ref|XP_002053141.1| GJ23505 [Drosophila virilis]
gi|194151227|gb|EDW66661.1| GJ23505 [Drosophila virilis]
Length = 1135
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 171/212 (80%), Positives = 196/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 164 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 223
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++F+NVT+HHTP+YD+APNPDK
Sbjct: 224 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFYNVTTHHTPSYDHAPNPDK 283
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT++LMTKRLQTLQSVD AVE++ ELK LGEL+NTYI+YTSDH
Sbjct: 284 QWILRVTEPMQPVHKRFTNILMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 343
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 344 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 375
>gi|194744781|ref|XP_001954871.1| GF16526 [Drosophila ananassae]
gi|190627908|gb|EDV43432.1| GF16526 [Drosophila ananassae]
Length = 1145
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 172/212 (81%), Positives = 196/212 (92%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KI+HG DY DYYPDLIAN
Sbjct: 154 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIRHGFDYAKDYYPDLIAN 213
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HHTP+YD+APNPDK
Sbjct: 214 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDK 273
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK LGEL+NTYI+YTSDH
Sbjct: 274 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKSLGELDNTYIVYTSDH 333
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 334 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 365
>gi|328703992|ref|XP_001948094.2| PREDICTED: extracellular sulfatase SULF-1 homolog [Acyrthosiphon
pisum]
Length = 929
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 169/216 (78%), Positives = 195/216 (90%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSY+PPGWR+WG LIMNSKYYNYS+N+NG+KIKHG+DY+ DYYP+L+ N
Sbjct: 170 GYFGKYLNKYNGSYVPPGWRQWGGLIMNSKYYNYSVNLNGKKIKHGDDYHKDYYPNLVTN 229
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FL+ +KQ +P MLVMSFP+PHGPEDSAP+Y++MFFNVTSHHTP YDYAPNPDK
Sbjct: 230 DSINFLQHAKQHHPHRPFMLVMSFPSPHGPEDSAPEYSDMFFNVTSHHTPTYDYAPNPDK 289
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQ T +M P+H++FT+LLMTKRLQTLQSVDDAV ++ ELK LGEL NTYIIYTSDH
Sbjct: 290 QWILQATGQMLPIHKKFTNLLMTKRLQTLQSVDDAVSRVYGELKQLGELENTYIIYTSDH 349
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
GYHLGQFGLVKGKSFPFEFD+RVP L+RGPGI PGT
Sbjct: 350 GYHLGQFGLVKGKSFPFEFDVRVPMLVRGPGIEPGT 385
>gi|195110209|ref|XP_001999674.1| GI24652 [Drosophila mojavensis]
gi|193916268|gb|EDW15135.1| GI24652 [Drosophila mojavensis]
Length = 1155
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 171/212 (80%), Positives = 194/212 (91%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 164 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 223
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++F NVT+HHTP+YD+APNPDK
Sbjct: 224 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFHNVTTHHTPSYDHAPNPDK 283
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH +FT++LMTKRLQTLQSVD AVE++ ELK LGEL+NTYI+YTSDH
Sbjct: 284 QWILRVTEPMQPVHMRFTNILMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 343
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 344 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 375
>gi|195038641|ref|XP_001990765.1| GH18078 [Drosophila grimshawi]
gi|193894961|gb|EDV93827.1| GH18078 [Drosophila grimshawi]
Length = 1191
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 170/212 (80%), Positives = 195/212 (91%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+KIKHG DY DYYPDLIAN
Sbjct: 176 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIAN 235
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR SKQ +KP++L MSFPAPHGPEDSAPQY+++F+NVT+HHTP+YD+APNPDK
Sbjct: 236 DSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFYNVTTHHTPSYDHAPNPDK 295
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT M PVH++FT++LMTKRLQTLQSVD AVE++ ELK LGEL+NTYI+YTSDH
Sbjct: 296 QWILRVTDPMLPVHKRFTNVLMTKRLQTLQSVDVAVERVYNELKALGELDNTYIVYTSDH 355
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 356 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 387
>gi|321466795|gb|EFX77788.1| hypothetical protein DAPPUDRAFT_305315 [Daphnia pulex]
Length = 856
Score = 361 bits (927), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 163/217 (75%), Positives = 186/217 (85%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSY+PPGWR W +MNS+YYNY+IN NG+KIKHG DY DYYPDLI N
Sbjct: 76 GYFGKYLNKYNGSYVPPGWRVWAGQVMNSRYYNYTINFNGQKIKHGADYQTDYYPDLITN 135
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ FLR+SKQ + +P LVMSFP+PHGPEDSAPQY++MFFNVT H TP+YD+APNPDK
Sbjct: 136 DSISFLRESKQHYDHQPFHLVMSFPSPHGPEDSAPQYSDMFFNVTRHRTPSYDFAPNPDK 195
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+ KM+PVH +FTDLLMTKRLQTLQSVD+AVEKI EL LG+L NTYI YTSDH
Sbjct: 196 QWILRWLGKMKPVHHRFTDLLMTKRLQTLQSVDEAVEKIFSELVALGQLENTYIFYTSDH 255
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYHLGQFGL+KGKSFPFE DIRVPFL+RGPG+ G +
Sbjct: 256 GYHLGQFGLIKGKSFPFESDIRVPFLVRGPGVGRGRV 292
>gi|307202326|gb|EFN81784.1| Extracellular sulfatase SULF-1-like protein [Harpegnathos saltator]
Length = 1241
Score = 361 bits (927), Expect = 2e-97, Method: Composition-based stats.
Identities = 157/191 (82%), Positives = 181/191 (94%)
Query: 46 MNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPA 105
MNS+YYNYS+NMNG+KIKHG +Y DYYPDLIANDSV FLRQSK +F++KP+MLV SFPA
Sbjct: 1 MNSRYYNYSVNMNGKKIKHGFEYSRDYYPDLIANDSVAFLRQSKHNFARKPVMLVASFPA 60
Query: 106 PHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
PHGPEDSAPQY+++FFNVT+HHTPAYDYAPNPDKQWILQVT+KMQP+H+QFTDLLMTKRL
Sbjct: 61 PHGPEDSAPQYSHLFFNVTTHHTPAYDYAPNPDKQWILQVTQKMQPIHKQFTDLLMTKRL 120
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
QTLQSVD AV++I +ELKDLGEL+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL
Sbjct: 121 QTLQSVDAAVDRIYQELKDLGELDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFL 180
Query: 226 MRGPGIVPGTM 236
+RGPGI G++
Sbjct: 181 IRGPGIEAGSV 191
>gi|332031465|gb|EGI70950.1| Extracellular sulfatase SULF-1-like protein [Acromyrmex echinatior]
Length = 1215
Score = 356 bits (913), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 158/191 (82%), Positives = 182/191 (95%)
Query: 46 MNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPA 105
MNS+YYNYS+NMNG+KIKHG DY DYYPDLIANDSV FLRQSK +F++KP+MLV SFPA
Sbjct: 1 MNSRYYNYSVNMNGKKIKHGFDYSRDYYPDLIANDSVAFLRQSKHNFARKPVMLVASFPA 60
Query: 106 PHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
PHGPEDSAPQ++++FFNVT+HHTPAYDYAPNPDKQWILQVT+KMQP+H+QFTDLLMTKRL
Sbjct: 61 PHGPEDSAPQFSHLFFNVTTHHTPAYDYAPNPDKQWILQVTQKMQPIHKQFTDLLMTKRL 120
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
QTLQSVD AV++I +ELKDLGEL+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL
Sbjct: 121 QTLQSVDSAVDRIYQELKDLGELDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFL 180
Query: 226 MRGPGIVPGTM 236
+RGPGI PG++
Sbjct: 181 IRGPGIEPGSI 191
>gi|6941989|gb|AAF32278.1| SULF1 [Drosophila melanogaster]
Length = 983
Score = 353 bits (906), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 160/212 (75%), Positives = 185/212 (87%), Gaps = 1/212 (0%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSIN+NG+K KHG DY DYYPDLIAN
Sbjct: 46 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINLNGQKXKHGFDYAKDYYPDLIAN 105
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
S+ FLR SKQ +KP++L MSFPAPHGPEDS P +++ + H+ +YD+APNPDK
Sbjct: 106 XSIAFLRSSKQQNQRKPVLLTMSFPAPHGPEDSLPVLSSLLYCDNPSHS-SYDHAPNPDK 164
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWIL+VT MQPVH++FT+LLMTKRLQTLQSVD AVE++ ELK+LGEL+NTYI+YTSDH
Sbjct: 165 QWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDH 224
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI
Sbjct: 225 GYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGI 256
>gi|443730659|gb|ELU16083.1| hypothetical protein CAPTEDRAFT_174568 [Capitella teleta]
Length = 923
Score = 345 bits (885), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 152/215 (70%), Positives = 186/215 (86%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLN+Y G+YIPPGW EW LI N+++YNYS+N NG+KIKHG++YY DY DLIAN
Sbjct: 152 GYFGKYLNEYTGTYIPPGWTEWVGLIRNTRFYNYSLNFNGKKIKHGDNYYKDYLTDLIAN 211
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FL+QSKQ F +P+ +V+S PAPHGPED+APQY +MF N+T+H TP+Y++APNPDK
Sbjct: 212 DSVTFLKQSKQYFPNRPVAMVLSMPAPHGPEDAAPQYQHMFPNITTHRTPSYNFAPNPDK 271
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQ T KM+P+H +FTD+L KRLQTLQSVDDAVEK+ ELK+LGEL+NTY IY++DH
Sbjct: 272 QWILQYTGKMEPIHMEFTDMLHRKRLQTLQSVDDAVEKVFNELKELGELDNTYFIYSADH 331
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GYHLGQ+GLVKGK+ P++FDIRVPFLMRGP I PG
Sbjct: 332 GYHLGQYGLVKGKAMPYDFDIRVPFLMRGPKIPPG 366
>gi|427780443|gb|JAA55673.1| Putative sulfatase [Rhipicephalus pulchellus]
Length = 474
Score = 343 bits (880), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 159/245 (64%), Positives = 198/245 (80%), Gaps = 5/245 (2%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
W E SR+ + + GY FGKYLN+YNG++IP GWREW AL+ NS++YNY++N+N
Sbjct: 171 WQQEFESRSFATYLSAAGYRTAYFGKYLNEYNGNHIPTGWREWAALVRNSRFYNYTVNVN 230
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
G K KHG DY DYYPDL+ NDSV FLRQSKQ F +P+M+V+S+P+PHGPEDSAPQY +
Sbjct: 231 GHKRKHGLDYARDYYPDLVTNDSVAFLRQSKQLFPNRPVMMVVSYPSPHGPEDSAPQYQH 290
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+F NVT+H TP+++YAPN DKQW+L+ T +M+P+H +FTD+L TKRLQTLQ+VDDAVEK+
Sbjct: 291 LFHNVTTHRTPSWNYAPNMDKQWLLRYTGQMEPIHIKFTDMLHTKRLQTLQTVDDAVEKL 350
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
EL LGELNNTYI YTSDHGYHLGQFGLVKGKS PFEFDIRVPF +RGP + PGT L
Sbjct: 351 YNELLSLGELNNTYIFYTSDHGYHLGQFGLVKGKSMPFEFDIRVPFYVRGPKVPPGTR-L 409
Query: 239 LDVLI 243
D+++
Sbjct: 410 KDIVL 414
>gi|241153279|ref|XP_002407031.1| sulfatase-1, sulf-1, putative [Ixodes scapularis]
gi|215494013|gb|EEC03654.1| sulfatase-1, sulf-1, putative [Ixodes scapularis]
Length = 630
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 153/237 (64%), Positives = 194/237 (81%), Gaps = 4/237 (1%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
W E SR+ + S GY FGKYLN+YNG++IP GWREW AL+ NS++YNY++++N
Sbjct: 60 WQQEFESRSFATYLSSAGYRTAYFGKYLNEYNGNHIPTGWREWAALVRNSRFYNYTVSVN 119
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
G K KHGEDY DYYPDL+AN+S+ FLRQSKQ F +P+M+V+SFP+PHGPEDSAPQY +
Sbjct: 120 GHKRKHGEDYARDYYPDLVANESLAFLRQSKQLFPTRPVMMVVSFPSPHGPEDSAPQYQH 179
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+F NVT+H TP++++APN DKQW+L+ T +M+P+H +FTD+L TKRLQTLQ+VDDAVE++
Sbjct: 180 LFHNVTTHRTPSWNHAPNQDKQWLLRYTGQMEPIHIRFTDMLHTKRLQTLQTVDDAVERL 239
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
KEL LGEL++TYI YTSDHGYHLGQFGLVKGKS PFEFDIRVPF +RGP + T
Sbjct: 240 YKELVSLGELDDTYIFYTSDHGYHLGQFGLVKGKSMPFEFDIRVPFFVRGPKVPRAT 296
>gi|357615219|gb|EHJ69539.1| hypothetical protein KGM_19329 [Danaus plexippus]
Length = 1117
Score = 327 bits (838), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 144/180 (80%), Positives = 168/180 (93%)
Query: 57 MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
MNG++IKHG+DY DYYPDLIANDS+ FLR SK+ FS+KP++LVMSFPAPHGPEDSAPQY
Sbjct: 1 MNGKRIKHGDDYNKDYYPDLIANDSIAFLRASKRRFSRKPVLLVMSFPAPHGPEDSAPQY 60
Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
+++FFNVT+HHTP YD APNPDKQWIL+VT KM+P+HRQFTDLLMTKRLQTLQSVD AVE
Sbjct: 61 SHLFFNVTTHHTPTYDMAPNPDKQWILRVTEKMKPIHRQFTDLLMTKRLQTLQSVDVAVE 120
Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
++ +ELK LGEL+NTY++YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL+RGPG+ PGT+
Sbjct: 121 RVYQELKALGELDNTYLVYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLVRGPGVEPGTV 180
>gi|405952082|gb|EKC19933.1| Extracellular sulfatase Sulf-1 [Crassostrea gigas]
Length = 914
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 144/214 (67%), Positives = 181/214 (84%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLN+YNG+YIPPGWREW LI N+++YNY++N NG K+KH ++YY DY+ DLIAN
Sbjct: 90 GYFGKYLNEYNGTYIPPGWREWVGLIKNTRFYNYTVNFNGNKMKHEDNYYADYFTDLIAN 149
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV FL+ SKQ F K+P+++V+S PAPHGPEDSAPQY ++F+N T H TP ++ APNPDK
Sbjct: 150 DSVTFLKNSKQYFPKRPVLMVLSVPAPHGPEDSAPQYQHLFYNNTLHRTPTWNMAPNPDK 209
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
Q++L+VT KM+P+ + FTD+L KRLQTLQSVDD VEK+ +EL +L EL NTYIIYTSDH
Sbjct: 210 QFLLKVTGKMEPIEQTFTDVLQQKRLQTLQSVDDLVEKVYEELWNLDELENTYIIYTSDH 269
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
GYHLGQFGLVKGK+ P++FD+RVP +RGPGI P
Sbjct: 270 GYHLGQFGLVKGKALPYDFDVRVPLYVRGPGIKP 303
>gi|391332766|ref|XP_003740800.1| PREDICTED: extracellular sulfatase Sulf-2-like [Metaseiulus
occidentalis]
Length = 761
Score = 323 bits (829), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 145/227 (63%), Positives = 184/227 (81%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLN+YNG+++P GW+EW L+ NSK+YNY++++NG K+KHG DY DY PDLI N
Sbjct: 124 GYFGKYLNEYNGNHVPLGWKEWLGLVRNSKFYNYTVSVNGNKVKHGNDYSKDYLPDLITN 183
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
S+ FLR+SKQ + KP+++VM++P PHGPEDSAPQY NMF N T H TP+++YAPN DK
Sbjct: 184 HSLSFLRRSKQFYPDKPVLMVMAYPGPHGPEDSAPQYQNMFLNETQHRTPSWNYAPNYDK 243
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QWILQ T KMQ +H +FTD+L TKRLQTLQSVDDAV++++ ELK L E++NT+I YTSDH
Sbjct: 244 QWILQHTGKMQDIHMKFTDMLNTKRLQTLQSVDDAVDQLITELKRLDEVDNTFIFYTSDH 303
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYHLGQFGLVKGK+ PFEFD+RVPF +RGP I G + IP++
Sbjct: 304 GYHLGQFGLVKGKAMPFEFDVRVPFYVRGPKISRG------IQIPEI 344
>gi|51467917|ref|NP_001003846.1| extracellular sulfatase Sulf-1 precursor [Danio rerio]
gi|37812238|gb|AAR04055.1| sulfatase FP1c [Danio rerio]
Length = 1099
Score = 309 bits (792), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 141/238 (59%), Positives = 178/238 (74%), Gaps = 4/238 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ + R SK+ F +P+M+V+S APHGPEDSAPQY+
Sbjct: 175 NGNKEKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAAPHGPEDSAPQYS 234
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+VEK
Sbjct: 235 ELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRLQTLMSVDDSVEK 294
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+ L D GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG
Sbjct: 295 VYNALVDTGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGA 352
>gi|37812240|gb|AAR04056.1| sulfatase FP1a [Danio rerio]
Length = 874
Score = 309 bits (792), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 141/238 (59%), Positives = 178/238 (74%), Gaps = 4/238 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ + R SK+ F +P+M+V+S APHGPEDSAPQY+
Sbjct: 175 NGNKEKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAAPHGPEDSAPQYS 234
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+VEK
Sbjct: 235 ELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRLQTLMSVDDSVEK 294
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+ L D GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG
Sbjct: 295 VYNALVDTGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGA 352
>gi|348511948|ref|XP_003443505.1| PREDICTED: extracellular sulfatase Sulf-1-like [Oreochromis
niloticus]
Length = 1165
Score = 309 bits (792), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 142/239 (59%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 117 SWQAQHEPRSFAVYLNETGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ F R SK+ + +P+M+V+S APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYAKDYFTDLITNDSISFFRMSKKMYPHRPVMMVVSHAAPHGPEDSAPQYA 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+V+K
Sbjct: 237 EFFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNFLHRKRLQTLMSVDDSVQK 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF MRGP + PG +
Sbjct: 297 IYDMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFMRGPNVEPGAV 355
>gi|37812242|gb|AAR04057.1| sulfatase FP1b [Danio rerio]
Length = 892
Score = 309 bits (791), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 141/238 (59%), Positives = 178/238 (74%), Gaps = 4/238 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ + R SK+ F +P+M+V+S APHGPEDSAPQY+
Sbjct: 175 NGNKEKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAAPHGPEDSAPQYS 234
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+VEK
Sbjct: 235 ELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRLQTLMSVDDSVEK 294
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+ L D GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG
Sbjct: 295 VYNALVDTGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGA 352
>gi|427794433|gb|JAA62668.1| Putative sulfatase, partial [Rhipicephalus pulchellus]
Length = 603
Score = 308 bits (790), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 147/245 (60%), Positives = 184/245 (75%), Gaps = 21/245 (8%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
W E SR+ + + GY FGKYLN+YNG++IP GWREW AL+ NS++YNY++N+N
Sbjct: 152 WQQEFESRSFATYLSAAGYRTAYFGKYLNEYNGNHIPTGWREWAALVRNSRFYNYTVNVN 211
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
G K KHG DY DYYPDL+ NDSV FLRQSKQ F +P+M+V+S+P+PHG
Sbjct: 212 GHKRKHGLDYARDYYPDLVTNDSVAFLRQSKQLFPNRPVMMVVSYPSPHG---------- 261
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+H TP+++YAPN DKQW+L+ T +M+P+H +FTD+L TKRLQTLQ+VDDAVEK+
Sbjct: 262 ------THRTPSWNYAPNMDKQWLLRYTGQMEPIHIKFTDMLHTKRLQTLQTVDDAVEKL 315
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
EL LGELNNTYI YTSDHGYHLGQFGLVKGKS PFEFDIRVPF +RGP + PGT L
Sbjct: 316 YNELLSLGELNNTYIFYTSDHGYHLGQFGLVKGKSMPFEFDIRVPFYVRGPKVPPGTR-L 374
Query: 239 LDVLI 243
D+++
Sbjct: 375 KDIVL 379
>gi|432930070|ref|XP_004081305.1| PREDICTED: extracellular sulfatase Sulf-1-like [Oryzias latipes]
Length = 1150
Score = 307 bits (787), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 141/239 (58%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW L+ NS++YNY++
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTVCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ F R SK+ + +P+M+V+S APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYAKDYFTDLITNDSISFFRMSKKMYPHRPVMMVISHAAPHGPEDSAPQYA 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+V++
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNFLHRKRLQTLMSVDDSVQR 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L D GEL NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PGT+
Sbjct: 297 IHDMLADTGELQNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGTV 355
>gi|317419889|emb|CBN81925.1| Extracellular sulfatase Sulf-1 [Dicentrarchus labrax]
Length = 843
Score = 307 bits (786), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 140/239 (58%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ F R SK+ + +P+M+V+S APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYSKDYFTDLITNDSINFFRVSKRMYPHRPVMMVISHAAPHGPEDSAPQYA 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+V+K
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQK 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ + L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG +
Sbjct: 297 VYEMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGAV 355
>gi|317419888|emb|CBN81924.1| Extracellular sulfatase Sulf-1 [Dicentrarchus labrax]
Length = 1124
Score = 307 bits (786), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 140/239 (58%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ F R SK+ + +P+M+V+S APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYSKDYFTDLITNDSINFFRVSKRMYPHRPVMMVISHAAPHGPEDSAPQYA 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+V+K
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQK 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ + L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG +
Sbjct: 297 VYEMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGAV 355
>gi|317419890|emb|CBN81926.1| Extracellular sulfatase Sulf-1 [Dicentrarchus labrax]
Length = 832
Score = 306 bits (785), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 140/239 (58%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 117 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY+ DLI NDS+ F R SK+ + +P+M+V+S APHGPEDSAPQYA
Sbjct: 177 NGYKEKHGADYSKDYFTDLITNDSINFFRVSKRMYPHRPVMMVISHAAPHGPEDSAPQYA 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
F N + H TP+Y+YAPN DK WI+Q T M+P+H +FT+ L KRLQTL SVDD+V+K
Sbjct: 237 EHFPNASQHITPSYNYAPNMDKHWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQK 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ + L++ GEL+NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG +
Sbjct: 297 VYEMLEETGELDNTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVEPGAV 355
>gi|312080544|ref|XP_003142644.1| hypothetical protein LOAG_07062 [Loa loa]
gi|307762190|gb|EFO21424.1| hypothetical protein LOAG_07062 [Loa loa]
Length = 827
Score = 306 bits (783), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 136/211 (64%), Positives = 171/211 (81%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
YFGKYLN+Y GSYIPPGW W L+ NS++YNY+IN+NG KIKHG DY DY+ DLIAN
Sbjct: 124 AYFGKYLNEYTGSYIPPGWDHWMGLLRNSRFYNYTINVNGDKIKHGWDYEKDYFTDLIAN 183
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
D++ F+R+ KP M+VMSFPAPHGPED APQY+ F +V +H T A++YAPNPDK
Sbjct: 184 DTIAFIRELHSRDLFKPYMIVMSFPAPHGPEDPAPQYSTWFQDVETHRTEAWNYAPNPDK 243
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QW+LQ T +M+PVH FTD+L +RLQTLQSVD ++++++ EL+DLG+L+NT IYTSDH
Sbjct: 244 QWLLQHTGRMEPVHVIFTDVLHRRRLQTLQSVDSSIQRLINELRDLGDLSNTVFIYTSDH 303
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
GYHLGQFGLVKGK+ P+EFDIRVP+ +RGPG
Sbjct: 304 GYHLGQFGLVKGKNMPYEFDIRVPYFIRGPG 334
>gi|395829467|ref|XP_003787880.1| PREDICTED: extracellular sulfatase Sulf-2 [Otolemur garnettii]
Length = 972
Score = 306 bits (783), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 138/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 219 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 278
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 279 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 338
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H QFT++L KRLQTL SVDD++E+
Sbjct: 339 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLMSVDDSMER 398
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 399 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 457
>gi|410909097|ref|XP_003968027.1| PREDICTED: extracellular sulfatase Sulf-1-like [Takifugu rubripes]
Length = 1172
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 136/217 (62%), Positives = 171/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG +Y DY+ DLI N
Sbjct: 137 GFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCRNGYKEKHGGEYAKDYFTDLITN 196
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ F +P+M+V+S APHGPEDSAPQYA+ F N + H TP+Y+YAPN DK
Sbjct: 197 DSINFFRISKRMFPHRPVMMVISHAAPHGPEDSAPQYADHFPNASQHITPSYNYAPNMDK 256
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M+P+H +FT+ L KRLQTL SVDD+V+K+ L++ GEL NTYIIYT+DH
Sbjct: 257 HWIMQYTGPMRPIHMEFTNFLHRKRLQTLMSVDDSVQKVYDMLEETGELENTYIIYTADH 316
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P++FDIRVPF +RGP + G +
Sbjct: 317 GYHIGQFGLVKGKSMPYDFDIRVPFFLRGPNVESGAV 353
>gi|297259626|ref|XP_001106412.2| PREDICTED: extracellular sulfatase Sulf-2-like [Macaca mulatta]
Length = 948
Score = 303 bits (777), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 198 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 257
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 258 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 317
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 318 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 377
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 378 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 436
>gi|81158032|tpe|CAI84990.1| TPA: sulfatase 2 [Rattus norvegicus]
Length = 866
Score = 303 bits (777), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGMKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPSVEAGSL 355
>gi|397511408|ref|XP_003826067.1| PREDICTED: extracellular sulfatase Sulf-2 [Pan paniscus]
gi|410207924|gb|JAA01181.1| sulfatase 2 [Pan troglodytes]
gi|410291484|gb|JAA24342.1| sulfatase 2 [Pan troglodytes]
gi|410345916|gb|JAA40669.1| sulfatase 2 [Pan troglodytes]
Length = 870
Score = 303 bits (777), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|332207736|ref|XP_003252952.1| PREDICTED: extracellular sulfatase Sulf-2 [Nomascus leucogenys]
Length = 869
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|119596097|gb|EAW75691.1| sulfatase 2, isoform CRA_b [Homo sapiens]
Length = 852
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355
>gi|440893777|gb|ELR46430.1| Extracellular sulfatase Sulf-2 [Bos grunniens mutus]
Length = 865
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 138/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRASKKMYPHRPVLMVLSHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F N + H TP+Y+YAPNPDK WI++ T M+P+H QFT++L KRLQTL SVDD++E
Sbjct: 237 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|355563054|gb|EHH19616.1| Extracellular sulfatase Sulf-2 [Macaca mulatta]
gi|380787465|gb|AFE65608.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
gi|380787469|gb|AFE65610.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
gi|383409655|gb|AFH28041.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
gi|384939934|gb|AFI33572.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
gi|384939936|gb|AFI33573.1| extracellular sulfatase Sulf-2 isoform a precursor [Macaca mulatta]
Length = 870
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|110626131|ref|NP_001030099.1| extracellular sulfatase Sulf-2 precursor [Rattus norvegicus]
gi|58012992|gb|AAW62950.1| sulfatase FP2 [Rattus norvegicus]
gi|149042868|gb|EDL96442.1| sulfatase 2, isoform CRA_b [Rattus norvegicus]
Length = 875
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGMKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPSVEAGSL 355
>gi|240255478|ref|NP_940998.2| extracellular sulfatase Sulf-2 isoform b precursor [Homo sapiens]
gi|37182046|gb|AAQ88826.1| GPPS559 [Homo sapiens]
gi|119596098|gb|EAW75692.1| sulfatase 2, isoform CRA_c [Homo sapiens]
Length = 867
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355
>gi|183986511|gb|AAI66410.1| Sulf2 protein [Rattus norvegicus]
Length = 875
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGMKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPSVEAGSL 355
>gi|29789100|ref|NP_061325.1| extracellular sulfatase Sulf-2 isoform a precursor [Homo sapiens]
gi|240255483|ref|NP_001155313.1| extracellular sulfatase Sulf-2 isoform a precursor [Homo sapiens]
gi|33112446|sp|Q8IWU5.1|SULF2_HUMAN RecName: Full=Extracellular sulfatase Sulf-2; Short=hSulf-2; Flags:
Precursor
gi|27356934|gb|AAM76861.1| extracellular sulfatase SULF-2 [Homo sapiens]
gi|51476368|emb|CAH18174.1| hypothetical protein [Homo sapiens]
gi|83405533|gb|AAI10540.1| Sulfatase 2 [Homo sapiens]
gi|119596096|gb|EAW75690.1| sulfatase 2, isoform CRA_a [Homo sapiens]
gi|119596099|gb|EAW75693.1| sulfatase 2, isoform CRA_a [Homo sapiens]
gi|119596100|gb|EAW75694.1| sulfatase 2, isoform CRA_a [Homo sapiens]
gi|119596101|gb|EAW75695.1| sulfatase 2, isoform CRA_a [Homo sapiens]
gi|190690055|gb|ACE86802.1| sulfatase 2 protein [synthetic construct]
gi|190691429|gb|ACE87489.1| sulfatase 2 protein [synthetic construct]
gi|190692019|gb|ACE87784.1| sulfatase 2 protein [synthetic construct]
gi|254071383|gb|ACT64451.1| sulfatase 2 protein [synthetic construct]
Length = 870
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355
>gi|168269766|dbj|BAG10010.1| extracellular sulfatase Sulf-2 precursor [synthetic construct]
Length = 870
Score = 303 bits (776), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 355
>gi|348563931|ref|XP_003467760.1| PREDICTED: extracellular sulfatase Sulf-2-like [Cavia porcellus]
Length = 872
Score = 303 bits (775), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|14133245|dbj|BAA86561.2| KIAA1247 protein [Homo sapiens]
Length = 885
Score = 303 bits (775), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 132 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 191
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 192 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 251
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 252 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 311
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 312 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCL 370
>gi|300793847|ref|NP_001179867.1| extracellular sulfatase Sulf-2 precursor [Bos taurus]
gi|296480968|tpg|DAA23083.1| TPA: sulfatase 2-like [Bos taurus]
Length = 862
Score = 303 bits (775), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 138/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRASKKMYPHRPVLMVLSHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F N + H TP+Y+YAPNPDK WI++ T M+P+H QFT++L KRLQTL SVDD++E
Sbjct: 237 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|354476744|ref|XP_003500583.1| PREDICTED: extracellular sulfatase Sulf-2 [Cricetulus griseus]
gi|344241558|gb|EGV97661.1| Extracellular sulfatase Sulf-2 [Cricetulus griseus]
Length = 875
Score = 303 bits (775), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + + GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+++ +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKTYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|363730828|ref|XP_003640872.1| PREDICTED: extracellular sulfatase Sulf-1 [Gallus gallus]
Length = 867
Score = 302 bits (774), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 178/227 (78%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY+I+ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD++E++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358
>gi|33112449|sp|Q90XB6.1|SULF1_COTCO RecName: Full=Extracellular sulfatase Sulf-1; Short=qSulf1; Flags:
Precursor
gi|15430244|gb|AAK98515.1|AF410802_1 N-acetylglucosamine-6-sulfatase [Coturnix coturnix]
Length = 867
Score = 302 bits (774), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 178/227 (78%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY+I+ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD++E++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358
>gi|357588455|ref|NP_001239508.1| extracellular sulfatase Sulf-2 isoform 2 [Mus musculus]
Length = 908
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 150 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 209
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 210 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 269
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 270 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 329
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 330 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 388
>gi|350595021|ref|XP_003484028.1| PREDICTED: extracellular sulfatase Sulf-2-like [Sus scrofa]
Length = 527
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|410920477|ref|XP_003973710.1| PREDICTED: extracellular sulfatase Sulf-2-like [Takifugu rubripes]
Length = 883
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 131/216 (60%), Positives = 172/216 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++ NG + KH DY DY D+I N
Sbjct: 146 GFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYPKDYLTDIITN 205
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+++ +P+M+V+S APHGPEDSAPQY++ F N + H TP+Y+YAPNPDK
Sbjct: 206 ESINYFRTSKRTYPNRPVMMVLSHVAPHGPEDSAPQYSSAFPNASQHITPSYNYAPNPDK 265
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T M+PVH QFT++L +R+QTL SVDD+VEK+ L + GEL+NTY+IYTSDH
Sbjct: 266 HWILRYTGAMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVFNMLAETGELDNTYLIYTSDH 325
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRVPFFIRGPNVEQGA 361
>gi|390462707|ref|XP_002747675.2| PREDICTED: extracellular sulfatase Sulf-2 [Callithrix jacchus]
Length = 931
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + + GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 179 SWQAQHESRTFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 238
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 239 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 298
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 299 CLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 358
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 359 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 417
>gi|148674529|gb|EDL06476.1| sulfatase 2, isoform CRA_a [Mus musculus]
Length = 943
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 185 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 244
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 245 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 304
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 305 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 364
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 365 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 423
>gi|148674531|gb|EDL06478.1| sulfatase 2, isoform CRA_c [Mus musculus]
Length = 908
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 150 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 209
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 210 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 269
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 270 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 329
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 330 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 388
>gi|27356938|gb|AAM76862.1| extracellular sulfatase SULF-2 [Mus musculus]
gi|187954897|gb|AAI41087.1| Sulfatase 2 [Mus musculus]
Length = 875
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|61656167|ref|NP_082348.2| extracellular sulfatase Sulf-2 isoform 1 precursor [Mus musculus]
gi|357588451|ref|NP_001239507.1| extracellular sulfatase Sulf-2 isoform 1 precursor [Mus musculus]
gi|341942093|sp|Q8CFG0.2|SULF2_MOUSE RecName: Full=Extracellular sulfatase Sulf-2; Short=mSulf-2; Flags:
Precursor
gi|26331608|dbj|BAC29534.1| unnamed protein product [Mus musculus]
gi|74138510|dbj|BAE38065.1| unnamed protein product [Mus musculus]
gi|74197259|dbj|BAC28804.2| unnamed protein product [Mus musculus]
gi|74223521|dbj|BAE21607.1| unnamed protein product [Mus musculus]
Length = 875
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|26349279|dbj|BAC38279.1| unnamed protein product [Mus musculus]
Length = 875
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYFNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|37360296|dbj|BAC98126.1| mKIAA1247 protein [Mus musculus]
Length = 948
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 190 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 249
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 250 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 309
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 310 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 369
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 370 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 428
>gi|281343859|gb|EFB19443.1| hypothetical protein PANDA_016742 [Ailuropoda melanoleuca]
Length = 776
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 59 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 118
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 119 NGLKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 178
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 179 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 238
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 239 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 297
>gi|224046351|ref|XP_002199068.1| PREDICTED: extracellular sulfatase Sulf-1 [Taeniopygia guttata]
Length = 867
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 178/227 (78%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY+I+ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD++E++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLHQMLAEMGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358
>gi|426392035|ref|XP_004062367.1| PREDICTED: extracellular sulfatase Sulf-2 [Gorilla gorilla gorilla]
Length = 870
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++ +
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMXQ 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|114326190|ref|NP_001041555.1| extracellular sulfatase Sulf-2 precursor [Canis lupus familiaris]
gi|81158076|tpe|CAI85012.1| TPA: sulfatase 2 [Canis lupus familiaris]
Length = 869
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 GLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|355784413|gb|EHH65264.1| Extracellular sulfatase Sulf-2 [Macaca fascicularis]
Length = 870
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG +Y DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSNYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|444706942|gb|ELW48257.1| Extracellular sulfatase Sulf-2 [Tupaia chinensis]
Length = 1080
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 221 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 280
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 281 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 340
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L +RLQTL SVDD++E
Sbjct: 341 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRRRLQTLMSVDDSMET 400
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 401 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 459
>gi|417515840|gb|JAA53728.1| E3 extracellular sulfatase Sulf-2 [Sus scrofa]
Length = 865
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|149733333|ref|XP_001503564.1| PREDICTED: extracellular sulfatase Sulf-2 [Equus caballus]
Length = 869
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG +Y DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSEYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|403282315|ref|XP_003932598.1| PREDICTED: extracellular sulfatase Sulf-2 [Saimiri boliviensis
boliviensis]
Length = 869
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + + GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 CLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|301783255|ref|XP_002927045.1| PREDICTED: extracellular sulfatase Sulf-2-like [Ailuropoda
melanoleuca]
Length = 828
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 75 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 134
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 135 NGLKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 194
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 195 SLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 254
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 255 IYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 313
>gi|351710483|gb|EHB13402.1| Extracellular sulfatase Sulf-2 [Heterocephalus glaber]
Length = 871
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 180/239 (75%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 HLFPNSSQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLLETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|410953784|ref|XP_003983550.1| PREDICTED: extracellular sulfatase Sulf-2 [Felis catus]
Length = 854
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 137/239 (57%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 GLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYIMYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|348588460|ref|XP_003479984.1| PREDICTED: extracellular sulfatase Sulf-1-like [Cavia porcellus]
Length = 1082
Score = 300 bits (768), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ +IPQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VIPQI 358
>gi|26324822|dbj|BAC26165.1| unnamed protein product [Mus musculus]
Length = 875
Score = 300 bits (768), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGSDYSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRL TL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLHTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|344280062|ref|XP_003411804.1| PREDICTED: extracellular sulfatase Sulf-2 [Loxodonta africana]
Length = 874
Score = 300 bits (768), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 178/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHEGRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNVLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|397522689|ref|XP_003831390.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Pan paniscus]
gi|397522691|ref|XP_003831391.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Pan paniscus]
gi|397522693|ref|XP_003831392.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Pan paniscus]
gi|397522695|ref|XP_003831393.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Pan paniscus]
Length = 869
Score = 300 bits (767), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 GFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|297683035|ref|XP_002819205.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Pongo abelii]
gi|297683037|ref|XP_002819206.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Pongo abelii]
gi|297683039|ref|XP_002819207.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Pongo abelii]
gi|297683041|ref|XP_002819208.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Pongo abelii]
Length = 868
Score = 299 bits (766), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF MRGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFMRGPSVEPGS------IVPQI 358
>gi|126321290|ref|XP_001378749.1| PREDICTED: extracellular sulfatase Sulf-1 [Monodelphis domestica]
Length = 1134
Score = 299 bits (765), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 177/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ+++++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSDLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|327271782|ref|XP_003220666.1| PREDICTED: extracellular sulfatase Sulf-2-like [Anolis
carolinensis]
Length = 882
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 130/217 (59%), Positives = 172/217 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSISFFRISKKMYPHRPVLMVLSHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTY+IYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQQKRLQTLMSVDDSMEAIYNTLVETGELDNTYVIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355
>gi|395506869|ref|XP_003757752.1| PREDICTED: extracellular sulfatase Sulf-2 [Sarcophilus harrisii]
Length = 875
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 171/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV F R SK+ + +P+++V+S APHGPEDSAPQY+ +F N + H TP+Y+YAPNPDK
Sbjct: 199 DSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTYI+YT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355
>gi|126303328|ref|XP_001379302.1| PREDICTED: extracellular sulfatase Sulf-2 [Monodelphis domestica]
Length = 878
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 171/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV F R SK+ + +P+++V+S APHGPEDSAPQY+ +F N + H TP+Y+YAPNPDK
Sbjct: 199 DSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTYI+YT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355
>gi|26336991|dbj|BAC32179.1| unnamed protein product [Mus musculus]
Length = 862
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358
>gi|351705551|gb|EHB08470.1| Extracellular sulfatase Sulf-1 [Heterocephalus glaber]
Length = 866
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|395511033|ref|XP_003759767.1| PREDICTED: extracellular sulfatase Sulf-1 [Sarcophilus harrisii]
Length = 1130
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 133/223 (59%), Positives = 175/223 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ+++++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSDLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++ VL
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVSQIVL 360
>gi|291409979|ref|XP_002721269.1| PREDICTED: sulfatase 2 [Oryctolagus cuniculus]
Length = 869
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 136/239 (56%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW +L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVSLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K +HG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKERHGFDYAKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
F N + H TP+Y+YAPNPDK WI++ T M+P+H QFT++L KRLQTL SVDD++E
Sbjct: 237 RFFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNVLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L + GEL+NTY++YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 IYNMLVETGELDNTYLVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|449283961|gb|EMC90544.1| Extracellular sulfatase Sulf-2 [Columba livia]
Length = 875
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 172/217 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHVGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|26986617|ref|NP_758498.1| extracellular sulfatase Sulf-1 precursor [Mus musculus]
gi|310616752|ref|NP_001185494.1| extracellular sulfatase Sulf-1 precursor [Mus musculus]
gi|310616754|ref|NP_001185495.1| extracellular sulfatase Sulf-1 precursor [Mus musculus]
gi|33112448|sp|Q8K007.1|SULF1_MOUSE RecName: Full=Extracellular sulfatase Sulf-1; Short=mSulf-1; Flags:
Precursor
gi|21961489|gb|AAH34547.1| Sulfatase 1 [Mus musculus]
gi|27356936|gb|AAM76863.1| extracellular sulfatase SULF-1 [Mus musculus]
gi|29165845|gb|AAH49276.1| Sulfatase 1 [Mus musculus]
gi|148682368|gb|EDL14315.1| sulfatase 1 [Mus musculus]
Length = 870
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358
>gi|118100629|ref|XP_417386.2| PREDICTED: extracellular sulfatase Sulf-2 [Gallus gallus]
Length = 877
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 172/217 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|26390193|dbj|BAC25858.1| unnamed protein product [Mus musculus]
Length = 870
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358
>gi|326931861|ref|XP_003212042.1| PREDICTED: extracellular sulfatase Sulf-2-like [Meleagris
gallopavo]
Length = 877
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 172/217 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|54660730|gb|AAV37455.1| heparan sulfate 6-O endosulfatase 2 [Coturnix coturnix]
Length = 877
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 172/217 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKRMYPHRPVLMVISHAAPHGPEDSAPQYSHLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL+NTYIIYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNTLVETGELDNTYIIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|354495022|ref|XP_003509631.1| PREDICTED: extracellular sulfatase Sulf-1 [Cricetulus griseus]
Length = 1011
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ +PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGST------VPQI 358
>gi|296226652|ref|XP_002759021.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Callithrix
jacchus]
Length = 869
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+SV + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESVNYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|37360220|dbj|BAC98088.1| mKIAA1077 protein [Mus musculus]
Length = 1079
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 85 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 144
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 145 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 204
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 205 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADH 264
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ ++PQ+
Sbjct: 265 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 305
>gi|431894471|gb|ELK04271.1| Extracellular sulfatase Sulf-2 [Pteropus alecto]
Length = 814
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/239 (55%), Positives = 179/239 (74%), Gaps = 4/239 (1%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHEGRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DL+ NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 177 NGVKEKHGFDYSKDYLTDLVTNDSVNFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 237 HLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMET 296
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ L + GEL+NTY++YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 297 LYNMLVETGELDNTYLVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|41055578|ref|NP_957230.1| sulfatase 2 precursor [Danio rerio]
gi|28279577|gb|AAH45403.1| Sulfatase 2 [Danio rerio]
gi|182890734|gb|AAI65233.1| Sulf2 protein [Danio rerio]
Length = 873
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGWREW AL+ NS++YNY++ NG + KHG Y DY D+I N
Sbjct: 141 AFFGKYLNEYNGSYVPPGWREWVALVKNSRFYNYTLCRNGIREKHGTQYPKDYLTDVITN 200
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+M+V+S APHGPED+APQY++ F N + H TP+Y++APNPDK
Sbjct: 201 DSINFFRMSKRMYPHRPVMMVLSHAAPHGPEDAAPQYSSAFPNASQHITPSYNHAPNPDK 260
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T M+PVH QFT++L +RLQTL SVDD+VEK+ L + GEL+NTYIIY SDH
Sbjct: 261 HWILRYTGPMKPVHMQFTNMLQRRRLQTLLSVDDSVEKVYNMLVETGELDNTYIIYMSDH 320
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIR+PF +RGP + G +
Sbjct: 321 GYHIGQFGLVKGKSMPYEFDIRIPFYVRGPNVEAGAI 357
>gi|119607360|gb|EAW86954.1| sulfatase 1, isoform CRA_b [Homo sapiens]
Length = 882
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|19705485|ref|NP_599205.1| extracellular sulfatase Sulf-1 precursor [Rattus norvegicus]
gi|33112437|sp|Q8VI60.1|SULF1_RAT RecName: Full=Extracellular sulfatase Sulf-1; AltName:
Full=RSulfFP1; AltName: Full=Sulfatase FP; Flags:
Precursor
gi|18461371|gb|AAL71906.1|AF230072_1 sulfatase FP [Rattus norvegicus]
Length = 870
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358
>gi|387018856|gb|AFJ51546.1| Extracellular sulfatase Sulf-2-like [Crotalus adamanteus]
Length = 888
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 129/217 (59%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGMKEKHGSDYTRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAP Y+++F N + H TP+Y+YAPNPDK
Sbjct: 199 DSISFFRISKKMYPHRPVLMVLSHAAPHGPEDSAPHYSHLFPNASQHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL NTY+IYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQQKRLQTLMSVDDSMEAIYNTLVETGELENTYVIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 355
>gi|327269757|ref|XP_003219659.1| PREDICTED: extracellular sulfatase Sulf-1-like [Anolis
carolinensis]
Length = 1122
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 134/223 (60%), Positives = 173/223 (77%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCRNGLKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 DSIHYFKMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD++E++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLLSVDDSMERLYHMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG++ VL
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGSVVSQIVL 360
>gi|426242121|ref|XP_004014923.1| PREDICTED: extracellular sulfatase Sulf-2 [Ovis aries]
Length = 912
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/219 (60%), Positives = 173/219 (78%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
+ +FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI
Sbjct: 184 ARAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKDYLTDLI 243
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+++F N + H TP+Y+YAPNP
Sbjct: 244 TNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSSLFPNASQHITPSYNYAPNP 303
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
DK WI++ T M+P+H QFT++L KRLQTL SVDD++E I L + GEL+NTYI+YT+
Sbjct: 304 DKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMETIYNMLVETGELDNTYIVYTA 363
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 364 DHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 402
>gi|344256682|gb|EGW12786.1| Extracellular sulfatase Sulf-1 [Cricetulus griseus]
Length = 787
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 76 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 135
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 136 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 195
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 196 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 255
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ +PQ+
Sbjct: 256 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGST------VPQI 296
>gi|350583123|ref|XP_003481437.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1-like
[Sus scrofa]
Length = 1119
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 199 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 259 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 319 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 359
>gi|344273147|ref|XP_003408388.1| PREDICTED: extracellular sulfatase Sulf-1 [Loxodonta africana]
Length = 869
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRLYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|149060925|gb|EDM11535.1| sulfatase 1, isoform CRA_a [Rattus norvegicus]
gi|149060926|gb|EDM11536.1| sulfatase 1, isoform CRA_a [Rattus norvegicus]
Length = 658
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP I PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSIEPGS------IVPQI 358
>gi|189571636|ref|NP_055985.2| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
gi|189571638|ref|NP_001121676.1| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
gi|189571641|ref|NP_001121677.1| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
gi|189571643|ref|NP_001121678.1| extracellular sulfatase Sulf-1 precursor [Homo sapiens]
gi|33112447|sp|Q8IWU6.1|SULF1_HUMAN RecName: Full=Extracellular sulfatase Sulf-1; Short=hSulf-1; Flags:
Precursor
gi|27356932|gb|AAM76860.1| extracellular sulfatase SULF-1 [Homo sapiens]
gi|119607359|gb|EAW86953.1| sulfatase 1, isoform CRA_a [Homo sapiens]
gi|119607361|gb|EAW86955.1| sulfatase 1, isoform CRA_a [Homo sapiens]
Length = 871
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|28191290|gb|AAO33315.1| sulfatase SULF1 precursor [Homo sapiens]
Length = 871
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|355779744|gb|EHH64220.1| Extracellular sulfatase Sulf-1 [Macaca fascicularis]
Length = 869
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|355698008|gb|EHH28556.1| Extracellular sulfatase Sulf-1 [Macaca mulatta]
gi|380810974|gb|AFE77362.1| extracellular sulfatase Sulf-1 precursor [Macaca mulatta]
Length = 869
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|426359865|ref|XP_004047178.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Gorilla
gorilla gorilla]
gi|426359867|ref|XP_004047179.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Gorilla
gorilla gorilla]
Length = 869
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|426235614|ref|XP_004011775.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1
[Ovis aries]
Length = 867
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|403304744|ref|XP_003942951.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403304746|ref|XP_003942952.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 869
Score = 297 bits (761), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|114326202|ref|NP_001041580.1| extracellular sulfatase Sulf-1 precursor [Canis lupus familiaris]
gi|81158074|tpe|CAI85011.1| TPA: sulfatase 1 [Canis lupus familiaris]
Length = 869
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 176/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL+NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|33096729|emb|CAE11871.1| hypothetical protein [Homo sapiens]
gi|190689601|gb|ACE86575.1| sulfatase 1 protein [synthetic construct]
gi|190690967|gb|ACE87258.1| sulfatase 1 protein [synthetic construct]
Length = 871
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/226 (58%), Positives = 175/226 (77%), Gaps = 6/226 (2%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N+
Sbjct: 139 FFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITNE 198
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ 140
S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 199 SINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKH 258
Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DHG
Sbjct: 259 WIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHG 318
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
YH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 319 YHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|402878437|ref|XP_003902891.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Papio anubis]
gi|402878439|ref|XP_003902892.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Papio anubis]
gi|402878441|ref|XP_003902893.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Papio anubis]
gi|402878443|ref|XP_003902894.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Papio anubis]
Length = 869
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358
>gi|332826233|ref|XP_003311792.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Pan
troglodytes]
gi|332826235|ref|XP_003311793.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Pan
troglodytes]
gi|332826237|ref|XP_003311794.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 3 [Pan
troglodytes]
gi|332826239|ref|XP_003311795.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 4 [Pan
troglodytes]
gi|410266244|gb|JAA21088.1| sulfatase 1 [Pan troglodytes]
gi|410266246|gb|JAA21089.1| sulfatase 1 [Pan troglodytes]
gi|410266248|gb|JAA21090.1| sulfatase 1 [Pan troglodytes]
Length = 869
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCHNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|47227306|emb|CAF96855.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1239
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 130/217 (59%), Positives = 171/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++ NG + KH DY DY D+I N
Sbjct: 173 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYPKDYLTDIITN 232
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +P+M+V+S APHGPEDSAPQY++ F + + H TP+Y+YAPNPDK
Sbjct: 233 ESIHYFRTSKRMYPNRPVMMVLSHVAPHGPEDSAPQYSSAFPSASQHITPSYNYAPNPDK 292
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T M+PVH QFT++L +R+QTL SVDD+VEK+ L + GEL+NTY+IYTSDH
Sbjct: 293 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVFNMLVETGELDNTYLIYTSDH 352
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G M
Sbjct: 353 GYHIGQFGLVKGKSMPYEFDIRVPFFIRGPNVEQGAM 389
>gi|410336657|gb|JAA37275.1| sulfatase 1 [Pan troglodytes]
gi|410336659|gb|JAA37276.1| sulfatase 1 [Pan troglodytes]
gi|410336661|gb|JAA37277.1| sulfatase 1 [Pan troglodytes]
Length = 869
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCHNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|281351669|gb|EFB27253.1| hypothetical protein PANDA_007018 [Ailuropoda melanoleuca]
Length = 862
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358
>gi|224078734|ref|XP_002189215.1| PREDICTED: extracellular sulfatase Sulf-2 isoform 1 [Taeniopygia
guttata]
Length = 879
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 130/217 (59%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 139 AFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGYDYSRDYLTDLITN 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+ +F N + H TP+Y+YAPNPDK
Sbjct: 199 DSITFFRISKKMYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASEHITPSYNYAPNPDK 258
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GEL NTY+IYT+DH
Sbjct: 259 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMEMIYNMLAETGELENTYLIYTADH 318
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 319 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 355
>gi|358415387|ref|XP_003583093.1| PREDICTED: extracellular sulfatase Sulf-1-like [Bos taurus]
Length = 868
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|444518093|gb|ELV11952.1| Extracellular sulfatase Sulf-1 [Tupaia chinensis]
Length = 1035
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYI+YT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIVYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|301766256|ref|XP_002918548.1| PREDICTED: extracellular sulfatase Sulf-1-like [Ailuropoda
melanoleuca]
Length = 867
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 358
>gi|5689491|dbj|BAA83029.1| KIAA1077 protein [Homo sapiens]
Length = 818
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 85 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 144
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 145 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 204
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 205 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 264
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 265 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 305
>gi|431891827|gb|ELK02361.1| Extracellular sulfatase Sulf-1 [Pteropus alecto]
Length = 958
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 196 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 255
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 256 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 315
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 316 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 375
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 376 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 416
>gi|317418991|emb|CBN81029.1| Extracellular sulfatase Sulf-2 [Dicentrarchus labrax]
Length = 885
Score = 297 bits (760), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 130/217 (59%), Positives = 171/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++ NG + KH DY DY D+I N
Sbjct: 146 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYSKDYLTDIITN 205
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + R SK+ + +P+M+V+S APHGPEDSAPQY++ F N + H TP+Y+YAPNPDK
Sbjct: 206 DSINYFRMSKKMYPHRPVMMVLSHVAPHGPEDSAPQYSSAFPNASQHITPSYNYAPNPDK 265
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T M+PVH QFT++L +R+QTL SVDD++EK+ L + GEL+NTY+IYTSDH
Sbjct: 266 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSMEKMYNMLMETGELDNTYLIYTSDH 325
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEQGAI 362
>gi|410987255|ref|XP_003999920.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1
[Felis catus]
Length = 865
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|432859487|ref|XP_004069132.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oryzias latipes]
Length = 885
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 130/212 (61%), Positives = 169/212 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++ NG + KH DY DY D+I N
Sbjct: 146 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSGDYRKDYLTDIITN 205
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + R SK+ + +P+M+V+S APHGPEDSAPQY++ F N + H TP+Y+YAPNPDK
Sbjct: 206 DSINYFRMSKRMYPHRPVMMVLSHVAPHGPEDSAPQYSSAFPNASQHITPSYNYAPNPDK 265
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T M+PVH QFT++L +R+QTL SVDD+VEK+ L + GEL+NTYIIYTSDH
Sbjct: 266 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVYNMLLETGELDNTYIIYTSDH 325
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYH+GQFGLVKGKS P+EFDIR+PF +RGP +
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRIPFYVRGPDV 357
>gi|432096980|gb|ELK27479.1| Extracellular sulfatase Sulf-1 [Myotis davidii]
Length = 1137
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|395849374|ref|XP_003797303.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 1 [Otolemur
garnettii]
gi|395849376|ref|XP_003797304.1| PREDICTED: extracellular sulfatase Sulf-1 isoform 2 [Otolemur
garnettii]
Length = 863
Score = 296 bits (759), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCHNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PI++V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPILMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|297299552|ref|XP_002808528.1| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-1-like
[Macaca mulatta]
Length = 1042
Score = 296 bits (759), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|359072436|ref|XP_003586945.1| PREDICTED: extracellular sulfatase Sulf-1-like [Bos taurus]
Length = 826
Score = 296 bits (759), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 175/227 (77%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 96 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 155
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 156 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 215
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 216 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 275
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 276 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 316
>gi|440904303|gb|ELR54836.1| Extracellular sulfatase Sulf-1, partial [Bos grunniens mutus]
Length = 869
Score = 296 bits (759), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 174/227 (76%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ +PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGST------VPQI 358
>gi|51491872|ref|NP_001003833.2| extracellular sulfatase Sulf-2 precursor [Danio rerio]
gi|37812244|gb|AAR04058.1| sulfatase FP2b [Danio rerio]
Length = 885
Score = 296 bits (758), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 129/215 (60%), Positives = 169/215 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+YIPPGWREW A++ NS++YNY++ NG + KHG +Y DY DLI N
Sbjct: 141 AFFGKYLNEYNGTYIPPGWREWVAMVKNSRFYNYTLCRNGVREKHGFEYPKDYLTDLITN 200
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + R SK+ + +P+++V+S APHGPED+APQY F N + H TP+Y+YAPNPDK
Sbjct: 201 DSINYFRMSKKIYPHRPVLMVISHAAPHGPEDAAPQYTTAFPNASQHITPSYNYAPNPDK 260
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD+VEK+ L D GEL+NTY+IYT+DH
Sbjct: 261 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSVEKVYNMLVDTGELDNTYVIYTADH 320
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G
Sbjct: 321 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAG 355
>gi|37812246|gb|AAR04059.1| sulfatase FP2a [Danio rerio]
Length = 867
Score = 296 bits (758), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 129/215 (60%), Positives = 169/215 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+YIPPGWREW A++ NS++YNY++ NG + KHG +Y DY DLI N
Sbjct: 141 AFFGKYLNEYNGTYIPPGWREWVAMVKNSRFYNYTLCRNGVREKHGFEYPKDYLTDLITN 200
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + R SK+ + +P+++V+S APHGPED+APQY F N + H TP+Y+YAPNPDK
Sbjct: 201 DSINYFRMSKKIYPHRPVLMVISHAAPHGPEDAAPQYTTAFPNASQHITPSYNYAPNPDK 260
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD+VEK+ L D GEL+NTY+IYT+DH
Sbjct: 261 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSVEKVYNMLVDTGELDNTYVIYTADH 320
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G
Sbjct: 321 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAG 355
>gi|260788079|ref|XP_002589078.1| hypothetical protein BRAFLDRAFT_120897 [Branchiostoma floridae]
gi|229274252|gb|EEN45089.1| hypothetical protein BRAFLDRAFT_120897 [Branchiostoma floridae]
Length = 922
Score = 296 bits (758), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 136/223 (60%), Positives = 168/223 (75%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLN+YNGSY+PPGWREW I NS++YNY +N NG K KH Y DY+ DLIAN
Sbjct: 147 GYFGKYLNEYNGSYVPPGWREWVGQIRNSRFYNYKVNRNGLKFKHDSSYERDYFTDLIAN 206
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DSV F + SK++F +P+ +V++FPAPHGPED APQY++MF N T H Y+ APN DK
Sbjct: 207 DSVSFFKLSKKTFPHRPVAMVLAFPAPHGPEDGAPQYSHMFANNTDHRIQGYNMAPNMDK 266
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T+ + FTD+L KRLQTLQSVDDAVEK+ L + G+L NTY+IYTSDH
Sbjct: 267 HWILRWTQPLDNTQLAFTDMLQQKRLQTLQSVDDAVEKVYNMLIEQGQLENTYMIYTSDH 326
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
GYHLGQFGLVKGKSFP+++D RVP+L+RGP + P T+ VL
Sbjct: 327 GYHLGQFGLVKGKSFPYDWDTRVPYLIRGPNVQPNTVMSQIVL 369
>gi|148225416|ref|NP_001088414.1| sulfatase 2 precursor [Xenopus laevis]
gi|119850718|gb|AAI27412.1| LOC495271 protein [Xenopus laevis]
Length = 872
Score = 296 bits (757), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 128/217 (58%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+Y+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 138 AFFGKYLNEYNGTYVPPGWKEWVGLVKNSRFYNYTLCRNGIKEKHGFDYAQDYLTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+ MF N + H TP+Y+YAPNPDK
Sbjct: 198 DSISFFRMSKKIYPHRPVLMVLSHAAPHGPEDSAPQYSQMFQNASQHITPSYNYAPNPDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L +RLQTL SVD+++E I L + GEL NTY+IYT+DH
Sbjct: 258 HWIMRYTGPMKPIHMEFTNMLQRRRLQTLMSVDNSMEMIYNMLVETGELENTYVIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 318 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEAGSL 354
>gi|345306776|ref|XP_001510791.2| PREDICTED: extracellular sulfatase Sulf-1-like [Ornithorhynchus
anatinus]
Length = 815
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 129/217 (59%), Positives = 172/217 (79%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGWREW L+ NS++YNY+I NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYVPPGWREWLGLVKNSRFYNYTICRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +P+M+V+S APHGPEDSAPQ+++ F N + H TP+Y+YAPN DK
Sbjct: 198 ESISYFKMSKRMYPHRPVMMVVSHAAPHGPEDSAPQFSDRFPNASQHVTPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WILQ T M P+H +FT++L KRLQTL SVDD++E++ L + GEL+NTY+IYT+DH
Sbjct: 258 HWILQYTGPMLPIHMEFTNMLQRKRLQTLLSVDDSMERLFAMLLETGELDNTYVIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSI 354
>gi|348510327|ref|XP_003442697.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oreochromis
niloticus]
Length = 884
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 129/217 (59%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+PPGW+EW AL+ NS++YNY++ NG + KH DY DY D+I N
Sbjct: 146 AFFGKYLNEYNGSYVPPGWKEWVALVKNSRFYNYTLCRNGVREKHSSDYTKDYLTDVITN 205
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +P+++V+S APHGPEDSAPQY+ F N + H TP+Y+YAPNPDK
Sbjct: 206 NSIDYFRMSKRLYPHRPVLMVLSHVAPHGPEDSAPQYSAAFSNASQHITPSYNYAPNPDK 265
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WIL+ T M+PVH QFT++L +R+QTL SVDD+VEK+ L + GEL+NTY+IYTSDH
Sbjct: 266 HWILRYTGPMKPVHMQFTNMLQRRRMQTLLSVDDSVEKVYNMLVETGELDNTYLIYTSDH 325
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G +
Sbjct: 326 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEQGAI 362
>gi|338728266|ref|XP_003365645.1| PREDICTED: extracellular sulfatase Sulf-1 [Equus caballus]
Length = 1218
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 132/227 (58%), Positives = 174/227 (76%), Gaps = 6/227 (2%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + EL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETRELENTYIIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 318 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 358
>gi|147901831|ref|NP_001090848.1| sulfatase 1 precursor [Xenopus (Silurana) tropicalis]
gi|125719330|gb|ABN54443.1| extracellular 6-O-endosulfatase [Xenopus (Silurana) tropicalis]
Length = 884
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY++ NG K KHG +Y DY+ DLI N
Sbjct: 137 AFFGKYLNEYNGSYIPPGWREWLGLVKNSRFYNYTMCRNGFKEKHGFEYEKDYFTDLITN 196
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ F N + H TP+Y+YAPN DK
Sbjct: 197 DSISYFKLSKKLYPHRPIMMVISHAAPHGPEDSAPQFSEFFPNASQHITPSYNYAPNMDK 256
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD++EK+ L D GE+ NTY+IYTSDH
Sbjct: 257 HWIMQYTGAMLPIHMEFTNVLHRKRLQTLLSVDDSMEKLYNMLVDTGEVENTYLIYTSDH 316
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++
Sbjct: 317 GYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGSI 353
>gi|115530840|emb|CAL49339.1| sulfatase 1 [Xenopus (Silurana) tropicalis]
Length = 697
Score = 294 bits (753), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 170/217 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY++ NG K KHG +Y DY+ DLI N
Sbjct: 137 AFFGKYLNEYNGSYIPPGWREWLGLVKNSRFYNYTMCRNGFKEKHGFEYEKDYFTDLITN 196
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ + + SK+ + +PIM+V+S APHGPEDSAPQ++ F N + H TP+Y+YAPN DK
Sbjct: 197 DSISYFKLSKKLYPHRPIMMVISHAAPHGPEDSAPQFSEFFPNASQHITPSYNYAPNMDK 256
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL SVDD++EK+ L D GE+ NTY+IYTSDH
Sbjct: 257 HWIMQYTGAMLPIHMEFTNVLHRKRLQTLLSVDDSMEKLYNMLVDTGEVENTYLIYTSDH 316
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG++
Sbjct: 317 GYHIGQFGLVKGKSMPYDFDIRVPFFVRGPNVEPGSI 353
>gi|170571849|ref|XP_001891890.1| Sulfatase family protein [Brugia malayi]
gi|158603342|gb|EDP39296.1| Sulfatase family protein [Brugia malayi]
Length = 595
Score = 293 bits (751), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 136/224 (60%), Positives = 171/224 (76%), Gaps = 13/224 (5%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
YFGKYLN+Y GSYIPPGW W L+ NS++YNY++N+NG KIKHG DY DY+ DLIAN
Sbjct: 124 AYFGKYLNEYIGSYIPPGWDYWMGLLRNSRFYNYTVNINGNKIKHGWDYGKDYFTDLIAN 183
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
D++ F+RQ + KP M+V+SFPAPHGPED APQY+ F +V +H T A++YAPNPDK
Sbjct: 184 DTIAFIRQLRDKDPFKPYMIVISFPAPHGPEDPAPQYSAWFQDVETHRTEAWNYAPNPDK 243
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE-------------KIVKELKDLG 186
QW+LQ T +M+PVH FTD+L +RLQTLQSVD ++ K++ EL+DLG
Sbjct: 244 QWLLQHTGRMEPVHVIFTDVLHRRRLQTLQSVDSNIQRVGFISYFYFPLAKLINELRDLG 303
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+L+NT IYTSDHGYHLGQFGLVKGK+ P+EFDIRVP+ +RGPG
Sbjct: 304 DLSNTVFIYTSDHGYHLGQFGLVKGKNMPYEFDIRVPYFIRGPG 347
>gi|348517056|ref|XP_003446051.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oreochromis
niloticus]
Length = 870
Score = 293 bits (750), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 129/217 (59%), Positives = 168/217 (77%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+P GWREW L+ NS++YNY++ NG + KHG Y DY DLI
Sbjct: 134 AFFGKYLNEYNGSYVPAGWREWLGLVKNSRFYNYTLCRNGVREKHGAQYPQDYLTDLITA 193
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+R+ R SK+ + +P+++V+S APHGPEDSAPQY+ F N + H TP+Y+YAPNPDK
Sbjct: 194 ESIRYFRSSKRVYPHRPVLMVLSHAAPHGPEDSAPQYSTAFPNASQHITPSYNYAPNPDK 253
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD+VEK+ L + GEL NTYIIYTSDH
Sbjct: 254 HWIMRYTGPMKPIHMEFTNMLQRKRLQTLLSVDDSVEKLYNMLVETGELENTYIIYTSDH 313
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 314 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEQGSI 350
>gi|54020831|ref|NP_001005661.1| sulfatase 2 precursor [Xenopus (Silurana) tropicalis]
gi|49250556|gb|AAH74686.1| sulfatase 1 [Xenopus (Silurana) tropicalis]
Length = 875
Score = 291 bits (746), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 127/217 (58%), Positives = 169/217 (77%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+Y+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 140 AFFGKYLNEYNGTYVPPGWKEWVGLVKNSRFYNYTLCRNGIKEKHGFDYAQDYLTDLITN 199
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ F R SK+ + +P+++V+S APHGPEDSAPQY+ MF N + H TP+Y+YAPNPDK
Sbjct: 200 NSINFFRTSKKIYPHRPVLMVLSHVAPHGPEDSAPQYSQMFPNASQHITPSYNYAPNPDK 259
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L +RLQTL SVD ++E I L + GE+ NTYIIYT+DH
Sbjct: 260 HWIMRYTGPMKPIHMEFTNMLQRRRLQTLMSVDSSMEMIYNMLVETGEVENTYIIYTADH 319
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 320 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVDAGSL 356
>gi|47939687|gb|AAH72073.1| LOC432214 protein, partial [Xenopus laevis]
Length = 679
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 126/217 (58%), Positives = 169/217 (77%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+Y+PPGW+EW L+ NS++YNY++ NG K KHG DY DY DLI N
Sbjct: 138 AFFGKYLNEYNGTYVPPGWKEWVGLVKNSRFYNYTLCRNGIKEKHGFDYAQDYLTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F R SK+ + +P+++V+S APHGPEDSAPQY+ MF N + H TP+Y+YAPN DK
Sbjct: 198 DSISFFRTSKKVYPHRPVLMVLSHVAPHGPEDSAPQYSQMFPNASQHITPSYNYAPNLDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L +RLQTL SVD+++E + L + GEL NTY+IYT+DH
Sbjct: 258 HWIMRYTGPMRPIHMEFTNMLQRRRLQTLMSVDNSMEMVYNMLVETGELENTYVIYTADH 317
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 318 GYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVDAGSL 354
>gi|449265602|gb|EMC76771.1| Extracellular sulfatase Sulf-1, partial [Columba livia]
Length = 856
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 131/227 (57%), Positives = 174/227 (76%), Gaps = 8/227 (3%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY+I+ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+Q T M P+H +FT++L KRLQTL S AV ++ + L ++GEL NTYIIYT+DH
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLFSY--AVLQLYQMLAEMGELENTYIIYTADH 315
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
GYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 316 GYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 356
>gi|432959714|ref|XP_004086377.1| PREDICTED: extracellular sulfatase Sulf-2-like [Oryzias latipes]
Length = 938
Score = 288 bits (736), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 128/215 (59%), Positives = 166/215 (77%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSY+P GWREW L+ NS++YNY+++ NG + KHG Y DY DLI
Sbjct: 134 AFFGKYLNEYNGSYVPNGWREWLGLVKNSRFYNYTLSRNGVREKHGARYPEDYLTDLITE 193
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+SVR+ R SK+ + +P+++V+S APHGPEDSAPQY+ F N + H TP+Y+YAPN DK
Sbjct: 194 ESVRYFRSSKRVYPHRPVLMVLSHAAPHGPEDSAPQYSEAFPNASHHITPSYNYAPNLDK 253
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ T M+P+H +FT++L KRLQTL SVDD++EK+ L + GEL NTYIIYTSDH
Sbjct: 254 HWIMRYTGPMKPIHMEFTNVLQRKRLQTLLSVDDSMEKLYNMLTETGELENTYIIYTSDH 313
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GYH+GQFGLVKGKS P+EFDIRVPF +RGP + G
Sbjct: 314 GYHIGQFGLVKGKSMPYEFDIRVPFYIRGPNVEQG 348
>gi|268578729|ref|XP_002644347.1| C. briggsae CBR-SUL-1 protein [Caenorhabditis briggsae]
Length = 681
Score = 288 bits (736), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 129/212 (60%), Positives = 167/212 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNYS+N NG + K G +Y DY+ DL+ N
Sbjct: 101 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYSMNSNGEREKFGFEYEKDYFTDLVTN 160
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
S+RF+ + ++ + +P L++S+PAPHGPED APQ+A+MF N SH T ++++APNPDK
Sbjct: 161 RSLRFIEKHIKTRAWQPFALIISYPAPHGPEDPAPQFAHMFENEFSHRTGSWNFAPNPDK 220
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QW+LQ T KM VH FTDLL +RLQTLQSVD+ +E++ L++L +L NTY IYTSDH
Sbjct: 221 QWLLQRTGKMNEVHISFTDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDH 280
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI
Sbjct: 281 GYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGI 312
>gi|341873966|gb|EGT29901.1| CBN-SUL-1 protein [Caenorhabditis brenneri]
Length = 691
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 129/218 (59%), Positives = 168/218 (77%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNY++N NG + K G +Y DY+ DL+ N
Sbjct: 123 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGEREKFGFEYEKDYFTDLVTN 182
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
S+RF+ + ++ + +P L++S+PAPHGPED APQ+A+MF N SH T ++++APNPDK
Sbjct: 183 RSLRFIEKHIKTRAWQPFALIISYPAPHGPEDPAPQFAHMFENEFSHRTGSWNFAPNPDK 242
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QW+LQ T KM VH FTDLL +RLQTLQSVDD + ++ L++L +L NTY IYTSDH
Sbjct: 243 QWLLQRTGKMNDVHITFTDLLHRRRLQTLQSVDDGLYRLFNLLRELNQLWNTYAIYTSDH 302
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
GYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI T +
Sbjct: 303 GYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGIPRNTSF 340
>gi|17568795|ref|NP_508560.1| Protein SUL-1 [Caenorhabditis elegans]
gi|33112442|sp|Q21376.1|SULF1_CAEEL RecName: Full=Putative extracellular sulfatase Sulf-1 homolog;
Short=CeSulf-1; Flags: Precursor
gi|351061478|emb|CCD69246.1| Protein SUL-1 [Caenorhabditis elegans]
Length = 709
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 127/212 (59%), Positives = 166/212 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNY++N NG + K G +Y DY+ DL+ N
Sbjct: 130 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGEREKFGSEYEKDYFTDLVTN 189
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
S++F+ + + + +P L++S+PAPHGPED APQ+A+MF N SH T ++++APNPDK
Sbjct: 190 RSLKFIDKHIKIRAWQPFALIISYPAPHGPEDPAPQFAHMFENEISHRTGSWNFAPNPDK 249
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
QW+LQ T KM VH FTDLL +RLQTLQSVD+ +E++ L++L +L NTY IYTSDH
Sbjct: 250 QWLLQRTGKMNDVHISFTDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDH 309
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI
Sbjct: 310 GYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGI 341
>gi|198427333|ref|XP_002120498.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 1001
Score = 280 bits (715), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 131/240 (54%), Positives = 171/240 (71%), Gaps = 7/240 (2%)
Query: 1 MAWPVELTSRNLLYF----DISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSIN 56
+AW V+ SR+ + S YFGKYLN+YNGSYIP GW+ W L+ NS+YYNYS+
Sbjct: 139 IAWQVKHESRSFATYLNKTGYSTAYFGKYLNEYNGSYIPQGWQYWMGLVRNSRYYNYSLR 198
Query: 57 MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
N K H ++Y +DY+ DLI N S+ + R+ K PI+ V+SFPAPHG ED APQY
Sbjct: 199 HNDVKESHRDNYRDDYFTDLIVNRSMTYFRRKKHEEPDSPILSVLSFPAPHGSEDGAPQY 258
Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRK--MQPVHRQFTDLLMTKRLQTLQSVDDA 174
+++ NVTSH TP++DY PNPDK WI+ +RK M +F+ +L KRLQTL+SVDDA
Sbjct: 259 QHLYANVTSHITPSFDYGPNPDKHWIIS-SRKVPMDETQHRFSSILQQKRLQTLRSVDDA 317
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
V++ V L+D GEL+NTY++YTSDHG+H+GQFGL KGKS P++FD+RVP MRGPGI G
Sbjct: 318 VDRFVSMLQDTGELDNTYLLYTSDHGFHIGQFGLAKGKSMPYDFDVRVPLFMRGPGIQAG 377
>gi|284424942|dbj|BAI67117.1| heparan sulfate 6-O endosulfatase [Hemicentrotus pulcherrimus]
Length = 983
Score = 279 bits (713), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 125/215 (58%), Positives = 163/215 (75%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLN+Y G YIPPGW EW L+ NSK+YNY++ NG+ ++HG +Y+ DY PDLIAN
Sbjct: 158 GYFGKYLNEYRGDYIPPGWVEWVGLVKNSKFYNYTLCRNGQLMRHGFNYHRDYLPDLIAN 217
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
DS+ F + SK+++ + P+ +V+SFP+PHGPED APQY +MF H TPA++Y PN DK
Sbjct: 218 DSMAFFKLSKRAYPRLPVAMVLSFPSPHGPEDGAPQYQDMFEGNKLHRTPAWNYGPNEDK 277
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++ +M + R FTD L KRL TL SVDDA+EK+ +EL+ LGEL+NTYI+ TSDH
Sbjct: 278 HWIIRQASRMTKLERDFTDALQRKRLITLLSVDDAIEKLCRELEALGELDNTYILMTSDH 337
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GYHLGQF ++KGKS ++ DIRVPF +RGP I G
Sbjct: 338 GYHLGQFNMLKGKSNAYDTDIRVPFYIRGPDIPHG 372
>gi|308512451|ref|XP_003118408.1| CRE-SUL-1 protein [Caenorhabditis remanei]
gi|308239054|gb|EFO83006.1| CRE-SUL-1 protein [Caenorhabditis remanei]
Length = 680
Score = 276 bits (707), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 126/227 (55%), Positives = 167/227 (73%), Gaps = 15/227 (6%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY----------- 68
Y GKYLN+Y+GSYIPPGW EW A++ NSK+YNY++N NG + + G DY
Sbjct: 124 AYLGKYLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGERERFGSDYEKVRFRFWKNI 183
Query: 69 ----YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
+ DY+ DL+ N S++F+ + ++ + +P L++S+PAPHGPED APQ+A+MF N
Sbjct: 184 LILRFQDYFTDLVTNRSLKFIEKHIKTRAWQPFALIISYPAPHGPEDPAPQFAHMFENEL 243
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
SH T ++++APNPDKQW+LQ T KM VH FTDLL +RLQTLQSVDD + ++ L++
Sbjct: 244 SHRTGSWNFAPNPDKQWLLQRTGKMNDVHITFTDLLHRRRLQTLQSVDDGIARLFNLLRE 303
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
L +L NT+ IYTSDHGYHLGQFGL+KGK+ P+EFDIRVPF MRGPGI
Sbjct: 304 LNQLWNTFAIYTSDHGYHLGQFGLLKGKNMPYEFDIRVPFFMRGPGI 350
>gi|47211781|emb|CAF94091.1| unnamed protein product [Tetraodon nigroviridis]
Length = 879
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 131/273 (47%), Positives = 180/273 (65%), Gaps = 37/273 (13%)
Query: 1 MAWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSIN 56
M+W + R + + GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY+++
Sbjct: 111 MSWQRQHEPRTFGVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWLGLVKNSRFYNYTLS 170
Query: 57 MNGRKIKHGEDY-------YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
NG + KHG + + DY DLI +S+R+ R SK+ + +P+++V+S APHGP
Sbjct: 171 RNGFREKHGAECVCMCVCVFQDYLTDLITAESMRYFRYSKRVYPHRPVLMVLSHAAPHGP 230
Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
EDSAPQY+ F N + H TP+Y+YAPNPDK WI++ M+P+H +FT++L KRLQTL
Sbjct: 231 EDSAPQYSTAFQNASQHITPSYNYAPNPDKHWIMRYIGPMKPIHMEFTNMLQRKRLQTLL 290
Query: 170 SVDDAVEK--------------------------IVKELKDLGELNNTYIIYTSDHGYHL 203
SVDD++EK + L ++GEL+NTYIIYTSDHGYH+
Sbjct: 291 SVDDSMEKAGPADQDASPPGALLRLLTPSVSVLQLYNMLVEMGELDNTYIIYTSDHGYHI 350
Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 351 GQFGLVKGKSMPYEFDIRVPFFIRGPNVEQGSI 383
>gi|195997457|ref|XP_002108597.1| hypothetical protein TRIADDRAFT_18386 [Trichoplax adhaerens]
gi|190589373|gb|EDV29395.1| hypothetical protein TRIADDRAFT_18386 [Trichoplax adhaerens]
Length = 655
Score = 249 bits (637), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 116/212 (54%), Positives = 151/212 (71%), Gaps = 1/212 (0%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+FGKYLN+Y+GSYIP GW W + NS+YYNY++N NG+ +KHG DY+ DLI
Sbjct: 125 GFFGKYLNEYDGSYIPVGWDHWIGQVKNSRYYNYTLNRNGKLVKHGNRCQQDYFTDLIVR 184
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+RFL+++ + + KKPI++V+S PAPHG EDSAPQY N + TP Y+Y + DK
Sbjct: 185 ESIRFLKKTTEKYPKKPILMVLSMPAPHGTEDSAPQYQNRYARAKVPRTPNYNYH-SFDK 243
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI+++ M V FTDLL KRLQTL SVDDAVEK+ + ++ L TY+ +TSDH
Sbjct: 244 HWIVRIAPPMSSVKMNFTDLLYRKRLQTLLSVDDAVEKLYRSIRKLDLKEKTYLFFTSDH 303
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLG F +VKGKS P+E DIRVPFL+RGP I
Sbjct: 304 GYHLGHFSIVKGKSMPYETDIRVPFLVRGPMI 335
>gi|47213980|emb|CAG00671.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1160
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 125/257 (48%), Positives = 166/257 (64%), Gaps = 22/257 (8%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSYIPPGWREW LI NS++YNY++
Sbjct: 115 SWQAQHEPRSFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWVGLIKNSRFYNYTVCR 174
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG +Y DY+ DLI NDS+ F R SK+ F +P+M+V+S APHGPEDSAPQYA
Sbjct: 175 NGYKEKHGAEYAKDYFTDLITNDSINFFRISKRMFPHRPVMMVISHAAPHGPEDSAPQYA 234
Query: 118 NMFFNVTSHHTPAYDYAPN------PDKQWILQVTRK------------MQPVHRQFTDL 159
+ F N + H P + + P K LQ+ K M+P+ +FT+
Sbjct: 235 DHFPNASQHIPPDWWVKSSSISHSRPRKHAQLQLRSKYGQTLDHAIPGPMRPIPMEFTNF 294
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
KRLQTL + + K+ L++ GEL+NTYI+YT+DHGYH+GQFGLVKGKS P++FD
Sbjct: 295 PHRKRLQTLMAYHYSAPKVYDMLEETGELDNTYIMYTADHGYHIGQFGLVKGKSMPYDFD 354
Query: 220 IRVPFLMRGPGIVPGTM 236
IRVPF +RGP + PG +
Sbjct: 355 IRVPFFLRGPNVEPGAV 371
>gi|449674493|ref|XP_002155025.2| PREDICTED: extracellular sulfatase Sulf-1-like [Hydra
magnipapillata]
Length = 497
Score = 239 bits (610), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 111/215 (51%), Positives = 146/215 (67%), Gaps = 1/215 (0%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
YFGKYLN YNG +PPGW W L+ NS++YNYS+N+NG+ +K+ DY DY+ D++ N
Sbjct: 134 AYFGKYLNNYNGDRVPPGWNHWSGLVRNSRFYNYSLNVNGKIVKYKNDYKKDYFTDVVTN 193
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
S+ + + K PIM+V+S APHGPEDSAPQY + F ++ + P Y+Y + DK
Sbjct: 194 ASIDYFKNVKSQTLSAPIMMVLSMSAPHGPEDSAPQYQDSFKDIKAPRHPNYNYT-SSDK 252
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
W+++ T + FTD+L KRLQTL SVDDA+EK+ L +G L+ TYII+TSDH
Sbjct: 253 HWVIRQTPPLDETTALFTDILYRKRLQTLLSVDDAIEKVYNMLLQVGLLDETYIIFTSDH 312
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GYHLGQF VKGKS P+E DIR+P MRGP I G
Sbjct: 313 GYHLGQFNQVKGKSQPYESDIRIPLYMRGPHIPKG 347
>gi|324535141|gb|ADY49408.1| Extracellular sulfatase Sulf-1, partial [Ascaris suum]
Length = 170
Score = 236 bits (603), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 107/169 (63%), Positives = 138/169 (81%)
Query: 43 ALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMS 102
ALI NS++YNY+IN NG KIKHG DY DY+ DLIAND++ F+ + ++ KP + V+S
Sbjct: 2 ALIRNSRFYNYTINFNGDKIKHGFDYEKDYFTDLIANDTIAFIHRHREREPTKPFLAVLS 61
Query: 103 FPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMT 162
FPAPHGPED APQ+A+ F +V +H T A++YAPNPDKQW+LQ T +M+P+H FTD+L
Sbjct: 62 FPAPHGPEDPAPQFADWFEDVETHRTEAWNYAPNPDKQWLLQHTGRMEPIHVVFTDVLHR 121
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
+RLQTLQSVD +V+++ EL+DLGEL+NT+IIYTSDHGYHLGQFGLVKG
Sbjct: 122 RRLQTLQSVDHSVQRLANELRDLGELSNTFIIYTSDHGYHLGQFGLVKG 170
>gi|156364294|ref|XP_001626284.1| predicted protein [Nematostella vectensis]
gi|156213155|gb|EDO34184.1| predicted protein [Nematostella vectensis]
Length = 369
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 113/212 (53%), Positives = 148/212 (69%), Gaps = 1/212 (0%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLN Y+GSYIP GW W LI NS++YNY + N KH +Y NDY+ ++I +
Sbjct: 89 GYFGKYLNAYDGSYIPYGWHRWAGLIRNSRFYNYVLRHNTFYKKHQNNYENDYFTNVITD 148
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ F + SK +PIM+V+S APHGPED APQY + F N T+ TP +++ +PDK
Sbjct: 149 HAISFFKNSKARKPDRPIMMVVSHSAPHGPEDGAPQYQDHFPNATAPRTPNWNFT-SPDK 207
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
WI++VT M F+D+L KRLQTL SVDD +++I LK+ G +TYI ++SDH
Sbjct: 208 HWIVRVTPPMTQQKSDFSDMLQRKRLQTLLSVDDNIQRIWDMLKETGYDRDTYIFFSSDH 267
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GYHLGQ+GLVKGKS P+E DIRVPF +RGPGI
Sbjct: 268 GYHLGQYGLVKGKSMPYESDIRVPFFVRGPGI 299
>gi|402882321|ref|XP_003904694.1| PREDICTED: extracellular sulfatase Sulf-2 [Papio anubis]
Length = 1165
Score = 227 bits (579), Expect = 4e-57, Method: Composition-based stats.
Identities = 99/183 (54%), Positives = 132/183 (72%), Gaps = 4/183 (2%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 448 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 507
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG K KHG DY DY DLI NDSV F R SK+ + +P+++V+S APHGPEDSAPQY+
Sbjct: 508 NGVKEKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYS 567
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F N + H TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E
Sbjct: 568 RLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNILQRKRLQTLMSVDDSMET 627
Query: 178 IVK 180
V+
Sbjct: 628 QVR 630
>gi|307174994|gb|EFN65195.1| Extracellular sulfatase SULF-1-like protein [Camponotus floridanus]
Length = 1148
Score = 213 bits (543), Expect = 5e-53, Method: Composition-based stats.
Identities = 93/109 (85%), Positives = 106/109 (97%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TPAYDYAPNPDKQWILQVT+KMQP+H+QFTDLLMTKRLQTLQSVD AV++I +ELKDLGE
Sbjct: 23 TPAYDYAPNPDKQWILQVTQKMQPIHKQFTDLLMTKRLQTLQSVDAAVDRIYQELKDLGE 82
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL+RGPG+ PG++
Sbjct: 83 LDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGVEPGSI 131
>gi|312374901|gb|EFR22368.1| hypothetical protein AND_15355 [Anopheles darlingi]
Length = 1369
Score = 207 bits (526), Expect = 5e-51, Method: Composition-based stats.
Identities = 92/119 (77%), Positives = 105/119 (88%)
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
N F TPAYD+APNPDKQWIL+VT+ M+P+HR+FTDLLMTKRLQTLQSVD AVE+
Sbjct: 38 NGSFFTFQQSTPAYDHAPNPDKQWILRVTQPMEPIHRKFTDLLMTKRLQTLQSVDVAVER 97
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ +ELK LGEL+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFLMRGPGI P T+
Sbjct: 98 VYQELKALGELDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLMRGPGIEPATI 156
>gi|170065419|ref|XP_001867931.1| sulfatase-1 [Culex quinquefasciatus]
gi|167882509|gb|EDS45892.1| sulfatase-1 [Culex quinquefasciatus]
Length = 1090
Score = 206 bits (524), Expect = 1e-50, Method: Composition-based stats.
Identities = 88/111 (79%), Positives = 105/111 (94%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TPAYD+APNPDKQWIL+VT+ M+P+HR+FTDLLMTKRLQTLQSVD AVE++ +ELK LGE
Sbjct: 3 TPAYDHAPNPDKQWILRVTKPMEPIHRKFTDLLMTKRLQTLQSVDVAVERVYQELKALGE 62
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
L+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFL+RGPGI P ++++
Sbjct: 63 LDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLIRGPGIEPASVFV 113
>gi|170065421|ref|XP_001867932.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167882510|gb|EDS45893.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 376
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 95/140 (67%), Positives = 108/140 (77%), Gaps = 1/140 (0%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY DYYPDLIAN
Sbjct: 206 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYGKDYYPDLIAN 265
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT-PAYDYAPNPD 138
DS+ FLRQSK +KP++L MSFPAPHGPEDSAPQY+++FFNVT+HH Y + N
Sbjct: 266 DSIAFLRQSKHQNHRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHCLTLYRVSDNLS 325
Query: 139 KQWILQVTRKMQPVHRQFTD 158
+ I + Q + TD
Sbjct: 326 RDRIQMTSVTTQQTQGRETD 345
>gi|312374902|gb|EFR22369.1| hypothetical protein AND_15356 [Anopheles darlingi]
Length = 358
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 88/107 (82%), Positives = 97/107 (90%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY DYYPDLIAN
Sbjct: 251 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYAKDYYPDLIAN 310
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
DS+ FLRQSK +KP++L MSFPAPHGPEDSAPQY+++FFNVT+H
Sbjct: 311 DSIAFLRQSKHQNHRKPVLLAMSFPAPHGPEDSAPQYSHLFFNVTTH 357
>gi|339245873|ref|XP_003374570.1| extracellular sulfatase Sulf-1 [Trichinella spiralis]
gi|316972167|gb|EFV55855.1| extracellular sulfatase Sulf-1 [Trichinella spiralis]
Length = 645
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 92/159 (57%), Positives = 119/159 (74%), Gaps = 8/159 (5%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
DY+ DLI NDS+ F ++S F +KPI+LV+SFPAPHGPED AP+Y + F N T+H T +
Sbjct: 23 DYFTDLIVNDSINFFKRSTLLFPQKPILLVLSFPAPHGPEDPAPKYNDFFANNTAHRTAS 82
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV--------KEL 182
++YAPNPDKQW+L+ T KM+PVH FTD L +RLQTL SVDDAV K+ KE
Sbjct: 83 WNYAPNPDKQWLLRHTGKMEPVHVAFTDFLQRRRLQTLLSVDDAVYKVTPVVLYHLYKEF 142
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
+ G L+NTY +YTSDHGYHLGQ+GL+KGK+ P++ DI+
Sbjct: 143 RQFGLLDNTYFLYTSDHGYHLGQYGLIKGKTLPYDCDIK 181
>gi|291190064|ref|NP_001167172.1| N-acetylglucosamine-6-sulfatase precursor [Salmo salar]
gi|223648450|gb|ACN10983.1| N-acetylglucosamine-6-sulfatase precursor [Salmo salar]
Length = 518
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/239 (45%), Positives = 149/239 (62%), Gaps = 16/239 (6%)
Query: 2 AWPVELTSRNLLYFDISHGYFGKYLNKY------NGSYIPPGWREWGALIMNSKYYNYSI 55
A+PV L+ ++ F Y GKYLN+Y N Y+PPGW +W AL+ NS+YYNY++
Sbjct: 127 AFPVYLSKQHYQTF-----YAGKYLNQYGKKETGNVGYVPPGWDQWHALMGNSQYYNYTL 181
Query: 56 NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
++NG+ +HG++Y DY DLI N S+ FL ++P +++S PAPH P +APQ
Sbjct: 182 SVNGKAEQHGDNYEKDYLTDLILNRSLHFLDDRS---PQRPFFIMLSPPAPHSPWTAAPQ 238
Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
Y F +V + ++D P DK W+L Q + M F D KR QTL SVDD
Sbjct: 239 YEKNFADVKAPRDGSFD-KPGKDKHWLLRQPSNPMPSTSLNFLDNAYRKRWQTLLSVDDM 297
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
VE +VK+L++L ELNNTYI YTSD+GYH GQF L K ++FDIR+P ++RGPGI P
Sbjct: 298 VEVLVKKLEELKELNNTYIFYTSDNGYHTGQFSLPIDKRQLYDFDIRIPLMVRGPGIKP 356
>gi|157118710|ref|XP_001653223.1| sulfatase-1, sulf-1 [Aedes aegypti]
gi|108875645|gb|EAT39870.1| AAEL008372-PA [Aedes aegypti]
Length = 250
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 88/107 (82%), Positives = 97/107 (90%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GYFGKYLNKYNGSYIPPGWREWG LIMNSKYYNYSINMNG+KIKHG DY DYYPDLIAN
Sbjct: 143 GYFGKYLNKYNGSYIPPGWREWGGLIMNSKYYNYSINMNGQKIKHGFDYAKDYYPDLIAN 202
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
DS+ FLRQSK +KP++L MSFPAPHGPEDSAPQY+++FFNVT+H
Sbjct: 203 DSIAFLRQSKHQNHRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTH 249
>gi|157118712|ref|XP_001653224.1| sulfatase-1, sulf-1 [Aedes aegypti]
gi|108875646|gb|EAT39871.1| AAEL008362-PA, partial [Aedes aegypti]
Length = 782
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 88/109 (80%), Positives = 101/109 (92%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TPAYD APNPDKQWIL+VT+ M+P+HR+FTDLLMTKRLQTLQSVD AVE++ +ELK L E
Sbjct: 1 TPAYDLAPNPDKQWILRVTQPMEPIHRKFTDLLMTKRLQTLQSVDVAVERVYQELKALDE 60
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L+NTYIIYTSDHGYHLGQFGL+KGKSFPFEFD+RVPFLMRGPGI P ++
Sbjct: 61 LDNTYIIYTSDHGYHLGQFGLIKGKSFPFEFDVRVPFLMRGPGIEPASV 109
>gi|410903982|ref|XP_003965472.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Takifugu rubripes]
Length = 520
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 139/229 (60%), Gaps = 11/229 (4%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IPPGW W AL+ NS+YYNYS+++NG++ KHG++Y +DY
Sbjct: 143 FAGKYLNQYGQKDAGHVGHIPPGWDHWHALVGNSQYYNYSLSVNGKEEKHGDNYGDDYLT 202
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DLI N S+ FL P L++S PAPH P +APQ+ + ++ + ++D
Sbjct: 203 DLITNRSLTFLDNRSPQL---PFFLLLSPPAPHAPWTAAPQHQKDYADIKAPRDGSFD-K 258
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P DK W+L Q M + D KR QTL SVDD VEK+V++LK + ELNNTYI
Sbjct: 259 PGKDKHWLLRQPANPMTASSLTYLDNAYRKRWQTLASVDDMVEKLVEKLKSIDELNNTYI 318
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
YTSD+GYH GQF L K +EFDIRVP L+RGPGI P VL
Sbjct: 319 FYTSDNGYHTGQFSLPIDKRQLYEFDIRVPLLVRGPGIAPNQTVEAPVL 367
>gi|432884623|ref|XP_004074510.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oryzias latipes]
Length = 513
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 142/229 (62%), Gaps = 11/229 (4%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y G ++IPPGW +W AL+ NS+YYNY++++NG++ KHG++Y DY
Sbjct: 136 FAGKYLNQYGGKDAGGVAHIPPGWDQWNALVGNSQYYNYTLSVNGKEEKHGDNYEKDYLT 195
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DLI N S+ FL + P +++S PAPH P +APQY F NV + ++D
Sbjct: 196 DLILNRSLHFLDDRS---PQHPFFIMLSPPAPHSPWTAAPQYGKEFSNVKAPRDGSFD-K 251
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P DK W+L Q M + D +R QTL SVDD VEK+VK+L+ + ELN+TYI
Sbjct: 252 PGKDKHWLLRQPPNPMSNSSLVYLDDAYRRRWQTLLSVDDMVEKLVKKLESVKELNDTYI 311
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
YTSD+GYH GQF L K ++FDIRVP L+RGPGI P VL
Sbjct: 312 FYTSDNGYHTGQFSLPIDKRQLYDFDIRVPLLVRGPGIKPNQTLQAPVL 360
>gi|156408455|ref|XP_001641872.1| predicted protein [Nematostella vectensis]
gi|156229012|gb|EDO49809.1| predicted protein [Nematostella vectensis]
Length = 506
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 135/218 (61%), Gaps = 9/218 (4%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLNKY S Y+PPGW W L+ NSKYYNY++++NG+ KHG DY DY+
Sbjct: 131 YAGKYLNKYGMSATGGTQYVPPGWDSWLGLVGNSKYYNYTVSVNGKAEKHGGDYKEDYFT 190
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I +V FL + QS SK+P ++++ PA HGP APQY N F T+ T +Y+
Sbjct: 191 DVITRSAVSFLEE--QSSSKQPFFIMLATPACHGPFTPAPQYNNSFNKTTAPRTQSYNKP 248
Query: 135 PNPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
K W+L+ T M ++D + R +TL SVDD VE++V L L+NTYI
Sbjct: 249 GGKSKHWLLRSTPSPMSASSVAYSDDVFRNRWRTLLSVDDLVEQVVVTLDKHKHLSNTYI 308
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
I+TSD+GYH GQF L K +EFDIRVP L+RGPGI
Sbjct: 309 IFTSDNGYHHGQFSLPIDKRQLYEFDIRVPLLVRGPGI 346
>gi|198427210|ref|XP_002128609.1| PREDICTED: similar to N-acetylglucosamine-6-sulfatase precursor
(G6S) (Glucosamine-6-sulfatase) [Ciona intestinalis]
Length = 545
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 142/218 (65%), Gaps = 9/218 (4%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y G ++P GW +W L+ NSKYYNY+I+ NG ++HG +Y+ DY
Sbjct: 127 FAGKYLNQYGGKSVGGPQHVPVGWNQWFGLVGNSKYYNYTISDNGVPVQHGANYHEDYLT 186
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DL+AN SV F+ K +++ P +++S PAPH P DSAPQY+ M+ N T+ HTP+Y+
Sbjct: 187 DLLANRSVDFIHNHKMRYTQ-PFFMMISTPAPHSPWDSAPQYSKMYENNTAPHTPSYN-T 244
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q T M ++D R + L+SVDD VE+++ L + +L+NTY+
Sbjct: 245 KAVNKHWLVRQATHPMTKESMDYSDNAFRSRWRALKSVDDLVERVINALSKMKQLDNTYV 304
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
++SD+GYHLGQF L K +EFD+RVP ++RGPG+
Sbjct: 305 FFSSDNGYHLGQFSLPIDKRQLYEFDLRVPLIIRGPGV 342
>gi|402591933|gb|EJW85862.1| hypothetical protein WUBG_03229 [Wuchereria bancrofti]
Length = 256
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 86/150 (57%), Positives = 112/150 (74%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
YFGKYLN+Y GSYIPPGW W L+ NS++YNY++N+NG KIKHG DY DY+ DLIAN
Sbjct: 106 AYFGKYLNEYTGSYIPPGWDYWMGLLRNSRFYNYTVNVNGNKIKHGWDYEKDYFTDLIAN 165
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
D++ F+RQ + KP M+V+SFPAPHGPED APQY+ F +V +H T A++YAPNPDK
Sbjct: 166 DTIAFIRQLRSKDPFKPYMIVISFPAPHGPEDPAPQYSAWFQDVETHRTEAWNYAPNPDK 225
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
QW+LQ T +M+PVH FTD+ + + +
Sbjct: 226 QWLLQHTGRMEPVHVIFTDVYIDDDCKHCK 255
>gi|41054147|ref|NP_956135.1| glucosamine (N-acetyl)-6-sulfatase b precursor [Danio rerio]
gi|28277517|gb|AAH45323.1| Glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID), b
[Danio rerio]
gi|182891244|gb|AAI64157.1| Gnsb protein [Danio rerio]
Length = 347
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 136/218 (62%), Gaps = 11/218 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +++PPGW +W AL+ NSKYYNY++++NG++ KHG+ Y DY
Sbjct: 131 YCGKYLNQYGSKDAGGVAHVPPGWDQWHALVGNSKYYNYTLSVNGKEEKHGDSYEKDYLT 190
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DL+ N S+ FL + S P +++ PAPH P +APQY+ F V + +++
Sbjct: 191 DLVLNRSLHFLEERSPS---HPFFMMLCPPAPHSPWTAAPQYSGSFSGVKAPRNGSFN-K 246
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P DK W+L Q M + D +R QTL SVDD VE+++K+L + EL+NTYI
Sbjct: 247 PGTDKHWLLRQPANPMPNSSIDYLDNAFRRRWQTLLSVDDLVERLLKKLDSVKELDNTYI 306
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
YTSDHGYH GQF L K +EFDIR+P L+RGPGI
Sbjct: 307 FYTSDHGYHTGQFSLPIDKRQLYEFDIRIPLLVRGPGI 344
>gi|326911518|ref|XP_003202105.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Meleagris
gallopavo]
Length = 488
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 143/220 (65%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y S++PPGW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 80 FAGKYLNEYGAEDAGGVSHVPPGWSFWYALEKNSKYYNYTLSVNGKARRHGENYSVDYLT 139
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL + K +F +P +++S PAPH P +APQY N F NV++ ++
Sbjct: 140 DVLANMSLDFL-EYKSNF--EPFFMMISTPAPHSPWTAAPQYKNSFQNVSAPRNSNFNIH 196
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK+L++ GEL+NTYI
Sbjct: 197 -GKNKHWLIRQAKTPMTNSSIQFLDDAYRKRWQTLLSVDDLVEKLVKKLENHGELDNTYI 255
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+G+H GQF L K +EFDI+VP L+RGPGI P
Sbjct: 256 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 295
>gi|47228544|emb|CAG05364.1| unnamed protein product [Tetraodon nigroviridis]
Length = 473
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 143/239 (59%), Gaps = 16/239 (6%)
Query: 2 AWPVELTSRNLLYFDISHGYFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSI 55
A+PV L+ F + GKYLN+Y YIPPGW +W AL+ NS+YYNYS+
Sbjct: 84 AFPVLLSKLGYQTF-----FAGKYLNQYGMKEAGGVDYIPPGWDQWHALVGNSQYYNYSL 138
Query: 56 NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
++NG++ KHG++Y DY DLI N S+ FL P L++S PAPH P +APQ
Sbjct: 139 SVNGKEEKHGDNYSADYLTDLITNRSLNFLDTRS---PVHPFFLMLSPPAPHAPWTAAPQ 195
Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
Y F +V + ++D P DK W+L Q M + D KR QTL SVDD
Sbjct: 196 YQKDFGDVKAPRDGSFD-KPGKDKHWLLRQPANPMTASSLAYLDNAYRKRWQTLVSVDDM 254
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
VE++VK+L + ELN+TYI +TSD+GYH GQF L K +EFDI+VP ++RGPGI P
Sbjct: 255 VEQLVKKLDSINELNSTYIFFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLMVRGPGIAP 313
>gi|313760582|ref|NP_001186488.1| N-acetylglucosamine-6-sulfatase precursor [Gallus gallus]
Length = 538
Score = 189 bits (481), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 142/220 (64%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y S++PPGW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 130 FAGKYLNEYGAEDAGGVSHVPPGWSFWYALEKNSKYYNYTLSVNGKARRHGENYSVDYLT 189
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL + K +F +P +++S PAPH P +APQY N F NV++ ++
Sbjct: 190 DVLANMSLDFL-EYKSNF--EPFFMMISTPAPHSPWTAAPQYKNDFLNVSAPRNSNFNIH 246
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK+L+ GEL+NTYI
Sbjct: 247 -GKNKHWLIRQAKTPMTNSSIQFLDDAYRKRWQTLLSVDDLVEKLVKKLEIHGELDNTYI 305
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+G+H GQF L K +EFDI+VP L+RGPGI P
Sbjct: 306 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 345
>gi|348504250|ref|XP_003439675.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oreochromis
niloticus]
Length = 524
Score = 189 bits (481), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 139/229 (60%), Gaps = 11/229 (4%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IPPGW +W AL+ NS+YYNY++++NG++ KHG+ Y +DY
Sbjct: 145 FSGKYLNQYGSKAAGSVGHIPPGWDQWHALVGNSQYYNYTLSVNGKEEKHGDSYEDDYLT 204
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DLI N S+ FL + P L++S PA H P +APQY F +V + +++
Sbjct: 205 DLILNRSLHFLDDRS---PQHPFFLMLSPPACHSPWTAAPQYQKEFKDVQAPRDGSFN-K 260
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P DK W+L Q M F D KR QTL SVDD VEK+VK+L + ELNNTYI
Sbjct: 261 PGKDKHWLLRQPLNPMPNSSVTFLDNAYRKRWQTLLSVDDMVEKLVKKLDSIKELNNTYI 320
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
YTSD+GYH GQF L K +EFDIR+P ++RGPGI P + VL
Sbjct: 321 FYTSDNGYHTGQFSLPIDKRQLYEFDIRIPLMVRGPGIKPKQVLQAPVL 369
>gi|348521278|ref|XP_003448153.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oreochromis
niloticus]
Length = 532
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 107/220 (48%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y S ++P GW W L NSKYYNY++++NG+ KHG DY DY
Sbjct: 126 FAGKYLNQYGHSEAGGVEHVPLGWSYWVGLDKNSKYYNYTLSVNGKAQKHGADYSKDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DL+AN S+ FL Q K S+ +P +++S PAPH P +APQY N F N + P ++
Sbjct: 186 DLLANMSLDFL-QYKTSY--QPFFMMVSTPAPHSPWTAAPQYQNSFKNTKAPRDPNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M QF D R QTL SVDD VEKIVK L+ GELNNTYI
Sbjct: 243 -GKDKHWLIRQAKTPMSNSSIQFLDDAFRGRWQTLLSVDDLVEKIVKRLEVRGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+TSD+GYH GQF L K +EFDIRVP ++RGP I P
Sbjct: 302 FFTSDNGYHTGQFSLPVDKRQLYEFDIRVPLMVRGPNIKP 341
>gi|345324687|ref|XP_001507725.2| PREDICTED: N-acetylglucosamine-6-sulfatase, partial
[Ornithorhynchus anatinus]
Length = 488
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/220 (46%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++PPGW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 82 FAGKYLNEYGAEGAGGLEHVPPGWSYWFALEKNSKYYNYTLSINGKARRHGENYSQDYLT 141
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 142 DVLANVSLDFLESKSNS---EPFFMMISTPAPHSPWIAAPQYEKSFQNVNAPRNSNFNIH 198
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR +TL SVDD VEK+VK L GELNNTYI
Sbjct: 199 -GKNKHWLIRQAKTPMTNSSIQFLDDAFRKRWRTLLSVDDLVEKLVKRLDFRGELNNTYI 257
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 258 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 297
>gi|313230182|emb|CBY07886.1| unnamed protein product [Oikopleura dioica]
Length = 526
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 143/227 (62%), Gaps = 24/227 (10%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+Y+P GW +W AL NS+YYNY+ G K + +Y+ D+I
Sbjct: 136 AFFGKYLNEYNGTYVPKGWSKWYALRGNSRYYNYNF---GNKPE-------EYFTDVITK 185
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN--------VTSHHTPAY 131
+++ + + +P+++V++FPAPHGPED+ P Y + + +T+H +
Sbjct: 186 NALEYFEANADK--PEPLLMVLNFPAPHGPEDAHPDYRHYYDEPSLEQQSILTAHRELVF 243
Query: 132 D---YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
+ + P +K W L + M FTDLL +RLQTL SVD+ +E+I++ +++ +
Sbjct: 244 NRTSWQPG-EKHWFLTSQKDMTEGDGYFTDLLQRRRLQTLFSVDNGIERIIRLVEEKNAI 302
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
NTY +YTSDHGYHLGQFG+VKGK+ P++FD R+PF MRGP I G+
Sbjct: 303 ENTYFLYTSDHGYHLGQFGVVKGKALPYDFDSRIPFYMRGPSIPAGS 349
>gi|432093890|gb|ELK25743.1| N-acetylglucosamine-6-sulfatase [Myotis davidii]
Length = 476
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 137/220 (62%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y Y+PPGW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 70 FAGKYLNEYGAPAAGGLEYVPPGWSYWVALEKNSKYYNYTLSVNGQARKHGENYSVDYLT 129
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P +++S PAPH P +APQY F NV + ++
Sbjct: 130 DVLANVSLDFL-DYKSNF--EPFFMMISTPAPHSPWIAAPQYQKTFQNVFAPRNKNFNIH 186
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
K W++ Q M QF D KR QTL SVDD VEK++K L+ +GEL+NTYI
Sbjct: 187 -GTSKHWLIRQAKTPMTNSSIQFLDSAFRKRWQTLLSVDDLVEKLIKRLEFIGELDNTYI 245
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 246 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 285
>gi|313218282|emb|CBY41541.1| unnamed protein product [Oikopleura dioica]
Length = 509
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 143/227 (62%), Gaps = 24/227 (10%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNG+Y+P GW +W AL NS+YYNY+ G K + +Y+ D+I
Sbjct: 136 AFFGKYLNEYNGTYVPKGWSKWYALRGNSRYYNYNF---GNKPE-------EYFTDVITK 185
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN--------VTSHHTPAY 131
+++ + + +P+++V++FPAPHGPED+ P Y + + +T+H +
Sbjct: 186 NALEYFEANADK--PEPLLMVLNFPAPHGPEDAHPDYRHYYDEPSLEQQSILTAHRELVF 243
Query: 132 D---YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
+ + P +K W L + M FTDLL +RLQTL SVD+ +E+I++ +++ +
Sbjct: 244 NRTSWQPG-EKHWFLTSQKDMTEGDGYFTDLLQRRRLQTLFSVDNGIERIIRLVEEKNAI 302
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
NTY +YTSDHGYHLGQFG+VKGK+ P++FD R+PF MRGP I G+
Sbjct: 303 ENTYFLYTSDHGYHLGQFGVVKGKALPYDFDSRIPFYMRGPSIPAGS 349
>gi|166158272|ref|NP_001107314.1| glucosamine (N-acetyl)-6-sulfatase precursor [Xenopus (Silurana)
tropicalis]
gi|161612194|gb|AAI55715.1| LOC100135105 protein [Xenopus (Silurana) tropicalis]
Length = 533
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 142/230 (61%), Gaps = 13/230 (5%)
Query: 13 LYFDISHGYF--GKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKH 64
L F I + F GKYLN+Y +++PPGW W AL NSKYYNY+++ NGR H
Sbjct: 119 LLFSIGYQTFFAGKYLNQYGSEEAGGINHVPPGWSYWFALEKNSKYYNYTLSENGRPKTH 178
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE+Y DY D++AN S+ FL K +++ P ++++ PAPH P +APQY F NV+
Sbjct: 179 GENYSQDYLTDVLANVSLDFLNY-KSNYA--PFFMMIATPAPHSPWTAAPQYETAFLNVS 235
Query: 125 SHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
+ ++ DK W++ Q M Q+ D KR QTL SVDD VEK+VK+L+
Sbjct: 236 APRGSNFN-VHGKDKHWLIRQAKSPMSNSSIQYLDNAFQKRWQTLLSVDDLVEKLVKQLE 294
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
GEL+NT+I +TSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 295 ARGELDNTFIFFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 344
>gi|417515891|gb|JAA53750.1| N-acetylglucosamine-6-sulfatase precursor [Sus scrofa]
Length = 552
Score = 187 bits (474), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLAHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY N F NV + ++
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNTFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|335288215|ref|XP_003355555.1| PREDICTED: N-acetylglucosamine-6-sulfatase, partial [Sus scrofa]
Length = 366
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLAHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY N F NV + ++
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNTFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|296487655|tpg|DAA29768.1| TPA: glucosamine (N-acetyl)-6-sulfatase precursor [Bos taurus]
Length = 560
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 154 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 213
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY N F NV + ++
Sbjct: 214 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIH 270
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 271 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 329
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 330 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 369
>gi|115497422|ref|NP_001069030.1| N-acetylglucosamine-6-sulfatase precursor [Bos taurus]
gi|122134311|sp|Q1LZH9.1|GNS_BOVIN RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
Precursor
gi|94534741|gb|AAI15991.1| Glucosamine (N-acetyl)-6-sulfatase [Bos taurus]
Length = 560
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 154 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 213
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY N F NV + ++
Sbjct: 214 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIH 270
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 271 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 329
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 330 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 369
>gi|167520137|ref|XP_001744408.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777494|gb|EDQ91111.1| predicted protein [Monosiga brevicollis MX1]
Length = 486
Score = 186 bits (472), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 97/254 (38%), Positives = 144/254 (56%), Gaps = 20/254 (7%)
Query: 21 YFGKYLNKYN--------------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE 66
Y GKYLN Y + IPPGW W L NSKYYNY+++ +G + HG+
Sbjct: 107 YAGKYLNDYALPGSPNCTHETPEACAAIPPGWDRWLGLFGNSKYYNYTVSDDGVPVDHGD 166
Query: 67 DYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
DY NDY+PDL+AN ++ F+ ++ + +PI+ V ++P PHGP AP A + +
Sbjct: 167 DYTNDYFPDLVANRTIGFIYEAARVAPDRPILAVNAWPTPHGPHPPAPWAAGRYAGYQAP 226
Query: 127 HTPAYDYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
TP+Y+ + K + ++ + P Q D+ +R +TL SVD+ VE IV+ L D
Sbjct: 227 RTPSYNASSEYMATKNYFMRQLDMIWPEDAQSIDVNYQQRWETLLSVDNHVESIVQALAD 286
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL-- 242
G LNNT+IIY SDHG+ LGQ L+ K +E DIR+PF++RGPG+ P + VL
Sbjct: 287 TGRLNNTFIIYASDHGFQLGQHRLLGDKRHQYENDIRIPFIVRGPGVPPNSTETRTVLNI 346
Query: 243 --IPQVRKFSSGSL 254
P + + +G++
Sbjct: 347 DIAPTLTELGTGAV 360
>gi|426224775|ref|XP_004006544.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Ovis aries]
Length = 552
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY N F N+ + ++
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNIFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|47218297|emb|CAG04129.1| unnamed protein product [Tetraodon nigroviridis]
Length = 532
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 134/218 (61%), Gaps = 11/218 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y + ++PPGW W L NSKYYNY++++NG+ KHG DY DY
Sbjct: 136 FAGKYLNQYGHAEAGGMEHVPPGWNVWVGLEKNSKYYNYTLSVNGKAQKHGSDYNKDYLT 195
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL Q K SF +P +++S PAPH P +APQY N F + P ++
Sbjct: 196 DVLANMSLDFL-QYKSSF--QPFFMMVSTPAPHSPWTAAPQYQNSFNGTKAPRDPNFNVH 252
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M Q D R +TL SVDD VEK+VK L+ GEL+NTYI
Sbjct: 253 -GTDKHWLIRQAKTPMSNSSVQLLDDAFRSRWRTLLSVDDLVEKLVKRLEIRGELDNTYI 311
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
I+TSD+GYH GQF L K +EFDIRVP ++RGP I
Sbjct: 312 IFTSDNGYHTGQFSLPLDKRQLYEFDIRVPLMVRGPNI 349
>gi|291389495|ref|XP_002711358.1| PREDICTED: glucosamine (N-acetyl)-6-sulfatase [Oryctolagus
cuniculus]
Length = 554
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 148 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 207
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 208 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 264
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 265 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 323
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 324 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 363
>gi|62898700|dbj|BAD97204.1| glucosamine (N-acetyl)-6-sulfatase precursor variant [Homo sapiens]
Length = 552
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVCAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|1707906|sp|P50426.1|GNS_CAPHI RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
Precursor
gi|841158|gb|AAB01663.1| N-acetylglucosamine 6-sulfatase precursor [Capra hircus]
gi|1580855|prf||2116212A GlcNAc 6-sulfatase
Length = 559
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 153 FAGKYLNEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 212
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY N F NV + ++
Sbjct: 213 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIH 269
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D +R QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 270 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRERWQTLLSVDDLVEKLVKRLEFNGELNNTYI 328
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 329 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 368
>gi|313227555|emb|CBY22702.1| unnamed protein product [Oikopleura dioica]
Length = 824
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/239 (41%), Positives = 141/239 (58%), Gaps = 17/239 (7%)
Query: 15 FDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK------------I 62
F FGKYLN+Y+GSY+P GW W L+ NS++YNY + ++G I
Sbjct: 136 FQYKTAMFGKYLNEYDGSYVPSGWDTWHGLVHNSRFYNYKVRVHGWSENNTDINADNSFI 195
Query: 63 KHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF-F 121
+ +D DY+ D+IAN S +++R + P +LV++FP PHGP D+AP++ F +
Sbjct: 196 ERHKDQPLDYFTDVIANRSSQWVRDQHRLDPDSPKLLVVNFPGPHGPADAAPRHTEAFKY 255
Query: 122 NVTSHHTPAYDY---APNPDKQWILQVTR-KMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
S + + N + W L+ T+ K+ +QF L KRLQTL S+D+AV+K
Sbjct: 256 EDESQLKKSRAFNNVDDNEHRNWFLRATKEKLTSEEQQFGGQLRRKRLQTLLSLDEAVKK 315
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ KDLG L TY+I+TSDHG+H+GQFG+VKGKS P+EFD+RVP + G I T
Sbjct: 316 VYFTFKDLGLLEKTYMIFTSDHGFHIGQFGMVKGKSTPYEFDVRVPMYVLGKDIKRTTQ 374
>gi|71834556|ref|NP_001025379.1| N-acetylglucosamine-6-sulfatase precursor [Danio rerio]
gi|66910454|gb|AAH97128.1| Zgc:114066 [Danio rerio]
Length = 538
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 137/220 (62%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NGR +HG++Y DY
Sbjct: 132 FAGKYLNEYGSKKAGGVEHVPLGWDHWFALERNSKYYNYTLSVNGRAQRHGQNYSEDYLT 191
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL +++P +++S PAPH P +APQY + F ++ + P ++
Sbjct: 192 DVLANVSIDFLENKS---NRRPFFMMVSTPAPHSPWTAAPQYDSSFPDLKAPRDPNFNIH 248
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M +F D KR +TL SVDD VEK+V++L GEL+NTY+
Sbjct: 249 -GKDKHWLIRQAKTPMSNASVEFLDNAYRKRWRTLLSVDDLVEKLVRKLDIRGELSNTYV 307
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
I+TSD+GYH GQF L K +EFDIRVP L+RGP I P
Sbjct: 308 IFTSDNGYHTGQFSLPMDKRQLYEFDIRVPLLVRGPNIKP 347
>gi|410918631|ref|XP_003972788.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Takifugu rubripes]
Length = 542
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 134/218 (61%), Gaps = 11/218 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y + +IPPGW W L NSKYYNY++++NG+ KHG DY DY
Sbjct: 136 FAGKYLNQYGHAQAGGVEHIPPGWSFWVGLEKNSKYYNYTLSVNGKAQKHGSDYSKDYLT 195
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL Q K S+ +P +++S PAPH P +APQY N F + P ++
Sbjct: 196 DVLANMSLEFL-QYKSSY--QPFFMMVSTPAPHSPWTAAPQYQNSFNGTKAPRDPNFNVH 252
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M QF D KR +TL SVDD VE +VK L+ GEL+NTYI
Sbjct: 253 -GKDKHWLIRQAKTPMSNSSVQFLDDAFRKRWRTLLSVDDLVETLVKRLEIRGELDNTYI 311
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+TSD+GYH GQF L K +EFDIRVP ++RGP I
Sbjct: 312 FFTSDNGYHTGQFSLPLDKRQLYEFDIRVPLMVRGPKI 349
>gi|432943754|ref|XP_004083269.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Oryzias latipes]
Length = 539
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 137/220 (62%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y + ++PPGW W AL NSKYYNY++++NG+ KHG DY DY
Sbjct: 133 FAGKYLNQYGRADAGGVQHVPPGWSYWFALEKNSKYYNYTMSVNGKAQKHGADYSRDYLT 192
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++ N S+ FL Q K S+ +P +++S PAPH P +APQY + F + P ++
Sbjct: 193 DVLTNVSLDFL-QYKSSY--QPFFMMVSTPAPHSPWTAAPQYQDRFNATKAPREPNFNIH 249
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEKIVK+L+ GEL+NTYI
Sbjct: 250 -GANKHWLIRQAKTPMTNSSIQFLDEAFRKRWQTLLSVDDLVEKIVKKLEVQGELDNTYI 308
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+TSD+GYH GQF L K +EFDI+VP ++RGP I P
Sbjct: 309 FFTSDNGYHTGQFSLPLDKRQLYEFDIKVPLMVRGPNIKP 348
>gi|410965016|ref|XP_003989048.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Felis catus]
Length = 553
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 147 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 207 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 263
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 264 -GTNKHWLIRQAKTPMTNSSIQFLDSAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 322
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 323 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362
>gi|417402596|gb|JAA48140.1| Putative sulfatase [Desmodus rotundus]
Length = 548
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 142 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSMDYLT 201
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 202 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQQTFQNVFAPRNKNFNIH 258
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 259 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 317
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 318 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 357
>gi|81158070|tpe|CAI85009.1| TPA: glucosamine (N-acetyl)-6-sulfatase [Canis lupus familiaris]
Length = 488
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 82 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 141
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 142 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 198
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 199 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 257
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 258 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 297
>gi|410965018|ref|XP_003989049.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Felis catus]
Length = 533
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 127 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 186
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 187 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 243
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 244 -GTNKHWLIRQAKTPMTNSSIQFLDSAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 302
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 303 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 342
>gi|395852131|ref|XP_003798594.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Otolemur garnettii]
Length = 548
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 103/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHG +Y DY
Sbjct: 142 FAGKYLNEYGAPAAGGLEHVPLGWSYWFALEKNSKYYNYTLSINGKARKHGANYSVDYLT 201
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 202 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 258
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+V+ L+ GELNNTYI
Sbjct: 259 -GTNKHWLIRQAKTPMTNSSIQFLDDAFRKRWQTLLSVDDLVEKLVRRLESTGELNNTYI 317
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
IYTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 318 IYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIRP 357
>gi|281341126|gb|EFB16710.1| hypothetical protein PANDA_008787 [Ailuropoda melanoleuca]
Length = 527
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 206 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GANKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|338726197|ref|XP_001492201.3| PREDICTED: LOW QUALITY PROTEIN: n-acetylglucosamine-6-sulfatase
[Equus caballus]
Length = 554
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 148 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 207
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 208 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 264
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 265 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 323
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 324 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 363
>gi|301769353|ref|XP_002920093.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Ailuropoda
melanoleuca]
Length = 552
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 206 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GANKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|359320559|ref|XP_531660.3| PREDICTED: N-acetylglucosamine-6-sulfatase [Canis lupus familiaris]
Length = 469
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 63 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 122
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 123 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 179
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 180 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 238
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 239 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 278
>gi|297692359|ref|XP_002823525.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Pongo abelii]
Length = 584
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|26350659|dbj|BAC38966.1| unnamed protein product [Mus musculus]
Length = 544
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 LYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|119617545|gb|EAW97139.1| glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID),
isoform CRA_b [Homo sapiens]
Length = 496
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 90 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 149
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 150 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 206
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 207 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 265
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 266 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 305
>gi|31873233|emb|CAD97608.1| hypothetical protein [Homo sapiens]
Length = 496
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 90 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 149
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 150 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 206
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 207 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 265
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 266 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 305
>gi|148234488|ref|NP_001080261.1| glucosamine (N-acetyl)-6-sulfatase [Xenopus laevis]
gi|27503350|gb|AAH42274.1| Gns-prov protein [Xenopus laevis]
Length = 547
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 98/223 (43%), Positives = 137/223 (61%), Gaps = 11/223 (4%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +++PPGW W AL NSKYYNY+++ NGR HGE+Y DY
Sbjct: 141 FAGKYLNQYGSEEAGGINHVPPGWSYWFALEKNSKYYNYTLSENGRPKTHGENYSQDYLT 200
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+++N S+ FL + +P ++++ PAPH P +APQY N F NV++ ++
Sbjct: 201 DVLSNVSLDFLNYKS---NHEPFFMMIATPAPHSPWTAAPQYENAFLNVSAPRGGNFN-V 256
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M Q+ D KR QTL SVDD VE ++K+L+ GEL+NT+I
Sbjct: 257 HGKDKHWLIRQAKSPMSNSSIQYLDHAFRKRWQTLLSVDDLVENLLKQLEVRGELDNTFI 316
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+TSD+GYH GQF L K +EFDI+VP L+RGPGI P +
Sbjct: 317 FFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKPNQI 359
>gi|410046400|ref|XP_003952183.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Pan troglodytes]
Length = 584
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|21321594|gb|AAM47270.1|AF510336_1 N-acetylglucosamine-6-sulfatase [Equus caballus]
Length = 427
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 147 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 207 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 263
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 264 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 322
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 323 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362
>gi|426373316|ref|XP_004053552.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Gorilla
gorilla gorilla]
Length = 583
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|397508844|ref|XP_003824850.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Pan paniscus]
gi|194385338|dbj|BAG65046.1| unnamed protein product [Homo sapiens]
Length = 584
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|332207367|ref|XP_003252768.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Nomascus
leucogenys]
Length = 584
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|297692361|ref|XP_002823526.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Pongo abelii]
Length = 552
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|297262879|ref|XP_001116940.2| PREDICTED: n-acetylglucosamine-6-sulfatase isoform 1 [Macaca
mulatta]
Length = 584
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|4504061|ref|NP_002067.1| N-acetylglucosamine-6-sulfatase precursor [Homo sapiens]
gi|55638455|ref|XP_522454.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 5 [Pan
troglodytes]
gi|397508840|ref|XP_003824848.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Pan paniscus]
gi|232126|sp|P15586.3|GNS_HUMAN RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
Precursor
gi|31867|emb|CAA78164.1| N-acetylglucosamine-6-sulphatase [Homo sapiens]
gi|15214696|gb|AAH12482.1| Glucosamine (N-acetyl)-6-sulfatase [Homo sapiens]
gi|119617544|gb|EAW97138.1| glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID),
isoform CRA_a [Homo sapiens]
gi|119617546|gb|EAW97140.1| glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID),
isoform CRA_a [Homo sapiens]
gi|325463309|gb|ADZ15425.1| glucosamine (N-acetyl)-6-sulfatase [synthetic construct]
gi|410264478|gb|JAA20205.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410264480|gb|JAA20206.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410264482|gb|JAA20207.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410264484|gb|JAA20208.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410264486|gb|JAA20209.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410264488|gb|JAA20210.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410307956|gb|JAA32578.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410307958|gb|JAA32579.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410335013|gb|JAA36453.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410335015|gb|JAA36454.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410335017|gb|JAA36455.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
gi|410335019|gb|JAA36456.1| glucosamine (N-acetyl)-6-sulfatase [Pan troglodytes]
Length = 552
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|355564432|gb|EHH20932.1| hypothetical protein EGK_03884 [Macaca mulatta]
Length = 584
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 238 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|332207363|ref|XP_003252766.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Nomascus
leucogenys]
Length = 552
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|426373312|ref|XP_004053550.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Gorilla
gorilla gorilla]
Length = 551
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|194390666|dbj|BAG62092.1| unnamed protein product [Homo sapiens]
Length = 469
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 63 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 122
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 123 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 179
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 180 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 238
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 239 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 278
>gi|158256974|dbj|BAF84460.1| unnamed protein product [Homo sapiens]
Length = 552
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|380787195|gb|AFE65473.1| N-acetylglucosamine-6-sulfatase precursor [Macaca mulatta]
gi|383411757|gb|AFH29092.1| N-acetylglucosamine-6-sulfatase precursor [Macaca mulatta]
gi|384939736|gb|AFI33473.1| N-acetylglucosamine-6-sulfatase precursor [Macaca mulatta]
Length = 552
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|297692363|ref|XP_002823527.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Pongo abelii]
Length = 532
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|332207365|ref|XP_003252767.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Nomascus
leucogenys]
Length = 532
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|395540906|ref|XP_003772391.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Sarcophilus harrisii]
Length = 556
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 134/218 (61%), Gaps = 11/218 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++PPGW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 149 FAGKYLNEYGALDAGGVEHVPPGWSYWYALEKNSKYYNYTLSVNGKARKHGENYSLDYLT 208
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV++ ++
Sbjct: 209 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWIAAPQYQKSFQNVSAPRNSNFNIH 265
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+++ L GELNNT+I
Sbjct: 266 -GKNKHWLIRQAKTPMTNSSIQFLDEAFRKRWQTLLSVDDLVEKLIRRLDFHGELNNTFI 324
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
YTSD+GYH GQF L K +EFDI+VP L+RGPGI
Sbjct: 325 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGI 362
>gi|449481578|ref|XP_004176153.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylglucosamine-6-sulfatase
[Taeniopygia guttata]
Length = 533
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 139/220 (63%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++PPGW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 128 FAGKYLNEYGAEDAGGVGHVPPGWSFWYALEKNSKYYNYTLSVNGKARRHGENYSVDYLT 187
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL + K +F +P ++++ PAPH P APQY F NV++ ++
Sbjct: 188 DVLANMSLDFL-EYKSNF--EPFFMMIATPAPHSPWTPAPQYTKSFQNVSAPRNSNFNIH 244
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD +EK+VK+L+ GEL+NTYI
Sbjct: 245 -GKNKHWLIRQAKTPMSNSSIQFLDDAYRKRWQTLLSVDDLIEKLVKKLEVNGELDNTYI 303
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+G+H GQF L K +EFDI+VP L+RGPGI P
Sbjct: 304 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 343
>gi|397508842|ref|XP_003824849.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Pan paniscus]
gi|410046398|ref|XP_003952182.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Pan troglodytes]
gi|194379548|dbj|BAG63740.1| unnamed protein product [Homo sapiens]
Length = 532
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|297262881|ref|XP_002798713.1| PREDICTED: n-acetylglucosamine-6-sulfatase isoform 2 [Macaca
mulatta]
Length = 532
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|426373314|ref|XP_004053551.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Gorilla
gorilla gorilla]
Length = 531
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 186 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|148692469|gb|EDL24416.1| glucosamine (N-acetyl)-6-sulfatase, isoform CRA_a [Mus musculus]
Length = 614
Score = 182 bits (463), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|148692470|gb|EDL24417.1| glucosamine (N-acetyl)-6-sulfatase, isoform CRA_b [Mus musculus]
Length = 574
Score = 182 bits (463), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 168 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 227
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 228 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 284
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 285 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 343
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 344 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 383
>gi|74208040|dbj|BAE29132.1| unnamed protein product [Mus musculus]
Length = 404
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|355786273|gb|EHH66456.1| hypothetical protein EGM_03453, partial [Macaca fascicularis]
Length = 436
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|29789239|ref|NP_083640.1| N-acetylglucosamine-6-sulfatase precursor [Mus musculus]
gi|81873637|sp|Q8BFR4.1|GNS_MOUSE RecName: Full=N-acetylglucosamine-6-sulfatase; AltName:
Full=Glucosamine-6-sulfatase; Short=G6S; Flags:
Precursor
gi|26326771|dbj|BAC27129.1| unnamed protein product [Mus musculus]
gi|26339830|dbj|BAC33578.1| unnamed protein product [Mus musculus]
gi|33243973|gb|AAH55328.1| Glucosamine (N-acetyl)-6-sulfatase [Mus musculus]
gi|74138505|dbj|BAE38063.1| unnamed protein product [Mus musculus]
gi|74143704|dbj|BAE41197.1| unnamed protein product [Mus musculus]
gi|74218111|dbj|BAE42031.1| unnamed protein product [Mus musculus]
Length = 544
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|26343951|dbj|BAC35632.1| unnamed protein product [Mus musculus]
Length = 544
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|74198008|dbj|BAE35186.1| unnamed protein product [Mus musculus]
Length = 544
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 132/218 (60%), Gaps = 11/218 (5%)
Query: 23 GKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY D+
Sbjct: 140 GKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDV 199
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 200 LANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH-G 255
Query: 137 PDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI Y
Sbjct: 256 TNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYIFY 315
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
TSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 316 TSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|431892015|gb|ELK02462.1| N-acetylglucosamine-6-sulfatase [Pteropus alecto]
Length = 550
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 133/218 (61%), Gaps = 11/218 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK+VK L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
YTSD+GYH GQF L K +EFDI+VP L+RGPGI
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGI 359
>gi|74181578|dbj|BAE30054.1| unnamed protein product [Mus musculus]
Length = 451
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IP GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 45 FAGKYLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 104
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 105 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIH 161
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 162 -GTNKHWLIRQAKTPMTNSSIRFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 220
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 221 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 260
>gi|225719510|gb|ACO15601.1| N-acetylglucosamine-6-sulfatase precursor [Caligus clemensi]
Length = 475
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/233 (42%), Positives = 143/233 (61%), Gaps = 15/233 (6%)
Query: 20 GYFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
YFGKYLN+Y + IP GW W LI NSKYYNY++++NGR +HG+ Y NDY D+
Sbjct: 120 AYFGKYLNQYGRIDPLRIPKGWDYWAGLIGNSKYYNYTLSVNGRLERHGDIYDNDYLTDV 179
Query: 77 IANDSVRFLR---QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
I S+ FL+ +S + K P+++V++ P+PH P APQY+ F S P+++Y
Sbjct: 180 IKRKSLDFLKDHFKSTEDQKKDPLLMVLATPSPHAPFTPAPQYSRNFSESKSPRIPSFNY 239
Query: 134 APNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
+ K W+ Q +P+ D + R +TL SVDD V+ I++ L L+N
Sbjct: 240 VHDK-KHWLAQ----KEPLSDYLIGDIDEVFRNRWRTLLSVDDLVDDILEFLHKKKVLDN 294
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
TY+I+TSDHGYHLGQ+GLV K P+E DIR+PF+M GPGI P + D+++
Sbjct: 295 TYVIFTSDHGYHLGQYGLVLDKRMPYETDIRIPFVMYGPGI-PASSESNDIIL 346
>gi|403269060|ref|XP_003926575.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 3 [Saimiri
boliviensis boliviensis]
Length = 584
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 178 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 237
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F +V + ++
Sbjct: 238 DVLANVSLDFL-NYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 294
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 295 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 353
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 354 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 393
>gi|354498717|ref|XP_003511460.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Cricetulus griseus]
Length = 522
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 132/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ HGE+Y DY
Sbjct: 116 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARTHGENYSVDYLT 175
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 176 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKEFQNVFAPRNKNFNIH 232
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 233 -GTNKHWLIRQAKTPMTNSSIQFLDDAFRRRWQTLLSVDDLVEKLVKRLDGTGELDNTYI 291
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 292 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 331
>gi|392349421|ref|XP_003750373.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Rattus norvegicus]
gi|81158028|tpe|CAI84988.1| TPA: glucosamine (N-acetyl)-6-sulfatase [Rattus norvegicus]
Length = 544
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARRHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFPNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIKFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|403269056|ref|XP_003926573.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Saimiri
boliviensis boliviensis]
Length = 552
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F +V + ++
Sbjct: 206 DVLANVSLDFLNY-KSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 263 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|58865546|ref|NP_001011989.1| N-acetylglucosamine-6-sulfatase precursor [Rattus norvegicus]
gi|56611129|gb|AAH87741.1| Glucosamine (N-acetyl)-6-sulfatase [Rattus norvegicus]
Length = 519
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARRHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFPNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIKFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 353
>gi|348580785|ref|XP_003476159.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Cavia porcellus]
Length = 553
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 137/220 (62%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 147 FAGKYLNEYGSLDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P +++S PAPH P +APQY F NV + + ++
Sbjct: 207 DVLANVSLDFL-DYKSNF--EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRSKNFNIH 263
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M Q+ D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 264 -GMNKHWLIRQAKTPMTNSSIQYLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 322
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+TSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 323 FFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362
>gi|344255848|gb|EGW11952.1| N-acetylglucosamine-6-sulfatase [Cricetulus griseus]
Length = 474
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 132/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ HGE+Y DY
Sbjct: 68 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARTHGENYSVDYLT 127
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 128 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKEFQNVFAPRNKNFNIH 184
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 185 -GTNKHWLIRQAKTPMTNSSIQFLDDAFRRRWQTLLSVDDLVEKLVKRLDGTGELDNTYI 243
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 244 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 283
>gi|403269058|ref|XP_003926574.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Saimiri
boliviensis boliviensis]
Length = 532
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F +V + ++
Sbjct: 186 DVLANVSLDFLNY-KSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 243 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|296212248|ref|XP_002752750.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 1 [Callithrix
jacchus]
Length = 552
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F +V + ++
Sbjct: 206 DVLANVSLDFL-NYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 263 -GMNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 321
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 322 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 361
>gi|351696736|gb|EHA99654.1| N-acetylglucosamine-6-sulfatase [Heterocephalus glaber]
Length = 553
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 147 FAGKYLNEYGSLDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 206
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P +++S PAPH P +APQY F NV + ++
Sbjct: 207 DVLANVSLDFL-DYKSNF--EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH 263
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M Q+ D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 264 -GMNKHWLIRQAKTPMTNSSIQYLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 322
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+TSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 323 FFTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 362
>gi|296212250|ref|XP_002752751.1| PREDICTED: N-acetylglucosamine-6-sulfatase isoform 2 [Callithrix
jacchus]
Length = 532
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 126 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F +V + ++
Sbjct: 186 DVLANVSLDFLNY-KSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQSVFAPRNKNFNIH 242
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD VEK++K L+ GELNNTYI
Sbjct: 243 -GMNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYI 301
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 302 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 341
>gi|344267548|ref|XP_003405628.1| PREDICTED: N-acetylglucosamine-6-sulfatase [Loxodonta africana]
Length = 555
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 132/220 (60%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 149 FAGKYLNEYGAPGAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 208
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 209 DVLANVSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKSFQNVFAPRNKNFNIH 265
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D R QTL SVDD VEK++K L GELNNTYI
Sbjct: 266 -GMNKHWLIRQAKTPMSNSSIQFLDNAFRGRWQTLLSVDDLVEKLIKRLDFNGELNNTYI 324
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH GQF L K +EFDI+VP L+RGPGI P
Sbjct: 325 FYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 364
>gi|327273075|ref|XP_003221308.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Anolis
carolinensis]
Length = 550
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 138/220 (62%), Gaps = 11/220 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++PPGW W AL NSKYYNY++++NG+ +HG +Y DY
Sbjct: 145 FAGKYLNEYGAEDAGGIGHVPPGWSFWYALEKNSKYYNYTLSVNGKPRRHGGNYSVDYLT 204
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL + K +F +P ++++ PAPH P +APQY F N+++ ++
Sbjct: 205 DVLANMSLDFL-EYKSNF--EPFFMMIATPAPHSPWVAAPQYKKNFANISAPRNSNFNIH 261
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M QF D KR QTL SVDD +EK++K L+ GEL+NTYI
Sbjct: 262 -GKDKHWLIRQAKTPMTNSSIQFLDDAFRKRWQTLLSVDDLIEKLLKRLEFHGELDNTYI 320
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+G+H GQF L K +EFDI+VP L+RGPG+ P
Sbjct: 321 FYTSDNGFHTGQFSLPIDKRQLYEFDIKVPLLVRGPGVKP 360
>gi|443733552|gb|ELU17869.1| hypothetical protein CAPTEDRAFT_122995, partial [Capitella teleta]
Length = 229
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 83/122 (68%), Positives = 100/122 (81%), Gaps = 4/122 (3%)
Query: 118 NMFFNVT----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
N F +T TP+Y++APNPDKQWILQ T KM+P+H +FTD+L KRLQTLQSVDD
Sbjct: 5 NAFLIITLLYFPRRTPSYNFAPNPDKQWILQYTGKMEPIHMEFTDMLHRKRLQTLQSVDD 64
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
AVEK+ ELK+LGEL+NTY IY++DHGYHLGQ+GLVKGK+ P++FDIRVPFLMRGP I P
Sbjct: 65 AVEKVFNELKELGELDNTYFIYSADHGYHLGQYGLVKGKAMPYDFDIRVPFLMRGPKIPP 124
Query: 234 GT 235
G
Sbjct: 125 GV 126
>gi|313224745|emb|CBY20536.1| unnamed protein product [Oikopleura dioica]
Length = 503
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 135/231 (58%), Gaps = 28/231 (12%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN Y ++P GW W AL+ NS+YYNYSI+ NG++ KHG+++ DY
Sbjct: 117 YAGKYLNMYGFKKAGGVEHVPKGWDSWNALVGNSRYYNYSISRNGKEEKHGDNFTLDYLT 176
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
+L+ANDSV F+ K P ++++ PAPH P DS P+YA+++ N+T A
Sbjct: 177 NLVANDSVNFINAMK---PDSPFFMIVAPPAPHEPWDSDPKYADLYKNLT---------A 224
Query: 135 PNPDKQW----------ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
P +KQW + V M ++D R + L+SVDD VE I+ L +
Sbjct: 225 PRDNKQWNIHRADAHWVVRNVPNPMSQKSTDWSDNAFRSRWRCLKSVDDLVESIMNALNE 284
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L++TYI+++SDHGYH GQF L K P+EFD+R+P L+RGPG+ P +
Sbjct: 285 TDLLDDTYIMFSSDHGYHTGQFSLPYDKRQPYEFDLRIPLLIRGPGVKPNS 335
>gi|313241330|emb|CBY33605.1| unnamed protein product [Oikopleura dioica]
Length = 503
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 135/231 (58%), Gaps = 28/231 (12%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN Y ++P GW W AL+ NS+YYNYSI+ NG++ KHG+++ DY
Sbjct: 117 YAGKYLNMYGFKKAGGVEHVPKGWDSWNALVGNSRYYNYSISRNGKEEKHGDNFTLDYLT 176
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
+L+ANDSV F+ K P ++++ PAPH P DS P+YA+++ N+T A
Sbjct: 177 NLVANDSVNFINAMK---PDAPFFMIVAPPAPHEPWDSDPKYADLYKNLT---------A 224
Query: 135 PNPDKQW----------ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
P +KQW + V M ++D R + L+SVDD VE I+ L +
Sbjct: 225 PRDNKQWNIHRADAHWVVRNVPNPMSQKSTDWSDNAFRSRWRCLKSVDDLVESIMNALNE 284
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L++TYI+++SDHGYH GQF L K P+EFD+R+P L+RGPG+ P +
Sbjct: 285 TDLLDDTYIMFSSDHGYHTGQFSLPYDKRQPYEFDLRIPLLIRGPGVKPNS 335
>gi|444721213|gb|ELW61958.1| N-acetylglucosamine-6-sulfatase [Tupaia chinensis]
Length = 514
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 98/202 (48%), Positives = 128/202 (63%), Gaps = 5/202 (2%)
Query: 33 YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF 92
++P GW W AL NSKYYNY++++NG+ KHGE+Y DY D++AN S+ FL K +F
Sbjct: 53 HVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDVLANVSLDFL-DYKSNF 111
Query: 93 SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQP 151
+P +++S PAPH P +APQY F NV + ++ +K W++ Q M
Sbjct: 112 --EPFFMMISTPAPHSPWTAAPQYQKTFQNVFAPRNKNFNIH-GTNKHWLIRQAKTPMTN 168
Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
QF D KR QTL SVDD VEK++K L+ GELNNTYI YTSD+GYH GQF L
Sbjct: 169 SSIQFLDNAFRKRWQTLLSVDDLVEKLIKRLEFTGELNNTYIFYTSDNGYHTGQFSLPID 228
Query: 212 KSFPFEFDIRVPFLMRGPGIVP 233
K +EFDI+VP L+RGPGI P
Sbjct: 229 KRQLYEFDIKVPLLVRGPGIKP 250
>gi|260803665|ref|XP_002596710.1| hypothetical protein BRAFLDRAFT_219258 [Branchiostoma floridae]
gi|229281969|gb|EEN52722.1| hypothetical protein BRAFLDRAFT_219258 [Branchiostoma floridae]
Length = 513
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 95/221 (42%), Positives = 135/221 (61%), Gaps = 10/221 (4%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y +IPPGW W LI NS+YY+Y++++NG K +HG+ Y DY
Sbjct: 124 FAGKYLNQYGFPSTGGPGHIPPGWDWWFGLIGNSRYYDYALSVNGTKQQHGKLYSMDYLT 183
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I N S+ FL S P +++S PA H P DSAPQY F + + +Y+
Sbjct: 184 DVIKNQSLHFL--DTVGASPNPFFMMLSTPAAHSPFDSAPQYEQDFLDWKAPRGGSYNVY 241
Query: 135 PNPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W+L+ M+ Q+ D + KR++TL SV+D V+ ++ +L+D G L NTY+
Sbjct: 242 -GKDKHWLLRFAPSPMKDTSVQYADSIFRKRMRTLLSVEDLVQAVMTKLEDQGLLYNTYV 300
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
I++SD+GYH GQF L + K +EFDIRVP L+RGPGI G
Sbjct: 301 IFSSDNGYHYGQFSLPEDKRQLYEFDIRVPLLIRGPGIKAG 341
>gi|327286464|ref|XP_003227950.1| PREDICTED: n-acetylglucosamine-6-sulfatase-like [Anolis
carolinensis]
Length = 506
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 101/229 (44%), Positives = 138/229 (60%), Gaps = 12/229 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y ++P GW W L+ NSKYYNY+++ NGR+ KHG+ Y DY
Sbjct: 133 YAGKYLNQYGDPKAGGVQHVPLGWNYWLGLVGNSKYYNYTLSSNGREEKHGDLYEEDYLT 192
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DLI N S+ FL Q S+ P ++V++ PA H P +AP Y + F + + +++
Sbjct: 193 DLITNRSLAFL----QRLSRPPFLMVLAPPASHSPWTAAPHYKSAFSTIKAPRNGSFNVY 248
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W++ Q M +F D + +R QTL SVDD VEK++ +LK LG L++TY+
Sbjct: 249 -GKDKHWLIRQAKSPMLNTSIEFLDNVYRQRWQTLLSVDDMVEKVLLQLKSLGLLDSTYV 307
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
YTSDHGYH GQF L K +EFDIRVP ++RGPGI P YL +L
Sbjct: 308 FYTSDHGYHTGQFSLPIDKRQLYEFDIRVPLMVRGPGIKPNQTYLEPIL 356
>gi|390333874|ref|XP_003723795.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Strongylocentrotus
purpuratus]
Length = 680
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 106/266 (39%), Positives = 151/266 (56%), Gaps = 48/266 (18%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++IPPGW +W AL +NS YYNYS+++NG++ HG+DY+ DY
Sbjct: 158 FAGKYLNQYGQKEVGGLAHIPPGWDDWIALKLNSVYYNYSLSVNGKEEVHGDDYHKDYLT 217
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DLI N S+ FL+ KQ+ S P ++MS PA H P DSAPQY + F + + P+++
Sbjct: 218 DLINNRSMEFLQ--KQTVSS-PFFMMMSTPACHSPFDSAPQYMSHFTDKKAPRGPSFN-- 272
Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
VH + R +TL SVDD V+ +V L ++NTYII
Sbjct: 273 -----------------VH--------SARWRTLLSVDDMVKNLVTLLTTKKMIDNTYII 307
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL-----LDVLIPQVRKF 249
++SD+G+HLGQF L K +EFDIRVP ++RGPGI PGT +D++ P + +
Sbjct: 308 FSSDNGFHLGQFSLPNDKRQLYEFDIRVPLIIRGPGISPGTTSKQIAVNIDIM-PTIVEM 366
Query: 250 SSGSL------IFIMSFLINLNTMRA 269
++GS I +M LI N A
Sbjct: 367 ATGSAPSEVDGISLMPLLIARNESMA 392
>gi|323652548|gb|ADX98525.1| N-acetylglucosamine-6-sulfatase [Gallus gallus]
Length = 263
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 74/125 (59%), Positives = 98/125 (78%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYIPPGWREW L+ NS++YNY+I+ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
+S+ + R SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N + H TP+Y+YAPN DK
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 140 QWILQ 144
WI+Q
Sbjct: 258 HWIMQ 262
>gi|157106230|ref|XP_001649229.1| sulfatase [Aedes aegypti]
gi|108879923|gb|EAT44148.1| AAEL004460-PA [Aedes aegypti]
Length = 495
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 134/225 (59%), Gaps = 9/225 (4%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y GKYLN+Y +PPGW +W L NSKYYNY++N NG+ + E+Y D +++N
Sbjct: 125 YAGKYLNEYYSKEVPPGWSDWHGLHGNSKYYNYTLNENGQIVSFTEEYLTD----VLSNR 180
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-DK 139
+V FL +++Q P +++ PAPH P +A ++ + F +V + T Y+ P +K
Sbjct: 181 TVDFLSKAEQGV---PFFAMVAPPAPHAPYTAATRHEDTFADVNAPRTKNYNLPCGPLEK 237
Query: 140 QWILQVTRKMQPVHRQFT-DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
W+L + P + D++ KR Q+L +VD+ VE IV L+ L +TYIIYTSD
Sbjct: 238 HWLLTMPPSPLPADILASIDIIYRKRWQSLMAVDEMVESIVATLEKRNMLTDTYIIYTSD 297
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
+GYH+GQFGL K P+E DIRVP LM GP I P T+ V++
Sbjct: 298 NGYHMGQFGLPYDKRQPYETDIRVPLLMTGPKIPPKTLVSAPVVL 342
>gi|149066818|gb|EDM16551.1| rCG48973 [Rattus norvegicus]
Length = 388
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 130/220 (59%), Gaps = 18/220 (8%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ +HGE+Y DY
Sbjct: 138 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARRHGENYSVDYLT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 198 DVLANLSLDFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKAFPNVIAPRNKNFNIH 254
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M +F D +R QTL SVDD VEK+VK L GEL+NTYI
Sbjct: 255 -GTNKHWLIRQAKTPMTNSSIKFLDDAFRRRWQTLLSVDDLVEKLVKRLDSTGELDNTYI 313
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
YTSD+GYH + L +EFDI+VP L+RGPGI P
Sbjct: 314 FYTSDNGYHTDKRQL-------YEFDIKVPLLVRGPGIKP 346
>gi|195425576|ref|XP_002061073.1| GK10744 [Drosophila willistoni]
gi|194157158|gb|EDW72059.1| GK10744 [Drosophila willistoni]
Length = 505
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 133/234 (56%), Gaps = 10/234 (4%)
Query: 2 AWPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK 61
A PV L R F Y GKYLN++ G ++PPGW E+ L NS+YYNY++ NGR
Sbjct: 118 ALPVLLAERGYRTF-----YAGKYLNEFRGEHVPPGWTEFHGLHGNSRYYNYTLLQNGRN 172
Query: 62 IKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF 121
+ Y + Y DLI++ +V F++Q Q P ++++ PA H P AP++ F
Sbjct: 173 VS----YTDTYLTDLISDLAVDFIQQEVQWKRMNPFFVMLAPPAAHAPYTPAPRHEGTFA 228
Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
+ TP+++ P DK W++ ++ + D KR ++L +VD+ V +V
Sbjct: 229 KWKALRTPSFN-VPTKDKHWLVGSSKVLPNETIATIDKYFQKRWESLLAVDEMVSTLVAL 287
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L + L+NTYI+YTSD+GYHLGQF K P+E DIRVP L+RGPGI G+
Sbjct: 288 LNETQLLDNTYIVYTSDNGYHLGQFSQPFDKRQPYETDIRVPLLVRGPGIAKGS 341
>gi|326432134|gb|EGD77704.1| hypothetical protein PTSG_12796 [Salpingoeca sp. ATCC 50818]
Length = 843
Score = 164 bits (415), Expect = 4e-38, Method: Composition-based stats.
Identities = 86/217 (39%), Positives = 122/217 (56%), Gaps = 7/217 (3%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y GKYLN Y G PPGW EW L NS+YYNY+I N + +K G+ Y DY D++A
Sbjct: 447 YGGKYLNLYYGPSNPPGWTEWFGLYGNSRYYNYTIKDNDKTVKFGDKYDTDYLTDVLARR 506
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD--YAPNPD 138
+VRF+ +S + + +P L + P+ H APQY N + + TP Y+ YA D
Sbjct: 507 AVRFINES--AAADEPFFLWLGTPSAHASFTPAPQYENTVTGIKAPRTPNYNKVYA---D 561
Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
+ ++ M + +D + KRL+TL+SVDD V I L +L+NT+ +T D
Sbjct: 562 RHRPVRDLSPMTDEQIRQSDEIYRKRLETLRSVDDLVGDIFDTLAATNQLSNTWFFFTGD 621
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
HG+HLGQF + K ++ DIRVP+ +RGPG T
Sbjct: 622 HGFHLGQFAMGFDKRQLYDTDIRVPYFVRGPGAAANT 658
>gi|443689097|gb|ELT91586.1| hypothetical protein CAPTEDRAFT_122955 [Capitella teleta]
Length = 537
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++PPGW +W L+ NS YYNYS+++NG++ KHG+ Y +DY
Sbjct: 122 FAGKYLNQYGYKPSGGVEHVPPGWDDWNGLVGNSVYYNYSLSVNGKEEKHGDKYPDDYLT 181
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I+ + RFL ++F P +++S PA H P APQ+ + F NVT+ +++
Sbjct: 182 DVISKKAERFLENEHETFGS-PFFMMLSTPACHAPFTPAPQHRDKFANVTAPRDGSFNKY 240
Query: 135 PNPDKQWILQVTRK-MQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+ ++ W+++ + M + D + R +TL SVDD + K+ L+ G L +TY+
Sbjct: 241 -DANRHWLVRNAKHPMGNESMKLIDPVFRNRWRTLLSVDDLITKLFTTLETKGFLEDTYV 299
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
Y +D+GYHLGQ L K +EFDIRVP ++RGPGI G
Sbjct: 300 FYAADNGYHLGQLSLPYDKRQLYEFDIRVPLMVRGPGIKAG 340
>gi|350414629|ref|XP_003490372.1| PREDICTED: LOW QUALITY PROTEIN:
N-acetylglucosamine-6-sulfatase-like [Bombus impatiens]
Length = 598
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/226 (39%), Positives = 138/226 (61%), Gaps = 14/226 (6%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y S++P GW W LI NSKYYNYS+++NG + K G + NDY
Sbjct: 119 YGGKYLNQYGEKIVGGSSHVPEGWDWWVGLIGNSKYYNYSLSINGTEKKFGSEP-NDYLT 177
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I++ ++ F++ + + +P ++V++ PAPH P A ++ + + +V + TP ++
Sbjct: 178 DVISDLAIDFIK--AHNSNSEPFLMVLAPPAPHAPFIPAMRHIDKYKDVIAKRTPNFNTQ 235
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
DK W++Q R P+ D + +R +TL +VD+ VE + + LK LNNT
Sbjct: 236 TQMDKHWLVQ--RGPSPLPDNLLPKLDEIYRRRWETLLAVDELVENVYEALKVGNLLNNT 293
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
YII+TSD+GYH+GQF + K P+E DI+VP L+RGPGI P ++
Sbjct: 294 YIIFTSDNGYHIGQFSMPIDKRQPYETDIKVPLLIRGPGITPSKIF 339
>gi|326434077|gb|EGD79647.1| hypothetical protein PTSG_10494 [Salpingoeca sp. ATCC 50818]
Length = 513
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/227 (38%), Positives = 125/227 (55%), Gaps = 20/227 (8%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y ++P GW W AL+ NS YY+Y I+ NG HG N Y P
Sbjct: 127 YTGKYLNQYGAPSAGGVQHVPRGWDNWQALVGNSVYYDYVISNNGTPESHG----NTYLP 182
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
+L+ + + FLR + + P +++ P+ H P + AP+YA++ N T+ P Y A
Sbjct: 183 NLLLDKTSDFLRTLEGT--DDPFFIMVGTPSCHDPTEPAPEYAHLLPNATAPRPPNYGKA 240
Query: 135 PNPDKQWILQVTRKMQ-------PVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
DK W + ++ P+ ++ DL +R TL +VDD V + LK +G
Sbjct: 241 FR-DKHWFVAKQGEVAGFGSGYAPIEEKYNDLQYRRRALTLMTVDDIVANVTATLKSMGR 299
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
LNNTY Y SDHGYHLG F ++K K P+E D+R+P ++RGP I PG
Sbjct: 300 LNNTYFFYASDHGYHLGNFAVLKDKRLPYETDLRIPAVVRGPHIKPG 346
>gi|328776879|ref|XP_623263.2| PREDICTED: n-acetylglucosamine-6-sulfatase-like isoform 2 [Apis
mellifera]
Length = 533
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 91/225 (40%), Positives = 134/225 (59%), Gaps = 15/225 (6%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y ++IP GW W LI NSKYYNY +++NG + K G D +DY
Sbjct: 110 YAGKYLNQYGDKIVGGAAHIPIGWDWWAGLIGNSKYYNYILSINGTEKKFGNDS-SDYLT 168
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I++ + F++ + +P ++V++ PAPH P A ++ N + NV + TP ++
Sbjct: 169 DVISDMATNFIKTHNPN---QPFLMVLAPPAPHAPFIPAQRHINKYKNVKAKRTPNFNTQ 225
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
DK W+ V R+ P+ D + R +TL +VD+ V+ I + LK LNNT
Sbjct: 226 TQMDKHWL--VKREPSPLPDNLLPKLDEIYRNRWETLLAVDELVKNIYQVLKLQSFLNNT 283
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
YII+TSD+GYH+GQF + K P+E DIRVP L+RGPGI+P +
Sbjct: 284 YIIFTSDNGYHIGQFSMPIDKRQPYETDIRVPLLIRGPGIMPSKI 328
>gi|441647472|ref|XP_003274834.2| PREDICTED: extracellular sulfatase Sulf-1 [Nomascus leucogenys]
Length = 1023
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 72/119 (60%), Positives = 93/119 (78%), Gaps = 6/119 (5%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TP+Y+YAPN DK WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GE
Sbjct: 400 TPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLMETGE 459
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
L NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 460 LENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 512
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY 68
+FGKYLN+YNGSYIPPGWREW LI NS++YNY++ NG K KHG DY
Sbjct: 138 AFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHGFDY 186
>gi|326917734|ref|XP_003205151.1| PREDICTED: extracellular sulfatase Sulf-1-like, partial [Meleagris
gallopavo]
Length = 622
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 71/119 (59%), Positives = 95/119 (79%), Gaps = 6/119 (5%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TP+Y+YAPN DK WI+Q T M P+H +FT++L KRLQTL SVDD++E++ + L ++GE
Sbjct: 1 TPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGE 60
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
L NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 61 LENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------VVPQI 113
>gi|340715199|ref|XP_003396106.1| PREDICTED: LOW QUALITY PROTEIN:
n-acetylglucosamine-6-sulfatase-like [Bombus terrestris]
Length = 497
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 136/225 (60%), Gaps = 14/225 (6%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y S++P GW W LI NSKYYNYS+++NG + K G + NDY
Sbjct: 119 YGGKYLNQYGEKIVGGSSHVPEGWDWWVGLIGNSKYYNYSLSINGTEKKFGSEP-NDYLT 177
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I++ ++ F++ + + +P ++V++ PAPH P A ++ + + V + TP ++
Sbjct: 178 DVISDLAIDFIKA--HNPNSEPFLMVLAPPAPHAPFIPAMRHIDKYKYVIAKRTPNFNTQ 235
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
DK W++Q R P+ D + +R +TL +VD+ VE + + LK LNNT
Sbjct: 236 TQMDKHWLVQ--RGPSPLPDNLLPKLDEIYRRRWETLLAVDELVENVYEALKTGNLLNNT 293
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
YII+TSD+GYH+GQF + K P+E DI+VP L+RGPGI P +
Sbjct: 294 YIIFTSDNGYHIGQFSMPIDKRQPYETDIKVPLLIRGPGITPSKI 338
>gi|46249932|gb|AAH68565.1| SULF1 protein [Homo sapiens]
Length = 644
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 72/119 (60%), Positives = 93/119 (78%), Gaps = 6/119 (5%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TP+Y+YAPN DK WI+Q T M P+H +FT++L KRLQTL SVDD+VE++ L + GE
Sbjct: 5 TPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGE 64
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
L NTYIIYT+DHGYH+GQFGLVKGKS P++FDIRVPF +RGP + PG+ ++PQ+
Sbjct: 65 LENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGS------IVPQI 117
>gi|383853828|ref|XP_003702424.1| PREDICTED: LOW QUALITY PROTEIN:
N-acetylglucosamine-6-sulfatase-like [Megachile
rotundata]
Length = 594
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 134/225 (59%), Gaps = 14/225 (6%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +++P GW W L+ NS+YYNYSI++NG + K G + DY
Sbjct: 112 YAGKYLNQYGHQKAGGPAHVPVGWDWWAGLVGNSRYYNYSISINGTRKKFGNEPA-DYLT 170
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I N ++ F+ ++ + +P ++V++ PAPH P A ++ + F +V + TP ++
Sbjct: 171 DVINNLAMEFI--GTRNANNQPFLMVLAPPAPHAPFTPATRHIDKFKDVKAKRTPNFNTL 228
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
DK W++Q + P+ D + +R +TL +VD+ V I + LK LNNT
Sbjct: 229 IQSDKHWLVQ--QGPSPLPNDLLPKLDDIYRRRWETLLAVDELVTNIYQALKFQNLLNNT 286
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
Y+I+TSD+GYH+GQF K P+E DIRVP LMRGPGIVP +
Sbjct: 287 YMIFTSDNGYHIGQFSQPIDKRQPYETDIRVPLLMRGPGIVPSKI 331
>gi|380011321|ref|XP_003689757.1| PREDICTED: N-acetylglucosamine-6-sulfatase-like [Apis florea]
Length = 528
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 132/225 (58%), Gaps = 14/225 (6%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +IP GW W LI NSKYYNY +++NG + K G D +DY
Sbjct: 110 YAGKYLNQYGDKIVGGAGHIPTGWDWWAGLIGNSKYYNYILSINGTEKKFGNDS-SDYLT 168
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I++ + F++ + +P ++V++ PAPH P A ++ N + NV + TP ++
Sbjct: 169 DVISDMATNFIKIYNPN--GQPFLMVLAPPAPHAPFTPAQRHINKYKNVKAKRTPNFNTQ 226
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
DK W+ V R P+ D + R +TL +VD+ V+ + + LK LNNT
Sbjct: 227 TQMDKHWL--VERGPSPLPDNLLPKLDEIYRNRWETLLAVDELVKNVYQVLKLQNLLNNT 284
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
YII+TSD+GYH+GQF + K P+E DIRVP L+RGPG++P +
Sbjct: 285 YIIFTSDNGYHIGQFSMPIDKRQPYETDIRVPLLIRGPGVMPSKI 329
>gi|350595017|ref|XP_003484027.1| PREDICTED: extracellular sulfatase Sulf-2-like, partial [Sus
scrofa]
Length = 281
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 69/109 (63%), Positives = 90/109 (82%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TP+Y+YAPNPDK WI++ T M+P+H +FT++L KRLQTL SVDD++E I L + GE
Sbjct: 1 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGE 60
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L+NTYI+YT+DHGYH+GQFGLVKGKS P+EFDIRVPF +RGP + G++
Sbjct: 61 LDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGSL 109
>gi|195151379|ref|XP_002016625.1| GL10403 [Drosophila persimilis]
gi|194110472|gb|EDW32515.1| GL10403 [Drosophila persimilis]
Length = 499
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 124/219 (56%), Gaps = 9/219 (4%)
Query: 18 SHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
+HGY GKYLN++ G+ +P GW E+ L NS+YYNY++ N + + Y + Y
Sbjct: 115 NHGYRTFFAGKYLNQFRGAKVPRGWNEFYGLHGNSRYYNYTLRENSQNVS----YTSTYL 170
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
DL+ + +V FLR + Q +P +++ PAPH P AP++ +F N + TP+++
Sbjct: 171 TDLLRDRAVHFLRSATQGDVHRPFFAMVAPPAPHAPYTPAPRHRGVFSNTKALRTPSFN- 229
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P DK W++ + + D R ++L +VD+ V +V L + L NTYI
Sbjct: 230 NPTEDKHWLVGSSAHLSNETIATIDRYFQGRWESLLAVDELVAALVAVLNETKSLENTYI 289
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
IYTSD+GYH+GQF K P+E DI VP L+RGPGI
Sbjct: 290 IYTSDNGYHMGQFAQPFDKRQPYETDIHVPLLIRGPGIA 328
>gi|198459803|ref|XP_002138742.1| GA24970, partial [Drosophila pseudoobscura pseudoobscura]
gi|198136813|gb|EDY69300.1| GA24970, partial [Drosophila pseudoobscura pseudoobscura]
Length = 450
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 124/219 (56%), Gaps = 9/219 (4%)
Query: 18 SHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
+HGY GKYLN++ G+ +P GW E+ L NS+YYNY++ N + + Y + Y
Sbjct: 114 NHGYRTFFAGKYLNQFRGAKVPRGWNEFYGLHGNSRYYNYTLRENSQNVS----YTSTYL 169
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
DL+ + +V FLR + Q +P +++ PAPH P AP++ +F N + TP+++
Sbjct: 170 TDLLRDRAVDFLRSATQGDVHRPFFAMVAPPAPHAPYTPAPRHRGVFSNTKALRTPSFN- 228
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P DK W++ + + D R ++L +VD+ V +V L + L NTYI
Sbjct: 229 NPTEDKHWLVGSSAHLSNETIATIDRYFQGRWESLLAVDELVAALVAVLNETKSLENTYI 288
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
IYTSD+GYH+GQF K P+E DI VP L+RGPGI
Sbjct: 289 IYTSDNGYHMGQFAQPFDKRQPYETDIHVPLLIRGPGIA 327
>gi|163793311|ref|ZP_02187286.1| sulfatase [alpha proteobacterium BAL199]
gi|159181113|gb|EDP65628.1| sulfatase [alpha proteobacterium BAL199]
Length = 464
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 84/218 (38%), Positives = 130/218 (59%), Gaps = 8/218 (3%)
Query: 20 GYFGKYLNKYN---GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
G GKY+N+Y +Y+PPGW EW A YYNY +N NGR + +G DY D+
Sbjct: 108 GLIGKYMNQYEPNEQTYVPPGWDEWYA--AGHDYYNYVLNENGRHVSYGSKP-EDYLTDV 164
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
IA +V +R++ + KP ++ APHGP +AP++A++F + ++D A
Sbjct: 165 IAGHTVSAIRRAADT--GKPFFFFVTPYAPHGPATAAPRHADLFADTPYPRPASFDEADV 222
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
DK ++ + + + + +RL++LQ++DD VE IV L+ +G+L NTY+IYT
Sbjct: 223 SDKPAYIRDLPPLSADQIAWIEDMYRRRLRSLQAMDDMVESIVNTLEAVGQLENTYLIYT 282
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
SD+G+H+G+ GK+ +E DIRVPF+MRGPG+ G
Sbjct: 283 SDNGWHMGEHRQQYGKTTAYEEDIRVPFVMRGPGVPKG 320
>gi|194757651|ref|XP_001961076.1| GF11188 [Drosophila ananassae]
gi|190622374|gb|EDV37898.1| GF11188 [Drosophila ananassae]
Length = 501
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/241 (37%), Positives = 131/241 (54%), Gaps = 14/241 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
W L R L GY GKYLN+Y+G P GW + L NS+YYNY++ N
Sbjct: 102 WRESLEPRTLAMHLQRRGYTTFFAGKYLNQYSGLETPVGWNSFYGLQGNSRYYNYTLREN 161
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSK-----KPIMLVMSFPAPHGPEDSA 113
+ + E Y D DL +V FL QS +K +P +++ PA H P A
Sbjct: 162 KNNVHYKEVYLTDKLRDL----TVDFLESVSQSANKSKEVERPFFAMITPPAAHAPFTPA 217
Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
P++ +F + + TP+++ P DK W+++ + +Q D KR +TL +VD+
Sbjct: 218 PRHDGVFNKIKALRTPSFN-KPVDDKHWLVRSSWPLQDKTIDTIDSYYRKRWETLLAVDE 276
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
VEK+V +L G L+NTY+I++SD+GYH+GQF K P+E DI +P L+RGPGI P
Sbjct: 277 LVEKVVTKLNQTGVLDNTYVIFSSDNGYHVGQFAQPFDKRQPYETDIHIPLLIRGPGIAP 336
Query: 234 G 234
G
Sbjct: 337 G 337
>gi|195134107|ref|XP_002011479.1| GI14019 [Drosophila mojavensis]
gi|193912102|gb|EDW10969.1| GI14019 [Drosophila mojavensis]
Length = 493
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 135/224 (60%), Gaps = 12/224 (5%)
Query: 19 HGYF----GKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
HGY GKYLN++ G+ +PPGW ++ L NS+YYNY+I N R + Y + Y
Sbjct: 115 HGYHTFFAGKYLNQFKGAEVPPGWDKFYGLHGNSRYYNYTIRENRRNVS----YADIYLT 170
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++ + +++F+ ++ + P ++S PA H P AP++ +F +V + TP+++ A
Sbjct: 171 DMLRDRTLKFIHNVRKG--QNPFFAMISPPAAHAPFTPAPRHKGVFSHVRALRTPSFN-A 227
Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
P+ DK W+++ +++ D KR +TL +VD+ + I+ LK L+NTYII
Sbjct: 228 PSKDKHWLVRTAKRLPNSTVLTVDKYFQKRWETLLAVDEMMVAIITLLKKYQCLDNTYII 287
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
YTSD+GYHLGQF K P++ DI+VP L+RGP VPG++ +
Sbjct: 288 YTSDNGYHLGQFAQPFDKRQPYQTDIKVPLLIRGPD-VPGSILM 330
>gi|405977217|gb|EKC41676.1| N-acetylglucosamine-6-sulfatase [Crassostrea gigas]
Length = 1011
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 10/229 (4%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++PPGW W L+ NSKYYNY +++NG H +D DY
Sbjct: 126 FAGKYLNQYGTPMAGGVEHVPPGWDWWIGLVGNSKYYNYHLSVNGTLEVHKDDPKTDYLT 185
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I + FL S Q+ P +++S PA HGP AP+Y N F N T+ +Y+
Sbjct: 186 DIINRRATSFL--SLQNSDSAPFFMMLSTPACHGPFTPAPKYQNNFPNKTAPRDGSYNKH 243
Query: 135 PNPDKQWILQVTRKMQPVHR-QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+ DK W++Q P + D + R +TL SVDD +E IV L+ L NTYI
Sbjct: 244 GH-DKHWLIQQAITPLPNDTVKSEDQVFRNRWRTLLSVDDMMENIVNILQTKNLLENTYI 302
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
+++SD+G+HLGQF + K +EFD+RVP ++RGP + G + V+
Sbjct: 303 VFSSDNGFHLGQFSMPSDKRQLYEFDVRVPLMIRGPNVTAGQVVKEQVM 351
>gi|307174678|gb|EFN65061.1| N-acetylglucosamine-6-sulfatase [Camponotus floridanus]
Length = 521
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 131/225 (58%), Gaps = 15/225 (6%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +++P GW W LI NSKYYNYS+++NG + K+G + +DY
Sbjct: 132 YAGKYLNEYGNRKVGGAAHVPMGWDWWAGLIGNSKYYNYSLSINGTEAKYG-NSTSDYLT 190
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I N +V F+ +P ++V++ PAPH P A ++ + + + TP ++
Sbjct: 191 DVINNLAVNFINGHSDD---QPFLMVLAPPAPHAPFTPADRHNDKYNGTKAKRTPNFNVP 247
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
+ DK W+ V + P+ D + KR +TL +VD+ V+ + LK+ LN+T
Sbjct: 248 VHQDKHWL--VRKGPTPLPNSVLPKLDEIYRKRWETLLAVDELVKNVHDLLKERNLLNDT 305
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
Y IYTSD+GYH+GQF + K P+E DIRVP L+ GPGI P T+
Sbjct: 306 YFIYTSDNGYHIGQFSMPMDKRQPYETDIRVPLLISGPGINPSTI 350
>gi|195041317|ref|XP_001991230.1| GH12546 [Drosophila grimshawi]
gi|193900988|gb|EDV99854.1| GH12546 [Drosophila grimshawi]
Length = 501
Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 87/246 (35%), Positives = 138/246 (56%), Gaps = 15/246 (6%)
Query: 18 SHGYF----GKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
+HGY GKYLN++ G+ +PPGW E+ L NS+YYNY++ N + + Y + Y
Sbjct: 123 NHGYHTFFAGKYLNQFKGAEVPPGWDEFYGLHGNSRYYNYTMRENQQNVS----YTDTYL 178
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
DL+ + +V F+R +KP ++S PA H P AP++ F V + TP ++
Sbjct: 179 TDLLKDRAVTFIRNMASKQYQKPFFAMISPPAAHAPFTPAPRHKGFFSEVHALRTPNFN- 237
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
AP+ DK W++ + + + D R +TL +VD+ ++ ++ L + L+NTYI
Sbjct: 238 APDTDKHWLVGSAQNLSEATVRTIDEYFQSRWETLLAVDEMMQNLLTVLNETQYLDNTYI 297
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-----TMYLLDVLIPQVRK 248
+YTSD+GYHLGQF K P+E DI+VP L+ GP + G + LLD L P + +
Sbjct: 298 VYTSDNGYHLGQFAQPFDKRQPYETDIKVPLLISGPDVPAGGSTDIIVSLLD-LAPTILE 356
Query: 249 FSSGSL 254
++ S+
Sbjct: 357 WTGMSV 362
>gi|158298003|ref|XP_318108.3| AGAP004721-PA [Anopheles gambiae str. PEST]
gi|157014599|gb|EAA13233.3| AGAP004721-PA [Anopheles gambiae str. PEST]
Length = 494
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 139/243 (57%), Gaps = 19/243 (7%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
+ GKYLN+Y + IPPGW +W L NS+YYN+++ NG+ Y N+Y+ D +
Sbjct: 125 FAGKYLNEYFSASIPPGWSQWFGLHGNSRYYNFTVTENGKVAT----YTNEYFTDYLDTK 180
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-DK 139
+V FL+ + + + +++ PAPH P A ++ ++F NV + TP ++ + +P K
Sbjct: 181 TVNFLKNLR---ADRHFFAMVTPPAPHAPFTPADRHKDLFPNVKAIQTPNFNISSSPLQK 237
Query: 140 QWILQVTRKMQPVHRQFTDLLMT---KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
W+L + P+ +F D + T KR QTL +VD+ VE IV L++ L T+I YT
Sbjct: 238 HWLLTMEPSTLPL--KFLDEMDTIHRKRWQTLMAVDNMVENIVNILREKKLLEKTFIFYT 295
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP-----GTMYLLDVLIPQVRKFSS 251
SD+G+HLGQF K P+E DIRVP +RGP I P + LLD+ +P + + ++
Sbjct: 296 SDNGFHLGQFSQAYDKRQPYETDIRVPLFVRGPTIKPKAVQESAVALLDI-VPTILQLAN 354
Query: 252 GSL 254
L
Sbjct: 355 LEL 357
>gi|229577317|ref|NP_001153351.1| glucosamine (N-acetyl)-6-sulfatase precursor [Nasonia vitripennis]
Length = 501
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/219 (38%), Positives = 133/219 (60%), Gaps = 13/219 (5%)
Query: 21 YFGKYLNKY---NG-SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GKYLNKY NG S +PPGW +W LI NS+YY+YS+++NG + K+G DY D+
Sbjct: 125 YAGKYLNKYGQRNGVSRVPPGWDKWYGLIGNSRYYDYSLSINGAEKKYGSSP-TDYLTDV 183
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
I ++ FL+ K+ +P ++V++ PAPH P AP++ + + + T ++ +
Sbjct: 184 IQGFAIDFLKTQKKD---QPFLMVLAPPAPHEPFIPAPRHNDKYKGTKAKRTKNFNTSSQ 240
Query: 137 PDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK W+ V R P+ D + KR ++L +VD+ V+K+ +LK L++TYI
Sbjct: 241 RDKHWL--VRRGPSPLPDSVVARLDDIYRKRWESLLAVDELVQKVYSQLKSQNLLDDTYI 298
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
I+TSD+G+H+GQF L K P+E DI++P ++RGP I
Sbjct: 299 IFTSDNGFHVGQFSLPYDKRQPYETDIKIPLMIRGPNIT 337
>gi|195334065|ref|XP_002033705.1| GM20282 [Drosophila sechellia]
gi|194125675|gb|EDW47718.1| GM20282 [Drosophila sechellia]
Length = 492
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/236 (38%), Positives = 129/236 (54%), Gaps = 12/236 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
W L R L Y HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 100 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 160 NAGNVH----YESTYLTDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F +V + TP+++ DK W+++ R++ D KR +TL +VD+ V
Sbjct: 214 GVFSHVEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDKYFQKRWETLLAVDELVVT 272
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+V L D L NTYIIYTSD+GYH+GQF K P+E DI VP L+RGPGI P
Sbjct: 273 LVGVLNDTESLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 328
>gi|195403316|ref|XP_002060238.1| GJ19791 [Drosophila virilis]
gi|194141891|gb|EDW58302.1| GJ19791 [Drosophila virilis]
Length = 500
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 134/238 (56%), Gaps = 9/238 (3%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMN 58
W R+L HGY GKYLN++ G+ +P GW E+ L NS+YYNY++ N
Sbjct: 103 WRTMFEPRSLPMLLQKHGYQTFFAGKYLNQFKGAEVPHGWDEFYGLHGNSRYYNYTLREN 162
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
+ + + + Y DL+ + + F+R S ++ KP ++S PA H P AP++
Sbjct: 163 TKNVS----FTDIYLTDLLRDKAAEFIRNSTRNRHSKPFFAMVSPPAAHEPFTPAPRHEG 218
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+F V + TP+++ AP+ +K W+++ + + D +R +TL +VD+ + +
Sbjct: 219 VFSQVRALRTPSFN-APDKNKHWLVRSAQHLSNDSVFTIDKYFQRRWETLLAVDEMMVTL 277
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
V L + L+NTYII+TSD+GYHLGQF K P++ DI+VP L+RGPG+ G +
Sbjct: 278 VTVLNETQCLDNTYIIFTSDNGYHLGQFAQPFDKRQPYDTDIKVPLLIRGPGVPAGRL 335
>gi|402886724|ref|XP_003906772.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylglucosamine-6-sulfatase
[Papio anubis]
Length = 551
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 127/220 (57%), Gaps = 12/220 (5%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 146 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 205
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL K +F +P ++++ PAPH P +APQY F NV + ++
Sbjct: 206 DVLANVSLDFL-DYKSNF--EPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIH 262
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+K W++ Q M QF D KR QTL SVDD K++K+L L ++
Sbjct: 263 -GMNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDXXXKVLKQLP-YTXLQDSTP 320
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
Y SD GY+ GQF L K +EFDI+VP L+RGPGI P
Sbjct: 321 DYASDSGYYTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKP 360
>gi|297565681|ref|YP_003684653.1| sulfatase [Meiothermus silvanus DSM 9946]
gi|296850130|gb|ADH63145.1| sulfatase [Meiothermus silvanus DSM 9946]
Length = 501
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 133/230 (57%), Gaps = 21/230 (9%)
Query: 23 GKYLNKY----------------NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKH 64
GKYLN Y + +Y+PPGW EW A + N+ Y YNYS+N NG+ +++
Sbjct: 131 GKYLNGYGQDNPRRAQRQGLNLPDPTYVPPGWTEWYAGVGNAPYQDYNYSLNENGQLVRY 190
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
G DY D+IA +V+ + + + KP L ++ PH P + AP++A++F +
Sbjct: 191 GSSP-EDYLTDVIARKAVQAITNATRE--GKPFFLYLAPFTPHAPANFAPRHASLFKDAE 247
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
P +D A DK +L+ ++ L KRL++LQ++DD VE IV+ L+
Sbjct: 248 LPRPPNFDEADVSDKPPLLRRLPRLSERELARMRELYVKRLRSLQAIDDLVESIVQALRQ 307
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
G+L NTYI+YTSD+G+H+G + +GK+ P+E DIRVP ++RGPG+ G
Sbjct: 308 NGQLANTYIVYTSDNGFHMGNHRMPQGKNMPYEEDIRVPLVVRGPGVPAG 357
>gi|195583054|ref|XP_002081339.1| GD25763 [Drosophila simulans]
gi|194193348|gb|EDX06924.1| GD25763 [Drosophila simulans]
Length = 1772
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 91/238 (38%), Positives = 130/238 (54%), Gaps = 12/238 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGS-YIPPGWREWGALIMNSKYYNYSINM 57
W L R L Y HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 60 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 119
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 120 NTGNVH----YQSTYLTDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 173
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F +V + TP+++ DK W+++ R++ D KR +TL +VD+ V
Sbjct: 174 GVFSHVEALRTPSFNQVKQ-DKHWLVRAARRLPKETINTIDKYFQKRWETLLAVDELVVT 232
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+V L D L NTYIIYTSD+GYH+GQF K P+E DI VP L+RGPGI P +
Sbjct: 233 LVGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAPES 290
>gi|24653364|ref|NP_725289.1| CG18278 [Drosophila melanogaster]
gi|21627222|gb|AAG22274.2| CG18278 [Drosophila melanogaster]
gi|60677693|gb|AAX33353.1| SD21168p [Drosophila melanogaster]
gi|220952048|gb|ACL88567.1| CG18278-PA [synthetic construct]
Length = 492
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 89/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
W L R L Y HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 100 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 160 NSGNVH----YESTYLTDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F ++ + TP+++ DK W+++ R++ D KR +TL +VD+ V
Sbjct: 214 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVT 272
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
++ L D L NTYIIYTSD+GYH+GQF K P+E DI VP L+RGPGI P
Sbjct: 273 LMGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 328
>gi|321470213|gb|EFX81190.1| hypothetical protein DAPPUDRAFT_50373 [Daphnia pulex]
Length = 506
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 81/219 (36%), Positives = 127/219 (57%), Gaps = 13/219 (5%)
Query: 21 YFGKYLNKYNGS------YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y + ++PPG+ W L+ NS+YYNY+++++G+ HG+DY+ DY
Sbjct: 123 YGGKYLNQYGKNETGGVEHVPPGYDRWVGLVGNSRYYNYTLSVDGQPRDHGDDYHLDYLT 182
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++ + L + +P+ V++ PA H P APQ+A F + P+++
Sbjct: 183 DVLVFIATSALNHGS---NNRPLFAVIAPPACHSPFTPAPQFARKFAGRRAPRWPSFNRH 239
Query: 135 PNPDKQWILQVTRKMQPVHRQFTDL---LMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
P DK W+++ P+ ++ D + +R +TL SVD + KIVK L G+L+ T
Sbjct: 240 PGLDKHWLVRQA-PQGPLPKEIIDQVDHIFRQRWRTLLSVDQMIGKIVKLLGWKGQLDRT 298
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+++ TSDHGYHLGQF L K P+E D+RVP L+ PG
Sbjct: 299 WLVVTSDHGYHLGQFTLPVDKRLPYETDVRVPLLVIAPG 337
>gi|28317151|gb|AAO39590.1| LD10990p, partial [Drosophila melanogaster]
Length = 515
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 89/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
W L R L Y HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 123 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNNFYGLHGNSRYYNYTLRE 182
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 183 NTGNVH----YESTYLSDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 236
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F ++ + TP+++ DK W+++ R++ D KR +TL +VD+ V
Sbjct: 237 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVT 295
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
++ L D L NTYIIYTSD+GYH+GQF K P+E DI VP L+RGPGI P
Sbjct: 296 LMGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 351
>gi|19922168|ref|NP_610872.1| CG30059 [Drosophila melanogaster]
gi|16769828|gb|AAL29133.1| SD03412p [Drosophila melanogaster]
gi|21627223|gb|AAM68572.1| CG30059 [Drosophila melanogaster]
gi|220946552|gb|ACL85819.1| CG30059-PA [synthetic construct]
Length = 492
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 89/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
W L R L Y HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 100 WRRALEPRALPYILQQHGYNTFFGGKYLNQYWGAGDVPKGWNNFYGLHGNSRYYNYTLRE 159
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 160 NTGNVH----YESTYLSDLLRDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F ++ + TP+++ DK W+++ R++ D KR +TL +VD+ V
Sbjct: 214 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVT 272
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
++ L D L NTYIIYTSD+GYH+GQF K P+E DI VP L+RGPGI P
Sbjct: 273 LMGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINVPLLIRGPGIAP 328
>gi|195484992|ref|XP_002090907.1| GE13365 [Drosophila yakuba]
gi|194177008|gb|EDW90619.1| GE13365 [Drosophila yakuba]
Length = 492
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/236 (37%), Positives = 129/236 (54%), Gaps = 12/236 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
W L R L + HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 100 WWRSLEPRALPHILQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 160 NSGNVH----YESTYLTDLLKDRAADFLRNATQS--SEPFFAMVAPPAAHEPFTPAPRHE 213
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F ++ + TP+++ DK W+++ R++ D KR +TL +VD+ V
Sbjct: 214 GVFSHIEALRTPSFNQVKQ-DKHWLVRAARRLPNETINTIDKYFQKRWETLLAVDELVVT 272
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+V L D L NTYIIYTSD+GYH+GQF K P+E DI +P L+RGPGI P
Sbjct: 273 LVGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINIPLLIRGPGIAP 328
>gi|170028855|ref|XP_001842310.1| N-acetylglucosamine-6-sulfatase [Culex quinquefasciatus]
gi|167877995|gb|EDS41378.1| N-acetylglucosamine-6-sulfatase [Culex quinquefasciatus]
Length = 472
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 130/220 (59%), Gaps = 12/220 (5%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y GKYLN+Y+ +P GW +W L NS+YYNY++N NG +I+ Y ++Y DL+
Sbjct: 79 YAGKYLNEYHSDRVPSGWTQWYGLHGNSRYYNYTLNENG-QIRF---YQDEYLTDLLKTR 134
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-DK 139
++ F+ + +K P +++ PAPH P +A ++ +F +V + T ++ P +K
Sbjct: 135 TLEFI--TSIDATKTPFFAMVAPPAPHAPFTAADRHQQLFPDVEALKTKNFNLPCGPLEK 192
Query: 140 QWILQVTRKMQPVHRQ---FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
W+L T P+ ++ + D + R ++L +VD+ VE ++ L+ LNNTYI YT
Sbjct: 193 HWLL--TMPPSPLSKKTLTYLDTIYRNRWRSLMAVDELVEAVISLLESKNLLNNTYIFYT 250
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
SD+GYH+GQF L K P+E DIRVPFL+ GP I ++
Sbjct: 251 SDNGYHIGQFSLPYDKRQPYETDIRVPFLVAGPKIASKSL 290
>gi|271968605|ref|YP_003342801.1| N-acetylglucosamine-6-sulfatase [Streptosporangium roseum DSM
43021]
gi|270511780|gb|ACZ90058.1| N-acetylglucosamine-6-sulfatase [Streptosporangium roseum DSM
43021]
Length = 513
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 127/223 (56%), Gaps = 16/223 (7%)
Query: 20 GYFGKYLNKYNG-----SYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYNDYY 73
G GKYLN Y G +Y+PPGW EW + + Y Y++N NG G DY
Sbjct: 128 GLMGKYLNHYPGGSAEPTYVPPGWDEWDVPVRKLYEEYGYTLNENGVLTTRGS-APEDYL 186
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
D+++ + F+ QS P L ++ APH P + AP+YAN F + + TP++D
Sbjct: 187 TDVLSQKAGAFVSQSPD-----PFFLYLAPTAPHNPANHAPRYANAFADAIAPRTPSFDQ 241
Query: 134 AP-NPDKQWILQVTR-KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
A + + +W+ R + + + D +RL+++ VDD V ++K L+D G+L++T
Sbjct: 242 ADVSAEPRWLSSRPRFSARTIEK--IDERYRRRLRSMLGVDDMVGALIKTLRDTGKLDDT 299
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
YI + SD+G+HLGQ L +GK+ PF+ I+VP ++RGPG+ PG
Sbjct: 300 YIFFASDNGFHLGQHRLAQGKTTPFDESIKVPLVVRGPGVRPG 342
>gi|296269863|ref|YP_003652495.1| sulfatase [Thermobispora bispora DSM 43833]
gi|296092650|gb|ADG88602.1| sulfatase [Thermobispora bispora DSM 43833]
Length = 493
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 84/222 (37%), Positives = 126/222 (56%), Gaps = 14/222 (6%)
Query: 20 GYFGKYLNKY----NGSYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYNDYYP 74
G GKYLN+Y + YIPPGW EW + + Y Y++N NGR K+GE DY
Sbjct: 134 GLLGKYLNQYPKGADRRYIPPGWDEWAVPVTRLYQEYGYALNDNGRIRKYGE-APEDYLE 192
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+++ + F+ P L ++ APH P + AP++A+ F + TP++D A
Sbjct: 193 DVLSAKANAFVS------GGGPFFLYLAPVAPHLPANHAPRHASAFAGEIAPRTPSFDQA 246
Query: 135 P-NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+ + +W+ + R R+ D + RL+ + +DD V ++ L+ G L++TYI
Sbjct: 247 DLSAEPRWLRERPRLTDRDIRRI-DRVHRNRLRAMLGIDDMVGALIAALERAGRLDDTYI 305
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+TSD+G+HLGQ L GK+ PFE DIRVP ++RGPGI PGT
Sbjct: 306 FFTSDNGFHLGQHRLRPGKTTPFEEDIRVPLVVRGPGIAPGT 347
>gi|242014810|ref|XP_002428078.1| N-acetylglucosamine-6-sulfatase precursor, putative [Pediculus
humanus corporis]
gi|212512597|gb|EEB15340.1| N-acetylglucosamine-6-sulfatase precursor, putative [Pediculus
humanus corporis]
Length = 555
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 131/219 (59%), Gaps = 11/219 (5%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y ++P GW W L+ NS YYNY++++NG K++ G +DY
Sbjct: 157 YAGKYLNRYGYKSSGGVGHVPKGWDWWIGLVGNSAYYNYTLSVNGSKVRRGTQE-SDYLT 215
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I S++FL + Q+F+ + +M + P+PH P A ++ + + + TP ++
Sbjct: 216 DVIKEYSLKFLNEH-QTFNNEFLMFINP-PSPHEPFTPAKRHLGKYDKIKAPRTPNFNKL 273
Query: 135 PNPD-KQWILQVTRKMQPVHR-QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ + K W+L++ P + + D + R +TL SVD+ VE I+ +LK L LN TY
Sbjct: 274 TDKNGKHWLLRLPPNELPSNVIERIDTIYRNRWETLLSVDELVETIIMKLKKLNILNETY 333
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
II+TSD+G+H+GQF L K P+E DIRVP ++RGPGI
Sbjct: 334 IIFTSDNGFHMGQFELPWDKRQPYETDIRVPLMIRGPGI 372
>gi|270008450|gb|EFA04898.1| hypothetical protein TcasGA2_TC014962 [Tribolium castaneum]
Length = 482
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/226 (36%), Positives = 132/226 (58%), Gaps = 14/226 (6%)
Query: 21 YFGKYLNKYNGS-----YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GKYLN+Y S ++PPG+ W L NSKYYNY++++NG G + DY D
Sbjct: 116 YAGKYLNQYGKSGKGVKHVPPGYDWWLGLKGNSKYYNYTLSINGS----GHFFEKDYLTD 171
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
I ++ FL Q+ + ++++ PA H P A ++ +F ++ + TP ++ P
Sbjct: 172 KITKYALDFLNQTDEG----NFFMMLAPPACHAPFTPADRHRRLFPDLETLKTPPFNATP 227
Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
+ DK WI+ + P + Q D + R++TLQSVD+ V+ ++ +L+++ L+NTY I
Sbjct: 228 S-DKHWIVAMPPMSLPQNVQILDEIYKNRIRTLQSVDEMVQALITKLQEIRVLDNTYFIV 286
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
TSD+G+H+GQF K P+E DIRVPF++RGP I T+ + V
Sbjct: 287 TSDNGFHIGQFTQPWDKRQPYESDIRVPFMIRGPNIRKKTVSEVSV 332
>gi|91085257|ref|XP_973432.1| PREDICTED: similar to sulfatase [Tribolium castaneum]
Length = 475
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/226 (36%), Positives = 132/226 (58%), Gaps = 14/226 (6%)
Query: 21 YFGKYLNKYNGS-----YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GKYLN+Y S ++PPG+ W L NSKYYNY++++NG G + DY D
Sbjct: 116 YAGKYLNQYGKSGKGVKHVPPGYDWWLGLKGNSKYYNYTLSINGS----GHFFEKDYLTD 171
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
I ++ FL Q+ + ++++ PA H P A ++ +F ++ + TP ++ P
Sbjct: 172 KITKYALDFLNQTDEG----NFFMMLAPPACHAPFTPADRHRRLFPDLETLKTPPFNATP 227
Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
+ DK WI+ + P + Q D + R++TLQSVD+ V+ ++ +L+++ L+NTY I
Sbjct: 228 S-DKHWIVAMPPMSLPQNVQILDEIYKNRIRTLQSVDEMVQALITKLQEIRVLDNTYFIV 286
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
TSD+G+H+GQF K P+E DIRVPF++RGP I T+ + V
Sbjct: 287 TSDNGFHIGQFTQPWDKRQPYESDIRVPFMIRGPNIRKKTVSEVSV 332
>gi|167538032|ref|XP_001750682.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770816|gb|EDQ84495.1| predicted protein [Monosiga brevicollis MX1]
Length = 1428
Score = 145 bits (367), Expect = 1e-32, Method: Composition-based stats.
Identities = 78/195 (40%), Positives = 107/195 (54%), Gaps = 10/195 (5%)
Query: 50 YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
YY+Y+I+ NG+ KHG+D DY P ++ ++ FL S P ++++ P+ H P
Sbjct: 753 YYHYTISNNGQAEKHGDDPEADYLPKVLQRKAMAFLENVTSSDDNAPFFMMVATPSCHDP 812
Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVH--------RQFTDLLM 161
+ A +YA + N T+ TP Y + DK W + H + DL
Sbjct: 813 TEPAAEYAGLMVNATAPRTPNYGGFAS-DKHW-YAAQQNCAAGHCDGYTKQQEAYNDLQY 870
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
+R TL SVDD V I L+ L+NTYI Y+SDHGYHLGQF L+K K P+E DIR
Sbjct: 871 RRRALTLMSVDDIVGNITFYLESKRLLDNTYIFYSSDHGYHLGQFALMKDKRNPYESDIR 930
Query: 222 VPFLMRGPGIVPGTM 236
+P L RGPGI PGT+
Sbjct: 931 IPGLARGPGIAPGTI 945
>gi|194883353|ref|XP_001975767.1| GG22495 [Drosophila erecta]
gi|190658954|gb|EDV56167.1| GG22495 [Drosophila erecta]
Length = 492
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/236 (37%), Positives = 127/236 (53%), Gaps = 12/236 (5%)
Query: 3 WPVELTSRNLLYFDISHGY----FGKYLNKYNGSY-IPPGWREWGALIMNSKYYNYSINM 57
W L R L HGY GKYLN+Y G+ +P GW + L NS+YYNY++
Sbjct: 100 WRRALEPRALPQVLQQHGYNTFFGGKYLNQYWGAGDVPKGWNHFYGLHGNSRYYNYTLRE 159
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
N + Y + Y DL+ + + FLR + QS +P +++ PA H P AP++
Sbjct: 160 NTGNVH----YESTYLTDLLRDRAADFLRNATQS--AEPFFAMVAPPAAHEPFTPAPRHE 213
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+F +V + TP+++ DK W+++ ++ D KR +TL +VD+ V
Sbjct: 214 GVFSHVEALRTPSFNQVKQ-DKHWLVRAASRLPNETISTIDKYFQKRWETLLAVDELVVT 272
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+V L D L NTYIIYTSD+GYH+GQF K P+E DI +P L+RGPGI P
Sbjct: 273 LVGVLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQPYETDINIPLLIRGPGIAP 328
>gi|307206693|gb|EFN84648.1| N-acetylglucosamine-6-sulfatase [Harpegnathos saltator]
Length = 521
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 134/224 (59%), Gaps = 12/224 (5%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +++P GW W L+ NSKYY+YS+++NG +G +DY
Sbjct: 98 YAGKYLNEYGNNKVGGAAHVPVGWDWWAGLLGNSKYYDYSLSINGTGRLYGNGS-SDYLT 156
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I+N +V F+ +K + + +P ++V++ PAPH P A ++ + + + TP ++
Sbjct: 157 DVISNLAVDFI--NKHTDNTQPFLMVLAPPAPHAPFTPADRHNHKYNGTKAKRTPNFNVP 214
Query: 135 PNPDKQWILQVTRKMQP--VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ +K W+++ P V + D+ ++R +TL +VD+ VE + + LK L++TY
Sbjct: 215 VHENKHWLVRNGPTPLPDDVLPRLDDIY-SRRWETLLAVDELVETVHESLKRRNLLHDTY 273
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
IYTSD+GYH+GQF + K +E DIRVP ++ GPGI P T+
Sbjct: 274 FIYTSDNGYHIGQFSMPMDKRQLYETDIRVPLMISGPGIAPATI 317
>gi|332023429|gb|EGI63672.1| N-acetylglucosamine-6-sulfatase [Acromyrmex echinatior]
Length = 465
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 135/240 (56%), Gaps = 36/240 (15%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +++P GW W LI NSKYYNYS+++NG + K+G D NDY
Sbjct: 82 YAGKYLNQYGKKETGGAAHVPLGWDWWAGLIGNSKYYNYSLSINGTERKYG-DSSNDYLT 140
Query: 75 DLIA---------NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
D+I N +V F+ K+ +P +++++ PAPHGP A ++ + + +
Sbjct: 141 DVIVILYIYIYIYNLAVNFI---KEYSGDQPFLMMLAPPAPHGPFIPAVRHKDKYIGTKA 197
Query: 126 HHTPAYDYAPNPDKQWILQ---------VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
TP ++ N DK W+++ + K+ ++R +R +TL +VD+ ++
Sbjct: 198 KRTPNFNIPVNQDKHWLVRKGPIPLPDDILPKLDDIYR--------RRWETLLAVDELIK 249
Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
I L++ L++TY IYTSD+GYH+GQF + K P+E DIRVP L+ GPGI T+
Sbjct: 250 NIHDLLEERNLLDDTYFIYTSDNGYHVGQFSMPIDKRQPYETDIRVPLLISGPGIELSTV 309
>gi|323450161|gb|EGB06044.1| hypothetical protein AURANDRAFT_29931, partial [Aureococcus
anophagefferens]
Length = 410
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 120/222 (54%), Gaps = 13/222 (5%)
Query: 21 YFGKYLNKY-NG-----SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN Y NG ++PPGW +W L+ NS +Y+Y +++NG + HG DY
Sbjct: 137 YAGKYLNMYGNGKMGGPGHVPPGWDDWHGLLGNSVFYDYDLSVNGVREAHGT-APEDYLT 195
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DL+A +V FL + P ++ PA H P + P + + TPA++ A
Sbjct: 196 DLVARRAVNFLEAAA-----APWFAILGTPASHDPFEPRPDHFGALDGASRPRTPAWN-A 249
Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
DK I++ ++ + D +RL+ L SVD+A+ + + GE + TY I
Sbjct: 250 SAADKHAIVRRQGFLKAPVADYIDRTYRRRLECLLSVDEAMNATLAAVAARGEADATYTI 309
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+TSDHG+HLG+FGL K +EFD RVPF++ GPG+ G +
Sbjct: 310 FTSDHGWHLGEFGLNYDKRQLYEFDTRVPFVLAGPGVQEGVV 351
>gi|390337306|ref|XP_003724525.1| PREDICTED: extracellular sulfatase Sulf-2-like [Strongylocentrotus
purpuratus]
Length = 935
Score = 141 bits (356), Expect = 3e-31, Method: Composition-based stats.
Identities = 64/107 (59%), Positives = 81/107 (75%)
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TPA++Y PN DK WI++ +M + R FTD L KRL TL SVDDA+EK+ +EL+ LGE
Sbjct: 211 TPAWNYGPNEDKHWIIRQASRMTKLERDFTDALQRKRLITLLSVDDAIEKLCRELEALGE 270
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+NTYI+ TSDHGYHLGQF ++KGKS ++ DIRVPF +RGP I G
Sbjct: 271 LDNTYILMTSDHGYHLGQFNMLKGKSNAYDTDIRVPFYIRGPDIPHG 317
>gi|322790248|gb|EFZ15247.1| hypothetical protein SINV_08807 [Solenopsis invicta]
Length = 499
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 82/225 (36%), Positives = 126/225 (56%), Gaps = 15/225 (6%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +++P GW W LI NSKYY+YS+++NG + K+G + +DY
Sbjct: 118 YAGKYLNQYGNEKVGGAAHVPVGWDWWAGLIGNSKYYDYSLSINGTERKYG-NSSSDYLT 176
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+I N +V F+ +P +LV++ PAPH P A ++ + + TP ++
Sbjct: 177 DVINNLAVDFIEGHS---GDQPFLLVLAPPAPHAPFTPADRHDGKYNGTKAKRTPNFNVP 233
Query: 135 PNPDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
+ DK W+ V + P+ D + +R + L +VD+ V+ + L+ L +T
Sbjct: 234 IHQDKHWL--VRKGPTPLPDDILPKLDEIYRRRWEALLAVDELVKNVHDLLEKRNLLKDT 291
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
Y IYTSD+GYH+GQF + K P+E DIRVP L+ GPGI T+
Sbjct: 292 YFIYTSDNGYHIGQFSMPMDKRQPYETDIRVPLLISGPGIESSTI 336
>gi|403731455|ref|ZP_10949313.1| putative sulfatase [Gordonia rhizosphera NBRC 16068]
gi|403202234|dbj|GAB93644.1| putative sulfatase [Gordonia rhizosphera NBRC 16068]
Length = 531
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 121/220 (55%), Gaps = 9/220 (4%)
Query: 20 GYFGKYLNKYNGS--YIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYPD 75
G GKYLN S +IPPGW +W A + S Y Y Y++N NGR + +G +DY D
Sbjct: 161 GMAGKYLNGLENSPAHIPPGWDDWNAGVTLSLYTGYGYTLNENGRMVPYGI-RPDDYETD 219
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
++A SV F+R S + S KP + APH P +AP++ +HH P + A
Sbjct: 220 VVARKSVDFIRAS--AHSGKPFFWYAASTAPHFPLPAAPRHLASSPWTPAHHLPNFQEAD 277
Query: 136 NPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
DK W++ DL T RL +L+++D+ VE IV EL G+L+NTY++
Sbjct: 278 VSDKPSWLIDTAGVRAATIAATNDLDNTNRLGSLKALDEMVEAIVGELSRTGQLDNTYLV 337
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+TSD+GY LG L + K P+E IR+P ++ GPG+ P
Sbjct: 338 FTSDNGYSLGAHRLTQ-KMVPYEESIRIPLVITGPGVRPA 376
>gi|357631807|gb|EHJ79275.1| sulfatase [Danaus plexippus]
Length = 496
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 123/220 (55%), Gaps = 23/220 (10%)
Query: 21 YFGKYLNKYN------GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN+Y +PPGW EW L+ NS YYNY+I+ NG + Y D
Sbjct: 118 YAGKYLNEYGVHASGGPEQVPPGWSEWHGLVGNSVYYNYTISNNGVPTFSTDLYLTDIIR 177
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DL S+ ++ +S +P ++V++ PAPH P A ++ ++ N T TP ++ A
Sbjct: 178 DL----SLNYIENQTES---RPFLMVLAPPAPHQPATPAERHRGVYDNTTVLKTPNFNIA 230
Query: 135 PNPDKQWILQ-----VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
+ +K W+++ + K+ P D + R ++L +VD+ V +V+ L G L
Sbjct: 231 DD-NKHWLIRMPPSPLPEKIMPE----LDRVYRSRWESLLAVDEMVADVVESLDSSGLLQ 285
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
NTY+I+TSD+GYH+GQF V K P+E D++VP L+RGP
Sbjct: 286 NTYLIFTSDNGYHIGQFSQVYDKRQPYEADVKVPLLIRGP 325
>gi|108804933|ref|YP_644870.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
gi|108766176|gb|ABG05058.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
Length = 484
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 124/218 (56%), Gaps = 8/218 (3%)
Query: 21 YFGKYLNKYNGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
YFGKY+N Y S +PPGW EW A+ N Y + N NGR + YND DLIA
Sbjct: 136 YFGKYMNGYGRSETRVPPGWDEWHAVAGN--YLSSWYNDNGRVRYYSPALYNDT--DLIA 191
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ +LR++ + P +V++ APH P P+YA+ F P++D D
Sbjct: 192 EKATSYLRRT--AGRGAPFFVVLAPRAPHQPAVPPPRYADAFPEAPLPRGPSFDERDVSD 249
Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
K ++ ++ +F L +RL+++ +V+D VE++++ L++ G+L NTYI +TSD
Sbjct: 250 KPRWVRDNPRLGRKKLEFLGSLYRRRLRSMLAVEDLVERLLRTLRESGQLENTYIFFTSD 309
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+G+H+G L +GK +E DIRVP L+RGPG+ G +
Sbjct: 310 NGFHMGHHRLPEGKWTAYEEDIRVPLLVRGPGVPEGRV 347
>gi|108805892|ref|YP_645829.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
gi|108767135|gb|ABG06017.1| sulfatase [Rubrobacter xylanophilus DSM 9941]
Length = 492
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 77/219 (35%), Positives = 126/219 (57%), Gaps = 8/219 (3%)
Query: 20 GYFGKYLNKY-NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
G+ GKYLN Y +++PPGW W Y+++++N NGR + + + Y D++
Sbjct: 142 GFVGKYLNGYLRTTHVPPGWDRWYGF-NGGGYHDFTLNENGRNVSYRGP--SSYQTDVLG 198
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
++ F+R + + +P L +S APHGP + AP++A +F P++D D
Sbjct: 199 RKALGFVRWAARR--DRPFFLHLSPWAPHGPAEPAPRHARLFARTPLPRPPSFDERDVSD 256
Query: 139 K-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
K +W+ R + R+ L RL+TL++VD+ V ++V L++ G+L NTYI +TS
Sbjct: 257 KPRWVRDNPRLGREEVREMGRLYRN-RLRTLRAVDELVGRLVAALRESGQLENTYIFFTS 315
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
D+G+H+G L +GK +E DIRVP L+RGPG+ G +
Sbjct: 316 DNGFHMGHHRLPEGKWTAYEEDIRVPLLVRGPGVPEGRV 354
>gi|345314989|ref|XP_003429575.1| PREDICTED: extracellular sulfatase Sulf-2-like, partial
[Ornithorhynchus anatinus]
Length = 107
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 60/107 (56%), Positives = 84/107 (78%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
DY DLI NDSV F R SK+++ +P+++V+S APHGPEDSAPQY+ +F N + H TP+
Sbjct: 1 DYLTDLITNDSVSFFRTSKRTYPHRPVLMVISHAAPHGPEDSAPQYSQLFPNASQHITPS 60
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
Y+YAP+PDK WI++ T M+P+H +FT++L KRLQTL SVDD+++K
Sbjct: 61 YNYAPSPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMDK 107
>gi|434398901|ref|YP_007132905.1| N-acetylglucosamine-6-sulfatase [Stanieria cyanosphaera PCC 7437]
gi|428269998|gb|AFZ35939.1| N-acetylglucosamine-6-sulfatase [Stanieria cyanosphaera PCC 7437]
Length = 508
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 126/226 (55%), Gaps = 15/226 (6%)
Query: 20 GYFGKYLNKY----NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHG---EDYYN 70
+ GKYLN Y +Y+PPGW EW + + S Y Y+Y++N NG+ + +G +DY N
Sbjct: 135 AFIGKYLNLYPYTSKMNYVPPGWDEWDSPVHGSAYVQYDYTVNENGKLVHYGNQPQDYGN 194
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
D Y N + +FL Q ++ +P L +S+ APH P AP++ N+F + A
Sbjct: 195 DVY----TNKAKQFLTQVAKT--NQPFFLFLSYYAPHQPATPAPRHQNLFPDAQVPRRQA 248
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
++ DK L+ + ++ D L KRL++LQ+VD+ + + L+ +L+N
Sbjct: 249 FNEIDVSDKPQYLRELPLLNQEQQEKIDYLHQKRLRSLQAVDEGLATLYNTLQATNQLDN 308
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
TYI ++SD+G+HLGQ L+ GK +E DI +P L+R P + G +
Sbjct: 309 TYIFFSSDNGFHLGQHRLLPGKETAYEEDIHLPLLVRSPHVPAGKV 354
>gi|345013357|ref|YP_004815711.1| sulfatase [Streptomyces violaceusniger Tu 4113]
gi|344039706|gb|AEM85431.1| sulfatase [Streptomyces violaceusniger Tu 4113]
Length = 506
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/226 (33%), Positives = 126/226 (55%), Gaps = 18/226 (7%)
Query: 20 GYFGKYLNKY-----NGS---YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
G+ GKYLN Y NG+ Y+P GW EW K Y Y +N NG+ + +G+ D
Sbjct: 143 GFMGKYLNGYQPADRNGTSKPYVPSGWDEWDVAGDGYKEYKYDLNENGKVVPYGK-APQD 201
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y D+++ + F+ S +KKP ML ++ APHGP AP+ + F + + T AY
Sbjct: 202 YLTDVLSKKATSFIGSSAD--AKKPFMLEVATFAPHGPSTPAPRDKDKFPGLKAPRTDAY 259
Query: 132 DYAPNPDKQWILQVTRKMQPV---HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
D A +W R + P+ ++ TD KR++++Q+VD + ++ K L++ G
Sbjct: 260 DKATRNAPKW----QRALSPLTAKEKEETDQKFAKRVRSVQAVDAMIGQLEKTLEEKGLA 315
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
++TY+++ SD+G+H+G+ L GK ++ D++VP ++ GPG+ G
Sbjct: 316 DDTYLVFGSDNGFHMGEHRLRPGKQTAYDTDVKVPMMVTGPGVPAG 361
>gi|326917732|ref|XP_003205150.1| PREDICTED: extracellular sulfatase Sulf-1-like, partial [Meleagris
gallopavo]
Length = 242
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 58/105 (55%), Positives = 79/105 (75%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGKYLN+YNGSYI WREW L+ NS++YNY+I+ NG K KHG DY DY+ DLI N
Sbjct: 138 AFFGKYLNEYNGSYIXXXWREWVGLVKNSRFYNYTISRNGNKEKHGFDYAKDYFTDLITN 197
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
+S+ + R SK+ + +PIM+V+S APHGPEDSAPQ++ ++ N +
Sbjct: 198 ESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNAS 242
>gi|302548207|ref|ZP_07300549.1| arylsulfatase [Streptomyces hygroscopicus ATCC 53653]
gi|302465825|gb|EFL28918.1| arylsulfatase [Streptomyces himastatinicus ATCC 53653]
Length = 507
Score = 133 bits (334), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 124/223 (55%), Gaps = 12/223 (5%)
Query: 20 GYFGKYLNKY-----NGS---YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
G+ GKYLN Y NG+ Y+P GW EW + ++Y +N NG+ + +G+D D
Sbjct: 144 GFMGKYLNGYQPGDKNGTSRPYVPSGWDEWNVAGDGYQEFDYDLNENGKVVPYGDD-PED 202
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y D+++ + F+ S + +KKP ML ++ APHGP AP+ + F + + T AY
Sbjct: 203 YLTDVLSRKATSFVDSSAK--AKKPFMLEVATFAPHGPSTPAPRDEDTFPGLKAPRTDAY 260
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
D A +W Q + ++ D +R++++Q+VD + ++ L+ G ++T
Sbjct: 261 DEATRNAPEW-QQALSPLTSKEKKRIDRKFAQRVRSVQAVDKMIGRLKSTLEAKGLADDT 319
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
YI++ SD+G+H+G+ L GK ++ D+RVP ++ GPG+ G
Sbjct: 320 YIMFGSDNGFHMGEHRLRPGKQTAYDTDVRVPMMVTGPGVPSG 362
>gi|167536101|ref|XP_001749723.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771871|gb|EDQ85532.1| predicted protein [Monosiga brevicollis MX1]
Length = 493
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 113/222 (50%), Gaps = 48/222 (21%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN Y ++IP GW+EW AL+ NSKYY+YS++ NGR+ G+DY DYY
Sbjct: 130 YSGKYLNTYGAPSVGGVAHIPDGWKEWHALVGNSKYYDYSVSDNGRQRNFGQDYATDYYT 189
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++ N+SV +L + P ++++ P PH YAN+F ++T+ TP ++ A
Sbjct: 190 DVLKNESVAWL--THNWTQDDPFFMMVATPCPH--------YANLFPDMTAPRTPNWNKA 239
Query: 135 PNPD-----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
P K W+++ M+P H +D R + L
Sbjct: 240 PAASSGPGAKHWLMRQFGPMEPQHIINSDSTFVDRWRCLH-------------------- 279
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
SDHG HLGQFG+ K +E DIRVP ++RGPGI
Sbjct: 280 -------SDHGQHLGQFGMEFDKRQLYETDIRVPLVVRGPGI 314
>gi|134101323|ref|YP_001106984.1| sulfatase [Saccharopolyspora erythraea NRRL 2338]
gi|291008139|ref|ZP_06566112.1| sulfatase [Saccharopolyspora erythraea NRRL 2338]
gi|133913946|emb|CAM04059.1| sulfatase [Saccharopolyspora erythraea NRRL 2338]
Length = 504
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/223 (30%), Positives = 120/223 (53%), Gaps = 11/223 (4%)
Query: 21 YFGKYLNKY------NGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDY 72
+ GKY+N Y +G+ Y+PPGW W K Y+Y +N NG+ + HG + DY
Sbjct: 143 FMGKYMNGYRPDGTVDGTPNYVPPGWNTWAVAGDGYKQYDYQLNENGQVVDHGHAPH-DY 201
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD 132
D++ F+R + + + +P L ++ +PHGP AP++ +F +T+ PA++
Sbjct: 202 LTDVLNRKGTDFIRSN--AAANRPFYLQLNTFSPHGPSTPAPRHEGLFPGLTAPRGPAFN 259
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
A DK L+ + + D R Q++QS++D + I L+ G+ +NTY
Sbjct: 260 EADMSDKPGWLRGMPPLDQKQQAKIDEQYRLRAQSIQSINDMLADIRDTLQQTGQADNTY 319
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
++++SD+G+H+G+ L GK F+ D+ VP + GPG+ G+
Sbjct: 320 VVFSSDNGFHMGEHRLRNGKQTAFQTDVVVPLTITGPGVPAGS 362
>gi|443287713|ref|ZP_21026808.1| Arylsulfatase [Micromonospora lupini str. Lupac 08]
gi|385885974|emb|CCH18154.1| Arylsulfatase [Micromonospora lupini str. Lupac 08]
Length = 497
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 74/225 (32%), Positives = 118/225 (52%), Gaps = 12/225 (5%)
Query: 20 GYFGKYLNKY------NGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
G+ GKYLN Y G+ Y+PPGW EW YNY +N N + +G D
Sbjct: 134 GFLGKYLNGYLPADTQGGALPYVPPGWSEWYVAGNGYPEYNYDLNENHTVVHYGSQPA-D 192
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y ++++ F+++S + + P +L +S APH P A Q + F +T+ T AY
Sbjct: 193 YLTNVLSRKGKDFIQRS--AAANTPFLLEVSTFAPHSPFTPANQDLDDFPGLTAPRTAAY 250
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
D P W L + + D KR Q++Q+VD + + +L G +NT
Sbjct: 251 DRLPTNPPSW-LAGNGPLTAQEKSTLDTNFRKRAQSVQAVDRLIGNLRAQLTTSGIADNT 309
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
Y++++SD+G+H+G++ L GK F+ D+RVP ++ GPG+VPG+
Sbjct: 310 YVVFSSDNGFHMGEYRLTGGKQTAFDTDVRVPLVVAGPGVVPGSQ 354
>gi|297623492|ref|YP_003704926.1| sulfatase [Truepera radiovictrix DSM 17093]
gi|297164672|gb|ADI14383.1| sulfatase [Truepera radiovictrix DSM 17093]
Length = 522
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 124/228 (54%), Gaps = 25/228 (10%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALI------MNSKY--YNYSINMNGRKIKHGEDYYND 71
GY G +N Y+PPGW EW I NS Y Y+Y IN NGR I+H D
Sbjct: 157 GYRGPRVN-----YVPPGWSEWYGFIDVPRDPRNSPYAMYDYKINANGR-IRHYGSRPQD 210
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y D++A +V F+R+S + + P L+++ APH P AP++A+ F ++ + P++
Sbjct: 211 YQTDVLAGLAVDFVRRSAR---RGPFFLLLAPTAPHLPAVPAPRHADAFRDLQAPRPPSF 267
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQ---FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
+ P +W+ ++ P+ D + Q L +VD+ + ++ L++ G L
Sbjct: 268 NADPQGKARWL-----ELPPLSDADIAAIDATYRAQAQMLLAVDELLGSLLDALRETGTL 322
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+NTY+++TSDHG+H G+ L K P++ +R+P ++RGPGI G +
Sbjct: 323 DNTYLVFTSDHGWHSGEHRLFHLKLTPYDASVRLPLVIRGPGIPKGEV 370
>gi|383776154|ref|YP_005460720.1| putative sulfatase [Actinoplanes missouriensis 431]
gi|381369386|dbj|BAL86204.1| putative sulfatase [Actinoplanes missouriensis 431]
Length = 508
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/237 (32%), Positives = 120/237 (50%), Gaps = 18/237 (7%)
Query: 20 GYFGKYLNKY--------NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYY 69
G+FGKYLN+Y Y+PPGW +W + + Y +NY++N NG K+GE
Sbjct: 145 GFFGKYLNEYLPATSFGGGKPYVPPGWDQW--FVGGNAYDNFNYALNENGTVKKYGEKPA 202
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
DY D++A + F+ S + + P ML ++ APH P AP+ AN+F + + T
Sbjct: 203 -DYLTDVLAARTTDFITTSAATGT--PFMLEVATYAPHAPYTPAPRDANLFPGLKAPRTA 259
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
AYD P W L K+ + D KR Q++ SVD + + L G +
Sbjct: 260 AYDKVPADAPAW-LAPHPKLNAQQKARMDQEFRKRAQSVVSVDRMLGALRSTLTKAGVAD 318
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
T +I++SD+GYH+G+ L GK F+ D+ VP + G G+ G +D ++ +
Sbjct: 319 ETVVIFSSDNGYHMGEHSLPSGKQTAFDTDVNVPLIAAGHGVKAGQT--VDAVVANI 373
>gi|340372047|ref|XP_003384556.1| PREDICTED: extracellular sulfatase Sulf-2-like [Amphimedon
queenslandica]
Length = 497
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 132/247 (53%), Gaps = 17/247 (6%)
Query: 20 GYFGKYLNKYNG--------SYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYN 70
G FGK LN NG + +P PG+ + + +++Y+ + NG ++ G+D
Sbjct: 124 GQFGKLLNP-NGVAAYCAKTNAVPVPGFDSYLTMCNDNRYFGNTFTSNGTFVEKGKDP-E 181
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
DY +I N S+ F+ S Q+ +P +S APH P AP YA+ F + + TP
Sbjct: 182 DYLTSVIGNHSLDFIEASLQA--GRPFFAYISPHAPHVPATPAPWYADKFSDKQAPRTPN 239
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
Y+Y+ D ++++ + +D L R ++L SVDD + +V L+ +++N
Sbjct: 240 YNYSA-ADHHYVIRSQPILTSKEGNESDALFRDRWRSLLSVDDITKDVVSLLEKYDQVDN 298
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI---PQVR 247
TYI++TSDHG+ LGQF L K P++ DIRVPF ++GPG+ G+ + ++ P +
Sbjct: 299 TYILWTSDHGFQLGQFRLPSCKLQPYDHDIRVPFRIKGPGLKQGSHSFVASMVDVAPTII 358
Query: 248 KFSSGSL 254
+ G++
Sbjct: 359 ELGGGTV 365
>gi|284042770|ref|YP_003393110.1| sulfatase [Conexibacter woesei DSM 14684]
gi|283946991|gb|ADB49735.1| sulfatase [Conexibacter woesei DSM 14684]
Length = 498
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 120/224 (53%), Gaps = 14/224 (6%)
Query: 22 FGKYLNKYNGS---------YIPPGWREWGALIMNSKYYNYSINM-NGRKIKHGEDYYND 71
GKY+N+Y+ Y+PPGW +W + + Y Y + +G ++ D
Sbjct: 131 MGKYMNQYSPGRVRDALGRPYVPPGWTDW--RVAGNGYPGYGYRLTSGEGVERRGHAAQD 188
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y D++ D++ F+ S + +P +L +S APH P AP+ + F N + T ++
Sbjct: 189 YLTDVLRRDAIGFV--SGAVAAGQPFLLQLSTFAPHTPATPAPRDEDRFGNAMAPRTASF 246
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
D A DK L+ + + D L +R++++Q+VD A+ ++ ++L+ LG +T
Sbjct: 247 DEADLSDKPRWLRGHPPLTAAQQWRIDDLFRERVRSVQAVDRAIGRLREQLRRLGVARST 306
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
Y++++SD+G+H+GQ L GK ++ D+RVP ++ GPG+ G
Sbjct: 307 YVVFSSDNGFHMGQHRLTPGKLTAYDADVRVPLIVAGPGVPAGA 350
>gi|340385547|ref|XP_003391271.1| PREDICTED: extracellular sulfatase Sulf-2-like [Amphimedon
queenslandica]
Length = 491
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 130/253 (51%), Gaps = 13/253 (5%)
Query: 11 NLLYFDISHGYFGKYLN---KYNGSYIP---PGWREWGALIMNSKYYNYSINMNGRKIKH 64
NL + G FGK LN +Y P PG+ + + ++Y+ + + NG K
Sbjct: 111 NLQSLGYTTGQFGKLLNGMKEYCSEKDPLRLPGFDSYLTMCNYARYFENTFSSNGTLYKT 170
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
G DY +I N S+ F+ S Q+ +P +S APH P AP YA+ F ++
Sbjct: 171 GTKP-EDYLTSVIGNHSLDFIEASLQA--GRPFFAYISPHAPHVPATPAPWYADKFSDMK 227
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
+ TP Y+Y+ D ++++ + +D L R ++L SVDD + +V L+
Sbjct: 228 APRTPNYNYSAT-DHHYVIRSQPILTSDEGDISDALFRDRWRSLLSVDDITKDVVSLLEK 286
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI- 243
+ +NTYI++TSDHGY LGQF L K P++ D++VPF ++GPG+ G+ + ++
Sbjct: 287 YDQADNTYILWTSDHGYQLGQFRLPSCKLQPYDHDLKVPFRIKGPGLKQGSHSFVASIVD 346
Query: 244 --PQVRKFSSGSL 254
P + + G++
Sbjct: 347 VAPTIIELGGGTV 359
>gi|26339842|dbj|BAC33584.1| unnamed protein product [Mus musculus]
Length = 604
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 69/84 (82%)
Query: 153 HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
H +FT++L KRLQTL SVDD++E I L + GEL+NTYI+YT+DHGYH+GQFGLVKGK
Sbjct: 1 HMEFTNMLQRKRLQTLMSVDDSMETIYDMLVETGELDNTYILYTADHGYHIGQFGLVKGK 60
Query: 213 SFPFEFDIRVPFLMRGPGIVPGTM 236
S P+EFDIRVPF +RGP + G++
Sbjct: 61 SMPYEFDIRVPFYVRGPNVEAGSL 84
>gi|224613406|gb|ACN60282.1| N-acetylglucosamine-6-sulfatase precursor [Salmo salar]
Length = 291
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/130 (50%), Positives = 82/130 (63%), Gaps = 2/130 (1%)
Query: 105 APHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTK 163
APH P +APQY F +V + ++D P DK W+L Q + M F D K
Sbjct: 1 APHSPWTAAPQYEKNFADVKAPRDGSFD-KPGKDKHWLLRQPSNPMPSTSLNFLDNAYRK 59
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
R QTL SVDD VE +VK+L++L ELNNTYI YTSD+GYH GQF L K ++FDIR+P
Sbjct: 60 RWQTLLSVDDMVEVLVKKLEELKELNNTYIFYTSDNGYHTGQFSLPIDKRQLYDFDIRIP 119
Query: 224 FLMRGPGIVP 233
++RGPGI P
Sbjct: 120 LMVRGPGIKP 129
>gi|410931337|ref|XP_003979052.1| PREDICTED: extracellular sulfatase Sulf-2-like, partial [Takifugu
rubripes]
Length = 613
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 19/130 (14%)
Query: 1 MAWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSIN 56
M+W + R + + GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY+
Sbjct: 111 MSWQRQHEPRTFGVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWLGLVKNSRFYNYT-- 168
Query: 57 MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
DY DLI +S+R+ R SK+ + +P+++V+S APHGPEDSAPQY
Sbjct: 169 -------------QDYLTDLITAESMRYFRYSKRVYPHRPVLMVLSHAAPHGPEDSAPQY 215
Query: 117 ANMFFNVTSH 126
+ F N + H
Sbjct: 216 STAFPNASQH 225
>gi|37520209|ref|NP_923586.1| glucosamine-6-sulfatase [Gloeobacter violaceus PCC 7421]
gi|35211202|dbj|BAC88581.1| gll0640 [Gloeobacter violaceus PCC 7421]
Length = 834
Score = 116 bits (290), Expect = 1e-23, Method: Composition-based stats.
Identities = 74/214 (34%), Positives = 109/214 (50%), Gaps = 7/214 (3%)
Query: 20 GYFGKYLNKYNGS--YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
G FGKY N Y+ S Y PPGW EW + + YYNY IN NG +G N Y D++
Sbjct: 127 GLFGKYFNGYSYSAFYTPPGWDEWQTFQL-AGYYNYRINANGTIEDYGRSESN-YSTDVL 184
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
+V F+ S + S KP L ++ APH P AP++A + ++ P Y+
Sbjct: 185 TQKAVAFITNS--AASDKPFFLFLAPFAPHAPYTPAPRHAGRYADIPPWRPPNYNEQDVL 242
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
DK +Q R P + D L+ L +VDD VE I++ L+ G+ NT +I+TS
Sbjct: 243 DKPTWVQKLRPASPQTQTDYDKERQAYLEMLLAVDDGVESILQALESTGQRENTLVIFTS 302
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D+G G+ + K +E +RVP ++ PG+
Sbjct: 303 DNGLTWGEHRWWE-KGCSYEESLRVPMVVSFPGV 335
>gi|323451516|gb|EGB07393.1| hypothetical protein AURANDRAFT_27956 [Aureococcus anophagefferens]
Length = 447
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/244 (34%), Positives = 121/244 (49%), Gaps = 19/244 (7%)
Query: 3 WPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKI 62
W V R+L Y S GYFGK+LN N PPG W A Y + S ++G ++
Sbjct: 65 WHVATHFRDLGY---SVGYFGKHLNNDNPLAAPPGVDRWVA-NGGGTYLDPSFAVDGVEV 120
Query: 63 KHGEDYYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GP----EDSAP 114
+ Y +I N +V +LR KP + V + APH GP AP
Sbjct: 121 AFDDCAGRPCYSTSIIGNKTVDWLRGD----FGKPFVAVAAVKAPHVEDGPGWPVAVPAP 176
Query: 115 QY--ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
Y + +F TP ++ A PD W++ M +D L RL++L SVD
Sbjct: 177 WYNSSALFAGEKPPRTPNWN-ASCPDHHWLVASQPPMTREQAHRSDELFRARLRSLLSVD 235
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
D V +V L+ L++T++++TSDHG+ GQF + +GK +E D+RVPF +RGPG+
Sbjct: 236 DLVAGVVDALEASQRLDDTFVVFTSDHGFRFGQFRMPEGKWNAYENDLRVPFWIRGPGVK 295
Query: 233 PGTM 236
PG +
Sbjct: 296 PGGV 299
>gi|307105104|gb|EFN53355.1| hypothetical protein CHLNCDRAFT_53914 [Chlorella variabilis]
Length = 818
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 118/222 (53%), Gaps = 12/222 (5%)
Query: 20 GYFGKYLNKYNGSYIP---PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
G GKYL YN ++IP PGW W + ++++YNY+++ G K K + +DY D+
Sbjct: 455 GMIGKYLTGYNSTFIPHVPPGWDRW-FVFSDNEFYNYTVSDQG-KSKSFANAEDDYSTDV 512
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ V F++Q K +P L ++ A H P A ++ N + PAY+ +
Sbjct: 513 VTAHGVEFIQQVKDD--SRPFFLYLAPYACHYPHIPANRHLNKLKGLKIPRVPAYNESQQ 570
Query: 137 PDKQWILQVTRKMQPVHRQFT---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
++ + + P+ + D R +TL +VDD +E + L+++GEL+NTY+
Sbjct: 571 HLRKKV-SFINSLPPIDEEMAAGLDFKYQARAETLLAVDDMIEAVFLALEEVGELDNTYV 629
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+TSD+GY LG L GK +E D+++PF +RGPG+ G
Sbjct: 630 FFTSDNGYKLGHHRL-SGKMSGYETDVKLPFYVRGPGVPRGV 670
>gi|159462408|ref|XP_001689434.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283422|gb|EDP09172.1| predicted protein [Chlamydomonas reinhardtii]
Length = 620
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 118/228 (51%), Gaps = 25/228 (10%)
Query: 24 KYLNKYNGSYIPPGWREWG-----ALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
K+LN+Y +Y P G+ G AL +N+ YN + + Y +Y +LI
Sbjct: 132 KFLNQYLETYAPSGYAPAGFTHFEALTVNA--YNMTNSCFSLSGAASTCYPGEYQTELIM 189
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------PEDSAPQYANMFFNVTSHHTP 129
++ ++ + + P +L ++ APH P A QYAN++ + + + P
Sbjct: 190 TKALDYI--GSYAAANAPFLLYVAPTAPHRSSTDGILWYPPTPAKQYANLY-SASENQVP 246
Query: 130 A---YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
+D+ ++ + + F D L RL+ L+SVDD + IV +L+ LG
Sbjct: 247 RGLNWDFR---NRNLPRRGAVAVDATFEAFMDSLYLYRLRALRSVDDLIGSIVGKLQRLG 303
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
LNNTY+I+TSD+GYH+G F L+ GK+ P E D+RVPF +RGPGI G
Sbjct: 304 VLNNTYVIFTSDNGYHMGAFSLLDGKNLPIEEDVRVPFFIRGPGIPAG 351
>gi|355690576|gb|AER99199.1| glucosamine -6-sulfatase [Mustela putorius furo]
Length = 220
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 91/163 (55%), Gaps = 11/163 (6%)
Query: 21 YFGKYLNKYNG------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
+ GKYLN+Y ++P GW W AL NSKYYNY++++NG+ KHGE+Y DY
Sbjct: 62 FAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLT 121
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D++AN S+ FL S +P +++S PAPH P +APQY F NV + ++
Sbjct: 122 DVLANISLGFLDYKSNS---EPFFMMISTPAPHSPWTAAPQYQKSFQNVFAPRNKNFNIH 178
Query: 135 PNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
+K W++ Q M QF D KR QTL SVDD VE
Sbjct: 179 -GTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVE 220
>gi|336176655|ref|YP_004582030.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
glomerata]
gi|334857635|gb|AEH08109.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
glomerata]
Length = 540
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 127/221 (57%), Gaps = 12/221 (5%)
Query: 23 GKYLN--KYNGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHG--EDYYNDYYPDL 76
GKY+N + + +++PPGW EW N Y YNY++N NG + +G E+ N Y D+
Sbjct: 170 GKYMNGIEADPTHVPPGWSEWYGSADNFFYTGYNYALNENGAIVDYGGPENPAN-YSTDV 228
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA-NMFFNVTSHHTPAYDYAP 135
+A S F+R++ + +P L + APH P AP++A N F + + TP ++
Sbjct: 229 VAAKSTDFIRRA--AAGDEPFFLYAASTAPHLPLPPAPRHADNPFADDAAPRTPNFNEPD 286
Query: 136 NPDKQWILQVTRKMQPVHRQFTD-LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
DK W LQ + + + + R+ +L ++D+ V IV+ L+ +GEL++TY++
Sbjct: 287 VSDKPWWLQQSARERSAQVAVVNNQDYQNRMGSLYALDEMVRDIVETLRSVGELDDTYLV 346
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+TSD+GY+LG L+ K P++ +RVP ++ GPGI PGT
Sbjct: 347 FTSDNGYNLGAHRLIN-KQAPYDESMRVPLVVAGPGIAPGT 386
>gi|159468317|ref|XP_001692329.1| hypothetical protein CHLREDRAFT_145838 [Chlamydomonas reinhardtii]
gi|158278515|gb|EDP04279.1| predicted protein [Chlamydomonas reinhardtii]
Length = 651
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 116/234 (49%), Gaps = 27/234 (11%)
Query: 23 GKYLNKY----------NGSYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYND 71
GK+LN Y +G Y P GW + AL + Y+N NG K +
Sbjct: 167 GKFLNAYMTTAAFMGADSGDYYPRGWTIFDALTQGTYAYWNSCFAYNGGSDKC---FPKQ 223
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------GPEDSAPQYANMFFNV 123
Y D+I + ++ +LR + P + ++ PAPH P A ++AN++ N
Sbjct: 224 YQTDIIRDKALSYLRTATAK-KDTPFFMYIAPPAPHVELTNNGWQPPSPAARHANLYAND 282
Query: 124 TSHHTPAYDYA-PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
++ PNP+ I M P + +RL++L++VD+ ++ +V +L
Sbjct: 283 KIRMPKGANFGVPNPN---IPMEVDDMTPDFVAQLEFHYLQRLRSLRAVDEMLDAVVNQL 339
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
G+L+NTY++YTSD+G HLGQF L GK+ E D RVPF +RGPGI G +
Sbjct: 340 SASGQLDNTYVLYTSDNGLHLGQFSLGDGKATAVEEDSRVPFYIRGPGIPAGQV 393
>gi|169786467|ref|XP_001827694.1| arylsulfatase [Aspergillus oryzae RIB40]
gi|83776442|dbj|BAE66561.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391866120|gb|EIT75392.1| sulfatase [Aspergillus oryzae 3.042]
Length = 576
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 120/233 (51%), Gaps = 23/233 (9%)
Query: 21 YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N ++ S P G+ +++ Y+Y + R + ++Y Y D+
Sbjct: 131 YTGKLMNAHSLSTYNKPFPKGFNG-SDFLLDPHTYSYYNSTYQRNREPPKNYAGHYTTDV 189
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------------APQYANMFF 121
I ++ L ++ +S +P + +S APH D AP++ ++F
Sbjct: 190 ITEKALGLLDEALES--DRPFFVAVSPVAPHSNIDPNTMGTAASIMSEPIPAPRHRHLFQ 247
Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
V TP+++ W+ + + Q V + D +RL+ LQ VD+ VEK+V
Sbjct: 248 GVKVPRTPSFNPLNRTGVSWVSNLALQNQTV-IDYEDHYYRQRLRALQGVDELVEKLVSR 306
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L++ GE++NTYIIYTSD+G+H+GQ + GK+ FE DIRVPF +RGPGI G
Sbjct: 307 LEESGEIDNTYIIYTSDNGFHIGQHRMPPGKTTGFEEDIRVPFFIRGPGIPEG 359
>gi|238507485|ref|XP_002384944.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
gi|220689657|gb|EED46008.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
Length = 581
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 120/233 (51%), Gaps = 23/233 (9%)
Query: 21 YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N ++ S P G+ +++ Y+Y + R + ++Y Y D+
Sbjct: 131 YTGKLMNAHSLSTYNKPFPKGFNG-SDFLLDPHTYSYYNSTYQRNREPPKNYAGHYTTDV 189
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------------APQYANMFF 121
I ++ L ++ +S +P + +S APH D AP++ ++F
Sbjct: 190 ITEKALGLLDEALES--DRPFFVAVSPVAPHSNIDPNTMGTAASIMSEPIPAPRHRHLFQ 247
Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
V TP+++ W+ + + Q V + D +RL+ LQ VD+ VEK+V
Sbjct: 248 GVKVPRTPSFNPLNRTGVSWVSNLALQNQTV-IDYEDHYYRQRLRALQGVDELVEKLVSR 306
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L++ GE++NTYIIYTSD+G+H+GQ + GK+ FE DIRVPF +RGPGI G
Sbjct: 307 LEESGEIDNTYIIYTSDNGFHIGQHRMPPGKTTGFEEDIRVPFFIRGPGIPEG 359
>gi|453082749|gb|EMF10796.1| Arylsulphatase, partial [Mycosphaerella populorum SO2202]
Length = 538
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 25/267 (9%)
Query: 19 HGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
H + +N +NGS +++ Y+Y + R Y Y PD++A
Sbjct: 90 HNHRNPPVNGFNGS----------DFLLDPYTYSYWKSQMSRNHADPVSYEGQYSPDVVA 139
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-------SAPQYANMFFNVTSHHTPAY 131
+ +L ++ S +P +V + APH +D +APQYA ++ +
Sbjct: 140 EKTYGWLDEAA-SHHDRPFFIVAAPIAPHSDQDPEARGVMTAPQYAPRHAHLFQDYQIPR 198
Query: 132 DYAPNPDKQWILQVTRKMQPVHR---QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
NP+ + + + P++ ++ D RL+ LQSVD+ VE+IV+ L G L
Sbjct: 199 TANFNPEAASGVSWVKNLAPLNATVVEYNDEYQRSRLRALQSVDEMVEQIVERLDSHGLL 258
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----VPGTMYLLDVLIP 244
+ TYIIYT+D+GYH+ Q L GK PFE DI VP L+RGPG+ V G + L P
Sbjct: 259 DTTYIIYTTDNGYHISQHRLPPGKECPFETDINVPLLIRGPGVPRGQVAGIVSSHTDLAP 318
Query: 245 QVRKFSSGSLIFIMSFLINLNTMRAEK 271
+ K + + + I L++ AE+
Sbjct: 319 TILKLAGQNRPDLDGTPIPLSSTEAEQ 345
>gi|67903834|ref|XP_682173.1| hypothetical protein AN8904.2 [Aspergillus nidulans FGSC A4]
gi|40744962|gb|EAA64118.1| hypothetical protein AN8904.2 [Aspergillus nidulans FGSC A4]
Length = 1264
Score = 108 bits (269), Expect = 3e-21, Method: Composition-based stats.
Identities = 65/198 (32%), Positives = 103/198 (52%), Gaps = 18/198 (9%)
Query: 50 YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
Y+N S N K Y Y D+ ++ F+ + + ++P L ++ APH
Sbjct: 852 YWNSSYQRNHEAPK---SYAGQYTTDVTEEKALGFVDDALED-KERPFFLTVAPIAPHFE 907
Query: 110 EDS-------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
+D AP++A++F + P+++ WI + + Q V +
Sbjct: 908 QDPGHSSDTPPQAPIPAPRHAHLFPDAKVPRVPSFNPLNQTGPSWIRDLKHQNQSV-VDY 966
Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
D +RL+ LQSVD+ V+K++ L+ G+LNNTY+IY+SD+G+H+G L GKS +
Sbjct: 967 EDFFYRQRLRALQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHIGHHRLPPGKSTSY 1026
Query: 217 EFDIRVPFLMRGPGIVPG 234
E DIRVPF +RGPGI G
Sbjct: 1027 EEDIRVPFFIRGPGIKSG 1044
>gi|340372495|ref|XP_003384779.1| PREDICTED: arylsulfatase-like [Amphimedon queenslandica]
Length = 723
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 115/223 (51%), Gaps = 22/223 (9%)
Query: 22 FGKYLNKYNGSY---IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
GKYLN Y+ ++ +PPGW W + ++ YY + NG+ +K ++ Y D I+
Sbjct: 273 IGKYLNGYSDAHADVVPPGWDYWHGM-TDTAYYGPHFSDNGKLLKVPKEVYQ---TDYIS 328
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ L++ + KP + +S APH P A ++ N+F + + T ++ NPD
Sbjct: 329 ETATTLLKKRSPN---KPFFMYLSPFAPHAPSIPAHRHENLFNDKQAPRTVSF----NPD 381
Query: 139 KQ-------WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
+ W+ + K+ D RL+ LQ+VD+ + + + LK GE +NT
Sbjct: 382 DKVQKEKPSWLGHLP-KLTDKQIDSIDNFYRNRLRALQAVDEMLVSLEETLKGTGEWDNT 440
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
Y+ Y D+G HLG + L GK ++ DIRVPFL+RGPGI G
Sbjct: 441 YVFYMGDNGQHLGDYRLPGGKRQAYDTDIRVPFLVRGPGIKGG 483
>gi|355722614|gb|AES07631.1| sulfatase 2 [Mustela putorius furo]
Length = 596
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 50/83 (60%), Positives = 64/83 (77%), Gaps = 5/83 (6%)
Query: 159 LLMTKRLQTLQS-----VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
+L KRLQTL S VDD++E + L + GEL+NTYI+YT+DHGYH+GQFGLVKGKS
Sbjct: 1 MLQRKRLQTLMSXTLMSVDDSMETLYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKS 60
Query: 214 FPFEFDIRVPFLMRGPGIVPGTM 236
P+EFDIRVPF +RGP + G++
Sbjct: 61 MPYEFDIRVPFYVRGPNVEAGSL 83
>gi|429848907|gb|ELA24343.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
Length = 576
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 110/216 (50%), Gaps = 22/216 (10%)
Query: 35 PPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSK 94
P GW +++ Y Y R + Y +Y DL+A ++ FL + +
Sbjct: 145 PKGWTG-SDFLLDPNTYLYLNATFQRNDSPPQTYPGEYSNDLVAERALGFLDDALKFNGS 203
Query: 95 KPIMLVMSFPAPH----GPEDSA--------PQYANMFFNVTSHHTPAYDYAPNPDK--- 139
KP + ++ PH G E SA ++ N+F N+T T Y NPDK
Sbjct: 204 KPFFVGIAPIGPHNNGLGGETSAGATPPIPAKRHENLFPNITVPRT----YNFNPDKPSM 259
Query: 140 -QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
W L + K+ + D +RLQTLQSVD+ V+ + K L+D G L+NTY+I+TSD
Sbjct: 260 ANWGL-IIPKLNSSEIAYGDNYYRRRLQTLQSVDEMVDAVFKRLEDAGVLDNTYVIFTSD 318
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+G+H+GQ L GK+ E DI VPFL+RGPG+ G
Sbjct: 319 NGFHIGQHRLKPGKTCAIEEDINVPFLIRGPGVPKG 354
>gi|330916911|ref|XP_003297607.1| hypothetical protein PTT_08067 [Pyrenophora teres f. teres 0-1]
gi|311329644|gb|EFQ94320.1| hypothetical protein PTT_08067 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 77/235 (32%), Positives = 118/235 (50%), Gaps = 23/235 (9%)
Query: 21 YFGKYLNKYNGS-YIPP---GWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
Y GK +N ++ + Y P GW + LI Y YN S+ N K + +Y
Sbjct: 33 YTGKLMNGHSTTTYNKPRAAGWNQSDFLIDPGTYVFYNTSMTRNNDPYKF---FPGEYST 89
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------PEDSAPQYANMFFN 122
DL++ +V FL + + S++P L ++ APH P A ++ ++F N
Sbjct: 90 DLVSKAAVGFLDDAIAAASERPFFLGVAPVAPHSETITDPRPAKFNPPVPAKRHEHLFPN 149
Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
VT TP ++ P + R++ + D+ KRLQ+LQSVD+ V+ I+ L
Sbjct: 150 VTVPRTPNFN-PEKPGTASYFKTLRQLNRTELDYNDVWYRKRLQSLQSVDELVDSIMDRL 208
Query: 183 KDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
E + NTY+IYT+D+G+H+GQ L GKS E D+ +PF MRGPGI +
Sbjct: 209 GASPEVIENTYMIYTTDNGFHIGQHRLGPGKSCGIEEDVNIPFFMRGPGIAKAAV 263
>gi|86740887|ref|YP_481287.1| sulfatase [Frankia sp. CcI3]
gi|86567749|gb|ABD11558.1| sulfatase [Frankia sp. CcI3]
Length = 524
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 113/225 (50%), Gaps = 23/225 (10%)
Query: 22 FGKYLNKYNGSYIPP-------GWREWGALIMNSKYY---NYSINMNGRKIKH--GEDYY 69
GKYLN Y I P GW +W + N+ Y N+ N NG ++H G+D
Sbjct: 154 LGKYLNGYGDPTITPTTGPVPRGWSDW--HVSNTTGYAELNFDQNDNG-VVRHYAGQD-- 208
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
+Y D++ D+ F+R+S + KP L ++ APH P AP+ A+ F +T P
Sbjct: 209 -NYGVDVLNADAQAFIRRS----AGKPFALEVATYAPHQPYTPAPRNADDFPGLTEPRDP 263
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
+++ W+ Q + P D +R Q ++SVD V L L+
Sbjct: 264 SFNTNNTDAPAWLGQRA-PLAPSVLTNLDQAYRERAQAVESVDKLVGDTEATLAAEHLLD 322
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
NTY +++SD+GYHLGQ LV+GK F+ DIRVP ++ GPG+ G
Sbjct: 323 NTYFVFSSDNGYHLGQHRLVRGKQTAFDTDIRVPLIVTGPGVPHG 367
>gi|259486661|tpe|CBF84693.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 565
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 103/199 (51%), Gaps = 18/199 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
Y+N S N K Y Y D+ ++ F+ + + ++P L ++ APH
Sbjct: 156 SYWNSSYQRNHEAPK---SYAGQYTTDVTEEKALGFVDDALED-KERPFFLTVAPIAPHF 211
Query: 109 PEDS-------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
+D AP++A++F + P+++ WI + + Q V
Sbjct: 212 EQDPGHSSDTPPQAPIPAPRHAHLFPDAKVPRVPSFNPLNQTGPSWIRDLKHQNQSV-VD 270
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQSVD+ V+K++ L+ G+LNNTY+IY+SD+G+H+G L GKS
Sbjct: 271 YEDFFYRQRLRALQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHIGHHRLPPGKSTS 330
Query: 216 FEFDIRVPFLMRGPGIVPG 234
+E DIRVPF +RGPGI G
Sbjct: 331 YEEDIRVPFFIRGPGIKSG 349
>gi|361131542|gb|EHL03215.1| putative Arylsulfatase [Glarea lozoyensis 74030]
Length = 371
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 116/230 (50%), Gaps = 22/230 (9%)
Query: 21 YFGKYLNKYN-GSYIPP---GWREWGALIMNS--KYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GKYLN + +Y P G+ L+ S +YYN S++ NG +Y Y
Sbjct: 39 YVGKYLNGHTLDNYNKPLAKGFTGSEFLLDPSTYQYYNASMSRNGGIPV---NYPGQYSN 95
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP------EDSAPQY----ANMFFNVT 124
DL+AN + FL + + KP L ++ PH E SAPQY AN+F +T
Sbjct: 96 DLVANKAYDFLDDAIAA--GKPFFLGIAPIGPHSEFQFATLEASAPQYPPRHANLFNTIT 153
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
TP ++ + WI + + P + D RL++LQS+D+ +E + L
Sbjct: 154 VPRTPNFNPSTPGGTSWISTLP-PLTPEALAYGDTFYRARLRSLQSIDEMIESLTLRLNT 212
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L NTYIIYTSD+G+H+ Q L GK+ FE DI +P L+RGPG+ G
Sbjct: 213 KKLLENTYIIYTSDNGFHISQHRLPPGKNCGFEEDINIPLLIRGPGVSKG 262
>gi|258572460|ref|XP_002544992.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237905262|gb|EEP79663.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 580
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 114/223 (51%), Gaps = 15/223 (6%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
Y GK++N + + PG + W ++ Y+Y + R DY Y D++A
Sbjct: 126 YVGKFMNGLSTNNYNPGPKGWTGSDFLLGMTTYDYWGAVMSRNRGAPVDYTGQYSGDVVA 185
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSHHT 128
+ + F+ + Q KP + + APHG P ++A++F + T
Sbjct: 186 DKVLGFMDDAIQD--DKPFFVAAAPVAPHGQVKPRPLWFDKPEYPERHAHLFKDYKIPRT 243
Query: 129 PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
P ++ W+ + R Q H + D RL+ LQSVD+ V ++V++L++ G L
Sbjct: 244 PNFNPDTPSGAGWVAAMPRLNQ-THIDYHDEYQRCRLRALQSVDEMVGQMVEKLEENGLL 302
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+NTY +YT+D+G+H+ Q+ L GK+ F+ DIR+P ++RGPG+
Sbjct: 303 DNTYFMYTTDNGFHISQYRLPPGKTCGFDTDIRIPMIVRGPGV 345
>gi|302886649|ref|XP_003042214.1| hypothetical protein NECHADRAFT_42509 [Nectria haematococca mpVI
77-13-4]
gi|256723123|gb|EEU36501.1| hypothetical protein NECHADRAFT_42509 [Nectria haematococca mpVI
77-13-4]
Length = 591
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 105/197 (53%), Gaps = 17/197 (8%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
+Y N ++ NG +Y Y PDL A + FL + + ++P LV + APHG
Sbjct: 155 QYLNATMTRNGAPPV---NYAGLYSPDLTAEKAYGFLDDALSN--EEPWFLVHAPIAPHG 209
Query: 109 PED-------SAPQYANMFFNVTSHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFT 157
D AP+YA ++ + D + NP+ Q W+ ++ R V +
Sbjct: 210 TVDLTPNFFSDAPKYAERHTHLFKDYIIPRDESFNPEVQGGVSWVKELPRLNDTV-IAYN 268
Query: 158 DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
D RL+ LQSVD+ VE ++K+++ G+L+NTYIIYT+D+GYH+ Q + GK FE
Sbjct: 269 DEYQRARLRALQSVDEMVEAMIKKVEAAGQLDNTYIIYTTDNGYHISQHRMHPGKECGFE 328
Query: 218 FDIRVPFLMRGPGIVPG 234
D+ +P ++RGPG+ G
Sbjct: 329 TDVHIPLIIRGPGVPAG 345
>gi|67901294|ref|XP_680903.1| hypothetical protein AN7634.2 [Aspergillus nidulans FGSC A4]
gi|40742630|gb|EAA61820.1| hypothetical protein AN7634.2 [Aspergillus nidulans FGSC A4]
gi|259483966|tpe|CBF79787.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 562
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 106/201 (52%), Gaps = 18/201 (8%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+YYN + NG + DY Y D++ ++ FL ++ ++ +P ML ++ APH
Sbjct: 124 RYYNAKMTRNGAEPV---DYSGQYVTDVLTEKAIGFLNETLKN-PDRPWMLTVAPNAPHS 179
Query: 108 ------------GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
G + AP++AN+F + +++ W+ +++ Q
Sbjct: 180 NGSATADGTHWFGEPEYAPRHANLFKDYKVPRDKSFNTVIEGAVGWVKDISQLSQK-EVD 238
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D RL+ LQ+VD+ V++++ L + G L+NTYI +++D+GYHLGQ GK+
Sbjct: 239 YIDEFQRCRLRALQAVDEMVDRLISRLDEAGVLDNTYIFFSTDNGYHLGQHAFRPGKNCG 298
Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
+E DI VP ++RGPG+ GT+
Sbjct: 299 YETDINVPLIVRGPGVPKGTV 319
>gi|255957079|ref|XP_002569292.1| Pc21g23250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591003|emb|CAP97222.1| Pc21g23250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 571
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 105/189 (55%), Gaps = 19/189 (10%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--GPEDS------------A 113
Y Y D+IA + F+ ++ + ++P ML ++ APH G DS A
Sbjct: 166 YDGQYSTDVIAEKAEGFIDEAVKH--EQPWMLTVAPNAPHSNGSHDSTRNANWFGEPEFA 223
Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
P++A++F +V + +++ W+ V ++ H + D RL+ LQ+VD+
Sbjct: 224 PRHADLFKDVRAPRDESFNTLIENAVSWVQNVP-ELNQTHIDYIDEFQRCRLRALQAVDE 282
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+ K+V++L+ +GEL+NTYI Y++D+GYHLGQ + GK+ +E DI VP ++RGPG+
Sbjct: 283 MIGKLVEKLEKVGELDNTYIFYSTDNGYHLGQHRMQPGKNCGYETDINVPLIVRGPGM-- 340
Query: 234 GTMYLLDVL 242
G LD +
Sbjct: 341 GKNQTLDAI 349
>gi|452977923|gb|EME77687.1| hypothetical protein MYCFIDRAFT_65580 [Pseudocercospora fijiensis
CIRAD86]
Length = 583
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/263 (29%), Positives = 121/263 (46%), Gaps = 30/263 (11%)
Query: 21 YFGKYLNKYN-GSYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK N N +Y P W +++ Y Y R Y Y D++
Sbjct: 136 YVGKLFNAQNVDNYASPHAAGWTGSEFLLDPYTYEYLNATFQRNHDPPVSYEGLYSTDVL 195
Query: 78 ANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHG--------------------PEDSAPQ 115
AN S FL ++ S KP L ++ APH P SA +
Sbjct: 196 ANKSFEFLADGLDAYKASGKPFFLTIAPTAPHSNVHMNTAIDGNFSEGSNTQSPPVSAER 255
Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
+A++F +V TP ++ + WI ++ + + F D +RL++LQ+VD+ V
Sbjct: 256 HAHLFEDVIVPRTPHFNPSNPHGVSWISRLPEQNN-TNLAFNDHFYRQRLRSLQAVDEMV 314
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
++ K L+D L++TY+ +++D+GYH+GQ L GK PFE DI +PF++RGPGI G
Sbjct: 315 GELFKRLEDARVLDDTYVFFSTDNGYHIGQHRLQPGKQCPFEEDINIPFMVRGPGIGKGV 374
Query: 236 MYLLDV----LIPQVRKFSSGSL 254
+ L P K + G +
Sbjct: 375 TADIVTTHTDLAPTFLKLAGGEI 397
>gi|389750805|gb|EIM91878.1| alkaline phosphatase-like protein [Stereum hirsutum FP-91666 SS1]
Length = 618
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/239 (32%), Positives = 114/239 (47%), Gaps = 23/239 (9%)
Query: 21 YFGKYLNKYNGSYI----PPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
Y GK +N + I GW L+ + Y YN S + NG +Y +Y
Sbjct: 135 YTGKLMNGMKTTNIQELPAAGWTNADFLLDPNTYLFYNASFSANGEA---SINYEGEYQT 191
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---------SAPQYANMFFNVTS 125
LI S L+Q+ S S+ P L ++ APH +P+YA F +
Sbjct: 192 ALIQQKSTSMLQQALASPSQ-PFFLGIAPTAPHLEVQFNGSFTEPLPSPEYAGSFNDSIV 250
Query: 126 HHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
+P Y+ W+ ++ ++ + D + +RL++LQSVDD VE IV+++
Sbjct: 251 PRSPNYNVPVGGPGGGVSWLKELP-ELNDTVVDYIDHVYRQRLRSLQSVDDLVENIVQQV 309
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
+D G NNTYIIY +D+GY LG K+ PFE D+ +P +RGPGI P T DV
Sbjct: 310 EDAGIANNTYIIYVADNGYALGSHRRQPSKTLPFEEDVLIPLFIRGPGITPNTTNSADV 368
>gi|350633016|gb|EHA21383.1| hypothetical protein ASPNIDRAFT_44607 [Aspergillus niger ATCC 1015]
Length = 588
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 118/234 (50%), Gaps = 37/234 (15%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y ++N +NGS +++ Y+Y R + Y Y D++
Sbjct: 154 YNAPFVNGFNGS----------DFLLDPHTYSYWNATYQRNHEPPRSYEGQYTTDVMKEK 203
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDS----APQYANMFFNVT 124
+ L + S P L ++ APH GP+ + AP++A++F +
Sbjct: 204 ASGLLADALDS--DAPFFLTVAPIAPHTNIDVEGLSGAGGPKMTEPLPAPRHAHLFADAK 261
Query: 125 SHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
TP + NPDK WI + + Q V + D L +RL+TLQ+VD+ V+ ++
Sbjct: 262 VPRTPNF----NPDKDSGAGWIQTMELQNQTV-IDYEDHLYRQRLRTLQAVDEMVDALIT 316
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+L++ G+++NTYIIY++D+GYH+G L GK+ +E DIRVPF +RGPGI G
Sbjct: 317 QLEESGQIDNTYIIYSADNGYHIGHHRLPPGKTTGYEEDIRVPFYIRGPGIPEG 370
>gi|37523000|ref|NP_926377.1| sulfatase [Gloeobacter violaceus PCC 7421]
gi|35214003|dbj|BAC91372.1| gll3431 [Gloeobacter violaceus PCC 7421]
Length = 521
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 111/242 (45%), Gaps = 35/242 (14%)
Query: 20 GYFGKYLNKY-----------NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGE 66
G+ GKYLN Y + +Y+PPGW W L+ + Y YN+ IN NGR +G
Sbjct: 141 GFLGKYLNGYGANKDKSSPRDDATYVPPGWDVWQGLVDPTTYQVYNFKINENGRVANYGT 200
Query: 67 DYYN---DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPE 110
D +Y D+++ +V F+ Q P L ++ APH P+
Sbjct: 201 DQAEPPEEYQTDVLSARAVNFVEQYGSG--DAPFFLWVNPIAPHFELRGTRRCTVNPRPQ 258
Query: 111 DS---APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT 167
+S AP++A V PA++ DK +Q M L RL+
Sbjct: 259 NSIRPAPRHAGSAAAVPLPRGPAFNEQDVSDKPTWVQQNPAMSERTIDCLQNLYRNRLEA 318
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
+++VDD V + L+ G L +T ++ TSD+GY LGQ L K P+E IRVP L+R
Sbjct: 319 MRAVDDLVAALFDALERTGALGDTIVLLTSDNGYLLGQHRLT-AKVLPYEESIRVPLLVR 377
Query: 228 GP 229
P
Sbjct: 378 VP 379
>gi|145235761|ref|XP_001390529.1| arylsulfatase [Aspergillus niger CBS 513.88]
gi|134058218|emb|CAK38410.1| unnamed protein product [Aspergillus niger]
Length = 581
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 118/234 (50%), Gaps = 37/234 (15%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y ++N +NGS +++ Y+Y R + Y Y D++
Sbjct: 147 YNAPFVNGFNGS----------DFLLDPHTYSYWNATYQRNHEPPRSYEGQYTTDVMKEK 196
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDS----APQYANMFFNVT 124
+ L + S P L ++ APH GP+ + AP++A++F +
Sbjct: 197 ASGLLADALDS--DAPFFLTVAPIAPHTNIDVEGLSGAGGPKMTEPLPAPRHAHLFADAK 254
Query: 125 SHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
TP + NPDK WI + + Q V + D L +RL+TLQ+VD+ V+ ++
Sbjct: 255 VPRTPNF----NPDKDSGAGWIQTMELQNQTV-IDYEDHLYRQRLRTLQAVDEMVDALIT 309
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+L++ G+++NTYIIY++D+GYH+G L GK+ +E DIRVPF +RGPGI G
Sbjct: 310 QLEESGQIDNTYIIYSADNGYHIGHHRLPPGKTTGYEEDIRVPFYIRGPGIPEG 363
>gi|269124890|ref|YP_003298260.1| sulfatase [Thermomonospora curvata DSM 43183]
gi|268309848|gb|ACY96222.1| sulfatase [Thermomonospora curvata DSM 43183]
Length = 474
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 122/240 (50%), Gaps = 24/240 (10%)
Query: 15 FDISHGYFGKYLNKY--NGSYIPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYN 70
+D H GKYLN Y +GS IP GW++W L+ S Y + Y++N NG+ K+G+
Sbjct: 119 YDTVH--IGKYLNGYGRHGSEIPQGWKDWYGLMGPSTYSMWGYTLNENGKLRKYGKKKGG 176
Query: 71 ---DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPEDSAP 114
+Y D++A +V +L ++ + K+P + + APH G AP
Sbjct: 177 KPANYQTDVLARKAVAYL--NRPTTGKRPFFMWFAPLAPHLEAPWNKKMGDKWGGPRPAP 234
Query: 115 QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
++ N + N +P+++ DK ++ ++ RL++L +VD+A
Sbjct: 235 RHRNAYDNAVMWRSPSFNEKDISDKPRHVRRNKRFGKKKIASIQAEYRNRLESLLAVDEA 294
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
V KI + +K G+L+ T ++TSD+GY LGQ K P+E +R+P ++RGPG G
Sbjct: 295 VAKIYRAVKRRGQLDRTLFVFTSDNGYMLGQHRFAHQKIHPYEESVRIPLIVRGPGFPKG 354
>gi|158316689|ref|YP_001509197.1| sulfatase [Frankia sp. EAN1pec]
gi|158112094|gb|ABW14291.1| sulfatase [Frankia sp. EAN1pec]
Length = 534
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 114/222 (51%), Gaps = 12/222 (5%)
Query: 22 FGKYLNKYNGS--YIPPGWREWGALIMNSKY--YNYSINMNGRKIKHG--EDYYNDYYPD 75
GKY+N + ++PPGW EW + N Y YNY++N NG + +G D N Y D
Sbjct: 155 VGKYMNGIEDAPDHVPPGWTEWYGSVDNFFYTGYNYALNENGTIVHYGGPSDPAN-YSTD 213
Query: 76 LIANDSVRFLRQSKQSFSKKPIML-VMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
++A SV FL ++ + +P ML S + N F + + +P Y
Sbjct: 214 VVAAKSVDFLERA--AAKDEPFMLYTASTAPHLPLPPAPRDSNNPFTDDLAPRSPNYQEP 271
Query: 135 PNPDKQWILQVTRKMQPVHRQF-TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
DK L+ + ++ D R+ +L ++DD V IV L+D GEL++TY+
Sbjct: 272 DVSDKPAWLRTSAGVRSAQVNLINDNDYRNRMGSLLALDDMVGDIVTTLRDTGELDHTYL 331
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
++TSD+GY+LG L+ K P+E +RVP ++ GPG+ GT
Sbjct: 332 VFTSDNGYNLGAHRLIH-KMAPYEESLRVPLVVAGPGVTRGT 372
>gi|323450570|gb|EGB06451.1| hypothetical protein AURANDRAFT_65638 [Aureococcus anophagefferens]
Length = 515
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 94/170 (55%), Gaps = 11/170 (6%)
Query: 73 YPDLIANDSVRFLRQSKQSFSKK----PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT 128
Y +L+ + F RQ + +++ P + APH P AP YA+ F +V T
Sbjct: 130 YEELLNETAPVFDRQPDAALARRAHTGPFFAYVGPHAPHFPATPAPWYAHAFDDVGIPET 189
Query: 129 PAYDYAPNPDKQWILQVTRKMQPVH---RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
P Y+ + + DK +Q R+ P+ + + R +L SVDD V + +L+ +
Sbjct: 190 PNYNCS-HEDK---VQHVRQTPPIDDRVKCWQSQHFRDRWASLLSVDDLVFSLYHKLEAM 245
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
G L++TYI+YTSDHGY GQ+ K+ P+E D+R+PF++RGPGI PG+
Sbjct: 246 GVLDDTYIVYTSDHGYKFGQWRQTTSKTHPYETDVRIPFIVRGPGIAPGS 295
>gi|325981531|ref|YP_004293933.1| N-acetylglucosamine-6-sulfatase [Nitrosomonas sp. AL212]
gi|325531050|gb|ADZ25771.1| N-acetylglucosamine-6-sulfatase [Nitrosomonas sp. AL212]
Length = 489
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/233 (30%), Positives = 119/233 (51%), Gaps = 20/233 (8%)
Query: 18 SHGYFGKYLNKYNG-------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYN 70
S YFGKY N+Y SYIPPGW W + ++KY+NY +N NG +G
Sbjct: 120 STAYFGKYFNQYGTTTLVDPESYIPPGWNYWYTFV-STKYFNYRLNENGITKSYGS-APE 177
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------APQYANMFF 121
DY D+IA + L + + S P ++++ A H +S AP++ N+F
Sbjct: 178 DYGTDVIAKHATNHLSTALSNHS--PFFMMIAPYAVHSSTESTESPFQPIPAPRHKNLFN 235
Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
VT+ +++ +K ++ + + P + D +R+++LQSVD+ + K++ +
Sbjct: 236 GVTAPRHGSFNQKDVSNKSPSVRNSPLLLPDDIDWIDNNFRERIRSLQSVDEMIGKLIDQ 295
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
LK+ + TYII+TSD+GY G+ L K+ ++ RVP + GP + G
Sbjct: 296 LKNSPKGKETYIIFTSDNGYWFGEHRLKFAKAGLYDQSHRVPLAIAGPKVQRG 348
>gi|189209101|ref|XP_001940883.1| arylsulfatase precursor [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976976|gb|EDU43602.1| arylsulfatase precursor [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 571
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 116/235 (49%), Gaps = 23/235 (9%)
Query: 21 YFGKYLNKYNGS-YIPP---GWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
Y GK +N ++ + Y P GW + LI Y YN S+ N K + +Y
Sbjct: 128 YTGKLMNGHSTTTYNTPRAAGWNQSDFLIDPGTYVFYNTSMTRNNEMYKF---FPGEYST 184
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------PEDSAPQYANMFFN 122
DLI+ +V FL + + S++P L ++ APH P A ++ ++F N
Sbjct: 185 DLISKAAVGFLDDAIAAASERPFFLGVAPVAPHSETIIDPRPAKFNPPVPAKRHEHLFPN 244
Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
VT P ++ P + R++ + D KRLQ+LQSV++ ++ ++ L
Sbjct: 245 VTVPRRPNFN-PEKPGTASYFKTLRQLNQTEIDYNDAWYRKRLQSLQSVNELIDSVMDRL 303
Query: 183 KDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
E L NTY++YT+D+G+H+GQ L GKS E D+ +PF +RGPG+ +
Sbjct: 304 SASPEVLENTYVLYTTDNGFHIGQHRLGPGKSCGIEEDVNIPFFIRGPGVAKAAV 358
>gi|302835590|ref|XP_002949356.1| hypothetical protein VOLCADRAFT_59221 [Volvox carteri f.
nagariensis]
gi|300265183|gb|EFJ49375.1| hypothetical protein VOLCADRAFT_59221 [Volvox carteri f.
nagariensis]
Length = 593
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 67/238 (28%), Positives = 114/238 (47%), Gaps = 25/238 (10%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK+L Y N P GW + AL+ +YY+ + ++NG + Y Y D
Sbjct: 134 YTGKFLVDYTIRNYDPAPSGWSAFDALVHPFTFEYYSPAFSLNGGTPRMYPGMY--YSTD 191
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
++ ++ LR + + +P L ++ PAPH P A ++
Sbjct: 192 VVGAKALAMLRSAATDPAGRPFYLQVAPPAPHHSMHYDFNSSGAFLARHWYPPIPAARHW 251
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
P+++ K L+ + + F + + RL+ +++VD+ VE
Sbjct: 252 EELSGAALPRPPSFNDPWVAGKPSWLRSLAPLTSANETFLEEIYRLRLRCMRAVDELVEG 311
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
IV+E++ LG TY+++TSD+GYH G + GK+ P+E DIRVP +M GPG++PG+
Sbjct: 312 IVREVESLGLSRRTYVLFTSDNGYHEGVHRMGSGKTTPYEEDIRVPLIMAGPGLIPGS 369
>gi|212538775|ref|XP_002149543.1| arylsulfatase, putative [Talaromyces marneffei ATCC 18224]
gi|210069285|gb|EEA23376.1| arylsulfatase, putative [Talaromyces marneffei ATCC 18224]
Length = 497
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 114/235 (48%), Gaps = 38/235 (16%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y Y+N +NGS +++ Y+Y + R + Y + D+I +
Sbjct: 159 YDKPYVNGFNGS----------DFLLDPFTYSYLNSTYQRNHEPPVSYEGQHTTDVITDK 208
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS-------AP----------QYANMFFNV 123
++ FL +P LV++ +PH D AP ++ N+F +V
Sbjct: 209 ALGFLEDGLAG--DQPFFLVVAPISPHSNVDGGALSGHQAPTVMTEPIPLSRHENLFGDV 266
Query: 124 TSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
+ NPD+ WI + ++ +P ++ D +RL+ LQ VD+ V I+
Sbjct: 267 KVPRMAHF----NPDEPSGVSWIRDLPKQDEPT-MEYNDHFYRQRLRALQGVDELVRDII 321
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+D +L+NTYIIYTSD+GYH+GQ L GK+ FE DIRVPF +RGPGI G
Sbjct: 322 TRLEDSNQLDNTYIIYTSDNGYHIGQHRLPPGKTCGFEEDIRVPFFIRGPGIAKG 376
>gi|37520215|ref|NP_923592.1| N-acetylglucosamine 6-sulfatase [Gloeobacter violaceus PCC 7421]
gi|35211208|dbj|BAC88587.1| gll0646 [Gloeobacter violaceus PCC 7421]
Length = 515
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 110/242 (45%), Gaps = 37/242 (15%)
Query: 20 GYFGKYLNKYNG-----------SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY 68
GY GKYLN Y +Y+PPGW +W ALI + YNY +N NG I +GED
Sbjct: 142 GYVGKYLNGYGSDRNRRSPLDDPTYVPPGWDDWQALITGT-MYNYVLNDNGTLIDYGEDP 200
Query: 69 YN---DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-- 123
+Y D++A +V F+ + + P L ++ APH +F N
Sbjct: 201 AEPPEEYQTDVLAERAVAFV----DAAATAPFFLTVAPVAPHFEVAPGVSSCRLFPNSLR 256
Query: 124 ----TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTK-----------RLQTL 168
H + P P + +P Q LL + ++++L
Sbjct: 257 SIRPAPRHQGSAGQVPLPQGTAFNEQDVTDKPTWVQAAPLLSERSIDCLGNIYRDQIESL 316
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
++VDD ++ L+ G L NT +++TSD+GY GQ L GK+ +E IRVP ++R
Sbjct: 317 RAVDDLTASLIAALERRGVLGNTIVVFTSDNGYLFGQHRLT-GKNVIYEESIRVPLVVRL 375
Query: 229 PG 230
PG
Sbjct: 376 PG 377
>gi|342886504|gb|EGU86314.1| hypothetical protein FOXB_03181 [Fusarium oxysporum Fo5176]
Length = 1194
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 82/246 (33%), Positives = 118/246 (47%), Gaps = 45/246 (18%)
Query: 21 YFGKYLNK-----YNGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYY 73
Y GK N YN + P GW L+ Y+N + + E YN
Sbjct: 727 YTGKLFNAHTVDNYNDPH-PAGWTSSDFLLDPFTYSYWNATWQRDSEPPISHEGTYNT-- 783
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------PEDSA------ 113
D +A ++ ++R++ + KP L + APH P+D A
Sbjct: 784 -DDLAEKTINYIREAHAA--GKPFFLTSAPIAPHNEAVVPEMPDIWTSTPDDPALRPRLL 840
Query: 114 -PQYA----NMFFNVTSHHTPAYDYAPNPDK-QWILQVTR--KMQPVHRQFTDLLMTKRL 165
PQ A ++F +V TP + NPDK + V R K+ + D RL
Sbjct: 841 PPQPAKRHEHLFLDVKVPRTPNF----NPDKPSGVNGVARLPKLNQTSLDYNDNYFRLRL 896
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
Q+LQ+VD+ VE++V EL+ LG ++TYIIYT+D+GYH+GQ L GK+ FE DI VP L
Sbjct: 897 QSLQAVDEMVERVVLELESLGIADDTYIIYTTDNGYHIGQHRLQPGKTCGFETDINVPLL 956
Query: 226 MRGPGI 231
+RGPG+
Sbjct: 957 IRGPGV 962
>gi|358374273|dbj|GAA90866.1| arylsulfatase [Aspergillus kawachii IFO 4308]
Length = 581
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 117/234 (50%), Gaps = 37/234 (15%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y ++N +NGS +++ Y+Y R + Y Y D++
Sbjct: 147 YNAPFVNGFNGS----------DFLLDPHTYSYWNATYQRNHEPPRSYEGQYTTDVMREK 196
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDS----APQYANMFFNVT 124
+ L + S P L ++ APH GP+ + AP++A++F +
Sbjct: 197 ASGLLADALDS--DAPFFLTVAPIAPHTNIDVEGLGGSGGPKMTEPLPAPRHAHLFADAK 254
Query: 125 SHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
TP + NPDK WI + + Q V + D L +RL+ LQ+VD+ V+ ++
Sbjct: 255 VPRTPNF----NPDKDSGAGWIQTMELQNQTV-IDYEDHLYRQRLRALQAVDEMVDSLIT 309
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+L++ G+++NTYIIY++D+GYH+G L GK+ +E DIRVPF +RGPGI G
Sbjct: 310 QLEESGQIDNTYIIYSADNGYHIGHHRLPPGKTTGYEEDIRVPFYIRGPGIPEG 363
>gi|159489378|ref|XP_001702674.1| arylsulfatase-like protein [Chlamydomonas reinhardtii]
gi|158280696|gb|EDP06453.1| arylsulfatase-like protein [Chlamydomonas reinhardtii]
Length = 635
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 118/237 (49%), Gaps = 36/237 (15%)
Query: 21 YFGKYLNKYN-----GSYIPPGWREWGALIMNS--KYYNYSINMNGRKIKHGEDYYNDYY 73
+ GK++N ++ P GW + L + Y NY + + + N Y
Sbjct: 178 FTGKFINLFDVPVGDTKNCPKGWDMFDPLTDKTVYDYINYDFT---PQCARTDSFTNAYQ 234
Query: 74 PDLIANDSVRFLRQSKQSFSK-KPIMLVMSFPA-----PHGPEDS--------APQYANM 119
D ++ + ++ + SK KP L ++ A P G E+ +P+Y +
Sbjct: 235 TDTVSQRASNYI---DDAVSKGKPFYLQVNPAACHTACPKGAEEGGSCVPPVPSPKYDGV 291
Query: 120 FFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
+ + +D P P+ L + ++ V + + +R++T+ SVDD +E +V
Sbjct: 292 WSGLKVPRLANFD-VPLPNA---LGIGNDVKGVDKHYQ-----RRVETMMSVDDMIESLV 342
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+L+ LG L+NTY+IY SD+GYHLG GL KGK+ P+E DIRVPF MRGPGI G +
Sbjct: 343 TKLETLGVLDNTYVIYMSDNGYHLGNHGLAKGKTLPYEEDIRVPFYMRGPGIPAGVV 399
>gi|310795554|gb|EFQ31015.1| sulfatase [Glomerella graminicola M1.001]
Length = 515
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 79/242 (32%), Positives = 111/242 (45%), Gaps = 30/242 (12%)
Query: 21 YFGKYLNKYN-GSY---IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK N ++ G++ P GW +++ Y Y R + + +Y DL
Sbjct: 115 YTGKLFNAHDTGNWNNPFPKGW-TGSDFLLDPNTYLYLNATFQRNQSPSQTFPGEYSNDL 173
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH----GPEDS--------APQYANMFFNVT 124
A + FL + KP L ++ PH G E S A ++ N+F NVT
Sbjct: 174 AAERVLGFLDDAFGFNGSKPFFLGVAPIGPHNNGLGGETSPGATPPIPAKRHENLFANVT 233
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
T Y NP K+ + D+ +RLQ LQSVD+ V+ I+ L
Sbjct: 234 VPRT----YNSNP---------VKLNATEVAYGDIYYHRRLQALQSVDEMVDTIINRLDR 280
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIP 244
G LNNTY+I+TSD+G+H GQ L GK+ E DI VPFL+RGPG+ G D P
Sbjct: 281 AGILNNTYVIFTSDNGFHTGQHRLKPGKTCSIEEDINVPFLIRGPGVPKGKTVDFDFDGP 340
Query: 245 QV 246
+
Sbjct: 341 PI 342
>gi|302837253|ref|XP_002950186.1| hypothetical protein VOLCADRAFT_90537 [Volvox carteri f.
nagariensis]
gi|300264659|gb|EFJ48854.1| hypothetical protein VOLCADRAFT_90537 [Volvox carteri f.
nagariensis]
Length = 672
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/75 (58%), Positives = 59/75 (78%)
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
+RL+T+ SVDD +E +V +L++ G LNNTY+IY SD+GYHLG GL KGK+ P+E DIRV
Sbjct: 355 RRLETMLSVDDMIEDVVNKLEERGVLNNTYVIYVSDNGYHLGNHGLPKGKTLPYEEDIRV 414
Query: 223 PFLMRGPGIVPGTMY 237
PF +RGPGI G ++
Sbjct: 415 PFYIRGPGIPAGVVH 429
>gi|322693417|gb|EFY85278.1| arylsulfatase precursor [Metarhizium acridum CQMa 102]
Length = 614
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 114/223 (51%), Gaps = 12/223 (5%)
Query: 21 YFGKYLNKYNG-SYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK +N+++ +Y P + W +++ Y+Y + Y D+I
Sbjct: 172 YTGKLMNQHSTRTYNQPFAKGWTRSDFLLDPGTYSYLDGIMALDKNEYRSIKGQYSTDVI 231
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----SAPQYANMFFNVTSHHTPAYD 132
AN S+ FL + + KP L ++ PH A ++ ++F NV+ +P+++
Sbjct: 232 ANRSLEFLDNALAA--GKPFFLGVAPIGPHANTGFTTPIPADRHKHLFPNVSIPRSPSFN 289
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-LNNT 191
P L+ ++ Q++D RLQTLQ+VD+ V I+ L+ + E L NT
Sbjct: 290 -PITPGHVNYLKNLPRLNSTQIQYSDNFYRARLQTLQAVDELVGGIIDALEKVPEVLANT 348
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
YIIYTSD+GYH+GQ L GK+ E DI VPF++RGPGI G
Sbjct: 349 YIIYTSDNGYHIGQHRLPPGKTCNIEEDINVPFVIRGPGIAAG 391
>gi|322703651|gb|EFY95257.1| arylsulfatase precursor [Metarhizium anisopliae ARSEF 23]
Length = 602
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 115/223 (51%), Gaps = 12/223 (5%)
Query: 21 YFGKYLNKYNG-SYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK +N+++ +Y P + W +++ Y+Y + Y D+I
Sbjct: 160 YTGKLMNQHSTRTYNQPFAKGWTRSDFLLDPGTYSYLDAIMALDKNEYRSIKGQYSTDVI 219
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----SAPQYANMFFNVTSHHTPAYD 132
AN S+ FL + + KP L ++ PH A ++ ++F NV+ +P+++
Sbjct: 220 ANRSLEFLDNALAA--GKPFFLGVTPIGPHANTGFTTPIPADRHKHLFPNVSIPRSPSFN 277
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-LNNT 191
+ P L+ ++ Q++D RLQTLQ+VD+ V I+ L+ + E L NT
Sbjct: 278 PS-TPGHVNYLKNLPQLTSTQIQYSDNFYRARLQTLQAVDELVGGIIDVLEKVPEVLANT 336
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
YIIYTSD+GYH+GQ L GK+ E DI VPF++RGPGI G
Sbjct: 337 YIIYTSDNGYHIGQHRLPPGKTCNIEEDINVPFIIRGPGIAAG 379
>gi|398401790|ref|XP_003853188.1| hypothetical protein MYCGRDRAFT_40096 [Zymoseptoria tritici IPO323]
gi|339473070|gb|EGP88164.1| hypothetical protein MYCGRDRAFT_40096 [Zymoseptoria tritici IPO323]
Length = 576
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/184 (35%), Positives = 97/184 (52%), Gaps = 24/184 (13%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-----GPEDSA----PQYA- 117
Y Y PD++A + FL ++ QS KP L + APH P+D PQYA
Sbjct: 171 YKGQYSPDVVAEKTYGFLDEAAQS--TKPFFLTAAPVAPHFELTNSPKDGVRLNPPQYAQ 228
Query: 118 ---NMFFNVTSHHTPAYDYAPNPD----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
++F + TP + NPD WI + K+ ++ D RL++LQ+
Sbjct: 229 RHAHLFQDYKIPRTPDF----NPDIPSGVSWIKGMP-KLNDTVIEYYDEFQRARLRSLQA 283
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
VD+ VE++V+ L G L+NTYI YT+D+G+H+GQ + GK FE D+ +P +RGPG
Sbjct: 284 VDEMVEQLVQRLDAKGLLDNTYIFYTTDNGFHIGQHRMHPGKECAFESDVHIPLAVRGPG 343
Query: 231 IVPG 234
+ G
Sbjct: 344 VPAG 347
>gi|398390355|ref|XP_003848638.1| hypothetical protein MYCGRDRAFT_76800 [Zymoseptoria tritici IPO323]
gi|339468513|gb|EGP83614.1| hypothetical protein MYCGRDRAFT_76800 [Zymoseptoria tritici IPO323]
Length = 610
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/244 (31%), Positives = 109/244 (44%), Gaps = 37/244 (15%)
Query: 21 YFGKYLNK-----YNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN GW +++ Y Y R Y Y D
Sbjct: 147 YVGKLFNAQTVDNYNDPEPAAGWTG-SEFLLDPYTYEYLNASFTRNAGPPTSYPGQYSTD 205
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------------------PEDSAP 114
LI + S FL S + P L ++ APH P A
Sbjct: 206 LIRSKSFAFL-SSALANPSSPFFLTIAPTAPHSNVHQLQGLIHGNFTEKSNIQSPPVPAA 264
Query: 115 QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
++A++F NVT TP + NPD WI ++ ++ Q + F D RL+ LQS
Sbjct: 265 RHAHLFANVTVPRTPHF----NPDSPHGVSWISRLPQQNQ-TNIDFNDHFYRSRLRALQS 319
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
VD+ V+++ L++ G L+ TYI +++D+GYH+GQ L GK FE DI +P ++RGPG
Sbjct: 320 VDEMVDELFTRLEEAGVLDETYIFFSTDNGYHIGQHRLQPGKQCAFEEDINIPLIVRGPG 379
Query: 231 IVPG 234
I G
Sbjct: 380 IPRG 383
>gi|440466456|gb|ELQ35723.1| arylsulfatase [Magnaporthe oryzae Y34]
gi|440488158|gb|ELQ67898.1| arylsulfatase [Magnaporthe oryzae P131]
Length = 640
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 123/242 (50%), Gaps = 28/242 (11%)
Query: 18 SHGYFGKYLNKYN-GSYIPP--GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
S Y GK+LN +N G+Y PP W E ALI + Y+++ + + +H +Y +
Sbjct: 121 STNYIGKFLNGHNLGNYNPPPKAWTEIDALI-DPYMYDFNRAVFSKNGQHPVNYPGWHQT 179
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------GPEDSAPQ--------YANMF 120
D++ +V ++Q Q KP +S APH GP D+ PQ + ++
Sbjct: 180 DIVRIKAVERIKQLAQD--DKPFFYWISPTAPHTVPPINGPSDNMPQPLTRHKDLFPDLV 237
Query: 121 FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF---TDLLMTKRLQTLQSVDDAVEK 177
+ P +YA KQ + V R P++ T++L R+Q+LQ +D+ +E
Sbjct: 238 LPKKGNWNPPDEYA----KQKVNWVGR-TPPLNESMLAETEVLYKTRIQSLQGIDEIIED 292
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
+V L+ G L+NTYIIYTSD+G+ +G + KS ++ + PF++RGPG+ G +
Sbjct: 293 VVATLEQEGILDNTYIIYTSDNGFMIGTHRITAMKSLAYKEAGQTPFIVRGPGVPEGVVS 352
Query: 238 LL 239
L
Sbjct: 353 RL 354
>gi|302889078|ref|XP_003043425.1| hypothetical protein NECHADRAFT_53397 [Nectria haematococca mpVI
77-13-4]
gi|256724341|gb|EEU37712.1| hypothetical protein NECHADRAFT_53397 [Nectria haematococca mpVI
77-13-4]
Length = 601
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 125/275 (45%), Gaps = 50/275 (18%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYY 73
Y GK N YN + P GW + L+ YYN S N + Y +
Sbjct: 142 YTGKLFNAHTVQNYNKPW-PRGWTQSDFLLDPFTYAYYNSSFQHNHDP---PQSYEGQHS 197
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------------- 108
D++A S FL ++ ++ +P L ++ PAPH
Sbjct: 198 VDVLAEKSYGFLEEAAKA--DRPFFLGIAPPAPHTDVDFNWDTVRDNSALHLGGINATVT 255
Query: 109 --PEDSAPQYANMFFNVTSHHTPAYD-YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
P A ++ ++F + T ++ P+ +W+ Q + P + D +RL
Sbjct: 256 FRPPVPAHRHEHLFQDARIPRTENFNPKTPSDSARWVQQ-RPPVSPSDLELYDHFYRQRL 314
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
+TLQSVD+ V+ + L+ LG LNNTY+ YT+D+G+H+GQ L GK FE DI VP L
Sbjct: 315 RTLQSVDELVDGLFDRLERLGLLNNTYVFYTTDNGFHIGQHRLPPGKECGFEEDINVPLL 374
Query: 226 MRGPGIVPGTMYLLDV------LIPQVRKFSSGSL 254
+RGPG+ GT DV L+P + K + +L
Sbjct: 375 VRGPGVPAGTR--TDVVTTHTDLVPTILKLAGAAL 407
>gi|389631020|ref|XP_003713163.1| arylsulfatase [Magnaporthe oryzae 70-15]
gi|351645495|gb|EHA53356.1| arylsulfatase [Magnaporthe oryzae 70-15]
Length = 633
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 123/242 (50%), Gaps = 28/242 (11%)
Query: 18 SHGYFGKYLNKYN-GSYIPP--GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
S Y GK+LN +N G+Y PP W E ALI + Y+++ + + +H +Y +
Sbjct: 121 STNYIGKFLNGHNLGNYNPPPKAWTEIDALI-DPYMYDFNRAVFSKNGQHPVNYPGWHQT 179
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------GPEDSAPQ--------YANMF 120
D++ +V ++Q Q KP +S APH GP D+ PQ + ++
Sbjct: 180 DIVRIKAVERIKQLAQD--DKPFFYWISPTAPHTVPPINGPSDNMPQPLTRHKDLFPDLV 237
Query: 121 FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF---TDLLMTKRLQTLQSVDDAVEK 177
+ P +YA KQ + V R P++ T++L R+Q+LQ +D+ +E
Sbjct: 238 LPKKGNWNPPDEYA----KQKVNWVGR-TPPLNESMLAETEVLYKTRIQSLQGIDEIIED 292
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
+V L+ G L+NTYIIYTSD+G+ +G + KS ++ + PF++RGPG+ G +
Sbjct: 293 VVATLEQEGILDNTYIIYTSDNGFMIGTHRITAMKSLAYKEAGQTPFIVRGPGVPEGVVS 352
Query: 238 LL 239
L
Sbjct: 353 RL 354
>gi|383779199|ref|YP_005463765.1| putative sulfatase [Actinoplanes missouriensis 431]
gi|381372431|dbj|BAL89249.1| putative sulfatase [Actinoplanes missouriensis 431]
Length = 488
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 115/235 (48%), Gaps = 12/235 (5%)
Query: 20 GYFGKYLNKY--NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
+ GKYLN+Y Y+PPGW EW A NY++N NG ++G + +DY D++
Sbjct: 131 AFLGKYLNEYLPESGYVPPGWDEWMAGGNAYDNVNYTLNENGTVKRYGSEP-SDYLTDVL 189
Query: 78 ANDSVRFLRQSK------QSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
+ + ++R +S KP + +S PH P AP+ A F + + AY
Sbjct: 190 SAKAQDWIRTGSHEQGLIKSTQVKPFFIEISTYTPHLPYPPAPRDAQSFPGLIAPRGDAY 249
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
P W L + D KR+Q++QS+D + + + G NT
Sbjct: 250 GKVPKNAPAW-LAPHSPLTAAQIAHMDRGFRKRVQSIQSIDRMIGDLRATIDRAGIAENT 308
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
I+++SD+GYH+G++ L GK F+ D+ VP ++ G G+ G +++D L+ +
Sbjct: 309 VIVFSSDNGYHMGEYRLNPGKQTAFDVDVNVPLVVVGAGVRRG--HVVDALVENI 361
>gi|452838367|gb|EME40308.1| hypothetical protein DOTSEDRAFT_66113 [Dothistroma septosporum
NZE10]
Length = 580
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/245 (31%), Positives = 111/245 (45%), Gaps = 41/245 (16%)
Query: 21 YFGKYLNKYN-GSYIPPGWREW--GALIMNSKYYNYSINMNGRKIKHGED---YYNDYYP 74
Y GK N +Y P W +++ Y Y +N ++GE+ Y Y
Sbjct: 97 YVGKLFNAQTIDNYASPHAAGWTGSEFLLDPYTYEY---LNATFTRNGEEPVSYEGQYTT 153
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------------------PEDSA 113
DL+A S FL + KKP L ++ APH P A
Sbjct: 154 DLVAEKSFGFLDDALAE--KKPFFLTIAPTAPHSNVHIGSNIIDGNFSENTNVQSPPVPA 211
Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
++ +F +VT TP + NPD WI ++ R+ + + + D RL+ LQ
Sbjct: 212 KRHERLFQDVTVPRTPHF----NPDDAHSVSWIGRLPRQNR-TNIEANDHFYRSRLRALQ 266
Query: 170 SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
SVD+ + + K L D G L TYI +++D+GYH+GQ L GK FE DI VP ++RGP
Sbjct: 267 SVDEIFDDLTKRLDDAGILEETYIFFSTDNGYHIGQHRLQPGKQCAFEEDINVPLIVRGP 326
Query: 230 GIVPG 234
G+ G
Sbjct: 327 GVAHG 331
>gi|389750799|gb|EIM91872.1| arylsulfatase precursor [Stereum hirsutum FP-91666 SS1]
Length = 586
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/239 (30%), Positives = 108/239 (45%), Gaps = 24/239 (10%)
Query: 21 YFGKYLNKYNGSYI----PPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
Y GK +N I GW L+ + Y YN S + NG + E Y
Sbjct: 137 YTGKLMNGMQVDNIQQLPAAGWTNADFLVDPNTYLFYNSSFSANGEAAINHEGTYQ---M 193
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---------SAPQYANMFFNVTS 125
DLI S+ +Q+ S +P L ++ APH P+YA +F
Sbjct: 194 DLIVQKSIAMFQQALSS--SQPFFLGIAPTAPHMEVQLNGSFTEPIPPPEYAGLFNETVV 251
Query: 126 HHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
P Y+ W+ ++ + V + D + +RL+TL++VDD VE V+++
Sbjct: 252 PRVPNYNVPVGGPGGGVSWLKELEALNETV-LSYVDHVYRQRLRTLRTVDDLVENTVQQV 310
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
+D G NNTYIIY +D+GY LG K+ PF+ D+ +P +RGPGI P DV
Sbjct: 311 EDAGIANNTYIIYVADNGYALGSHRRQPSKTLPFQEDVLIPLFIRGPGIAPNVTNAADV 369
>gi|323454692|gb|EGB10562.1| hypothetical protein AURANDRAFT_12528, partial [Aureococcus
anophagefferens]
Length = 327
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 107/232 (46%), Gaps = 25/232 (10%)
Query: 20 GYFGKYLN-----KYNGSYI--------PPGWREWGALIMNSK-----YY--NYSINMNG 59
Y GKYLN +Y + PPGW + + + YY N+ + G
Sbjct: 101 AYVGKYLNPPAMVRYCRNETLGPLEHGWPPGWDAFFGMCDQASTPQGAYYSVNWVDSAAG 160
Query: 60 RKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANM 119
+ G + DY ++ N SV +L+ + + +P L + APH P+ P +A+
Sbjct: 161 AVVFTGSEP-ADYTTSIVGNRSVAYLKLAPKD---RPFFLAAAVRAPHAPQLPPPWHASA 216
Query: 120 FFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
F + AY+ P W+ + +F D + R + L +VDD V +
Sbjct: 217 FPDARVPRDGAYNATPTGKPYWMSRNPPLSDEDAAEF-DAVYADRWRALLAVDDLVAGVF 275
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
EL+ +++TY+ YTSDHGYH+G F L K +EFDIRVP L GPG+
Sbjct: 276 DELRASSLIDDTYVFYTSDHGYHMGHFRLPPLKMHKYEFDIRVPLLATGPGL 327
>gi|452979586|gb|EME79348.1| hypothetical protein MYCFIDRAFT_97143, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 548
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 117/249 (46%), Gaps = 21/249 (8%)
Query: 21 YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK N +N +Y P + +++ Y+Y + R + Y Y PD+I
Sbjct: 100 YTGKLFNSHNVDNYNDPPVNGFNGSDFLLDPYTYSYYGSYMSRNFEPPVSYEGLYSPDVI 159
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAPQYANMFFNVTSH 126
A + FL ++ Q P + ++ APH D SA ++ ++F +
Sbjct: 160 AEKAYGFLEEASQH--ANPFFIGVAPIAPHADADPRKGRQMSIPLSAARHQHLFKDYKIP 217
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T ++ + W Q+ R V ++ D R++ LQ+VD+ VE IV +L+ G
Sbjct: 218 RTANFNPSRASGVGWTKQLERLNDTV-IEYNDEFQRGRIRALQAVDEMVEGIVSKLEAKG 276
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----VPGTMYLLDVL 242
L+NTY+IYT+D+GYH+ L GK PFE DI +P +RGPG+ V G + L
Sbjct: 277 LLDNTYVIYTTDNGYHISHHRLPPGKECPFETDIHIPLAIRGPGVPIRHVAGVVSSHTDL 336
Query: 243 IPQVRKFSS 251
P + K +
Sbjct: 337 TPTILKLAG 345
>gi|402086553|gb|EJT81451.1| hypothetical protein GGTG_01430 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 631
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 115/232 (49%), Gaps = 17/232 (7%)
Query: 21 YFGKYLNKYN---GSYIPPGWREWGALIMNSKYYNYSINM--NGRKIKHGEDYYNDYYPD 75
Y GK+LN N P GW + + S+ M NG + ++ D
Sbjct: 131 YLGKFLNGVNILNWVSQPRGWNHVDVFLEPYQTTPNSVVMSENGNTPIWFKGFHQT---D 187
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP-----QYANMFFNVTSHHTPA 130
+I ++ L + + +P L +S APH +++A ++ +F N T TP+
Sbjct: 188 IIRAKALDRLDKLTATEQDQPFFLFLSPTAPHVDDETATTVPCQRHEGLFLNATVPRTPS 247
Query: 131 YDYAPNPDKQWILQVTRKMQPV---HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
++ P+ ++ + R ++P+ + D +R Q L +D+ V +V +L++ G
Sbjct: 248 FN-PPDAVQRNKVSWLRDLEPMPDADVAWADAEYRRRAQALMGIDEMVADVVAKLEEKGV 306
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
++NTYIIYTSD+GYHLGQ + GK+ PF D +PF++RGPG+ G L
Sbjct: 307 IDNTYIIYTSDNGYHLGQHRMAVGKTTPFAEDTNLPFVVRGPGVPAGARSTL 358
>gi|374854282|dbj|BAL57168.1| mucin-desulfating sulfatase [uncultured planctomycete]
Length = 490
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 79/240 (32%), Positives = 116/240 (48%), Gaps = 23/240 (9%)
Query: 3 WPVELTS--RNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNG- 59
+P +L S R L Y GK+ PG+ W + Y++ N+NG
Sbjct: 113 YPRDLPSYPRRLQEIGYETAYIGKWHMGEQEDQPRPGFDFWMSHKGQGNYFDNEFNINGQ 172
Query: 60 RKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-PEDSAPQYAN 118
R I G YY ++ + +V +LR+ +Q KP L++ APHG P P+Y
Sbjct: 173 RTIIKG------YYTHVVTDYAVNWLRKPRQ----KPFCLILGHKAPHGGPIVPEPKYEQ 222
Query: 119 MFFNVT-SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL-----LMTKRLQTLQSVD 172
+F N T + A DY PD W+ Q + L + L T+ SVD
Sbjct: 223 IFDNQTITRPATADDYTGKPD--WLKQCRFTWHGIGGPLYGLKEYPKFVRYYLATIVSVD 280
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
D+V +I K L+D G+L+NT IIYTSD+G+ LG+ G V ++ +E IRVP L+R P +
Sbjct: 281 DSVGRIYKVLQDTGQLDNTLIIYTSDNGFALGEHGRVDKRTM-YEESIRVPLLVRYPKLA 339
>gi|159487144|ref|XP_001701595.1| predicted protein [Chlamydomonas reinhardtii]
gi|158271536|gb|EDO97353.1| predicted protein [Chlamydomonas reinhardtii]
Length = 643
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 115/233 (49%), Gaps = 29/233 (12%)
Query: 22 FGKYLNKYN-----GSYIPPGWREWGALIMNS-KYYNYSINMNGRKIKHGEDYYNDYYPD 75
GK+LN + G P GW + AL + YN + ++N Y D
Sbjct: 104 VGKFLNSFGVPLPAGVVCPRGWDSFEALTTGTYALYNSTFSLNCGPST--PPLSGQYQTD 161
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------PEDSAPQYANMFFNVTSHH 127
LI + ++ ++ + + +KP+ L ++ PH P A ++A ++ +
Sbjct: 162 LIRDKALAYIDTAVAA--RKPLYLHLTPATPHKDNNGEGWVPPPPAARHAGLYPGLQLPA 219
Query: 128 TPAYDYAPNPDKQWILQVTR----KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
P PNP ++ TR K Q V +Q TDL + RL+ L++VD+ +E +V L+
Sbjct: 220 NPTL-RTPNP----LIPDTRIDMTKAQSV-QQMTDLYVA-RLRALRAVDEMLEALVNRLE 272
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
G LNNTYII+TSD+G H+GQ L+ GK E D RVP +RGPG+ G +
Sbjct: 273 ARGILNNTYIIFTSDNGLHMGQHSLLDGKGTNLEEDNRVPLFIRGPGVEAGVV 325
>gi|171680767|ref|XP_001905328.1| hypothetical protein [Podospora anserina S mat+]
gi|27764278|emb|CAD60558.1| unnamed protein product [Podospora anserina]
gi|170940011|emb|CAP65237.1| unnamed protein product [Podospora anserina S mat+]
Length = 618
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/253 (29%), Positives = 119/253 (47%), Gaps = 46/253 (18%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN Y P GW +++ YNY + R + Y + D
Sbjct: 143 YTGKLFNAHTVENYNSPY-PAGWN-GSDFLLDPYTYNYLNSSFQRNQDPPKSYEGFHSVD 200
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---------------------------- 107
++A S+ F+ ++ ++ P L ++ APH
Sbjct: 201 VLAEKSLGFVDEAVRA--DGPFFLGIAPVAPHSNVESNTLGDDMGDDWGNLPDIEEFAKF 258
Query: 108 GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTK 163
GP A ++ ++F +V TP + NPDK W+ ++ +K+ + + D +
Sbjct: 259 GPPVPAKRHEHLFKDVKIPRTPNF----NPDKPTGANWV-RLRKKLNQENIDYNDHFYRQ 313
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
RL+TLQSVD+ V +V L++ G L+NTYI YT+D+GYH+ Q L GK FE DI +P
Sbjct: 314 RLRTLQSVDELVASVVGRLEEHGVLDNTYIFYTTDNGYHISQHRLNPGKECGFEEDINIP 373
Query: 224 FLMRGPGIVPGTM 236
++RGPG+ G +
Sbjct: 374 LIIRGPGVAKGVV 386
>gi|116200710|ref|XP_001226167.1| hypothetical protein CHGG_10900 [Chaetomium globosum CBS 148.51]
gi|88175614|gb|EAQ83082.1| hypothetical protein CHGG_10900 [Chaetomium globosum CBS 148.51]
Length = 575
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 77/129 (59%), Gaps = 2/129 (1%)
Query: 109 PEDSAPQYANMFFNVTSHHTPAYDYAPNPD-KQWILQVTRKMQPVHRQFTDLLMTKRLQT 167
P A ++A +F NVT TP ++ A W+ + R+ + + + D +RL+T
Sbjct: 210 PPIPAKRHARLFGNVTVPRTPHFNPAGGASGASWVRGLARQSEE-NVAWNDHYYRQRLRT 268
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
LQSVD+ VE +V+ L DLG L TY+ YT+D+G+H+GQ L GK FE DI +P ++R
Sbjct: 269 LQSVDELVEAVVRRLGDLGVLEETYVFYTTDNGFHIGQHRLQPGKECGFEEDINIPLIVR 328
Query: 228 GPGIVPGTM 236
GPG+ G +
Sbjct: 329 GPGVPRGEV 337
>gi|320594028|gb|EFX06431.1| arylsulfatase [Grosmannia clavigera kw1407]
Length = 615
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 118/247 (47%), Gaps = 42/247 (17%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE---DYYNDYY 73
Y GK N + S P GW +++ Y+Y +N ++GE Y Y
Sbjct: 145 YTGKLFNAHTIDNYDSPRPAGW-TGSDFLLDPYTYSY---LNATYQRNGEPPVSYEGHYT 200
Query: 74 PDLIANDSVRFLRQS-KQSFSKKPIMLVMSFPAPHG----------PEDS---------- 112
D++A ++ FL + +P L ++ APH DS
Sbjct: 201 GDVLAEKALGFLDDATADGVDARPFFLGIAPVAPHSNVVVLEGEPVSNDSSRAVFSAPVA 260
Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQ-----WILQVTRKMQPVHRQFTDLLMTKRLQT 167
A ++A++F +V TP + NP +Q W+ Q+ R+ + + D +RL+
Sbjct: 261 ATRHAHLFADVKVPRTPHF----NPARQAAGGSWVRQLARQ-DDANVAYNDYFYRQRLRA 315
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
LQSVD+ V+++ L LG L +TY+ YT+D+G+H+GQ L GK+ FE DI +P ++R
Sbjct: 316 LQSVDELVDELFARLDRLGLLASTYVFYTTDNGFHIGQHRLQPGKTCGFEEDINIPLIVR 375
Query: 228 GPGIVPG 234
GPG+ G
Sbjct: 376 GPGVAAG 382
>gi|402074765|gb|EJT70274.1| hypothetical protein GGTG_12447 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 614
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 97/189 (51%), Gaps = 27/189 (14%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSF----SKKPIMLVMSFPAPHGPEDS----------- 112
Y Y PD++A+ ++ F+ + +S +P + ++ APH S
Sbjct: 199 YEGQYSPDVLASKALGFIDDAAESLKPENGGRPFFIGLAPIAPHSDVASDRGNGKAAISA 258
Query: 113 ---APQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRL 165
A ++ N+F +V T + NPD W+ ++ +K+ + + D +RL
Sbjct: 259 PVAAKRHENLFADVKVPRTANF----NPDAPSGVSWVSRL-QKLNDTNVEANDHFYRQRL 313
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
+ LQSVD+ V+ + L+ G L+NTY+ YTSD+G+H+ Q L GK+ FE DI VP L
Sbjct: 314 RALQSVDEMVDNVFASLEKHGLLDNTYVFYTSDNGFHISQHRLQPGKTCGFEEDINVPLL 373
Query: 226 MRGPGIVPG 234
+RGPGI G
Sbjct: 374 VRGPGIAAG 382
>gi|358394598|gb|EHK43991.1| hypothetical protein TRIATDRAFT_37606 [Trichoderma atroviride IMI
206040]
Length = 596
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 97/199 (48%), Gaps = 24/199 (12%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
+YYN S NG + Y D++A + FL + Q K P + ++ APH
Sbjct: 161 QYYNVSTTKNGAPPINP---VGKYSTDIVAESAEEFLNHALQD-DKSPFFITLAPIAPHA 216
Query: 109 -----------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVH 153
P A ++ +F TP + NPD WI ++ + V
Sbjct: 217 LVGEGSTGAFDPPQVAERHEGLFPEYKIPRTPNF----NPDSPSGVGWISKLPKFNDSV- 271
Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
+ D RL++LQ+VD+ +E +V L + G L+NT+I YTSD+GYH+GQ L GK+
Sbjct: 272 VDYGDEYQRLRLRSLQAVDEIIESVVTRLDEAGVLDNTFIFYTSDNGYHIGQHRLHPGKT 331
Query: 214 FPFEFDIRVPFLMRGPGIV 232
FE DI VP ++RGPG+
Sbjct: 332 CGFETDINVPLVVRGPGVA 350
>gi|242805691|ref|XP_002484584.1| arylsulfatase, putative [Talaromyces stipitatus ATCC 10500]
gi|218715209|gb|EED14631.1| arylsulfatase, putative [Talaromyces stipitatus ATCC 10500]
Length = 586
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 113/245 (46%), Gaps = 38/245 (15%)
Query: 12 LLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYND 71
L F Y Y+N + GS +++ Y+Y R + Y
Sbjct: 144 LFNFHTVDNYDSPYVNGFTGS----------DFLLDPFTYSYLNATFQRNREPPVSYKGH 193
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-------SAP---------- 114
+ ++I + ++ FL +P LV++ APH D AP
Sbjct: 194 HTTEVITDKALGFLEDGLAE--DQPFFLVIAPIAPHSNIDVGALDDHGAPTVMTEPIPLS 251
Query: 115 QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
++ N+F + T + NPD+ W+ + ++ ++ D +RL+ LQ
Sbjct: 252 RHENLFQDAKVPRTAHF----NPDEPNGVSWVRNLPKQDDQT-IEYNDHFYRQRLRALQG 306
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
VD+ V +V L+D +L+NTYIIYTSD+GYH+GQ L GK+ FE DIRVPFL+RGPG
Sbjct: 307 VDELVGAVVNRLEDSDQLDNTYIIYTSDNGYHIGQHRLPPGKACGFEEDIRVPFLIRGPG 366
Query: 231 IVPGT 235
I G
Sbjct: 367 IAKGA 371
>gi|46117250|ref|XP_384643.1| hypothetical protein FG04467.1 [Gibberella zeae PH-1]
Length = 703
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 113/229 (49%), Gaps = 28/229 (12%)
Query: 26 LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYY-NDYYPDLIANDSVRF 84
+N YN Y P GW +++ Y+Y +N + + Y+ ++ DLIA S
Sbjct: 150 VNNYNSPY-PAGWTGTN-FLLDPGTYDY-LNPIYQHNQEAPVYHKGEHTSDLIAKYSHDL 206
Query: 85 LRQSKQSFSKKPIMLVMSFPAPHGPED-------------SAPQYANMFFNVTSHHTPAY 131
L+++ S P + ++ APH A +++++F V T +
Sbjct: 207 LKEAIDS--NNPFFVAIAPIAPHSDISVDRVAGSGMTIPIPAERHSHLFEGVKVPRTENF 264
Query: 132 DYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
NPD WI ++ ++ D RLQ LQ VD+ VE++ ++L+D G
Sbjct: 265 ----NPDSPSGVSWIKEL-KQFNESSVSHLDNYYRARLQALQGVDEIVEQVTQQLEDAGL 319
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L++TYIIY+SD+GYHLGQ L GK FE DIRVP +RGPGI G +
Sbjct: 320 LDDTYIIYSSDNGYHLGQHRLPPGKECGFEEDIRVPLFIRGPGISSGAV 368
>gi|367032166|ref|XP_003665366.1| hypothetical protein MYCTH_2095444 [Myceliophthora thermophila ATCC
42464]
gi|347012637|gb|AEO60121.1| hypothetical protein MYCTH_2095444 [Myceliophthora thermophila ATCC
42464]
Length = 621
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 110/251 (43%), Gaps = 40/251 (15%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK N + S P GW +++ Y Y R Y N + D+
Sbjct: 144 YTGKLFNAHTVDNYDSPYPAGWNG-TEFLLDPYTYMYLNASFQRNRDPPVSYPNQHSVDV 202
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG---------------------------- 108
+ ++ FL ++ QS +P L ++ APH
Sbjct: 203 LTQKALSFLDEASQS--PRPFFLGIAPVAPHSNVAQAPTPDGDGGWDEDGDYDDIESRVT 260
Query: 109 --PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTK 163
P A ++A++F + TP ++ A W R++ P + F D +
Sbjct: 261 FTPPIPAARHAHLFADAVVPRTPHFNPAEPAVATWGAGWLRQLPPQTAANVAFNDHFYRQ 320
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
RL+ LQSVD+ V+ +V+ L LG L +TY+ YT+D+G+H+GQ L GK FE DI VP
Sbjct: 321 RLRVLQSVDELVDAVVRRLDQLGLLADTYLFYTTDNGFHIGQHRLQPGKECGFEEDINVP 380
Query: 224 FLMRGPGIVPG 234
++RGPG+ P
Sbjct: 381 LIVRGPGVAPA 391
>gi|238498084|ref|XP_002380277.1| sulfatase-1, sulf-1, putative [Aspergillus flavus NRRL3357]
gi|220693551|gb|EED49896.1| sulfatase-1, sulf-1, putative [Aspergillus flavus NRRL3357]
Length = 564
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 15/195 (7%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+Y N +I N K + Y Y DL+AN S+ ++ + + KKP L ++ PH
Sbjct: 158 QYTNTTIQHNFEKPR---SYPGVYATDLLANKSMAWMDDAAKE--KKPFFLAINPVNPHN 212
Query: 108 ----GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDL 159
G + P AN + NPD+ W+ ++ + + + DL
Sbjct: 213 NYQWGKGWTKPVPANRHEGTFPDAKVPRSVSFNPDRPSGAAWVHELPQ-LSDAAIKDNDL 271
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+RLQ LQ+VDD VE + LK L+NTYIIYTSD+G+H+ Q L+ GK P+E D
Sbjct: 272 YYRRRLQALQAVDDLVETTIGTLKRHKMLDNTYIIYTSDNGFHISQHRLMPGKRCPYEED 331
Query: 220 IRVPFLMRGPGIVPG 234
+ VP ++RGPGI G
Sbjct: 332 VNVPMIIRGPGIPKG 346
>gi|425773153|gb|EKV11523.1| hypothetical protein PDIG_49660 [Penicillium digitatum PHI26]
Length = 587
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 116/235 (49%), Gaps = 36/235 (15%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y ++N +NGS +++ Y+Y + R + + ++ D+I
Sbjct: 151 YHSPHINGFNGS----------DFLLDPYTYSYLNSTYQRNHEPPVSHEGEHTIDVITGK 200
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAP----QYANMFFNVTS 125
++ FL + ++P L +S APH D SAP ++ ++F +VT
Sbjct: 201 ALGFLDDALAG--ERPFFLAVSPVAPHSNIDPGNVASGKTYMSAPIPLQRHTHLFQDVTV 258
Query: 126 HHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
T + NPD+ W+ + + Q V + D RL+ LQ VD+ V+ +V
Sbjct: 259 PRTANF----NPDQPSGVSWVHDLPLQNQSV-VDYHDHFYRSRLRALQGVDELVDGLVTR 313
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L+ G+L++TYIIYTSD+G+H+GQ L GK+ FE DIRVP +RGPG+ G +
Sbjct: 314 LEKSGQLDHTYIIYTSDNGFHIGQHRLPPGKTCGFEEDIRVPLFIRGPGVAKGDV 368
>gi|408392195|gb|EKJ71554.1| hypothetical protein FPSE_08295 [Fusarium pseudograminearum CS3096]
Length = 586
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 113/229 (49%), Gaps = 28/229 (12%)
Query: 26 LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYY-NDYYPDLIANDSVRF 84
+N YN Y P GW +++ Y+Y +N + + Y+ ++ DLIA +
Sbjct: 150 VNNYNSPY-PAGWTGTN-FLLDPGTYDY-LNPIYQHNQEAPVYHKGEHTSDLIAKYAHEL 206
Query: 85 LRQSKQSFSKKPIMLVMSFPAPHGPED-------------SAPQYANMFFNVTSHHTPAY 131
L+++ S P + ++ APH D A +++++F V T +
Sbjct: 207 LKEAIDS--NNPFFVGIAPIAPHSNIDVDRVAGSGMTIPIPAERHSHLFEGVKVPRTENF 264
Query: 132 DYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
NPD WI ++ + D RLQ LQ VD+ VE++ ++L+D G
Sbjct: 265 ----NPDSPSGVSWIKEL-EQFNESSVSHLDNYYRARLQALQGVDEIVEQVTQQLEDAGL 319
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L++TYIIY+SD+GYHLGQ L GK FE DIRVP +RGPGI G +
Sbjct: 320 LDDTYIIYSSDNGYHLGQHRLPPGKECGFEEDIRVPLFIRGPGISSGAV 368
>gi|425776527|gb|EKV14743.1| hypothetical protein PDIP_42520 [Penicillium digitatum Pd1]
Length = 587
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 116/235 (49%), Gaps = 36/235 (15%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y ++N +NGS +++ Y+Y + R + + ++ D+I
Sbjct: 151 YHSPHINGFNGS----------DFLLDPYTYSYLNSTYQRNHEPPVSHEGEHTIDVITGK 200
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAP----QYANMFFNVTS 125
++ FL + ++P L +S APH D SAP ++ ++F +VT
Sbjct: 201 ALGFLDDALAG--ERPFFLAVSPVAPHSNIDPGNVASGKTYMSAPIPLQRHTHLFQDVTV 258
Query: 126 HHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
T + NPD+ W+ + + Q V + D RL+ LQ VD+ V+ +V
Sbjct: 259 PRTANF----NPDQPSGVSWVHDLPLQNQSV-VDYHDHFYRSRLRALQGVDELVDGLVTR 313
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L+ G+L++TYIIYTSD+G+H+GQ L GK+ FE DIRVP +RGPG+ G +
Sbjct: 314 LEKSGQLDHTYIIYTSDNGFHIGQHRLPPGKTCGFEEDIRVPLFIRGPGVAKGDV 368
>gi|255939408|ref|XP_002560473.1| Pc16g00510 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585096|emb|CAP92721.1| Pc16g00510 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 623
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 20/187 (10%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED-----------SAP----QYANM 119
D+I ++ FL + ++P L +S APH D SAP ++ ++
Sbjct: 237 DVITGKALGFLDDALAG--ERPFFLAVSPVAPHSNVDPGNITSENFYMSAPIPLERHEHL 294
Query: 120 FFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
F +V T ++ W+ + + Q V + D RL+ LQ VD+ V+ +V
Sbjct: 295 FQDVRIPRTANFNSDQPSGVSWVHDLPLQNQSV-VDYHDHFYRSRLRALQGVDELVDGLV 353
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
L++ G+L+NTYIIYTSD+G+H+GQ L GK+ FE DIRVP +RGPG+ G Y+
Sbjct: 354 TRLEESGQLDNTYIIYTSDNGFHIGQHRLPPGKTCGFEEDIRVPLFIRGPGVTKG--YVQ 411
Query: 240 DVLIPQV 246
D + V
Sbjct: 412 DAVTTHV 418
>gi|453080905|gb|EMF08955.1| Arylsulphatase [Mycosphaerella populorum SO2202]
Length = 588
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 117/256 (45%), Gaps = 37/256 (14%)
Query: 4 PVELTSRNLLYFDISHGYFGKYLNKYN-GSYIPPGWREW--GALIMNSKYYNYSINMNGR 60
PV L S N + Y GK N ++ +Y P W +++ Y Y +N
Sbjct: 86 PVWLQSLNYATY-----YVGKLFNAHSVHNYASPRAAGWTGSEFLLDPYTYEY---LNAT 137
Query: 61 KIK-HGE--DYYNDYYPDLIANDSVRFLRQSKQSFSK--KPIMLVMSFPAPHG------- 108
++ HGE Y Y DLIA + FL + + K KP L ++ APH
Sbjct: 138 FVRNHGEPKSYEGQYSTDLIAAKAFGFLDTALDEYQKEGKPFFLTIAPTAPHSNVHMNTQ 197
Query: 109 -------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
P A +++++F +V TP ++ WI ++ R+ Q +
Sbjct: 198 IDGNFSEGSNTQSPPIPAARHSHLFEDVVVPRTPHFNPVDAHGVSWISRLPRQNQ-TNID 256
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
F D RL+ LQ+VD+ + + +L G T++ +++D+GYH+GQ L GK
Sbjct: 257 FNDHFYRSRLRALQAVDEMIHDLFMKLDAAGVTEETFVFFSTDNGYHIGQHRLQPGKQCA 316
Query: 216 FEFDIRVPFLMRGPGI 231
FE D+ +PF++RGPGI
Sbjct: 317 FEEDVNIPFMVRGPGI 332
>gi|380477057|emb|CCF44357.1| sulfatase [Colletotrichum higginsianum]
Length = 459
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/263 (30%), Positives = 120/263 (45%), Gaps = 52/263 (19%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK N ++ + P GW +++ Y Y R + + +Y DL
Sbjct: 128 YTGKLFNSHDVDNWNNPFPKGWTG-SDFLLDPNTYLYLNATFQRNQSPPQSFPGEYSNDL 186
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH----GPEDS--------APQYANMFFNVT 124
+++ ++ FL + + KP L ++ PH G E S A ++ N+F NVT
Sbjct: 187 VSDRALGFLDDALKYNGSKPFFLGVAPIGPHNNGLGGETSPGATPPIPAKRHENLFSNVT 246
Query: 125 SHHTPAYDYAPNP---------------DKQWILQVT----RKMQP-------------- 151
T Y++ P+ D LQV+ ++ QP
Sbjct: 247 VPRT--YNFNPDKLCPASLACLPGPGRIDPSSSLQVSNTAPQQQQPSMANWGLIIPQLNA 304
Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
+ D +RLQTLQSVD+ V+ I+ L G L+NTY+I+TSD+G+H+GQ L G
Sbjct: 305 TEVAYGDNYYRRRLQTLQSVDEMVDAIIDRLDRAGVLDNTYVIFTSDNGFHIGQHRLKPG 364
Query: 212 KSFPFEFDIRVPFLMRGPGIVPG 234
K+ E DI VPFL+RGPG+ G
Sbjct: 365 KTCAIEEDINVPFLIRGPGVPKG 387
>gi|83766705|dbj|BAE56845.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 615
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 98/195 (50%), Gaps = 15/195 (7%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+Y N +I N K + Y Y DL+AN S+ ++ + + KKP L ++ PH
Sbjct: 158 QYTNTTIQHNFEKPR---SYPGVYATDLLANKSMAWMDDAAKE--KKPFFLAINPVNPHN 212
Query: 108 ----GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDL 159
G + P A + NPD+ W+ ++ + V + DL
Sbjct: 213 NYQWGKGWTKPVPAKRHEGTFPDAKVPRSVSFNPDRPSGAAWVHELPQLSDAVIKD-NDL 271
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+RLQ LQ+VDD VE + LK L+NTYIIYTSD+G+H+ Q L+ GK P+E D
Sbjct: 272 YYRRRLQALQAVDDLVETTIDTLKRHKMLDNTYIIYTSDNGFHISQHRLMPGKRCPYEED 331
Query: 220 IRVPFLMRGPGIVPG 234
+ VP ++RGPGI G
Sbjct: 332 VNVPMIIRGPGIPKG 346
>gi|317141886|ref|XP_001818847.2| arylsulfatase [Aspergillus oryzae RIB40]
gi|391863140|gb|EIT72453.1| sulfatase [Aspergillus oryzae 3.042]
Length = 564
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 98/195 (50%), Gaps = 15/195 (7%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+Y N +I N K + Y Y DL+AN S+ ++ + + KKP L ++ PH
Sbjct: 158 QYTNTTIQHNFEKPR---SYPGVYATDLLANKSMAWMDDAAKE--KKPFFLAINPVNPHN 212
Query: 108 ----GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDL 159
G + P A + NPD+ W+ ++ + V + DL
Sbjct: 213 NYQWGKGWTKPVPAKRHEGTFPDAKVPRSVSFNPDRPSGAAWVHELPQLSDAVIKD-NDL 271
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+RLQ LQ+VDD VE + LK L+NTYIIYTSD+G+H+ Q L+ GK P+E D
Sbjct: 272 YYRRRLQALQAVDDLVETTIDTLKRHKMLDNTYIIYTSDNGFHISQHRLMPGKRCPYEED 331
Query: 220 IRVPFLMRGPGIVPG 234
+ VP ++RGPGI G
Sbjct: 332 VNVPMIIRGPGIPKG 346
>gi|291301372|ref|YP_003512650.1| sulfatase [Stackebrandtia nassauensis DSM 44728]
gi|290570592|gb|ADD43557.1| sulfatase [Stackebrandtia nassauensis DSM 44728]
Length = 522
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 121/259 (46%), Gaps = 36/259 (13%)
Query: 16 DISHGYFGKYLNKY---NGSYIPPGWREWGALIMNS-KYYNYSINM-NGRKIKHGEDYYN 70
D Y GKYLN Y G +P GW EW Y+Y ++ G K D
Sbjct: 129 DYHTAYLGKYLNGYKMKEGYKVPAGWDEWHVADGGGYNEYDYKLSEYTGGDDKPISDGDG 188
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--EDSAPQYANM--------- 119
Y DL+A+ +V + +S+ + +K + ++ +PH +D P++
Sbjct: 189 KYLVDLMADRAVESIDRSRDA--EKSFFVQVAPFSPHSGVGKDGGPRFPPAKRDRPGADE 246
Query: 120 ---------FFNVTSHHTPAYDYAPNPDKQWILQ---VTRKMQPVHRQFTDLLMTKRLQT 167
++ P ++ W+ Q ++++ ++R F + R +
Sbjct: 247 KHGDCGKVDCADLDVTKLPGFNEDTKDKPDWVRQKPLTDKEIKELNRDFRN-----RARM 301
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
+QSVDD VEK+ K L EL+NTYI++ SD+G+HLGQ L++GK+ ++ D+R PFL++
Sbjct: 302 VQSVDDMVEKVTKSLS-QSELDNTYIMFGSDNGFHLGQHRLMRGKTTAYDHDVRTPFLVK 360
Query: 228 GPGIVPGTMYLLDVLIPQV 246
PG G D ++ V
Sbjct: 361 RPGSSGGDSIKSDEIVQNV 379
>gi|346319604|gb|EGX89205.1| arylsulfatase precursor [Cordyceps militaris CM01]
Length = 565
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 110/224 (49%), Gaps = 16/224 (7%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N S P GW +++ Y Y+ K + + +Y DL
Sbjct: 123 YTGKLMNNLGVRTYQSPPPSGWNR-SDFLLDPNTYVYNNAYFTLDNKKWQAFPGEYSTDL 181
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP-EDS------APQYANMFFNVTSHHTP 129
IA S+ FL++ + +KP + ++ APH D+ A ++ ++F +
Sbjct: 182 IAKRSLEFLKEGIDT--QKPFFIGVAPIAPHSELTDTFREPVPAARHQDLFADAQVPRNK 239
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-L 188
++ P P + ++ Q+ D +RLQ+LQ+VDD V+ I L+D L
Sbjct: 240 NFN-PPQPGTASYFKNLPRLSNSQVQYLDNFHRRRLQSLQAVDDLVDDIFDALRDHPSVL 298
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
NTYIIYT+D+G+HLGQ L GK+ E D+ VPF++RGPGI
Sbjct: 299 ANTYIIYTADNGFHLGQHRLPAGKTCGIEEDVNVPFIIRGPGIT 342
>gi|347840483|emb|CCD55055.1| similar to arylsulfatase [Botryotinia fuckeliana]
Length = 585
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 114/262 (43%), Gaps = 40/262 (15%)
Query: 21 YFGKYLNKYNGSYI----PPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GK N + S P GW L+ +YYN + N E +Y
Sbjct: 128 YTGKLFNHHTVSNWNNPHPAGWTSSDFLLDPFTYQYYNSTFQRNEHPPVSHE---GEYST 184
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYAN 118
D++A + L + QS KP LV + APH D A ++ +
Sbjct: 185 DVLAQKAYGLLDEGVQS--GKPFFLVAAPIAPHANVDLAAFKPGEGVLMTTPLPAERHKH 242
Query: 119 MFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
+F + NPDK WI + R V + D RLQTLQ+VD+
Sbjct: 243 LFPGAKVPRNANF----NPDKPSGASWIKHLDRLTDEV-IAYNDEFYRARLQTLQAVDEL 297
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+E +V L D G L+NTY I+++D+GYH+ Q L GK FE DI +P ++RGPG+ G
Sbjct: 298 IEGLVDRLTDYGILDNTYFIFSTDNGYHISQHRLAPGKECGFEEDINIPLVIRGPGVPQG 357
Query: 235 TMYLLDV----LIPQVRKFSSG 252
+ L P + K ++G
Sbjct: 358 EETSIVTAHTDLAPTILKIANG 379
>gi|218132019|ref|ZP_03460823.1| hypothetical protein BACEGG_03644 [Bacteroides eggerthii DSM 20697]
gi|217985779|gb|EEC52119.1| arylsulfatase [Bacteroides eggerthii DSM 20697]
Length = 520
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 114/249 (45%), Gaps = 36/249 (14%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGK+ + PG+ W + Y+N IN+NG K K +D + Y DL+ +
Sbjct: 135 AFFGKWHMGNDTGEPQPGFTHWEGIRGQGVYWNPEINING-KWKEFKD--STYLGDLLTD 191
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-----TSHHTPAYDYA 134
++ F+R+ K++ KP + +S H P + +Y + N TS P Y
Sbjct: 192 HAIDFIREQKKA--DKPFFVYLSHKGVHDPFQAPKRYEGCYANKKVPLPTSFENPHYGIT 249
Query: 135 PNPDKQ--------------------WILQVTRKMQPVH-----RQFTDLLMTKRLQTLQ 169
P P+K W+ V R+ + + K +TL+
Sbjct: 250 PTPNKSVQTGKPLSGVDYYGEQMKPDWVKMQRESWHGVDFCYNGRRNWEEEVRKYCETLR 309
Query: 170 SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
+VD+++ +++ L+++G NT +IY D+G+ G+ GL+ + F +E +RVP L+R P
Sbjct: 310 AVDESIGRVIDSLQEMGLDENTVVIYMGDNGFCWGEHGLIDKRQF-YEASVRVPMLIRAP 368
Query: 230 GIVPGTMYL 238
G+ P L
Sbjct: 369 GLFPAGQVL 377
>gi|310801414|gb|EFQ36307.1| sulfatase [Glomerella graminicola M1.001]
Length = 607
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 118/250 (47%), Gaps = 41/250 (16%)
Query: 21 YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK N + +Y P + W + +++ Y+Y R ++ +Y DL+
Sbjct: 141 YTGKMFNAHTIWNYDSPHLKGWTSSDFLLDPNTYDYYNAAFQRNHDQPVNHPGEYSTDLV 200
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----------- 114
A + FL + Q KP L ++ APH GP+DS P
Sbjct: 201 AKKAYGFLDDAVQE--DKPFFLAIAPIAPHSNVNSSVLNPVKGPDDSVPVSDGKFRVSIP 258
Query: 115 ----QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
++ ++F + T + NPDK WI + ++ + ++ D RLQ
Sbjct: 259 LAAKRHEHLFKDAKVPRTENF----NPDKPSGADWIRR-QPQLTNENIEYNDRHYQGRLQ 313
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
LQ+VD+ V+ ++K+L G L+NTYI +T+D+GYH+GQ L GK +E DI +P ++
Sbjct: 314 ALQAVDELVDGVIKKLGKHGILDNTYIFFTTDNGYHIGQHRLQPGKECGYEEDINIPLIV 373
Query: 227 RGPGIVPGTM 236
RGPG+ G +
Sbjct: 374 RGPGVPKGGI 383
>gi|367047595|ref|XP_003654177.1| hypothetical protein THITE_2116964 [Thielavia terrestris NRRL 8126]
gi|347001440|gb|AEO67841.1| hypothetical protein THITE_2116964 [Thielavia terrestris NRRL 8126]
Length = 609
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 116/248 (46%), Gaps = 38/248 (15%)
Query: 21 YFGKYLNK-----YNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE-DYYNDYYP 74
Y GK N YN + P GW G+ + Y +N + ++ + Y +
Sbjct: 144 YTGKLFNAHTVENYNAPF-PAGWN--GSDFLLDPYTYMYLNASFQRNREAPVSYEGQHSV 200
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------PED--------- 111
D++A ++ FL ++ + +P L ++ APH P+D
Sbjct: 201 DVLAQKALGFLDEAARG--DRPFFLGIAPVAPHSNVQASLLTNHTFDNPDDINKIVGAFT 258
Query: 112 ---SAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
A ++ ++F + TP ++ W+ ++ R+ + F D +RL+TL
Sbjct: 259 PPIPAERHKHLFADARVPRTPNFNPEAATGANWVRRL-RRQSAENVAFNDHFYRQRLRTL 317
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
QSVD+ V +V L + G L++TY+ YT+D+GYH+GQ L GK FE DI VP ++RG
Sbjct: 318 QSVDELVAAVVDRLAERGLLDSTYVFYTTDNGYHIGQHRLQPGKECGFEEDINVPLIVRG 377
Query: 229 PGIVPGTM 236
PG+ G +
Sbjct: 378 PGVPRGEV 385
>gi|390605164|gb|EIN14555.1| Arylsulphatase [Punctularia strigosozonata HHB-11173 SS5]
Length = 581
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 92/168 (54%), Gaps = 11/168 (6%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------PEDSAPQYANMFFNV 123
Y DLIA + FL S + KP L ++ APH A ++ +F N
Sbjct: 201 YQTDLIAEKAYGFL--SDAVNAGKPFFLAVAPTAPHANVKDGVFTEPQPARRHEKLFQNA 258
Query: 124 TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
T +++ + W+L++ ++ + + + D L +RL+ LQSVD+ ++ ++ L+
Sbjct: 259 RVPRTDSFNPEKSSGANWVLELPQQNK-SNVDYNDHLYRQRLRALQSVDEMIDGLITRLE 317
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
G L++TYIIYT+D+GYH+GQ L GK+ +E DI VP ++RGP +
Sbjct: 318 QRGLLDSTYIIYTADNGYHIGQHRLQPGKTCGYEEDINVPLIVRGPEV 365
>gi|317149839|ref|XP_001823212.2| arylsulfatase precursor [Aspergillus oryzae RIB40]
Length = 776
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 99/197 (50%), Gaps = 19/197 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+YYN + NG Y Y D+I N S+ FL ++ + +P ML ++ APH
Sbjct: 343 RYYNAKMTRNGETPV---SYAGQYSTDVIKNKSLGFLDEALAN-PDRPWMLTIAPNAPHS 398
Query: 108 -GPEDS------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
G +D+ AP++A +F +++ + W+ + +
Sbjct: 399 NGSQDAVTGATWFGEPEYAPRHAQLFKEYKIPRDESFNKIIDGAVGWVADLP-TLTEEEI 457
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+ D RL+ LQ+VD+ + ++ ++L G LNNTYI +++D+GYH+GQ + GK+
Sbjct: 458 NYIDEFQRCRLRALQAVDEMIGELFEKLDKAGVLNNTYIFFSTDNGYHIGQHAMQPGKNC 517
Query: 215 PFEFDIRVPFLMRGPGI 231
+E DI +P +RGPGI
Sbjct: 518 GYETDINIPLFIRGPGI 534
>gi|350636401|gb|EHA24761.1| hypothetical protein ASPNIDRAFT_183005 [Aspergillus niger ATCC
1015]
Length = 591
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN + P +++ Y+Y + R + + Y +Y D
Sbjct: 133 YTGKLFNVHTVDNYNAPF--PAGFTGNDFLLDPFTYDYLNSTFQRDHEIPQSYEGEYSTD 190
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
++A + R L ++ + +KP L ++ APH D+ P+++
Sbjct: 191 VLAQKAYRLLNEAVTA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 248
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F +V TP+++ WI + ++ + + D +RL+TLQ+VD+ V+
Sbjct: 249 HLFPDVKVPRTPSFNPEIPSGANWI-KTLKRQNDTNVDYNDHFYRQRLRTLQAVDELVDG 307
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ LK+ L+NTYI+Y+SD+GYH+GQ L GKS +E DI VP ++RGP I
Sbjct: 308 LFTRLKEYNILDNTYIVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIIRGPDIA 362
>gi|391871485|gb|EIT80645.1| sulfatase [Aspergillus oryzae 3.042]
Length = 644
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 100/197 (50%), Gaps = 19/197 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+YYN + NG Y Y D+I N S+ FL ++ + +P ML ++ APH
Sbjct: 152 RYYNAKMTRNGETPV---SYAGQYSTDVIKNKSLGFLDEALAN-PDRPWMLTIAPNAPHS 207
Query: 108 -GPEDS------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
G +D+ AP++A +F + +++ + W+ + +
Sbjct: 208 NGSQDAVTGATWFGEPEYAPRHAQLFKDYKIPRDESFNKIIDGAVGWVADLP-TLTEEEI 266
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+ D RL+ LQ+VD+ + ++ ++L G LNNTYI +++D+GYH+GQ + GK+
Sbjct: 267 NYIDEFQRCRLRALQAVDEMIGELFEKLDKAGVLNNTYIFFSTDNGYHIGQHAMQPGKNC 326
Query: 215 PFEFDIRVPFLMRGPGI 231
+E DI +P +RGPGI
Sbjct: 327 GYETDINIPLFIRGPGI 343
>gi|134082572|emb|CAK42487.1| unnamed protein product [Aspergillus niger]
Length = 585
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN + P +++ Y+Y + R + + Y +Y D
Sbjct: 131 YTGKLFNVHTVDNYNAPF--PAGFTGNDFLLDPFTYDYLNSTFQRDHEIPQSYEGEYSTD 188
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
++A + R L ++ + +KP L ++ APH D+ P+++
Sbjct: 189 VLAQKAYRLLNEAVTA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 246
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F +V TP+++ WI + ++ + + D +RL+TLQ+VD+ V+
Sbjct: 247 HLFPDVKVPRTPSFNPEIPSGANWI-KTLKRQNDTNVDYNDHFYRQRLRTLQAVDELVDG 305
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ LK+ L+NTYI+Y+SD+GYH+GQ L GKS +E DI VP ++RGP I
Sbjct: 306 LFTRLKEYNILDNTYIVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIIRGPDIA 360
>gi|340518976|gb|EGR49216.1| predicted protein [Trichoderma reesei QM6a]
Length = 593
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 97/198 (48%), Gaps = 24/198 (12%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
+YYN S NG + Y D++A + FL+ + + S P + ++ APH
Sbjct: 161 QYYNVSTTRNGAPPTNP---VGKYSTDILAESAQEFLQHALEDESS-PFFITLAPIAPHA 216
Query: 109 -----------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVH 153
P A ++ +F TP + NPD W+ + + V
Sbjct: 217 LVGEGSVGAFDPPQVAERHQGLFPEYKIPRTPNF----NPDSPSGVNWVSNLPKLDDEV- 271
Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
+ D RL++LQ+VD+ V +V L++ G L+NT+I YTSD+GYH+GQ L GK+
Sbjct: 272 VDYNDEYQRLRLRSLQAVDEIVGSVVARLEEAGVLDNTFIFYTSDNGYHIGQHRLHPGKT 331
Query: 214 FPFEFDIRVPFLMRGPGI 231
FE DI VP ++RGPGI
Sbjct: 332 CGFETDINVPLIVRGPGI 349
>gi|317035851|ref|XP_001397044.2| arylsulfatase [Aspergillus niger CBS 513.88]
Length = 589
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN + P +++ Y+Y + R + + Y +Y D
Sbjct: 131 YTGKLFNVHTVDNYNAPF--PAGFTGNDFLLDPFTYDYLNSTFQRDHEIPQSYEGEYSTD 188
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
++A + R L ++ + +KP L ++ APH D+ P+++
Sbjct: 189 VLAQKAYRLLNEAVTA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 246
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F +V TP+++ WI + ++ + + D +RL+TLQ+VD+ V+
Sbjct: 247 HLFPDVKVPRTPSFNPEIPSGANWI-KTLKRQNDTNVDYNDHFYRQRLRTLQAVDELVDG 305
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ LK+ L+NTYI+Y+SD+GYH+GQ L GKS +E DI VP ++RGP I
Sbjct: 306 LFTRLKEYNILDNTYIVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIIRGPDIA 360
>gi|302839150|ref|XP_002951132.1| periplasmic arylsulfatase, sulfate-repressible [Volvox carteri f.
nagariensis]
gi|300263461|gb|EFJ47661.1| periplasmic arylsulfatase, sulfate-repressible [Volvox carteri f.
nagariensis]
Length = 652
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/238 (29%), Positives = 117/238 (49%), Gaps = 33/238 (13%)
Query: 21 YFGKYLNKYNGSY---IPPGWREWGALIM--NSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK+L Y+ S +P GW + AL+ Y N + NG +Y+ D
Sbjct: 126 YVGKFLVDYSVSNYQNVPAGWTDIDALVTPYTFDYNNPGFSRNGATPNIYPGFYST---D 182
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
+IA+ +V ++ + + KP +S APH P AP++
Sbjct: 183 VIADKAVAQIKTAVAA--GKPFYAQISPIAPHTSTQIYFDPVANATKTFFYPPIPAPRHW 240
Query: 118 NMFFNVT----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
+F + T + H Y+ + WI + Q +R + + + RL++L SVD+
Sbjct: 241 ELFSDATLPEGTSHKNLYEADVSDKPAWIRALPLAQQN-NRTYLEEVYRLRLRSLASVDE 299
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
++++V L++ G L+NTY+IY++D+GYH+G GK ++ D+RVPFL+RGPGI
Sbjct: 300 LIDRVVATLQEAGVLDNTYLIYSADNGYHVGTHRFGAGKVTAYDEDLRVPFLIRGPGI 357
>gi|342874897|gb|EGU76804.1| hypothetical protein FOXB_12701 [Fusarium oxysporum Fo5176]
Length = 601
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 114/232 (49%), Gaps = 30/232 (12%)
Query: 26 LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFL 85
+N YN Y P GW +++ Y+Y + + + + DLI+ + L
Sbjct: 162 INNYNSPY-PAGWTGTN-FLLDPGTYDYLNPIYQHNQEPPVQHKGVHTSDLISKYAHELL 219
Query: 86 RQSKQSFSKKPIMLVMSFPAPHGPED--------SAP---------QYANMFFNVTSHHT 128
+++ + + P + ++ APH + SAP +++++F V T
Sbjct: 220 KEAIDA--ENPFFVAIAPVAPHSNVNLRRQPGNPSAPLMTIPIPLERHSHLFEGVKVPRT 277
Query: 129 PAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
+ NPD WI ++ ++ + D RLQ LQ VD+ VE++ K+L+D
Sbjct: 278 ENF----NPDSPSGVSWIHKLA-QLNDSSVSYLDDFYRARLQALQGVDEIVEQVTKQLED 332
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
G L+ TYIIY+SD+GYHLGQ L GK F+ DIRVP +RGPG+ G++
Sbjct: 333 AGILDETYIIYSSDNGYHLGQHRLPPGKECGFDEDIRVPLFIRGPGVSSGSI 384
>gi|1703431|sp|Q10723.1|ARS_VOLCA RecName: Full=Arylsulfatase; Short=AS; AltName: Full=Aryl-sulfate
sulphohydrolase; Flags: Precursor
gi|452339|emb|CAA54426.1| arylsulfatase [Volvox carteri f. nagariensis]
gi|110174603|gb|ABG54272.1| arylsulfatase precursor [Volvox carteri f. nagariensis]
Length = 649
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/238 (29%), Positives = 116/238 (48%), Gaps = 33/238 (13%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALIM--NSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK+L Y N +P GW + AL+ Y N + NG +Y+ D
Sbjct: 126 YVGKFLVDYSVSNYQNVPAGWTDIDALVTPYTFDYNNPGFSRNGATPNIYPGFYST---D 182
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
+IA+ +V ++ + + KP +S APH P AP++
Sbjct: 183 VIADKAVAQIKTAVAA--GKPFYAQISPIAPHTSTQIYFDPVANATKTFFYPPIPAPRHW 240
Query: 118 NMFFNVT----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
+F + T + H Y+ + WI + Q +R + + + RL++L SVD+
Sbjct: 241 ELFSDATLPEGTSHKNLYEADVSDKPAWIRALPLAQQN-NRTYLEEVYRLRLRSLASVDE 299
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
++++V L++ G L+NTY+IY++D+GYH+G GK ++ D+RVPFL+RGPGI
Sbjct: 300 LIDRVVATLQEAGVLDNTYLIYSADNGYHVGTHRFGAGKVTAYDEDLRVPFLIRGPGI 357
>gi|58265540|ref|XP_569926.1| Arylsulfatase precursor [Cryptococcus neoformans var. neoformans
JEC21]
gi|134108857|ref|XP_776543.1| hypothetical protein CNBC0370 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259223|gb|EAL21896.1| hypothetical protein CNBC0370 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57226158|gb|AAW42619.1| Arylsulfatase precursor, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 604
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 109/237 (45%), Gaps = 30/237 (12%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK++N + N +P ++++ Y+Y R + + +Y DL+
Sbjct: 127 YTGKFMNDHTDANCESLPVSGFNSSDILVDPYTYDYWTPGFSRDNGPVKVHAGEYSTDLV 186
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYANMFF 121
+V +L + Q +P L ++ A H D S P++A +F
Sbjct: 187 HEKAVNYLEDALQE--DRPFYLTVAPVACHSWLDYKQQGDIKKFVTDIPASHPRHARLFP 244
Query: 122 NVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
T + NPD+ W+ ++ K+ + D RL+ LQ+VD+ VE
Sbjct: 245 VEQIERTENW----NPDEPSGVSWVKELP-KLNSTEEAYLDEFFRGRLRALQAVDELVED 299
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
IV L+ GEL+NTYI YT+D+GY LG GK+ FE DI VPF+ RGPGI G
Sbjct: 300 IVDRLEKAGELDNTYIFYTADNGYALGSHRRQPGKTLGFEEDIHVPFIARGPGIKKG 356
>gi|340931779|gb|EGS19312.1| arylsulfatase-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 630
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 117/248 (47%), Gaps = 46/248 (18%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN Y P GW + +++ Y Y R + + Y + D
Sbjct: 165 YTGKLFNAHTVENYNKPY-PAGWNQ-SDFLLDPYTYMYLNATYQRNREPPKSYKGQHTVD 222
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------------------- 108
++A ++ F+ + ++ ++P L ++ APH
Sbjct: 223 VLAEKALGFIDIAAKN--ERPFFLGIAPVAPHSNVQSKLLDDIDESDPWKDIDDEDETLF 280
Query: 109 -PEDSAPQYANMFFNVTSHHTPAYDYAPNPD----KQWILQVTRKMQPVHRQFTDLLMTK 163
P A ++ ++F + P + NPD W+ ++ K+ + +F D +
Sbjct: 281 SPPIPAERHKHLFPDAKVPRNPNF----NPDVPSGANWVRKLP-KLSDENVEFNDHFYRQ 335
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
RL+ LQSVD+ VE +V+ L++LG L+NTYIIYT+D+GYH+ Q L GK FE DI VP
Sbjct: 336 RLRALQSVDELVEGVVERLEELGILDNTYIIYTTDNGYHISQHRLQPGKECGFEEDINVP 395
Query: 224 FLMRGPGI 231
++RGPG+
Sbjct: 396 LIIRGPGV 403
>gi|302902097|ref|XP_003048580.1| hypothetical protein NECHADRAFT_95823 [Nectria haematococca mpVI
77-13-4]
gi|256729513|gb|EEU42867.1| hypothetical protein NECHADRAFT_95823 [Nectria haematococca mpVI
77-13-4]
Length = 571
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 112/229 (48%), Gaps = 27/229 (11%)
Query: 26 LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFL 85
+N YN + P GW +++ Y+Y + R + ++ + DLI+ + L
Sbjct: 134 INNYNSPF-PAGWTGTN-FLLDPGTYSYLNPIYQRNDEPPVQHHGTHTLDLISKYAHDLL 191
Query: 86 RQSKQSFSKKPIMLVMSFPAPHGPED--------------SAPQYANMFFNVTSHHTPAY 131
++ S KP + ++ APH D A +++++F V T +
Sbjct: 192 DEAISS--DKPFFVAIAPVAPHSNIDVNRGSGPPTMTIPIPAERHSHLFEGVKIPRTANF 249
Query: 132 DYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
NPD WI Q++ ++ + D RLQ LQ VD+ VE+++ +L G
Sbjct: 250 ----NPDSPSGVSWIRQLS-QLNETAVSYLDDYYRARLQVLQGVDEIVEQVIHQLDSAGL 304
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L++TY++Y+SD+G+HLGQ L GK F+ DIRVP +RGPG+ G +
Sbjct: 305 LDDTYVVYSSDNGFHLGQHRLPPGKECGFDEDIRVPLFIRGPGVSAGAV 353
>gi|83771949|dbj|BAE62079.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 644
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 99/197 (50%), Gaps = 19/197 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+YYN + NG Y Y D+I N S+ FL ++ + +P ML ++ APH
Sbjct: 152 RYYNAKMTRNGETPV---SYAGQYSTDVIKNKSLGFLDEALAN-PDRPWMLTIAPNAPHS 207
Query: 108 -GPEDS------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
G +D+ AP++A +F +++ + W+ + +
Sbjct: 208 NGSQDAVTGATWFGEPEYAPRHAQLFKEYKIPRDESFNKIIDGAVGWVADLP-TLTEEEI 266
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+ D RL+ LQ+VD+ + ++ ++L G LNNTYI +++D+GYH+GQ + GK+
Sbjct: 267 NYIDEFQRCRLRALQAVDEMIGELFEKLDKAGVLNNTYIFFSTDNGYHIGQHAMQPGKNC 326
Query: 215 PFEFDIRVPFLMRGPGI 231
+E DI +P +RGPGI
Sbjct: 327 GYETDINIPLFIRGPGI 343
>gi|358385978|gb|EHK23574.1| hypothetical protein TRIVIDRAFT_190530 [Trichoderma virens Gv29-8]
Length = 576
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 95/198 (47%), Gaps = 24/198 (12%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
+YYN S + NG + Y D++A + FL + P + ++ APH
Sbjct: 148 QYYNVSTSKNGAPPT---NPVGKYSTDIVAESANEFLDHALLD-DGSPFFITLAPIAPHA 203
Query: 109 -----------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVH 153
P A ++ +F TP + NPD W+ ++ K+
Sbjct: 204 LVGEGSLGAFDPPQVAERHEGLFPEYKIPRTPNF----NPDSPSGVNWVSKLP-KLNDTI 258
Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
+ D KRL++LQ+VD+ VE +V L G L +T+I YTSD+GYH+GQ L GK+
Sbjct: 259 LDYNDEYQRKRLRSLQAVDEIVESVVARLDQAGVLEDTFIFYTSDNGYHIGQHRLHPGKT 318
Query: 214 FPFEFDIRVPFLMRGPGI 231
FE DI VP ++RGPG+
Sbjct: 319 CGFETDINVPLVVRGPGV 336
>gi|159467885|ref|XP_001692122.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
gi|158278849|gb|EDP04612.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
Length = 654
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/238 (30%), Positives = 114/238 (47%), Gaps = 33/238 (13%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNG--RKIKHGEDYYNDYYPD 75
Y GK+L Y N +P GW + AL+ Y + N G R Y +Y D
Sbjct: 132 YVGKFLVDYSVSNYQQVPAGWDDIDALVTP---YTFDYNTPGFSRNGATPNIYPGEYSTD 188
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
+I + V ++ + + KP +S APH P AP++
Sbjct: 189 VIRDKGVAQIKSAVAA--GKPFYAQISPIAPHTSTQISTNPATGVTRSYFFPPIPAPRHW 246
Query: 118 NMFFNVT-SHHTPA---YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
+F + TP Y+ + WI + Q +R + + + RL++L +VD+
Sbjct: 247 QLFSDANLPGGTPNKNLYEVDVSDKPAWIRALPLAQQ-NNRTYLEEIYRLRLRSLAAVDE 305
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+E++VK L + G L+NTYIIY++D+GYH+G GK+ +E D+RVPFL+RGPGI
Sbjct: 306 LIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHRFGAGKTTGYEEDLRVPFLIRGPGI 363
>gi|154293528|ref|XP_001547281.1| hypothetical protein BC1G_13903 [Botryotinia fuckeliana B05.10]
Length = 585
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 114/262 (43%), Gaps = 40/262 (15%)
Query: 21 YFGKYLNKYNGSYI----PPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GK N + S P GW L+ +YYN + N E +Y
Sbjct: 128 YTGKLFNHHTVSNWNNPHPAGWTSSDFLLDPFTYQYYNSTFQRNEHPPVSHE---GEYST 184
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYAN 118
D++A + L + QS KP LV + APH D A ++ +
Sbjct: 185 DVLAQKAYGLLDEGVQS--GKPFFLVAAPIAPHANVDLAAFKPGEGVLMTTPLPAERHKH 242
Query: 119 MFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
+F + NPDK WI + + V + D RLQTLQ+VD+
Sbjct: 243 LFPGAKVPRNANF----NPDKPSGASWIKHLDQLTDEV-IAYNDEFYRARLQTLQAVDEL 297
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+E +V L D G L+NTY I+++D+GYH+ Q L GK FE DI +P ++RGPG+ G
Sbjct: 298 IEGLVDRLTDYGILDNTYFIFSTDNGYHISQHRLAPGKECGFEEDINIPLVIRGPGVPQG 357
Query: 235 TMYLLDV----LIPQVRKFSSG 252
+ L P + K ++G
Sbjct: 358 EETSIVTAHTDLAPTILKIANG 379
>gi|169779723|ref|XP_001824326.1| arylsulfatase [Aspergillus oryzae RIB40]
gi|83773065|dbj|BAE63193.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 598
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 91/180 (50%), Gaps = 25/180 (13%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAP----QYAN 118
D++ ++ L + ++P L ++ APH + SAP ++ +
Sbjct: 200 DVLRKKALDLLDDAVAESHERPFFLTIAPIAPHSNFEMTNASDYTTFRFSAPIPLERHKD 259
Query: 119 MFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
+F V T + NPD+ WI + ++ Q D RL+ LQ VD+
Sbjct: 260 LFPEVKVPRTEHF----NPDQPSGVNWISTLPQQNQS-SIDSNDEFYRARLRALQGVDEI 314
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
VE+IV+ L+D G L+NTYI YTSD+GYH+GQ L GK FE DIRVP +RGPGI G
Sbjct: 315 VEQIVQRLEDAGVLDNTYIFYTSDNGYHIGQHRLHPGKECGFEEDIRVPMFIRGPGIPSG 374
>gi|407921643|gb|EKG14784.1| Sulfatase [Macrophomina phaseolina MS6]
Length = 368
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 73/124 (58%), Gaps = 3/124 (2%)
Query: 111 DSAPQYANMFFNVTSHHTPAYDYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
++A ++ N+FFNVT TP ++ A WI + + + D RL+ L
Sbjct: 25 EAAKRHQNLFFNVTVPRTPHFNPAGGNAGAASWIKDLPL-LNASEVAYGDEFYRLRLEAL 83
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
Q+VD+ ++ +V L LG LN+TY+ YTSD+G+H+GQ L GKS +E DI VPF +RG
Sbjct: 84 QAVDELIDALVARLGALGVLNDTYVFYTSDNGFHIGQHRLQPGKSCGYEEDILVPFFVRG 143
Query: 229 PGIV 232
PG+
Sbjct: 144 PGVA 147
>gi|358373836|dbj|GAA90432.1| arylsulfatase [Aspergillus kawachii IFO 4308]
Length = 820
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N YN + P G+ +++ Y+Y + R + + Y +Y D
Sbjct: 130 YTGKLFNVHTVDNYNAPF-PAGFTG-NDFLLDPFTYDYLNSTFQRDREPPQSYEGEYSTD 187
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYA----------- 117
++A + R L ++ + +KP L ++ APH D+ P+++
Sbjct: 188 VLAQKAYRLLDEAVAA--QKPFFLTVAPIAPHCNVFMNGTGLDANPKFSFSAPIPAKRHE 245
Query: 118 NMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
++F ++ TP+++ WI + ++ + + D +RL+ LQ++D+ V+
Sbjct: 246 DLFSDIKVPRTPSFNPEKPSGANWI-KTLKRQNDTNIDYNDHFYRQRLRALQAIDELVDG 304
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ L+D L+NTY++Y+SD+GYH+GQ L GKS +E DI VP ++RGP +
Sbjct: 305 LFSRLEDYDILDNTYVVYSSDNGYHIGQHRLQPGKSCGYEEDINVPLIVRGPSVA 359
>gi|67902708|ref|XP_681610.1| hypothetical protein AN8341.2 [Aspergillus nidulans FGSC A4]
gi|40747747|gb|EAA66903.1| hypothetical protein AN8341.2 [Aspergillus nidulans FGSC A4]
gi|259484268|tpe|CBF80344.1| TPA: arylsulfatase, putative (AFU_orthologue; AFUA_8G02520)
[Aspergillus nidulans FGSC A4]
Length = 608
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 110/233 (47%), Gaps = 39/233 (16%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y Y+N + GS +++ Y+Y + R Y + D+I
Sbjct: 169 YHSPYVNGFTGS----------DFLLDPFTYSYLNSTYQRNGDEPVSYEGRHTVDVITEK 218
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPH--------GPEDSAP----------QYANMFFN 122
++ FL +P L ++ APH G + AP ++ ++F +
Sbjct: 219 ALGFLDDGLNG--DRPFFLTVAPVAPHSNVDVSALGADRRAPTIMTEPIPLDRHKSLFQD 276
Query: 123 VTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
V T + NPD+ WI + ++ + ++ D +RL+ LQ VD+ V+ I
Sbjct: 277 VKVPRTKHF----NPDEPSGVSWIRDLPQQDEST-IEYNDHFYRQRLRALQGVDELVDLI 331
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
V L+ G+L+NTYIIYTSD+GYH+GQ L GK+ FE DIRVP +RGPG+
Sbjct: 332 VTRLEASGQLDNTYIIYTSDNGYHIGQHRLPPGKACGFEEDIRVPLFIRGPGV 384
>gi|121714837|ref|XP_001275028.1| arylsulfatase, putative [Aspergillus clavatus NRRL 1]
gi|119403184|gb|EAW13602.1| arylsulfatase, putative [Aspergillus clavatus NRRL 1]
Length = 584
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 114/238 (47%), Gaps = 26/238 (10%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + S P GW +++ Y+Y + R + Y + ++
Sbjct: 134 YTGKLMNGHRLTNYDSPFPKGWTG-SDFLLDPNTYSYHNSTYQRNQDPPKSYAGHHTSEV 192
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------GPEDSAP----QYAN 118
IA ++ FL + S KP L +S APH GP + P ++ +
Sbjct: 193 IAEKAMGFLEDALGS--DKPFFLGVSPVAPHTNIEPIDLGPNAVNGPRMTEPVPLEKHMH 250
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+F + TP ++ ++L + Q V + D +RL++LQSVD+ V+ +
Sbjct: 251 LFKDTKVPRTPNFNPTTASGASYVLGLPLLNQTV-IDYQDHFYRQRLRSLQSVDELVDSL 309
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
V+ L++ G +NTYI YT+D+G+H+G L GK+ +E DI VP +RGPG+ G +
Sbjct: 310 VRRLEESGRADNTYIFYTADNGFHIGNHRLPPGKTTGYEEDIHVPLFVRGPGVPKGKV 367
>gi|159467585|ref|XP_001691972.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
gi|12698891|gb|AAK01720.1|AF333184_1 arylsulfatase 2 precursor [Chlamydomonas reinhardtii]
gi|20750294|gb|AAM23308.1| arylsuflatase 2 [Chlamydomonas reinhardtii]
gi|158278699|gb|EDP04462.1| periplasmic arylsulfatase [Chlamydomonas reinhardtii]
Length = 649
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 115/238 (48%), Gaps = 33/238 (13%)
Query: 21 YFGKYLNKYNGS---YIPPGWREWGALIMNSKYYNYSINMNG--RKIKHGEDYYNDYYPD 75
Y GK+L Y+ S +P GW + AL+ Y + N G R Y +Y D
Sbjct: 127 YVGKFLVDYSVSNYQQVPAGWDDIDALVTP---YTFDYNTPGFSRNGATPNIYPGEYSTD 183
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------PEDSAPQYA 117
+I + + ++ + + KP +S APH P AP++
Sbjct: 184 VIRDKGIAQIKSAVAA--GKPFYAQISPIAPHTSTQISTDPVTGVTRSFFYPPIPAPRHW 241
Query: 118 NMFFNVT-SHHTPA---YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
+F + TP Y+ + W+ + Q +R + + + RL++L +VD+
Sbjct: 242 QLFSDANLPGGTPNKNLYEVDVSDKPAWVRALPLAQQ-NNRTYLEEIYRLRLRSLAAVDE 300
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+E++VK L + G L+NTYIIY++D+GYH+G GK+ +E D+RVPFL+RGPGI
Sbjct: 301 LIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHRFGAGKTTGYEEDLRVPFLIRGPGI 358
>gi|115388119|ref|XP_001211565.1| hypothetical protein ATEG_02387 [Aspergillus terreus NIH2624]
gi|114195649|gb|EAU37349.1| hypothetical protein ATEG_02387 [Aspergillus terreus NIH2624]
Length = 576
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 100/199 (50%), Gaps = 19/199 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
YYN S N K Y Y D+I + FL + +S +P L ++ APH
Sbjct: 167 SYYNSSYQRNHEPPK---SYAGQYTTDVITEKGLGFLDDALES--DRPFFLTVAPIAPHS 221
Query: 109 PEDS-------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
D AP++ ++F +V T ++ I + + + V
Sbjct: 222 NIDPSAAGSGRMTEPIPAPRHRHLFKDVKVPRTANFNPLNRTGVSSIWDLPLQNKTV-VD 280
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQSVD+ V+K++ L+ G ++NTY+I+++D+G+H+GQ L GK+
Sbjct: 281 YEDHFYRQRLRALQSVDEMVDKLIDRLEKSGAIDNTYVIFSADNGFHIGQHRLGPGKTTG 340
Query: 216 FEFDIRVPFLMRGPGIVPG 234
++ DIRVPF +RGPGI G
Sbjct: 341 YDEDIRVPFFIRGPGIPEG 359
>gi|452842014|gb|EME43950.1| hypothetical protein DOTSEDRAFT_171886 [Dothistroma septosporum
NZE10]
Length = 591
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 111/231 (48%), Gaps = 32/231 (13%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED---YYNDYYPDLI 77
Y ++N +NGS +++ Y Y N ++GE Y Y PD++
Sbjct: 134 YDDPFVNGFNGS----------DFLLDPYTYEY---FNAHMTRNGEGPVAYTGLYSPDVV 180
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-----------PEDS---APQYANMFFNV 123
A + FL ++ +P + ++ APH +D+ A ++A++F
Sbjct: 181 AQKAYGFLEEAVLH-KDRPFFIGVAPIAPHSNMRIDVENGEFSKDTPFYAERHAHLFKEY 239
Query: 124 TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
T ++ W+ + + V + + D RL++LQSVD+ VE++V L+
Sbjct: 240 KIPRTANFNPGVPSGVSWVKDLPQLNDTVIK-YHDEFQRARLRSLQSVDEMVEEVVHRLE 298
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
G L+NTYI YT+D+GYH+GQ L GK PFE DI +P +RGPG+ G
Sbjct: 299 AHGLLDNTYIFYTTDNGYHIGQHRLPPGKECPFEEDIHIPLAVRGPGVPAG 349
>gi|391873570|gb|EIT82595.1| sulfatase [Aspergillus oryzae 3.042]
Length = 598
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 88/176 (50%), Gaps = 17/176 (9%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED--SAPQYANMFFNV---TSHHTP 129
D++ ++ L + ++P L ++ APH + +A Y F+ H
Sbjct: 200 DVLRKKALGLLDDAVTESHERPFFLTIAPIAPHSNFEMTNASDYTTFRFSAPIPLERHKD 259
Query: 130 AYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+ A NPD+ WI + ++ Q D RL+ LQ VD+ VE++
Sbjct: 260 LFPEAKVPRTEHFNPDQPSGVNWISTLPQQNQS-SIDSNDEFYRARLRALQGVDEIVERV 318
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
V+ L+D G L+NTYI YTSD+GYH+GQ L GK FE DIRVP +RGPGI G
Sbjct: 319 VQRLEDAGVLDNTYIFYTSDNGYHIGQHRLHPGKECGFEEDIRVPMFIRGPGIPSG 374
>gi|159479262|ref|XP_001697712.1| hypothetical protein CHLREDRAFT_192731 [Chlamydomonas reinhardtii]
gi|158274080|gb|EDO99864.1| predicted protein [Chlamydomonas reinhardtii]
Length = 686
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 79/245 (32%), Positives = 117/245 (47%), Gaps = 38/245 (15%)
Query: 23 GKYLNKYNGSYI-----PPGWREWGALIMN----SKYY---------NYSINMNGRKIKH 64
GK+LN + + P GW LI + S+Y NY N G
Sbjct: 165 GKFLNGFTYKAVKKFGCPKGWTVADPLIQDFTTDSEYEFSEDYPRAPNYMTNCQGEV--- 221
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GP---EDSAPQYAN 118
+ Y+ DY+ ++I + ++ +L + S KP L +S APH GP P Y +
Sbjct: 222 -QTYFGDYHEEVIRSKALTYLDDG--AASGKPFFLYISTVAPHDAKGPGAYPQVPPAYRD 278
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL--LMTKRLQTLQSVDDAVE 176
+F V + T + P P K P TD+ RLQ+L++VDD ++
Sbjct: 279 LFPGVKAPRTGNWG-KPVPAAVGF----SKQTPNDFNVTDIDGRYRARLQSLRAVDDTID 333
Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
++K L G L+NT I+YTSD+G+ LG + + K FE D+RVPFLM GPG VP +
Sbjct: 334 AVMKRLACHGLLDNTVIMYTSDNGFKLGNHNMAQEKFTYFEEDVRVPFLMAGPG-VPRGV 392
Query: 237 YLLDV 241
Y+ +V
Sbjct: 393 YVPEV 397
>gi|302905774|ref|XP_003049336.1| hypothetical protein NECHADRAFT_45116 [Nectria haematococca mpVI
77-13-4]
gi|256730271|gb|EEU43623.1| hypothetical protein NECHADRAFT_45116 [Nectria haematococca mpVI
77-13-4]
Length = 580
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 98/200 (49%), Gaps = 20/200 (10%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
+Y+N I NG + Y Y D+IA ++ +L ++ P L +S APH
Sbjct: 152 QYWNAKITHNGEEPV---SYAGQYSTDVIAKKALAWLDEALAE--PDPFFLTVSPIAPHS 206
Query: 109 --------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
SAP++ ++F + +++ A + W+ + +
Sbjct: 207 NWVIDTERDLSYLEEPKSAPRHQHLFSDYVIPREKSFNAAIDGAAGWVGDLP-PLNETTV 265
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+ D +RL+ LQSVD+ V ++V L+ G+L+NT+I YT+D+GYH+ Q + GK
Sbjct: 266 TYNDHYQRQRLRALQSVDEMVSELVGRLEKAGQLDNTFIFYTTDNGYHISQHRMNPGKEC 325
Query: 215 PFEFDIRVPFLMRGPGIVPG 234
++ DI +PF +RGPGI G
Sbjct: 326 GYDTDIHIPFFVRGPGIPAG 345
>gi|167521003|ref|XP_001744840.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776454|gb|EDQ90073.1| predicted protein [Monosiga brevicollis MX1]
Length = 589
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 94/208 (45%), Gaps = 40/208 (19%)
Query: 61 KIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GP----EDSA 113
I Y +Y +I N S++++ ++ + +P M ++ APH GP A
Sbjct: 225 AITFNNGTYANYSTSVIGNVSLQWIE--SKARANQPFMAYIAPKAPHIQDGPGWPKPIPA 282
Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT---------------- 157
P YAN F V++ P ++ + PD W++ M T
Sbjct: 283 PWYANAFAQVSAPRPPNFNVSA-PDHHWLVASQAPMTAEEVSDTKSTTFASSLYSAQSIG 341
Query: 158 --------------DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
D L R ++L SVDD VE +V L +L+ TYI+YTSDHGY
Sbjct: 342 RLSGLVCVQQEVNMDELYRDRWRSLLSVDDLVESLVNTLAATNQLDRTYILYTSDHGYRF 401
Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
GQF + GK +E D+R+PF+ RGPGI
Sbjct: 402 GQFRMPMGKWGLYENDVRIPFVARGPGI 429
>gi|238500393|ref|XP_002381431.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
gi|220693184|gb|EED49530.1| arylsulfatase, putative [Aspergillus flavus NRRL3357]
Length = 598
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 88/176 (50%), Gaps = 17/176 (9%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED--SAPQYANMFFNV---TSHHTP 129
D++ ++ L + ++P L ++ APH + +A Y F+ H
Sbjct: 200 DVLRKKALGLLDDAVTESHERPFFLTIAPIAPHSNFEMTNASDYTTFRFSAPIPLERHKD 259
Query: 130 AYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+ A NPD+ WI + ++ Q D RL+ LQ VD+ VE++
Sbjct: 260 LFPEAKVPRTEHFNPDQPSGVNWISTLPQQNQS-SIDSNDEFYRARLRALQGVDEIVEQV 318
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
V+ L+D G L+NTYI YTSD+GYH+GQ L GK FE DIRVP +RGPGI G
Sbjct: 319 VQRLEDAGVLDNTYIFYTSDNGYHIGQHRLHPGKECGFEEDIRVPMFIRGPGIPSG 374
>gi|380477817|emb|CCF43942.1| sulfatase [Colletotrichum higginsianum]
Length = 607
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 117/248 (47%), Gaps = 41/248 (16%)
Query: 21 YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK N + +Y P + W + +++ Y+Y R + ++ +Y DL+
Sbjct: 141 YTGKMFNAHTIWNYDSPHLKGWNSSDFLLDPNTYDYYNATFQRNHEKPVNHLGEYSTDLV 200
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----------- 114
A + FL + ++ KP L ++ APH GP+D+ P
Sbjct: 201 AEKAYGFLDDAVEA--SKPFFLAIAPIAPHSNVNASILLPVKGPDDNVPVSDDKFRVNFP 258
Query: 115 ----QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
++ ++F + T + NPDK WI + K+ + ++ D RL+
Sbjct: 259 LAAKRHEHLFTDAKVPRTDNF----NPDKPSGADWIRR-QPKLTDENVEYNDRHYQGRLR 313
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
LQ+VD+ V+ +V L+ G L+NTY+ YT+D+GYH+ Q L GK +E DI +P ++
Sbjct: 314 ALQAVDELVDGLVDRLEKHGILDNTYVFYTTDNGYHISQHRLQPGKECGYEEDINIPLIV 373
Query: 227 RGPGIVPG 234
RGPG+ G
Sbjct: 374 RGPGVPKG 381
>gi|405122939|gb|AFR97704.1| arylsulfatase [Cryptococcus neoformans var. grubii H99]
Length = 604
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 107/237 (45%), Gaps = 30/237 (12%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK +N + N +P ++++ Y+Y R E + +Y DL+
Sbjct: 127 YTGKLMNDHTDANCESLPVSGFNSSDILVDPYTYDYWTPGFSRDNGPVEVHAGEYSTDLV 186
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED----------------SAPQYANMFF 121
+V +L + Q +P L ++ A H D S P++A +F
Sbjct: 187 HEKAVNYLEDALQE--DQPFYLTVAPVACHSWLDHKHQVDLHKFVADIPASHPRHARLFP 244
Query: 122 NVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
T + NPD+ W+ ++ K+ + D RL+ LQ+VD+ VE
Sbjct: 245 VEQIERTENW----NPDEPSGVSWVKELP-KLNSTQEAYLDEFFRGRLRALQAVDEIVED 299
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
IV L+ GEL+NTYI YT+D+GY LG GK+ FE DI VP + RGPGI G
Sbjct: 300 IVDRLEKAGELDNTYIFYTADNGYALGSHRRQPGKTLGFEEDIHVPLIARGPGIKKG 356
>gi|258564744|ref|XP_002583117.1| hypothetical protein UREG_07890 [Uncinocarpus reesii 1704]
gi|237908624|gb|EEP83025.1| hypothetical protein UREG_07890 [Uncinocarpus reesii 1704]
Length = 587
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 106/225 (47%), Gaps = 18/225 (8%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + + GW +++ Y+Y R Y Y D+
Sbjct: 130 YVGKLMNAHTVVNYNQPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 188
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
+A + F+ + + KP LV + +PHG P ++A++F +
Sbjct: 189 VAEKVLGFMDDALKG--DKPFFLVAAPVSPHGQIKPPPLWFDKPEYPERHAHLFKDYKIP 246
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T ++ WI + R Q + D RL++LQSVD+ V ++++LK
Sbjct: 247 RTSNFNPETPSGAGWIAAMPRLNQ-TQIDYHDEYQRCRLRSLQSVDEMVGDMIEKLKQHD 305
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
EL+NTY IYT+D+G+H+ Q+ L GK+ F+ DIR+P + RGPG+
Sbjct: 306 ELDNTYFIYTTDNGFHISQYRLPPGKTCGFDTDIRIPMVARGPGV 350
>gi|119483994|ref|XP_001261900.1| arylsulfatase, putative [Neosartorya fischeri NRRL 181]
gi|119410056|gb|EAW20003.1| arylsulfatase, putative [Neosartorya fischeri NRRL 181]
Length = 583
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 115/238 (48%), Gaps = 26/238 (10%)
Query: 21 YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N ++ S P GW +++ Y Y + R + ++Y Y D+
Sbjct: 134 YTGKLMNGHSTSNYNSPFPKGWNG-SDFLLDPYTYAYLNSTYQRNREPPKNYAGQYTTDV 192
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------GPEDSAP----QYAN 118
I ++ FL + S +P L ++ APH G +AP ++ +
Sbjct: 193 IREKAMGFLNDALSS--DRPFFLAVAPIAPHSNIEPSALGTGGAKGTLMTAPIPAERHKD 250
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+F + T ++ W+ + Q V + D +RL++LQ++D+ V+ +
Sbjct: 251 LFPDAKVPRTANFNPKVPTGGGWVQHLPLANQTV-IDYQDHFYRQRLRSLQALDEMVDAL 309
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ L++ G+ NTYIIYT+D+G+H+GQ L GKS +E DI VPF +RGPGI G +
Sbjct: 310 ITRLEESGQAENTYIIYTADNGFHIGQHRLPPGKSSGYEEDIHVPFYIRGPGIPKGKI 367
>gi|46126687|ref|XP_387897.1| hypothetical protein FG07721.1 [Gibberella zeae PH-1]
Length = 603
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 98/192 (51%), Gaps = 32/192 (16%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS------------------ 112
+Y D++A + FL ++ + P L ++ APH D+
Sbjct: 194 EYVTDVLARKTQGFLDEALKE--DAPFFLTIAPNAPHSNVDASFAKINFTDTPRNWTFKM 251
Query: 113 -----APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR---KMQPVHRQFTDLLMTKR 164
AP++A++F +V T + NPD Q + R K+ + + D R
Sbjct: 252 TTPIPAPRHAHLFKDVVVPRTENF----NPDSQIVPSWIRDQPKLTQENIDWNDDFYRNR 307
Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
L+ LQ+VD+ ++ + K+L+D G L++TY++YT+D+G+H+ Q L GK +E DI +P
Sbjct: 308 LRALQAVDELIDDVFKKLEDAGVLDHTYVVYTTDNGFHISQHRLQPGKECGYEEDINIPL 367
Query: 225 LMRGPGIVPGTM 236
++RGPGI G++
Sbjct: 368 IIRGPGIPAGSI 379
>gi|346321498|gb|EGX91097.1| arylsulfatase precursor [Cordyceps militaris CM01]
Length = 603
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 112/231 (48%), Gaps = 29/231 (12%)
Query: 21 YFGKYLN-----KYNGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYY 73
Y GK +N YN + P GW L+ +Y N ++++ ++ + Y
Sbjct: 158 YTGKLMNMHGVETYNNPF-PAGWNRSDYLLDPGTYRYLNSVMSLDEQEYR---SMAGTYS 213
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS--------APQYANMFFNVTS 125
D I + +V FL + KP L ++ PHG + A ++ ++F ++
Sbjct: 214 TDNIRDRAVEFLGNGIDA--GKPFFLGVAPIGPHGQDTPKGFDHPIPADRHKDLFPDLQV 271
Query: 126 HHTPAYDYAPNPDKQWILQ---VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
TP + NPDK + T K+ P ++ D +R+Q LQSVD+ VE ++ +L
Sbjct: 272 PRTPNF----NPDKAGDVHYFASTPKLAPHQVEYLDEYYRRRIQALQSVDELVEALMLKL 327
Query: 183 KDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ L NTY+ YTSD+GYHLGQ L GK E DI VP L+RGPGI
Sbjct: 328 EQHRHVLANTYVFYTSDNGYHLGQHRLPPGKCTSMEDDINVPLLVRGPGIA 378
>gi|119181412|ref|XP_001241920.1| hypothetical protein CIMG_05816 [Coccidioides immitis RS]
Length = 806
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/229 (27%), Positives = 108/229 (47%), Gaps = 18/229 (7%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + + GW L+ + Y + M+ R Y Y D+
Sbjct: 349 YVGKLMNAHTIVNYNEPVAAGWTGSDFLLDPTTYDYWGAKMS-RNGAAPVSYAGQYSSDV 407
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
+A + F+ + + KP LV + +PHG P ++A++F +
Sbjct: 408 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 465
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T ++ W+ + R Q ++ D RL+ LQSVD+ ++ +V++L +
Sbjct: 466 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 524
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L+NTY IYT+D+G+H+ Q L GK+ F+ DIR+P ++RGPG+ G
Sbjct: 525 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMIVRGPGVAKGA 573
>gi|336178163|ref|YP_004583538.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
glomerata]
gi|334859143|gb|AEH09617.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
glomerata]
Length = 452
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 104/218 (47%), Gaps = 29/218 (13%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G FGKYLN +N + PP + EW ++ Y + + N+NG N Y ++I N
Sbjct: 143 GLFGKYLNAWNINDNPPHFEEWA--LLQPSYTDGTFNVNGTV-----QTINGYTTNVIRN 195
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ FL K + +P ++ APH P A QYA+ P ++ P+
Sbjct: 196 RTLSFL--DKAATDTRPWFAYVAPYAPHEPNTPAAQYADTVV-------PTWNGRPS--- 243
Query: 140 QWILQVTRKMQPVHRQFT-------DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ + R +P + Q L ++L+TL SVDDAV+ +L LG+LNNT
Sbjct: 244 --VFETDRSDKPGYIQNATNTLADGQALRQRQLRTLLSVDDAVQAFRDKLVALGQLNNTL 301
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+IY D+GY G + GKS P+ VPF + PG
Sbjct: 302 VIYIPDNGYLWADHGRL-GKSTPYTPAHEVPFYLSWPG 338
>gi|115437586|ref|XP_001217849.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114188664|gb|EAU30364.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 579
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 101/200 (50%), Gaps = 26/200 (13%)
Query: 49 KYYNYSINMNGRKIKHGE--DYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAP 106
+YYN + NG GE Y Y D++ + FL ++ ++P M+ ++ AP
Sbjct: 152 QYYNSMMTRNG-----GEPVSYAGQYITDVMTEKAHGFLDEALAD--ERPWMMTVAPVAP 204
Query: 107 HG---------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP 151
H PE AP++A+MF + +++ A + WI ++
Sbjct: 205 HSNGTLEIHSGVFWQSQPE-YAPRHAHMFKDYKIPRDASFNAAIDGGVSWIGKLDALNDS 263
Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
V ++ D RL++LQ+VD+ VE +V L++ G L+ TYI Y++D+GYH+ Q L G
Sbjct: 264 V-IEYHDEYQRCRLRSLQAVDEMVESLVSRLEEAGVLDTTYIFYSTDNGYHISQHRLAPG 322
Query: 212 KSFPFEFDIRVPFLMRGPGI 231
K F+ DI +P +RGPGI
Sbjct: 323 KECGFDTDIHIPLYVRGPGI 342
>gi|156057213|ref|XP_001594530.1| hypothetical protein SS1G_04337 [Sclerotinia sclerotiorum 1980]
gi|154702123|gb|EDO01862.1| hypothetical protein SS1G_04337 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 492
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 113/258 (43%), Gaps = 32/258 (12%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
Y GK N + S P GW L+ +YYN + N E +Y
Sbjct: 34 YTGKLFNHHTVDNWNSPHPAGWTSSDFLLDPFTYQYYNSTFQRNEHPPVSHE---GEYST 90
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH---GPEDSAPQYANMFFNV--TSHHTP 129
D++A + L + QS KP LV + APH GP+ P + H
Sbjct: 91 DVLAQKAYGLLDEGVQS--GKPFFLVAAPIAPHSNVGPDAFKPGEGILMTTPLPAERHKH 148
Query: 130 AYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+ A NPDK W+ + ++ + D RLQTLQ+VD+ +E +
Sbjct: 149 LFPSAKVPRNANFNPDKPSGASWVKHLD-QLTDEAIAYNDEFYRARLQTLQAVDELIEGL 207
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
L D G L+NTY I+++D+GYH+ Q L GK FE DI +P ++RGPG+ G
Sbjct: 208 FHRLTDYGILDNTYFIFSTDNGYHISQHRLAPGKECGFEEDINIPLVIRGPGVPQGEETS 267
Query: 239 LDV----LIPQVRKFSSG 252
+ L P + K ++G
Sbjct: 268 IVTAHTDLAPTILKIANG 285
>gi|303318527|ref|XP_003069263.1| arylsulfatase precursor, putative [Coccidioides posadasii C735
delta SOWgp]
gi|240108949|gb|EER27118.1| arylsulfatase precursor, putative [Coccidioides posadasii C735
delta SOWgp]
Length = 658
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 18/229 (7%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + + GW +++ Y+Y R Y Y D+
Sbjct: 201 YVGKLMNAHTIVNYNEPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 259
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
+A + F+ + + KP LV + +PHG P ++A++F +
Sbjct: 260 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 317
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T ++ W+ + R Q ++ D RL+ LQSVD+ ++ +V++L +
Sbjct: 318 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 376
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L+NTY IYT+D+G+H+ Q L GK+ F+ DIR+P ++RGPG+ G
Sbjct: 377 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMVVRGPGVAKGA 425
>gi|392864834|gb|EAS30557.2| arylsulfatase [Coccidioides immitis RS]
Length = 584
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 18/229 (7%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + + GW +++ Y+Y R Y Y D+
Sbjct: 127 YVGKLMNAHTIVNYNEPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 185
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
+A + F+ + + KP LV + +PHG P ++A++F +
Sbjct: 186 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 243
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T ++ W+ + R Q ++ D RL+ LQSVD+ ++ +V++L +
Sbjct: 244 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 302
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L+NTY IYT+D+G+H+ Q L GK+ F+ DIR+P ++RGPG+ G
Sbjct: 303 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMIVRGPGVAKGA 351
>gi|406858889|gb|EKD11969.1| arylsulfatase [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 576
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 97/187 (51%), Gaps = 29/187 (15%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------------GP 109
Y Y D++A + L + Q+ ++P L ++ APH G
Sbjct: 193 YEGQYSTDVLAQKAYSLLDHAVQA--EEPFFLTLAPIAPHANIEMNGSALDDGHTLKFGS 250
Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRL 165
SA ++ ++F +V T ++ NPD W+L + ++ + + D +RL
Sbjct: 251 PVSAERHKHLFKDVKVPRTESF----NPDNPSGANWLLGL-KQQNKSNVDYNDHFYRQRL 305
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
+ LQ+VD+ V+ + + L+ G ++NTYIIY+SD+G+H+GQ L GKS +E DI VP +
Sbjct: 306 RALQAVDELVDGLFERLEAHGIMDNTYIIYSSDNGFHIGQHRLQPGKSCGYEEDINVPLI 365
Query: 226 MRGPGIV 232
+RGPG+
Sbjct: 366 IRGPGVA 372
>gi|320036152|gb|EFW18091.1| arylsulfatase [Coccidioides posadasii str. Silveira]
Length = 584
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 18/229 (7%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + + GW +++ Y+Y R Y Y D+
Sbjct: 127 YVGKLMNAHTIVNYNEPVAAGWTG-SDFLLDPTTYDYWGAKMSRNGAAPVSYAGQYSSDV 185
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----------QYANMFFNVTSH 126
+A + F+ + + KP LV + +PHG P ++A++F +
Sbjct: 186 VAEKVLGFMDDAMKE--DKPFFLVAAPVSPHGQVKPPPLWFDKPEYPARHAHLFKDYKIP 243
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T ++ W+ + R Q ++ D RL+ LQSVD+ ++ +V++L +
Sbjct: 244 RTSNFNPKDPSGVGWVASMPRLNQ-TQIEYHDEYQRCRLRALQSVDEMIDSMVEKLAEND 302
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L+NTY IYT+D+G+H+ Q L GK+ F+ DIR+P ++RGPG+ G
Sbjct: 303 LLDNTYFIYTTDNGFHISQHRLPPGKTCGFDTDIRIPMVVRGPGVAKGA 351
>gi|326435643|gb|EGD81213.1| hypothetical protein PTSG_11248 [Salpingoeca sp. ATCC 50818]
Length = 698
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 88/167 (52%), Gaps = 4/167 (2%)
Query: 71 DYYPDLIANDSVRFLRQ---SKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHH 127
+Y DL+ + +LR+ + ++ + +P L + PH P + +YA+ NVT+
Sbjct: 348 EYATDLLLEKASEWLRRDNVTGKASNGRPFFLYFAPHCPHIPSTPSDKYADACPNVTAPR 407
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
TP ++Y+ + I + ++ R D L R Q L SVDDA +++ L+DLG
Sbjct: 408 TPNFNYSTPLFHELISKQAPFVEEEARAI-DALARLRCQCLLSVDDAHAQLLSVLEDLGV 466
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
NNTY +SDHGY+LG L K +E +R+P ++ GPG+ G
Sbjct: 467 ANNTYWFVSSDHGYNLGNHRLPSNKFLLYEHSLRIPLVVHGPGVPAG 513
>gi|321252792|ref|XP_003192520.1| arylsulfatase precursor [Cryptococcus gattii WM276]
gi|317458989|gb|ADV20733.1| Arylsulfatase precursor, putative [Cryptococcus gattii WM276]
Length = 604
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/187 (32%), Positives = 89/187 (47%), Gaps = 27/187 (14%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---------------- 111
+ +Y DL+ +V +L + Q +P L ++ A H D
Sbjct: 177 HAGEYSTDLVHEKAVNYLEDALQE--DRPFYLTVAPIACHSWIDHRRMVDVHQFVADVPA 234
Query: 112 SAPQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQT 167
S P++A +F T + NPD+ W+ ++ K+ + D RL+
Sbjct: 235 SHPRHARLFPVEQIERTENW----NPDEPSGVSWVKELP-KLNSTQEAYLDEFFRGRLRA 289
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
LQ+VD+ VE IV L+ GEL+NTYI YT+D+GY LG GK+ FE DI VP + R
Sbjct: 290 LQAVDEMVEDIVDRLEKAGELDNTYIFYTADNGYALGSHRRQPGKTLGFEEDIHVPLIAR 349
Query: 228 GPGIVPG 234
GPGI G
Sbjct: 350 GPGIKKG 356
>gi|345570135|gb|EGX52960.1| hypothetical protein AOL_s00007g296 [Arthrobotrys oligospora ATCC
24927]
Length = 594
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 96/191 (50%), Gaps = 29/191 (15%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED--------------SA 113
Y+ + D++ + S L + S +KP L ++ APH + S
Sbjct: 197 YHGQHTIDILIDKSNALLDDAIAS--EKPFFLALAPVAPHSNVEGDISDISDIEKVTFSE 254
Query: 114 P----QYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRL 165
P ++ ++F + TP + NPDK WI + ++ D +RL
Sbjct: 255 PIPLKRHEHLFPDAKVPRTPHF----NPDKPSGVNWIAE-RPQLNKTWVDHNDYYYRQRL 309
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
+ LQ VD+ V+ I K L++ G+L+NTYI YT+D+G+H+ Q L GK FE DIRVPF+
Sbjct: 310 RALQGVDELVDGIFKRLEENGQLDNTYIFYTADNGFHVSQHRLNPGKECGFEDDIRVPFI 369
Query: 226 MRGPGIVPGTM 236
+RGPGI G +
Sbjct: 370 VRGPGIAEGAV 380
>gi|70982853|ref|XP_746954.1| arylsulfatase [Aspergillus fumigatus Af293]
gi|66844579|gb|EAL84916.1| arylsulfatase, putative [Aspergillus fumigatus Af293]
gi|159123838|gb|EDP48957.1| arylsulfatase, putative [Aspergillus fumigatus A1163]
Length = 598
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 111/236 (47%), Gaps = 26/236 (11%)
Query: 21 YFGKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
Y GK +N + S P GW +++ Y Y + R + ++Y Y D+
Sbjct: 134 YTGKLMNGHTTSNYNSPFPKGWNG-SDFLLDPYTYAYLNSTYQRNREPPKNYAGQYTTDV 192
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------GPEDSAP----QYAN 118
I ++ FL + S +P L ++ APH G +AP ++ +
Sbjct: 193 ITEKALGFLNDALSS--DRPFFLAVAPIAPHSNIEPSALGTGGAKGTLMTAPIPAERHKD 250
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
+F + T ++ W+ + Q V + D +RL+ LQ++D+ V+ +
Sbjct: 251 LFPDAKVPRTANFNPKVPTGGGWVQHLPLANQTV-IDYQDHFYRQRLRALQALDEMVDAL 309
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ L++ G NTYIIYT+D+G+H+GQ L GKS +E DI VPF +RGPGI G
Sbjct: 310 ITRLEESGHAENTYIIYTADNGFHIGQHRLPPGKSSGYEEDIHVPFYIRGPGIPRG 365
>gi|269839628|ref|YP_003324320.1| sulfatase [Thermobaculum terrenum ATCC BAA-798]
gi|269791358|gb|ACZ43498.1| sulfatase [Thermobaculum terrenum ATCC BAA-798]
Length = 477
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 107/234 (45%), Gaps = 31/234 (13%)
Query: 22 FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
FGKYLN ++ PP + WG I + N N GR ++ Y D+ +A+D+
Sbjct: 130 FGKYLNYWDVYRAPPYFDRWG--IQEPVHVNALFNEQGR-VERVPGYSTDH----VADDA 182
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANM---------------------F 120
V F+R+ + + +P L ++ PH P +YA+ F
Sbjct: 183 VEFIRERAAAGNGRPFFLYVAPHVPHAPFQPPERYADAPVPPFRGDPATCERDISDKPPF 242
Query: 121 FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
Y PD + ++ + + R FT ++ T++ + L + DD V +++
Sbjct: 243 VRRLERTHRVEHYQLPPDDRGRMRWSSAQR--QRYFTQIIPTQQERMLMAADDLVNRVMS 300
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L G NT +IY SD+G+ G+ GLV KS+P+ ++VP +MR PG VP
Sbjct: 301 TLHSTGLDRNTLVIYISDNGFLWGEHGLVS-KSWPYTQSVQVPMMMRWPGHVPA 353
>gi|119488189|ref|XP_001262637.1| arylsulfatase [Neosartorya fischeri NRRL 181]
gi|119410795|gb|EAW20740.1| arylsulfatase [Neosartorya fischeri NRRL 181]
Length = 575
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 88/184 (47%), Gaps = 29/184 (15%)
Query: 77 IANDSVRFLRQSKQSF------SKKPIMLVMSFPAPHG-------------------PED 111
+ + LR+ F K+P L ++ APH PED
Sbjct: 180 LGRHTTEVLREKAMGFLEDALDGKRPFFLAVTPIAPHSNMNGTYGGGAGPMWMDEPIPED 239
Query: 112 SAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSV 171
++ +F + TP ++ WI + + Q V + D +RL+ LQ V
Sbjct: 240 ---RHKRLFPDAKVPRTPNFNPKEPTGVSWIHDLPYRNQTV-VDYNDHYYRQRLRALQGV 295
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D+ V++++ L+ +L NTYIIYTSD+G+H+GQ L GK+ F+ DIRVPF +RGPGI
Sbjct: 296 DELVDQLITRLEQSDKLENTYIIYTSDNGFHVGQHRLPPGKTCGFDEDIRVPFFIRGPGI 355
Query: 232 VPGT 235
G
Sbjct: 356 PEGA 359
>gi|323455238|gb|EGB11107.1| hypothetical protein AURANDRAFT_62077 [Aureococcus anophagefferens]
Length = 537
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/229 (28%), Positives = 104/229 (45%), Gaps = 25/229 (10%)
Query: 22 FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGE--DYYNDYYPDLIAN 79
GKY+N Y+P GW + A + + N I G + Y ++ N
Sbjct: 130 LGKYVNAMPEGYVPQGWSAYLANGGGDYFGPEFVARNAPGIPDGAYAAPKDAYTVSVVGN 189
Query: 80 DSVRFLRQSKQSFSK-KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
SV FL ++ SK +P++L + A H P + P YA+ H P++ P
Sbjct: 190 ASVAFL---ERHLSKSEPVLLYCAPKAVHEPFNPPPWYAD-------HWDPSWPAKPVRG 239
Query: 139 KQW--ILQVTRK------MQP----VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
W R+ QP + D + R ++L +VDD VE +V + G
Sbjct: 240 PAWNATFDARRRHAANVPTQPMITAAAAEVVDGVYRNRWRSLMAVDDLVEALVTTVDARG 299
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+ TY ++TSDHG+ LGQ ++ K +++D RVP +++GPG+ GT
Sbjct: 300 ATDRTYFVFTSDHGFQLGQLNMLMDKRHVYDWDTRVPLVIKGPGVGRGT 348
>gi|159489062|ref|XP_001702516.1| arylsulfatase [Chlamydomonas reinhardtii]
gi|158280538|gb|EDP06295.1| arylsulfatase [Chlamydomonas reinhardtii]
Length = 705
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 123/252 (48%), Gaps = 53/252 (21%)
Query: 22 FGKYLNKYN---GSYIPPGWREWGALIMNSKYY----NYSINMNGRKIKHGEDYYNDYYP 74
GK+LN + GS P G+ AL+ Y +S N G K+G+ +DY
Sbjct: 163 IGKFLNGFAVGPGSVCPRGFTVLDALVDPYVYSYFGPGFSKNC-GTVQKYGD---SDYST 218
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-PEDS---------------APQYAN 118
D+I + ++ +LR++ + +P + ++ APH +DS AP++A
Sbjct: 219 DVIRDKAIGYLREAVAA--GQPFYMQVNPIAPHERCKDSEEGGGGRLACDNAIPAPRHAY 276
Query: 119 MFFNVT----------------SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMT 162
+F N + + + D + + +Q + ++ +RQ
Sbjct: 277 LFRNASLPRSANWMTPLPESFRAAVAASTDSVRSGKELGAVQSSLQLNTAYRQ------- 329
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
RL+ + SVD+ + IV+EL G L++T+II+TSD+GYH+G +GK+ P+E D+RV
Sbjct: 330 -RLRAMASVDEMLGDIVQELSAQGVLDSTHIIFTSDNGYHMGNHNFGQGKTLPYEEDVRV 388
Query: 223 PFLMRGPGIVPG 234
PF +RGPG+ G
Sbjct: 389 PFFIRGPGLPRG 400
>gi|169598524|ref|XP_001792685.1| hypothetical protein SNOG_02066 [Phaeosphaeria nodorum SN15]
gi|160704411|gb|EAT90278.2| hypothetical protein SNOG_02066 [Phaeosphaeria nodorum SN15]
Length = 577
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 108/233 (46%), Gaps = 29/233 (12%)
Query: 21 YFGKYLNKYNGSY----IPPGWREWGALIMNSKY--YNYSINMNGRKIKHGEDYYNDYYP 74
Y GK +N ++ + +P GW L+ Y +N + + YN
Sbjct: 140 YTGKLMNGHSTTTWNKPLPAGWNGTDFLVDPGTYIYWNATFQKDQAPPAPAPGQYNT--- 196
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS---------APQYANMFFNVTS 125
DL+ + F+ + KP + ++ PH + A ++ +F + +
Sbjct: 197 DLVKEKGLEFIDTVANT--SKPFFIGIAPIGPHAEFNGTGGFTKAVGAKRHQKLFLDAKA 254
Query: 126 HHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
T + NPDK WI ++ K+ D RLQ+LQSVD+ V+++V
Sbjct: 255 PRTSNW----NPDKPSGASWISKL-EKLNGTVIASHDEWHVGRLQSLQSVDELVDEVVNR 309
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+ L+NTYII+TSD+GYH+GQ L GK+ FE DI VPF +RGP + G
Sbjct: 310 LEKYKLLDNTYIIFTSDNGYHIGQHRLQPGKTCAFEEDINVPFYVRGPNVPKG 362
>gi|171693357|ref|XP_001911603.1| hypothetical protein [Podospora anserina S mat+]
gi|170946627|emb|CAP73430.1| unnamed protein product [Podospora anserina S mat+]
Length = 577
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 92/174 (52%), Gaps = 15/174 (8%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPEDSAPQYANMFFN 122
LI+ ++ FL + S KP + ++ APH GP ++A++F +
Sbjct: 180 LISEKALGFLDDAI-SNPDKPFFIGIAPIAPHSNMYIDENDVRHWGPPIPLDKHAHLFPD 238
Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
TP ++ W+ + RK ++ D +RL++LQSVD+ V+ ++K L
Sbjct: 239 AQVPRTPNFNPDVESGGSWVRGL-RKQDDDVVEYNDHYYRERLRSLQSVDELVDDVIKRL 297
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ L+NTYIIYTSD+G+H+GQ + GK +E DI VP ++RGPG+ G +
Sbjct: 298 EAAKLLDNTYIIYTSDNGFHIGQHRMQPGKMCGYEEDINVPLIIRGPGVPKGEV 351
>gi|241601619|ref|XP_002405037.1| sulfatase, putative [Ixodes scapularis]
gi|215500549|gb|EEC10043.1| sulfatase, putative [Ixodes scapularis]
Length = 145
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 86/159 (54%), Gaps = 21/159 (13%)
Query: 77 IANDSVRFLR-QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
++ + FLR ++ QS KP + +S PAPH P A ++ N + V + TP+++ +
Sbjct: 1 MSEKAANFLRPRTNQS---KPFFMFLSPPAPHQPFTPAVRHRNAYNGVQAPRTPSFNVSS 57
Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
N V+R P R+ +TL +VDD V ++V L L +TY+ +
Sbjct: 58 N-------AVSR---PYDRRIP-------WRTLLAVDDLVSEVVGILASKKLLASTYVFF 100
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
TSD+GYHLGQF K K P+E DI VP L+RGPGI G
Sbjct: 101 TSDNGYHLGQFSQPKDKRQPYETDIHVPLLVRGPGIPAG 139
>gi|159489004|ref|XP_001702487.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280509|gb|EDP06266.1| predicted protein [Chlamydomonas reinhardtii]
Length = 825
Score = 87.8 bits (216), Expect = 5e-15, Method: Composition-based stats.
Identities = 66/222 (29%), Positives = 103/222 (46%), Gaps = 36/222 (16%)
Query: 21 YFGKYLNKYNGS----YIPPGWREWGALIMNSKYYNY----SINMNGRKIKHGEDYYNDY 72
+ GK+LN + + P GW W AL+ Y Y S+N ++ N Y
Sbjct: 133 HVGKFLNAMDPTDSRFRCPRGWSTWDALVEPYVYMYYTPAFSLNCGPTQVLE-----NQY 187
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD 132
D+I+ D ++ FP P A ++ +F + P ++
Sbjct: 188 STDVIS-DKCDYINARGTCV----------FPIP------AERHRTLFSHAVLPMNPNFN 230
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
AP + + ++T V + F R++ L +VD+ + ++V L +LG L+NTY
Sbjct: 231 VAPPAELGLVNEMTSDTG-VQKHFL-----ARIRALAAVDEMIGRMVNTLTELGVLDNTY 284
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
I+YTSD+G+ LG +GK F +E RVPF MRGPGI PG
Sbjct: 285 ILYTSDNGFQLGNHAQKQGKQFHWEEITRVPFYMRGPGIPPG 326
>gi|407918049|gb|EKG11340.1| Sulfatase, partial [Macrophomina phaseolina MS6]
Length = 431
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 106/234 (45%), Gaps = 28/234 (11%)
Query: 21 YFGKY---LNKYNGSYIPPGWREWGALIMNS--KYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK LN N S P GW LI Y N ++MNG + E Y+
Sbjct: 95 YIGKLFNGLNLANYSPKPKGWDHVDVLIDPYIYSYNNVVMSMNGDRPIWYEGYHQTDV-- 152
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP----QYANMFFNVTSHHTPAY 131
L A R Q + P L ++ +PH P ++A+ F N T
Sbjct: 153 LRAKGLARLEHLISQ---EDPFYLTIAPASPHVEGSDLPVPLQRHAHDFPNATVPKPKGG 209
Query: 132 DYAPNPD-----KQWILQVT----RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
++ PN W+ +T ++ H + RL+ L+ +D+ V +VK+L
Sbjct: 210 NFNPNDSFTEQKVSWLRNLTLLDDEQVNRTHEHYR-----HRLRALKGLDEIVHDVVKKL 264
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ G LNNTY+IYT+D+GYHLG + GKS P+ D +P +RGPGI PG +
Sbjct: 265 NETGALNNTYVIYTTDNGYHLGNHRVPAGKSLPYIEDTNIPMAIRGPGIPPGKI 318
>gi|302892713|ref|XP_003045238.1| hypothetical protein NECHADRAFT_94512 [Nectria haematococca mpVI
77-13-4]
gi|256726163|gb|EEU39525.1| hypothetical protein NECHADRAFT_94512 [Nectria haematococca mpVI
77-13-4]
Length = 600
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 116/250 (46%), Gaps = 41/250 (16%)
Query: 21 YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK N + +Y P + W + +++ Y+Y R + +++ +Y DL+
Sbjct: 134 YTGKMFNAHTVWNYDSPHLKGWNSSDFLLDPNTYDYWNATFQRNHEKPKNHLGEYSTDLV 193
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS------------------------- 112
A+ + FL + + +P L ++ APH +S
Sbjct: 194 ASKAYGFLDDAVAA--GEPFFLGIAPIAPHSNVNSTILNPVKGPDDDVPVSDDLFRVNFP 251
Query: 113 --APQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
A ++ ++F + T + NPDK WI + K+ +F D RLQ
Sbjct: 252 KAAKRHEHLFPDAKVPRTENF----NPDKPSGGDWIRR-QPKLTDEMVEFNDRHYRGRLQ 306
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
LQ+VD+ VE +V+ L+ G L+NTYI YT+D+GYH+ Q L GK +E DI +P ++
Sbjct: 307 ALQAVDELVEGLVERLEKHGILDNTYIFYTTDNGYHISQHRLQPGKECSYEEDINIPLIV 366
Query: 227 RGPGIVPGTM 236
RGPG+ G +
Sbjct: 367 RGPGVPKGGI 376
>gi|336260685|ref|XP_003345136.1| hypothetical protein SMAC_07425 [Sordaria macrospora k-hell]
gi|380096516|emb|CCC06564.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 618
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 74/125 (59%), Gaps = 2/125 (1%)
Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSV 171
A ++A++F +V TP ++ + WI Q+ + + F D +RL+TLQSV
Sbjct: 256 AARHAHLFPDVVVPRTPHFNPSSRASGVSWISQLPVQSAE-NVAFNDHFYRQRLRTLQSV 314
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D+ V +++ L+ G L +TY+ YT+D+GYH+GQ L GK FE DI VP ++RGPG+
Sbjct: 315 DELVAGVIERLERHGLLEDTYVFYTTDNGYHIGQHRLQPGKECGFEEDINVPLIVRGPGV 374
Query: 232 VPGTM 236
G +
Sbjct: 375 KKGAV 379
>gi|391870792|gb|EIT79965.1| arylsulfatase [Aspergillus oryzae 3.042]
Length = 586
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 56/81 (69%)
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQ VD+ V+ +V L+ +L+NTYIIYTSD+G+H+GQ L GK+
Sbjct: 290 YNDHYYRQRLRALQGVDELVDSLVTRLEQSDKLDNTYIIYTSDNGFHIGQHRLPPGKTCG 349
Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
FE DIRVPF +RGPGI G +
Sbjct: 350 FEEDIRVPFFIRGPGIPEGAV 370
>gi|350636581|gb|EHA24941.1| hypothetical protein ASPNIDRAFT_131173 [Aspergillus niger ATCC
1015]
Length = 566
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/79 (53%), Positives = 53/79 (67%)
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQ VD+ VE+IV L D G L+NTYI YTSD+GYH+GQ L GK
Sbjct: 267 YNDEFYRERLRALQGVDELVEEIVLRLDDAGILDNTYIFYTSDNGYHIGQHRLHPGKECG 326
Query: 216 FEFDIRVPFLMRGPGIVPG 234
F+ DIRVP +RGPG+ G
Sbjct: 327 FDEDIRVPMFVRGPGVPKG 345
>gi|395752419|ref|XP_002830441.2| PREDICTED: LOW QUALITY PROTEIN: extracellular sulfatase Sulf-2
[Pongo abelii]
Length = 974
Score = 86.3 bits (212), Expect = 1e-14, Method: Composition-based stats.
Identities = 37/71 (52%), Positives = 48/71 (67%), Gaps = 4/71 (5%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 273 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 332
Query: 58 NGRKIKHGEDY 68
NG K KHG DY
Sbjct: 333 NGVKEKHGSDY 343
>gi|169776738|ref|XP_001822835.1| arylsulfatase [Aspergillus oryzae RIB40]
gi|83771571|dbj|BAE61702.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 586
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 56/81 (69%)
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQ VD+ V+ +V L+ +L+NTYIIYTSD+G+H+GQ L GK+
Sbjct: 290 YNDHYYRQRLRALQGVDELVDSLVTRLEQSDKLDNTYIIYTSDNGFHIGQHRLPPGKTCG 349
Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
FE DIRVPF +RGPGI G +
Sbjct: 350 FEEDIRVPFFIRGPGIPEGAV 370
>gi|164425630|ref|XP_959804.2| hypothetical protein NCU06041 [Neurospora crassa OR74A]
gi|157071002|gb|EAA30568.2| hypothetical protein NCU06041 [Neurospora crassa OR74A]
Length = 615
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 111/261 (42%), Gaps = 48/261 (18%)
Query: 21 YFGKYLNK-----YNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK N Y+ YI GW +++ Y+Y R Y Y D
Sbjct: 143 YTGKLFNAHTVDNYDAPYIA-GWN-GSDFLLDPYTYSYLNATFQRNRDPPISYEGQYSVD 200
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG--------------------------- 108
++A + FL ++ ++ +P L ++ APH
Sbjct: 201 VLAEKAYGFLDEAAKNVHNRPFFLGIAPIAPHSNVEPRFPSSSSSSSSSDSATLHRRPTN 260
Query: 109 ------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTRKMQPVHRQ 155
P A ++A++F +V TP ++ + WI ++ + +
Sbjct: 261 EHDDIEKSVSFTPPIPAARHAHLFPDVIVPRTPHFNPSSRASGVSWIARLPHQSAE-NVA 319
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
F D +RL+ LQSVD+ V +V L+ G L NTY+ YT+D+GYH+GQ L GK
Sbjct: 320 FNDHFYRQRLRALQSVDELVAGVVARLERHGLLENTYVFYTADNGYHIGQHRLQPGKECG 379
Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
FE DI VP ++RGPG+ G +
Sbjct: 380 FEEDINVPLIVRGPGVKKGEV 400
>gi|448735634|ref|ZP_21717827.1| N-acetylglucosamine-6-sulfatase [Halococcus salifodinae DSM 8989]
gi|445797667|gb|EMA48128.1| N-acetylglucosamine-6-sulfatase [Halococcus salifodinae DSM 8989]
Length = 515
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 115/233 (49%), Gaps = 27/233 (11%)
Query: 20 GYFGKYLNKY-----NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
GY Y+ K+ N + PG+ W + Y++ ++N+NG+K++ Y
Sbjct: 143 GYETAYIGKWHTYQLNSAEPRPGFDHWVSFEGQGHYFDQTLNVNGKKVER-----KGYIT 197
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--- 131
D++ + +L+ + KP L +S APH AP++ N + N + +
Sbjct: 198 DILTEYARDWLKNRE---GDKPFFLFLSHKAPHAWFCPAPRHRNQYANASIEYPKTMADT 254
Query: 132 --DYAPNPDKQWILQVTRKMQPVHRQFT-----DLLMTKRLQTLQSVDDAVEKIVKELKD 184
+Y PD W+ ++ V+ F D L + +TL ++D+++ ++ +L +
Sbjct: 255 EENYTGKPD--WVQNQRESVRGVNYIFGGRFDFDELYRRYSETLLALDESIGAVMDQLNE 312
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
G ++T ++Y SD+GY LG+ GL+ GK +E IRVP L PG++ PGT+
Sbjct: 313 SGVADSTLMLYMSDNGYSLGEHGLI-GKQTAYEPSIRVPMLAYAPGMIEPGTV 364
>gi|145251529|ref|XP_001397278.1| arylsulfatase [Aspergillus niger CBS 513.88]
gi|134082813|emb|CAK42644.1| unnamed protein product [Aspergillus niger]
Length = 594
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/79 (53%), Positives = 53/79 (67%)
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D RL++LQ VD+ VE+IV L D G L+NTYI YTSD+GYH+GQ L GK
Sbjct: 295 YNDEFYRDRLRSLQGVDELVEEIVLRLDDAGILDNTYIFYTSDNGYHIGQHRLHPGKECG 354
Query: 216 FEFDIRVPFLMRGPGIVPG 234
F+ DIRVP +RGPG+ G
Sbjct: 355 FDEDIRVPMFVRGPGVPKG 373
>gi|336467297|gb|EGO55461.1| arylsulfatase [Neurospora tetrasperma FGSC 2508]
gi|350288073|gb|EGZ69309.1| arylsulfatase [Neurospora tetrasperma FGSC 2509]
Length = 635
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 109/254 (42%), Gaps = 49/254 (19%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y+ Y+ +NGS +++ Y+Y R Y Y D++A
Sbjct: 156 YYSPYVAGWNGS----------DFLLDPYTYSYLNATFQRNRDPPASYEGQYSVDVLAKK 205
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHG-------------------------------- 108
+ FL ++ ++ +P L ++ APH
Sbjct: 206 AYGFLDKAAKNVHNRPFFLGIAPIAPHSNVEPRFSSSSSSSSDPATLQRRPTKEHDDIEK 265
Query: 109 -----PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTRKMQPVHRQFTDLLMT 162
P A ++A++F +V TP ++ + WI ++ + + D
Sbjct: 266 SVSFTPPIPAARHAHLFPDVIVPRTPHFNPSSRASGVSWIARLPHQSAE-NVACNDHFYR 324
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
+RL+TLQSVD+ V +V L+ G L NTY+ YT+D+GYH+GQ L GK FE DI V
Sbjct: 325 QRLRTLQSVDELVAGVVARLERHGLLENTYVFYTADNGYHIGQHRLQPGKECGFEEDINV 384
Query: 223 PFLMRGPGIVPGTM 236
P ++RGPG+ G +
Sbjct: 385 PLIVRGPGVKRGEV 398
>gi|301311577|ref|ZP_07217504.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 20_3]
gi|423340655|ref|ZP_17318393.1| hypothetical protein HMPREF1059_04318 [Parabacteroides distasonis
CL09T03C24]
gi|300830663|gb|EFK61306.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 20_3]
gi|409226897|gb|EKN19800.1| hypothetical protein HMPREF1059_04318 [Parabacteroides distasonis
CL09T03C24]
Length = 515
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 114/244 (46%), Gaps = 35/244 (14%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGK+ + PG+ W +YYN IN NG I++ + + Y DL+ +
Sbjct: 131 AFFGKWHMGNDSGAPQPGFDHWEGFKGQGEYYNPRINTNGEWIQYKD---STYVTDLLTD 187
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHHTPAYDYA 134
+++F+++ KQ+ KP + +S H +A ++ + + N + +TP Y
Sbjct: 188 HAIQFMKEQKQA--DKPFFVYLSHKGVHDNFSAAKRHKDCYKNKELVLPPNFNTPHYGIK 245
Query: 135 --PNPDKQ------------------WILQVTRKMQPV----HRQFTDLLMTKRLQTLQS 170
P DK+ W+ V H + D+ + K +TL+S
Sbjct: 246 ELPTIDKKTGKAAFGKEYYGEKMAPDWVKSQRESWHGVDYSYHGRPWDVQVRKYCETLRS 305
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
VD+++ ++ LK+ G +NT +IY D+G+ G+ GL+ + F +E +RVP L+R P
Sbjct: 306 VDESIGSVLDYLKEAGLDDNTLVIYMGDNGFAWGEHGLIDKRQF-YEESVRVPMLVRCPS 364
Query: 231 IVPG 234
+ G
Sbjct: 365 MFEG 368
>gi|115433238|ref|XP_001216756.1| hypothetical protein ATEG_08135 [Aspergillus terreus NIH2624]
gi|114189608|gb|EAU31308.1| hypothetical protein ATEG_08135 [Aspergillus terreus NIH2624]
Length = 594
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 98/197 (49%), Gaps = 19/197 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
+Y++ + NG K Y +Y D++ + FL + + +P ML ++ APH
Sbjct: 153 RYFHAVMTRNGGKPV---SYAGNYSTDVLREKAEGFLDDALAN-PDRPWMLTVAPNAPHS 208
Query: 108 -------------GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
G + AP++A++F + T +++ W+ ++ Q
Sbjct: 209 NGSHDPSRDVTWYGEPEYAPRHAHLFKDYTIPRDKSFNKVIEGAVGWVKDLSELSQK-EV 267
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+ D RL+ L++VD+ V +V +L G L+NTYI Y++D+GYH GQ + GK+
Sbjct: 268 DYIDEFQRCRLRALRAVDEMVGALVDKLDKAGVLDNTYIFYSTDNGYHAGQHAMQPGKNC 327
Query: 215 PFEFDIRVPFLMRGPGI 231
++ DI VP ++RGPGI
Sbjct: 328 GYDTDIHVPLIVRGPGI 344
>gi|384496845|gb|EIE87336.1| hypothetical protein RO3G_12047 [Rhizopus delemar RA 99-880]
Length = 378
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 112/234 (47%), Gaps = 26/234 (11%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK +N+Y N ++ P ++ +++ Y Y + + + Y N Y D+I
Sbjct: 58 YIGKLMNQYSVNNYNHPTPKGFDYQDQLVDPYTYLYHTAVFSKDGQDPVYYSNQYQTDVI 117
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------GPEDSAPQYANMFFNVT 124
+ L++ + + P L ++ APH GP A ++A++F +V
Sbjct: 118 HAKAKAALKRLENA--TDPFFLWVAPMAPHAQFTIHANGSLSTGPAVPAARHAHLFQDVK 175
Query: 125 SHHTPAYDYAPNPDKQW----ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
P + NP +Q + ++ P D RL+ LQ+VD+ V +++
Sbjct: 176 IPRHPHF----NPKRQTKTASFWKHLDRLSPDEVDGLDEAYRNRLRALQAVDEMVGSLME 231
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
EL+ G L NT++IY++D+GYHLGQ GK+ E DI VP ++RGPG+ G
Sbjct: 232 ELERQGRLENTFVIYSADNGYHLGQHRSFPGKTSNLEEDINVPLIVRGPGVAQG 285
>gi|2873363|gb|AAC02716.1| arylsulfatase [Neurospora crassa]
Length = 639
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 41/209 (19%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG------------------- 108
Y Y D++A + FL ++ ++ +P L ++ APH
Sbjct: 193 YEGQYSVDVLAEKAYGFLDEAAKNVHNRPFFLGIAPIAPHSNVEPGFPSSSSSSSSSDSA 252
Query: 109 --------------------PEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-QWILQVTR 147
P A ++A++F +V TP ++ + WI ++
Sbjct: 253 TLHRRPTNEHDDIEKSVSFTPPIPAARHAHLFPDVIVPRTPHFNPSSRASGVSWIARLPH 312
Query: 148 KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
+ + F D +RL+ L+SVD+ V +V L+ G L NTY+ YT+D+GYH+GQ
Sbjct: 313 QSAE-NVAFNDHFYRQRLRALESVDELVAGVVARLERHGLLENTYVFYTADNGYHIGQHR 371
Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L GK FE DI VP ++RGPG+ G +
Sbjct: 372 LQPGKECGFEEDINVPLIVRGPGVKKGEV 400
>gi|238502181|ref|XP_002382324.1| gloeobacteria sulfatase, putative [Aspergillus flavus NRRL3357]
gi|220691134|gb|EED47482.1| gloeobacteria sulfatase, putative [Aspergillus flavus NRRL3357]
Length = 588
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 56/81 (69%)
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQ VD+ V+ +V L+ +L+NTYIIYTSD+G+H+GQ L GK+
Sbjct: 290 YNDHYYRQRLRALQGVDELVDSLVTRLEQSDKLDNTYIIYTSDNGFHIGQHRLPPGKTCG 349
Query: 216 FEFDIRVPFLMRGPGIVPGTM 236
F+ DIRVPF +RGPGI G +
Sbjct: 350 FDEDIRVPFFIRGPGIPEGAV 370
>gi|6822089|emb|CAB70953.1| Arylsulfatase [Colletotrichum gloeosporioides f. sp. malvae]
Length = 511
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 86/150 (57%), Gaps = 13/150 (8%)
Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQ----WILQVTRKMQPVHRQFTDLLMTKRLQTL 168
A ++A++F +V T + NPDK+ W+ + R+ + F D RL+ L
Sbjct: 167 AARHAHLFPDVVVPRTENF----NPDKESGANWVREQPRQSDE-NVAFNDHFYRSRLRAL 221
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
Q+VD+ VE +V +L++ G L +TYI YT+D+G+H+GQ GK +E DI +P ++RG
Sbjct: 222 QAVDELVESVVLKLEEYGILEDTYIFYTTDNGFHIGQHRPQPGKECGYEEDINIPLIVRG 281
Query: 229 PGIVPG-TMYLLDV---LIPQVRKFSSGSL 254
P + G T+ ++ V L+P + +SG L
Sbjct: 282 PRVPEGKTVEIVTVHQDLVPTILGLASGPL 311
>gi|402075556|gb|EJT71027.1| hypothetical protein GGTG_12048 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 620
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 71/224 (31%), Positives = 114/224 (50%), Gaps = 19/224 (8%)
Query: 21 YFGKYLNKYN--GSYI-PPGWREWGALIMN-SKYYNYSI-NMNGRKIKHGEDYYNDYYPD 75
Y GK LN YN ++I P G+ AL+ + YYN + + NG + + + +++
Sbjct: 123 YIGKLLNGYNVVNAHIRPKGFDHIDALLDPYTTYYNVPVMSQNGERPVYYKGFHSTDVVR 182
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED---SAP--QYANMFFNVTSHHTPA 130
+ A D ++FL Q P L ++ +PH + +AP ++ ++F + + P
Sbjct: 183 IKALDRLQFLLSQDQ-----PYYLQIAPYSPHIQNNLNAAAPLARHMDLFPDAKAPRLPN 237
Query: 131 YDYAPN---PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
Y+ A W+ + M +Q TD +R ++LQ VD+ +E +V L G
Sbjct: 238 YNPADEFQAGKGSWVGGLPL-MNETTQQRTDYAYRRRAESLQGVDEIIEDVVAMLAAKGT 296
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
L+ TYIIYTSD+GYHLGQ + GK+ F D +P +RGPGI
Sbjct: 297 LDTTYIIYTSDNGYHLGQHRIPAGKALFFGEDTNLPMAVRGPGI 340
>gi|325110459|ref|YP_004271527.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970727|gb|ADY61505.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 501
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 93/191 (48%), Gaps = 29/191 (15%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
A + +L + +Q + P ++ F PH D P+ + + +V +H P
Sbjct: 182 AEQVLNYLNEREQQADEDPFLIYFGFSHPHDTRDGKPELLDKYGSV--NHKDKSSLPPAN 239
Query: 138 DKQWILQVT-RKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVE 176
DKQ L + P H +L + + +++D ++
Sbjct: 240 DKQPELPPNYLEAHPFHHGHPNLRDEVAVSGVWERRDERTIRNELGREFACSENIDIQLD 299
Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----- 231
+++K L+++GEL+NTY+ YTSDHG +G+ GL +GK +E RVPF+++GPGI
Sbjct: 300 RVLKRLEEMGELDNTYVFYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIVKGPGIEAGSR 358
Query: 232 VPGTMYLLDVL 242
V G +YLLDVL
Sbjct: 359 VQGNIYLLDVL 369
>gi|350630052|gb|EHA18425.1| hypothetical protein ASPNIDRAFT_176469 [Aspergillus niger ATCC
1015]
Length = 561
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 60/94 (63%), Gaps = 1/94 (1%)
Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
WI + + Q V + D +RL+ LQ VD+ V+ ++ L+ E++NTYIIYTSD+G
Sbjct: 253 WIHNLPYRNQSV-IDYNDHYYRQRLRALQGVDELVDSLIARLEASSEIDNTYIIYTSDNG 311
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+H+ Q L GK+ F+ DIRVPF +RGPGI G
Sbjct: 312 FHISQHRLPPGKTCAFDEDIRVPFFIRGPGIPEG 345
>gi|336178164|ref|YP_004583539.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
glomerata]
gi|334859144|gb|AEH09618.1| N-acetylglucosamine-6-sulfatase [Frankia symbiont of Datisca
glomerata]
Length = 475
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 110/227 (48%), Gaps = 30/227 (13%)
Query: 19 HGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIA 78
+G+FGKYLN +N + PP + ++ AL+ Y + N+NG + Y +I
Sbjct: 137 NGFFGKYLNAWNVADAPPHFEDF-ALLDPVVYNDGQYNVNGTI-----QTISGYSTHVIR 190
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ + F++ K + +P L ++ APHGP PQYAN P +D P+
Sbjct: 191 DFAKGFIQ--KGTTDTRPWFLYIAPKAPHGPNTPEPQYANTVV-------PTWDGRPS-- 239
Query: 139 KQWILQVTRKMQPVHRQFTDLLMT-------KRLQTLQSVDDAVEKIVKELKDLGELNNT 191
+ + + +P + Q + + +L+TL SVDDAV+ + +L G+L+NT
Sbjct: 240 ---VFETDKSDKPDYIQSANATLADGRAIREAQLRTLLSVDDAVQAVHDKLVATGQLDNT 296
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP--GIVPGTM 236
+IY D+G+ G K KS P+ + VPF + P G+ GT
Sbjct: 297 LVIYIGDNGFTWADHGWTK-KSVPYLPSVEVPFYLSWPAGGLGAGTA 342
>gi|255951092|ref|XP_002566313.1| Pc22g24250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211593330|emb|CAP99713.1| Pc22g24250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 550
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
WI + + Q V + D +RL+ LQ VD+ V+ ++ L+ G+++NT+IIYTSD+G
Sbjct: 230 WIHDLPYRNQTV-IDYNDHYYRQRLRALQGVDELVDNLITHLEKSGKIDNTFIIYTSDNG 288
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+H+ Q L GK+ F+ DIRVPF+MRGPGI
Sbjct: 289 FHISQHRLPPGKTCGFDEDIRVPFMMRGPGI 319
>gi|159462406|ref|XP_001689433.1| arylsulfatase [Chlamydomonas reinhardtii]
gi|158283421|gb|EDP09171.1| arylsulfatase [Chlamydomonas reinhardtii]
Length = 518
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 97/219 (44%), Gaps = 53/219 (24%)
Query: 23 GKYLNKYNGS-----YIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
GK+LN Y S Y+P G+ + AL N+ YN + + R Y +DY DLI
Sbjct: 120 GKFLNGYYTSEAPAGYVPKGFTVFDALTTNA--YNMTNSCFSRNGGRSTCYPSDYQTDLI 177
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
+ ++ + ++ KKP ++ APH N+ + H P
Sbjct: 178 GVKAKNYISAAVRA--KKPFFAYVTPTAPH----------RSSTNLVTWHPPT------- 218
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
P R DL A E ++ L+DL L+NT +IY S
Sbjct: 219 -------------PASRH-KDLY-------------ADENVLGHLQDLEVLDNTIVIYMS 251
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
D+GYH+G F L+ GK+ P E D+RVPF +RGPGI G +
Sbjct: 252 DNGYHMGAFSLLDGKNLPIEEDVRVPFFIRGPGIPAGQV 290
>gi|134077589|emb|CAK96733.1| unnamed protein product [Aspergillus niger]
Length = 573
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 60/94 (63%), Gaps = 1/94 (1%)
Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
WI + + Q V + D +RL+ LQ VD+ V+ ++ L+ E++NTYIIYTSD+G
Sbjct: 267 WIHNLPYRNQSV-IDYNDHYYRQRLRALQGVDELVDSLIARLEASSEIDNTYIIYTSDNG 325
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+H+ Q L GK+ F+ DIRVPF +RGPGI G
Sbjct: 326 FHISQHRLPPGKTCAFDEDIRVPFFIRGPGIPEG 359
>gi|317030660|ref|XP_001393065.2| arylsulfatase [Aspergillus niger CBS 513.88]
Length = 573
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 60/94 (63%), Gaps = 1/94 (1%)
Query: 141 WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
WI + + Q V + D +RL+ LQ VD+ V+ ++ L+ E++NTYIIYTSD+G
Sbjct: 267 WIHNLPYRNQSV-IDYNDHYYRQRLRALQGVDELVDSLIARLEASSEIDNTYIIYTSDNG 325
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+H+ Q L GK+ F+ DIRVPF +RGPGI G
Sbjct: 326 FHISQHRLPPGKTCAFDEDIRVPFFIRGPGIPEG 359
>gi|346978743|gb|EGY22195.1| arylsulfatase [Verticillium dahliae VdLs.17]
Length = 606
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 111/243 (45%), Gaps = 40/243 (16%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y +LN +NGS +++ Y Y R ++ Y DL+A
Sbjct: 153 YDSPHLNGWNGS----------DFLLDPNTYAYWNATFQRNKDSPVNHLGQYSTDLVAQK 202
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----QYANMFFNVT 124
+ FL + + +P L ++ APH GP+++ P ++ F
Sbjct: 203 AYGFLDDAVAA--GEPFFLGIAPIAPHSNVNASTLNPVKGPDENVPVSDDEFRVSFPQPA 260
Query: 125 SHHTPAYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
H + A NPDK WI + K+ + + DL RLQ LQ+VD+
Sbjct: 261 KRHEHLFPDAKVPRTENFNPDKPSGADWIRR-QPKLSDENVAYNDLHYRGRLQALQAVDE 319
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
V+ ++ +L++ G L+NTYI +T+D+G+H+ Q L GK +E DI +P ++RGPG+
Sbjct: 320 LVDGLIDKLEEHGVLDNTYIFFTTDNGFHISQHRLQPGKECGYEEDINIPLIVRGPGVPK 379
Query: 234 GTM 236
G +
Sbjct: 380 GAI 382
>gi|429859197|gb|ELA33986.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
Length = 608
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 112/248 (45%), Gaps = 41/248 (16%)
Query: 21 YFGKYLNKYN-GSYIPPGWREWGA--LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
Y GK N + +Y P + W +++ Y+Y R ++ +Y DL+
Sbjct: 142 YTGKMFNAHTVWNYDSPHLKGWNGSDFLLDPNTYDYWNATFQRNKDKPVNHLGEYSTDLV 201
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS------------------------- 112
A + FL + ++ KP L ++ APH +S
Sbjct: 202 AQKAYDFLDDAVEA--GKPFFLAVAPIAPHSNVNSSVLLPVKGPDDDVPVSDDKFRTSIP 259
Query: 113 --APQYANMFFNVTSHHTPAYDYAPNPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQ 166
A ++ +F + T + NPDK WI + K+ + ++ D RL+
Sbjct: 260 KAAKRHEKLFPDAKVPRTENF----NPDKPSGADWIRR-QPKLSDENVEYNDEHYRGRLR 314
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
LQ+VD+ V+ ++ +L+ G L+NTYI YT+D+GYH+ Q L GK +E DI +P ++
Sbjct: 315 ALQAVDELVDGLIDKLEKHGLLDNTYIFYTTDNGYHISQHRLQPGKECGYETDINIPLII 374
Query: 227 RGPGIVPG 234
RGPG+ G
Sbjct: 375 RGPGVPKG 382
>gi|302830105|ref|XP_002946619.1| hypothetical protein VOLCADRAFT_86751 [Volvox carteri f.
nagariensis]
gi|300268365|gb|EFJ52546.1| hypothetical protein VOLCADRAFT_86751 [Volvox carteri f.
nagariensis]
Length = 976
Score = 83.2 bits (204), Expect = 1e-13, Method: Composition-based stats.
Identities = 61/196 (31%), Positives = 98/196 (50%), Gaps = 19/196 (9%)
Query: 51 YNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--- 107
YN S ++N + Y D+I + ++ ++ + + S P L + APH
Sbjct: 225 YNSSFSLN---CGPSQAMMGQYQTDVIRDKALAYIEGAVAAGS--PFYLQLDPAAPHVDN 279
Query: 108 ------GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQ-PVHRQFTDLL 160
P A ++A+++ + P + NP + TR M P + + L
Sbjct: 280 GGGEGWRPPPPAQRHADLYPGLELPDNPTF-LKVNPLNPF---RTRDMNNPENVKDATDL 335
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
R+++L++VD+ V IV +L DLG L+NTY+IY SD+G H+GQ L GK+ P E D
Sbjct: 336 YVGRIRSLRAVDEMVGAIVAKLSDLGILDNTYVIYMSDNGLHMGQHSLDDGKATPIEEDS 395
Query: 221 RVPFLMRGPGIVPGTM 236
RVP +RGPG+ G +
Sbjct: 396 RVPLYLRGPGVPAGVV 411
>gi|159464461|ref|XP_001690460.1| hypothetical protein CHLREDRAFT_144396 [Chlamydomonas reinhardtii]
gi|158279960|gb|EDP05719.1| predicted protein [Chlamydomonas reinhardtii]
Length = 692
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 113/273 (41%), Gaps = 69/273 (25%)
Query: 23 GKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYY--NDYYPDLI 77
GK+LN++ + P GW AL+ + Y Y + R Y DY D+I
Sbjct: 113 GKFLNEFTPAGAAGCPRGWTALDALV-DPWVYTYFGPVFSRDCDPQLQRYADTDYSTDVI 171
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAPQYANMFFNVTS 125
+ + ++R++ S +P L ++ PH G AP++A +F
Sbjct: 172 RDKGLAYIREAVSS--GRPFFLELAPVGPHEACTQGESGQWCGDPLPAPRHAGLFPGAAI 229
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMT---------------KRLQTLQS 170
T ++ P PD +T K+QP R ++ RLQ+LQ+
Sbjct: 230 PSTRSW-LTPFPDPA----LTGKLQPPQRASNGSVIGPYLPPSVRRLKQRQRARLQSLQA 284
Query: 171 VDDAVEKIV----------KELKDL-------------------GELNNTYIIYTSDHGY 201
VD+ V +V +K G L NTY+IY SD+GY
Sbjct: 285 VDELVRDVVFPQCRQGYGITTMKHCHHRRRLLLLLPAVLELAAQGVLENTYVIYMSDNGY 344
Query: 202 HLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
H+GQ GK P+E D+RVPF MRGPG+ G
Sbjct: 345 HMGQHNANAGKYMPWEEDVRVPFFMRGPGVPSG 377
>gi|302417488|ref|XP_003006575.1| arylsulfatase [Verticillium albo-atrum VaMs.102]
gi|261354177|gb|EEY16605.1| arylsulfatase [Verticillium albo-atrum VaMs.102]
Length = 606
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 110/243 (45%), Gaps = 40/243 (16%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
Y +LN +NGS +++ Y Y R ++ Y DL+A
Sbjct: 153 YDSPHLNGWNGS----------DFLLDPNTYAYWNATFQRNKDAPVNHLGQYSTDLVAQK 202
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPH------------GPEDSAP----QYANMFFNVT 124
+ FL + + +P L ++ APH GP++ P ++ F
Sbjct: 203 AYGFLDDAVAA--GEPFFLGIAPIAPHSNVNASTLNPVKGPDEDVPVSDGEFRVNFPQPA 260
Query: 125 SHHTPAYDYAP-------NPDK----QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
H + A NPDK W+ + K+ + + DL RLQ LQ+VD+
Sbjct: 261 KRHEHLFPDAKVPRTENFNPDKPSGADWVRR-QPKLSDENVAYNDLHYRGRLQALQAVDE 319
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
V+ ++ +L++ G L+NTYI +T+D+G+H+ Q L GK +E DI +P ++RGPG+
Sbjct: 320 LVDGLIDKLEEHGVLDNTYIFFTTDNGFHISQHRLQPGKECGYEEDINIPLIVRGPGVPK 379
Query: 234 GTM 236
G +
Sbjct: 380 GAI 382
>gi|358375590|dbj|GAA92170.1| arylsulfatase [Aspergillus kawachii IFO 4308]
Length = 578
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/71 (56%), Positives = 51/71 (71%)
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
+L++LQ VD+ VE+IV L D G L+NTYI YTSD+GYH+GQ L GK F+ DIRVP
Sbjct: 287 QLRSLQGVDELVEEIVLRLDDAGILDNTYIFYTSDNGYHIGQHRLHPGKECGFDEDIRVP 346
Query: 224 FLMRGPGIVPG 234
+RGPG+ G
Sbjct: 347 MFVRGPGVPRG 357
>gi|410055272|ref|XP_514701.4| PREDICTED: extracellular sulfatase Sulf-2 isoform 3 [Pan
troglodytes]
Length = 552
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 58/99 (58%), Gaps = 6/99 (6%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + SR + S GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDYYNDYYPDLIAND--SVRFLRQSKQSFSK 94
NG K KHG DY Y L S+ L + K S+ +
Sbjct: 177 NGVKEKHGSDYSKMMYRHLFTRKQLSLEILLKYKASYVR 215
>gi|357394060|ref|YP_004908901.1| putative sulfatase [Kitasatospora setae KM-6054]
gi|311900537|dbj|BAJ32945.1| putative sulfatase [Kitasatospora setae KM-6054]
Length = 473
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 20 GYFGKYLNKYNGSYIPPGWREW---GALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
G FGKYLN + + PP + EW ++ N YN + ++ Y +
Sbjct: 135 GLFGKYLNAWQPADNPPHFDEWLLSAPVVYNDGQYNDNGTVH---------TIPGYSTSV 185
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
I + ++ FL K + +P ++ A H P P+YA+ V + P+ D A
Sbjct: 186 IRDHALAFL--DKAATDPRPWFAFVAPKASHEPNTPEPKYADTVVPVWNGR-PSVDEADR 242
Query: 137 PDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
DK +WI T + + L ++L+TL SVDDAV+ + +L LG+L+NT +IY
Sbjct: 243 SDKPEWIQNSTGTLATARQ-----LRARQLRTLLSVDDAVQALHDKLAALGQLDNTLVIY 297
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
D+GY G K KS P+ + VPF + P
Sbjct: 298 LGDNGYTWADHGWSK-KSVPYLPSVEVPFYVSWPA 331
>gi|448726816|ref|ZP_21709206.1| N-acetylglucosamine-6-sulfatase [Halococcus morrhuae DSM 1307]
gi|445793377|gb|EMA43955.1| N-acetylglucosamine-6-sulfatase [Halococcus morrhuae DSM 1307]
Length = 516
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 109/227 (48%), Gaps = 23/227 (10%)
Query: 21 YFGKYLNKYNGSYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GK+ S P PG+ W + +Y++ +N++G ++K Y D++
Sbjct: 144 YIGKWHTYQLDSATPRPGFDRWVSFEGQGRYFDPILNVDGNRVKR-----TGYITDILTE 198
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-----TSHHTPAYDYA 134
+ +L KP L +S A H AP++ + NV + YA
Sbjct: 199 YARDWLDSRD---GDKPFFLFLSHKAAHAWFRPAPRHRGRYSNVPIEYPQTMANTKQAYA 255
Query: 135 PNPDKQWILQVTRKMQPVHRQFT-----DLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
PD W+ + ++ V+ F D L + +TL ++DD++ ++ EL + G +
Sbjct: 256 GKPD--WVRRQRNGVRGVNYLFGGRFDFDELYRRYSETLLALDDSIGAVMDELDESGMAD 313
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGT 235
+T ++YTSD+GY LG+ GLV GK +E +RVP L PG++ PGT
Sbjct: 314 STLLLYTSDNGYTLGEHGLV-GKQTAYESSVRVPLLAYAPGMIDPGT 359
>gi|302851682|ref|XP_002957364.1| hypothetical protein VOLCADRAFT_119669 [Volvox carteri f.
nagariensis]
gi|300257323|gb|EFJ41573.1| hypothetical protein VOLCADRAFT_119669 [Volvox carteri f.
nagariensis]
Length = 1147
Score = 82.4 bits (202), Expect = 2e-13, Method: Composition-based stats.
Identities = 66/238 (27%), Positives = 111/238 (46%), Gaps = 31/238 (13%)
Query: 21 YFGKYLNKYNGSYI-----PPGWREWGALIMNSK------YYNYSINMNGRKIKHGEDYY 69
Y GK+LN + P GW A+++ YY + R+ + E +
Sbjct: 136 YVGKFLNGLTHDSLLELGCPRGWTAMDAMLLGDTGGAEELYYKDTPRFM-RECRTLETFP 194
Query: 70 NDYYPDLIANDSVRFLRQSKQSFS---KKPIMLVMSFPAPH---GPEDSAPQ----YANM 119
+ + ++ +++++ ++ + S KKP L++S APH G D+ P+ Y M
Sbjct: 195 DTFTEIVVRQKALQYIERAAATNSLRGKKPFFLMVSTFAPHDASGGTDAVPEVEVKYRGM 254
Query: 120 FFNVTSHHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
+ +V +P + P P+K W + + + RL L+SVDD V
Sbjct: 255 YGDVELPESPNFGVRP-PNKIGYYWPYDSKDAIPRLTEHYR-----ARLAALRSVDDTVA 308
Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+V +L LNNT I+YTSD+G+ LG + + K +E D+R+P L+ GPGI G
Sbjct: 309 AVVSKLACRDLLNNTVIVYTSDNGFKLGSHNIAQEKFTQYEEDVRLPLLLAGPGIPRG 366
>gi|269124330|ref|YP_003297700.1| sulfatase [Thermomonospora curvata DSM 43183]
gi|268309288|gb|ACY95662.1| sulfatase [Thermomonospora curvata DSM 43183]
Length = 461
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 109/226 (48%), Gaps = 16/226 (7%)
Query: 22 FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
FGKYLN ++ PP + E+ AL+ KY N+NG+ + Y LI + +
Sbjct: 128 FGKYLNGWDVHQAPPHFDEY-ALMHPPKYGETWWNVNGKVSRK-----RAYSTSLIRDHA 181
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQW 141
VRFLR+ + S +P L ++ APH P +YAN+ + PA + DK +
Sbjct: 182 VRFLRRHRAS--GRPWFLYLTPYAPHAPFTPEARYANLSVP-SWRGNPAVAESDKRDKPF 238
Query: 142 ILQVTRKMQP-VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
+ R+ P +HR + +L+TL+SVDD + + EL+ L++T II SD+G
Sbjct: 239 YI---RRSDPDLHRARR--IRAGQLRTLRSVDDLLGAVRDELRAQRRLDDTLIIVISDNG 293
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
Y G G KS P+ +R+P + P G D L+ +
Sbjct: 294 YCWGDHGW-HAKSVPYSPAVRIPLYLSWPAGGLGRGATDDRLVANI 338
>gi|149195728|ref|ZP_01872785.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
gi|149141190|gb|EDM29586.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
Length = 474
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 96/182 (52%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
++ +++ N + F+ ++K KP + +++ APH P + ++ +M+ + S P
Sbjct: 169 HWTEIVTNHGIDFIDEAKND--DKPFFMYIAYNAPHDPRQAPKEFVDMY-PLESISVPEN 225
Query: 132 DYAPNPDKQWI----LQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
A PDK+ I LQ ++ P R + + ++ + +D + +++K L D
Sbjct: 226 FLAEYPDKELIGCGPLQRDARLAPFPRTEYAVKVNRQEYFACISYMDAEIGRLLKALADS 285
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLD 240
G+ +NT +I+T+DHG +G+ G + GK ++ +RVPFL+ GPGI + +YL D
Sbjct: 286 GQADNTIVIFTADHGLAVGRHGFI-GKQNMYDHSVRVPFLISGPGIKKAQKISSPIYLQD 344
Query: 241 VL 242
+
Sbjct: 345 AM 346
>gi|159479368|ref|XP_001697765.1| hypothetical protein CHLREDRAFT_150903 [Chlamydomonas reinhardtii]
gi|158274133|gb|EDO99917.1| predicted protein [Chlamydomonas reinhardtii]
Length = 483
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 107/241 (44%), Gaps = 33/241 (13%)
Query: 23 GKYLNKYNGSY----IPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDY------ 72
GK+LN + P GW LI + + G + Y D
Sbjct: 10 GKFLNSFKEPLARKGCPQGWTGISPLIQDVDTED-----GGEEAPRAPTYMEDCNGQLTS 64
Query: 73 YPDLIANDSVRFLRQS---KQSFSKKPIMLVMSFPAPHGPEDSA-------PQYANMFFN 122
YP + +R S + S S P L++S APH + PQY +++
Sbjct: 65 YPGAFQEEVIRDKALSYIDEASGSPLPFFLLISTVAPHDAKGKGSTYPQVLPQYKDLYPG 124
Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL--LMTKRLQTLQSVDDAVEKIVK 180
+ + T + P + ++ TD+ RLQ+L++VDD +E +V+
Sbjct: 125 LKAPRTANWGRPPPVTIGFAPSTPNRLN-----VTDINERYRARLQSLRAVDDTIEAVVR 179
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLD 240
L G+LNNT ++TSD+G+ LG + + K +E D+RVPFLM GPG VP +Y+ +
Sbjct: 180 RLACHGQLNNTVFMFTSDNGFKLGTHNIPREKFTWYEEDVRVPFLMAGPG-VPRGVYVPE 238
Query: 241 V 241
V
Sbjct: 239 V 239
>gi|345328217|ref|XP_003431251.1| PREDICTED: extracellular sulfatase Sulf-2 [Ornithorhynchus
anatinus]
Length = 828
Score = 80.9 bits (198), Expect = 5e-13, Method: Composition-based stats.
Identities = 35/71 (49%), Positives = 48/71 (67%), Gaps = 4/71 (5%)
Query: 2 AWPVELTSRNLLYFDISHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM 57
+W + R+ + + GY FGKYLN+YNGSY+PPGW+EW L+ NS++YNY++
Sbjct: 117 SWQAQHEVRSFAVYLNNTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCR 176
Query: 58 NGRKIKHGEDY 68
NG K KHG DY
Sbjct: 177 NGVKEKHGFDY 187
>gi|288917454|ref|ZP_06411820.1| sulfatase [Frankia sp. EUN1f]
gi|288351157|gb|EFC85368.1| sulfatase [Frankia sp. EUN1f]
Length = 475
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 103/218 (47%), Gaps = 29/218 (13%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G FGKYLN ++ + PP + EW ++ Y + + N+NG N Y +++ N
Sbjct: 140 GLFGKYLNGWDINASPPHFEEWA--LLKPSYVDGTYNVNGTV-----QTINGYTTNVLRN 192
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ FL + + +P ++ APHGP A QYA+ P ++ P
Sbjct: 193 RTLSFL--DRAATDTRPWFAYVAPYAPHGPNTPAAQYADTVV-------PEWNGRPA--- 240
Query: 140 QWILQVTRKMQPVHRQFTDLLMT-------KRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ + + +P + Q + + ++L+TL SVDDAV+ +L LG+L NT
Sbjct: 241 --VHETDKSDKPPYVQNSTSTLADGQAVRERQLRTLLSVDDAVQAFRDKLVALGQLENTL 298
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+ Y D+G+ G G V KS P+ +PF + PG
Sbjct: 299 VFYVPDNGFLWGDHGRV-AKSVPYTPAHELPFYVSWPG 335
>gi|429848344|gb|ELA23841.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
Length = 578
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 98/199 (49%), Gaps = 18/199 (9%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
+Y++ + NG Y Y D++A+ + L Q+ +P L ++ APH
Sbjct: 150 RYWDTKMTHNGEPPV---SYAGKYNTDVVADKAYALLEQALNR--DEPWFLTVAPIAPHS 204
Query: 109 ----PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR------KMQPVHRQ--- 155
E + Y + + H +DY K + Q++ +++ ++
Sbjct: 205 NWVFDERTNTTYLSTPQSAYRHEHLFHDYKIPRGKSFNRQISGAASWPGRLEALNESVLA 264
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ D +RL+ LQ+VD+ V ++V+ L+ GE+ NTYI Y++D+GYH+ Q + GK
Sbjct: 265 YNDHYQRQRLRALQAVDEMVHELVERLEVAGEIENTYIFYSTDNGYHISQHRMHPGKECG 324
Query: 216 FEFDIRVPFLMRGPGIVPG 234
++ DI +P +RGPG+V G
Sbjct: 325 YDTDIHIPMFVRGPGLVAG 343
>gi|400592940|gb|EJP60966.1| arylsulfatase, putative [Beauveria bassiana ARSEF 2860]
Length = 343
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 41/79 (51%), Positives = 53/79 (67%), Gaps = 1/79 (1%)
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE-LNNTYIIYTSDHGYHLGQFGLVKGKS 213
Q+ D +RLQ+LQ VDD VE I + L D + L NTYIIYT+D+G+HLGQ L GK+
Sbjct: 26 QYLDNFQRRRLQSLQDVDDLVEGIFETLLDYPDVLANTYIIYTADNGFHLGQHRLPAGKT 85
Query: 214 FPFEFDIRVPFLMRGPGIV 232
E D+ VPF++RGP I
Sbjct: 86 CGIEEDVNVPFIIRGPEIA 104
>gi|319954020|ref|YP_004165287.1| sulfatase [Cellulophaga algicola DSM 14237]
gi|319422680|gb|ADV49789.1| sulfatase [Cellulophaga algicola DSM 14237]
Length = 505
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 110/226 (48%), Gaps = 17/226 (7%)
Query: 22 FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
K N + S +P G+ L + +N + G K G+ ++ +++ +D+
Sbjct: 169 LSKIPNATSESIMPNGYNR--PLSLTDTTWNPTDTTKGGYWKDGQ-----HWSEVLKDDA 221
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYDYAPNP 137
+RF+ +K S +KP + ++F APH P + +Y +M+ +V P Y Y +
Sbjct: 222 IRFINTAKTS--EKPFFMYLAFNAPHDPRQAPQEYQDMYALKNLSVPKSFLPEYPYRHDI 279
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
L+ + P R + K+ ++ VD + I++ LK+ G+++NTYII+
Sbjct: 280 ANGDDLR-DEALAPFPRTELAIKTHKKEYYASITHVDAQIGNIIEALKNSGKMDNTYIIF 338
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
T+DHG +G+ GL+ GK F+ IR P ++ GP I +DV
Sbjct: 339 TADHGIAMGRHGLL-GKQSLFDHSIRPPIIILGPDIPKNKKTNVDV 383
>gi|429852351|gb|ELA27491.1| arylsulfatase precursor [Colletotrichum gloeosporioides Nara gc5]
Length = 474
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 100/200 (50%), Gaps = 28/200 (14%)
Query: 50 YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF---SKKPIMLVMSFPAP 106
+ N ++ NG + +H YP D +R + + +KP L ++ P
Sbjct: 3 FNNIVMSENGGRPRH--------YPGFHQTDIIRAQSLDRLEYLTSQEKPFFLTIAPTVP 54
Query: 107 HGPED-SAP----QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT---- 157
H + AP ++ +F NVTS P + NP + +Q + M+ + RQ
Sbjct: 55 HENYNFYAPVPLLRHKGLFPNVTSPKPPNW----NPPDELQMQKSNWMKKL-RQLNETEI 109
Query: 158 ---DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
D L +R+++LQ VD+ +E +V+ L+D + NTYIIYT+D+G+ +G + GK+
Sbjct: 110 AQGDWLFRQRIESLQGVDEIIEDVVQLLEDKSIIENTYIIYTTDNGWTIGANRVAAGKAT 169
Query: 215 PFEFDIRVPFLMRGPGIVPG 234
P+ D +P ++RGPG+ G
Sbjct: 170 PYADDTNLPIVVRGPGVPSG 189
>gi|443897158|dbj|GAC74500.1| sulfatases [Pseudozyma antarctica T-34]
Length = 576
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 99/198 (50%), Gaps = 18/198 (9%)
Query: 50 YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
Y N S ++G+++ Y Y D++ + +++L ++ S +P + ++ PH
Sbjct: 169 YNNASFAIDGKQLVQ---YEGQYSTDVVRDFGLQYLEEAASS--GQPFFVGIAPIGPHAE 223
Query: 110 ED------------SAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT 157
A ++ N+F + +++ A +P ++ ++ +
Sbjct: 224 TTFINGSAIFSNPVPAKRHKNLFATEVAPRKSSFN-ADDPSGAGFIRDLPRLNSSVVDYI 282
Query: 158 DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
D RL++LQ+VD+ V ++ + + LG L+NTY IY+SD+GY +G+ GK+ +E
Sbjct: 283 DEWYRDRLRSLQAVDELVGAVMDKAEQLGILDNTYFIYSSDNGYSVGEHRRQPGKTTGYE 342
Query: 218 FDIRVPFLMRGPGIVPGT 235
DIRVP +RGPGI G+
Sbjct: 343 EDIRVPLYIRGPGINSGS 360
>gi|159468309|ref|XP_001692325.1| hypothetical protein CHLREDRAFT_145830 [Chlamydomonas reinhardtii]
gi|158278511|gb|EDP04275.1| predicted protein [Chlamydomonas reinhardtii]
Length = 723
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 97/190 (51%), Gaps = 14/190 (7%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH- 107
KYY + + RK+ +D + LI DSV S P L ++ APH
Sbjct: 129 KYYYTNCQYHERKVGLPDDNLFERAHSLI-EDSVD---------SGNPFFLKLAVSAPHD 178
Query: 108 GPEDSAPQYANMFFNV-TSHHTP-AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
GP D P+ A+ + + P +Y D ++ + R + +D+ R
Sbjct: 179 GPVDGLPRIADRYLSAFPGLKVPRGPNYGKPIDDRFGMSKQRPLW-WDSADSDIRYRARA 237
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
Q++ ++D+ ++K+V +L+ LG LNNTYIIYTSD+G+ LGQ + + K +E D+ +PF
Sbjct: 238 QSMLAIDEHLDKLVTKLECLGILNNTYIIYTSDNGFKLGQHNIPQEKWTYYEEDVALPFF 297
Query: 226 MRGPGIVPGT 235
+RGPG+ G
Sbjct: 298 IRGPGVPRGA 307
>gi|340619599|ref|YP_004738052.1| sulfatase [Zobellia galactanivorans]
gi|339734396|emb|CAZ97773.1| Sulfatase, family S1-28 [Zobellia galactanivorans]
Length = 505
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 107/214 (50%), Gaps = 21/214 (9%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ +D++ F+ Q+K S +KP + ++F APH P + ++ +++ +V
Sbjct: 209 HWSEVLKDDALGFIEQTKTS--EKPFFMYLAFNAPHDPRQAPQEFVDLYPIENISVPESF 266
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELK 183
P Y Y L+ + P R T+ + K Q + +D + +I+ L+
Sbjct: 267 LPLYPYKDGMGNGPSLR-DEALAPFPR--TEYAIKKHTQEYYALISHLDFQIGQILDGLE 323
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYL 238
G+++NTYII+T+DHG +G+ GL+ GK F+ IR PF++ GP I + +YL
Sbjct: 324 KTGKMDNTYIIFTADHGLAMGKHGLI-GKQSQFDHSIRAPFMIVGPKIPENKKIDADIYL 382
Query: 239 LDVLIP--QVRKFSSGSLIFIMSFLINLNTMRAE 270
D ++ ++ +F S + +N R E
Sbjct: 383 QDAMVTSLELAGIEKPEYVFFNSVIDLVNGERKE 416
>gi|389625515|ref|XP_003710411.1| arylsulfatase [Magnaporthe oryzae 70-15]
gi|351649940|gb|EHA57799.1| arylsulfatase [Magnaporthe oryzae 70-15]
Length = 643
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 50/71 (70%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M +R + LQ VD+ +E +V L+ G+L+NTY++YTSD+GYHLGQ L GK+ + D
Sbjct: 264 MARRAEALQGVDEIIEDVVAMLEANGQLDNTYVVYTSDNGYHLGQHRLPGGKALFYGEDT 323
Query: 221 RVPFLMRGPGI 231
+PF +RGPG+
Sbjct: 324 NLPFAVRGPGV 334
>gi|149195700|ref|ZP_01872757.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
gi|149141162|gb|EDM29558.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
Length = 482
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 92/181 (50%), Gaps = 13/181 (7%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++ A+D++ FL +K +P + + F APH P + +Y +M+ ++ +H
Sbjct: 179 HWTEVTADDAIEFLDDAKDI--DQPFFMYVGFNAPHDPRQAPKEYVDMYPIDKISLPKNH 236
Query: 128 TPAYDYAPNPDKQWILQVTRKMQ-PVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
Y Y L+ M P + T + + +D + +++K LKD G
Sbjct: 237 LGLYPYRDQIGCGKQLRDAAIMPFPRNEYATKINRQEYFAATTHMDHNIGRVLKALKDSG 296
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-----TMYLLDV 241
+ +NTY+I+TSDHG +G+ GLV GK +E +R PF++ GP I G ++Y+ D
Sbjct: 297 KADNTYVIFTSDHGLAVGEHGLV-GKQNMYEHSMRAPFIIVGPSIPKGQRNATSIYIQDA 355
Query: 242 L 242
+
Sbjct: 356 M 356
>gi|302837023|ref|XP_002950071.1| hypothetical protein VOLCADRAFT_120839 [Volvox carteri f.
nagariensis]
gi|300264544|gb|EFJ48739.1| hypothetical protein VOLCADRAFT_120839 [Volvox carteri f.
nagariensis]
Length = 1230
Score = 78.2 bits (191), Expect = 3e-12, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 51/71 (71%)
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
RL+TL++VD+ + ++ L +G+L+NTY+I+TSD+G+ LG GK F +E +RVP
Sbjct: 246 RLRTLRAVDEMIGRLATTLASVGQLSNTYVIFTSDNGFQLGNHAQSSGKQFHWEEVVRVP 305
Query: 224 FLMRGPGIVPG 234
F +RGPGI PG
Sbjct: 306 FYIRGPGIPPG 316
>gi|149197559|ref|ZP_01874610.1| probable sulfatase [Lentisphaera araneosa HTCC2155]
gi|149139577|gb|EDM27979.1| probable sulfatase [Lentisphaera araneosa HTCC2155]
Length = 505
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 58/79 (73%), Gaps = 6/79 (7%)
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+++D +++++K+L+ +GEL+NTYIIYTSDHG +G+ GL +GK +E RVPF+++G
Sbjct: 281 ENIDIQIDRVLKKLESMGELDNTYIIYTSDHGIAIGRHGL-QGKQNLYEHTFRVPFIVKG 339
Query: 229 PGI-----VPGTMYLLDVL 242
PGI V G +YL D+L
Sbjct: 340 PGIPAGKRVKGNIYLTDIL 358
>gi|1703428|sp|P14217.2|ARS_CHLRE RecName: Full=Arylsulfatase; Short=AS; AltName: Full=Aryl-sulfate
sulphohydrolase; Flags: Precursor
Length = 647
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 99/204 (48%), Gaps = 29/204 (14%)
Query: 53 YSINMNGRKIKHGED---YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG- 108
Y+ + N R ++G Y +Y D+I + V ++ + + KP +S APH
Sbjct: 157 YTFDYNTRLQRNGATPNIYPGEYSTDVIRDKGVAQIKSAVAA--GKPFYAQISPIAPHTS 214
Query: 109 -----------------PEDSAPQYANMFFNVT----SHHTPAYDYAPNPDKQWILQVTR 147
P AP + +F + S + Y+ + WI +
Sbjct: 215 TQISTNPATGVTRSYFFPPIPAPPHWQLFSDANLPGGSXNKNLYEVDVSDKPAWIRALPL 274
Query: 148 KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
Q +R + + + RL++L D+ +E++VK L + G L+NTYIIY++D+GYH+G
Sbjct: 275 AQQ-NNRTYQEEIYRLRLRSL-GPDELIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHR 332
Query: 208 LVKGKSFPFEFDIRVPFLMRGPGI 231
GK+ +E D+RVPFL+RGPGI
Sbjct: 333 FGAGKTTGYEEDLRVPFLIRGPGI 356
>gi|340619425|ref|YP_004737878.1| sulfatase [Zobellia galactanivorans]
gi|339734222|emb|CAZ97599.1| Sulfatase, family S1-28 [Zobellia galactanivorans]
Length = 493
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 92/187 (49%), Gaps = 29/187 (15%)
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQW 141
+ +L +++ + P + F PH D P+ + V +HT P DKQ
Sbjct: 176 LEYLGDREKTKEEDPFFIYFGFSHPHDTRDGTPELLAKYGAV--NHTDKNSLPPANDKQP 233
Query: 142 ILQVT-RKMQPVHRQFTDLLMTKRL---------QTL-----------QSVDDAVEKIVK 180
LQ + P +L +R+ QT+ +++D + K++K
Sbjct: 234 PLQDNYLEAHPFFHGHPELRDEERVSGVWKNRDEQTVRNELGREYACSENIDIQLGKVLK 293
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGT 235
+L+ +GEL+NTYI+YTSDHG +G+ GL+ GK +E RVP +++GPG+ G
Sbjct: 294 KLEAMGELDNTYIVYTSDHGMSIGRHGLM-GKQNLYEHTWRVPMIIKGPGLPSGKRTKGN 352
Query: 236 MYLLDVL 242
+YLLDVL
Sbjct: 353 VYLLDVL 359
>gi|159487921|ref|XP_001701971.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281190|gb|EDP06946.1| predicted protein [Chlamydomonas reinhardtii]
Length = 567
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 112/249 (44%), Gaps = 44/249 (17%)
Query: 21 YFGKYLNKY---NGSYIPPGWREWGALI--MNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
Y GK++ Y N P GW + AL+ +YY + ++G Y Y D
Sbjct: 125 YTGKFIVDYTIRNHRPPPAGWTAFDALVHPFTFEYYCPAFALDGGPPAM---YPAQYVTD 181
Query: 76 LIANDSVRFLRQS--KQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
++ + LR + + S +P L ++ PAPH + +N T + Y
Sbjct: 182 VVGAKARALLRSAAADAAVSGRPFYLQVAPPAPH-------HSMHYDYNATGGLRARHWY 234
Query: 134 APNPDKQ--------------------------WILQVTRKMQPVHRQFTDLLMTKRLQT 167
P P ++ W+ + R + P + F + + RL++
Sbjct: 235 PPVPAERHWEAFSGAALPRPASFNPRNLWGKPAWVRGLER-LTPGNVTFLEEVFRLRLRS 293
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
L +VD+ V I+ +L+ LG +TY+++TSD+GYH G GK+ +E DIRVP L+
Sbjct: 294 LLAVDELVAGIMSDLEALGLTGSTYVLFTSDNGYHEGAHRQGSGKTTAYEEDIRVPLLVA 353
Query: 228 GPGIVPGTM 236
GPG+ PG++
Sbjct: 354 GPGLRPGSV 362
>gi|326334165|ref|ZP_08200389.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
gi|325948051|gb|EGD40167.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
Length = 509
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 110/245 (44%), Gaps = 28/245 (11%)
Query: 21 YFGKYLNKYN----GSYIPPGWREWGALIMNSKYYNY---SINMNGRKIKHGEDYYNDYY 73
+ GKYLN Y +PPGW +W A + YN+ IN NG+ + Y Y
Sbjct: 156 FAGKYLNGYGLRGTAKDVPPGWTDWRA-TTDMSTYNFIRPRINHNGKFTTY--HRYTTYV 212
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--GPEDSAPQYANMFFNVTSHHTPAY 131
AND + S ++KP + +++ APH GP + + ++
Sbjct: 213 MRDQANDMI-----SAPERAEKPWYMWLNYVAPHHGGPGEPDDPEGTSTTRPAAEDKHSF 267
Query: 132 DYAPNPDKQWILQVTRKMQPVH--RQFTD--------LLMTKRLQTLQSVDDAVEKIVKE 181
D PD ++ + + R+F D + +R++ +Q++D AV V
Sbjct: 268 DGLELPDLPYMFRTPTDAPRISASRKFFDAEGRAELRIAHEQRVEAVQALDRAVASHVLR 327
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
L++ G+L+NT II++SD+GY G + GK + R+P ++RGPGI G V
Sbjct: 328 LQETGQLDNTIIIFSSDNGYTTGGHN-INGKLMHYRETQRIPLILRGPGIPVGRSLATGV 386
Query: 242 LIPQV 246
P +
Sbjct: 387 GNPDI 391
>gi|429847721|gb|ELA23288.1| arylsulfatase precursor [Colletotrichum gloeosporioides Nara gc5]
Length = 633
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 102/236 (43%), Gaps = 27/236 (11%)
Query: 20 GYFGKYLNK-YNGSYI------PPGWREWGALIMNSKYYNYSINM--NGRKIKHGEDYYN 70
GY +Y+ K +NG I P GWR L+ + ++ M NG +
Sbjct: 117 GYNAEYVGKLFNGDSIISYNPAPKGWRHIDVLLDPYTNVHNTVVMSENGARPV------- 169
Query: 71 DYYPDLIANDSVRFL---RQSKQSFSKKPIMLVMSFPAPH-----GPEDSAPQYANMFFN 122
+Y D VR R + P L+++ APH P +Y F N
Sbjct: 170 -WYQGFQQTDVVRIKALDRLDNLLKEEDPFFLMIAPTAPHVENLRDPPTPPARYLGSFAN 228
Query: 123 VTSHHTPAYDYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
T TP ++ A K L+ + D +RL++L+ VDD ++ +V+
Sbjct: 229 STVPRTPNFNPADEYQHGKPSWLRTLPSLNSTQIAEIDHFYQRRLESLKGVDDIIDDVVE 288
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
L +++TYIIYT+D GYHLG GKS P+ D P ++RGPG+ G +
Sbjct: 289 MLTVNDAIDDTYIIYTTDQGYHLGIHRTGVGKSLPYLEDTNTPLIVRGPGVPKGAV 344
>gi|18119|emb|CAA34302.1| unnamed protein product [Chlamydomonas reinhardtii]
gi|18174|emb|CAA36545.1| unnamed protein product [Chlamydomonas reinhardtii]
Length = 646
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 33/61 (54%), Positives = 47/61 (77%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
VD+ +E++VK L + G L+NTYIIY++D+GYH+G GK+ +E D+RVPFL+RGPG
Sbjct: 295 VDELIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHRFGAGKTTGYEEDLRVPFLIRGPG 354
Query: 231 I 231
I
Sbjct: 355 I 355
>gi|429849912|gb|ELA25239.1| arylsulfatase [Colletotrichum gloeosporioides Nara gc5]
Length = 502
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 84/166 (50%), Gaps = 14/166 (8%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
+Y D++A ++ FL + + KP L ++ APH S N+ N + H
Sbjct: 174 EYSTDVLACKALDFLDDAVAA--SKPFFLGIAPVAPHSNVQS-----NVISNGSVHDPHG 226
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
+ P + P+ R+ T + +RL+ LQ+VD+ V+ +V L++ G L
Sbjct: 227 IVFPPRGTPIF------SPMPLSRERTTSI-PRRLRALQAVDELVDSVVSRLEEYGILEE 279
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
TYI YT+D+GYH+G L GK +E DI +P ++RGP + G +
Sbjct: 280 TYIFYTTDNGYHIGHHRLQPGKECGYEEDINIPLIIRGPQVPEGKV 325
>gi|254444649|ref|ZP_05058125.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
gi|198258957|gb|EDY83265.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
Length = 499
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 109/238 (45%), Gaps = 33/238 (13%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDL 76
+ GY + K++ P G+ W LI +YY+ + G+ G Y D+
Sbjct: 113 AAGYETAVIGKWHLGSDPTGFDFWKILIGQGQYYDAPFLTAEGQVETEG------YVTDV 166
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-----GPE-------DSAPQYANMFFNVT 124
I + ++ +L + +KP ML+ APH GP+ ++ P+ +F +
Sbjct: 167 ITDLAIDWLNTRE---DEKPFMLMYQHKAPHANFQPGPDYLNWREDETIPEPETLFDDYA 223
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKM----------QPVHRQFTDLLMTKRLQTLQSVDDA 174
+ A+D D LQ ++ R + L+ ++SVDD
Sbjct: 224 TRSPAAWDNEMRIDPTLELQYQGELNLKVPDGLRGHERSRWLYQFYIKNYLRCVKSVDDG 283
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
V ++ ++L+ +GEL+NT +IYTSD G+ LG+ G K F +E +++P L+R P ++
Sbjct: 284 VGRVFEQLEAMGELDNTIVIYTSDQGFFLGEHGYYD-KRFMYEESLQIPLLVRYPKMI 340
>gi|325109858|ref|YP_004270926.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970126|gb|ADY60904.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 595
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 111/232 (47%), Gaps = 28/232 (12%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GK+ PG+ + YY+ + N+NG++ K + YY ++ +
Sbjct: 219 AYIGKWHMGEKNDEPRPGFDYFVTHTGQGLYYDTTFNINGKERK----TFPGYYTHVVTD 274
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHH-TPAYDYAPNPD 138
+ +L Q + +P ML++ APH P+YA+ + ++ ++ A+D NPD
Sbjct: 275 MAEEWLDQQQDD---EPFMLMVGQKAPHSFYLPEPKYAHRYDDIDVNYPQSAFDLDDNPD 331
Query: 139 KQWILQ-------------VTRKMQPVHRQ--FTDLLMTKRL--QTLQSVDDAVEKIVKE 181
W+ + RK P + TD R T+ SVDD+V ++ K+
Sbjct: 332 --WMTKRLYTWHGIYGPLFAWRKDHPNDKAEAVTDFANMIRAYWGTINSVDDSVGRLTKK 389
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
L+++GEL++T ++ D+G G+ G+V ++ E IRVP ++R PG+ P
Sbjct: 390 LREMGELDSTVFVFMGDNGLLEGEHGMVDKRTM-HEPSIRVPMIVRYPGLTP 440
>gi|374606778|ref|ZP_09679601.1| sulfatase [Paenibacillus dendritiformis C454]
gi|374387613|gb|EHQ59112.1| sulfatase [Paenibacillus dendritiformis C454]
Length = 472
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 110/243 (45%), Gaps = 35/243 (14%)
Query: 19 HGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYY 73
HGY GK+ + G P G+ W L +Y+N + NG + G Y
Sbjct: 96 HGYQTAIVGKWHLGHGGHANPTGFDHWSVLDGQGEYFNSEFLEANGSRRIEG------YV 149
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD- 132
D+IA+ S+ +LR+ KP ML+ APH P +++ M+ +V + +D
Sbjct: 150 TDVIADLSLDWLRKRD---PDKPFMLMCHHKAPHRPWQPDERHSRMYEDVEIPYPETFDD 206
Query: 133 -YAPNPDK--QWILQVTRKM---------------QPVHRQFTDLLMTKRLQTLQSVDDA 174
YA + +++ R M + V R + L+ + S+DD
Sbjct: 207 DYANRSQAAVEAKMRIGRDMVTTDTKGDPPEGLTEEEVKRWKYQRYIKDYLRCVASIDDN 266
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GIVP 233
V +++ L + G +T +IYTSD G+ LG G K F +E +R+PF++R P I P
Sbjct: 267 VGRLLDYLDEEGIAEDTIVIYTSDQGFFLGDHGWFD-KRFMYEESLRMPFIVRYPRAIAP 325
Query: 234 GTM 236
G++
Sbjct: 326 GSV 328
>gi|325285339|ref|YP_004261129.1| sulfatase [Cellulophaga lytica DSM 7489]
gi|324320793|gb|ADY28258.1| sulfatase [Cellulophaga lytica DSM 7489]
Length = 513
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 90/178 (50%), Gaps = 14/178 (7%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ +D++ F+ +KQ +KP + ++F APH P + Y M+ +
Sbjct: 218 HWSEVLKDDAITFINTAKQK--EKPFFMYLAFNAPHDPRQAPKAYQAMYNVNNITIPKSF 275
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELK 183
P Y Y N L+ + P R T+L + ++ ++ +D + I+ LK
Sbjct: 276 LPEYPYRHNIANGDDLR-DEALAPFPR--TELAIKTHIKEYYASITHLDAQIGDILTALK 332
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
+ G++NNTYII+TSDHG +G+ GL+ GK F+ IR P ++ GP I DV
Sbjct: 333 NSGKINNTYIIFTSDHGLAMGKHGLL-GKQNLFDHSIRPPMIIVGPDIPKNKKNTADV 389
>gi|326333280|ref|ZP_08199527.1| hypothetical protein NBCG_04717 [Nocardioidaceae bacterium Broad-1]
gi|325948924|gb|EGD41017.1| hypothetical protein NBCG_04717 [Nocardioidaceae bacterium Broad-1]
Length = 499
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 44/243 (18%)
Query: 21 YFGKYLNKYNG-----SYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
+ GKYLN Y S PGW +W A + S Y S N Y+
Sbjct: 124 FLGKYLNNYGKGSERYSSGEPGWDDWQATVDPSTYNLLSPTFNSNGTLQSPAIYSSV--- 180
Query: 76 LIANDSVRFLRQSKQSFS-KKPIMLVMSFPAPH--GP------------------EDSAP 114
+ N S+R ++ + + +KP L +++ APH GP E++ P
Sbjct: 181 ALQNRSIRMIKHAGTNVGGRKPWFLWVNYVAPHAGGPVEASDPKKRYPDQPSLWLENTRP 240
Query: 115 QYAN--MFFNVTSHHTPAYDYAP----NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
A+ F + TP YA K + R M+ V++Q R++
Sbjct: 241 MGADKGRFSGLELPKTPDMFYAGPGQVTAGKPYTTAQKRVMRYVYQQ--------RIEAA 292
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
QSVD AV + + LKD G+ T II++ D+GY G +GK + F +R+P +M G
Sbjct: 293 QSVDRAVARTIAHLKDSGQYRRTVIIFSGDNGYTTGHHNR-QGKLWEFSQSMRIPLIMVG 351
Query: 229 PGI 231
PGI
Sbjct: 352 PGI 354
>gi|223936143|ref|ZP_03628057.1| sulfatase [bacterium Ellin514]
gi|223895365|gb|EEF61812.1| sulfatase [bacterium Ellin514]
Length = 456
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/223 (29%), Positives = 108/223 (48%), Gaps = 38/223 (17%)
Query: 36 PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
PG+ W + Y N ++N++G K+ E Y D ++ + +V F++Q KK
Sbjct: 127 PGFDHWLSFKGQGVYENPNLNIDG-KVSRVEGYITD----ILNSRAVEFVKQE----HKK 177
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD--YAPNPDKQWILQVTRKMQ--- 150
P L + A HGP A ++ ++ H P+ D A P +TRK Q
Sbjct: 178 PFCLYVGHKAVHGPFTPAERHKELYTKEQIPHPPSIDDDLAGKP------VLTRKEQQGP 231
Query: 151 -----PVHRQFTD-----------LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
P F D L+ ++L+TL ++D+ V ++++ L++ +L NT II
Sbjct: 232 KDGQKPQKVGFDDEAERPMGKVPERLVRQQLRTLMAIDEGVGQLLRALEESRQLENTVII 291
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
+TSD+GY G+ L K + +E IR P L+R P ++ PGT+
Sbjct: 292 FTSDNGYFWGEHHL-GDKRWAYEESIRDPLLIRYPKLIKPGTV 333
>gi|449132149|ref|ZP_21768311.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula europaea 6C]
gi|448888596|gb|EMB18910.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula europaea 6C]
Length = 516
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 106/235 (45%), Gaps = 20/235 (8%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY----NYSINMNGRKIKHGEDYYNDYYPD 75
G+ GK+ + PG+ W + +YY Y+IN NG+++ + Y
Sbjct: 125 GFIGKWHMGHANDDPRPGFDFWFSFKGQGQYYPPGPQYTINDNGKRVPQ-----DGYITP 179
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
L+ ++ FL Q Q+ KP L +S A H P P+Y + + + +
Sbjct: 180 LLTRKAIEFLEQ--QTDRDKPFFLYLSHKAVHDPFTPEPKYKDSLKDQPYEFPASSELLK 237
Query: 136 NPDK---QWILQVTRKMQ----PVHR-QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
+ K +W+L +H Q + + + L+SVDD++ +V++LK +G
Sbjct: 238 DNLKNRPRWLLDQRNSWHGMDFALHTGQSVEGFYKRYCEALRSVDDSIGAVVQQLKMMGI 297
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
T +IY D+G+ G+ GL K +E RVP LM+ P I+ G + +V+
Sbjct: 298 YEETLVIYMGDNGFMFGEHGLFD-KRVAYETSSRVPMLMQCPAIIKGGTVVEEVV 351
>gi|261346415|ref|ZP_05974059.1| N-acetylglucosamine-6-sulfatase [Providencia rustigianii DSM 4541]
gi|282565398|gb|EFB70933.1| N-acetylglucosamine-6-sulfatase [Providencia rustigianii DSM 4541]
Length = 519
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 102/244 (41%), Gaps = 36/244 (14%)
Query: 12 LLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNS------KYYNYSIN 56
L D S GY GK Y+ YN W EW A Y Y ++
Sbjct: 136 LQSLDYSTGYIGKWHLDAPYAPYIESYNNPMEGRYWNEWAAPDRRHGFDFWYSYGTYDLH 195
Query: 57 MN----GRKIKHGEDYYNDYY-PDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGP 109
M G + Y D + P+ A+ +++FLR F KP LV+S PH P
Sbjct: 196 MKPMYWGNNTPREKPIYVDQWGPEHEADMAIKFLRNENGQFRDETKPFALVVSMNPPHSP 255
Query: 110 EDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
D PQ Y +++ TS + P+ QW + P + + L +
Sbjct: 256 YDQVPQKYLDVYKGKTSK-----ELNTRPNVQWDAEYQEGYGPQY-------FKEYLAMV 303
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
VDD +I+ EL+ G +NT +++ SDHG LG G K+ P+E +RVP + R
Sbjct: 304 NGVDDQFGRIMNELEKQGLDDNTLVVFFSDHGSCLGANGQPT-KNNPYEESMRVPMMFRL 362
Query: 229 PGIV 232
PG +
Sbjct: 363 PGKI 366
>gi|407923746|gb|EKG16811.1| Sulfatase [Macrophomina phaseolina MS6]
Length = 298
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 93/190 (48%), Gaps = 26/190 (13%)
Query: 49 KYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG 108
KYY+ + NG + Y +Y D+I + + FL + ++P +V + APHG
Sbjct: 118 KYYDAVMTRNGEEPVS---YAGNYSTDVIKDKILGFLDEGLSH--ERPWFVVAAPIAPHG 172
Query: 109 PEDS--------------APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
+ AP++A++F + T +++ WI + ++P++
Sbjct: 173 TTTTDPATNGTYFDFAQYAPRHADLFKDYTIPRDSSFNAKVEGGVSWI----KNLEPLND 228
Query: 155 Q---FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
+ D RL++LQ+VD+ +E V++L+D G L+ TYIIY++D+GYH+ Q + G
Sbjct: 229 SVIAYNDEFQRSRLRSLQAVDEMIEAFVEKLEDAGVLDETYIIYSTDNGYHISQHRMTPG 288
Query: 212 KSFPFEFDIR 221
K + R
Sbjct: 289 KECGYGMSFR 298
>gi|410638706|ref|ZP_11349259.1| predicted sulfatase [Glaciecola lipolytica E3]
gi|410141234|dbj|GAC16464.1| predicted sulfatase [Glaciecola lipolytica E3]
Length = 522
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 90/182 (49%), Gaps = 19/182 (10%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ-----YANMFFNVTSHHTP 129
++IA+ +V FL K KP M+ + F APH P APQ Y + P
Sbjct: 218 EVIADAAVDFLTSEKS----KPFMMYVGFLAPHDPR-QAPQAYMGKYPGDSIELPKSFAP 272
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
++ + N +I + P + + ++ D + +I++ LK GEL+
Sbjct: 273 SHPF--NQGDYYIRDEILENNPRTPYVIQKAIGEYYAMIEHTDAQIGRIIEALKASGELD 330
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDVLIP 244
NT I++TSDHG +G+ GL+ GK ++ +R PF+M+GP + VPG M+ L+ + P
Sbjct: 331 NTLIVFTSDHGLAVGRHGLL-GKQNQYDHSVRAPFIMQGPNVPKGKSVPG-MFYLNSVFP 388
Query: 245 QV 246
V
Sbjct: 389 TV 390
>gi|302538002|ref|ZP_07290344.1| predicted protein [Streptomyces sp. C]
gi|302446897|gb|EFL18713.1| predicted protein [Streptomyces sp. C]
Length = 488
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 105/234 (44%), Gaps = 29/234 (12%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G FGKYLN + + PP + E+ ++ Y + N+NG N Y ++I N
Sbjct: 153 GLFGKYLNSWALADNPPHFEEFA--LLQPGYVDAQWNVNGTV-----QTINGYTTNIIKN 205
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ FL K + +P ++ A HGP +YA TP ++ P
Sbjct: 206 RTLSFL--DKAATDTRPWFAYVTPYASHGPRTPEAKYAG---------TPVPHWSGRPS- 253
Query: 140 QWILQVTRKMQPVH-RQFTDLLM------TKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ + R +P + + T L ++L+TL SVDDAV+ L+ LG+L+NT
Sbjct: 254 --VPENDRSDKPAYIKNATGTLADGQTIRAEQLRTLLSVDDAVQAFKDRLQALGQLDNTL 311
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
+IY D+G+ G G KS P+ VPF + P G + D ++ +
Sbjct: 312 VIYIGDNGFSWGDHGWT-AKSVPYRQAHEVPFYLSWPAGGRGAGAVDDRIVANI 364
>gi|255036000|ref|YP_003086621.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254948756|gb|ACT93456.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 506
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 36/256 (14%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYN--YSINMNGRKIKHGEDYYNDYYPD 75
+ GY + K++ P G+ + L +Y N + R+ + G Y Y D
Sbjct: 113 TAGYQTALIGKWHLVKEPAGFDYYCVLPGQGRYRNPIMMTKEDFREDQKGGKVYEGYSTD 172
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+I + S+ +L + +S KP L F A H P D +Y N +V + PA D+A
Sbjct: 173 VITDQSIAWLEKRDKS---KPFYLSTHFKATHEPFDYPKRYENYLEDVEIPY-PA-DFAD 227
Query: 136 N-------PDKQWILQV--------TRKMQPVHR---QFTDLLMTKR----------LQT 167
W L + T K P H Q D + ++ ++
Sbjct: 228 RGATGSGRTHDGWPLDLLGTRYEKGTGKEYPGHSFSLQGLDSVAARKKIYQKFVKDYIRC 287
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
+VDD + ++++ LKD GEL+NT IIYTSD GY LG+ G K F +E IR+PF++
Sbjct: 288 GAAVDDNIGRLIQYLKDAGELDNTIIIYTSDQGYFLGEHGFFD-KRFIYEPSIRMPFVIS 346
Query: 228 GPGIVPGTMYLLDVLI 243
P +P + D+++
Sbjct: 347 YPKEIPKGKRVNDLIL 362
>gi|449134522|ref|ZP_21770018.1| choline sulfatase [Rhodopirellula europaea 6C]
gi|448886797|gb|EMB17190.1| choline sulfatase [Rhodopirellula europaea 6C]
Length = 473
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++IAN S F + + +P + ++F A H P + +Y + + V +++
Sbjct: 183 HWSEVIANHSDDFFADAAKH--DQPAFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 240
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P + YA L+ R + P R + + +R L +D + +I+ ++
Sbjct: 241 QPLHPYAEEIGCGRKLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 299
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
GE +NT+I +T+DHG +GQ GL+ GK P++ +RVPFL++GPG+ G +YL D
Sbjct: 300 GEADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVKAGVRIEEPIYLQD 358
Query: 241 VL 242
V+
Sbjct: 359 VM 360
>gi|223936731|ref|ZP_03628641.1| sulfatase [bacterium Ellin514]
gi|223894582|gb|EEF61033.1| sulfatase [bacterium Ellin514]
Length = 515
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/238 (27%), Positives = 107/238 (44%), Gaps = 31/238 (13%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIA 78
GY+ + K++ P G+ W L KY++ I M RK G Y ++I
Sbjct: 149 GYYTGMVGKWHLESDPTGFDYWNVLPGQGKYHDPDFIEMGNRKKIEG------YATEIIT 202
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--DYA-- 134
+ S+ FL+ Q KP L+ APH P + ++A M+ +VT + DY
Sbjct: 203 DLSINFLKNRPQD---KPFFLMCHHKAPHRPWEPDEKHAKMYEDVTIPEPETFNDDYKTR 259
Query: 135 PNPDKQWILQVTRKMQP---------------VHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
+ + +++ R + P + + + L+ + SVDD V +++
Sbjct: 260 SSAATEATMRIDRDLTPKDLKQAPPPGLAGEALKKWKYQRYIKDYLRCIASVDDNVGRLL 319
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTM 236
K L D G NT +IYTSD G+ LG K F +E +R+PF++R P I P T+
Sbjct: 320 KFLDDSGLAENTIVIYTSDQGFFLGDHNWFD-KRFMYEESLRMPFIVRYPNHIKPATV 376
>gi|149196998|ref|ZP_01874051.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
gi|149140108|gb|EDM28508.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
Length = 495
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 106/240 (44%), Gaps = 43/240 (17%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+ GK+ PG+ W KY++ + N+NG + K YY + +
Sbjct: 121 GWIGKWHMGEEDDSKRPGFDYWVTHKGQGKYWDTTFNVNGERKK-----VPGYYAHKVTD 175
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ FL + +S KP L + APHGP +Y +++ + TP P PD
Sbjct: 176 MAIDFLNKVDKS---KPFALCLGHKAPHGPFIPEAKYDSIY-----NDTPV----PYPDS 223
Query: 140 QWILQ-----VTRKMQPVHRQFTDLLMTKR--------------------LQTLQSVDDA 174
W L + ++ H + L ++ T+ SVDD+
Sbjct: 224 SWKLGDKPKWIVDRLPTWHGIYGPLYGFRKDFPNDKASAIVDFEHFVRSYTATINSVDDS 283
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
V +I L+++G L+NT +I+TSD+G+ LG+ G++ ++ E + +P +R P + G
Sbjct: 284 VGRIYDHLEEMGILDNTILIFTSDNGFLLGEHGMIDKRTM-HEASVSIPLTVRFPKKIKG 342
>gi|126728061|ref|ZP_01743877.1| choline sulfatase [Sagittula stellata E-37]
gi|126711026|gb|EBA10076.1| choline sulfatase [Sagittula stellata E-37]
Length = 506
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 89/185 (48%), Gaps = 13/185 (7%)
Query: 69 YNDYYPDLIANDSVRFL---RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
Y Y D + + R+L + + + + P +V SF PH P + P++ +++ +
Sbjct: 158 YQTDYDDEVEFAARRWLIDRGRDRAAGDQTPFCMVASFIHPHDPYVAKPEWWDLYSD-DD 216
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR----LQTLQSVDDAVEKIVKE 181
P Y AP +V ++ + TD + + L + D V +VK
Sbjct: 217 VDMPEYSLAPEEQDPGSRRVMDGIEASYIPLTDEEVHRARRAYLANVSYFDSKVGALVKT 276
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI----VPGTMY 237
L+D+GEL+NT ++ T+DHG LG+ GL +F FE R+P +M GPG+ P
Sbjct: 277 LEDIGELDNTLLVVTADHGDMLGERGLWFKMNF-FEHSARIPMIMTGPGVAKGAAPNACS 335
Query: 238 LLDVL 242
L+D+L
Sbjct: 336 LIDLL 340
>gi|305665307|ref|YP_003861594.1| choline sulfatase [Maribacter sp. HTCC2170]
gi|88710062|gb|EAR02294.1| choline sulfatase [Maribacter sp. HTCC2170]
Length = 503
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 95/184 (51%), Gaps = 19/184 (10%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ +D+V F+ Q+K P + ++F APH P + +Y +M+ ++
Sbjct: 209 HWSEVLKDDAVGFIDQAK--VKDNPFFMYLAFNAPHDPRQAPQEYVDMYSLDKISLPKSW 266
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELK 183
P Y Y + L+ + P R T+ K +Q + +D+ + +I+ L+
Sbjct: 267 MPMYPYKDSIGNGPGLR-DEALAPFPR--TEYATKKHIQEYYALISHMDNQIGEILDALE 323
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYL 238
+ G++ NTY+I+T+DHG +G+ GL+ GK F+ IR PF++ GP I + +YL
Sbjct: 324 NSGKMENTYVIFTADHGLAIGKHGLL-GKQSQFDHSIRPPFMIVGPDIPKDASIDKDIYL 382
Query: 239 LDVL 242
D +
Sbjct: 383 QDAM 386
>gi|440479319|gb|ELQ60092.1| arylsulfatase precursor [Magnaporthe oryzae P131]
Length = 668
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 51/78 (65%), Gaps = 4/78 (5%)
Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
+ FTD + LQ VD+ +E +V L+ G+L+NTY++YTSD+GYHLGQ L GK+
Sbjct: 286 QNFTDYAA----EALQGVDEIIEDVVAMLEANGQLDNTYVVYTSDNGYHLGQHRLPGGKA 341
Query: 214 FPFEFDIRVPFLMRGPGI 231
+ D +PF +RGPG+
Sbjct: 342 LFYGEDTNLPFAVRGPGV 359
>gi|436835021|ref|YP_007320237.1| sulfatase [Fibrella aestuarina BUZ 2]
gi|384066434|emb|CCG99644.1| sulfatase [Fibrella aestuarina BUZ 2]
Length = 522
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 108/244 (44%), Gaps = 42/244 (17%)
Query: 36 PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQ--SKQSFS 93
PG+ W + YYN ++N+NG+++ + + + Y DL+ + ++ +L Q S
Sbjct: 143 PGFNHWVSFRGQGVYYNPTLNVNGKQVPYTD---SSYTTDLLTDYALNWLNSLNKSQGQS 199
Query: 94 KKPIMLVMSFPAPHGPEDSAPQYANMFFNV--------------TSHHTPAY----DYAP 135
+P L +S A H A ++ M+ N+ +S + PA
Sbjct: 200 SQPFCLYLSHKAVHAEFQPAKRHRGMYRNMPINYPQSMFLTATDSSKNWPALAGNGQTRR 259
Query: 136 NPDK-----------QWILQVTRKMQPV----HRQ--FTDLLMTKRLQTLQSVDDAVEKI 178
NP+ W+ Q V H Q F D + +TL VDD+V ++
Sbjct: 260 NPETGAMKANLADMPNWVKQQRYSWHGVDYMYHGQIGFNDFY-RQYCETLMGVDDSVGRV 318
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
+K L D G+L NT ++Y D+G+ G+ GL+ K +E +RVP L+ P + G L
Sbjct: 319 LKWLADNGQLENTLVVYMGDNGFSFGERGLID-KRHMYEESMRVPLLVHCPALTKGGTKL 377
Query: 239 LDVL 242
V+
Sbjct: 378 EQVV 381
>gi|390958071|ref|YP_006421828.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390958412|ref|YP_006422169.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390412989|gb|AFL88493.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390413330|gb|AFL88834.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
Length = 450
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 85/187 (45%), Gaps = 32/187 (17%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF---- 120
G + + ++ A+ ++ FL + + K P + + F APH P + +Y M+
Sbjct: 148 GPEGSTKHSSEVYADSAISFLDRQRAEHDK-PFFMYVGFNAPHDPRQAPEEYQAMYPVDK 206
Query: 121 ------------FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
FN HT AP P + ++ RK + +
Sbjct: 207 IALPPNYLPQHPFNQGDFHTRDEQLAPFPRTELDVKTHRK--------------EYYAII 252
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+D + +++ L G+ +NTY+I T+DHG +G+ GL+ GK +E +R+P ++RG
Sbjct: 253 THMDAQIGRVLDALDRSGQADNTYVIMTADHGLAVGEHGLM-GKQNQYECSMRMPLIVRG 311
Query: 229 PGIVPGT 235
PGI PG+
Sbjct: 312 PGIKPGS 318
>gi|305666556|ref|YP_003862843.1| mucin-desulfating sulfatase [Maribacter sp. HTCC2170]
gi|88708825|gb|EAR01060.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Maribacter sp. HTCC2170]
Length = 502
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 100/226 (44%), Gaps = 18/226 (7%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+FGK+ +G PG+ W + YYN ++N+NG ++ + + + Y DL+
Sbjct: 135 GFFGKWHMGSHGDEPQPGFTHWESFPGQGVYYNPTLNINGERVSYKD---STYITDLLTE 191
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF------FNVTSHHTPAYDY 133
++ +L++ + KP +S A H A ++ + T T Y
Sbjct: 192 HTIDWLKKRDKD---KPFFAYLSHKAVHAGFKPARRHKGKYKGKRIALPATYDQTKTGAY 248
Query: 134 APNPDKQWILQVTRKMQPVHRQFTD-----LLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
QW+ V + D ++ +TL VD++V I+ LK G
Sbjct: 249 RDLKWPQWVADQRISWHGVDYMYHDNRDIHEMVVDYCETLLGVDESVGAIMDYLKKEGLD 308
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T +IY D+G+ G+ GL+ + F +E ++VP L+R P + G
Sbjct: 309 ESTMVIYMGDNGFSWGEHGLIDKRHF-YEESVKVPLLVRCPELFDG 353
>gi|329928120|ref|ZP_08282066.1| arylsulfatase [Paenibacillus sp. HGF5]
gi|328937997|gb|EGG34396.1| arylsulfatase [Paenibacillus sp. HGF5]
Length = 497
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 75/157 (47%), Gaps = 9/157 (5%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP- 135
+ S+ FLR+ KP L MSF PH P D Y N++ + P D+A
Sbjct: 192 VVTQSIDFLRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLNLYEDADIPEPPVGDWAET 248
Query: 136 -NPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+PD +T R + P R M + +D + + ++ L + GEL NT I
Sbjct: 249 EDPDLAGYSPITGRGLVPKRRLRK--AMAAYYALITHIDHQIGRFLQSLNEYGELGNTVI 306
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
++TSDHG LG L + KS P+E RVPF++ PG
Sbjct: 307 LFTSDHGELLGDHHLFR-KSLPYEGSTRVPFIVNDPG 342
>gi|417304251|ref|ZP_12091280.1| choline sulfatase [Rhodopirellula baltica WH47]
gi|327539463|gb|EGF26078.1| choline sulfatase [Rhodopirellula baltica WH47]
Length = 477
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++IAN S F + + +P + ++F A H P + +Y + + V +++
Sbjct: 187 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 244
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P + YA L+ R + P R + + +R L +D + +I+ ++
Sbjct: 245 QPLHPYAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 303
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
G+ +NT+I +T+DHG +GQ GL+ GK P++ +RVPFL++GPG+ G +YL D
Sbjct: 304 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVKAGGRVEEPIYLQD 362
Query: 241 VL 242
V+
Sbjct: 363 VM 364
>gi|149177416|ref|ZP_01856020.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Planctomyces maris DSM 8797]
gi|148843749|gb|EDL58108.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Planctomyces maris DSM 8797]
Length = 510
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 92/192 (47%), Gaps = 42/192 (21%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHH 127
Y + + D+I + S+++L+ KQ +KP L+ F APH ++A +Y ++ +V
Sbjct: 171 YDSKHSSDVITDLSLKWLKNRKQK--EKPFFLMHHFKAPHDNFENAERYDWLYQDVV--- 225
Query: 128 TPAYDYAPNPDKQW----------------ILQVTRKMQPVHRQFTDLLMT--------- 162
P PD W I + H F D L++
Sbjct: 226 ------IPEPDSLWHRGQHGPLHHALYGTSIGLRNERRNMGHHMFVDPLLSSEQYTREAY 279
Query: 163 -----KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
K L+ ++ VDD ++++V+ LK GEL+NT IIYT+D G+ LG+ + K + +E
Sbjct: 280 QRYLKKYLRCVRGVDDNIQRLVEYLKQTGELDNTIIIYTADQGFMLGEHDYID-KRWMYE 338
Query: 218 FDIRVPFLMRGP 229
+R+PF++R P
Sbjct: 339 ESLRMPFIVRYP 350
>gi|440716907|ref|ZP_20897409.1| choline sulfatase [Rhodopirellula baltica SWK14]
gi|436437975|gb|ELP31557.1| choline sulfatase [Rhodopirellula baltica SWK14]
Length = 477
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++IAN S F + + +P + ++F A H P + +Y + + V +++
Sbjct: 187 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 244
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P + YA L+ R + P R + + +R L +D + +I+ ++
Sbjct: 245 QPLHPYAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 303
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
G+ +NT+I +T+DHG +GQ GL+ GK P++ +RVPFL++GPG+ G +YL D
Sbjct: 304 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVEAGGRVEEPVYLQD 362
Query: 241 VL 242
V+
Sbjct: 363 VM 364
>gi|294053768|ref|YP_003547426.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613101|gb|ADE53256.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 444
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 80/177 (45%), Gaps = 20/177 (11%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
+ Y DL N +V F+ +Q+ +P L +F APH P ++ + H+ P
Sbjct: 154 DTYSVDLFCNTAVDFIENYQQN---QPFFLYTAFTAPHDPRTPPKEFLD-------HYPP 203
Query: 130 AYDYAPNP---------DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
PN D + I P + + ++S+D + +++
Sbjct: 204 QTVRVPNNYADEHPFDLDVRDIRDECLAPYPRTPERIQEELAAYYGMIESMDSGIARVLA 263
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
L+ GEL+NT I+YT DHG +GQ GL+ GK +E RVP L+ GPG+ G Y
Sbjct: 264 ALEARGELDNTIIVYTGDHGLAVGQHGLL-GKQNVYEHSARVPLLLSGPGVPKGEAY 319
>gi|90416259|ref|ZP_01224191.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[gamma proteobacterium HTCC2207]
gi|90331984|gb|EAS47198.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[marine gamma proteobacterium HTCC2207]
Length = 360
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 87/186 (46%), Gaps = 31/186 (16%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP--------------- 114
N Y D++ +VRF+ S + +P L + A H PE S
Sbjct: 38 NGYMTDILTQRAVRFIHDS----ASQPFFLYIGHKAVH-PEASQRNDGSADLSVKREFIP 92
Query: 115 ------QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP-----VHRQFTDLLMTK 163
+Y N N + D++P + I + K ++ F++ +
Sbjct: 93 AKRHQGKYKNAEVNTVERYIATQDFSPLNESLAIRHMLEKSDERWYRWINSGFSEDTIRA 152
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
R + + SVD+++ +IVK L+++ L+NT II+TSD+GY G+ GL + +P+E I+VP
Sbjct: 153 RAEMMLSVDESLGEIVKTLQNMDLLDNTLIIFTSDNGYFYGEHGLTIERRYPYEEAIKVP 212
Query: 224 FLMRGP 229
++R P
Sbjct: 213 LIIRKP 218
>gi|149199161|ref|ZP_01876200.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
gi|149137758|gb|EDM26172.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
Length = 451
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 96/185 (51%), Gaps = 20/185 (10%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++AN+S+ F+ ++K KP + ++F A H P + ++ +M+ +V +
Sbjct: 168 HWSEVVANNSIDFIERAKNV--NKPFFMYIAFNATHDPRQAPKEFIDMYPLEDMDVPKNF 225
Query: 128 TPAYDY----APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS-VDDAVEKIVKEL 182
Y Y +P + L+ R M F + + L + +D + +I+K +
Sbjct: 226 LANYPYDTEIGCDPKR---LRDERLMPTPRTDFAVKVHRQEYFALATHMDQQIGRILKAI 282
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MY 237
+ G+ +NTYII+T+DHG +G GLV GK +E +RVPF + GP + G+ +Y
Sbjct: 283 ELSGKADNTYIIFTADHGLAVGNHGLV-GKQSMYEHSMRVPFFIAGPKMKKGSVFETPIY 341
Query: 238 LLDVL 242
L D++
Sbjct: 342 LQDIM 346
>gi|406834550|ref|ZP_11094144.1| choline sulfatase [Schlesneria paludicola DSM 18645]
Length = 486
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 91/182 (50%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++A+D+V FLRQ+ P + ++F APH P S ++ M+ ++ +
Sbjct: 194 HWSEIVADDAVDFLRQAADE--PAPFFMYVAFNAPHDPRQSPREFVEMYPPEKISLPVNF 251
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P Y + + L+ K+ P R + + + + +D + +I+ LK
Sbjct: 252 LPEYPFKDLIGCEPTLR-DEKLAPFPRTERAIRVHRAEYYAIVSHLDRQIGRILDALKAS 310
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
G+ NT+I +T+DHG +G+ GLV GK +E +RVPF+ GP G +YL D
Sbjct: 311 GKEQNTWIFFTADHGLAVGRHGLV-GKQNLYEHSVRVPFVAVGPKAAAGERIDAPIYLQD 369
Query: 241 VL 242
V+
Sbjct: 370 VM 371
>gi|260062063|ref|YP_003195143.1| N-acetylglucosamine-6-sulfatase [Robiginitalea biformata HTCC2501]
gi|88783625|gb|EAR14796.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Robiginitalea biformata HTCC2501]
Length = 490
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 107/242 (44%), Gaps = 25/242 (10%)
Query: 11 NLLYFDI---SHGYFGKYLNKYNGSYIP----PGWREWGALIMNSKYYNYSINMNGRKIK 63
NL YF GY +L K++ S PG+ W + YYN ++N+NG +I+
Sbjct: 107 NLTYFPQYLQEAGYQTAFLGKWHMSSHTDEPRPGFDHWESFFGQGVYYNPTLNINGERIE 166
Query: 64 HGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV 123
+ + + Y DL+ +V +L + KP L +S A H A ++ +
Sbjct: 167 YKD---STYITDLLTEHAVDWLESRDKD---KPFFLYLSHKAVHAEFQPARRHKGRYAGK 220
Query: 124 TSHHTPAYDYAPNP---DKQWILQVTRKMQPVH------RQFTDL--LMTKRLQTLQSVD 172
P Y+ D +W V + H D+ L+ +TL VD
Sbjct: 221 KIELPPTYEQTKTGAWRDLKWPEWVADQRVSWHGVDYMYHSNIDMQELVQAYCETLLGVD 280
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
D+V +++ L++ G T +IY D+G+ G+ GL+ + F +E ++VP L+R P +
Sbjct: 281 DSVGAVLEYLEEEGLDEETLVIYMGDNGFSWGEHGLIDKRHF-YEESVKVPLLVRCPELF 339
Query: 233 PG 234
G
Sbjct: 340 EG 341
>gi|440717674|ref|ZP_20898156.1| choline sulfatase [Rhodopirellula baltica SWK14]
gi|436437294|gb|ELP30950.1| choline sulfatase [Rhodopirellula baltica SWK14]
Length = 507
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 82/174 (47%), Gaps = 24/174 (13%)
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
+ +L Q + + P ++ F PH D P+ + ++HT P NP +
Sbjct: 187 LEYLDQRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 244
Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
+ + P H +L + + +++D + K+++
Sbjct: 245 PLPPNHLEAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 304
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+++GEL+NTYI+YTSDHG +G+ GL +GK +E RVPF+++GPGI G
Sbjct: 305 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 357
>gi|238921150|ref|YP_002934665.1| sulfatase [Edwardsiella ictaluri 93-146]
gi|238870719|gb|ACR70430.1| sulfatase [Edwardsiella ictaluri 93-146]
Length = 510
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/254 (27%), Positives = 107/254 (42%), Gaps = 37/254 (14%)
Query: 5 VELTSRNLLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YN 52
+ S L D S GY GK ++ YN W EW A + Y+
Sbjct: 119 AQCWSDVLKKLDYSTGYIGKWHLDAPHAPFIESYNNPMEGRYWNEWTAPDRRHGFDFWYS 178
Query: 53 YSI-NMNGRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMS 102
Y +++ R + G + Y + + + A+ +++FLR K SF + KP LV+S
Sbjct: 179 YGTYDLHMRPMYWGNNTPRESPVYVDQWSAEHEADMAIKFLRNEKGSFRAADKPFALVVS 238
Query: 103 FPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLM 161
PH P D PQ Y + F +S D P+ W P +
Sbjct: 239 MNPPHSPYDQVPQKYLDRFPGKSSR-----DLNTRPNVDWQANYQEGYGPQY-------F 286
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
+ + + VD+ +I+ EL G +NT +++ SDHG LG G K+ +E IR
Sbjct: 287 KEYMAMVNGVDEQFGRILAELDKQGLADNTLVVFFSDHGCCLGANGQPT-KNVAYEEAIR 345
Query: 222 VPFLMRGPG-IVPG 234
+P + R PG I PG
Sbjct: 346 IPMMFRLPGKIAPG 359
>gi|296121821|ref|YP_003629599.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014161|gb|ADG67400.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 523
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 80/167 (47%), Gaps = 25/167 (14%)
Query: 83 RFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS----------HHTPAYD 132
R ++QS + KP ++ + + PH P P++ M+ + S TP +
Sbjct: 221 RQVQQSSSTGESKPFLMCVGYFLPHVPCYVTPEWLAMYPDDDSILPFIEKDDRKDTPRFS 280
Query: 133 Y-----APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
+ P P +W+ Q H + L+ + L + VD + +++ L+ GE
Sbjct: 281 WYLHWRLPEPRLKWLQQ--------HEHWRSLVRSY-LASTSYVDAQIGRLLAALEATGE 331
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
NNT I+ SDHG+HLG+ G + GK+ +E RVP L GPG++ G
Sbjct: 332 ANNTLIVLWSDHGWHLGEKG-ITGKNTLWERSTRVPLLFAGPGVLAG 377
>gi|340616913|ref|YP_004735366.1| sulfatase [Zobellia galactanivorans]
gi|339731710|emb|CAZ94975.1| Sulfatase, family S1-28 [Zobellia galactanivorans]
Length = 515
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 96/194 (49%), Gaps = 36/194 (18%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ +D++ F+ ++K+ + P + ++F A H P + ++ +M+ V +
Sbjct: 219 HWSEVVRDDALSFIEKAKKK--EDPFFMYLAFNATHDPRQAPQRFLDMYPVKDIKVPDNF 276
Query: 128 TPAYDY---------------APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
P Y + AP P ++ ++V HRQ L+T +D
Sbjct: 277 MPEYPWKDDMGNPPSLRDEALAPYPRTEYAVKV-------HRQEYYALLT-------HMD 322
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ + KI+ L+ G+++NTYI + +DHG +G GL+ GK F+ +R+P ++ GPGI
Sbjct: 323 EQIGKILDALEASGKMDNTYIFFGADHGLSVGHHGLI-GKQSMFDHSVRIPMMVVGPGIP 381
Query: 233 PGTMYLLDVLIPQV 246
G + DV + +
Sbjct: 382 KGKLLDQDVYLQDI 395
>gi|171915410|ref|ZP_02930880.1| iduronate-2-sulfatase [Verrucomicrobium spinosum DSM 4136]
Length = 514
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 77/162 (47%), Gaps = 8/162 (4%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D + D R+L + +P + M F PH P + Y +++ P +
Sbjct: 189 DGMVADDARWLLSRCAADKSRPFFVAMGFFRPHTPYVAPKAYFDLYPVEKMPLYPPVEKV 248
Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNN 190
P L +K Q Q TD L + +Q ++ +D V ++V+ELK LG +
Sbjct: 249 PAEVPAAALVSYKKEQ---DQLTDDLRQQAVQAYYASISFMDAQVGRVVEELKQLGLAES 305
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
T I++TSDHGYH+G+ GL + S FE RVP ++ PG+
Sbjct: 306 TIIVFTSDHGYHMGEHGLWQKMSL-FEASARVPLMIVAPGVA 346
>gi|326334100|ref|ZP_08200327.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
gi|325948076|gb|EGD40189.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
Length = 511
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 112/256 (43%), Gaps = 37/256 (14%)
Query: 21 YFGKYLNKY--NGSY--IPPGWREWGALIMNSKYYNY---SINMNGRKIKHGEDYYNDYY 73
+ GKYLN+Y +G+ IPPGW EW A + + YNY N+NGR +K Y
Sbjct: 137 FTGKYLNEYGEDGTDRDIPPGWDEWRATV-DPSTYNYLGAKFNVNGRLVKP-----KGYS 190
Query: 74 PDLIANDSVRFLR--QSKQSFSKKPIMLVMSFPAPH--GPEDSAPQYANMFFNVTSHHTP 129
+I + L+ + K KP +++ APH GP + + P
Sbjct: 191 TTVITQQAQAALQDARKKSGAEAKPWFQWVNYVAPHIGGPTEKGDPKQRFPGTKGAIGVP 250
Query: 130 --------AYDYAPNPDKQWILQVTRKMQP-----------VHRQFTDLLMTKRLQTLQS 170
AY P P + + RK P V + +L +R+++ +S
Sbjct: 251 VPDQQDRDAYADKPIPQQPNLFPEGRKQFPKGSPSRKNPSQVEKDAYELSYQRRIESARS 310
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + ++ LK GEL T +++TSD+G+ G GK + +E +R+P LM GPG
Sbjct: 311 LDRNIASLLGGLKKSGELARTLVVFTSDNGFSNGYHNF-NGKLWYYEESLRIPVLMSGPG 369
Query: 231 IVPGTMYLLDVLIPQV 246
+ G + P +
Sbjct: 370 VPKGRKVATTLTNPDI 385
>gi|326334096|ref|ZP_08200323.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
gi|325948072|gb|EGD40185.1| arylsulfatase [Nocardioidaceae bacterium Broad-1]
Length = 514
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 105/249 (42%), Gaps = 35/249 (14%)
Query: 23 GKYLNKYN-GSY---IPPGWREWGALIMNSKYYNYS--INMNGRKIKHGEDYYNDYYPDL 76
GKYLN Y GS +PPGW +W A + S Y S N+NG IK Y +
Sbjct: 146 GKYLNGYGEGSSRGDVPPGWDQWRATVDPSTYNFRSPKFNVNGEIIKS-----KGYSSTV 200
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-----GPEDSAPQY----ANMFFNVTSHH 127
I + + + + S KP +++ APH GP+D Y A + V
Sbjct: 201 ITEHAKAGI--AAERGSGKPWFTWVNYVAPHHGGPSGPDDPKKIYPGTDAALSVTVPDRQ 258
Query: 128 TPA-YDYAPNPDKQWILQVTRK-----------MQPVHRQFTDLLMTKRLQTLQSVDDAV 175
YD P P + + P + + +RL+ +S+D +
Sbjct: 259 DRGFYDNVPIPARPNLFPEDTSGYAKGSPGRGTFNPRKKDALRIAYQRRLEATRSLDRNI 318
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
++ LK GEL T +++TSD+G+ G L GK + ++ +R+P LM GPG+ G
Sbjct: 319 ASLIGSLKKSGELKRTLVVFTSDNGFSNGYHNL-NGKLWHYDESLRIPVLMSGPGVPRGR 377
Query: 236 MYLLDVLIP 244
V P
Sbjct: 378 TVRTPVTNP 386
>gi|149177547|ref|ZP_01856150.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Planctomyces maris DSM 8797]
gi|148843697|gb|EDL58057.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Planctomyces maris DSM 8797]
Length = 633
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 102/234 (43%), Gaps = 31/234 (13%)
Query: 20 GYFGKYLNKY---NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
GY Y+ K+ N PG+ W ++ ++ ++N+NG +I+ + + D+
Sbjct: 282 GYATAYVGKWHMGNDDTARPGFDHWVSMKGQGTSFDPTLNINGERIQ-----FKGHTTDV 336
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE------DSAPQYANMFFNVTSHHTPA 130
+ F++ +KP L ++ A H PE S + F H
Sbjct: 337 LNQKVNEFVKAQ----GEKPFCLYIAHKALH-PELTQRDDGSITDPSAAKFMPAKRHEKL 391
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQF------------TDLLMTKRLQTLQSVDDAVEKI 178
Y P + ++ + + R +D ++ RL+ L +D+ V +
Sbjct: 392 YSDDAIPRRLNVVDTLEGKRALKRTVPGLPPLSQKTGTSDEVIRDRLRMLAGIDEGVGSL 451
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ L+ G+L++T ++TSDHGY G+ GL + P+E IRVP L+R P ++
Sbjct: 452 CELLESQGKLDDTVFVFTSDHGYWYGEHGLSVERRLPYEEGIRVPLLVRYPPVI 505
>gi|261407393|ref|YP_003243634.1| arylsulfatase [Paenibacillus sp. Y412MC10]
gi|261283856|gb|ACX65827.1| sulfatase [Paenibacillus sp. Y412MC10]
Length = 496
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 76/157 (48%), Gaps = 9/157 (5%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP- 135
+ S+ FLR+ + KP L MSF PH P D Y +++ + P D+A
Sbjct: 191 VVTQSIDFLRRRDPA---KPFFLWMSFVRPHPPFDPPQAYLDLYEDAEIPEPPVGDWAET 247
Query: 136 -NPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+PD +T R + P R M + +D + + ++ L + GEL NT I
Sbjct: 248 EDPDLSGYSPITGRGLVPKRRLRK--AMAAYYALITHIDHQIGRFLQSLNEYGELGNTVI 305
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
++TSDHG LG L + KS P+E RVPF++ PG
Sbjct: 306 LFTSDHGELLGDHHLFR-KSLPYEGSTRVPFIVSDPG 341
>gi|417301585|ref|ZP_12088736.1| choline sulfatase [Rhodopirellula baltica WH47]
gi|327542177|gb|EGF28670.1| choline sulfatase [Rhodopirellula baltica WH47]
Length = 507
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 24/174 (13%)
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
+ +L Q + + P ++ F PH D P+ + ++HT P NP +
Sbjct: 187 LEYLDQRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 244
Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
+ + P H +L + + +++D + K+++
Sbjct: 245 PLPPNHLEAHPFHHGHPNLRDEVSVNGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 304
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+++GEL+NTYI+YTSDHG +G+ GL +GK +E RVPF+++GPG+ G
Sbjct: 305 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGVPKG 357
>gi|253575992|ref|ZP_04853325.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251844567|gb|EES72582.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 502
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 96/192 (50%), Gaps = 20/192 (10%)
Query: 72 YYPDLIANDSVRFLR--QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
+Y IAN SV FL Q ++SF L SFP PH P + Y+ M+ + S P
Sbjct: 195 HYNRWIANRSVAFLENLQPEESF-----YLWCSFPDPHHPFAACRPYSEMY-DPQSLTLP 248
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL-QT---LQSVDDAVEKIVKELKDL 185
+ D L+ RK+ P H QF + + + L QT + +D + +++++LK+L
Sbjct: 249 E-GWENEADSIDWLKGLRKVHPGHTQFDETTLREVLAQTYGMITHMDSCIGEVLEKLKEL 307
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY------LL 239
G +NT I++ +DHG +LG L+ +P+E + VP++ + P +Y LL
Sbjct: 308 GLYDNTVIVFMADHGEYLGSHRLLTKAEWPWEELLNVPYIWKAPQAAKQGVYRDRVVSLL 367
Query: 240 DVLIPQVRKFSS 251
D +P + F+
Sbjct: 368 D-FVPTILDFAG 378
>gi|149196927|ref|ZP_01873980.1| putative N-acetylglucosamine-6-sulfatase [Lentisphaera araneosa
HTCC2155]
gi|149140037|gb|EDM28437.1| putative N-acetylglucosamine-6-sulfatase [Lentisphaera araneosa
HTCC2155]
Length = 489
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 103/205 (50%), Gaps = 15/205 (7%)
Query: 38 WREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
W G Y+N S + NG + K Y D + +L +V ++ KQ+ +KP
Sbjct: 142 WDRGGPKSNKYAYFNNSLVRTNGGERKPLGGYTTDKFTEL----AVDYIH--KQNDKEKP 195
Query: 97 IMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
L + +P HGP A ++ + + +V P+ + P K L+ + + +
Sbjct: 196 WYLWLCYPGVHGPYTPAKRHKDFYKDVEVK-VPSDIFGPRHSKPAHLKNMTRWKK-DKNG 253
Query: 157 TDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
T + +++ ++S+D+AV ++K L+D G+L NT +I+TSD G+ GQ G K K
Sbjct: 254 TPVGFASQVKKYHNAVKSLDEAVGTLMKALEDSGQLENTIVIFTSDQGFAWGQHG-SKEK 312
Query: 213 SFPFEFDIRVPFLMRGPGIV-PGTM 236
P++ +I P +++ PGI PGT+
Sbjct: 313 WLPYDANIIAPLIIKAPGITQPGTV 337
>gi|269140257|ref|YP_003296958.1| sulfatase [Edwardsiella tarda EIB202]
gi|387868773|ref|YP_005700242.1| Choline-sulfatase [Edwardsiella tarda FL6-60]
gi|267985918|gb|ACY85747.1| putative sulfatase [Edwardsiella tarda EIB202]
gi|304560086|gb|ADM42750.1| Choline-sulfatase [Edwardsiella tarda FL6-60]
Length = 510
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 107/254 (42%), Gaps = 37/254 (14%)
Query: 5 VELTSRNLLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YN 52
+ S L D S GY GK ++ YN W +W A + Y+
Sbjct: 119 AQCWSDVLKKLDYSTGYIGKWHLDAPQAPFIESYNNPMEGRYWNDWTAPDRRHGFDFWYS 178
Query: 53 YSI-NMNGRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMS 102
Y +++ R + G + Y + + + A+ +++FLR K SF + KP LV+S
Sbjct: 179 YGTYDLHMRPMYWGNNTPRESPMYVDQWSAEHEADMAIKFLRNEKGSFRAADKPFALVVS 238
Query: 103 FPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLM 161
PH P D PQ Y + F +S D P+ W P +
Sbjct: 239 MNPPHSPYDQVPQKYLDRFPGKSSR-----DLNTRPNVDWQANYQEGYGPQY-------F 286
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
+ + + VD+ +I+ EL G NT +++ SDHG LG G K+ P+E +R
Sbjct: 287 KEYMAMVNGVDEQFGRILAELDKQGLAENTLVVFFSDHGCCLGANGQPT-KNVPYEEAMR 345
Query: 222 VPFLMRGPG-IVPG 234
+P + R PG I PG
Sbjct: 346 IPMMFRLPGKIAPG 359
>gi|197285971|ref|YP_002151843.1| exported sulfatase [Proteus mirabilis HI4320]
gi|227356480|ref|ZP_03840868.1| exported sulfatase [Proteus mirabilis ATCC 29906]
gi|425068874|ref|ZP_18471990.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
mirabilis WGLW6]
gi|425071592|ref|ZP_18474698.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
mirabilis WGLW4]
gi|194683458|emb|CAR44241.1| putative exported sulfatase [Proteus mirabilis HI4320]
gi|227163590|gb|EEI48511.1| exported sulfatase [Proteus mirabilis ATCC 29906]
gi|404598450|gb|EKA98920.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
mirabilis WGLW4]
gi|404598774|gb|EKA99242.1| tat (twin-arginine translocation) pathway signal sequence [Proteus
mirabilis WGLW6]
Length = 519
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 107/247 (43%), Gaps = 37/247 (14%)
Query: 12 LLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YNYSI-NMN 58
L D S GY GK ++ YN W EW A + Y+Y +++
Sbjct: 136 LKSLDYSTGYIGKWHLDAPYEPFIESYNNPMEGRYWNEWAAPDRRHGFDFWYSYGTYDLH 195
Query: 59 GRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGP 109
+ + D Y + + P+ A+ +++FL+ F + KP LV+S PH P
Sbjct: 196 LKPMYWANDTPRDKPIYIDQWGPEHEADMAIKFLKNENNQFRDNSKPFALVVSMNPPHSP 255
Query: 110 EDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
D P+ Y + + TS + P+ QW + P + + + +
Sbjct: 256 YDQVPKKYLDAYQGKTSK-----ELNTRPNVQWDSEYQEGYGPQY-------FKEYMAMV 303
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
VD+ +IV EL+ G +NT +++ SDHG LG G K+ P+E +RVP + R
Sbjct: 304 NGVDEQFGRIVDELEKQGLADNTLVVFFSDHGSCLGSNGQAT-KNNPYEESMRVPMMFRL 362
Query: 229 PG-IVPG 234
PG I PG
Sbjct: 363 PGKIKPG 369
>gi|253576233|ref|ZP_04853564.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251844360|gb|EES72377.1| sulfatase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 503
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 101/213 (47%), Gaps = 17/213 (7%)
Query: 22 FGKYLNKYNGSYIPPGWREWGALIMNSKYY-NYSINMNGRKIKHGEDYYNDYYPDLIAND 80
FG YL Y P G ++ L YY + I M+ R I E+Y+ +++ IA
Sbjct: 152 FGDYLRWLEQVY-PGGGKK---LSKEGAYYADDKIPMSWR-IDLPEEYHYNHW---IAER 203
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ 140
S+ FL Q Q +P L SFP PH P + Y+ M+ + S P +
Sbjct: 204 SIDFLEQMSQQ--DQPFFLWCSFPDPHHPFAACRPYSEMY-DPASLTLPEHWDVEEDGIS 260
Query: 141 WILQVTRKMQPVHRQFTD----LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
W L+ R + P + F + ++ + + VD + +I K+LK+L NT I++
Sbjct: 261 W-LKERRNIHPDYTSFDEHDLREILAQTYGMISHVDKTIGEITKKLKELELDQNTVIVFL 319
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
+DHG +LG L+ +P+E IRVPF+ + P
Sbjct: 320 ADHGEYLGSHHLITKGEWPWEELIRVPFIWKIP 352
>gi|308512815|gb|ADO33061.1| N-acetylglucosamine-6-sulfatase [Biston betularia]
Length = 198
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 47/64 (73%)
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
L +VD+ V +V L++ L+NTY+++TSD+GYH+GQF V K P+E DI+VPFL++
Sbjct: 1 LLAVDEMVADVVGALENSALLDNTYLVFTSDNGYHIGQFSQVYDKRQPYETDIKVPFLIK 60
Query: 228 GPGI 231
GP I
Sbjct: 61 GPKI 64
>gi|449132608|ref|ZP_21768623.1| choline sulfatase [Rhodopirellula europaea 6C]
gi|448888287|gb|EMB18609.1| choline sulfatase [Rhodopirellula europaea 6C]
Length = 492
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 81/174 (46%), Gaps = 24/174 (13%)
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
+ +L Q + + P ++ F PH D P+ + ++HT P NP +
Sbjct: 172 LEYLDQRDAASDEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 229
Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
+ P H +L + + +++D + K+++
Sbjct: 230 PLPPNHLDAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 289
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+++GEL+NTYI+YTSDHG +G+ GL +GK +E RVPF+++GPGI G
Sbjct: 290 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 342
>gi|363581191|ref|ZP_09314001.1| sulfatase [Flavobacteriaceae bacterium HQM9]
Length = 510
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 92/182 (50%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
++ +++A+D++ F+ + S P + ++F APH P + +Y +M+ ++ P
Sbjct: 213 HWSEVLADDAITFMDTA--STMNNPFFMYLAFNAPHDPRQAPQKYQDMY-KLSDISLPKS 269
Query: 132 DYAPNPDKQWILQVT----RKMQPVHRQ--FTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
PD+ I + P R T + + + ++ +D + KI+K LK
Sbjct: 270 WLPEYPDRHQIANGNNLRDEALAPFPRTEYATKVNIKEYYASITHLDAQIGKILKALKTS 329
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLD 240
G+ NTYII+T+DHG +G+ GL+ GK ++ IR P ++ GP I + +YL D
Sbjct: 330 GKDKNTYIIFTADHGLAMGKHGLL-GKQNLYDHSIRPPLVITGPDIPKNKKITTDVYLQD 388
Query: 241 VL 242
V+
Sbjct: 389 VM 390
>gi|383115489|ref|ZP_09936245.1| hypothetical protein BSGG_2640 [Bacteroides sp. D2]
gi|313695105|gb|EFS31940.1| hypothetical protein BSGG_2640 [Bacteroides sp. D2]
Length = 526
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 100/229 (43%), Gaps = 43/229 (18%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
DL+ ++R++ KQ K+P ++ F A H P D + +++ VT D+
Sbjct: 185 DLVTQKAIRWM---KQQDGKQPFLMCCHFKATHEPYDYPIRMEHLYDGVTFPEPENLLDW 241
Query: 134 AP------------------------NPDKQWILQ-----VTRKMQ--PVHRQFTDLLMT 162
P +PDK W T MQ R L+
Sbjct: 242 GPETNGRSFIGQKLEELVRRWRTASKDPDKWWCRYPGLPFSTEGMQRTTARRAAYQKLIR 301
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
++ +VDD + K++K L ++G +NT ++Y SD GY LG+ G + F +E R+
Sbjct: 302 DYMRCGATVDDNIGKLLKALDEMGIADNTIVVYVSDQGYFLGEHGFFDKRMF-YEESARM 360
Query: 223 PFLMRGPGIVPGTMYLLDVLI-----PQVRKFSSGSLIFIM--SFLINL 264
PF++R P +P L D+++ P + +FS + + SF+ NL
Sbjct: 361 PFVIRYPKKIPAGQRLKDLVLNVDFAPTLAEFSGVKMDNVQGRSFVDNL 409
>gi|421611137|ref|ZP_16052288.1| choline sulfatase [Rhodopirellula baltica SH28]
gi|408497951|gb|EKK02459.1| choline sulfatase [Rhodopirellula baltica SH28]
Length = 326
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 95/182 (52%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++IAN S F + + +P + ++F A H P + +Y + + V +++
Sbjct: 36 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPAENILVPANY 93
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P + YA L+ R + P R + + +R L +D + +I+ ++
Sbjct: 94 QPLHPYAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 152
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-----TMYLLD 240
G+ +NT+I +T+DHG +GQ GL+ GK P++ +RVPFL++GPG+ G +YL D
Sbjct: 153 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVEAGGRVEEPVYLQD 211
Query: 241 VL 242
V+
Sbjct: 212 VM 213
>gi|238587342|ref|XP_002391445.1| hypothetical protein MPER_09124 [Moniliophthora perniciosa FA553]
gi|215456112|gb|EEB92375.1| hypothetical protein MPER_09124 [Moniliophthora perniciosa FA553]
Length = 184
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 52/84 (61%), Gaps = 10/84 (11%)
Query: 158 DLLMTKRLQTLQSVDDAVEKIVKELKDLGE--LNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
D + +RL+ LQ VDD VE IV+E+++ G NTYIIYTSD+GY LG GKS P
Sbjct: 2 DHVYRQRLRVLQPVDDLVEAIVQEVENAGPEVAKNTYIIYTSDNGYALGSHQRSPGKSLP 61
Query: 216 FE--------FDIRVPFLMRGPGI 231
+E D+ +P ++RGPG+
Sbjct: 62 YEEGNKRFYFADVLIPLIIRGPGV 85
>gi|325285349|ref|YP_004261139.1| sulfatase [Cellulophaga lytica DSM 7489]
gi|324320803|gb|ADY28268.1| sulfatase [Cellulophaga lytica DSM 7489]
Length = 433
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 95/209 (45%), Gaps = 26/209 (12%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
D+ AN+++ F+ K SKKP ++F APH P + Y +++ P+ D
Sbjct: 153 DITANEAINFI---KNYSSKKPFFAYVAFNAPHVPRQTTQNYYDLYPVDKVKLPPSVVDN 209
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLGELN 189
P L K Q + + M KR+Q + +D + I+ LK+ G N
Sbjct: 210 TP-------LNTNVKYQYTNNPLSAKTMQKRVQQNNAMVTHMDTRIGDIINTLKEKGIYN 262
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDVL-- 242
NT I++TSDHG + G+ G V GK +E + P +++ P + + +YL DV+
Sbjct: 263 NTIIVFTSDHGINFGENG-VAGKVCLYEPSVTAPLIIKAPTVKQNNKITARVYLQDVVPT 321
Query: 243 ---IPQVRKFSSGSLIFIMSFLINLNTMR 268
+ +++ + +M LIN R
Sbjct: 322 LFSLLKIKDKEATDFKSLMPLLINSKKTR 350
>gi|340619483|ref|YP_004737936.1| sulfatase [Zobellia galactanivorans]
gi|339734280|emb|CAZ97657.1| Sulfatase, family S1-27 [Zobellia galactanivorans]
Length = 490
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 74/250 (29%), Positives = 110/250 (44%), Gaps = 47/250 (18%)
Query: 20 GYFGKYLNKYNGSY--IPPGWREWGALIMNSKY----YNYSINMNGRKIKHGEDYYNDYY 73
GY GK+ +G +PPG R G + YN+S+ + E ++Y
Sbjct: 137 GYIGKWHLDGHGRLQNVPPGARRQGFDFWKANECTHDYNHSVYYDNLD---PEQKIWEHY 193
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSHHTPAYD 132
D+ Q+K S P M+++S+ PH P +AP+ Y MF PA
Sbjct: 194 DTFDQTDAALNYIQTKNGDSS-PFMMIVSYGTPHAPYHTAPEKYRKMF-------DPA-- 243
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ + V +MQ + TDL + ++DD + KI K+LK+ G+ NT
Sbjct: 244 -----NIKLRENVPEEMQ--EKAKTDL--AGYYAHIAAIDDMIGKITKKLKEEGQWENTL 294
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-----------------GIVPGT 235
I++TSDHG LG G K K P+E IRVP LM+ P I+P
Sbjct: 295 ILFTSDHGDMLGSQGAYK-KQQPYEESIRVPMLMKLPKDMNLAPAVKKTVINTEDILPTL 353
Query: 236 MYLLDVLIPQ 245
+ L D+ IP+
Sbjct: 354 LGLCDIDIPK 363
>gi|406699829|gb|EKD03024.1| arylsulfatase precursor [Trichosporon asahii var. asahii CBS 8904]
Length = 646
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 98/242 (40%), Gaps = 35/242 (14%)
Query: 21 YFGKYLNKYN---GSYIPPGWREWGALIMNSKYYNY------SINMNGRKIKHGEDYYND 71
Y GK +N +N + +P G +++ Y+Y +N + +G
Sbjct: 138 YTGKLMNGHNIDNCASLPVSGVTEGEFLVDPWTYDYWSPGFCDVNTGQIAVHNGS----- 192
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFP-APHG------------PEDSAPQYAN 118
Y D++A + L + SK V++ P PH P S PQY
Sbjct: 193 YSSDVVAQ---KVLGHLDTALSKDEPFFVVAAPIGPHSHIPGAKNSNDYIPGMSIPQYRP 249
Query: 119 MFFNVTSHHTPAYDYAPNPD----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
+ H+ NPD W+ + R Q D RL+ LQ +D
Sbjct: 250 QDAGKFADHSIERHEGFNPDFPTGVSWVKDLPRLNQ-TQIDHLDDWHRARLRALQPIDAL 308
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ + +L G + TY+IYTSD+GY LG GK+ FE D RVP +RGPG+ G
Sbjct: 309 IGDVFAKLDAAGVTDETYVIYTSDNGYALGSHRRQPGKTLGFEEDTRVPLFVRGPGVEKG 368
Query: 235 TM 236
+
Sbjct: 369 RV 370
>gi|32477511|ref|NP_870505.1| sulfatase [Rhodopirellula baltica SH 1]
gi|32448065|emb|CAD77582.1| probable sulfatase [Rhodopirellula baltica SH 1]
Length = 524
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 24/174 (13%)
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
+ +L + + + P ++ F PH D P+ + ++HT P NP +
Sbjct: 204 LEYLDKRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 261
Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
+ + P H +L + + +++D + K+++
Sbjct: 262 PLPPNHLEAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 321
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+++GEL+NTYI+YTSDHG +G+ GL +GK +E RVPF+++GPGI G
Sbjct: 322 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 374
>gi|357384309|ref|YP_004899033.1| choline-sulfatase [Pelagibacterium halotolerans B2]
gi|351592946|gb|AEQ51283.1| choline-sulfatase [Pelagibacterium halotolerans B2]
Length = 476
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 100/218 (45%), Gaps = 34/218 (15%)
Query: 18 SHGYFGKYLNKYNGSYIPPG--WREW--GALIMNSKYYNYSINMNGRKIKHGEDYYNDYY 73
+ G G++L+ Y G + G ++W G+L Y N++ + + +D YND +
Sbjct: 158 ARGLEGRFLDDYAGRGVVRGTLLQQWLDGSLGTPPDAY---ANVDPQDLP--DDAYNDNF 212
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
R L + K P ++++FP PH P D P A + V
Sbjct: 213 V------GQRTLDELKGFDPNSPWFMIVNFPGPHEPMDVTPSMAEAWEGVEF-------- 258
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P P + + Q + R++ ++ +++D + + + L++ G+L+NT I
Sbjct: 259 -PLP-RMRNTEDAELQQDIRRRYAAMI--------ENIDRWIGRYIAHLEETGQLDNTVI 308
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
++ SDHG LG L K K PFE +RVP ++ GPGI
Sbjct: 309 VFASDHGEMLGDRNLWK-KQVPFEASVRVPMILAGPGI 345
>gi|334364983|ref|ZP_08513955.1| arylsulfatase [Alistipes sp. HGB5]
gi|313158816|gb|EFR58199.1| arylsulfatase [Alistipes sp. HGB5]
Length = 516
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 105/246 (42%), Gaps = 41/246 (16%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGK+ PG+ W YYN +N+NG ++K + Y+ D++ +
Sbjct: 132 AFFGKWHMGNQDDMPQPGFDHWEGFRGQGTYYNTVLNINGERVKFDPELYST---DILTD 188
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------FNVTSHHTP 129
++ F+R +++ P + +S+ + H +P M+ FNV + P
Sbjct: 189 HAIDFVRDNEEG----PFFIYLSYKSVHSGFQPSPSRKGMYKDEKAVYPPSFNVPEYGIP 244
Query: 130 A----------------YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR-----LQTL 168
Y + PD W+ V Q+ L + +T+
Sbjct: 245 RLPGKDADGRPLAGRGWYGESRLPD--WVKNQRESWHGVDYQYHGALPYEEDFRNYCETV 302
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
S+DDA+ +++ L+ G +T +IY D+G+ G+ GL+ ++F +E +RVP L
Sbjct: 303 TSMDDAIGRLLDFLQAEGLGESTLVIYMGDNGFTWGEHGLIDKRNF-YEPSVRVPMLAYC 361
Query: 229 PGIVPG 234
P ++P
Sbjct: 362 PELIPA 367
>gi|421611911|ref|ZP_16053039.1| choline sulfatase [Rhodopirellula baltica SH28]
gi|408497316|gb|EKK01847.1| choline sulfatase [Rhodopirellula baltica SH28]
Length = 524
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 24/174 (13%)
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP-NPDKQ 140
+ +L + + + P ++ F PH D P+ + ++HT P NP +
Sbjct: 204 LEYLDKRDAAADEDPFLIYFGFSHPHDTRDGTPELLAKYG--ATNHTDRDTLPPANPKQP 261
Query: 141 WILQVTRKMQPVHRQFTDLL--------------------MTKRLQTLQSVDDAVEKIVK 180
+ + P H +L + + +++D + K+++
Sbjct: 262 PLPPNHLEAHPFHHGHPNLRDEVSVKGVWDRRDERTIRNELGREFACSENIDIQIGKVLR 321
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+++GEL+NTYI+YTSDHG +G+ GL +GK +E RVPF+++GPGI G
Sbjct: 322 RLEEMGELDNTYIVYTSDHGMAIGRHGL-QGKQNLYEHTWRVPFIIQGPGIPKG 374
>gi|159487839|ref|XP_001701930.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281149|gb|EDP06905.1| predicted protein [Chlamydomonas reinhardtii]
Length = 795
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 44/72 (61%)
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
RLQ L+SVDD V +V L G LN T IYTSD+G+ LG + + K +E D+R+P
Sbjct: 304 RLQALRSVDDTVSALVSRLACRGLLNRTVFIYTSDNGFKLGNHDITQEKFTQYEEDVRLP 363
Query: 224 FLMRGPGIVPGT 235
LM GPGI G
Sbjct: 364 LLMTGPGIPVGA 375
>gi|440712658|ref|ZP_20893273.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SWK14]
gi|436442644|gb|ELP35762.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SWK14]
Length = 534
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 104/223 (46%), Gaps = 37/223 (16%)
Query: 31 GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
G+Y+PPG + Y++N+NG ++K Y D + + +V +L++
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218
Query: 91 SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
KP L +S A H E +YA+ + F T A P +W+
Sbjct: 219 D---KPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 271
Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ + D L + +++ +VDD+V +++++LKD+G ++T IIY D+
Sbjct: 272 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 331
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G+ G+ GL+ K +E IRVP LM+ P + G + +V+
Sbjct: 332 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGQPIENVV 373
>gi|32474463|ref|NP_867457.1| choline sulfatase [Rhodopirellula baltica SH 1]
gi|32445001|emb|CAD75003.1| choline sulfatase [Rhodopirellula baltica SH 1]
Length = 477
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 94/182 (51%), Gaps = 15/182 (8%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++IAN S F + + +P + ++F A H P + +Y + + V +++
Sbjct: 187 HWSEVIANHSDDFFADAAKH--DQPTFMYLAFNATHDPRQAPQEYLDKYPTENILVPANY 244
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P + A L+ R + P R + + +R L +D + +I+ ++
Sbjct: 245 QPLHPCAEEIGCGRNLRDER-LAPFPRTEYAVRVHRREYYALLTHMDAMIGRILDSVEAS 303
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
G+ +NT+I +T+DHG +GQ GL+ GK P++ +RVPFL++GPG+ G +YL D
Sbjct: 304 GKADNTWIFFTADHGLAVGQHGLL-GKQNPYDHSVRVPFLVKGPGVKAGGRVEEPIYLQD 362
Query: 241 VL 242
V+
Sbjct: 363 VM 364
>gi|408529297|emb|CCK27471.1| sulfatase [Streptomyces davawensis JCM 4913]
Length = 509
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 102/234 (43%), Gaps = 34/234 (14%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNG--RKIKHGEDYYNDYYPDLI 77
G FGKYLN + PP + EW L +Y + N NG R+I Y +I
Sbjct: 155 GLFGKYLNFWRTGDNPPHFDEW-LLQEPVRYVDGHYNDNGAVRRIP-------GYNTTVI 206
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
+ ++ F+ S+ ++P ++ + H P+YA HT D+ P
Sbjct: 207 KDRALAFMEASRTD--QRPWFAYVATRSAHEVNVPEPKYA---------HTRVPDWKGRP 255
Query: 138 --------DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
DK L+ R L ++L+TL SVDDA+ ++L+ LG+L
Sbjct: 256 SVFETGKDDKPPFLRAAGHPFAAGRD----LRARQLRTLLSVDDAMHDFREKLRALGQLE 311
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
NT +++TSDHG G G ++ KS P+ + VPF P GT D L
Sbjct: 312 NTLVVFTSDHGLCWGDHGWLR-KSVPYRPSLEVPFHASWPAGGLGTSRADDRLT 364
>gi|332665111|ref|YP_004447899.1| N-acetylglucosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
gi|332333925|gb|AEE51026.1| N-acetylglucosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
Length = 501
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 54/237 (22%), Positives = 102/237 (43%), Gaps = 37/237 (15%)
Query: 36 PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
PG+ W + YYN + N+NG+++ H + + Y DL+ + +V +++ +KK
Sbjct: 131 PGFDYWLSFKGQGVYYNPTFNINGQQVPHKD---SAYISDLLTDYAVDWIKAQP---AKK 184
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
P L +S H + A ++ M+ N+ P+ +++ ++ P +
Sbjct: 185 PFFLYLSHKGVHALFEPAKRHKGMYKNLRFTPPPSMFMTARDTSKYLELKNSRISPELKP 244
Query: 156 FTD------------------------------LLMTKRLQTLQSVDDAVEKIVKELKDL 185
F + + L+TL +VD++V +++ +
Sbjct: 245 FVENQRDQPNWVYDQRHSWHGVDYMYHGYISFYEFYQQYLETLMAVDESVGRVLDYVSSS 304
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
NNT +IY D+G+ G+ GL+ K +E +RVP LM P +VP L V+
Sbjct: 305 ALANNTMVIYMGDNGFSFGEHGLID-KRHAYEESMRVPMLMYAPKLVPAGSKLTQVI 360
>gi|153807006|ref|ZP_01959674.1| hypothetical protein BACCAC_01283 [Bacteroides caccae ATCC 43185]
gi|149130126|gb|EDM21336.1| arylsulfatase [Bacteroides caccae ATCC 43185]
Length = 529
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 93/205 (45%), Gaps = 44/205 (21%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
DL+A ++ +++Q S +P ++ F A H P D + +++ VT + D+
Sbjct: 189 DLVAEKAINWIKQQD---SNQPFLMCCHFKATHEPYDYPTRMEHLYDGVTFPEPENFLDW 245
Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
P +PDK W +Q T + ++++
Sbjct: 246 GPETNGRSFQGQTLEELGHRWRVASKDPDKWWCRYPGLPFNTDGMQRTTARRAIYQK--- 302
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
L+ L+ ++DD + K++K L ++G +NT ++Y SD GY LG+ G + F +E
Sbjct: 303 -LIRDYLRCGATIDDNIGKLLKTLDEMGIADNTIVVYVSDQGYFLGEHGFFDKRMF-YEE 360
Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI 243
R+PF++R P +P L D+++
Sbjct: 361 TARMPFVIRYPKKIPAGKRLKDLIL 385
>gi|149174110|ref|ZP_01852738.1| N-acetylgalactosamine 6-sulfate sulfatase [Planctomyces maris DSM
8797]
gi|148847090|gb|EDL61425.1| N-acetylgalactosamine 6-sulfate sulfatase [Planctomyces maris DSM
8797]
Length = 469
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 88/196 (44%), Gaps = 37/196 (18%)
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
PDL+ +D++ F+R SK KP L + F APH P P+ + + P
Sbjct: 180 PDLLVDDAINFIRTSKD----KPFALCLHFRAPHTPYGPVPEQDSAHYEGMKIDVP---I 232
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P + I Q ++ ++ SVD + +++KEL L NT +
Sbjct: 233 TPGVIPEQIRQKNKEY---------------YASVSSVDRNIGRLLKELDQLRLAENTLV 277
Query: 194 IYTSDHGYHLGQFGL------------VKGKSFPFEFD--IRVPFLMRGPGIV-PGTMYL 238
I+TSDHGY+ G+ G+ V G P +D IRVP +MR P ++ PGT +
Sbjct: 278 IFTSDHGYNNGRHGVSTKGNGHWIAGGVTGPKRPNMWDTSIRVPLVMRWPAVIKPGTQFD 337
Query: 239 LDVLIPQVRKFSSGSL 254
V + KF G+L
Sbjct: 338 EIVSNIDMFKFVLGAL 353
>gi|423220051|ref|ZP_17206547.1| hypothetical protein HMPREF1061_03320 [Bacteroides caccae
CL03T12C61]
gi|392623883|gb|EIY17981.1| hypothetical protein HMPREF1061_03320 [Bacteroides caccae
CL03T12C61]
Length = 538
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 93/205 (45%), Gaps = 44/205 (21%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
DL+A ++ +++Q S +P ++ F A H P D + +++ VT + D+
Sbjct: 198 DLVAEKAINWIKQQD---SNQPFLMCCHFKATHEPYDYPTRMEHLYDGVTFPEPENFLDW 254
Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
P +PDK W +Q T + ++++
Sbjct: 255 GPETNGRSFKGQTLEELGHRWRVASKDPDKWWCRYPGLPFNTDGMQRTTARRAIYQK--- 311
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
L+ L+ ++DD + K++K L ++G +NT ++Y SD GY LG+ G + F +E
Sbjct: 312 -LIRDYLRCGATIDDNIGKLLKTLDEMGIADNTIVVYVSDQGYFLGEHGFFDKRMF-YEE 369
Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI 243
R+PF++R P +P L D+++
Sbjct: 370 AARMPFVIRYPKKIPAGKRLKDLIL 394
>gi|421613439|ref|ZP_16054521.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH28]
gi|408495803|gb|EKK00380.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH28]
Length = 534
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 104/223 (46%), Gaps = 37/223 (16%)
Query: 31 GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
G+Y+PPG + Y++N+NG ++K Y D + + +V +L++
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218
Query: 91 SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
KP L +S A H E +YA+ + F T A P +W+
Sbjct: 219 D---KPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 271
Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ + D L + +++ +VDD+V +++++LKD+G ++T IIY D+
Sbjct: 272 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 331
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G+ G+ GL+ K +E IRVP LM+ P + G + +V+
Sbjct: 332 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGKPIENVV 373
>gi|268592966|ref|ZP_06127187.1| N-acetylglucosamine-6-sulfatase [Providencia rettgeri DSM 1131]
gi|291311438|gb|EFE51891.1| N-acetylglucosamine-6-sulfatase [Providencia rettgeri DSM 1131]
Length = 519
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 104/247 (42%), Gaps = 37/247 (14%)
Query: 12 LLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNS------KYYNYSIN 56
L D S GY GK ++ YN W EW A Y Y ++
Sbjct: 136 LKSLDYSTGYIGKWHLDAPYAPFIESYNNPMEGRYWNEWAAPNRRHGFDFWYSYGTYDLH 195
Query: 57 MN-----GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGP 109
M + Y + + P+ A+ +++FL+ + + KP LV+S PH P
Sbjct: 196 MKPMYWANNTPREKPIYVDQWGPEHEADMAIKFLQNKNGQYRDNTKPFALVVSMNPPHSP 255
Query: 110 EDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
D PQ Y + + TS + P+ +W ++ P + + L +
Sbjct: 256 YDQVPQKYLDAYKGKTSQ-----ELNTRPNVKWDVEYQEGYGPQY-------FKEYLAMV 303
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
VD+ +IV EL+ G +NT +++ SDHG LG G K+ P++ +RVP + R
Sbjct: 304 NGVDEQFGRIVDELEKQGLADNTLVVFFSDHGSCLGANGQPT-KNNPYDESMRVPMMFRL 362
Query: 229 PG-IVPG 234
PG I PG
Sbjct: 363 PGKIQPG 369
>gi|386821832|ref|ZP_10109048.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426938|gb|EIJ40768.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 434
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 91/184 (49%), Gaps = 35/184 (19%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVT-------- 124
D+ +++V+F+ + KP +SF +PH P +A Y +++ N+T
Sbjct: 153 DITGSEAVKFINNYN---NDKPFFAYVSFNSPHVPRQTAQNYYDLYPIENITLPPSVKDN 209
Query: 125 SHHTP--AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
S P Y YAPNP LQ+ ++ V + + ++T +D + I+K L
Sbjct: 210 SLLNPNVRYQYAPNP-----LQLKTMLERVQQN--NAMVTH-------MDKQIGDILKSL 255
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MY 237
KD G +NT I++TSDHG + G+ G V GK +E + P ++R P + GT +Y
Sbjct: 256 KDKGIYDNTIIVFTSDHGINFGENG-VAGKVCLYEPSVTAPLIIRAPNLEAGTTISERVY 314
Query: 238 LLDV 241
L D+
Sbjct: 315 LQDI 318
>gi|160890613|ref|ZP_02071616.1| hypothetical protein BACUNI_03058 [Bacteroides uniformis ATCC 8492]
gi|156859612|gb|EDO53043.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
Length = 510
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 102/195 (52%), Gaps = 17/195 (8%)
Query: 56 NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
N NG+ + + N + +L A+ ++RFL ++K S P ++ ++F +PH P + P
Sbjct: 193 NTNGKWV---DASLNTFSSELYADAAIRFL-EAKAS-DDTPFLMYVAFTSPHDPRNVLPH 247
Query: 116 YANMFFNVTSHHTPAYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQSV 171
Y + + + T P A +P D + + K+ R + ++T+R + V
Sbjct: 248 YGHKYAS-TEVPMPENFVAQHPFDNGDLNERDEKLLSTPR-ISAQVLTERANYYGMVSEV 305
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D + +I+ L+ G+++NT I++TSD+G +G+ GL+ GK +E +RVP ++ GPGI
Sbjct: 306 DFQIGRILDVLESSGKVDNTIIVFTSDNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPGI 364
Query: 232 VPGTM-----YLLDV 241
M YL D+
Sbjct: 365 PQNAMCDAYCYLYDI 379
>gi|449133266|ref|ZP_21768913.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula europaea 6C]
gi|448887952|gb|EMB18296.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula europaea 6C]
Length = 554
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 104/223 (46%), Gaps = 37/223 (16%)
Query: 31 GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
G+Y+PPG + Y++N+NG ++K Y D + + +V +L++
Sbjct: 197 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 238
Query: 91 SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
KP L +S A H E +YA+ + F T A P +W+
Sbjct: 239 D---KPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 291
Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ + D L + +++ +VDD+V +++++LKD+G ++T IIY D+
Sbjct: 292 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 351
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G+ G+ GL+ K +E IRVP LM+ P + G + +V+
Sbjct: 352 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGKPIENVV 393
>gi|417304905|ref|ZP_12091902.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica WH47]
gi|327538823|gb|EGF25470.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica WH47]
Length = 534
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 101/219 (46%), Gaps = 29/219 (13%)
Query: 31 GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
G+Y+PPG + Y++N+NG ++K Y D + + +V +L++
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218
Query: 91 SFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD--YAPNPDKQWILQVTRK 148
KP L +S A H A ++ + + P A +W+
Sbjct: 219 D---KPFFLYLSHKAVHSNFTPAKRHQGRYADEDLSFLPTGKELSADKNTPRWVRDQKNS 275
Query: 149 MQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
+ + D L + +++ +VDD+V +++++LKD+G ++T IIY D+G+
Sbjct: 276 WHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDNGFMW 335
Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G+ GL+ K +E IRVP LM+ P + G + +V+
Sbjct: 336 GEHGLID-KRVSYEASIRVPMLMQCPNLFDGGQPIENVV 373
>gi|317479849|ref|ZP_07938969.1| sulfatase [Bacteroides sp. 4_1_36]
gi|316904055|gb|EFV25889.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 491
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 102/195 (52%), Gaps = 17/195 (8%)
Query: 56 NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
N NG+ + + N + +L A+ ++RFL ++K S P ++ ++F +PH P + P
Sbjct: 174 NTNGKWV---DASLNTFSSELYADAAIRFL-EAKAS-DDTPFLMYVAFTSPHDPRNVLPH 228
Query: 116 YANMFFNVTSHHTPAYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQSV 171
Y + + + T P A +P D + + K+ R + ++T+R + V
Sbjct: 229 YGHKYAS-TEVPMPENFVAQHPFDNGDLNERDEKLLSTPR-ISAQVLTERANYYGMVSEV 286
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D + +I+ L+ G+++NT I++TSD+G +G+ GL+ GK +E +RVP ++ GPGI
Sbjct: 287 DFQIGRILDVLESSGKVDNTIIVFTSDNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPGI 345
Query: 232 VPGTM-----YLLDV 241
M YL D+
Sbjct: 346 PQNAMCDAYCYLYDI 360
>gi|340619412|ref|YP_004737865.1| mucin-desulfating sulfatase [Zobellia galactanivorans]
gi|339734209|emb|CAZ97586.1| Mucin-desulfating sulfatase, family S1-11 [Zobellia
galactanivorans]
Length = 505
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 100/227 (44%), Gaps = 20/227 (8%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
+FGK+ + PG+ W + YY ++N+NG+ I++ +D Y DL+
Sbjct: 138 FFGKWHMGDHNDSPRPGFDHWESFAGQGDYYAPNLNINGKNIQYDKD---TYVTDLLTEH 194
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD------YA 134
+V +L + KP + +S A H A ++ M+ +Y Y
Sbjct: 195 AVDWLDKRD---VDKPFFMYLSHKAVHAMFKPAKRHEGMYAGKKIEKPDSYTQTIGSAYK 251
Query: 135 PNPDKQWILQVTRKMQPVHRQFT-------DLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
+W+ + V + D L+ + +TL VD++V ++ LK G
Sbjct: 252 KLNWPEWVKKQRDSWHGVDYMYHGRADGTFDDLVQRYCETLMGVDESVGAVMDWLKTNGL 311
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T +IY D+G+ G+ GL+ + F +E ++VP L+R P I G
Sbjct: 312 DESTMVIYMGDNGFSWGEHGLIDKRHF-YEESVKVPMLVRCPEIFEG 357
>gi|149198031|ref|ZP_01875079.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138943|gb|EDM27348.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
Length = 647
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 87/185 (47%), Gaps = 12/185 (6%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ ++A++++ F + K KP + ++F APH P + +Y +M+ + ++
Sbjct: 178 HWTQVVADNTLTFFEKVKND--NKPFFMYLAFNAPHDPRQAPKEYVDMYPLDSIKIPENY 235
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
P Y YA + + P + ++ +D + +++ L+ G+
Sbjct: 236 MPEYPYAAEICGKKLRDEVLAPYPRTTYAVKRNRQEYYASITYMDHHIGRMLDALEASGK 295
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDVL 242
NTYII+T+DHG G GL+ GK +E +R PF++ GPGI + +YL D +
Sbjct: 296 AENTYIIFTADHGLAAGHHGLM-GKQSMYEHSMRPPFIVVGPGIKQNSKIDTPIYLQDAM 354
Query: 243 IPQVR 247
+
Sbjct: 355 ATAIE 359
>gi|372209721|ref|ZP_09497523.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 505
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 94/184 (51%), Gaps = 19/184 (10%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ +D+++F+ +K S KP + ++F APH P + +Y +++ V
Sbjct: 213 HWSEVVKDDALQFINHAKTS--DKPFFMYLAFNAPHDPRQAPQEYQDLYNVHEIQVPESF 270
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELK 183
P Y Y + L+ + P R T+ + ++ ++ D + I++ L+
Sbjct: 271 LPEYPYRKSIGNGNNLR-DEALAPFPR--TEFAVKTHIKEYYASITHADAQIGMILEALE 327
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYL 238
G+++NTYII+T+DHG +G+ GL+ GK F+ +R P ++ GP I V +YL
Sbjct: 328 KSGKMDNTYIIFTADHGLAMGRHGLL-GKQNLFDHSLRPPMVIVGPDIPKNKQVATDVYL 386
Query: 239 LDVL 242
D +
Sbjct: 387 QDAM 390
>gi|406660077|ref|ZP_11068212.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405556211|gb|EKB51161.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 491
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 84/164 (51%), Gaps = 9/164 (5%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT----SHH 127
Y ++ A+ ++ F+R K++ + P ++F APH P Y + + + T ++
Sbjct: 175 YSTEIFADSAIDFIRSQKEA--ENPFFCYVAFTAPHDPYSPEATYIDYYPDGTIPLPGNY 232
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
P + + D + P + ++++ + VD V +I+K L++ G+
Sbjct: 233 KPFHPF--QFDDLTVRDENLAGWPRKPEVIQMILSDYYALVTHVDHHVGRILKVLEETGQ 290
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
L NT I+YT+D+G +G GL+ GK +E I+VP +++GPGI
Sbjct: 291 LENTIIVYTADNGLAVGSHGLL-GKQSLYEHSIKVPLIIKGPGI 333
>gi|149197403|ref|ZP_01874454.1| putative sulfatase yidj [Lentisphaera araneosa HTCC2155]
gi|149139421|gb|EDM27823.1| putative sulfatase yidj [Lentisphaera araneosa HTCC2155]
Length = 511
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 74/160 (46%), Gaps = 15/160 (9%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
N Y D + N ++ F+ Q K S P ++S+P PHGP+ Y M+ ++
Sbjct: 203 NTYTTDWLTNKTIDFITQHKAS----PFCYMVSYPDPHGPDTVRAPYDTMYTHMNFQKPK 258
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
+ W T++ Q+ ++ + +DD + +I+ L + G L
Sbjct: 259 TASKKQDDLPSWA--TTKRGAANQSQYYGMI--------KCIDDNIARIMTCLDEQGILE 308
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
NT +++TSDHG G+ G + K P E +VPF++R P
Sbjct: 309 NTIVVFTSDHGDMRGEHGR-QNKGIPLEASAKVPFIVRYP 347
>gi|340751541|ref|ZP_08688352.1| arylsulfatase [Fusobacterium mortiferum ATCC 9817]
gi|340562196|gb|EEO35554.2| arylsulfatase [Fusobacterium mortiferum ATCC 9817]
Length = 473
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 90/174 (51%), Gaps = 14/174 (8%)
Query: 66 EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
E++ Y+ + + +++R K F+K P+ ++ +F PH P D A ++A M+ V
Sbjct: 172 EEFNQTYWIGMKSCEAIR-----KYDFNK-PLFMMTNFVDPHHPFDPAEKFARMYDGVEI 225
Query: 126 HHTPAYDYAPNPDKQWI-LQVTRKMQP---VHRQFTDLLMTKRLQ----TLQSVDDAVEK 177
+ D N +++ Q R P + +D + + + + +D + K
Sbjct: 226 DSPISKDKFCNERPEYLKRQGERGYWPGGGEQHKLSDEKVEEYTRYYYAMITFIDQEIGK 285
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
I KEL+ GEL+NT II+TSDHG ++G +GL++ F ++ I+VP L G G+
Sbjct: 286 IRKELEKKGELDNTIIIFTSDHGEYMGDYGLLQKGPFMYDNLIKVPLLFWGKGV 339
>gi|409730693|ref|ZP_11272254.1| N-acetylglucosamine-6-sulfatase [Halococcus hamelinensis 100A6]
gi|448723708|ref|ZP_21706224.1| N-acetylglucosamine-6-sulfatase [Halococcus hamelinensis 100A6]
gi|445787247|gb|EMA37995.1| N-acetylglucosamine-6-sulfatase [Halococcus hamelinensis 100A6]
Length = 536
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 107/235 (45%), Gaps = 34/235 (14%)
Query: 20 GYFGKYLNKY-----NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYP 74
GY ++ K+ N + P + W + Y++ +N++G + H + Y
Sbjct: 142 GYETAFIGKWDTFRINSAEPRPAFDRWVSFEKQGNYFDPRLNVDGDWVDH-----SGYIT 196
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV------TSHHT 128
D++ ++++LR KP L +S APH A ++ + + T ++
Sbjct: 197 DILTEYALQWLRNRS---GDKPFFLYLSHKAPHAWFRPARRHRGRYADAPIEYPKTMANS 253
Query: 129 PAYDYAPNPDKQWILQVTRKMQPVH-----------RQFTDLLMTKRLQTLQSVDDAVEK 177
PA +Y PD W+ + ++ V D L + +TL S+D+++
Sbjct: 254 PA-NYRDKPD--WVREQRDSVRGVDYVFGGNFGFGGPSGFDTLYRRYCETLLSLDESIGA 310
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
++ +L G ++T +Y SD+G+ LG+ GL+ GK +E +RVP L PG+V
Sbjct: 311 VMDQLDTSGVADSTLTLYMSDNGFSLGEHGLI-GKQTAYEPSMRVPLLAWAPGLV 364
>gi|319953999|ref|YP_004165266.1| sulfatase [Cellulophaga algicola DSM 14237]
gi|319422659|gb|ADV49768.1| sulfatase [Cellulophaga algicola DSM 14237]
Length = 434
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 85/179 (47%), Gaps = 23/179 (12%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D+ AN++V FL ++ KP ++F APH P + +Y +M+ PA +
Sbjct: 153 DITANEAVSFLNTYDKT---KPFFAYVAFNAPHVPRQTEQKYYDMY--------PAEEMV 201
Query: 135 --PNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLGEL 188
P+ L K Q + M +R+Q + +D + I++ LKD G
Sbjct: 202 LPPSVINNTPLNTNVKYQYTNDPLQSKTMQQRVQQNNAMVTHMDTGIGAIIQALKDKGIY 261
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLDVL 242
+NT I++TSDHG + G+ G V GK +E + P +++ P P T +YL DV+
Sbjct: 262 DNTIIVFTSDHGINFGENG-VAGKVCLYEPSVTAPLIIKAPLAKPNTKIASRVYLQDVV 319
>gi|451853738|gb|EMD67031.1| hypothetical protein COCSADRAFT_168273 [Cochliobolus sativus
ND90Pr]
Length = 458
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 95/229 (41%), Gaps = 50/229 (21%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKH-----GEDYYNDYYP 74
GY +NG Y+P ++ G + YY +N I++ + Y +Y
Sbjct: 66 GYPKSIREGHNGRYLPVWLQQAGY----NTYYTGKF-LNAHSIRNYNSPFPKGYPGNYST 120
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
DLI++ ++ FL ++ + +K P L + PH
Sbjct: 121 DLISSKAIEFLEEA--AAAKAPFFLGVMPIVPH--------------------------- 151
Query: 135 PNPDKQWILQVTRKMQPV-------HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
Q IL PV + + D RL L SVDD V+ + + L+ G
Sbjct: 152 ----TQTILSTIPGGLPVFEPPVSAYVTYNDEFYRLRLAALASVDDHVDAMFERLESFGL 207
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+NN YIIYTSD+G+ +GQ L S +E D+ VP +RG G+ G +
Sbjct: 208 MNNIYIIYTSDNGFPIGQHRLALENSCAYEEDVNVPMFIRGTGVPKGEV 256
>gi|32473376|ref|NP_866370.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH 1]
gi|32398056|emb|CAD78151.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH 1]
Length = 534
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/223 (25%), Positives = 104/223 (46%), Gaps = 37/223 (16%)
Query: 31 GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQ 90
G+Y+PPG + Y++N+NG ++K Y D + + +V +L++
Sbjct: 177 GNYLPPGPK-------------YTLNVNGERVKQ-----KGYITDELTDYAVDWLKERDD 218
Query: 91 SFSKKPIMLVMSFPAPHG----PEDSAPQYAN--MFFNVTSHHTPAYDYAPNPDKQWILQ 144
+P L +S A H E +YA+ + F T A P +W+
Sbjct: 219 D---EPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTP----RWVRD 271
Query: 145 VTRKMQPVHRQF-----TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ + D L + +++ +VDD+V +++++LKD+G ++T IIY D+
Sbjct: 272 QKNSWHGIDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDN 331
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G+ G+ GL+ K +E IRVP LM+ P + G + +V+
Sbjct: 332 GFMWGEHGLID-KRVSYEASIRVPMLMQCPNLFDGGQPIENVV 373
>gi|410030244|ref|ZP_11280074.1| sulfatase [Marinilabilia sp. AK2]
Length = 491
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 83/164 (50%), Gaps = 9/164 (5%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT----SHH 127
Y +L A+ ++ F+R K++ + P ++F APH P Y + + + T ++
Sbjct: 175 YSTELFADSAIDFIRSQKEA--ENPFFCYVAFTAPHDPYSPEAAYVDYYPDGTIPLPGNY 232
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
P + + D + P + ++++ + VD V +I+K L++ G
Sbjct: 233 MPFHPF--QFDDLTVRDENLTGWPRKPEVIQMILSDYYALVTHVDHQVGRILKVLEETGL 290
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
L NT I+YT+D+G +G GL+ GK +E I+VP +++GPGI
Sbjct: 291 LENTIIVYTADNGLAVGSHGLL-GKQSLYEHSIKVPLIIKGPGI 333
>gi|83950157|ref|ZP_00958890.1| Sulfatase [Roseovarius nubinhibens ISM]
gi|83838056|gb|EAP77352.1| Sulfatase [Roseovarius nubinhibens ISM]
Length = 518
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 84/165 (50%), Gaps = 13/165 (7%)
Query: 87 QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT---SHHTPAYDYAPNPDKQWIL 143
+ + + + P+ +V SF PH P + P++ +++ + P D+ P ++ +
Sbjct: 187 RDRAAGQEAPLCMVASFIHPHDPYVARPEWWDLYSDDEIELPEVLPLADHDPF-SRRLMD 245
Query: 144 QVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGY 201
+ P+ R +++ +R L + D + +VK L + GEL+NT +I T+DHG
Sbjct: 246 GIEASYVPLSRD--EVIRARRAYLANVSYFDSKIGALVKTLDETGELDNTVVIVTADHGD 303
Query: 202 HLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG----TMYLLDVL 242
LG+ GL +F FE RVP +M GPG+V G L+D+L
Sbjct: 304 MLGERGLWYKMNF-FEHSARVPLIMAGPGVVQGAAANACSLIDLL 347
>gi|392967509|ref|ZP_10332927.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387844306|emb|CCH54975.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 510
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 104/222 (46%), Gaps = 24/222 (10%)
Query: 37 GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
G+ W + YYN +N++G++++H + + Y D++ + ++ FL + + KP
Sbjct: 156 GFNHWVSFRGQGVYYNPVLNVDGKRVEHRD---SAYIADVLTDYALNFLNRRDKD---KP 209
Query: 97 IMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ-----------WILQV 145
+ +S + H + N F + PA Y +P Q W+ +
Sbjct: 210 FFMYLSHKSVHHQFQPSRADRNRFAQSPIQY-PASFYQTDPRNQNPETKESGVPHWVRRQ 268
Query: 146 TRKMQPV----HRQFT-DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
V H T D + +TL+S+D ++ +++K L D G NT +IY D+G
Sbjct: 269 RYSWHGVDYLYHGSTTFDSFVRDYCETLRSMDTSIGRVLKYLDDNGLAENTLVIYMGDNG 328
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
+ G+ GL+ ++ +E +RVPFL+R P ++ + + ++
Sbjct: 329 FMFGEHGLIDKRT-AYEESMRVPFLVRCPALIKAGITVAQMI 369
>gi|430747734|ref|YP_007206863.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430019454|gb|AGA31168.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 475
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 79/177 (44%), Gaps = 37/177 (20%)
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
PDL+ + +RF+ ++ +P +L + F APH P P+ + + P
Sbjct: 182 PDLLVGEGIRFVEANRD----RPFLLSIHFRAPHVPYAPVPEQDSAHYETLDPTIPDVPG 237
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P RQ L + ++ SVD V +++ L+ LG T +
Sbjct: 238 LP------------------RQRVKTLTRRYYASVSSVDRNVGRLLDRLEALGLTGKTIV 279
Query: 194 IYTSDHGYHLGQFG---------LVKGKS---FPFEFD--IRVPFLMRGPGIV-PGT 235
I+TSDHGY +G G LV+GK P FD IRVP ++R PG+V PGT
Sbjct: 280 IFTSDHGYMIGHHGLWHKGNGTWLVEGKKEERRPNMFDDSIRVPLIVRWPGVVAPGT 336
>gi|354582496|ref|ZP_09001398.1| sulfatase [Paenibacillus lactis 154]
gi|353199895|gb|EHB65357.1| sulfatase [Paenibacillus lactis 154]
Length = 497
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 70/151 (46%), Gaps = 5/151 (3%)
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD-K 139
S+ FLR+ KP L MSF PH P D Y +++ P D+A D
Sbjct: 195 SIDFLRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLDLYEEAEIPEPPVGDWAEKEDVS 251
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
Q L V ++ M + +D + + ++ L + GELNNT I++TSDH
Sbjct: 252 QAGLSPVTGRGIVPKRRLRKAMAAYYALITHLDHQIGRFLQALHEYGELNNTVIVFTSDH 311
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
G LG L + KS P+E VPF++ PG
Sbjct: 312 GELLGDHHLFR-KSLPYEGSAHVPFIVNDPG 341
>gi|390947095|ref|YP_006410855.1| arylsulfatase A family protein [Alistipes finegoldii DSM 17242]
gi|390423664|gb|AFL78170.1| arylsulfatase A family protein [Alistipes finegoldii DSM 17242]
Length = 515
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 105/246 (42%), Gaps = 41/246 (16%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+FGK+ PG+ W YY+ +N+NG ++K + Y+ D++ +
Sbjct: 131 AFFGKWHMGNQDDMPQPGFDHWEGFRGQGTYYDTVLNINGERVKFDPELYST---DILTD 187
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------FNVTSHHTP 129
++ F+R +++ P + +S+ + H +P M+ FNV + P
Sbjct: 188 HAIDFVRDNEEG----PFFIYLSYKSVHSGFQPSPSRKGMYKDEKAVYPPSFNVPEYGIP 243
Query: 130 A----------------YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR-----LQTL 168
Y + PD W+ V Q+ L + +T+
Sbjct: 244 RLPGKDADGRPLAGRGWYGESRLPD--WVKNQRESWHGVDYQYHGALPYEEDFRNYCETV 301
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
S+DDA+ +++ L+ G +T +IY D+G+ G+ GL+ ++F +E +RVP L
Sbjct: 302 TSMDDAIGRLLDFLQAEGLDESTLVIYMGDNGFTWGEHGLIDKRNF-YEPSVRVPMLAYC 360
Query: 229 PGIVPG 234
P ++P
Sbjct: 361 PELIPA 366
>gi|297180521|gb|ADI16734.1| arylsulfatase a and related enzymes [uncultured Verrucomicrobiales
bacterium HF0010_05E02]
Length = 535
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 103/233 (44%), Gaps = 31/233 (13%)
Query: 51 YNYSINMNGRKIKHGEDYYNDYYPD-LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
Y +++ +G + ++ E Y D PD L A D+VRFL++ KP +L + F PH P
Sbjct: 213 YGWTLFHSGEQWEYREGYNRDLMPDELCAQDAVRFLKEKH----AKPFILTVGFSRPHSP 268
Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR---------KMQPVHRQFTDLL 160
+ ++ + F + TP + D IL V + K + + D
Sbjct: 269 WYAPQEFFDRFPLESLELTPLLKGDAD-DVSKILSVEQDIAQPWGWSKYKKIVNNGGDQQ 327
Query: 161 MTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
+ K Q + VD V ++ L+ NT +I+T DHGYH+G+ + K P+
Sbjct: 328 LLKWTQAYLACVAFVDHLVGVVLGVLESSEYAENTVVIFTGDHGYHMGEKDYLF-KYSPW 386
Query: 217 EFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGSLIFIMSFLINLNTMRA 269
E +RVP ++ GP I G + V SL+ + I+L M+A
Sbjct: 387 EESVRVPLILAGPDIAEGAISRTPV-----------SLVDLYPTCIDLAAMQA 428
>gi|374605454|ref|ZP_09678382.1| arylsulfatase [Paenibacillus dendritiformis C454]
gi|374388953|gb|EHQ60347.1| arylsulfatase [Paenibacillus dendritiformis C454]
Length = 488
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 75/153 (49%), Gaps = 9/153 (5%)
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD-- 138
S+ F+R+ KP L MSF PH P D Y +M+ + P D+A D
Sbjct: 193 SIDFVRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLDMYKDEEIPDPPVGDWADTADEA 249
Query: 139 KQWILQVT-RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
++ + +T R + P R M + +D + + ++ L + GEL NT I++TS
Sbjct: 250 REGLSPITSRGIVPKRR--LKQAMAAYYALITHLDHQIGRFLQVLDEYGELRNTVILFTS 307
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
DHG LG L + KS P+E RVPF++ PG
Sbjct: 308 DHGELLGDHNLFR-KSLPYEGSARVPFIVNDPG 339
>gi|87306335|ref|ZP_01088482.1| choline sulfatase [Blastopirellula marina DSM 3645]
gi|87290514|gb|EAQ82401.1| choline sulfatase [Blastopirellula marina DSM 3645]
Length = 462
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 92/184 (50%), Gaps = 19/184 (10%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++A+D+V FL + + KP + ++F A H P S Y + + +V +
Sbjct: 170 HWSEVVADDAVDFLDTAAKR--DKPFFMYLAFNAVHDPRQSPQDYVDHYPAEKTDVPADF 227
Query: 128 TPAYDYAPNPDKQWILQVTR--KMQPVHR--QFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
P Y P + + R K+ P R + ++ ++ +D + +I+ L+
Sbjct: 228 QPLY---PEREAMGAGKSLRDEKLAPFPRTPHAIQVHRSEYYAIIEHMDAQIGRILDALQ 284
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYL 238
G+ +NTYI +T+DHG +G GL+ GK ++ +RVP L+ GPG+ G +YL
Sbjct: 285 KTGKADNTYIFFTADHGLAVGHHGLL-GKQNMYDHSVRVPLLVAGPGLPQGKVCETPVYL 343
Query: 239 LDVL 242
DV+
Sbjct: 344 QDVM 347
>gi|349805009|gb|AEQ17977.1| putative glucosamine (n-acetyl)-6-sulfatase [Hymenochirus curtipes]
Length = 110
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 65/114 (57%), Gaps = 6/114 (5%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
DY D++AN S+ FL + +P +++S PAPH P +APQY F NVT+
Sbjct: 1 DYLTDVLANVSLDFLNYKS---NYQPFFMMISTPAPHSPWIAAPQYEKTFPNVTAPRGGN 57
Query: 131 YDYAPNPDKQWILQVTRK-MQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
++ + DK W+++ + M QF D KR QTL SVDD +EK++K+L+
Sbjct: 58 FNV--HKDKHWLIRQDKSPMSNSSIQFLDNAFRKRWQTLLSVDDLIEKLLKQLE 109
>gi|294634825|ref|ZP_06713348.1| N-acetylglucosamine-6-sulfatase [Edwardsiella tarda ATCC 23685]
gi|451966770|ref|ZP_21920021.1| putative sulfatase [Edwardsiella tarda NBRC 105688]
gi|291091761|gb|EFE24322.1| N-acetylglucosamine-6-sulfatase [Edwardsiella tarda ATCC 23685]
gi|451314442|dbj|GAC65383.1| putative sulfatase [Edwardsiella tarda NBRC 105688]
Length = 521
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 106/254 (41%), Gaps = 37/254 (14%)
Query: 5 VELTSRNLLYFDISHGYFGK---------YLNKYNGSYIPPGWREWGALIMNSKY---YN 52
+ S L D S GY GK Y+ YN W +W A + Y+
Sbjct: 130 AQCWSDVLKKLDYSTGYIGKWHLDAPHAPYIESYNNPMEGRYWNDWTAPDRRHGFDFWYS 189
Query: 53 YSI-NMNGRKIKHGED-------YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMS 102
Y +++ R + G + + + + A+ +++FLR + +F + KP LV+S
Sbjct: 190 YGTYDLHMRPMYWGNTTPREKPMFVDQWSAEHEADMAIKFLRNDEGNFRAADKPFALVVS 249
Query: 103 FPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLM 161
PH P D PQ Y + F TS D P+ W P +
Sbjct: 250 MNPPHSPYDQVPQKYLDRFAGKTSR-----DLNTRPNVDWQANYQEGYGPQY-------F 297
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
+ + + VD+ +I+ EL G +NT +++ SDHG LG G K+ +E +R
Sbjct: 298 KEYMAMVNGVDEQFGRILAELDKQGLADNTLVVFFSDHGCCLGANGQPT-KNVAYEEAMR 356
Query: 222 VPFLMRGPG-IVPG 234
+P + R PG I PG
Sbjct: 357 IPMMFRLPGKIAPG 370
>gi|423018509|ref|ZP_17009230.1| putative sulfatase [Achromobacter xylosoxidans AXX-A]
gi|338778394|gb|EGP42868.1| putative sulfatase [Achromobacter xylosoxidans AXX-A]
Length = 485
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 11/192 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++ A V +LR++ + S KP +L +S APH P + P++ +FN
Sbjct: 154 GESSYTFYDREITARAQV-WLREAARRESDKPWVLFVSLVAPHFPLTAPPEHYYRYFNQN 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD PD +++ + + + KR +D+ + KI+
Sbjct: 213 LPLPKLYDRRERPDHPYLVDYRNSFN--YDDYFEGDAVKRAVAGYYGLCSFLDENIGKIL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
L+D G ++T ++YTSDHG +LG GL GKS FE VP +M G I G ++
Sbjct: 271 NALEDAGLADDTRVMYTSDHGDNLGARGLW-GKSTMFEETAGVPLIMAGDDIPQGR--VV 327
Query: 240 DVLIPQVRKFSS 251
DV + + + +
Sbjct: 328 DVPVTHIDAYPT 339
>gi|402494641|ref|ZP_10841381.1| sulfatase [Aquimarina agarilytica ZC1]
Length = 506
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 91/186 (48%), Gaps = 23/186 (12%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ D+ FL + S + P + ++F APH P + +Y +M+ ++
Sbjct: 213 HWSEVLTQDATTFLDMA--SKKEDPFFMYLAFNAPHDPRQAPKKYQDMYDLDRISLPKSW 270
Query: 128 TPAYDYAPNPDKQWILQ----VTRKMQPVHRQ--FTDLLMTKRLQTLQSVDDAVEKIVKE 181
P Y PD+ I + P R T + + + ++ +D + KI++
Sbjct: 271 LPEY-----PDRHQIANGDHLRDEALAPFPRTEYATKVHIKEYYASITHLDAQIGKILEA 325
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTM 236
LK G+ NTYII+T+DHG +G+ GL+ GK ++ +R P ++ GP I V +
Sbjct: 326 LKASGKDKNTYIIFTADHGLAMGRHGLL-GKQNLYDHSLRPPLIIMGPDIPKHKKVNADV 384
Query: 237 YLLDVL 242
YL DV+
Sbjct: 385 YLQDVM 390
>gi|301309188|ref|ZP_07215132.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Bacteroides sp. 20_3]
gi|423338919|ref|ZP_17316661.1| hypothetical protein HMPREF1059_02586 [Parabacteroides distasonis
CL09T03C24]
gi|300832870|gb|EFK63496.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Bacteroides sp. 20_3]
gi|409233044|gb|EKN25885.1| hypothetical protein HMPREF1059_02586 [Parabacteroides distasonis
CL09T03C24]
Length = 524
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
D++ + ++R+++++ + +P M+ F A H P D + +++ V +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239
Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
P +PDK W ++ + +HR L+
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVC 299
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
L+ ++DD + K++K L D+G +NT ++Y SD GY LG+ G K +E +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358
Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
PF++R P +P D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379
>gi|406833294|ref|ZP_11092888.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 520
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 59/264 (22%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ P G+ W L YYN + NG ++KH Y D+I +
Sbjct: 122 GYSTAIIGKWHLISDPTGFDHWHILPGQGLYYNPPMIRNGEEVKH-----EGYATDIITD 176
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA------------PQYANMFFNVTSHH 127
S+ +L+ +S KP +L+ APH A P+ +F +
Sbjct: 177 LSIDWLKNRDKS---KPFVLMSQHKAPHREWAPALRDLGWDQDRQYPEPETLFDDYAGRS 233
Query: 128 TPAYDYAPNPDKQWI----------------LQVT-----------RKMQPVHRQFTDLL 160
D+ D+ + LQV RKM P Q DL+
Sbjct: 234 KAVSDHDMGIDRTFTDLDAKLKPPPNMTAEQLQVWNAYYGPRNDAFRKMNP---QGKDLV 290
Query: 161 -------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
M L +++VDD V +++K L G +T I++TSD G++LG+ G K
Sbjct: 291 RWRYNRYMHDYLGCVKAVDDNVGRLLKFLDQEGLSQDTLIVFTSDQGFYLGEHGWFD-KR 349
Query: 214 FPFEFDIRVPFLMRGPGIV-PGTM 236
+ FE +R P +MR PG+V PG +
Sbjct: 350 WIFEESLRSPLVMRMPGLVKPGQV 373
>gi|422320449|ref|ZP_16401509.1| sulfatase [Achromobacter xylosoxidans C54]
gi|317404779|gb|EFV85161.1| sulfatase [Achromobacter xylosoxidans C54]
Length = 485
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 11/192 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++ A V +LR++ + S KP +L +S APH P + P++ +FN
Sbjct: 154 GESSYTFYDREITARAQV-WLREAARRESDKPWVLFVSLVAPHFPLTAPPEHYYRYFNQN 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD PD +++ + + + KR +D+ + KI+
Sbjct: 213 LPLPKLYDRRERPDHPYLVDYRNSFN--YDDYFEGDAVKRAVAGYYGLCSFLDENIGKIL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLL 239
L+D G ++T ++YTSDHG +LG GL GKS FE VP +M G I G ++
Sbjct: 271 NALEDAGLADDTRVMYTSDHGDNLGARGLW-GKSTMFEETAGVPLIMAGDDIPRGR--VV 327
Query: 240 DVLIPQVRKFSS 251
DV + + + +
Sbjct: 328 DVPVTHIDAYPT 339
>gi|71281899|ref|YP_269086.1| N-acetylglucosamine-6-sulfatase [Colwellia psychrerythraea 34H]
gi|71147639|gb|AAZ28112.1| putative N-acetylglucosamine-6-sulfatase [Colwellia psychrerythraea
34H]
Length = 537
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 105/236 (44%), Gaps = 32/236 (13%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYY--------NYSINMNGRKIKHGEDYYNDY 72
+FGK+ +G PG+ W + YY IN+NG ++ Y
Sbjct: 128 FFGKWHMGGHGDSPQPGFDHWLSFAGQGHYYPKKDKKGRTNKININGERVDQ-----KGY 182
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHH 127
D + + +V +L + S KP + +S A H D AP++ + + +V S
Sbjct: 183 ITDELTDYAVDWLDKRD---SDKPFFMYLSHKAVHSNFDPAPRHKDQYSDVAIEVPESQA 239
Query: 128 TPAYDYAPNPDKQWILQVTRKMQ----PVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKE 181
+YA P W+ P H + D+ KR + L +VDD++ +++K
Sbjct: 240 DTPENYAGKP--MWVKNQRNSWHGVDFPYHSEM-DVQEYKRQYHRALSAVDDSLGRVLKW 296
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTM 236
LKD N+T ++ D+G+ G+ GL+ ++ +E +RVP L PG PGT+
Sbjct: 297 LKDNNLENDTIVMLMGDNGFMFGEHGLIDKRN-AYEESMRVPLLAYAPGYFKPGTV 351
>gi|298375117|ref|ZP_06985074.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Bacteroides sp. 3_1_19]
gi|298267617|gb|EFI09273.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Bacteroides sp. 3_1_19]
Length = 524
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
D++ + ++R+++++ + +P M+ F A H P D + +++ V +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239
Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
P +PDK W ++ + +HR L+
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSVARSAAYQKLVR 299
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
L+ ++DD + K++K L D+G +NT ++Y SD GY LG+ G K +E +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358
Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
PF++R P +P D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379
>gi|223939156|ref|ZP_03631039.1| sulfatase [bacterium Ellin514]
gi|223892205|gb|EEF58683.1| sulfatase [bacterium Ellin514]
Length = 492
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 67/153 (43%), Gaps = 20/153 (13%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
+D+ ++LR S +KKP L +S+ PH P +AP +
Sbjct: 201 THDASQYLR--NHSHAKKPFFLYLSWGPPHDPYQTAPATYRSQYQAAK------------ 246
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
++ + P R + ++D V +++ LKD G NT +I+TS
Sbjct: 247 -----IKTRLNVPPGMRASAQTNLAGYYSHCTAIDSCVGTLLQTLKDTGLETNTLVIFTS 301
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
DHG L GLVK K PF+ IRVP LMR P
Sbjct: 302 DHGDMLHSHGLVK-KQHPFDESIRVPLLMRWPA 333
>gi|320103099|ref|YP_004178690.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319750381|gb|ADV62141.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 952
Score = 67.4 bits (163), Expect = 6e-09, Method: Composition-based stats.
Identities = 51/175 (29%), Positives = 85/175 (48%), Gaps = 10/175 (5%)
Query: 77 IANDSVRFL--RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
IA+ +V FL R + ++P ++ ++ PH P P + + P
Sbjct: 645 IADAAVEFLEHRAALPPDQRRPFLMHLALECPHDPLAPDPADLAALPPIEAFPLPPSFLP 704
Query: 135 PNP-DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNT 191
+P D +L +++ R + R L + +D V ++++ L GE +NT
Sbjct: 705 FHPFDNGEMLVRDERLEAFPRDPQAIRTRWRDYLAVVAGIDRVVGRLLEALTVAGERDNT 764
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG----TMYLLDVL 242
Y+I+TSD G LG GL+ GK +E IRVPF++ GPG+ PG ++LLD+
Sbjct: 765 YVIFTSDQGLALGDHGLL-GKQNLYEHSIRVPFIVAGPGLRPGRCAAMVWLLDLF 818
>gi|149178829|ref|ZP_01857409.1| putative sulfatase yidJ [Planctomyces maris DSM 8797]
gi|148842304|gb|EDL56687.1| putative sulfatase yidJ [Planctomyces maris DSM 8797]
Length = 394
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 100/246 (40%), Gaps = 70/246 (28%)
Query: 44 LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSF 103
LI YYN + NG ++KH Y D+I + ++ +L+ + KP ML+
Sbjct: 7 LIGQGPYYNPPMIKNGERVKH-----EGYTTDIITDLALDYLKNDRDP--DKPFMLMFQH 59
Query: 104 PAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK-------------QWILQVTRKMQ 150
APH P+Y +M+ +VT P PD Q + + + M
Sbjct: 60 KAPHRNWQPGPKYLHMYDDVT---------IPEPDNLFDNYEGRGTAAHQQDMTIAKTMT 110
Query: 151 PVHRQFT--------------------------------DLL-------MTKRLQTLQSV 171
P + T DL+ M L+ + SV
Sbjct: 111 PFDLKLTPPTNLTPEQLATWNAAYEPKNEAFRKANLTGKDLVRWKYQRYMKDYLRCVASV 170
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D+ V +++ L+ G NT +IY+SD G++LG G K F +E R+P L R PG+
Sbjct: 171 DENVGRMLDYLESSGLAKNTVVIYSSDQGFYLGDHGWF-DKRFMYEESYRMPLLARWPGV 229
Query: 232 V-PGTM 236
+ PG++
Sbjct: 230 IKPGSV 235
>gi|315647334|ref|ZP_07900447.1| arylsulfatase [Paenibacillus vortex V453]
gi|315277536|gb|EFU40865.1| arylsulfatase [Paenibacillus vortex V453]
Length = 490
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 73/157 (46%), Gaps = 9/157 (5%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ S+ FLR+ KP L MSF PH P D Y +++ + P D+A
Sbjct: 181 VVTQSIDFLRRRDPG---KPFFLWMSFVRPHPPFDPPQAYLDLYEDADLPEPPVGDWART 237
Query: 137 PDKQ---WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
D + + R + P R M + +D + + ++ L + GEL+NT I
Sbjct: 238 EDPELAGYSPITGRGLVPKRRLRK--AMAAYYALITHLDHQIGRFLQSLNEYGELHNTVI 295
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
++TSDHG LG L + KS P+E VPF++ PG
Sbjct: 296 LFTSDHGELLGDHHLFR-KSLPYEGSTHVPFIVNDPG 331
>gi|390957885|ref|YP_006421642.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390958226|ref|YP_006421983.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390412803|gb|AFL88307.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390413144|gb|AFL88648.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
Length = 526
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 111/266 (41%), Gaps = 56/266 (21%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
S GY + K++ P G+ W ++ YN + + K G Y D+I
Sbjct: 125 SAGYHTAMIGKWHLGSDPTGFDTWN-ILPGQGVYNDPVFYD----KDGSTQIKGYCTDII 179
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-----------------PEDSA------- 113
A+ ++R L++ Q KP ++ APH PE +
Sbjct: 180 ADLTIRTLKERPQD---KPFFIMSHHKAPHREWTPDEKHRKQFENLHIPEPATLRDDYAG 236
Query: 114 ------PQYANMFFNVTSHH---TPAYDYAPNPDKQWI----LQVTRKMQPVHRQFT--- 157
Q ++F ++T P +P KQW+ ++V + V + T
Sbjct: 237 RTDALREQKQSVFRDLTRADLKIVPPAGLSPEDQKQWLAAKPMEVEIESDGVKKTITGRE 296
Query: 158 ------DLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
M L +QSVDD V +++ L G NT +IYTSD G+ LG+ GL
Sbjct: 297 LEDWKYQRYMRDYLACVQSVDDNVGRVLDWLDANGLRENTIVIYTSDQGFFLGEHGLFD- 355
Query: 212 KSFPFEFDIRVPFLMRGPG-IVPGTM 236
K F +E +++PF++R PG I PGT+
Sbjct: 356 KRFMYEESLQMPFVVRWPGFIKPGTV 381
>gi|168333887|ref|ZP_02692125.1| choline sulfatase [Epulopiscium sp. 'N.t. morphotype B']
Length = 459
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 84/187 (44%), Gaps = 28/187 (14%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--------N 122
++ D+IA ++FL Q KP SF APH P ++ NM+ N
Sbjct: 170 EHSTDVIAESVIKFLDQKHD----KPFFAYTSFLAPHDPRTMPEEFLNMYNPEDIKLPPN 225
Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF--TDLLMTKRLQTLQSVDDAVEKIVK 180
S+H Y W + + P R T + + + +D + +I+
Sbjct: 226 FMSYHFIEY-------ANWECR-DETLAPYPRTLANTQKHIAEYYAMITHLDYQIGRILD 277
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT----- 235
+L+++GE +NT I+Y D+G LGQ GL GK ++ +RVP L+ G GI G
Sbjct: 278 KLEEIGEKDNTIIVYAGDNGLALGQHGLF-GKQSLYDHSMRVPLLISGAGIKAGMKTDAL 336
Query: 236 MYLLDVL 242
+YL D+
Sbjct: 337 VYLFDIF 343
>gi|159487449|ref|XP_001701735.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280954|gb|EDP06710.1| predicted protein [Chlamydomonas reinhardtii]
Length = 631
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 101/240 (42%), Gaps = 35/240 (14%)
Query: 23 GKYLNKYNGSYI-----PPGWREWGALIMNSKYYNYSINMNGRKIKHGED----YYNDYY 73
G++LN +N S + P GW L+ Y + G D Y Y+
Sbjct: 136 GQFLNGFNRSVLQQYGCPQGWTAADVLV---PAYGCA----------GPDGVVTYTGRYH 182
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA-------PQYANMFFNVTSH 126
+LI + R + + +P ++++ AP +A P+Y N+F +
Sbjct: 183 EELIREKVTK--RIDEAAGQDQPFFMMVATVAPRDSGRAATAYPEVMPEYKNLFLGERAP 240
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
+ + + Q T D RLQ L++VDD + ++ L
Sbjct: 241 RLADWGVPVPAEVGFTSQRTNNFNAAD---IDARFRARLQALRAVDDTLALMLSRLACHD 297
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
L++T I+++SD G+ LG + + KS FE D+RVP L+ GPG+ P +Y +V V
Sbjct: 298 LLDDTVIVFSSDSGFKLGSHNMAQDKSTYFEEDVRVPMLLAGPGM-PVGVYAAEVTAAAV 356
>gi|227495529|ref|ZP_03925845.1| sulfatase [Actinomyces coleocanis DSM 15436]
gi|226831076|gb|EEH63459.1| sulfatase [Actinomyces coleocanis DSM 15436]
Length = 482
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 107/230 (46%), Gaps = 36/230 (15%)
Query: 19 HGYFGKYLNKYN--GSYIP-PGWREWGA-LIMNSKYYNYSI-NMNGRKIKHGEDYYNDYY 73
+GY+ + K++ S P PG+ W A YYN I + NG + + Y+
Sbjct: 101 NGYYCGMVGKWHVGTSQTPAPGFSYWYAHRYGGGPYYNAPIWDENGNEATEPK-----YF 155
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-DSAPQ-----YANMFFNVTSHH 127
D IA ++ F++ + +KP L+++F APH P ++ PQ YA+ F
Sbjct: 156 TDAIAENACDFIQSAASVNEEKPFFLMVNFTAPHSPWINNHPQELMDLYADTDFPSIPRE 215
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
P W T+ F D + + R +L VD+AV I+K L++
Sbjct: 216 EP---------HPW----TKYYDDFADAFADPVPSLRGYAASLTGVDNAVGDILKALEEN 262
Query: 186 GELNNTYIIYTSDHGYHLGQFGL-VKGK-SFPFEF---DIRVPFLMRGPG 230
+NT ++Y SD+G+ GQ G+ KG +FP F +RVPF++ PG
Sbjct: 263 AYADNTVVMYMSDNGFSCGQHGIWGKGNGTFPLNFWENSVRVPFIIHLPG 312
>gi|150008305|ref|YP_001303048.1| mucin-desulfating sulfatase [Parabacteroides distasonis ATCC 8503]
gi|255015459|ref|ZP_05287585.1| mucin-desulfating sulfatase [Bacteroides sp. 2_1_7]
gi|256839409|ref|ZP_05544918.1| mucin-desulfating sulfatase [Parabacteroides sp. D13]
gi|410101513|ref|ZP_11296441.1| hypothetical protein HMPREF0999_00213 [Parabacteroides sp. D25]
gi|149936729|gb|ABR43426.1| mucin-desulfating sulfatase [Parabacteroides distasonis ATCC 8503]
gi|256738339|gb|EEU51664.1| mucin-desulfating sulfatase [Parabacteroides sp. D13]
gi|409239311|gb|EKN32095.1| hypothetical protein HMPREF0999_00213 [Parabacteroides sp. D25]
Length = 524
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
D++ + ++R+++++ + +P M+ F A H P D + +++ V +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239
Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
P +PDK W ++ + +HR L+
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVR 299
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
L+ ++DD + K++K L D+G +NT ++Y SD GY LG+ G K +E +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358
Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
PF++R P +P D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379
>gi|149197078|ref|ZP_01874130.1| sulfatase 1 precursor [Lentisphaera araneosa HTCC2155]
gi|149139624|gb|EDM28025.1| sulfatase 1 precursor [Lentisphaera araneosa HTCC2155]
Length = 555
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 96/205 (46%), Gaps = 33/205 (16%)
Query: 40 EWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIML 99
E+ + N ++ IN+ ++ +D Y + +S++FL K+P +
Sbjct: 157 EYEEIAPNGNFHGEYINIKNKESLPEDDPKRIYS---LTKESLKFLDDHA---GKRPFYM 210
Query: 100 VMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN-PDKQWILQVTRKMQPVHRQFTD 158
++S A HGP ++P +V YD PN P+K R+ +R T
Sbjct: 211 MISHYAVHGPHVASP-------DVLKKWQARYDALPNKPEK------GREKDVFYRLHTP 257
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGY---------HLGQFGLV 209
+ + ++ D + K++ LK+ G L+NTYII+TSD+G H L
Sbjct: 258 VFAS----MIEESDTHLGKLIDSLKEKGVLDNTYIIFTSDNGAEWAPRNEKNHRYNGPLT 313
Query: 210 KGKSFPFEFDIRVPFLMRGPGIVPG 234
+GK FPFE +R+PF++ GPGI G
Sbjct: 314 EGKYFPFEGGLRIPFVVAGPGIPAG 338
>gi|423332426|ref|ZP_17310210.1| hypothetical protein HMPREF1075_02223 [Parabacteroides distasonis
CL03T12C09]
gi|409229175|gb|EKN22055.1| hypothetical protein HMPREF1075_02223 [Parabacteroides distasonis
CL03T12C09]
Length = 524
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
D++ + ++R+++++ + +P M+ F A H P D + +++ V +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239
Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
P +PDK W ++ + +HR L+
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVR 299
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
L+ ++DD + K++K L D+G +NT ++Y SD GY LG+ G K +E +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358
Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
PF++R P +P D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379
>gi|406835434|ref|ZP_11095028.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 523
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 26/201 (12%)
Query: 50 YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
YY+ + NG + Y +D Y A Q K + KP L + + A HGP
Sbjct: 170 YYDQLLTFNGED-RETPGYSSDNYSQWAAE-----YIQGKHRTADKPWYLWLCYGAIHGP 223
Query: 110 EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
A ++ + T+ P P PDK L TR P T LM + ++
Sbjct: 224 TTPAERHRGTLADKTAS-LPMDILGPWPDKPAYLNKTRAWMP-GPDGTPALMKRNRGAVE 281
Query: 170 -----------------SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
+VD+ V +++ LK+ G+L NT +IYT+D G+ LG+ G + K
Sbjct: 282 PGKSLNAWVQQVNECNLAVDEGVGRVMTALKESGQLENTLVIYTADQGFGLGEHGFSQ-K 340
Query: 213 SFPFEFDIRVPFLMRGPGIVP 233
P++ + P ++ PG VP
Sbjct: 341 VAPYDATLSSPLIISWPGHVP 361
>gi|262384577|ref|ZP_06077711.1| mucin-desulfating sulfatase [Bacteroides sp. 2_1_33B]
gi|262293870|gb|EEY81804.1| mucin-desulfating sulfatase [Bacteroides sp. 2_1_33B]
Length = 524
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 91/201 (45%), Gaps = 36/201 (17%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
D++ + ++R+++++ + +P M+ F A H P D + +++ V +D+
Sbjct: 183 DIVTDKAIRWMKEADKD---RPFMMCCHFKATHEPWDFPERMRHLYDGVVFPEPENLFDW 239
Query: 134 AP------------------------NPDKQWIL--QVTRKMQPVHRQFT-----DLLMT 162
P +PDK W ++ + +HR L+
Sbjct: 240 GPETNGRTFSGQPLEELARRWDVASQDPDKWWCRYPELPFTTKGMHRSAARSAAYQKLVR 299
Query: 163 KRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
L+ ++DD + K++K L D+G +NT ++Y SD GY LG+ G K +E +R+
Sbjct: 300 DYLRCGATIDDNIGKLLKALDDMGIADNTIVVYVSDQGYFLGEHGFFD-KRMMYEESLRM 358
Query: 223 PFLMRGPGIVPGTMYLLDVLI 243
PF++R P +P D+++
Sbjct: 359 PFVIRYPKEIPAGTRNKDMIL 379
>gi|374602783|ref|ZP_09675771.1| sulfatase [Paenibacillus dendritiformis C454]
gi|374391542|gb|EHQ62876.1| sulfatase [Paenibacillus dendritiformis C454]
Length = 459
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 89/179 (49%), Gaps = 21/179 (11%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPA 130
+L +++ F+R+ + + P L +++ +PH P + YA+ + V + P
Sbjct: 169 ELFTDEAESFIRRYDR---EDPFFLYVAYTSPHDPRTAPEPYASHYEGDRVTVPGNFLPE 225
Query: 131 YDYAPNP---DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
+ + + + + R V R D + +D + ++++ L++ G
Sbjct: 226 HPFDNGEMHIRDEGLAGLPRTRDEVRRHIADYYAM-----ITHMDAQMGRLMQALEETGR 280
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDV 241
+++T ++YTSDHG +GQ GL+ GK +E IR+P+LM GPG+ + G +Y +D+
Sbjct: 281 MDDTIVVYTSDHGLAVGQHGLM-GKQNLYEHSIRIPWLMAGPGVPAKGPIRGQVYQMDI 338
>gi|296123653|ref|YP_003631431.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296015993|gb|ADG69232.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 526
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 107/262 (40%), Gaps = 56/262 (21%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
S GY + K++ P G+ W L YYN + NG+K++ Y D+I
Sbjct: 125 SQGYSTALIGKWHLISEPTGFDHWEILPGQGIYYNPPMIANGQKVQR-----EGYVTDII 179
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS-HHTPAYDYAPN 136
+ S+ +L+ +S KP +L+ APH + +P ++ FN PA + +
Sbjct: 180 TDRSIDWLKNRDKS---KPFLLMAQHKAPH--REWSPALRHLGFNKDKPFAEPATLFDQH 234
Query: 137 PDK-------------------------------------QWILQVTRKMQPVHRQFTDL 159
D+ ++ L + H Q DL
Sbjct: 235 KDRAQAVVDHDMGIDRTFTKLDAKLVPPPGINSTQLEEWNKYYLPRNNAFEAAHLQGQDL 294
Query: 160 L-------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
+ M L +++VD++V ++++ L + G NT ++ +SD G++LG+ G K
Sbjct: 295 VRWRYQRYMHDYLACVKAVDESVGRLLQTLDEEGLAENTLVVVSSDQGFYLGEHGWFD-K 353
Query: 213 SFPFEFDIRVPFLMRGPGIVPG 234
+ FE +R P L R P +P
Sbjct: 354 RWIFEESLRTPLLARWPAAIPA 375
>gi|407773361|ref|ZP_11120662.1| choline-sulfatase [Thalassospira profundimaris WP0211]
gi|407283825|gb|EKF09353.1| choline-sulfatase [Thalassospira profundimaris WP0211]
Length = 504
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 83/174 (47%), Gaps = 21/174 (12%)
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD 132
Y D +A ++ + + ++P L +SF PH P + ++ +++ + D
Sbjct: 162 YDDEVAYNATHKIYDLSRGNDERPWCLTVSFTHPHDPYVARKKFWDLYEDCPELEP---D 218
Query: 133 YAP------NPDKQWILQVTR------KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
AP +P Q +++ + K + V R + +DD V +I++
Sbjct: 219 VAPFDFEDQDPHSQRLMKASDYTAFDIKNEDVRRS-----RQAYFANISYIDDKVGQIME 273
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+ +L+NT II+ SDHG LG+ GL SF FE RVP +M GPGI+PG
Sbjct: 274 VLETTRQLDNTIIIFVSDHGDMLGERGLWFKMSF-FEGSARVPLMMAGPGIMPG 326
>gi|251795725|ref|YP_003010456.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247543351|gb|ACT00370.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 509
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 66/151 (43%), Gaps = 5/151 (3%)
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
D + LRQ Q P +SFP PH P YA+M P P++
Sbjct: 187 DYLAHLRQENQ-----PFFTWLSFPDPHTPYQVPEPYASMIRPEDVPMPPVDSLEGKPER 241
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ + M Q + ++ +DD + KI + + L L NT II+TSDH
Sbjct: 242 VKVAHLMDAMDTADEQLIRQVRAIHYGMIRFIDDTLAKIFERMDALSLLENTVIIFTSDH 301
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
G +G G+++ +F ++ VPF+M PG
Sbjct: 302 GDSMGAHGIIQKHNFFYDSFTHVPFIMSLPG 332
>gi|87309127|ref|ZP_01091264.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Blastopirellula marina DSM 3645]
gi|87288118|gb|EAQ80015.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Blastopirellula marina DSM 3645]
Length = 493
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 105/235 (44%), Gaps = 34/235 (14%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GK+ + PG+ + KY++ N NG+ K + YY D+ A
Sbjct: 116 AYIGKWHMGEDNDEPRPGFDYFVTHKGQGKYFDTEFNFNGQGRKVVDGYYTTVVTDM-AE 174
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHG---PEDSAPQYANMFFNVTSHH-TPAYDYAP 135
D + SKQ KP ML++ APH PE+ +Y + F + A+D
Sbjct: 175 DWI-----SKQD-GDKPWMLMLGHKAPHSFYLPEE---KYEHTFDQADIQYPKSAFDLED 225
Query: 136 NPDKQWILQVTRKMQPVH-------RQFTD----------LLMTKRLQTLQSVDDAVEKI 178
P +W + ++ + F D ++ T+ SVDD+V ++
Sbjct: 226 KP--EWFKKRLDTWHGIYGPLFDWRKNFPDESPAGVKDFARMVRAYWGTILSVDDSVGRL 283
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
LK+ GEL+NT II+TSD+G G+ G+V K E IR+P ++R PG+ P
Sbjct: 284 YDFLKERGELDNTLIIFTSDNGLLEGEHGMVD-KRTGHEPSIRIPLVVRYPGLTP 337
>gi|222147868|ref|YP_002548825.1| choline sulfatase [Agrobacterium vitis S4]
gi|221734856|gb|ACM35819.1| choline sulfatase [Agrobacterium vitis S4]
Length = 496
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 80/164 (48%), Gaps = 9/164 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + +++P L +SF PH P + QY +++ + ++H P
Sbjct: 159 FLANQKLYHLSRENDDAARRPWCLTVSFTHPHDPYVARKQYWDLYED-SNHLLPDVGALA 217
Query: 136 NPD---KQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNN 190
+ D K+ I V + D+ ++R + +DD V +++ L L+N
Sbjct: 218 DQDPHSKRLIHACDYDNFNVTEE--DIRRSRRAYFANISYIDDKVGELIDTLTRTRMLDN 275
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
T I++ SDHG LG+ GL +F FE RVP ++ GPGI PG
Sbjct: 276 TTILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMVAGPGIAPG 318
>gi|343085191|ref|YP_004774486.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353725|gb|AEL26255.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 478
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 29/184 (15%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
+Y D + N+S++ ++++ +K+P +L +SF + H P AP+ + F + + +
Sbjct: 194 EYLTDRLTNESLQLIKEN----NKEPFLLYLSFYSVHTPIQPAPKRYDYFLDKKENMIVS 249
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGEL 188
+ A K +P T L+ + ++D+ V +I++ELK G
Sbjct: 250 EEQA-------------KYKPEGEGQTKLIQDNASYASMVAAMDENVGRILQELKAQGLD 296
Query: 189 NNTYIIYTSDHGYHLGQFG---------LVKGKSFPFEFDIRVPFLMRGPGIV-PGTMYL 238
+NT++I TSD+G +G L GK + +E IRVP L++GPGI PGT
Sbjct: 297 DNTWVILTSDNGGLSTLYGEGAPTSNGPLRAGKGWCYEGGIRVPMLIKGPGIKNPGTAPE 356
Query: 239 LDVL 242
L V+
Sbjct: 357 LPVI 360
>gi|441500082|ref|ZP_20982252.1| Choline-sulfatase [Fulvivirga imtechensis AK7]
gi|441436173|gb|ELR69547.1| Choline-sulfatase [Fulvivirga imtechensis AK7]
Length = 504
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 102/226 (45%), Gaps = 26/226 (11%)
Query: 21 YFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAND 80
+FGK+ + + G+ W + YYN ++N++G ++++ + + Y DL+
Sbjct: 138 FFGKWHMGEHHANPRKGFDYWVSFKGQGVYYNPTLNIDGEEVEYTD---STYITDLLTEK 194
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGP-----------EDSAPQYANMFFNVTSHHTP 129
++ +L + K+ KP + +S H E + PQY F ++ T
Sbjct: 195 AIEWLGKRKKD---KPFFMYLSHKGVHSEFSPAKRHQDVYEGTKPQYPPTMFPASAD-TT 250
Query: 130 AYDYAPNPDKQWILQVTRKMQPV----HRQFT-DLLMTKRLQTLQSVDDAVEKIVKELKD 184
Y+Y PD W+ + V H Q D +TL S+D+++ ++ L++
Sbjct: 251 KYNYKDVPD--WVKKQRHSWHGVDYMYHGQINFDEFYENYNETLLSIDESIGDVLSYLEE 308
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
L +T + Y D+G+ G+ GL+ K +E IRVP L G G
Sbjct: 309 NDLLESTLVFYMGDNGFSFGEHGLID-KRHAYEESIRVPLLAYGGG 353
>gi|338212322|ref|YP_004656377.1| N-acetylglucosamine-6-sulfatase [Runella slithyformis DSM 19594]
gi|336306143|gb|AEI49245.1| N-acetylglucosamine-6-sulfatase [Runella slithyformis DSM 19594]
Length = 509
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 114/256 (44%), Gaps = 42/256 (16%)
Query: 20 GYFGKYLNKY---NGSYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPD 75
GY +L K+ N P PG+ W + YY+ N+NG+++ H + Y D
Sbjct: 124 GYKTAFLGKWHMGNADDAPQPGFDYWLSFKGQGVYYSPIFNINGKQVTHKDGYTTD---- 179
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
L+ + ++ ++ + ++ KP L +S A H A ++A ++ ++ + PA Y
Sbjct: 180 LLTDYAIEWMGKQDKN---KPFFLYLSHKAVHADFQPAKRHAGVYKDMPIQY-PASMYLT 235
Query: 136 NPD--KQWILQV----TRKMQPVHRQFTDLLMTKR-----------------------LQ 166
D K W T +++ R + + +R +
Sbjct: 236 KSDTSKIWGKNTKDPETGEVKSNLRDMPNWVKNQRYSWHGVDYLYHGSISFNDFYRQYCE 295
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
TL VD++V +++K L+D G NT +IY D+G+ G+ GL+ K +E +RVP L+
Sbjct: 296 TLLGVDESVGRVMKYLEDNGLAENTLVIYMGDNGFSFGERGLID-KRHAYEESMRVPMLV 354
Query: 227 RGPGIVPGTMYLLDVL 242
P ++ L +V+
Sbjct: 355 SCPSVIKPQTKLTNVI 370
>gi|260587313|ref|ZP_05853226.1| mucin-desulfating sulfatase [Blautia hansenii DSM 20583]
gi|260542180|gb|EEX22749.1| mucin-desulfating sulfatase [Blautia hansenii DSM 20583]
Length = 484
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 114/248 (45%), Gaps = 37/248 (14%)
Query: 4 PVELTSRNLLYFDI--SHGYFGKYLNKYN-GSYIPP--GWREWGAL-IMNSKYYNYSINM 57
P+ Y D+ +GY K++ G + P G+ +W + + Y++ I
Sbjct: 100 PISYLEGKTTYTDVLNENGYRCALAGKWHLGDSVCPQHGFSKWYTIGLGGCDYFHPDIVE 159
Query: 58 NGR-KIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP---EDSA 113
NG K+ H Y ++IAN ++ +L + + ++P L + F APH P E
Sbjct: 160 NGNIKVLH-----EQYVTEVIANKAIEYLNEFQHQ--EEPFYLSVHFTAPHSPWGEEQHP 212
Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQW--ILQVTRKMQPVHRQFTDLLMTKRLQTLQSV 171
++ + + N P D A +PD + ++ + + F + ++
Sbjct: 213 KKWMDYYENCDFQSIP--DEADHPDLTTGPVFGTEKRKENLRGYFA---------AISAM 261
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK---SFPF---EFDIRVPFL 225
D+ + +I+ L+ G NT ++YT+D+G +G G V GK +FPF E ++VPFL
Sbjct: 262 DEQIGRILDTLEANGLRENTLVVYTADNGMSMGHHG-VWGKGNGTFPFNMYETSVKVPFL 320
Query: 226 MRGPGIVP 233
M PG++P
Sbjct: 321 MSLPGVIP 328
>gi|440719037|ref|ZP_20899471.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SWK14]
gi|436435625|gb|ELP29454.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SWK14]
Length = 735
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)
Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
PE P+ N + + A+D A P+ Q + + + + T +
Sbjct: 264 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 323
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
L T+Q+VDD+V +++ L + G NT +IY+SD G++LG+ G K + FE +R+P
Sbjct: 324 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 382
Query: 224 FLMRGPGIV-PGT 235
FL+R PG++ PGT
Sbjct: 383 FLIRWPGVIDPGT 395
>gi|283781956|ref|YP_003372711.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283440409|gb|ADB18851.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 506
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 109/258 (42%), Gaps = 55/258 (21%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ P G+ W L YYN ++ ++H Y D+I +
Sbjct: 123 GYQTAVIGKWHLGSDPTGFDYWHILQGQGPYYNPAMKTPNGIVRH-----TGYTTDIITD 177
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--------------- 124
+++ +L + + KP L+ APH AP+Y + + +VT
Sbjct: 178 EAIGWLEKGRDK--NKPFFLMYQHKAPHRNWQPAPRYLDKYKDVTIPEPTTLFDDYKGRG 235
Query: 125 -----SHHTPAYDYAP-----------NPDKQWILQVTRKMQPVHRQFTDLLMTKR---- 164
+ A D +P NP+++ I + + ++ F + +T
Sbjct: 236 TAARDQQMSIAKDLSPHDLKLVAQPGLNPEQKEIFE--KAYAEENKAFQEAKLTGDDKTR 293
Query: 165 ----------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
L+ + +VD+ V ++++ L + GE NT +IYTSD G++LG+ G K +
Sbjct: 294 WMYQRYVKDYLRCVDAVDENVGRMLEWLDNSGEAANTLVIYTSDQGWYLGEHGWYD-KRW 352
Query: 215 PFEFDIRVPFLMRGPGIV 232
+E R P ++R PG V
Sbjct: 353 MYEESFRTPLIVRWPGQV 370
>gi|417306141|ref|ZP_12093068.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica WH47]
gi|327537574|gb|EGF24291.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica WH47]
Length = 747
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)
Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
PE P+ N + + A+D A P+ Q + + + + T +
Sbjct: 276 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 335
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
L T+Q+VDD+V +++ L + G NT +IY+SD G++LG+ G K + FE +R+P
Sbjct: 336 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 394
Query: 224 FLMRGPGIV-PGT 235
FL+R PG++ PGT
Sbjct: 395 FLIRWPGVIDPGT 407
>gi|32471182|ref|NP_864175.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH 1]
gi|32396884|emb|CAD71852.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH 1]
Length = 747
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)
Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
PE P+ N + + A+D A P+ Q + + + + T +
Sbjct: 276 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 335
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
L T+Q+VDD+V +++ L + G NT +IY+SD G++LG+ G K + FE +R+P
Sbjct: 336 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 394
Query: 224 FLMRGPGIV-PGT 235
FL+R PG++ PGT
Sbjct: 395 FLIRWPGVIEPGT 407
>gi|320106473|ref|YP_004182063.1| sulfatase [Terriglobus saanensis SP1PR4]
gi|319924994|gb|ADV82069.1| sulfatase [Terriglobus saanensis SP1PR4]
Length = 495
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 81/180 (45%), Gaps = 33/180 (18%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF---------- 120
++ ++ A+ ++ L K + P + + F APH P + +Y +++
Sbjct: 200 EHSSEIYADAAIEHL--GKVAKRDAPFFMYVGFNAPHDPRQAPQEYLDLYPADKIAIPPN 257
Query: 121 ------FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
F+ H T AP P Q +QV R+ + + +D
Sbjct: 258 YLPEHPFDQGEHKTRDELLAPFPRTQHDVQVHRR--------------EYYAIISHMDAQ 303
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ +I LK G+ +NTYII T+DHG +G+ GL+ GK +E +R+P ++ GPGI G
Sbjct: 304 IGRIFDALKKSGKADNTYIILTADHGLSVGEHGLM-GKQNQYERSMRMPLILSGPGIKAG 362
>gi|237721982|ref|ZP_04552463.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448851|gb|EEO54642.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 517
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 80/162 (49%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E +R+P + G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLVRIPMVWAG 333
>gi|338210316|ref|YP_004654363.1| sulfatase [Runella slithyformis DSM 19594]
gi|336304129|gb|AEI47231.1| sulfatase [Runella slithyformis DSM 19594]
Length = 479
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 85/162 (52%), Gaps = 13/162 (8%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--SHHTPAYD 132
D A +++FL KQ+ + KP + ++F APH P P+Y + V + P +
Sbjct: 169 DHFAETAIQFL--DKQT-ANKPFVTYVAFTAPHDPRSPLPEYLKRYGTVPLPPNFMPIHP 225
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLL---MTKRLQTLQSVDDAVEKIVKELKDLGELN 189
+ D + V +M + + TD++ +T+ + +D+A+ KI+ +LK+ G
Sbjct: 226 FNFGSD----MTVRDEMLAGYPRTTDVIKSQLTEYYAMITHLDEAIGKILAKLKEKGLDK 281
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
NT I++ +D+G +G GL+ GK +E +RVP +M G GI
Sbjct: 282 NTIIVFAADNGLAMGSHGLL-GKQNLYEHSMRVPLIMSGKGI 322
>gi|449136746|ref|ZP_21772114.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula europaea 6C]
gi|448884630|gb|EMB15114.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula europaea 6C]
Length = 747
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 70/133 (52%), Gaps = 7/133 (5%)
Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
PE P+ N + + A+D A P+ Q + + + + T +
Sbjct: 276 PEQFLPRLGNGEYRRMNDEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 335
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
L T+Q+VDD+V +++ L + G NT +IY+SD G++LG+ G K + FE +R+P
Sbjct: 336 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 394
Query: 224 FLMRGPGIV-PGT 235
FL+R PG++ PGT
Sbjct: 395 FLIRWPGVIAPGT 407
>gi|149195789|ref|ZP_01872846.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
gi|149141251|gb|EDM29647.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
Length = 524
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 108/235 (45%), Gaps = 19/235 (8%)
Query: 20 GYFGKYLNKYNGSYIPPGWRE-------WGALIMNSKYYNYSINMNGRKIK--HGEDY-Y 69
G FG +LN G + G E W + M Y + N R+ K +G+ +
Sbjct: 119 GKFGFHLNDEEGKALKGGATERVINSFDWWSGWMGQGSYQMNENREAREYKSVYGDKAEH 178
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
Y L+ D F++++K S KP L +SF APH P + P+Y ++ V
Sbjct: 179 TTYALGLMGRD---FIKKAKAS--GKPFCLSISFKAPHTPLATDPRYDALYEGVKFTKPA 233
Query: 130 AYDYAPN-PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
++ + P++ + K + + + D M K + +D A+ I++ELK+
Sbjct: 234 DFNEVDSLPEQAKSGRPMSKGKSWIKSYDDT-MKKYHTMIYGMDKAIAMIMEELKEQALD 292
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY-LLDVL 242
+NT II+TSD+G+H G GL GK + +E P ++ P GT + +D L
Sbjct: 293 HNTIIIFTSDNGHHNGSKGL-GGKLYAYEVGSLAPAIIYDPRHDTGTQFKTIDAL 346
>gi|340616998|ref|YP_004735451.1| hypothetical protein zobellia_1003 [Zobellia galactanivorans]
gi|339731795|emb|CAZ95060.1| Pseudogene of sulfatase [Zobellia galactanivorans]
Length = 339
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 39/167 (23%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y D + ++ + F+R++ SKKP M+V+S H P + P F N +
Sbjct: 37 YVADALTDECIDFIRKN----SKKPFMVVLSHYLVHNPIQAKPALVKRFKNKATTDQDNP 92
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
+YA L SVD+++ +I ELK LG NNT
Sbjct: 93 EYAA-------------------------------MLVSVDESIGRINTELKRLGIENNT 121
Query: 192 YIIYTSDHG---YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PG 234
+I+TSD+G + L+ GKS+ FE +RVP +++ P ++ PG
Sbjct: 122 LVIFTSDNGGLNNSTSNYPLLGGKSYSFEGAMRVPLIVKWPAMIKPG 168
>gi|423288164|ref|ZP_17267015.1| hypothetical protein HMPREF1069_02058 [Bacteroides ovatus
CL02T12C04]
gi|392671053|gb|EIY64529.1| hypothetical protein HMPREF1069_02058 [Bacteroides ovatus
CL02T12C04]
Length = 482
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 79/163 (48%), Gaps = 15/163 (9%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D + ++ L + K KP L++S P PH P+ + P Y +M+ N+ P +
Sbjct: 197 DFFTDKALEILERDKN----KPFCLMLSIPDPHTPDYARPPYHDMYKNMK----PEMPFT 248
Query: 135 PNPD-----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
P+ W + + + F + + ++ +DD+V +I+K L+D L+
Sbjct: 249 MKPEYASQRPNWGMGGEKDKNEA-KSFDANALKQYFGMVKHIDDSVGRILKFLEDNNLLD 307
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
NT +++TSDHG + + K P+E R+PF++R PG +
Sbjct: 308 NTIVVFTSDHGDMFYEHKR-RNKGVPYEASSRIPFVIRYPGKI 349
>gi|430744285|ref|YP_007203414.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430016005|gb|AGA27719.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 475
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 100/216 (46%), Gaps = 15/216 (6%)
Query: 18 SHGYFGKY-LNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
+ GY GK+ ++ G PG+ E + I KY + +NG ++ + + +D D
Sbjct: 135 TTGYVGKWHMDGQKGKR--PGFDESVSFIGQGKYVDCPFEVNG-ELTPTQGWVDDRSTDF 191
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
++ FL++++ KP LV+ F A HGP D + + F T PA +
Sbjct: 192 ----AIDFLKKNQ----AKPFALVVGFKAAHGPFDPPERARDRFAGETPR--PAANQGVR 241
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
P + + + +VDD + +++ L +LG +T ++++
Sbjct: 242 PIFSKAAAAPAVAKKKKGATRPEGNIGYFRCVSAVDDNLGRLMTTLDELGLAEDTVLVFS 301
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
SD+G++LG+ GL +S +E +RVP L+R P +
Sbjct: 302 SDNGFYLGEHGLGDKRS-AYEASLRVPLLLRYPRLA 336
>gi|149195819|ref|ZP_01872876.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
gi|149141281|gb|EDM29677.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
Length = 519
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 100/236 (42%), Gaps = 66/236 (27%)
Query: 62 IKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF 121
I G + Y D++ ++S+++L Q Q+ KP ++++ F +PH P P+Y NM+
Sbjct: 153 IDKGHEQIMGYSTDVVTDESIKWLDQRDQN---KPFLMMVQFKSPHVPRIPHPRYMNMYT 209
Query: 122 NVTSHHTPAYDYAPN-----------------------PDKQWILQVTRK-------MQP 151
+ YD N P K V +K M P
Sbjct: 210 EDVAEPATLYDNYQNRLKGASTAWMEINGQNEEVLAYFPPKNATEPVNKKQKKHLDRMTP 269
Query: 152 VHRQ---------------------FTDLLMTKRLQ----------TLQSVDDAVEKIVK 180
R+ F D + ++L+ +Q++DD V +++K
Sbjct: 270 EQRKALLDAMDNQNSEYYELKKAGAFKDPVKARKLKYQFFIKNYLRCVQAIDDNVGRLLK 329
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGT 235
L+D NT +IY+SD Y +G+ G + K + +E +++PF++R PG I PG+
Sbjct: 330 WLEDNELDENTIVIYSSDQSYFIGEHGWAE-KRWMYEEALKMPFVIRWPGKIKPGS 384
>gi|311748198|ref|ZP_07721983.1| choline sulfatase [Algoriphagus sp. PR1]
gi|311302763|gb|EAZ80936.2| choline sulfatase [Algoriphagus sp. PR1]
Length = 496
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 86/195 (44%), Gaps = 21/195 (10%)
Query: 48 SKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH 107
+ +Y+ + G K GE + + A ++ F++ Q + +P ++F APH
Sbjct: 152 ADHYDLPLRDYGADGKLGEPTRKGFSTEQFAQAAIDFIKDHGQRNTDQPFFCYVAFTAPH 211
Query: 108 GPEDSAPQYANMF--------FNVTSHHTPAYDYAPNPDKQ---WILQVTRKMQPVHRQF 156
P Y N + N +H +D+ D+ W P +
Sbjct: 212 DPYSPEANYINHYPDGTLPLPGNYMPYHPFEFDHLTVRDENLTGW---------PRKPEV 262
Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
++++ + +D + KI+ LK+ G+ +NT I+Y +D+G G GL+ GK +
Sbjct: 263 IQMILSDYYALVTHLDTQIAKILNTLKETGQYDNTIIVYAADNGLAAGSHGLL-GKQSLY 321
Query: 217 EFDIRVPFLMRGPGI 231
E +VP +++GPG+
Sbjct: 322 EHSSKVPLIIKGPGV 336
>gi|229819795|ref|YP_002881321.1| arylsulfatase [Beutenbergia cavernae DSM 12333]
gi|229565708|gb|ACQ79559.1| sulfatase [Beutenbergia cavernae DSM 12333]
Length = 486
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 15/199 (7%)
Query: 44 LIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSF 103
L ++ ++++ I+ N + + + + + +A++S FLR+ + KP +L SF
Sbjct: 152 LGRDATFFDHGIDSNSWVARPWDKPEHLHPTNFVASESADFLRRRDPT---KPFLLFASF 208
Query: 104 PAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT---DLL 160
APH P D + P D+A D W P T DLL
Sbjct: 209 NAPHPPFDPPAWAFEQYLETDMPDPPVGDWAEAFDP-W----ANSADPTALVGTIPPDLL 263
Query: 161 MTKRLQ---TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFE 217
R + VD + +++EL G +NT + + +DHG LG L + K FP+E
Sbjct: 264 QRARAGYYGHMTHVDQQINFLLEELSHRGLRDNTLVCFLADHGEMLGDHHLFR-KGFPYE 322
Query: 218 FDIRVPFLMRGPGIVPGTM 236
R+P ++ GPG+ G +
Sbjct: 323 GSARIPMILSGPGVPAGQV 341
>gi|225619372|ref|YP_002720598.1| sulfatase [Brachyspira hyodysenteriae WA1]
gi|225214191|gb|ACN82925.1| sulfatase [Brachyspira hyodysenteriae WA1]
Length = 474
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 76/149 (51%), Gaps = 14/149 (9%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM----- 149
KP+ + +SF PH P D +Y +++ ++ + DK+ +T++
Sbjct: 196 KPLFMWVSFVDPHHPFDPIKKYYDIYKDIKPKELNS---KLKLDKKRPEHLTKQGDRGYW 252
Query: 150 ----QPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
+ H ++ K+L + +D + +I+ +LK+ E +NT II+TSDHG +L
Sbjct: 253 PGGGEEHHYSQEEIKEIKKLYYGMISFIDSQIGRIIDKLKEKNEFDNTIIIFTSDHGEYL 312
Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
G +GL+K F ++ I+VP L G GIV
Sbjct: 313 GDYGLLKKGPFMYDCLIKVPLLFYGKGIV 341
>gi|331270234|ref|YP_004396726.1| putative sulfatase [Clostridium botulinum BKT015925]
gi|329126784|gb|AEB76729.1| probable sulfatase [Clostridium botulinum BKT015925]
Length = 504
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 58/102 (56%), Gaps = 3/102 (2%)
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
Y P DK W+ +Q D+ + + ++ +D + KI+ +L++LG +T
Sbjct: 275 YEPQ-DKNWLHGCHSHLQDKEELKKDIAIYYGMVSM--MDHYIGKILDKLEELGMAEDTI 331
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+++TSDHG+ GQ GL+ F +E I++PF++R PG+VP
Sbjct: 332 VVFTSDHGHFFGQHGLIAKGPFHYEDMIKIPFIVREPGVVPA 373
>gi|149197012|ref|ZP_01874065.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
gi|149140122|gb|EDM28522.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
HTCC2155]
Length = 463
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 86/201 (42%), Gaps = 40/201 (19%)
Query: 37 GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
G REWG Y+ + +M K G DY D + + +V F+ + + K P
Sbjct: 165 GGREWGQPKGRGIYF-HPFDMPNMTSKEG-----DYLTDRLTDYAVDFIEKHQ----KDP 214
Query: 97 IMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
+L S+ HGP + P + K+W + PV+
Sbjct: 215 FLLYFSYYTVHGPLQAKPDLVEKY-----------------KKKWQSGKYKHKNPVYAAM 257
Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQF--GLVKGKSF 214
+QS+D++V +++ +L +LG +NT +I+T D+G + GL K+
Sbjct: 258 -----------VQSLDESVGRVLDKLDELGIADNTIVIFTGDNGAVGTNYCGGLKGAKAL 306
Query: 215 PFEFDIRVPFLMRGPGIVPGT 235
E +R PF ++GPGI PG
Sbjct: 307 SHEGGVREPFFIKGPGIEPGV 327
>gi|402075483|gb|EJT70954.1| hypothetical protein GGTG_11977 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 518
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 51/83 (61%), Gaps = 1/83 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + L+T+QS+DD V +++ L+D G +NT ++YTSD G+ LG+ G K F +E
Sbjct: 295 MQRYLRTIQSIDDNVGRMLDWLEDEGIADNTLVVYTSDQGFFLGEHGWFD-KRFMYEESF 353
Query: 221 RVPFLMRGPGIVPGTMYLLDVLI 243
++PFL R P +V G D++
Sbjct: 354 QMPFLARFPKLVAGGSVCRDIVC 376
>gi|149197555|ref|ZP_01874606.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
HTCC2155]
gi|149139573|gb|EDM27975.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
HTCC2155]
Length = 515
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 117/280 (41%), Gaps = 66/280 (23%)
Query: 10 RNLLYFDISH----------GYFGKYLNKYNGSYIPPGWREWGALIM---NSKYYNYSIN 56
+N +YFD GY + K++ + +P G+ + + KYY+ +N
Sbjct: 90 KNDMYFDGRQITFPKLLRQAGYQTAVIGKWHLASLPTGFDHFEVITGYGGQGKYYHPVMN 149
Query: 57 MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
NG KH Y ++I ++ +L+ Q KP ML+M APH +P+Y
Sbjct: 150 RNGEPTKH-----RGYTTEVITKLNMEWLKN--QRDPNKPFMLMMQHKAPHRAWLPSPKY 202
Query: 117 ANMFFNV-----TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFT--------DL---- 159
N F + + HT A + KQ ++ + M P + T DL
Sbjct: 203 MNAFKDKKFPKPANLHTDYQGKASHVKKQDMM-IKDSMNPGDLKLTPPKYLDGADLANWH 261
Query: 160 --------------LMTKRLQT-------------LQSVDDAVEKIVKELKDLGELNNTY 192
L K L++ +QS+DD++ +++ L + G NT
Sbjct: 262 KAYDEENAAFAKAKLSGKALRSWNYQRYIRDYVRCVQSIDDSIGEVLNYLDESGLAENTL 321
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+IY+SD G+ LG+ G K F +E +R P +MR PG +
Sbjct: 322 LIYSSDQGFFLGEHGWFD-KRFMYEEALRTPLVMRWPGKI 360
>gi|325110420|ref|YP_004271488.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970688|gb|ADY61466.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 469
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 82/181 (45%), Gaps = 14/181 (7%)
Query: 58 NGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA 117
NG + + D N YY +A ++ L + S +KP L + F PH P ++ +Y
Sbjct: 171 NGIECRDVPD--NAYYDGRVAEVAIDALDEI--SREEKPFFLAVGFWKPHTPFNAPKKYW 226
Query: 118 NMF----FNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDD 173
+++ V H TP D + L R ++ L L + +D
Sbjct: 227 DLYDRDQIPVPEHVTPPTDV-----PEVALTSARYKGGPDTEYLRELHHGHLAAITYLDA 281
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
V K++ +L + G NNT +++ SDHG HLG+ GL + K+ FE D VP ++ P
Sbjct: 282 QVGKVLAKLDETGLRNNTIVVFWSDHGLHLGEHGLTR-KTTAFELDAGVPMIIATPEHTA 340
Query: 234 G 234
G
Sbjct: 341 G 341
>gi|149177554|ref|ZP_01856156.1| putative N-acetylglucosamine-6-sulfatase [Planctomyces maris DSM
8797]
gi|148843534|gb|EDL57895.1| putative N-acetylglucosamine-6-sulfatase [Planctomyces maris DSM
8797]
Length = 513
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 108/234 (46%), Gaps = 26/234 (11%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY------NYSINMNGRKIKHGEDYYNDYY 73
G+FGK+ + PG+ +W + YY +S+N++G+ + Y
Sbjct: 126 GFFGKWHMGGHSDDPRPGFDKWISFRGQGHYYPPRHLKKWSLNIDGKSVPQ-----KGYI 180
Query: 74 PDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHHT 128
D + + ++ +L + + S KP + +S HG A ++A + + + +
Sbjct: 181 TDELTDYAIEWLDDTVKP-SGKPFFVYLSHKGVHGMFHPAERHAGRYKDKSMPIPKTMEN 239
Query: 129 PAYDYAPNPDKQWILQVTRKMQPV---HRQFTDLLMTKRL--QTLQSVDDAVEKIVKELK 183
+ +Y P W+ V + Q TD+ RL + L SVD+++ ++ K LK
Sbjct: 240 TSENYFNKP--MWLKNQRNSWHGVDFAYHQDTDIEEHYRLYCEALLSVDESIARVRKWLK 297
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-VPGTM 236
D G NT ++Y D+G+ G+ GL+ K +E +RVP + PG+ PGT+
Sbjct: 298 DNGLAENTLVMYMGDNGFQWGEHGLID-KRTAYEASMRVPLVGVCPGLWKPGTV 350
>gi|332307886|ref|YP_004435737.1| sulfatase [Glaciecola sp. 4H-3-7+YE-5]
gi|332175215|gb|AEE24469.1| sulfatase [Glaciecola sp. 4H-3-7+YE-5]
Length = 554
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 100/255 (39%), Gaps = 53/255 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
GY + K++ P + + L Y++ G K E Y + D++
Sbjct: 141 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGAKPWPENETQYQGHSSDVVT 200
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ S+ +L K S KP L+ F APH AP+Y + A + P PD
Sbjct: 201 DLSIEWL---KNRSSDKPFFLMHQFKAPHDMFQYAPRYEDFL---------AAETIPEPD 248
Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
+ + T R M + RQ
Sbjct: 249 DLYGVAKTFGSIATRGKNDALRAEIGTSVSRRNNRRNMGIDLGVDPNLNEEEFTRQAYQK 308
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+ L+ ++ VDD V ++++ LKD G+ NT IIYTSD G LG+ L + K + ++
Sbjct: 309 YLKAYLRCVKGVDDNVARLIQTLKDAGQYENTIIIYTSDQGMMLGEHDL-QDKRWIYDES 367
Query: 220 IRVPFLMRGPGIVPG 234
IR+PF+++ P V
Sbjct: 368 IRMPFIVKHPDAVAA 382
>gi|343516047|ref|ZP_08753093.1| sulfatase [Vibrio sp. N418]
gi|342797271|gb|EGU32926.1| sulfatase [Vibrio sp. N418]
Length = 499
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 96/192 (50%), Gaps = 37/192 (19%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
NDY ++A++S+++L + + P L + FP H P S +Y +M+ + +
Sbjct: 200 NDYAGGVVADESIQWLDTINK---RDPFFLYVPFPEVHTPIASPKKYLDMYSDYIT---- 252
Query: 130 AYDYAP-NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
D+A NPD + K QP Q + + +D+ V +IV +LK++GE
Sbjct: 253 --DFAKQNPD---LYHWDWKGQPYRGQ------GEYYANISYMDEQVGRIVSKLKEIGEY 301
Query: 189 NNTYIIYTSDHG---------YHL---GQFGLVKG-KSFPFEFDIRVPFLMRGPGIVPGT 235
+NT I+++SD+G + L G+ G ++G K FE IRVP +M+G GI T
Sbjct: 302 DNTIILFSSDNGPVTREARKPWELNMAGETGGLRGRKDNLFEGGIRVPLIMKGSGITQNT 361
Query: 236 -----MYLLDVL 242
+Y LD++
Sbjct: 362 QSHEPVYGLDIV 373
>gi|338212053|ref|YP_004656106.1| sulfatase [Runella slithyformis DSM 19594]
gi|336305872|gb|AEI48974.1| sulfatase [Runella slithyformis DSM 19594]
Length = 528
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 39/238 (16%)
Query: 19 HGYFGKYLNKYN------GSYIPP----GWREWGALIMNSKYYN---YSINMNGRKIKHG 65
HGY ++ K++ +YIPP G+ W AL Y N Y+ + N + + G
Sbjct: 161 HGYTTGFVGKWHIDGHGRNTYIPPTRQQGFGYWKALECTHDYNNSAYYAGDSNKKLVWKG 220
Query: 66 EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
D +IA + ++ P M+ +S +PH P +AP+ F
Sbjct: 221 YD--------VIAQTDDVCNYMTAKAKEPNPFMVFISIGSPHDPYQTAPEKYRKLF---- 268
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
DK ++ ++ + DL + ++DD + K+ + L DL
Sbjct: 269 -----------ADKDIMINDNVPVEKREKVKNDL--RGYYSHIAAIDDCIGKMWQTLIDL 315
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
G +NT I+++SDHG LG G K P+E IRVPFL+ P + T + L+
Sbjct: 316 GIEDNTIIVFSSDHGDLLGAHGSWN-KQQPYEESIRVPFLLHYPKLFGKTGKTSEALL 372
>gi|410647586|ref|ZP_11358010.1| mucin-desulfating sulfatase [Glaciecola agarilytica NO2]
gi|410132848|dbj|GAC06409.1| mucin-desulfating sulfatase [Glaciecola agarilytica NO2]
Length = 554
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 100/255 (39%), Gaps = 53/255 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
GY + K++ P + + L Y++ G K E Y + D++
Sbjct: 141 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGAKPWPENETQYQGHSSDVVT 200
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ S+ +L K S KP L+ F APH AP+Y + A + P PD
Sbjct: 201 DLSIEWL---KNRSSDKPFFLMHQFKAPHDMFQYAPRYEDFL---------AAETIPEPD 248
Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
+ + T R M + RQ
Sbjct: 249 DLYGVAKTFGSIATRGKNDALRAEIGTSVSRRNNRRNMGIDLGVDPNLNEEEFTRQAYQK 308
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+ L+ ++ VDD V ++++ LKD G+ NT IIYTSD G LG+ L + K + ++
Sbjct: 309 YLKAYLRCVKGVDDNVARLIQTLKDAGQYENTIIIYTSDQGMMLGEHDL-QDKRWIYDES 367
Query: 220 IRVPFLMRGPGIVPG 234
IR+PF+++ P V
Sbjct: 368 IRMPFIVKHPDAVAA 382
>gi|311109197|ref|YP_003982050.1| sulfatase family protein 12 [Achromobacter xylosoxidans A8]
gi|310763886|gb|ADP19335.1| sulfatase family protein 12 [Achromobacter xylosoxidans A8]
Length = 508
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 76/175 (43%), Gaps = 21/175 (12%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
I + ++ +LR+ + +P L SFP PH P D ++ + ++ H Y+
Sbjct: 205 IGDRTIDYLREHRD----QPFCLWASFPDPHHPFDCPEPWSRLHHPDEVDLPRHRATDYE 260
Query: 133 YAP-------------NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
P N + Q + Q +M + ++ + VD V +I+
Sbjct: 261 RRPWWHRASMESKPMGNAEVQAVRQNFSRMPTQSDEALRRIIANYYGMISLVDHQVGRIM 320
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
K L + G +T +I+TSDHG LG GL+ P+E +RV + GPGI G
Sbjct: 321 KALSEYGLDQDTIVIFTSDHGEWLGDHGLMLKGPMPYEGVLRVAMVASGPGIAAG 375
>gi|298482548|ref|ZP_07000733.1| sulfatase [Bacteroides sp. D22]
gi|298271255|gb|EFI12831.1| sulfatase [Bacteroides sp. D22]
Length = 517
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDMPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|196232752|ref|ZP_03131603.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196223212|gb|EDY17731.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 496
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 26/234 (11%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
Y GK+ + PG+ + KY++ + N+NG G YY ++ +
Sbjct: 121 AYMGKWHMGEDNDAPRPGFDFFATHKGQGKYWDTAWNINGA----GSKVIPGYYTTIVTD 176
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ +L K+ KP L + APH +YA++F NV + P +
Sbjct: 177 MALDWL---KKDHGGKPWALCIGHKAPHSFYTPEEKYAHVFDNVRVPY-PESAFHLEDKP 232
Query: 140 QWILQVT-------------RKMQPVHRQFT----DLLMTKRLQTLQSVDDAVEKIVKEL 182
W+ Q RK P R + ++ T+ SVDD+V +++K L
Sbjct: 233 TWMKQRLYTWHGIYGPLFEWRKKFPDDRPEAVKDFENMVHGYWGTILSVDDSVGRLLKYL 292
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+D +L+NT I++ D+G G+ G+V K E +R+P L+R PG+ G +
Sbjct: 293 EDTKQLDNTIIVFMGDNGLLEGEHGMVD-KRTAHEPSMRIPMLVRYPGLAKGKV 345
>gi|168334519|ref|ZP_02692680.1| sulfatase [Epulopiscium sp. 'N.t. morphotype B']
Length = 510
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 83/175 (47%), Gaps = 25/175 (14%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT----- 124
DYY + IAN++V+F+ K +P+ +SF PH P D AP + +T
Sbjct: 155 EDYYEEWIANNAVKFIEDHKD----EPLFAWVSFQGPHQPYD-APAENGISTALTPPDPI 209
Query: 125 ------SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
S + P Y A N IL + +R+ + + +Q +D V KI
Sbjct: 210 MLDFEPSCNVPKYRRASNISTPSILGMK-----AYRE----RRARYAEGVQIIDYEVGKI 260
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+ +LK+LG +T II+++DHG LG + + + + P++ + +P ++ +P
Sbjct: 261 IAKLKELGIYKDTVIIFSTDHGCMLGDYDMYEKGAMPYKAQLEIPMIVVDADALP 315
>gi|423291167|ref|ZP_17270015.1| hypothetical protein HMPREF1069_05058 [Bacteroides ovatus
CL02T12C04]
gi|392664201|gb|EIY57743.1| hypothetical protein HMPREF1069_05058 [Bacteroides ovatus
CL02T12C04]
Length = 517
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|293368572|ref|ZP_06615180.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292636369|gb|EFF54853.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 517
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDMPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|421610218|ref|ZP_16051399.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH28]
gi|408499015|gb|EKK03493.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SH28]
Length = 735
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 69/133 (51%), Gaps = 7/133 (5%)
Query: 109 PEDSAPQYAN-MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL----LMTK 163
PE P+ N + + A+D A P+ Q + + + + T +
Sbjct: 264 PEQFLPRLGNGEYRRMNEEQKAAWDAAYEPENQKFIADMKAGKLSSKDITKWKYQRYIKD 323
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
L T+Q+VDD+V +++ L + G NT +IY+SD G++LG+ G K + FE +R+P
Sbjct: 324 YLGTVQAVDDSVGELLAYLDESGLAENTIVIYSSDQGFYLGEHGWYD-KRWMFEESLRMP 382
Query: 224 FLMRGPGIV-PGT 235
FL+R PG + PGT
Sbjct: 383 FLIRWPGAIDPGT 395
>gi|423293725|ref|ZP_17271852.1| hypothetical protein HMPREF1070_00517 [Bacteroides ovatus
CL03T12C18]
gi|392677683|gb|EIY71099.1| hypothetical protein HMPREF1070_00517 [Bacteroides ovatus
CL03T12C18]
Length = 517
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|160884808|ref|ZP_02065811.1| hypothetical protein BACOVA_02798 [Bacteroides ovatus ATCC 8483]
gi|156109843|gb|EDO11588.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
gi|295085766|emb|CBK67289.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
XB1A]
Length = 517
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|423214424|ref|ZP_17200952.1| hypothetical protein HMPREF1074_02484 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692839|gb|EIY86075.1| hypothetical protein HMPREF1074_02484 [Bacteroides xylanisolvens
CL03T12C04]
Length = 517
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|336412903|ref|ZP_08593256.1| hypothetical protein HMPREF1017_00364 [Bacteroides ovatus
3_8_47FAA]
gi|335942949|gb|EGN04791.1| hypothetical protein HMPREF1017_00364 [Bacteroides ovatus
3_8_47FAA]
Length = 517
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q+ P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQLEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|320102664|ref|YP_004178255.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319749946|gb|ADV61706.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 1074
Score = 63.9 bits (154), Expect = 8e-08, Method: Composition-based stats.
Identities = 64/262 (24%), Positives = 113/262 (43%), Gaps = 54/262 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIA 78
GY K++ P G+ +W L YYN + G+ K G Y DLI
Sbjct: 212 GYQTALFGKWHLVSEPTGFDDWEILPGQGHYYNPDFLTPTGKITKTG------YVTDLIT 265
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--DYAP- 135
++++ QS+ +P + ++ APH + A ++ + P DYA
Sbjct: 266 ERALQWFTQSRDP--SRPFLWMVHHKAPHREWEPALRHLGHDNDRVYPEPPTLFDDYAGR 323
Query: 136 -NPDKQWILQVTRKM-------------------------QPVHRQF-------TDLL-- 160
+++ + + R M +P +R F DL+
Sbjct: 324 GKAEREQDMTIARTMNERDLKLVPPARLNAEQRRVWDAYYEPRNRAFRQAKLEGNDLVRW 383
Query: 161 -----MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
M L +++VD++V+++++ L+ G +NT ++Y SD G++LG+ G K F
Sbjct: 384 KYNRYMHDYLGCVKAVDESVQRLLEALEAAGVADNTIVVYCSDQGFYLGEHGWFD-KRFI 442
Query: 216 FEFDIRVPFLMRGPGIV-PGTM 236
FE +R P L+R PG+V PG++
Sbjct: 443 FEESLRTPLLIRWPGVVAPGSV 464
Score = 55.1 bits (131), Expect = 4e-05, Method: Composition-based stats.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 7/163 (4%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
D+ +A++++R L Q + +P+ L + F PH P + +Y +++ T TP
Sbjct: 796 DHTDGKVASEAIRILNQRQD----QPLFLAVGFYRPHTPYVAPQKYFDLYPFETVSLTPR 851
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGEL 188
+ P TR K+ ++ VD + +++ + LG
Sbjct: 852 PNMIPEDVPAAAFASTRPWPSFGITPEQERRCKQAYWASISFVDAQLGRVLAAIDALGLR 911
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+ T +++ SDHGYHLG+ GL +S FE RVP ++ G G+
Sbjct: 912 DKTIVVFLSDHGYHLGEHGLWLKQS-NFEESARVPLVIAGYGV 953
>gi|149176052|ref|ZP_01854669.1| putative sulfatase [Planctomyces maris DSM 8797]
gi|148845206|gb|EDL59552.1| putative sulfatase [Planctomyces maris DSM 8797]
Length = 470
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 99/236 (41%), Gaps = 42/236 (17%)
Query: 20 GYFGKYLNKYNG--SYIPPGWRE----WGALIMNSKYYN--YSINMNGRKIKHGEDYYND 71
G+ GK+ G ++ PP R+ W AL Y N Y + R+ G Y+
Sbjct: 118 GFIGKWHLDGRGRTAFTPPERRQGFEFWRALECTHNYNNSFYYADSPERRTWEG---YDA 174
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
+ +A +R QS + +P +LVMS+ +PH P +AP P Y
Sbjct: 175 FAQTRVARQFIR-----NQSKAGQPFLLVMSYGSPHNPYHTAP--------------PEY 215
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLGE 187
P+K K++P + K L ++DD V ++ LK+ G
Sbjct: 216 QSMYEPEKI-------KLRPNVPKDQQAAAKKELAGYYAHCSALDDCVGDLLATLKETGV 268
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
+NT I++TSDHG L G ++ K P++ +RVP L R G LD I
Sbjct: 269 DDNTIIVFTSDHGDMLHSHGQIR-KQKPWDESLRVPMLFRLNGAEHAKGRTLDTPI 323
>gi|54288350|gb|AAV31638.1| probable sulfatase [uncultured alpha proteobacterium EBAC2C11]
Length = 512
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 80/177 (45%), Gaps = 23/177 (12%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+A+ S+ +L ++S P L +SFP PH P D P+ N+ N P +
Sbjct: 207 VADRSIHWLSNRRES---NPFCLWISFPDPHHPFD-CPEPWNLLHNPEDVDLPKFLEKDL 262
Query: 137 PDKQW---------------ILQVTRKM---QPVHRQFTDLLMTKRLQTLQS-VDDAVEK 177
D+ W +L+ RK P + MT + S +D V +
Sbjct: 263 NDRPWWHRRSLESEPDLSDPVLKRFRKQGSRMPDQSEAQLREMTANYYGMISLIDHNVGR 322
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
++ L++ G L+ T IIYTSDHG H+G+ GL ++ I V ++RGPG+ G
Sbjct: 323 VIACLREKGILDETIIIYTSDHGDHMGERGLYLKGPMLYDSLINVGMIVRGPGVAAG 379
>gi|149196160|ref|ZP_01873216.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
gi|149141007|gb|EDM29404.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
Length = 504
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 77/167 (46%), Gaps = 13/167 (7%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
Y +IA + VR +++ Q KKP + + F PH P ++ +Y +++ F + +
Sbjct: 202 YEDGMIAREGVRLIKELGQK--KKPFFIAVGFKKPHLPFNAPKKYWDLYKETDFALEKYQ 259
Query: 128 TPAYDYAPNPDKQ--WILQ---VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKEL 182
P AP+ Q W V R V F L + VD + K++K L
Sbjct: 260 KPVQG-APHYAYQNSWEFSGYNVPRINGEVLESFQRKLKHAYAACISYVDAQIAKLLKTL 318
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
KD G NT I++ SDHG+HLG G+ K +E RVPF + P
Sbjct: 319 KDQGLEKNTVIVFWSDHGFHLGDHGMW-CKHSNYEQATRVPFFVYDP 364
>gi|410639714|ref|ZP_11350259.1| mucin-desulfating sulfatase [Glaciecola chathamensis S18K6]
gi|410140595|dbj|GAC08446.1| mucin-desulfating sulfatase [Glaciecola chathamensis S18K6]
Length = 499
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 100/255 (39%), Gaps = 53/255 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
GY + K++ P + + L Y++ G K E Y + D++
Sbjct: 86 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGAKPWPENETQYQGHSSDVVT 145
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ S+ +L K S KP L+ F APH AP+Y + A + P PD
Sbjct: 146 DLSIEWL---KNRSSDKPFFLMHQFKAPHDMFQYAPRYEDFL---------AAETIPEPD 193
Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
+ + T R M + RQ
Sbjct: 194 DLYGVAKTFGSIATRGKNDALRAEIGTSVSRRNNRRNMGIDLGVDPNLNEEEFTRQAYQK 253
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+ L+ ++ VDD V ++++ LKD G+ NT IIYTSD G LG+ L + K + ++
Sbjct: 254 YLKAYLRCVKGVDDNVARLIQTLKDAGQYENTIIIYTSDQGMMLGEHDL-QDKRWIYDES 312
Query: 220 IRVPFLMRGPGIVPG 234
IR+PF+++ P V
Sbjct: 313 IRMPFIVKHPDAVAA 327
>gi|15964702|ref|NP_385055.1| choline sulfatase [Sinorhizobium meliloti 1021]
gi|334315413|ref|YP_004548032.1| choline-sulfatase [Sinorhizobium meliloti AK83]
gi|384528657|ref|YP_005712745.1| choline-sulfatase [Sinorhizobium meliloti BL225C]
gi|384535063|ref|YP_005719148.1| choline sulfatase [Sinorhizobium meliloti SM11]
gi|433612715|ref|YP_007189513.1| choline-sulfatase [Sinorhizobium meliloti GR4]
gi|7387556|sp|O69787.2|BETC_RHIME RecName: Full=Choline-sulfatase
gi|15073880|emb|CAC45521.1| Choline sulfatase [Sinorhizobium meliloti 1021]
gi|333810833|gb|AEG03502.1| choline-sulfatase [Sinorhizobium meliloti BL225C]
gi|334094407|gb|AEG52418.1| choline-sulfatase [Sinorhizobium meliloti AK83]
gi|336031955|gb|AEH77887.1| choline sulfatase [Sinorhizobium meliloti SM11]
gi|429550905|gb|AGA05914.1| choline-sulfatase [Sinorhizobium meliloti GR4]
Length = 512
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 81/165 (49%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + S++P L +SF PH P + ++ +++ + H TP P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLTPEVGAIP 228
Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
+P Q I+ ++ Q ++ ++R + +D+ V +++ L L+
Sbjct: 229 LDEQDPHSQRIM-LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T I++ SDHG LG+ GL +F FE RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331
>gi|388567294|ref|ZP_10153729.1| putative sulfatase YidJ [Hydrogenophaga sp. PBC]
gi|388265505|gb|EIK91060.1| putative sulfatase YidJ [Hydrogenophaga sp. PBC]
Length = 517
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 80/194 (41%), Gaps = 32/194 (16%)
Query: 72 YYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--------FN 122
YYP I + +V FLR ++ +P +SFP PH P +Y ++
Sbjct: 209 YYPTSYIRDRAVDFLRTTRGQ--AQPFFAFVSFPDPHHPFTPPGRYWGLYDPADFPVRLP 266
Query: 123 VTSHHTPAYDYAPNPDKQWI----LQVTRKMQPVHRQF------TDLLMTKRLQTLQSVD 172
+H P P +W L +R Q H F M + +D
Sbjct: 267 FEAHRNPP------PPIRWCRERFLDGSRARQGPHDSFFATEREIREAMALSAGMITMID 320
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL-----MR 227
DAV IV+ L++ G+ NT I++TSDHG +LG F LV + RVPF+
Sbjct: 321 DAVGDIVRALQESGQAENTVIVFTSDHGDYLGDFNLVLKGALASRGITRVPFIWCDPRQA 380
Query: 228 GPGIVPGTMYLLDV 241
P + P +D+
Sbjct: 381 APAVSPALASTIDI 394
>gi|359408276|ref|ZP_09200747.1| arylsulfatase A family protein [SAR116 cluster alpha
proteobacterium HIMB100]
gi|356676686|gb|EHI49036.1| arylsulfatase A family protein [SAR116 cluster alpha
proteobacterium HIMB100]
Length = 509
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 83/181 (45%), Gaps = 27/181 (14%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
A+ ++ +L++ Q KP +SFP PH P D+ ++++ ++ H ++
Sbjct: 205 TADRTINWLKEQAQQ--DKPFCAWVSFPDPHHPFDAPEPWSHLHKPENVDLPEHRQRTFE 262
Query: 133 YAPNPDKQWILQV-------TRKMQPVHRQFTDL----------LMTKRLQTLQSVDDAV 175
P W QV T++ + ++ + ++ + VD +
Sbjct: 263 GRP----WWHEQVLTAEPAGTKENAETRKAYSRISHQSDEQLREIIANTYGQIALVDHQL 318
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+I+ L++LG NT I+Y SDHG LG GL+ +E +RVP +M+GPG+ G
Sbjct: 319 GRIMASLEELGLAENTLIVYASDHGDWLGDHGLILKGPMHYEGLLRVPLIMKGPGVPAGQ 378
Query: 236 M 236
+
Sbjct: 379 V 379
>gi|3046314|gb|AAC13371.1| choline sulfatase [Sinorhizobium meliloti]
Length = 512
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 81/165 (49%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + S++P L +SF PH P + ++ +++ + H TP P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLTPEVGAIP 228
Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
+P Q I+ ++ Q ++ ++R + +D+ V +++ L L+
Sbjct: 229 LDEQDPHSQRIM-LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T I++ SDHG LG+ GL +F FE RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331
>gi|149199420|ref|ZP_01876456.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
gi|149137498|gb|EDM25915.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
Length = 476
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 81/173 (46%), Gaps = 23/173 (13%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ N V F+ ++ + KP + ++F A H P + ++ +M+ V +
Sbjct: 172 HWSEVVGNHGVDFINEAAKD--DKPFFMYIAFNATHDPRQAPKEFIDMYPLDSIEVPKSY 229
Query: 128 TPAYDYAPNPDKQWILQVTRKMQP---------VHRQFTDLLMTKRLQTLQSVDDAVEKI 178
Y Y + ++ P VHRQ L T + + + DA+EK
Sbjct: 230 LSNYPYDEEMGCNPVGLRDERLMPSPRTEFAVKVHRQEYFALATHMDREIGRIFDALEKS 289
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
G +NTY+I+T+DHG +G GLV GK ++ ++VPF + GPGI
Sbjct: 290 -------GMKDNTYVIFTADHGLSVGHHGLV-GKQNMYDHSMQVPFFIAGPGI 334
>gi|410456454|ref|ZP_11310315.1| arylsulfatase [Bacillus bataviensis LMG 21833]
gi|409928123|gb|EKN65246.1| arylsulfatase [Bacillus bataviensis LMG 21833]
Length = 486
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 79/162 (48%), Gaps = 19/162 (11%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP- 135
+ +S+ FLR+ + KP L MSF PH P D + +M+ ++ P D+A
Sbjct: 186 VVTESIDFLRRRDPT---KPFFLKMSFVRPHPPFDPPQVFFDMYKDLELPDAPVGDWAER 242
Query: 136 -NPDKQWILQVTRK-MQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIVKELKDLGEL 188
+P K +T K + P +R KR Q + +D + + + +++ G L
Sbjct: 243 EDPKKSGYNPITSKGIVPKNR-------LKRAQAAYYALITHIDYQIGRFLMAMEEHGVL 295
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
NNT I++ SDHG LG L + KS +E +VPF++ PG
Sbjct: 296 NNTVILFCSDHGELLGDHNLFR-KSLAYEGSAKVPFILADPG 336
>gi|288870307|ref|ZP_06113643.2| arylsulfatase [Clostridium hathewayi DSM 13479]
gi|288867673|gb|EFC99971.1| arylsulfatase [Clostridium hathewayi DSM 13479]
Length = 535
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 81/160 (50%), Gaps = 21/160 (13%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN-----VTSHHTPAY 131
+A SV L K++ K+P L SF APH P D ++A+++ + TP
Sbjct: 189 VAKRSVYHL---KENGGKRPFFLWSSFIAPHPPFDVPEKWADLYKGKELPPLKESKTPIS 245
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELN 189
A +W +K + + L + L ++ VD + I+++LKD+GE +
Sbjct: 246 GIA-----EW-----KKYIADYPNESYLRRARELYYASISFVDYNIGTILQQLKDMGEYD 295
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
NT I++TSDHG LG G + K P++ +R+PF+MR P
Sbjct: 296 NTLILFTSDHGEMLGDHGTFQ-KMLPYDGSVRIPFIMRYP 334
>gi|29348460|ref|NP_811963.1| mucin-desulfating sulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124650|ref|ZP_09945313.1| hypothetical protein BSIG_1599 [Bacteroides sp. 1_1_6]
gi|29340364|gb|AAO78157.1| mucin-desulfating sulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|382983652|gb|EES69274.2| hypothetical protein BSIG_1599 [Bacteroides sp. 1_1_6]
Length = 518
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/233 (23%), Positives = 100/233 (42%), Gaps = 51/233 (21%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
DL+ ++R++++ +P ++ F A H P D + +++ VT D+
Sbjct: 177 DLVTEKAIRWMKEQD---GNQPFLMCCHFKATHEPYDYPIRMEHLYDGVTFPEPENLLDW 233
Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
P +PDK W +Q T + +++F
Sbjct: 234 GPETNGRSFKGQTLEELERRWRIASQDPDKWWCRYPGLPFSTEGMQRTAARRASYQKF-- 291
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
+ L+ +VDD + K++ L ++ +NT +IY SD GY LG+ G + F +E
Sbjct: 292 --IRDYLRCGATVDDNIGKLLNALDEMNIADNTIVIYVSDQGYFLGEHGFFDKRMF-YEE 348
Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI-----PQVRKFSSGSLIFIM--SFLINL 264
R+PF++R P VP L D+++ P + +F+ + + SF+ NL
Sbjct: 349 SARMPFVIRYPKKVPAGKRLDDLILNVDFAPTLAEFAGVKMENVQGDSFVSNL 401
>gi|254515103|ref|ZP_05127164.1| choline-sulfatase [gamma proteobacterium NOR5-3]
gi|219677346|gb|EED33711.1| choline-sulfatase [gamma proteobacterium NOR5-3]
Length = 464
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/230 (21%), Positives = 103/230 (44%), Gaps = 22/230 (9%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
G+FGK+ + PG+ W + + Y + +N++G + + Y D++ +
Sbjct: 123 GFFGKWHMGHADDSPRPGFDRWVSFVGQGVYDDPEMNIDGVT-----SHASGYMTDILTD 177
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA------PQYA---NMFFNVTSHHTPA 130
+V F+ + ++P + ++ A H PE A P A +++ +P
Sbjct: 178 QAVSFI----EEIDERPFLAFIAHKASH-PEIHANYLRTFPPAAGDEDLYTQEPLQRSPN 232
Query: 131 YDY--APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
+ A P + + P D ++ RL+ L +VD + +++ L+ L
Sbjct: 233 WQASTAGKPALERPVDYNDPRSP-EGGLPDDIILNRLRMLSAVDRGLGRLIDVLETKKLL 291
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
++T ++ +SD G+ G+FGL + + +E I +P ++R P +VP L
Sbjct: 292 DDTVVVLSSDQGFFYGEFGLAQERRLAYEASIHIPLIIRYPQMVPAGAEL 341
>gi|407719814|ref|YP_006839476.1| choline-sulfatase [Sinorhizobium meliloti Rm41]
gi|407318046|emb|CCM66650.1| Choline-sulfatase [Sinorhizobium meliloti Rm41]
Length = 512
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 80/165 (48%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + S++P L +SF PH P + ++ +++ H TP P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEGC-EHLTPEVGAIP 228
Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
+P Q I+ ++ Q ++ ++R + +D+ V +++ L L+
Sbjct: 229 LDKQDPHSQRIM-LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T I++ SDHG LG+ GL +F FE RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMLAGPGIAPG 331
>gi|89069432|ref|ZP_01156787.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Oceanicola granulosus HTCC2516]
gi|89045062|gb|EAR51140.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Oceanicola granulosus HTCC2516]
Length = 517
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 48/75 (64%), Gaps = 2/75 (2%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M++ L+T+Q++DD V +++ L L NT +IYTSD G+ LG+ G K F +E +
Sbjct: 288 MSRYLRTVQAIDDGVGRMLDRLDALDLAENTLVIYTSDQGFFLGEHGWFD-KRFMYEESL 346
Query: 221 RVPFLMRGP-GIVPG 234
R+PFL R P GI PG
Sbjct: 347 RMPFLARFPAGIAPG 361
>gi|298385837|ref|ZP_06995394.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Bacteroides sp. 1_1_14]
gi|298261065|gb|EFI03932.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Bacteroides sp. 1_1_14]
Length = 527
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/233 (23%), Positives = 100/233 (42%), Gaps = 51/233 (21%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-SHHTPAYDY 133
DL+ ++R++ K+ +P ++ F A H P D + +++ VT D+
Sbjct: 186 DLVTEKAIRWM---KEQDGNQPFLMCCHFKATHEPYDYPIRMEHLYDGVTFPEPENLLDW 242
Query: 134 AP------------------------NPDKQWI-----------LQVTRKMQPVHRQFTD 158
P +PDK W +Q T + +++F
Sbjct: 243 GPETNGRSFKGQTLEELERRWRIASQDPDKWWCRYPGLPFSTEGMQRTAARRASYQKF-- 300
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF 218
+ L+ +VDD + K++ L ++ +NT +IY SD GY LG+ G + F +E
Sbjct: 301 --IRDYLRCGATVDDNIGKLLNALDEMNIADNTIVIYVSDQGYFLGEHGFFDKRMF-YEE 357
Query: 219 DIRVPFLMRGPGIVPGTMYLLDVLI-----PQVRKFSSGSLIFIM--SFLINL 264
R+PF++R P VP L D+++ P + +F+ + + SF+ NL
Sbjct: 358 SARMPFVIRYPKKVPAGKRLDDLILNVDFAPTLAEFAGVKMENVQGDSFVSNL 410
>gi|365134097|ref|ZP_09343135.1| hypothetical protein HMPREF1032_00931 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363614613|gb|EHL66099.1| hypothetical protein HMPREF1032_00931 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 477
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 27/183 (14%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP------EDSAPQYANMFFNVTS 125
Y D I +D+V FL K++ +KP L +++ APH P E+ Y N F
Sbjct: 152 YVTDAITDDAVEFLNSHKEA--EKPFYLHVAYTAPHAPWLNNHPEEYTKLYENCLFESVP 209
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
H D + +T ++ R + + ++D + I++ L +L
Sbjct: 210 HLPRHLD---------SIYLTDEVAKDERAN----LIGYFAAVTAMDVGIGHILETLDEL 256
Query: 186 GELNNTYIIYTSDHGYHLGQFGL-VKGK-SFP---FEFDIRVPFLMRGPGIVPGTMYLLD 240
G +T +++TSD+G+ G G KG +FP +E ++VPF+ R PG +P + + D
Sbjct: 257 GLAEDTLVVFTSDNGFSCGHHGFWGKGNGTFPLNMYEESVKVPFIARHPGRIPAGV-VCD 315
Query: 241 VLI 243
L+
Sbjct: 316 SLV 318
>gi|150395788|ref|YP_001326255.1| sulfatase [Sinorhizobium medicae WSM419]
gi|150027303|gb|ABR59420.1| sulfatase [Sinorhizobium medicae WSM419]
Length = 512
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 81/165 (49%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + S++P L +SF PH P + ++ +++ + H TP + P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLTPEVEAIP 228
Query: 136 NPDKQWILQVTRKMQPVHRQF----TDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
DKQ + +R F ++ ++R + +D+ V +++ L L+
Sbjct: 229 -LDKQDSHSQRIMLSCDYRNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLD 287
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T I++ SDHG LG+ GL +F FE RVP ++ GPGI PG
Sbjct: 288 DTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331
>gi|393776751|ref|ZP_10365045.1| putative sulfatase [Ralstonia sp. PBA]
gi|392716108|gb|EIZ03688.1| putative sulfatase [Ralstonia sp. PBA]
Length = 485
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 9/175 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y +Y IA + +LR+ + KP +L +SF APH P + P++ +++
Sbjct: 154 GESQYT-FYDREIAARAQVWLREQATAEHDKPWVLFVSFVAPHFPLTAPPEHFYKYYDQD 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
Y + P + QV + + + D + K+ +D+ + K++
Sbjct: 213 LPLPKLYAHDQRPTHPY--QVDYRKSFNYDDYFDAVKLKKALAGYYGLCSFLDENIGKVL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ L D G ++T +IYTSDHG +LG G+ GKS FE VP +M GP + G
Sbjct: 271 QALDDAGLTDDTRVIYTSDHGDNLGSRGMW-GKSTLFEEAAGVPLIMAGPDVPAG 324
>gi|296120919|ref|YP_003628697.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013259|gb|ADG66498.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 535
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 155 QFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVK 210
Q TD + + LQ ++ +D V +++ LK+ G +NT +++TSDHGYHLG+ GL +
Sbjct: 283 QLTDPMRQEILQAYFASISFMDAQVGRVLASLKENGLADNTIVVFTSDHGYHLGEHGLWQ 342
Query: 211 GKSFPFEFDIRVPFLMRGPGI 231
+S FE RVP L+ PGI
Sbjct: 343 KQSL-FEESARVPLLISAPGI 362
>gi|427825191|ref|ZP_18992253.1| putative sulfatase [Bordetella bronchiseptica Bbr77]
gi|410590456|emb|CCN05545.1| putative sulfatase [Bordetella bronchiseptica Bbr77]
Length = 485
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++++ + +LR+ + KP +L +SF +PH P + P++ ++N
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD + PD + Q + + + D + K+ Q +D+ + K++
Sbjct: 213 LPLPKLYDKSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L DL L++T ++YTSDHG +LG G+ GKS FE VP ++ G I G
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGKDIPSGV 325
>gi|410421771|ref|YP_006902220.1| sulfatase [Bordetella bronchiseptica MO149]
gi|408449066|emb|CCJ60752.1| putative sulfatase [Bordetella bronchiseptica MO149]
Length = 485
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++++ + +LR+ + KP +L +SF +PH P + P++ ++N
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD + PD + Q + + + D + K+ Q +D+ + K++
Sbjct: 213 LPLPKLYDKSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L DL L++T ++YTSDHG +LG G+ GKS FE VP ++ G I G
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGKDIPSGV 325
>gi|427819811|ref|ZP_18986874.1| putative sulfatase [Bordetella bronchiseptica D445]
gi|410570811|emb|CCN19009.1| putative sulfatase [Bordetella bronchiseptica D445]
Length = 485
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++++ + +LR+ + KP +L +SF +PH P + P++ ++N
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD + PD + Q + + + D + K+ Q +D+ + K++
Sbjct: 213 LPLPKLYDKSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L DL L++T ++YTSDHG +LG G+ GKS FE VP ++ G I G
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGKDIPSGV 325
>gi|149178047|ref|ZP_01856643.1| sulfatase [Planctomyces maris DSM 8797]
gi|148843104|gb|EDL57471.1| sulfatase [Planctomyces maris DSM 8797]
Length = 406
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 79/158 (50%), Gaps = 13/158 (8%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
++ ++ F+ Q Q +P +L ++ PH P +YA MF + + P ++ +
Sbjct: 112 SDRAIEFIEQDHQ----QPWLLNINVYDPHPPFTPPEKYAKMF-DPAAMPGPHFEESDKA 166
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTL-----QSVDDAVEKIVKELKDLGELNNTY 192
++ + +V QPV T++ K++Q L +DD +I+ L G+ +NT
Sbjct: 167 TQELLSRV--DFQPV-TMSTEISELKKVQALYYAMIAQIDDQFARILSVLDSTGQRDNTV 223
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
II+TSDHG LG GLV+ +E IRVP + PG
Sbjct: 224 IIFTSDHGETLGDHGLVQKGCRFYEGLIRVPLIFSWPG 261
>gi|423226172|ref|ZP_17212638.1| hypothetical protein HMPREF1062_04824 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392630283|gb|EIY24277.1| hypothetical protein HMPREF1062_04824 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 491
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 102/224 (45%), Gaps = 42/224 (18%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++A++++ ++ ++K S K+P+ + +F APH P + +Y +M+ +V +
Sbjct: 196 HWSEILADETITYMEENKDS--KEPLFMFCAFNAPHDPRQAPKEYVDMYDVDKISVPQNF 253
Query: 128 TPAY---------------DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
P + AP P + +Q HRQ L+T +D
Sbjct: 254 LPVHPLGEQMKSGKNLRDEQLAPFPRTHYSVQK-------HRQEYYALITH-------MD 299
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
V +I++ LK G NT I++ SD+G +G GL+ GK ++ +VP G G+
Sbjct: 300 AQVGRIIEALKRNGLDKNTLIVFASDNGLSVGHHGLI-GKQSMYDHSAKVPLAFCGLGLP 358
Query: 233 PGT-----MYLLDVLIPQVRKFSSGSLIFIMSFLINLNTMRAEK 271
G +YL D L+P V + F+ +N +++ K
Sbjct: 359 TGEVRNQLIYLQD-LVPTVYDLVGIKKPNHLEFISQINALKSSK 401
>gi|33598356|ref|NP_885999.1| sulfatase [Bordetella parapertussis 12822]
gi|33603289|ref|NP_890849.1| sulfatase [Bordetella bronchiseptica RB50]
gi|412341382|ref|YP_006970137.1| sulfatase [Bordetella bronchiseptica 253]
gi|427816299|ref|ZP_18983363.1| putative sulfatase [Bordetella bronchiseptica 1289]
gi|33566914|emb|CAE39130.1| putative sulfatase [Bordetella parapertussis]
gi|33577413|emb|CAE34678.1| putative sulfatase [Bordetella bronchiseptica RB50]
gi|408771216|emb|CCJ56016.1| putative sulfatase [Bordetella bronchiseptica 253]
gi|410567299|emb|CCN24870.1| putative sulfatase [Bordetella bronchiseptica 1289]
Length = 485
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++++ + +LR+ + KP +L +SF +PH P + P++ ++N
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD + PD + Q + + + D + K+ Q +D+ + K++
Sbjct: 213 LPLPKLYDRSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L DL L++T ++YTSDHG +LG G+ GKS FE VP ++ G I G
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGRDIPSGV 325
>gi|227821277|ref|YP_002825247.1| choline-sulfatase [Sinorhizobium fredii NGR234]
gi|227340276|gb|ACP24494.1| choline-sulfatase [Sinorhizobium fredii NGR234]
Length = 512
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 78/165 (47%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + +++P L +SF PH P + ++ +++ N H P P
Sbjct: 170 FLANQKLYQLARDNDDEARRPWCLTVSFTHPHDPYVARRKFWDLYENC-EHLRPEVGALP 228
Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
+P Q I+ + ++ ++R + +D+ V ++V L L+
Sbjct: 229 FDSQDPHSQRIMHSC-DYKNFELTEVNIRQSRRAYFANISYLDEKVGELVDTLTRTRMLD 287
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
NT I++ SDHG LG+ GL +F FE RVP ++ GPG+ PG
Sbjct: 288 NTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGVAPG 331
>gi|262408667|ref|ZP_06085213.1| sulfatase [Bacteroides sp. 2_1_22]
gi|294646699|ref|ZP_06724324.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294807128|ref|ZP_06765946.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|345511274|ref|ZP_08790819.1| sulfatase [Bacteroides sp. D1]
gi|229444367|gb|EEO50158.1| sulfatase [Bacteroides sp. D1]
gi|262353532|gb|EEZ02626.1| sulfatase [Bacteroides sp. 2_1_22]
gi|292637987|gb|EFF56380.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294445694|gb|EFG14343.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
Length = 464
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 104/230 (45%), Gaps = 36/230 (15%)
Query: 19 HGYFGKYLNKYN-GSYIP-PGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL 76
+GY+ ++ KY G+Y+P + W L YY + N R + L
Sbjct: 115 NGYYVGFIGKYGVGNYLPQKEFDYWRGLAGQGTYYQKDKDGNPRHL-----------TGL 163
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAP---------QYANMFFN----V 123
I+ FL ++ KP L +SF +PH + P Y + FF+
Sbjct: 164 ISEQIDEFLANRDET---KPFCLSVSFKSPHTESEQDPFPFDKKYSFMYEDEFFDKPETF 220
Query: 124 TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIV 179
++ + + D +W + + + +R TD ++ + +D+A+ K+
Sbjct: 221 GENYYRLFPESFRKDGKWENEGYVRFK--NRYGTDEKYQSSVKGYYRLIAGIDEAIGKLR 278
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
K LK++G NT II+TSD+GY+LG+ GL +GK + E IR+P ++ P
Sbjct: 279 KRLKEMGLDKNTLIIFTSDNGYYLGEHGL-EGKWYGHEESIRLPLVIYDP 327
>gi|410474393|ref|YP_006897674.1| sulfatase [Bordetella parapertussis Bpp5]
gi|408444503|emb|CCJ51254.1| putative sulfatase [Bordetella parapertussis Bpp5]
Length = 485
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 9/176 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y ++++ + +LR+ + KP +L +SF +PH P + P++ ++N
Sbjct: 154 GESQYTFYDREIVSRAQI-WLREQAPRHADKPWVLFVSFVSPHFPLTAPPEHYYRYYNRD 212
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKIV 179
YD + PD + Q + + + D + K+ Q +D+ + K++
Sbjct: 213 LPLPKLYDRSQRPDHPY--QQDYRGSFNYDDYFDPGLVKKAQAGYYGLCSFLDENIGKLL 270
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L DL L++T ++YTSDHG +LG G+ GKS FE VP ++ G I G
Sbjct: 271 GTLDDLDILDSTRVVYTSDHGDNLGARGMW-GKSNMFEEAAAVPLIIAGRDIPSGV 325
>gi|378825232|ref|YP_005187964.1| choline sulfatase [Sinorhizobium fredii HH103]
gi|365178284|emb|CCE95139.1| choline sulfatase [Sinorhizobium fredii HH103]
Length = 510
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 78/165 (47%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + ++P L +SF PH P + ++ +++ N H P D P
Sbjct: 168 FLANQKLYQLARENDDEGRRPWCLTVSFTHPHDPYVARRKFWDLYENC-EHLLPEVDALP 226
Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
+P + I+ + ++ ++R + +D+ V ++V L L+
Sbjct: 227 FDRQDPHSRRIMHSC-DYKNFELTEENIRRSRRAYFANISYLDEKVGELVDTLTRTRMLD 285
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
NT I++ SDHG LG+ GL +F FE RVP ++ GPG+ PG
Sbjct: 286 NTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGVAPG 329
>gi|196230236|ref|ZP_03129099.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225833|gb|EDY20340.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 489
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 104/237 (43%), Gaps = 35/237 (14%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
+ GK+ + G+ W Y++ + N+NG K + Y D++ +
Sbjct: 122 AFLGKWHMGEDSDNPRKGFDYWAGFRGQGHYFDDTYNINGEHKK-----IDGYSSDVLTD 176
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDK 139
++ +L+ KP + + APH P + AP+ + P + N ++
Sbjct: 177 LALDWLKHR----GDKPFFCELCYKAPHYPFEPAPRNKGRYEKAP---IPYPETMANTEE 229
Query: 140 QWILQ---VTRKMQPVHRQFTDLLMTKRL----------------QTLQSVDDAVEKIVK 180
++ Q V + +H D + T R +T+ S+D+ + +++K
Sbjct: 230 NYLTQPRWVRERRFGIHG--VDHMETGRFDHDPVPSFEDLYHRYSETVFSMDENIGRLLK 287
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
L + G ++T ++Y +D+G+ LG+ G K FE +RVP L+R PG V PGT+
Sbjct: 288 YLDNTGLRDSTIVVYMADNGFELGEHGFYD-KRDAFETSMRVPMLLRAPGAVKPGTV 343
>gi|299145033|ref|ZP_07038101.1| putative sulfatase [Bacteroides sp. 3_1_23]
gi|298515524|gb|EFI39405.1| putative sulfatase [Bacteroides sp. 3_1_23]
Length = 517
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 78/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ +++Q K++ P + +SFP PH P Y +MF +
Sbjct: 176 EEQHPTKIVNEALAWIKQQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 231
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL Q P Q + + ++ +DD ++++++ LK G
Sbjct: 232 TSRKDLAKKGEKYRILAQQEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 291
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 292 QYENTLFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 333
>gi|452126972|ref|ZP_21939555.1| sulfatase [Bordetella holmesii F627]
gi|452130345|ref|ZP_21942917.1| sulfatase [Bordetella holmesii H558]
gi|451920270|gb|EMD70416.1| sulfatase [Bordetella holmesii H558]
gi|451922067|gb|EMD72212.1| sulfatase [Bordetella holmesii F627]
Length = 487
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 80/181 (44%), Gaps = 12/181 (6%)
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN 118
R I GE Y Y D I +V +L + KP +L SF +PH P + P++
Sbjct: 150 ARSIGPGESSYIRYDRD-ITEKTVNWLNTQGRKSHDKPWVLYSSFISPHSPLIAPPEFYA 208
Query: 119 MFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTD-----LLMTKRLQTLQSVDD 173
M+ + S P D +P WI Q K F + + +D
Sbjct: 209 MY-DARSIELPRKDVKYHP---WI-QAWNKCYGFDSHFGSDDQRRIAIASYFGLCSFLDS 263
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
V KI+ L++ G + T +I+ SDHG +LG L GKS +E VP ++ GPG+ P
Sbjct: 264 NVGKILYALEETGLAHTTRVIFLSDHGDNLGARALW-GKSTMYEESAGVPMILGGPGVAP 322
Query: 234 G 234
G
Sbjct: 323 G 323
>gi|430747567|ref|YP_007206696.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430019287|gb|AGA31001.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 541
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 111/261 (42%), Gaps = 53/261 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ P G+ W L YYN + NG ++KH Y D+I +
Sbjct: 129 GYQTAVVGKWHLVTEPTGFDYWHILPGQGVYYNPPMIDNGTQVKH-----EGYATDIITD 183
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGP------------EDSAPQYANMFFNVTSHH 127
++ +L++ +S KP +++ APH + + P+ + +F + +
Sbjct: 184 LTIDWLKKRDKS---KPFLMMSQNKAPHREWAPALRHLGHDNDRAYPEPSTLFDDYSGRG 240
Query: 128 TPAYDYAPNPDKQWI-----LQVTRKMQPVHRQFTDLLMTKR------------------ 164
+D K L+ ++ P R+ D R
Sbjct: 241 KAEHDQDMTIAKTMTARDLKLEAPPQLTPDQRKVWDAYYEPRNQAFAKADLKGDDLVRWK 300
Query: 165 --------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
L +++VD+ V +++K L++ G +NT ++Y +D G++LG+ G K + F
Sbjct: 301 YNRYMHDYLACIKAVDENVGRLLKYLEEEGLADNTIVVYAADQGFYLGEHGWFD-KRWIF 359
Query: 217 EFDIRVPFLMRGPGIV-PGTM 236
E +R P ++R PG+ PG++
Sbjct: 360 EESLRTPLVVRWPGVTKPGSV 380
>gi|340619559|ref|YP_004738012.1| mucin-desulfating sulfatase [Zobellia galactanivorans]
gi|339734356|emb|CAZ97733.1| Mucin-desulfating sulfatase, family S1-11 [Zobellia
galactanivorans]
Length = 495
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 102/239 (42%), Gaps = 47/239 (19%)
Query: 37 GWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
G+ W L KYYN I G +Y +Y D+I + ++ +L + K
Sbjct: 135 GFDYWEVLPGQGKYYNPDFITEEG-----DTNYKGEYVTDVITDRALNWLENERDQ--DK 187
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNV----------------TSHHTPAYDYAPNPDK 139
P M+++ APH D A +Y + + +V T+ H D A + +
Sbjct: 188 PFMMMVHHKAPHRNWDPAERYMSKYEDVEIPEPDNFFDDYATRTTAAHKQEMDIATSMNT 247
Query: 140 QWILQVTR---------------------KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKI 178
L+ K + + + M L+ + SVD++V +I
Sbjct: 248 HIDLKAEGDRYANDPRYKERYAKFAAAKLKGEALVKWKYQTYMKDYLRCIWSVDESVGQI 307
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
++ LK+ G NT +IY+SD G+++G+ G K F +E R P +++ PG V PG++
Sbjct: 308 MQSLKEQGLDENTIVIYSSDQGFYMGEHGWF-DKRFMYEESFRTPLIVKWPGTVKPGSV 365
>gi|397566700|gb|EJK45164.1| hypothetical protein THAOC_36235 [Thalassiosira oceanica]
Length = 475
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 86/198 (43%), Gaps = 24/198 (12%)
Query: 53 YSINMNGRKIKHGED---YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
Y M G + G D + YPD D L + K S SK P + ++FP PH P
Sbjct: 147 YIACMTGEGPEGGCDATSFPESIYPDDFVRDRAIDLLKRKPSDSK-PFFMQVNFPGPHHP 205
Query: 110 EDSAPQYANMFFNVTSHHTP------AYDYAPNPD-----KQWILQVTR--KMQPVHRQF 156
++M +V P ++AP+ + +QW L + ++ P +
Sbjct: 206 ---VVVTSSMARSVMDRLWPPPVNPQQVEFAPSGEAMTCQQQWKLSGKKHVRVDPAKMKL 262
Query: 157 TDLLMTKRLQ---TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
R ++ +D + IV++LK GE+ NT ++ DHG +LG G GK
Sbjct: 263 VHPQKEGRCNYGAEIEHIDRLMSSIVEQLKATGEIANTVVVVAGDHGENLGDNG-TTGKG 321
Query: 214 FPFEFDIRVPFLMRGPGI 231
P++ + VP L+ GP +
Sbjct: 322 IPWQASVSVPLLISGPSV 339
>gi|410096139|ref|ZP_11291129.1| hypothetical protein HMPREF1076_00307 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227210|gb|EKN20111.1| hypothetical protein HMPREF1076_00307 [Parabacteroides goldsteinii
CL02T12C30]
Length = 501
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 81/193 (41%), Gaps = 33/193 (17%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV----------T 124
D ++ F+ + K+ P ++ FP PH P Y M+ +
Sbjct: 198 DYFTTQAIEFINRHKEG----PFCCMLCFPDPHSANLVRPPYDTMYSRIRFKEPATALTD 253
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
+ PA+ + N + QP M + ++ +DD V +I+ LKD
Sbjct: 254 TTGRPAWSHGNN---------LKADQPED-------MAQYFGMIKCIDDNVGRILDALKD 297
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIP 244
G L+NT II+TSDHG Q GL+ K P + RVPF++ P + + + +V+
Sbjct: 298 AGILDNTIIIFTSDHGDMCRQHGLI-NKGVPLDDSSRVPFIISYPAGMRQGIRIENVM-- 354
Query: 245 QVRKFSSGSLIFI 257
V F+ L F
Sbjct: 355 SVTDFTPSLLSFC 367
>gi|336426381|ref|ZP_08606391.1| hypothetical protein HMPREF0994_02397 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010816|gb|EGN40796.1| hypothetical protein HMPREF0994_02397 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 454
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 79/167 (47%), Gaps = 13/167 (7%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
Y D +A++++RF+++ K +P + ++S P PH P Y MF + +
Sbjct: 153 YTTDWLADETLRFIKEQKDG---RPFLFMVSMPDPHQPFRVRKPYDTMFDPLQMEIPETF 209
Query: 132 DYAPNPDKQWILQVT---RKMQPV----HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
PD W Q R P L + L ++ +DD V KI+ L++
Sbjct: 210 YEEELPD--WAEQDEWGRRHYFPTGLFEREDHFRRLKAQYLGAVKCIDDNVGKILDGLRE 267
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
L T +I+T+DHG +LG+ GL++ K+ +E +P L+R PG
Sbjct: 268 ANLLEETVVIFTTDHGEYLGEHGLME-KNNLYESVYHIPMLLRLPGC 313
>gi|440714555|ref|ZP_20895134.1| choline-sulfatase [Rhodopirellula baltica SWK14]
gi|436440751|gb|ELP34055.1| choline-sulfatase [Rhodopirellula baltica SWK14]
Length = 517
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 80/168 (47%), Gaps = 11/168 (6%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
+ D + + + F+R+ S++P +S P PHGP Y MF N+ P
Sbjct: 221 FSTDWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVR--PPM 274
Query: 132 DYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
+ + D+ W+ R Q ++F LMT+ ++ +DD V ++ L +L
Sbjct: 275 TFQLDGDQPGWLPATNRNSQ---QRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKR 331
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
T +++TSDHG + G + K P+E +VP ++ PG+VP + +
Sbjct: 332 TVVVFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIITAPGLVPAGLRI 378
>gi|343085359|ref|YP_004774654.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353893|gb|AEL26423.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 504
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 84/180 (46%), Gaps = 14/180 (7%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++ ND+ FL + S S P + ++F A H P S + +++ + +
Sbjct: 213 HWSEVLRNDAHAFLDSA--SKSNNPFFMYLAFNAVHDPRQSPGSFLDLYAVEDIKIPKNF 270
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELK 183
P Y Y L+ + P R T + K LQ + +D + +I EL+
Sbjct: 271 LPVYPYKDEIGNGPALR-DEALAPYPR--TPYSIKKHLQEYYAIISHMDHQIGEIFNELE 327
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
G NTYII+T+DHG +G+ GL+ GK F+ + PF++ GP I G D+ I
Sbjct: 328 RKGLRENTYIIFTADHGLAMGRHGLM-GKQTMFDHSMAAPFIISGPNIPKGQRLDQDIYI 386
>gi|54301819|ref|YP_131812.1| sulfatase [Photobacterium profundum SS9]
gi|46915239|emb|CAG22012.1| putative sulfatase [Photobacterium profundum SS9]
Length = 520
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 74/166 (44%), Gaps = 16/166 (9%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSH 126
N + P+ A+ ++++LR + S KP LV+S PH P D PQ Y + F TS
Sbjct: 223 NQWSPEHEADIAIKYLRNEDGKYRDSNKPFTLVVSMNPPHSPYDQVPQKYLDKFDGQTSE 282
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
P+ QW P + + + + VD +I+ EL LG
Sbjct: 283 SLNT-----RPNVQWDKDYQEGYGPAY-------FKEYMAMVHGVDAQFGRIIDELDRLG 330
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+T +++ SDHG +G G K+ +E +R+P + R PG +
Sbjct: 331 LEEDTLVVFFSDHGCCMGSNGKPT-KNVHYEEAMRIPMMFRWPGKI 375
>gi|417302547|ref|ZP_12089645.1| sulfatase [Rhodopirellula baltica WH47]
gi|327541105|gb|EGF27651.1| sulfatase [Rhodopirellula baltica WH47]
Length = 517
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 80/168 (47%), Gaps = 11/168 (6%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
+ D + + + F+R+ S++P +S P PHGP Y MF N+ P
Sbjct: 221 FSTDWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVR--PPM 274
Query: 132 DYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
+ + D+ W+ R Q ++F LMT+ ++ +DD V ++ L +L
Sbjct: 275 TFQLDGDQPGWLPATNRNSQ---QRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKR 331
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
T +++TSDHG + G + K P+E +VP ++ PG+VP + +
Sbjct: 332 TVVVFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIITAPGLVPAGLRI 378
>gi|257069392|ref|YP_003155647.1| choline-sulfatase [Brachybacterium faecium DSM 4810]
gi|256560210|gb|ACU86057.1| choline-sulfatase [Brachybacterium faecium DSM 4810]
Length = 520
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 88/192 (45%), Gaps = 20/192 (10%)
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
ND VR + + + P ++V SF PH P + ++ + F +V P + P+P
Sbjct: 176 NDRVR---ADQAAGERVPFLMVASFIHPHDPYEPPQEHWDRFADVDIP-APRHPEVPDPA 231
Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNTYI 193
+ R M ++ T +R + + +DD V +I + L+ LG +T +
Sbjct: 232 QDPHSHRLRAMSGFDQRETTEEEVRRARRSYYAAVSYIDDHVGRIRERLESLGLWEDTVV 291
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGS 253
+ TSDHG LG+ GL K P+E RVP ++ GP + L+P R + S
Sbjct: 292 VVTSDHGDMLGEKGLWF-KMSPYEESSRVPLILHGP----------EHLVPAGRYANPVS 340
Query: 254 LIFIMSFLINLN 265
L+ +M L+ L
Sbjct: 341 LLDLMPTLLELG 352
>gi|429766630|ref|ZP_19298886.1| arylsulfatase [Clostridium celatum DSM 1785]
gi|429183835|gb|EKY24874.1| arylsulfatase [Clostridium celatum DSM 1785]
Length = 483
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 84/175 (48%), Gaps = 11/175 (6%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ +S+ FLR+ KP L+ SF PH P D+ Y +++ N D+
Sbjct: 185 VVTESIDFLRRRDPC---KPFFLMTSFVRPHAPFDAPKYYFDLYKNKDLREPVKGDWN-- 239
Query: 137 PDKQWILQVTRKMQ----PVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
D++ I ++ + P+ ++ + + +D + ++++ L + ++NT
Sbjct: 240 -DEELIKKIGYRYNATTGPLDKELVKEMQIGYYACITHIDHQIGRLIQALVEHQVMDNTI 298
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVR 247
II+TSDHG LG ++ K+ P++ I++P + G + G Y +D + ++R
Sbjct: 299 IIFTSDHGEMLGDHNYIQ-KARPYQGSIKIPMFISGNEELLGRKYGVDESLVELR 352
>gi|254461230|ref|ZP_05074646.1| putative sulfatase YidJ [Rhodobacterales bacterium HTCC2083]
gi|206677819|gb|EDZ42306.1| putative sulfatase YidJ [Rhodobacteraceae bacterium HTCC2083]
Length = 520
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 82/184 (44%), Gaps = 26/184 (14%)
Query: 70 NDYYPDL-IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVT 124
D YP +A+ ++ +L + K P +SFP PH P + +Y +M+ F+V
Sbjct: 211 EDKYPTAWVADKAISYLEERKAG--GDPFFAFVSFPDPHHPFNPPGKYWDMYDPNDFDVD 268
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKM-----QPVHRQFTDLLMTKRLQTLQS--------V 171
YD NP +Q KM +P QF + ++ ++ +
Sbjct: 269 L----PYDAHQNPTPP--MQHETKMWETGEEPAIPQFAFRASDQHVREAKALTAGMITMI 322
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
DD + +++ LK GE +NT II+TSDHG +LG F L+ + P RVP + P
Sbjct: 323 DDHIGRVIDALKASGEYDNTVIIFTSDHGDYLGDFNLMLKGAIPLPSITRVPMIWSDPAT 382
Query: 232 VPGT 235
G
Sbjct: 383 RKGA 386
>gi|410644094|ref|ZP_11354578.1| extracellular sulfatase Sulf [Glaciecola agarilytica NO2]
gi|410136393|dbj|GAC02977.1| extracellular sulfatase Sulf [Glaciecola agarilytica NO2]
Length = 477
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 87/170 (51%), Gaps = 18/170 (10%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-DSA------PQYANMFFNVT 124
YY +AN+ +R S + KP +L ++F PH + D A P++ + NV
Sbjct: 168 YYIQNLANEFIR-----DASNTNKPWVLAVNFWNPHAHDTDKADQFHYPPEFEEYYQNVD 222
Query: 125 ---SHHTPAYDYAPNPD--KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
+ + +D+ P+ K + +V + + + + ++ + + + SVD AV I
Sbjct: 223 VPDAKNATKHDFEELPEFLKGSMARVRWEYRFANPKMYQRIIKRHYRAISSVDKAVGAIS 282
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
+L+ LG +NT IIYT D+GY L + L GK F +E D+RVP +++ P
Sbjct: 283 AQLEKLGVDDNTIIIYTGDNGYSLNERQLA-GKWFGWEEDLRVPLIIKDP 331
>gi|343085846|ref|YP_004775141.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354380|gb|AEL26910.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 529
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/271 (22%), Positives = 107/271 (39%), Gaps = 69/271 (25%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
++GY + K + +P G+ L YYN +NG K + Y D+I
Sbjct: 117 ANGYQTAMIGKIHMDGLPQGFDFSMVLPGQGHYYNPDFLINGEKKR-----IEGYCTDII 171
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT------------- 124
+ ++ +L+ ++ +P +L+ APH A +Y ++ +VT
Sbjct: 172 TDAALNWLKNDRKE--DEPFLLLYHQKAPHRNWKPAQKYLTLYDDVTFDPPANFFDDYEG 229
Query: 125 -------------SHHTPAYDYA-----PNPDKQWILQVTRKMQPVHRQFTDLL------ 160
H +D+ N D W++ +++ P Q D L
Sbjct: 230 RGTAAKTQEMEIDGHARWGHDFKFQFDPTNGDTTWLVNQLKRLNP--EQLADWLAAYEPK 287
Query: 161 ----------------------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
+ L+T++SVDD V +++ LK+ G NT ++YTSD
Sbjct: 288 NKAMREANLSGKELALWKYNRYIKDYLRTIKSVDDGVGELLDYLKETGLDENTIVVYTSD 347
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
G++LG+ G K F +E R P LM+ P
Sbjct: 348 QGFYLGEHGWFD-KRFMYEESFRTPLLMKYP 377
>gi|304393879|ref|ZP_07375804.1| choline-sulfatase [Ahrensia sp. R2A130]
gi|303294078|gb|EFL88453.1| choline-sulfatase [Ahrensia sp. R2A130]
Length = 513
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 78/166 (46%), Gaps = 32/166 (19%)
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQ 155
P +V SF PH P ++ P+ F+N+ H A D N + P+ ++
Sbjct: 184 PFAMVASFTHPHDPYEARPE----FWNLYDHD--AIDMPKN------VLAPEDQDPMSKR 231
Query: 156 FTDLLMTKRLQTLQS---------------VDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
D + +++ + D V ++V+ L++ G+L++T +I T+DHG
Sbjct: 232 LMDGIEASKVEVTEEQVRNSRHAYFANISYFDSKVGELVQVLEESGQLDDTVVIITADHG 291
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY----LLDVL 242
LG+ GL +F FE RVP +M GPGI T+ L+D+L
Sbjct: 292 DMLGERGLWYKMNF-FEHSARVPLIMAGPGIANTTVADPCSLVDIL 336
>gi|189459671|ref|ZP_03008456.1| hypothetical protein BACCOP_00299 [Bacteroides coprocola DSM 17136]
gi|189433630|gb|EDV02615.1| arylsulfatase [Bacteroides coprocola DSM 17136]
Length = 522
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/194 (21%), Positives = 81/194 (41%), Gaps = 40/194 (20%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH--------------------------- 107
D++ ++R+++ +S +P ++ F A H
Sbjct: 179 DIVTEKAIRWMKAQDKS---QPFLMCCHFKATHEPYDFPERMRHLYDGVVFPEPENLLDW 235
Query: 108 GPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQV-----TRKMQPV--HRQFTDLL 160
GPE + + H+ + +PDK W + T+ M + + L
Sbjct: 236 GPETNGRTFVGQKLETIGHNWEVA--SADPDKWWCRYLELPFTTKGMSRIAARKAIYQKL 293
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
+ L+ +VDD + K++ L ++G +NT ++Y +D GY LG+ G + F +E
Sbjct: 294 IRDYLRCAATVDDNIGKLLDALDEMGIADNTIVVYVADQGYFLGEHGFYDKRMF-YEESA 352
Query: 221 RVPFLMRGPGIVPG 234
R+PF++R P +P
Sbjct: 353 RMPFVIRYPKCIPA 366
>gi|225011395|ref|ZP_03701833.1| sulfatase [Flavobacteria bacterium MS024-2A]
gi|225003898|gb|EEG41870.1| sulfatase [Flavobacteria bacterium MS024-2A]
Length = 501
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 63/121 (52%), Gaps = 12/121 (9%)
Query: 133 YAPNPDKQWILQVTRK---MQPVHRQFTDLLMTK-------RLQTLQSVDDAVEKIVKEL 182
++P K W+ T K M+ + + +L + K L+T+QSVDD V +I+ L
Sbjct: 248 FSPEQKKAWLNAYTPKNEAMKTQNLEGNELALWKFNRYIKDYLRTIQSVDDGVGEILDYL 307
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G NT ++YTSD G++LG+ G K F +E +R P LMR P +P +D L
Sbjct: 308 DQEGLTENTIVVYTSDQGFYLGEHGWFD-KRFMYEESLRTPLLMRYPKEIPANS-KVDAL 365
Query: 243 I 243
I
Sbjct: 366 I 366
>gi|372210453|ref|ZP_09498255.1| sulfatase yidj [Flavobacteriaceae bacterium S85]
Length = 512
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 91/185 (49%), Gaps = 26/185 (14%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH------GPEDSAPQYANM----FF 121
+ D + + ++ F K + +P V+S+P PH P D+ N+ F
Sbjct: 201 FATDWLTDRAIEFFDAQKSN--NQPFFQVVSYPDPHTNNTVRAPYDTLYNIKNVKLPKTF 258
Query: 122 NVTSHHT-PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT--------LQSVD 172
N+ H PA+ NP+ + + K + + +++T ++L+ ++ +D
Sbjct: 259 NLEYHKDLPAWK---NPELDFAVDKNTKEKKLKKEYTPEKTKRKLKKDITQYFGMVKCID 315
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GI 231
D + K+++ LK G+LNNT II+TSDHG LG++G K P E ++PF++ P I
Sbjct: 316 DNIGKMIQALKKNGQLNNTIIIFTSDHGDLLGEYGR-DNKGVPQEGSAKIPFIVYYPKAI 374
Query: 232 VPGTM 236
GT+
Sbjct: 375 KAGTV 379
>gi|118442882|ref|YP_878798.1| sulfatase [Clostridium novyi NT]
gi|118133338|gb|ABK60382.1| probable sulfatase [Clostridium novyi NT]
Length = 504
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 83/186 (44%), Gaps = 28/186 (15%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------- 120
+Y + IA + L Q K++ + L SF PH P +A+M+
Sbjct: 193 HYDNWIAERTNALLEQYKEN--DENFFLWSSFFDPHPPYFVPEPWASMYNPDDITLPHYL 250
Query: 121 ----------FNVTSHHTPAYD--YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
T P + Y P DK W+ +Q D+ + + ++
Sbjct: 251 GGEHLTTSPILKKTQEENPDFSQYYEPK-DKNWLHGCHSHLQDKEELKKDIAIYYGMISM 309
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+D + KI+ +L++LG +T +++TSDHG+ GQ L+ F +E I++PF++R
Sbjct: 310 --MDHYIGKILDKLEELGMAEDTIVVFTSDHGHFFGQHNLIAKGPFHYEDMIKIPFIVRE 367
Query: 229 PGIVPG 234
PG++P
Sbjct: 368 PGVIPA 373
>gi|410098956|ref|ZP_11293930.1| hypothetical protein HMPREF1076_03108 [Parabacteroides goldsteinii
CL02T12C30]
gi|409220087|gb|EKN13044.1| hypothetical protein HMPREF1076_03108 [Parabacteroides goldsteinii
CL02T12C30]
Length = 481
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 85/177 (48%), Gaps = 13/177 (7%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY-DY 133
D+ D+V ++ + K++ KP L +++ +PH P +Y + P Y D
Sbjct: 195 DIFTEDAVEYIEKQKEA--GKPFFLSVAYMSPHDPRTMPDEYMRQYDFDKIQLPPNYMDK 252
Query: 134 APNPDKQWIL--QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNT 191
P + + ++ +V + V + +M + + VD V IVK LK+ GE NT
Sbjct: 253 HPFDNGELVIRDEVLAAIPRVPDEIKKHIM-EYYAMVTHVDRRVGDIVKALKETGEYENT 311
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-VPGT-----MYLLDVL 242
I++ D+G +GQ GL+ GK +E +RVP +++ G V G YL+DV
Sbjct: 312 IIVFAGDNGLAVGQHGLM-GKQNVYEHSVRVPLMIKQAGKDVKGQKSDKLCYLIDVF 367
>gi|443695661|gb|ELT96527.1| hypothetical protein CAPTEDRAFT_63083, partial [Capitella teleta]
Length = 490
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKR--------------LQTLQSVDDAVEKIVKE 181
+PD WI P + QF + L + VDD V +I+K+
Sbjct: 225 SPDIAWIKNTELHGAPEYSQFMEGLSRNNSLPYVIVRDLRRAYFSCISYVDDLVGEILKD 284
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L DLG N+T + + DHGYHLG+ ++ GK+ FE P ++R PG+ G
Sbjct: 285 LNDLGIANDTVVSFIGDHGYHLGEHSMI-GKNTAFEVANNSPMIIRIPGVTDG 336
>gi|168335082|ref|ZP_02693192.1| sulfatase [Epulopiscium sp. 'N.t. morphotype B']
Length = 541
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 85/181 (46%), Gaps = 20/181 (11%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF-FNVTSHHTP 129
+Y + N ++ +L Q+ P L + + PH P + +Y ++ P
Sbjct: 167 NYETAWVENQTINWLEQN----GDNPFFLFVGYIKPHPPWNIPAKYQGIYQXAAIPEAIP 222
Query: 130 AYDYAP-NPD-KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
Y P +P+ W Q R+ + T + VD++ KI++ L+ G+
Sbjct: 223 KARYYPEDPNHNSWYGDDDSNXQL--RKHQEAYYT----AVSMVDESFGKIIEHLRXTGK 276
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGT-----MYLLDV 241
L+NT +IYTSDHG LG G KS P+E +R+P L+R P + PGT + LLD+
Sbjct: 277 LDNTLVIYTSDHGEMLGDRGYYS-KSVPYESAVRIPLLVRYPEVFEPGTTNEDFVDLLDI 335
Query: 242 L 242
L
Sbjct: 336 L 336
>gi|410643845|ref|ZP_11354334.1| extracellular sulfatase Sulf [Glaciecola chathamensis S18K6]
gi|410136471|dbj|GAC12521.1| extracellular sulfatase Sulf [Glaciecola chathamensis S18K6]
Length = 477
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 85/165 (51%), Gaps = 15/165 (9%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA-------PQYANMFFNVT---SH 126
I N + F++ + S KP +L ++F PH + P++ + +VT ++
Sbjct: 170 IQNLAKEFIQDA--SSKDKPWVLAVNFWNPHAHDTDKVDQFHYPPEFEEYYQDVTVPDAN 227
Query: 127 HTPAYDYAPNPD--KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
++ +D+ P K + +V + + + + ++ + + + SVD AV I +LK
Sbjct: 228 NSTKHDFEELPKFLKDSMARVRWEYRFANPKMYQRIIKRHYRAISSVDKAVGAISAQLKK 287
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
LG +NT IIYT D+GY L + L GK F +E D+RVP +++ P
Sbjct: 288 LGVDDNTIIIYTGDNGYSLNERQLA-GKWFGWEEDLRVPLIIKDP 331
>gi|73539584|ref|YP_299951.1| sulfatase [Ralstonia eutropha JMP134]
gi|72122921|gb|AAZ65107.1| Sulfatase [Ralstonia eutropha JMP134]
Length = 482
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 88/191 (46%), Gaps = 21/191 (10%)
Query: 57 MNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPH----GPEDS 112
M G I GE Y Y + +V++L+++ Q + +L + APH PE+
Sbjct: 145 MLGEHIGPGESSYTQY-DRAVTQRAVQWLQEAAQR-QEAGFVLYVGLVAPHFPFVVPEEF 202
Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS-- 170
Y H P Y +P W+ + M RQF D +RL+ +
Sbjct: 203 YSLYPTDGLPEPKLH-PRTGYEQHP---WVREYCDFMA-SERQFAD--ADERLRAFAAYY 255
Query: 171 -----VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
+D V +I+ L+D G + T+I+YTSDHG +LG G V GKS +E ++VP L
Sbjct: 256 GLCTWLDHNVGQILGALRDNGLEDTTHIVYTSDHGDNLGARG-VWGKSTLYEESVKVPML 314
Query: 226 MRGPGIVPGTM 236
+ GP + PG
Sbjct: 315 LAGPIVTPGVC 325
>gi|418404758|ref|ZP_12978202.1| choline-sulfatase [Sinorhizobium meliloti CCNWSX0020]
gi|359501269|gb|EHK73887.1| choline-sulfatase [Sinorhizobium meliloti CCNWSX0020]
Length = 512
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 75/164 (45%), Gaps = 7/164 (4%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + S++P L +SF PH P + ++ +++ H TP P
Sbjct: 170 FLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEGC-EHLTPEVGAIP 228
Query: 136 ----NPDKQWI-LQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
+P Q I L + V + + +D+ V +++ L L++
Sbjct: 229 LDEQDPHSQRIMLSCDYQNFDVTEENVRRSCRAYFANISYLDEKVGELIDTLTRTRMLDD 288
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
T I++ SDHG LG+ GL +F FE RVP ++ GPGI PG
Sbjct: 289 TLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGIAPG 331
>gi|383115488|ref|ZP_09936244.1| hypothetical protein BSGG_2641 [Bacteroides sp. D2]
gi|313695106|gb|EFS31941.1| hypothetical protein BSGG_2641 [Bacteroides sp. D2]
Length = 497
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 84/172 (48%), Gaps = 19/172 (11%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF---- 120
GE++ + Y D ++RFL+ KQ K+P + ++F +PH P + P Y +
Sbjct: 187 GEEFSSKMYADA----AIRFLQ--KQKGDKQPFLAYVAFTSPHDPRNQLPNYGRKYSPDT 240
Query: 121 FNVTSHHTPAYDYAPNPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
+V + P + + + + +L R Q V ++ +D + VD + +
Sbjct: 241 LDVPRNFLPKHPFNNGEMRVRDELLLPAPRTEQQVQKELSDYY-----GMISEVDVQIGR 295
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
I++ L+ G+ NT +++ SD+G +G+ GL+ GK ++ ++VP + P
Sbjct: 296 IMEVLRATGQAENTIVVFASDNGLAVGRHGLL-GKQNLYDHSVKVPLTIIAP 346
>gi|56697077|ref|YP_167440.1| choline sulfatase [Ruegeria pomeroyi DSS-3]
gi|56678814|gb|AAV95480.1| choline sulfatase [Ruegeria pomeroyi DSS-3]
Length = 498
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 80/164 (48%), Gaps = 18/164 (10%)
Query: 81 SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ 140
++ +LR + + P ML +SF PH P Y M+ + P DY + D +
Sbjct: 184 ALEYLRHAPDT----PWMLTVSFTNPHPPYVVPRPYWEMYKDAD---IPLPDYPADMDAR 236
Query: 141 W--ILQVTRKMQPVHRQFTDLLMTKRLQTLQS--------VDDAVEKIVKELKDLGELNN 190
+ R+ +H++ ++ + L ++ VDD + +++ L + G+ +
Sbjct: 237 YSEFDHALRRWHGLHQRGHEVRDPRNLIAMRRGFAALAHYVDDKIGALLEVLDETGQRDE 296
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
T II TSDHG LG+ GL++ +S +E+ R+P ++ PG PG
Sbjct: 297 TVIIVTSDHGEMLGEKGLIQKRSL-YEWSARIPLIIDLPGAAPG 339
>gi|171912626|ref|ZP_02928096.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Verrucomicrobium spinosum DSM 4136]
Length = 468
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 104/232 (44%), Gaps = 30/232 (12%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNG-RKIKHGEDYYNDYYPDLIA 78
Y GK+ + P + W + KY++ N+ G RK+ G YY +
Sbjct: 94 AYIGKWHMGEDNDEKRPHFNYWASHKGQGKYFDTEFNIEGERKVLPG------YYTTSVT 147
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV-TSHHTPAYDYAPNP 137
+ ++++S KP L++ APH +YA+ F ++ + A++ P
Sbjct: 148 QLATDYIKKSHD----KPWCLMLGHKAPHSFYTPEEKYAHTFDDIRVPYPKSAFNLKDKP 203
Query: 138 ----DK------------QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
D+ +W + + F +++ + SVDD++ + K+
Sbjct: 204 RWYRDRLNTWHGIYGPLFEWRKEFPNNKPEGVKDFENMIHG-YWGVILSVDDSIGALTKQ 262
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
L++ G+LNNT +++ D+G G+ G+V ++ + IR+P ++R PG+VP
Sbjct: 263 LEESGQLNNTVVVFMGDNGLLNGENGMVDKRAM-HDASIRIPLIVRYPGLVP 313
>gi|91789296|ref|YP_550248.1| sulfatase [Polaromonas sp. JS666]
gi|91698521|gb|ABE45350.1| sulfatase [Polaromonas sp. JS666]
Length = 511
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 81/182 (44%), Gaps = 28/182 (15%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
I + ++ F+R+ +K+ L SFP PH P D ++ + ++ +H T ++
Sbjct: 205 IGDRTIEFMRKHAGEAAKR-FCLWASFPDPHHPFDCPEPWSRLHHPDEVDLPAHRTTDFE 263
Query: 133 YAPNPDKQWILQVTRKMQPVH-------RQFTDLLMTKRLQTLQS-----------VDDA 174
P W + + +PV RQ + T Q L++ VD
Sbjct: 264 RRP-----WWHKASMDSKPVGDAAVQALRQNFSRMPTPAEQQLRNITANYYGMISLVDHQ 318
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
V +I L+ LG NT +I+TSDHG LG GL+ P+E +RV ++ GP + G
Sbjct: 319 VGRIQTALQQLGLDGNTLVIFTSDHGEWLGDHGLMLKGPIPYEGVLRVGMVVNGPQVQAG 378
Query: 235 TM 236
+
Sbjct: 379 QV 380
>gi|182625265|ref|ZP_02953040.1| sulfatase family protein [Clostridium perfringens D str. JGS1721]
gi|177909577|gb|EDT72017.1| sulfatase family protein [Clostridium perfringens D str. JGS1721]
Length = 481
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDTS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333
>gi|149196982|ref|ZP_01874035.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
HTCC2155]
gi|149140092|gb|EDM28492.1| N-acetylglucosamine-6-sulfatase precursor [Lentisphaera araneosa
HTCC2155]
Length = 495
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/228 (21%), Positives = 97/228 (42%), Gaps = 36/228 (15%)
Query: 37 GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
G+ W L + KY N ++ K+ H + D+I N ++ + + + KP
Sbjct: 132 GFDHWETLPGHGKYINPALLTAEGKVNH-----EGHSSDVITNRALEWYNEKRDK--DKP 184
Query: 97 IMLVMSFPAPHGPEDSAPQYANMF-------------------------FNVTSHHTPAY 131
ML++ + APH AP++ F + H
Sbjct: 185 FMLMVHYKAPHRAWIPAPRFVEKFKDTVFPEPKTLFDDYKTRDMAVQHSMGIREHMNLEK 244
Query: 132 DYAPN--PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
D N ++++L + + RQ M + VD+ V +++ +LK G +
Sbjct: 245 DLKTNIWEHRKYLLNKDLSKEELTRQKYQAYMRDYFACVAGVDENVGRLLDQLKADGIDD 304
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
NT ++++SD G++LG+ G K + +E R P++++ PG+ PG++
Sbjct: 305 NTIVMFSSDQGFYLGEHGWFD-KRWMYEESFRTPYIVKWPGVTKPGSV 351
>gi|431797960|ref|YP_007224864.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430788725|gb|AGA78854.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 520
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 30/177 (16%)
Query: 69 YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT 128
Y Y P L+ ++V+F+ ++ KKP L + P PH E AP+ F
Sbjct: 229 YGAYSPALMQKEAVKFI----EAHQKKPFFLFLPTPLPHA-ELIAPEEEMALFEGKLQEG 283
Query: 129 PAY---DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
Y D P K Q+ + QP H F ++ + +DD V +I+K + L
Sbjct: 284 KPYKGTDEGPGFRKG---QLGSQEQP-HAAFAAMV--------KILDDQVGEIMKTVDSL 331
Query: 186 GELNNTYIIYTSDHGYHL---------GQFGLVKG-KSFPFEFDIRVPFLMRGPGIV 232
G NNT II+TSD+G H+ G +KG K +E IRVP +MR PG +
Sbjct: 332 GLANNTLIIFTSDNGPHIEGGGDPAFFDSNGPLKGIKRDLYEGGIRVPLIMRWPGRI 388
>gi|116625900|ref|YP_828056.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116229062|gb|ABJ87771.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 499
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 99/225 (44%), Gaps = 19/225 (8%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY--NYSINMNGRKIKHGEDYYNDYYPDLI 77
G+ GK+ G PG+ +W + Y +N++G+ + Y D +
Sbjct: 122 GFVGKWHMGREGDDPQPGFDKWVSFRGQGSYLPERNGLNVDGKHVPQ-----KGYITDEL 176
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA-PN 136
+ ++ +LR + ++P L +S A H A ++ + T D + PN
Sbjct: 177 TDYALDWLRTVPK---EQPYFLYLSHKAVHADFIPADRHKGAYAKETFRPPTTMDESGPN 233
Query: 137 PDKQ--WILQVTRKMQ----PVHRQFTDLLMTKR-LQTLQSVDDAVEKIVKELKDLGELN 189
+ W+ P H KR +TL VDD+V++++ L++ G+L+
Sbjct: 234 AQHRPMWVQNQRNSWHGVDFPYHSDLDVGEYYKRYAETLLGVDDSVDRMLDALRERGQLD 293
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+T +IY D+G+ G+ GL+ K +E +RVP L R P + G
Sbjct: 294 STLVIYMGDNGFQFGEHGLID-KRTAYEESMRVPLLARCPEMFSG 337
>gi|171911709|ref|ZP_02927179.1| choline sulfatase [Verrucomicrobium spinosum DSM 4136]
Length = 469
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 85/177 (48%), Gaps = 16/177 (9%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
+L A+ +V F+R KQS +P + ++F APH P + P + N P
Sbjct: 166 ELFADKAVEFVRAQKQS--SQPWLCYVAFNAPHDPRKAPPAWHEQT-NANKPPIPENFLP 222
Query: 135 PNPDKQWILQVT-RKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELN 189
+P + V K+ P R T+ ++ + L + +D + +I++ L+ G+
Sbjct: 223 VHPFNNGEMTVRDEKLAPWPR--TEPVIRQELADYYAAIMFMDSQIGRILESLRATGQDE 280
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLDV 241
T I+++SDHG +G GL+ GK ++ + P ++ GPG+ G YLLDV
Sbjct: 281 KTIIVFSSDHGLAIGSHGLM-GKQSLYDHSMHSPLILAGPGVPKGEKRAALCYLLDV 336
>gi|168333697|ref|ZP_02691950.1| mucin-desulfating sulfatase [Epulopiscium sp. 'N.t. morphotype B']
Length = 491
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 91/204 (44%), Gaps = 23/204 (11%)
Query: 41 WGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI-ANDSVRFLRQSKQSFSKKPIML 99
W N KY+ N K E+ +ND + + A+ + FL K + S KP ML
Sbjct: 151 WNGSPSNLKYFPDKEN----KFNGYENQWNDDHLTMFNAHQADAFLE--KAADSGKPFML 204
Query: 100 VMSFPAPHGPEDSAPQYANMFFNVT----SHHTPAY---------DYAPNPDKQWI-LQV 145
+SF APH P ++P++ + ++T + P Y ++ N D+ W +
Sbjct: 205 SVSFKAPHRPHTASPKWVQFYKDMTIKRMDNDKPEYFAVLPEVVRTHSRNADEYWGGHRY 264
Query: 146 TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
TRK F + VD+AV I +L +LG NT IIYTSD+GY G
Sbjct: 265 TRKAWNDDATFQHDFRN-YYGLISGVDEAVGSIRHKLDELGLAENTIIIYTSDNGYFCGS 323
Query: 206 FGLVKGKSFPFEFDIRVPFLMRGP 229
L GK +E I+ P ++ P
Sbjct: 324 KQL-GGKELLYEESIKAPLIIYDP 346
>gi|384208890|ref|YP_005594610.1| sulfatase [Brachyspira intermedia PWS/A]
gi|343386540|gb|AEM22030.1| sulfatase [Brachyspira intermedia PWS/A]
Length = 474
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 75/148 (50%), Gaps = 14/148 (9%)
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQ----- 150
P+ + +SF PH P D +Y +++ ++ + DK+ +T++ +
Sbjct: 197 PLFMWVSFVDPHHPFDPIKKYYDIYKDINPKELNS---KLKLDKKRPEHLTKQGERGYWP 253
Query: 151 ----PVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLG 204
+ ++ KRL + +D + +I+ +LK+ E +NT I++TSDHG +LG
Sbjct: 254 GGGEEHNYSEEEIKEIKRLYYGMISFIDSQIGRIIDKLKEKNEFDNTIIVFTSDHGEYLG 313
Query: 205 QFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+GL+K F ++ I+VP L G GIV
Sbjct: 314 DYGLLKKGPFMYDCLIKVPLLFYGKGIV 341
>gi|423288192|ref|ZP_17267043.1| hypothetical protein HMPREF1069_02086 [Bacteroides ovatus
CL02T12C04]
gi|392671081|gb|EIY64557.1| hypothetical protein HMPREF1069_02086 [Bacteroides ovatus
CL02T12C04]
Length = 529
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 80/173 (46%), Gaps = 11/173 (6%)
Query: 61 KIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF 120
K K G+D Y +Y D + N + L + +P +++S P PH P+ Y + +
Sbjct: 191 KNKEGDDAY-EYSTDFLTNKVIELLEERDN----RPFCMMLSIPDPHSPDLCKEPYISQY 245
Query: 121 FNVT-SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
+ ++ + + N W + K Q +F + ++ +DD V KI+
Sbjct: 246 AKIEITNPSTNTEELLNARPGW--GIGGKAQA--EKFVPKNIRNYFAMVKCIDDNVGKIL 301
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
K L D +NT +++TSDHG L V K P++ IRVPF++R P +
Sbjct: 302 KYLDDNSLTDNTIVVFTSDHGDMLYNHMRVD-KGCPYDDAIRVPFIVRYPSRI 353
>gi|393784374|ref|ZP_10372539.1| hypothetical protein HMPREF1071_03407 [Bacteroides salyersiae
CL02T12C01]
gi|392666150|gb|EIY59667.1| hypothetical protein HMPREF1071_03407 [Bacteroides salyersiae
CL02T12C01]
Length = 546
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 83/189 (43%), Gaps = 27/189 (14%)
Query: 66 EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----- 120
E Y+ D + IA + L Q K+ KP ML SF PH P +A+M+
Sbjct: 232 EKYHTDAW---IAERCIALLDQYKKG--DKPFMLWASFFDPHNPYLIPEPWASMYKPEDM 286
Query: 121 -----FNVTSHHTPAY---DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSV- 171
N P + NPDK I PVH ++LQ +++
Sbjct: 287 DIPETVNDDISDMPLHYRVTREENPDKS-IWAEGNPSYPVHGIAQHRWTKEKLQQNKAIY 345
Query: 172 -------DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
D + KI+ L++ G+++NT II+T+DHG +G GLV F +E +++PF
Sbjct: 346 YGMVSMMDHYIGKILDHLEENGQIDNTLIIFTTDHGAFIGHHGLVGKGVFDYEDAVKIPF 405
Query: 225 LMRGPGIVP 233
+ G +P
Sbjct: 406 IASYKGKMP 414
>gi|404367236|ref|ZP_10972607.1| hypothetical protein FUAG_01415 [Fusobacterium ulcerans ATCC 49185]
gi|404288764|gb|EFS25900.2| hypothetical protein FUAG_01415 [Fusobacterium ulcerans ATCC 49185]
Length = 472
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 75/147 (51%), Gaps = 8/147 (5%)
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM----QP 151
P+ + SF PH P D ++A+++ N + ++ N + + + +
Sbjct: 196 PLFMWTSFVDPHHPFDPIEKFASLYENTIIDKPISKNFLCNGRPEHLQKQGKNGYWPGGG 255
Query: 152 VHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
F+D + + + + +D + KI+K L++ +L+NT II+TSDHG ++G +G
Sbjct: 256 EQHNFSDEKIEEFTKYYYAMISFIDQEIGKILKSLEEKKQLDNTIIIFTSDHGEYMGDYG 315
Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L++ F ++ I+VP L+ G G+ G
Sbjct: 316 LLQKGPFMYDDLIKVPLLIWGKGVSKG 342
>gi|340620633|ref|YP_004739086.1| sulfatase [Zobellia galactanivorans]
gi|339735430|emb|CAZ98807.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 538
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 91/205 (44%), Gaps = 27/205 (13%)
Query: 41 WGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLV 100
W A N Y++ N N K GE+ +A ++V FL+++ + + KP+
Sbjct: 178 WDAGSPNGGYFSPYNNPNLENRKDGENLSM-----RLAEETVTFLKENDPNKTGKPVFAY 232
Query: 101 MSFPAPHGP-EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDL 159
+SF A HGP + + ++A + Y K P+ RQ D
Sbjct: 233 LSFYAVHGPIQTTKEKWAKYRKKAVRNGVADSGYK-----------MAKFLPI-RQVQDN 280
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG-------YHLGQFGLVKGK 212
+ L ++++DDAV +++ L LG NT +++TSD+G + L GK
Sbjct: 281 PVYAGL--VETMDDAVGRVLNTLDSLGLDKNTLVVFTSDNGGVSAGDSFSTSNLPLRGGK 338
Query: 213 SFPFEFDIRVPFLMRGPGIVPGTMY 237
+ FE IR P+ ++ P +V G Y
Sbjct: 339 GYQFEGGIREPYFIKAPWLVKGGAY 363
>gi|373498627|ref|ZP_09589132.1| hypothetical protein HMPREF0402_03005 [Fusobacterium sp. 12_1B]
gi|371960785|gb|EHO78430.1| hypothetical protein HMPREF0402_03005 [Fusobacterium sp. 12_1B]
Length = 472
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 75/148 (50%), Gaps = 8/148 (5%)
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM----QP 151
P+ + SF PH P D ++A+++ N + ++ N + + + +
Sbjct: 196 PLFMWASFVDPHHPFDPIEKFASLYENTIIDKPISKNFLCNGRPEHLQKQGKNGYWPGGG 255
Query: 152 VHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
F+D + + + + +D + KI+K L++ +L+NT II+TSDHG ++G +G
Sbjct: 256 EQHNFSDEKIEEFTKYYYAMISFIDQEIGKILKSLEEKKQLDNTIIIFTSDHGEYMGDYG 315
Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
L++ F ++ I+VP L+ G G+ G
Sbjct: 316 LLQKGPFMYDDLIKVPLLIWGKGVSKGV 343
>gi|149176956|ref|ZP_01855565.1| choline sulfatase [Planctomyces maris DSM 8797]
gi|148844211|gb|EDL58565.1| choline sulfatase [Planctomyces maris DSM 8797]
Length = 549
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 84/174 (48%), Gaps = 15/174 (8%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYD 132
I +D++ FL Q + +P + ++F PH P + +Y + + + +++ P +
Sbjct: 162 IVDDAIEFL----QKKNTQPFFMYLAFACPHDPRVADQEYMDHYEREEIPLPANYLPLHP 217
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ N +Q + P + + + +D + +++K LK+ GE +NT
Sbjct: 218 F--NNGEQVVRDELLAGFPRSKAEIRKHLHDYYADITGLDRHIGRLIKALKESGEYDNTV 275
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT----MYLLDVL 242
II++SDHG +G GL+ GK +E ++ P + GPGI G +YL D+
Sbjct: 276 IIFSSDHGLAVGSHGLM-GKQSLYEHSMKSPLIFSGPGIPHGQSNALVYLYDIF 328
>gi|420237904|ref|ZP_14742349.1| arylsulfatase A family protein [Rhizobium sp. CF080]
gi|398089566|gb|EJL80081.1| arylsulfatase A family protein [Rhizobium sp. CF080]
Length = 503
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 1/83 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + L+T+QS+DD V +++ L G NT +IYTSD G+ LG+ G K F +E +
Sbjct: 286 MKRYLRTVQSIDDNVGRLLDHLDKKGLTENTLVIYTSDQGFFLGEHGWFD-KRFMYEESL 344
Query: 221 RVPFLMRGPGIVPGTMYLLDVLI 243
++PFL R P +P D+
Sbjct: 345 QMPFLARYPAAIPAGTTCGDLAC 367
>gi|168210574|ref|ZP_02636199.1| sulfatase family protein [Clostridium perfringens B str. ATCC 3626]
gi|170711367|gb|EDT23549.1| sulfatase family protein [Clostridium perfringens B str. ATCC 3626]
Length = 481
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRKDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYAPG 333
>gi|344204106|ref|YP_004789249.1| sulfatase [Muricauda ruestringensis DSM 13258]
gi|343956028|gb|AEM71827.1| sulfatase [Muricauda ruestringensis DSM 13258]
Length = 542
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 112/289 (38%), Gaps = 81/289 (28%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED--YYNDYYPDLI 77
GY + K++ P G+ W L YYN + GED Y DLI
Sbjct: 122 GYETAIVGKWHLHGKPTGFDYWDVLNDQGNYYN-------PEFIQGEDTTVVEGYATDLI 174
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG-------------------PEDSAPQYAN 118
+ S+ +L+Q + + +KP L++ APH P+ P++ N
Sbjct: 175 TDKSLEWLKQREGN--QKPFYLMVHHKAPHRNWMPALRHLNVYDSITFPLPDTYFPEFKN 232
Query: 119 MFFNVTSHHT-----------------PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLL- 160
T + D A NP W R M P R D
Sbjct: 233 QRAAAEQQQTIYKDMYEGHDLKMSDGYGSTDLAHNP---WTKDFDR-MTPEQRNIWDEAY 288
Query: 161 -------------------------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
+ + L T++SVD+ + KI+ L+ G NT ++Y
Sbjct: 289 LPKNNAFYEADLHGKELAEWKGQRYLHEYLATVKSVDEGIGKILGYLEATGLDENTLVVY 348
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTMYLLDVLI 243
TSD G++LG+ G K F +E +++P LM+ PG++ PGT +D L+
Sbjct: 349 TSDQGFYLGEHGWFD-KRFMYEESMKMPLLMQLPGVIEPGTN--IDALV 394
>gi|198274928|ref|ZP_03207460.1| hypothetical protein BACPLE_01087 [Bacteroides plebeius DSM 17135]
gi|198272375|gb|EDY96644.1| arylsulfatase [Bacteroides plebeius DSM 17135]
Length = 451
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 59/252 (23%), Positives = 102/252 (40%), Gaps = 47/252 (18%)
Query: 5 VELTSRNLLYFDI--SHGYFGKYLNKY--NGSYIP----------PGWREWG-------- 42
VEL+ + + D+ GY Y+ K+ + Y P W EW
Sbjct: 74 VELSEEAVCWSDVLKKEGYRTAYIGKWHLDAPYRPYVDTYNNKGEVAWNEWCPPERRHGF 133
Query: 43 -ALIMNSKYYNYSINMNGRKIKHGEDYY--NDYYPDLIANDSVRFLRQSKQSFSKKPIML 99
I Y N+ M E++Y N + P A+ ++ F+R ++ + + +P +
Sbjct: 134 DKWIAYGTYDNHLKPMYWDTKSGREEFYYVNQWGPAYEADRAIEFIR-TEGTKTDRPFAM 192
Query: 100 VMSFPAPHG-----PEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
V+S PH PE Y ++ + P P+ + + R
Sbjct: 193 VVSMNPPHTGYELVPETYTRLYDSLDVEALARQLPYISEKGTPEGDYFRENIRNY----- 247
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+ VD+ V +I++ LKD G +NT I++TSDHG +G G V+GK+
Sbjct: 248 ----------YACITGVDEHVGRIIQALKDCGVYDNTLIVFTSDHGVCMGGHG-VEGKNV 296
Query: 215 PFEFDIRVPFLM 226
+E +R+P +
Sbjct: 297 FYEESMRIPMIC 308
>gi|372209997|ref|ZP_09497799.1| sulfatase yidj [Flavobacteriaceae bacterium S85]
Length = 479
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 81/174 (46%), Gaps = 7/174 (4%)
Query: 57 MNGRKIKHGEDYYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
MN +K + D N + D + ++ + + Q +P +++S P PH P+ + P
Sbjct: 179 MNDKKYESAADKSNIVHATDFFVDKTIEIINRDHQ----QPFFVMVSIPDPHTPDYAKPP 234
Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
Y N +N P A + + +K H F + + ++ +DD +
Sbjct: 235 Y-NSLYNHLDLQAPETMSATSIKNRPEWANNKKNDVSHEGFKPQKLKQYFGMVKHIDDRI 293
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
I+K L L +NT +I+TSDHG + G + KS P++ ++VPF++R P
Sbjct: 294 GDILKTLDKLQISDNTILIFTSDHGDMFYEHGRM-NKSVPYDAAMKVPFVIRYP 346
>gi|257067681|ref|YP_003153936.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
gi|256558499|gb|ACU84346.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
Length = 478
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP-AYDYAPNP--------DKQWILQV 145
+P L +F APH P ++A ++ P ++ +P + +
Sbjct: 211 EPFFLWAAFTAPHDPRTPPEEFARLYDRTDPAAVPLPENFRTDPVEATNFGERDENLAAA 270
Query: 146 TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
R + V D + +DD + +I+ L+ G NT ++YT+DHG LGQ
Sbjct: 271 PRDPEEVRGHLADYY-----GMISHLDDGIGRILAHLERSGLAENTLVVYTADHGLSLGQ 325
Query: 206 FGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
G++ GK +E +RVP L+ GPGI G
Sbjct: 326 HGMM-GKQSLYEHSLRVPLLLAGPGIEAG 353
>gi|168217566|ref|ZP_02643191.1| sulfatase family protein [Clostridium perfringens NCTC 8239]
gi|182380378|gb|EDT77857.1| sulfatase family protein [Clostridium perfringens NCTC 8239]
Length = 481
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333
>gi|393771603|ref|ZP_10360073.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. Rr
2-17]
gi|392722961|gb|EIZ80356.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. Rr
2-17]
Length = 575
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 108/236 (45%), Gaps = 33/236 (13%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
+FGK+ + PG+ +W + I Y+ + + N++G+ +
Sbjct: 140 AFFGKWHMGASTDAPRPGFDKWVSFIGQGNYWPSMVNTGQSSPNFNVDGKAVAQ-----K 194
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF------FNVT 124
Y D + + ++ +L +K+ KKP L +S H P+YA+ + +
Sbjct: 195 GYITDELTDYAMDWL--TKERDPKKPFFLYLSHKGVHSDPLPPPRYAHQYDKTKFTLPAS 252
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF--TDLLMTKRLQ----TLQSVDDAVEKI 178
+ +TP + N K +Q R F D+ MT+ L+ TL +DD++ +I
Sbjct: 253 AANTPEH----NAGKPLWVQNQRNTWHGIDFFYNADVPMTEYLKYYYATLSGIDDSLGRI 308
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ L+ +T +++TSD+GY +G+ GL+ K +E IRVP ++ PG +P
Sbjct: 309 MAYLRKNHLEKDTLVVFTSDNGYMIGEHGLID-KRNAYEASIRVPLVVWEPGTIPA 363
>gi|358065466|ref|ZP_09152008.1| choline-sulfatase [Clostridium hathewayi WAL-18680]
gi|356696358|gb|EHI57975.1| choline-sulfatase [Clostridium hathewayi WAL-18680]
Length = 491
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 87/212 (41%), Gaps = 47/212 (22%)
Query: 45 IMNSKYYNYSINMNGRKIKHGEDYYND--YYPDLIANDSVRFLRQSKQSFSKKPIMLVMS 102
IM + +I+ N + K E +ND Y + + + +L + K P L +S
Sbjct: 139 IMQPDSWGNAIHYNRQSAKPKE--WNDGLQYDEEVQVKAREYLYKKGMEKEKTPFCLCVS 196
Query: 103 FPAPHGPEDSAPQYANMF------------------------FNVTSHHTPAYDYAPNPD 138
+ PH P +Y NM+ N H T +D D
Sbjct: 197 YHHPHDPFQPPEKYWNMYEGKDIAIPDLSHVKLKGDTLLDCWLNTGFHRTDVFDVTKAED 256
Query: 139 KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
+ V R ++ L+ +DD V +++ +L+ G NT +I+TSD
Sbjct: 257 ----------LYAVRRAYSGLVTY--------IDDKVGQLLDDLEQCGLRENTVVIFTSD 298
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
HG LG+ G+V+ + F +E+ R+P ++ PG
Sbjct: 299 HGDMLGERGMVQKRCF-YEWSSRIPLIISTPG 329
>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 489
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 75/167 (44%), Gaps = 38/167 (22%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
D+ D +A ++V+F+ S +KP L MSF A H P + P+ +
Sbjct: 204 DFLTDALAEEAVKFI----HSTDEKPFFLNMSFYAVHTPITAIPEKVEKY---------- 249
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
K+ I R+ P + ++ +DDAV I+K L++ G ++N
Sbjct: 250 --------KRLIGPDARQKNPTYAGL-----------VEHLDDAVGAILKALEEKGIIDN 290
Query: 191 TYIIYTSDHGYHL-----GQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
T I++TSD+G + F L GK +E RVP L++ PG+
Sbjct: 291 TIIVFTSDNGGEILHGITDNFPLRDGKGSSYEGGTRVPLLVKWPGVT 337
>gi|422347930|ref|ZP_16428839.1| arylsulfatase [Clostridium perfringens WAL-14572]
gi|373223241|gb|EHP45592.1| arylsulfatase [Clostridium perfringens WAL-14572]
Length = 482
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333
>gi|262405803|ref|ZP_06082353.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294648075|ref|ZP_06725620.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294809254|ref|ZP_06767969.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|336403337|ref|ZP_08584053.1| hypothetical protein HMPREF0127_01366 [Bacteroides sp. 1_1_30]
gi|345510315|ref|ZP_08789883.1| hypothetical protein BSAG_00519 [Bacteroides sp. D1]
gi|229443018|gb|EEO48809.1| hypothetical protein BSAG_00519 [Bacteroides sp. D1]
gi|262356678|gb|EEZ05768.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636582|gb|EFF55055.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294443535|gb|EFG12287.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|335946510|gb|EGN08315.1| hypothetical protein HMPREF0127_01366 [Bacteroides sp. 1_1_30]
Length = 515
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 79/164 (48%), Gaps = 7/164 (4%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTS 125
+ + +P I N+++ ++ + K++ P + +SFP PH P Y +MF +
Sbjct: 175 FLEEQHPTKIVNEALAWIEKQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPV 230
Query: 126 HHTPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
T D A +K IL ++ P Q + + ++ +DD ++++++ LK
Sbjct: 231 LKTSRKDLAKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKA 290
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
G+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 291 SGQYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 334
>gi|372211260|ref|ZP_09499062.1| arylsulfatase [Flavobacteriaceae bacterium S85]
Length = 519
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 94/219 (42%), Gaps = 21/219 (9%)
Query: 38 WREWGALIMNSKY---YNYSINMNGRKIKHGE-------DYYNDYYPDLIANDSVRFLRQ 87
WR+ G + Y+ N RK+ GE + + ++ +++V F+
Sbjct: 166 WRKDGKYKKEDAFIYVYDKKGNRTTRKVAKGEKKGPVKTEKDGPHTSEIFGDEAVDFI-- 223
Query: 88 SKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTR 147
SKQ KKP + ++F APH P + +Y M+ TP+Y D +
Sbjct: 224 SKQE-GKKPFFMYLAFHAPHDPRQAPKKYKKMYPEKKMKLTPSYLPQHPFDNGHLYLRDE 282
Query: 148 KMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHL 203
++ P R T + K L + VD + +++K LK+ G+ NT I+ + D G +
Sbjct: 283 QLAPWPR--TKAVAQKELSDYYAIISHVDAQIGRVIKTLKEKGQYENTIIVLSGDSGLAV 340
Query: 204 GQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
G GL+ ++ E I VP ++ G V G D L
Sbjct: 341 GNHGLMGKQNLYDEDGIHVPLIISGK--VKGQGKRFDAL 377
>gi|168206593|ref|ZP_02632598.1| sulfatase family protein [Clostridium perfringens E str. JGS1987]
gi|170661981|gb|EDT14664.1| sulfatase family protein [Clostridium perfringens E str. JGS1987]
Length = 481
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 74/159 (46%), Gaps = 13/159 (8%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNT 191
D++ + + V + + KR + ++ +D + + + L + GELNNT
Sbjct: 240 DDEE---NRGKDINCV-KGIINNKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNT 295
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
++ SDHG +G + K P+E RVPF + PG
Sbjct: 296 IFLFVSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYDPG 333
>gi|168214076|ref|ZP_02639701.1| sulfatase family protein [Clostridium perfringens CPE str. F4969]
gi|170714434|gb|EDT26616.1| sulfatase family protein [Clostridium perfringens CPE str. F4969]
Length = 481
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLNMSFVRPHSPLDPPKFYFDMYKDEELPEHLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSTRVPFFIYDPG 333
>gi|89069431|ref|ZP_01156786.1| probable sulfatase [Oceanicola granulosus HTCC2516]
gi|89045061|gb|EAR51139.1| probable sulfatase [Oceanicola granulosus HTCC2516]
Length = 507
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 43/64 (67%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D V +++ L+ GEL++T I++T+DHG+ LGQ GLV F +E IR+PF++ PG
Sbjct: 314 IDAEVGRLLDLLEARGELDDTLIVFTTDHGHFLGQHGLVAKGPFHYEDLIRIPFIVAWPG 373
Query: 231 IVPG 234
VP
Sbjct: 374 EVPA 377
>gi|311104529|ref|YP_003977382.1| sulfatase family protein 3 [Achromobacter xylosoxidans A8]
gi|310759218|gb|ADP14667.1| sulfatase family protein 3 [Achromobacter xylosoxidans A8]
Length = 527
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 77/172 (44%), Gaps = 17/172 (9%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF------FNVTSHHTPA 130
+ + ++ LR+ + S KP + SFP PH P +Y +M+ + H + A
Sbjct: 220 VGDQTIAQLRE--HAASGKPFFIQCSFPDPHHPYTPHGKYWDMYQPDDVILPESFHASAA 277
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ---------TLQSVDDAVEKIVKE 181
+ P P W+ Q + + V T+R ++ +D + ++++
Sbjct: 278 PGHHPPPHVAWMHQQRDQGKAVKHTPAIFACTEREAREAIALNYGSISHIDATIGRVMQA 337
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
L++ G +NT +I+TSDHG + G L+ ++ IR PF+ R P P
Sbjct: 338 LRENGLDDNTVVIFTSDHGDYFGDHQLLLKGPIHYQSLIRTPFIWRDPQAAP 389
>gi|359408966|ref|ZP_09201434.1| arylsulfatase A family protein [SAR116 cluster alpha
proteobacterium HIMB100]
gi|356675719|gb|EHI48072.1| arylsulfatase A family protein [SAR116 cluster alpha
proteobacterium HIMB100]
Length = 509
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
P +Q +Q T++M L+ + +D + ++ +L LG NT II+T
Sbjct: 285 PREQGFMQTTQRMS---EDTLRLIKAYTCGMVSMIDTGIGQLRAQLAQLGLTENTIIIFT 341
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
SDHG LG GL++ P+ ++VP ++ GPGIV G
Sbjct: 342 SDHGELLGDHGLIRKGPSPYLALLKVPLIITGPGIVAG 379
>gi|221134141|ref|ZP_03560446.1| iduronate-2-sulfatase [Glaciecola sp. HTCC2999]
Length = 1028
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 84/173 (48%), Gaps = 29/173 (16%)
Query: 70 NDYYPD-LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVT 124
+D+Y D +A D+ LR+ + P ++ + F PH P ++ +Y +M+ F V
Sbjct: 188 DDFYKDGKVARDADIALREHQND----PFLMFVGFSKPHLPFNAPKKYWDMYQRDQFTVP 243
Query: 125 SHHTPAYDYAPNPDKQWILQVT-----RKMQPVHRQ-FTDLLMTKRL-----QTLQSVDD 173
S TP D + L +T R + ++ +TD +TK L T+ +D
Sbjct: 244 SRKTP--------DNMFRLALTKWNELRMYGGIPKEGYTDDELTKTLIHAYYATVSYMDA 295
Query: 174 AVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
V K++ L +LG NT +I+ SDHGY LG++G K E D RVP ++
Sbjct: 296 QVGKVLNTLDELGLRENTTVIFMSDHGYKLGEYG-AWNKHTNMELDTRVPLII 347
>gi|154490064|ref|ZP_02030325.1| hypothetical protein PARMER_00293 [Parabacteroides merdae ATCC
43184]
gi|423723059|ref|ZP_17697212.1| hypothetical protein HMPREF1078_01272 [Parabacteroides merdae
CL09T00C40]
gi|154089213|gb|EDN88257.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
gi|409241484|gb|EKN34252.1| hypothetical protein HMPREF1078_01272 [Parabacteroides merdae
CL09T00C40]
Length = 524
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 89/202 (44%), Gaps = 38/202 (18%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
DL+ + ++R++R + +P M+ F A H P D + P+ ANM F
Sbjct: 183 DLVTDKTIRWIRSRDKD---RPFMMCCHFKATHEPWDFPERLKHIYDGVTFPEPANMMEF 239
Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
S T P D A +PD W ++ K + R+ L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMDK-EDARRKIYQKLI 298
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
L+ ++DD + ++++ L + G NT ++Y SD GY LG+ G K +E +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSENTIVVYVSDQGYFLGEHGFFD-KRMMYEEPLR 357
Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
+PF++R P +P D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379
>gi|110799572|ref|YP_694684.1| arylsulfatase [Clostridium perfringens ATCC 13124]
gi|123345068|sp|Q0TUK6.1|SULF_CLOP1 RecName: Full=Arylsulfatase; AltName: Full=Cys-type sulfatase
gi|110674219|gb|ABG83206.1| sulfatase family protein [Clostridium perfringens ATCC 13124]
Length = 481
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRKDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYDPG 333
>gi|449135790|ref|ZP_21771228.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448885592|gb|EMB16025.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 476
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 47/72 (65%), Gaps = 4/72 (5%)
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG----LVKGKSFPFEFDIRV 222
T+Q +D + +I++ LK+LG NNT +I+TSD+G + G+ G L GKS E +RV
Sbjct: 256 TIQELDWSTGEILRSLKELGLDNNTLVIFTSDNGPNTGKGGSAGPLKGGKSSTLEGGVRV 315
Query: 223 PFLMRGPGIVPG 234
PF+ R PG++P
Sbjct: 316 PFVARWPGMIPA 327
>gi|269104927|ref|ZP_06157623.1| choline-sulfatase [Photobacterium damselae subsp. damselae CIP
102761]
gi|268161567|gb|EEZ40064.1| choline-sulfatase [Photobacterium damselae subsp. damselae CIP
102761]
Length = 517
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 16/166 (9%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVT 124
+ N + P+ A+ ++ +LR + ++KP LV+S PH P D PQ Y + F +
Sbjct: 218 HINQWSPEHEADVAINYLRNENGKYRDAEKPFTLVVSMNPPHSPYDQVPQKYLDRFKGQS 277
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKD 184
S P+ QW + P + + + + VD+ +IV EL
Sbjct: 278 SKSLNT-----RPNVQWDKEYLEGYGPNY-------FKEYMAMVNGVDEQFGRIVDELDR 325
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
L N+T +++ SDHG +G G K+ +E +R+P + R PG
Sbjct: 326 LNLANDTLVVFFSDHGCCMGSNGQPT-KNVHYEEAMRIPMIFRWPG 370
>gi|339502714|ref|YP_004690134.1| choline-sulfatase BetC [Roseobacter litoralis Och 149]
gi|338756707|gb|AEI93171.1| choline-sulfatase BetC [Roseobacter litoralis Och 149]
Length = 502
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 98/204 (48%), Gaps = 18/204 (8%)
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
Y D +A ++ R L + +P L +SF PH P + +Y +++ + +H P
Sbjct: 162 YDDEVAFNATRKLYDLSRGADARPWCLTVSFTHPHDPYVARKKYWDLYEDC-AHLQPEVA 220
Query: 130 AYDYAP-NPDKQWILQVT--RKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKD 184
A+DYA +P Q I R R D+ ++R + +DD + +I++ L+D
Sbjct: 221 AFDYADQDPHSQRIFDANDWRSFDITKR---DIERSRRAYFANISYLDDKIGEILQTLED 277
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG--TMYLLDV- 241
+ I++ SDHG LG+ GL SF FE RVP ++ P + PG T + ++
Sbjct: 278 TRQ--EAIILFVSDHGDMLGERGLWFKMSF-FEGSSRVPLMISAPQMEPGLQTTAVSNID 334
Query: 242 LIPQVRKFSSGSLIFIMSFLINLN 265
+ P + + S+ +M++ ++
Sbjct: 335 VCPTLCDLAGVSMDEVMAWTTGVS 358
>gi|423346509|ref|ZP_17324197.1| hypothetical protein HMPREF1060_01869 [Parabacteroides merdae
CL03T12C32]
gi|409219660|gb|EKN12620.1| hypothetical protein HMPREF1060_01869 [Parabacteroides merdae
CL03T12C32]
Length = 524
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 89/202 (44%), Gaps = 38/202 (18%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
DL+ + ++R++R + +P M+ F A H P D + P+ ANM F
Sbjct: 183 DLVTDKTIRWIRSRDKD---RPFMMCCHFKATHEPWDFPERLKHIYDGVTFPEPANMMEF 239
Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
S T P D A +PD W ++ K + R+ L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMGK-EDARRKIYQKLI 298
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
L+ ++DD + ++++ L + G NT ++Y SD GY LG+ G K +E +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSENTIVVYVSDQGYFLGEHGFFD-KRMMYEEPLR 357
Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
+PF++R P +P D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379
>gi|423342537|ref|ZP_17320251.1| hypothetical protein HMPREF1077_01681 [Parabacteroides johnsonii
CL02T12C29]
gi|409217454|gb|EKN10430.1| hypothetical protein HMPREF1077_01681 [Parabacteroides johnsonii
CL02T12C29]
Length = 524
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 90/202 (44%), Gaps = 38/202 (18%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
DL+ + +++++R ++ +P M+ F A H P D + P+ ANM F
Sbjct: 183 DLVTDKTIQWIRSRDKN---QPFMMCCHFKATHEPWDFPERLKHLYDGVTFPEPANMMEF 239
Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
S T P D A +PD W ++ K + R+ L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMNK-EDARRKIYQKLI 298
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
L+ ++DD + ++++ L + G NT ++Y SD GY LG+ G K FE +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSKNTIVVYVSDQGYFLGEHGFFD-KRMMFEEPLR 357
Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
+PF++R P +P D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379
>gi|372210565|ref|ZP_09498367.1| mucin-desulfating sulfatase [Flavobacteriaceae bacterium S85]
Length = 493
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 102/267 (38%), Gaps = 63/267 (23%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNS---KYYNYSINMNGRKIKHGEDYYNDYYPDL 76
GY + K++ P G+ + L YYN G K+ E Y+ +
Sbjct: 88 GYKTAVIGKWHLGTAPTGFDYYKVLFNKEGQGSYYNPVFETTGNKLVKEEGKYS---TTV 144
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT------------ 124
I N+++ +L+ KQ +K+P ML+ F APH P + P Y + +V
Sbjct: 145 IKNNAISWLK--KQKDAKEPFMLLYQFKAPHRPWEPGPGYEDYLKDVEIPYPTTFNDDYK 202
Query: 125 ------------SHHTPAYDYAPNPDKQWILQV------------------------TRK 148
H D P K Q RK
Sbjct: 203 GRRAAEEAWMRIDGHMNRKDVKIKPPKSLTEQELIDWNKYGNEDGEFWTPDDSMSDQERK 262
Query: 149 MQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL 208
R D L + ++ VDDA+ +++ L + G NT ++YTSD G+++G+ G
Sbjct: 263 NWKYQRYVKDYL-----RCVKGVDDAIGEMLNYLDESGLAENTIVVYTSDQGFYVGEHGW 317
Query: 209 VKGKSFPFEFDIRVPFLMRGP-GIVPG 234
K F +E + +P LMR P I PG
Sbjct: 318 FD-KRFIYEESLHMPLLMRYPKKIKPG 343
>gi|218258523|ref|ZP_03474879.1| hypothetical protein PRABACTJOHN_00534 [Parabacteroides johnsonii
DSM 18315]
gi|218225399|gb|EEC98049.1| hypothetical protein PRABACTJOHN_00534 [Parabacteroides johnsonii
DSM 18315]
Length = 524
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 90/202 (44%), Gaps = 38/202 (18%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPED------------SAPQYANM--F 120
DL+ + +++++R ++ +P M+ F A H P D + P+ ANM F
Sbjct: 183 DLVTDKTIQWIRSRDKN---QPFMMCCHFKATHEPWDFPERLKHLYDGVTFPEPANMMEF 239
Query: 121 FNVTSHHT----PAYDYA-------PNPDKQWI--------LQVTRKMQPVHRQFTDLLM 161
S T P D A +PD W ++ K + R+ L+
Sbjct: 240 GPEESGRTFPGQPLEDMARRWGIASKDPDSWWCKYPELPFSIEGMNK-EDARRKIYQKLI 298
Query: 162 TKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
L+ ++DD + ++++ L + G NT ++Y SD GY LG+ G K FE +R
Sbjct: 299 RDYLRCGAAIDDNIGRLLRFLDEEGLSKNTIVVYVSDQGYFLGEHGFFD-KRMMFEEPLR 357
Query: 222 VPFLMRGPGIVPGTMYLLDVLI 243
+PF++R P +P D+++
Sbjct: 358 MPFVIRYPKEIPAGTRNSDIIL 379
>gi|266621313|ref|ZP_06114248.1| N-acetylglucosamine-6-sulfatase, partial [Clostridium hathewayi DSM
13479]
gi|288867025|gb|EFC99323.1| N-acetylglucosamine-6-sulfatase [Clostridium hathewayi DSM 13479]
Length = 428
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 76/170 (44%), Gaps = 23/170 (13%)
Query: 91 SFSKKPIMLVMSFPAPHGPEDSAP-QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM 149
S ++P L +S+ PH P + P +Y N + N+ D PN + + T +
Sbjct: 183 SVKEQPFALFLSWNPPHPPFERVPKEYYNRYRNLEP------DLPPNVEGERFDNQTGEP 236
Query: 150 QPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLV 209
R+ + + +D+ +IV LK++ + T ++ T+DHG HLG G V
Sbjct: 237 GFGSREELAEAVRCYYAAITGLDEQFGRIVSWLKEMELYDQTIVLLTADHGEHLGAHGYV 296
Query: 210 KGKSFPFEFDIRVPFLMRGP---------------GIVPGTMYLLDVLIP 244
GK +E I +PFLMR P IVP + LLD+ IP
Sbjct: 297 -GKHTWYEESINIPFLMRYPEKLPAGRNDISVETVDIVPTLLGLLDIAIP 345
>gi|149197958|ref|ZP_01875006.1| putative sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138870|gb|EDM27275.1| putative sulfatase [Lentisphaera araneosa HTCC2155]
Length = 502
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 73/171 (42%), Gaps = 28/171 (16%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--------------FN 122
N S+ FL Q +P L MSF PH P AP+ +M+ F
Sbjct: 178 TGNKSLEFLENRDQD---RPFFLKMSFQRPHDPITPAPEDFDMYNPEDIVLPESISDLFE 234
Query: 123 VTSHHTPAY--DYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIV 179
P + DY NP D + K++ + L+ +D+ + +++
Sbjct: 235 NKFVGKPQFMQDYVANPGDYPMCVADEAKLKRALASYYALIT--------KIDEEIGRVI 286
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
LK+ GE +NT I YT+DHG G+ GL +E R+PFL++ PG
Sbjct: 287 DHLKETGEYDNTIIFYTADHGDFAGEHGLFLKNLGIYESIHRIPFLLKWPG 337
>gi|383115597|ref|ZP_09936353.1| hypothetical protein BSGG_2533 [Bacteroides sp. D2]
gi|313694998|gb|EFS31833.1| hypothetical protein BSGG_2533 [Bacteroides sp. D2]
Length = 515
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 78/162 (48%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ ++ + K++ P + +SFP PH P Y +MF +
Sbjct: 177 EEQHPTKIVNEALAWIEKQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 232
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D A +K IL ++ P Q + + ++ +DD ++++++ LK G
Sbjct: 233 TSRKDLAKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 292
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 293 QYENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 334
>gi|373461994|ref|ZP_09553727.1| hypothetical protein HMPREF9944_01991 [Prevotella maculosa OT 289]
gi|371950171|gb|EHO68029.1| hypothetical protein HMPREF9944_01991 [Prevotella maculosa OT 289]
Length = 476
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 100/246 (40%), Gaps = 35/246 (14%)
Query: 14 YFDISHGYFGKYLNKYNGSYIPPGWREW--GALIMNSKYYNYSINMNGRKIKH------- 64
Y+ +S+G ++ + S+ WR G + ++Y + + +N +H
Sbjct: 123 YYTVSNGKIMHHIEDHAASWSEKPWRSHPDGYDVYWAEYNKWELWLNSESGRHINPKTMR 182
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSK-----KPIMLVMSFPAPHGPEDSAPQYANM 119
G Y PD A D R L ++ + + KP L F PH P + +Y ++
Sbjct: 183 GPFYEWADVPD-SAYDDGRVLGKTIEDLRRLRDNGKPFFLACGFWKPHLPFCAPKKYWDL 241
Query: 120 FFNVTSHHTPAYDYAPNP-----------DKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
+ PA A NP + + I P + F L+ T +
Sbjct: 242 Y--------PALPMADNPFRPDGLPEEVQNSKEINAYALATAPDDKAFLRLVKTGYYACI 293
Query: 169 QSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
VD +++ L+ LG +NT +I DHG+ LG+ G V GK + RVP L+R
Sbjct: 294 SYVDALFGELMTALETLGLKDNTAVILLGDHGWDLGEHGFV-GKHNLMDITTRVPLLIRV 352
Query: 229 PGIVPG 234
PG+ G
Sbjct: 353 PGLRGG 358
>gi|410632053|ref|ZP_11342721.1| choline-sulfatase [Glaciecola arctica BSs20135]
gi|410148339|dbj|GAC19588.1| choline-sulfatase [Glaciecola arctica BSs20135]
Length = 493
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 83/179 (46%), Gaps = 19/179 (10%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
++IA+ +V +L + K + KP ++ + F APH S Y + + A+
Sbjct: 193 EIIADAAVEYL-EKKPDVNDKPFLMYVGFLAPHDTRQSPKSYVDKYPQEAIPLPKAFKKQ 251
Query: 135 PNPDK-------QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
D+ + +L + RK V + + + + +D V +I+ L+
Sbjct: 252 NTVDQGDFWVRDEVLLDIPRKPDEVKK-----FIGEYYAMINHMDTQVGRILDSLEKSPY 306
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI-----VPGTMYLLDV 241
NT II+TSDHG +G+ GL+ GK ++ IR PF++ G GI + G +YL V
Sbjct: 307 AENTIIIFTSDHGLAVGKHGLL-GKQNQYDHSIRAPFIIAGKGIDKGQKIKGNIYLNSV 364
>gi|398350561|ref|YP_006396025.1| choline-sulfatase BetC [Sinorhizobium fredii USDA 257]
gi|390125887|gb|AFL49268.1| choline-sulfatase BetC [Sinorhizobium fredii USDA 257]
Length = 511
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 79/165 (47%), Gaps = 9/165 (5%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + ++P L +SF PH P + ++ +++ + H P P
Sbjct: 169 FLANQKLYQLSRENDDEGRRPWCLTVSFTHPHDPYVARRKFWDLYEDC-EHLLPEVGALP 227
Query: 136 ----NPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELN 189
+P Q I+ ++ + ++ ++R + +D+ V ++V L L+
Sbjct: 228 FDRQDPHSQRIM-LSCDYKNFKLTEENIRQSRRAYFANISYLDEKVGELVDTLTRTRMLD 286
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
NT I++ SDHG LG+ GL +F FE RVP ++ GPG+ PG
Sbjct: 287 NTLILFCSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGPGVAPG 330
>gi|149196556|ref|ZP_01873610.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140236|gb|EDM28635.1| mucin-desulfating sulfatase [Lentisphaera araneosa HTCC2155]
Length = 545
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 107/250 (42%), Gaps = 45/250 (18%)
Query: 1 MAWPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNS--KYYNYS---- 54
M++PV L RN +GYF ++ K+ + E+ L ++S ++Y ++
Sbjct: 103 MSYPVIL--RN-------NGYFTGFIGKFGFPVNAKNYHEYENLPIDSFDRWYGWTGQGY 153
Query: 55 --INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDS 112
+ N +K ++Y + A + F+ ++++ KP L +SF A H P
Sbjct: 154 FDTSKNKYMVKFAKEY--PHVTLATAEAACEFIDEAQKQ--DKPFCLSLSFKASHKPFSP 209
Query: 113 APQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDL---------- 159
P Y +++ + YD RK+ P + RQ+ +
Sbjct: 210 DPAYDDVYKDTVWKKRANYDEGG----------ARKLPPQAKLGRQYLTIDDFAPEKYQE 259
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
M K Q + +D AV KIV++L G NT IIY +D+GY G G GK P+E
Sbjct: 260 SMRKYNQLIYGIDQAVGKIVEKLDQTGLSKNTVIIYATDNGYSCGSHGF-GGKVLPYEGP 318
Query: 220 IRVPFLMRGP 229
R P ++ P
Sbjct: 319 ARGPMIIMDP 328
>gi|150398338|ref|YP_001328805.1| sulfatase [Sinorhizobium medicae WSM419]
gi|150029853|gb|ABR61970.1| sulfatase [Sinorhizobium medicae WSM419]
Length = 489
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
+Q D+ + +++ LK++G +++T I+Y SDHG H+G+ GL +F FE ++ P +MR
Sbjct: 255 VQRTDEMIGQVLDALKEIGAMDDTLIVYASDHGDHVGERGLWWKHTF-FEESVKFPLVMR 313
Query: 228 GPGIVPG 234
PG +P
Sbjct: 314 LPGAIPA 320
>gi|168187234|ref|ZP_02621869.1| probable sulfatase [Clostridium botulinum C str. Eklund]
gi|169294856|gb|EDS76989.1| probable sulfatase [Clostridium botulinum C str. Eklund]
Length = 504
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 81/185 (43%), Gaps = 26/185 (14%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------- 120
+Y + IA + L Q K + + L SF PH P +A+M+
Sbjct: 193 HYNNWIAERTNALLEQYKDN--DENFFLWSSFFDPHPPYFVPEPWASMYNPDDITLPHYL 250
Query: 121 ----------FNVTSHHTPAY-DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQ 169
T P + +Y DK W+ +Q D+ + + ++
Sbjct: 251 DGEHLTTSPILEKTQEENPDFSEYYEPKDKNWLHGCHSHLQDKEELKKDIAIYYGMVSM- 309
Query: 170 SVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
+D + KI+ +L++LG +T +++TSDHG+ GQ L+ F +E I++P ++R P
Sbjct: 310 -MDHYIGKILDKLEELGMAEDTIVVFTSDHGHFFGQHNLIAKGPFHYEDMIKIPLIVREP 368
Query: 230 GIVPG 234
G+VP
Sbjct: 369 GVVPA 373
>gi|429757503|ref|ZP_19290038.1| arylsulfatase [Actinomyces sp. oral taxon 181 str. F0379]
gi|429174942|gb|EKY16401.1| arylsulfatase [Actinomyces sp. oral taxon 181 str. F0379]
Length = 437
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 11/146 (7%)
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---- 151
P L +SF PH P D PQ+A ++ P D N + + + R P
Sbjct: 142 PFFLYLSFHRPHAPLD-PPQWAFDSYDQDELTLPGED---NWSDELMSEERRDWDPTALA 197
Query: 152 VHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLV 209
H + D+ + + + +D + + ++ L + G NT I + SDHG +G GL
Sbjct: 198 AHYRERDVRLARAGYYGHMTHIDSQISRFIQTLGEFGLAENTVIAFVSDHGDMMGDHGLW 257
Query: 210 KGKSFPFEFDIRVPFLMRGPGIVPGT 235
+ K +P+E VPF++ G GI PG
Sbjct: 258 R-KGYPYEASSHVPFILYGVGISPGA 282
>gi|428313711|ref|YP_007124688.1| arylsulfatase A family protein [Microcoleus sp. PCC 7113]
gi|428255323|gb|AFZ21282.1| arylsulfatase A family protein [Microcoleus sp. PCC 7113]
Length = 478
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 79/185 (42%), Gaps = 32/185 (17%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--SHHTPAYDYAP 135
A+++ RFL QS +++P L +SF H P +F V + HT + P
Sbjct: 175 ASEAARFL----QSTAQQPFFLYVSFYETHSP---------FWFPVEFRNRHTASEFTVP 221
Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
+ Q+ + + R + + +D V +++ L G +NT +I+
Sbjct: 222 ELTAEDFEQIPAVFRNLTRPEKRGIQAAYYTCTEFMDKNVGLVLEALDQSGHADNTLVIF 281
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG----------------IVPGTMYLL 239
TSDHGY LGQ G + K FE +RV LMR PG I+P L
Sbjct: 282 TSDHGYLLGQHGRFE-KHCCFEGAVRVALLMRFPGVIVSGQASNALVELIDIIPTIFSLC 340
Query: 240 DVLIP 244
DV IP
Sbjct: 341 DVPIP 345
>gi|171915403|ref|ZP_02930873.1| choline-sulfatase [Verrucomicrobium spinosum DSM 4136]
Length = 469
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 85/174 (48%), Gaps = 5/174 (2%)
Query: 59 GRKIKH-GEDYYNDYYPDLIAND-SVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
G + +H GE D + D++ D +++FL++ + + + KP L + F APH P + P+Y
Sbjct: 149 GEEFEHAGEHADMDEFVDVVRRDHAIKFLKE-RGADNNKPFFLTIGFIAPHYPLVAPPEY 207
Query: 117 ANMFFN-VTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
F + V P P L+ RK + V + ++ +DD +
Sbjct: 208 LEHFRDKVPFPEVPPGYVDTLPLNYRHLRNDRKFERVPPALAKRALEGYYARVEWIDDQI 267
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
+++ LK+ NT +IYTSDHG +LG+ GL K+ F+ RVP ++ P
Sbjct: 268 GMVLEALKNSRFAENTVVIYTSDHGENLGEHGLWW-KNCMFDSGARVPLIVSWP 320
>gi|171913979|ref|ZP_02929449.1| sulfatase family protein [Verrucomicrobium spinosum DSM 4136]
Length = 479
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 80/161 (49%), Gaps = 17/161 (10%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVTSHHTPAYDYAP 135
A + ++ L K KP L + F PH P + +Y +++ + TPA +A
Sbjct: 209 AAEGIKLLEAHKD----KPFFLAVGFYRPHTPYVAPKKYFDLYPLDQIPLPSTPA-GFAE 263
Query: 136 NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS----VDDAVEKIVKELKDLGELNNT 191
K I T+K + ++ TD + +Q + +D V +++ ++ L NT
Sbjct: 264 QTPKPAI--TTKKNE---QEMTDQQRQEAIQAYYAATSYMDAQVGRLLDAVERLDLAKNT 318
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
I++TSDHGYHLG++G+ + +S FE RVPF++ PG
Sbjct: 319 VIVFTSDHGYHLGEYGMWQKQSL-FEQSARVPFIISAPGAA 358
>gi|266619688|ref|ZP_06112623.1| mucin-desulfating sulfatase [Clostridium hathewayi DSM 13479]
gi|288868740|gb|EFD01039.1| mucin-desulfating sulfatase [Clostridium hathewayi DSM 13479]
Length = 468
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 93/191 (48%), Gaps = 23/191 (12%)
Query: 50 YYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP 109
YYN + NG+KI+ Y D+I ++++ F+ + K ++P L + + APH P
Sbjct: 139 YYNAPMFRNGQKIEE-----KGYITDVITDEAISFIDREKNK--EQPFYLSVHYTAPHSP 191
Query: 110 --EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT 167
+Y +++ + P + P + I + + + F
Sbjct: 192 WINCHPKKYTDLYEDCPFETCPQGEVHPWAKTEVIAGYQKPRESLIGYFA---------A 242
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL-VKGK-SFP---FEFDIRV 222
+ ++DD V +I+K+L++ + +T II++SD+G++ G G+ KG +FP ++ ++V
Sbjct: 243 VTAMDDNVGRILKKLEEENLMEDTLIIFSSDNGFNCGHHGIWGKGNGTFPLNMYDSSVKV 302
Query: 223 PFLMRGPGIVP 233
P +M G +P
Sbjct: 303 PLIMCHKGHIP 313
>gi|109897122|ref|YP_660377.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699403|gb|ABG39323.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 549
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 98/250 (39%), Gaps = 53/250 (21%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRK-IKHGEDYYNDYYPDLIA 78
GY + K++ P + + L Y++ G K E Y + D++
Sbjct: 136 GYETAIIGKWHLKAEPGAFDYYQVLESQGTYFDPEFRTRGPKPWPENETQYTGHSSDVVT 195
Query: 79 NDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPD 138
+ S+ +L + + KP L+ F APH AP+Y + A + P PD
Sbjct: 196 DLSIEWL---ENRVANKPFFLMHQFKAPHDMFKYAPRYEDFL---------AAETIPEPD 243
Query: 139 KQWILQVT---------------------------RKM------------QPVHRQFTDL 159
+ + T R M + RQ
Sbjct: 244 DLYAVAKTFGSIATRGKNDTLRADIGTSVSRRNNRRSMGIDLGVDPNLSEEEFTRQAYQK 303
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+ L+ ++ VDD V ++++ L+D G+ NT IIYTSD G LG+ L + K + F+
Sbjct: 304 YLKAYLRCVKGVDDNVARLIQTLRDTGQYKNTIIIYTSDQGMMLGEHDL-QDKRWIFDES 362
Query: 220 IRVPFLMRGP 229
IR+P +++ P
Sbjct: 363 IRMPLIVKHP 372
>gi|334346371|ref|YP_004554923.1| sulfatase [Sphingobium chlorophenolicum L-1]
gi|334102993|gb|AEG50417.1| sulfatase [Sphingobium chlorophenolicum L-1]
Length = 570
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 110/235 (46%), Gaps = 31/235 (13%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
+FGK+ + PG+ +W + Y+ + +N++G+ +
Sbjct: 138 AFFGKWHMGASTDAPRPGFDKWVSFKGQGNYWPARAATPESSPDLNVDGKAVPQ-----K 192
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV------T 124
Y D + + ++ +L SK+ K+P L +S H P+YA+ + + +
Sbjct: 193 GYITDELTDYAMNWL--SKERDPKRPFFLYLSHKGVHSDPLPPPRYAHQYDDAKFTLPAS 250
Query: 125 SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF-TDLLMTKRLQ----TLQSVDDAVEKIV 179
+ +TP + A P W+L + + D+ MT+ L+ TL ++DD++ +I+
Sbjct: 251 AANTPENN-AGKP--VWVLNQRNSWHGIDFFYNADVPMTEYLKYYYGTLSAIDDSLGRIM 307
Query: 180 KELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+ +T +++TSD+G+ +G+ GL+ K +E +RVP ++ PG VP
Sbjct: 308 AYLRKNHLEKDTLVVFTSDNGFQVGEHGLID-KRNAYEASVRVPLVVWEPGTVPA 361
>gi|336399820|ref|ZP_08580620.1| sulfatase [Prevotella multisaccharivorax DSM 17128]
gi|336069556|gb|EGN58190.1| sulfatase [Prevotella multisaccharivorax DSM 17128]
Length = 514
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/273 (24%), Positives = 105/273 (38%), Gaps = 72/273 (26%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ + +P G+ W LI YYN NG++I+ Y +++ +
Sbjct: 117 GYQTAMIGKWHLTSLPTGFDYWDILIGQGDYYNPDFIRNGKRIRRP-----GYVTNIVTD 171
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPH---GPEDS------------------------ 112
++ +L S + KP L++ APH P+
Sbjct: 172 VALDWL--SNKRDKSKPFCLLLHNKAPHRVWAPDTCDLDLYNDRKYPMPDNFYDNYEGRL 229
Query: 113 APQYANMFFNVTSHHTPAYD---YAPN---PDKQ---WILQVTRKMQPVHRQFTDL---- 159
A Q M N+ P YD Y P+ DK+ W +M P R D
Sbjct: 230 AAQKQKM--NIIKDMNPVYDLKMYDPDVHTNDKELETWGFNNYNRMSPSQRAQWDRHYKP 287
Query: 160 ----------------------LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
M + + S+D V ++++ LKD L+NT I+YTS
Sbjct: 288 IIEKFKRDSLKGKALAEWKYQRYMHDYCRVITSIDRNVGRVIQYLKDNNLLDNTMIVYTS 347
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
D G+++G+ G K F +E R P L+ PG
Sbjct: 348 DQGFYMGEHGFFD-KRFMYEESFRTPLLVYIPG 379
>gi|414344124|ref|YP_006985645.1| hypothetical protein B932_3169 [Gluconobacter oxydans H24]
gi|411029459|gb|AFW02714.1| hypothetical protein B932_3169 [Gluconobacter oxydans H24]
Length = 509
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 75/166 (45%), Gaps = 20/166 (12%)
Query: 76 LIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
+ ++RFLR + F +KP L + + H P P V S+ ++
Sbjct: 187 FLTEQAIRFLRSEQGLFRSDRKPFFLFVGYTDTHSPHSDMPPEL-----VESYAGASFID 241
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQSVDDAVEKIVKELKDLGELNN 190
P I V + PV R + KR+Q ++D V +I+ EL LGE +N
Sbjct: 242 IPKEQFSAIHGVAER--PVDRN-SGTEREKRMQYYAAATAIDREVGRILDELDHLGEADN 298
Query: 191 TYIIYTSDHGYHLGQFGL------VKGKSFPFEFDIRVPFLMRGPG 230
T+I+YTSDHG + G G+ + ++F FE IRVP + P
Sbjct: 299 TFIVYTSDHGLNAGHHGIWEKGNATRPQNF-FEESIRVPCTISWPA 343
>gi|429856689|gb|ELA31586.1| mucin-desulfating sulfatase (n-acetylglucosamine-6-sulfatase)
[Colletotrichum gloeosporioides Nara gc5]
Length = 875
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 49/83 (59%), Gaps = 1/83 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + L+ +QSVDD V +++ L+D G +NT +IYTSD G+ LG+ G K F +E
Sbjct: 287 MQRYLRVIQSVDDNVGRMLNYLEDNGLADNTIVIYTSDQGFFLGEHGWFD-KRFMYEESF 345
Query: 221 RVPFLMRGPGIVPGTMYLLDVLI 243
++PFL+R P + D++
Sbjct: 346 QMPFLIRYPAAIAKGAVCDDIIC 368
>gi|445061849|ref|ZP_21374332.1| sulfatase [Brachyspira hampsonii 30599]
gi|444506776|gb|ELV07055.1| sulfatase [Brachyspira hampsonii 30599]
Length = 474
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 8/145 (5%)
Query: 96 PIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWIL-QVTRKMQPVHR 154
P+ + +SF PH P D +Y +++ ++ + + +++ Q R P
Sbjct: 197 PLFMWVSFVDPHHPFDPIKKYYDIYKDINPKELNSKLKLDSRRPEYLTKQGDRGYWPGGG 256
Query: 155 QFTD-----LLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
+ D + KRL + +D + +I+++LK+ E +NT I++TSDHG +LG +G
Sbjct: 257 EEHDYSEEEIKEIKRLYYAMISFIDSQIGRIIEKLKEKNEWDNTIIVFTSDHGEYLGDYG 316
Query: 208 LVKGKSFPFEFDIRVPFLMRGPGIV 232
L++ F ++ I+ P L G GIV
Sbjct: 317 LLRKGPFMYDCLIKTPLLFYGKGIV 341
>gi|110679304|ref|YP_682311.1| choline sulfatase [Roseobacter denitrificans OCh 114]
gi|109455420|gb|ABG31625.1| choline sulfatase [Roseobacter denitrificans OCh 114]
Length = 502
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 97/204 (47%), Gaps = 18/204 (8%)
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
Y D +A ++ R L + +P L +SF PH P + +Y +++ + +H P
Sbjct: 162 YDDEVAFNATRKLYDLSRGADARPWCLTVSFTHPHDPYVARKKYWDLYEDC-AHLQPEVA 220
Query: 130 AYDYAP-NPDKQWILQVT--RKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKD 184
A+DYA +P Q I R R D+ ++R + +DD + I++ L+D
Sbjct: 221 AFDYADQDPHSQRIFDANDWRSFDITKR---DIERSRRAYFANISYLDDKIGDILQTLED 277
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG--TMYLLDV- 241
+ I++ SDHG LG+ GL SF FE RVP ++ P + PG T + ++
Sbjct: 278 TRQ--EAIILFVSDHGDMLGERGLWFKMSF-FEGSSRVPLMISAPQMEPGLQTTAVSNID 334
Query: 242 LIPQVRKFSSGSLIFIMSFLINLN 265
+ P + + S+ +M++ ++
Sbjct: 335 VCPTLCDLAGVSMDEVMAWTTGVS 358
>gi|196230533|ref|ZP_03129395.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225463|gb|EDY19971.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 497
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 71/149 (47%), Gaps = 17/149 (11%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMF--------FNVTSHHTPAYDYAPNPDKQWILQVT 146
+P L ++F APH P P + + N S H +D+ + +L +
Sbjct: 225 EPFFLHVAFTAPHDPRLLPPGWETKYDPKTMPLPKNFRSVH--PFDHGNMGGRDEVLLAS 282
Query: 147 -RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
R+ V + + + +D+ + +IV+ LK G+L+NT II+TSD G +G
Sbjct: 283 PRRPDEVRAE-----LAAYYAAISGMDEQIGRIVEALKSTGQLDNTLIIFTSDQGLAVGS 337
Query: 206 FGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GL+ GK +E + VP +M GPGI G
Sbjct: 338 HGLI-GKQNLYEHTLGVPLIMSGPGIPKG 365
>gi|254445484|ref|ZP_05058960.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
gi|198259792|gb|EDY84100.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
Length = 495
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 89/195 (45%), Gaps = 41/195 (21%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHH 127
++ +++A+D+ F+ Q+ S P + ++F A H P S ++ + + + ++
Sbjct: 200 HWSEVVASDAEDFMLQA--SGRTGPFFMYLAFNAAHDPRQSPKEFVDRYSAESIPIPENY 257
Query: 128 TPAYDY---------------APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
P Y Y AP P ++ ++V R ++ + +D
Sbjct: 258 QPIYPYREEMGSGEGLRDEILAPFPRTEYAVRVHR--------------SEYYAVISHLD 303
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+ +I+K L+ G + TYI +T+DHG G GL+ GK +E ++ P ++ GPG+
Sbjct: 304 AQIGRILKALEASGRRDRTYIFFTADHGLACGHHGLL-GKQNMYEHSMKAPLIVLGPGLP 362
Query: 233 PGT-----MYLLDVL 242
G +Y+ D++
Sbjct: 363 GGQRRKAPVYIQDIM 377
>gi|395005731|ref|ZP_10389600.1| arylsulfatase A family protein [Acidovorax sp. CF316]
gi|394316287|gb|EJE53017.1| arylsulfatase A family protein [Acidovorax sp. CF316]
Length = 487
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 89/195 (45%), Gaps = 27/195 (13%)
Query: 56 NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ 115
M G I GE +Y Y ++ +V ++R + + +P +L + APH P AP+
Sbjct: 144 RMLGEHIGAGESHYTAYDREVTGR-AVEWIRDAG-TRPDEPFVLYVGLVAPHFPL-VAPE 200
Query: 116 YANMFFNVTSHHT-------PAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTL 168
FF + S PA Y +P W+ Q +F +R Q
Sbjct: 201 E---FFALYSSEVMPEPKLQPASGYQRHP---WV-QAYADFMDNEGEFKS--PEERRQAF 251
Query: 169 QS-------VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
+ +D V +I + L+D G +T ++YTSDHG +LG GL GKS ++ +R
Sbjct: 252 VAYYALCSFLDHNVGRITEALRDAGLEGDTTLVYTSDHGDNLGTRGLW-GKSTLYQESVR 310
Query: 222 VPFLMRGPGIVPGTM 236
VP L+ GPG+ PG
Sbjct: 311 VPMLVAGPGVQPGVC 325
>gi|373252949|ref|ZP_09541067.1| choline-sulfatase [Nesterenkonia sp. F]
Length = 527
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 93/196 (47%), Gaps = 22/196 (11%)
Query: 79 NDSVRFLRQSKQSFSK-KPIMLVMSFPAPHGPEDSAPQYANMFF-----NVTSHHTPAYD 132
ND VR + ++ + +P ++V SF PH P + ++ + F + + P+ +
Sbjct: 188 NDRVRANQWAEAHGEQHRPFLMVTSFIHPHDPYEPPREHWDRFGEGEIPDPKNPQVPSVE 247
Query: 133 YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNN 190
P+ + + + P + D+ +R + +DD V +I L+++G N
Sbjct: 248 EDPHSHRLRAMAGFDERDP---EIEDVRNARRAYYAAVSYIDDHVGRIRAHLEEVGLAEN 304
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFS 250
T II T+DHG LG+ GL K P+E RVP ++ GP D L+P+ R +
Sbjct: 305 TVIIVTADHGDMLGEKGLWY-KMSPYERSSRVPMIVNGP----------DHLVPKGRYRT 353
Query: 251 SGSLIFIMSFLINLNT 266
SL+ +M L++L +
Sbjct: 354 PVSLLDLMPTLMDLGS 369
>gi|171914267|ref|ZP_02929737.1| choline sulfatase [Verrucomicrobium spinosum DSM 4136]
Length = 496
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 1/75 (1%)
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
L+ L ++ +D V ++++ L+ G+L+NT I+ DHG+HLG+ G + GK+ +E
Sbjct: 274 LVRAYLASISFMDSQVGRVLEALQASGQLDNTIIVVWGDHGWHLGEKG-ITGKNTLWERS 332
Query: 220 IRVPFLMRGPGIVPG 234
RVPF+ GPGI G
Sbjct: 333 TRVPFIWAGPGITRG 347
>gi|171912751|ref|ZP_02928221.1| putative exported sulfatase [Verrucomicrobium spinosum DSM 4136]
Length = 541
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/256 (22%), Positives = 102/256 (39%), Gaps = 52/256 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ P G+ W L YYN + NG+ +KH Y D+I +
Sbjct: 122 GYQTAIVGKWHLMSDPTGFDYWQILPGQGIYYNPPMIDNGKPVKH-----EGYVTDIITD 176
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA------------PQYANMFFNVTSHH 127
++ +L+ ++ KP +++ APH D A P+ +F + +
Sbjct: 177 LTLDWLKNRDKN---KPFLMMCQHKAPHREWDPALRHLGHDGDRKYPEPETLFDDYENRG 233
Query: 128 TPAYDYAPNPDKQWI-----LQVTRKMQPVHRQFTDLLMTKR------------------ 164
D +K + L M P + D R
Sbjct: 234 VAERDQDMTLEKTFTPKDAKLVAPASMTPEQKAKWDAYYGPRNAEFEKSKPTGKDLVRWR 293
Query: 165 --------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
L +++VD+ V +++K L D G +T ++Y++D G++LG+ G K + F
Sbjct: 294 YQRYMHDYLGCVKAVDEGVARLLKYLDDEGLAEDTIVVYSADQGFYLGEHGWFD-KRWIF 352
Query: 217 EFDIRVPFLMRGPGIV 232
E +R P ++R PG+
Sbjct: 353 EESLRAPLMVRWPGVA 368
>gi|414164695|ref|ZP_11420942.1| choline-sulfatase [Afipia felis ATCC 53690]
gi|410882475|gb|EKS30315.1| choline-sulfatase [Afipia felis ATCC 53690]
Length = 487
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 83/177 (46%), Gaps = 11/177 (6%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y D+ A + +L++ ++ KP +L +S PH P + P++ ++N
Sbjct: 153 GESPYTFYDRDITAR-AQTWLQEEAPKYADKPWVLFVSLVCPHFPLTAPPEHFYRYYNDP 211
Query: 125 SHHTPA-YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR------LQTLQSVDDAVEK 177
+P Y+ P+ ++ V+ +F D R L +D+ V K
Sbjct: 212 RLASPKLYEDMHRPNHPYLQDYAGSF--VYDEFFDTPDKLRRGLAGYLGLCSFLDENVGK 269
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
++ L D + T +IYTSDHG ++G GL GKS +E VPF+M GP I G
Sbjct: 270 LLACLTDTNLSDTTRVIYTSDHGDNMGARGLW-GKSTMYEETAAVPFIMSGPDIPRG 325
>gi|294085689|ref|YP_003552449.1| sulfatase [Candidatus Puniceispirillum marinum IMCC1322]
gi|292665264|gb|ADE40365.1| Sulfatase [Candidatus Puniceispirillum marinum IMCC1322]
Length = 513
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 75/156 (48%), Gaps = 10/156 (6%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
+P +V SF PH P + P++ +++ + PAY + + ++ ++
Sbjct: 189 QPFCMVASFIHPHDPYVARPEWWDLY-DQADIDMPAYTLTRDQQDAFSQRLMDGIEASTV 247
Query: 155 QFT-DLLMTKRLQTLQSV---DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVK 210
T D + R + D V IV+ L + G+L+NT I T+DHG LG+ GL
Sbjct: 248 AVTEDEIRNARHAYYANTSYFDSKVGDIVRTLDETGQLDNTIIFITADHGDMLGERGLWY 307
Query: 211 GKSFPFEFDIRVPFLMRGPGIVPGTMY----LLDVL 242
+F FE RVP +M GP + GT+ L+D+L
Sbjct: 308 KMNF-FEHSGRVPLVMAGPNVAKGTVTSPCSLVDIL 342
>gi|149176616|ref|ZP_01855228.1| choline sulfatase [Planctomyces maris DSM 8797]
gi|148844495|gb|EDL58846.1| choline sulfatase [Planctomyces maris DSM 8797]
Length = 479
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--------EDSAPQYANMFFNVTSHHTP 129
A+ ++ F+ + Q KP L + F APH P ++ P + N H
Sbjct: 200 ADAAIEFVERKHQ----KPFFLHVCFTAPHDPLLMPIGYEQNYDPDQMPVPANFLPQH-- 253
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELN 189
+D+ + L + + + + L + + +D V +IVK LK GE
Sbjct: 254 PFDHGNFDGRDEALLPWPRTKEIVKNDLSLYYS----VISHLDAQVGRIVKALKKTGEWE 309
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
NT +I++SDHG +G GL +GK +E + VP +M GPGI T+
Sbjct: 310 NTILIFSSDHGLAMGSHGL-RGKQNMYEHTVNVPLIMVGPGIPADTL 355
>gi|212692494|ref|ZP_03300622.1| hypothetical protein BACDOR_01990 [Bacteroides dorei DSM 17855]
gi|237709186|ref|ZP_04539667.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
gi|265752713|ref|ZP_06088282.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
gi|294777453|ref|ZP_06742904.1| arylsulfatase [Bacteroides vulgatus PC510]
gi|345514224|ref|ZP_08793737.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
gi|423230606|ref|ZP_17217010.1| hypothetical protein HMPREF1063_02830 [Bacteroides dorei
CL02T00C15]
gi|423240767|ref|ZP_17221881.1| hypothetical protein HMPREF1065_02504 [Bacteroides dorei
CL03T12C01]
gi|423244315|ref|ZP_17225390.1| hypothetical protein HMPREF1064_01596 [Bacteroides dorei
CL02T12C06]
gi|212664969|gb|EEB25541.1| arylsulfatase [Bacteroides dorei DSM 17855]
gi|229437205|gb|EEO47282.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
gi|229456882|gb|EEO62603.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
gi|263235899|gb|EEZ21394.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
gi|294448521|gb|EFG17070.1| arylsulfatase [Bacteroides vulgatus PC510]
gi|392630750|gb|EIY24736.1| hypothetical protein HMPREF1063_02830 [Bacteroides dorei
CL02T00C15]
gi|392642496|gb|EIY36262.1| hypothetical protein HMPREF1064_01596 [Bacteroides dorei
CL02T12C06]
gi|392643729|gb|EIY37478.1| hypothetical protein HMPREF1065_02504 [Bacteroides dorei
CL03T12C01]
Length = 508
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 104/257 (40%), Gaps = 69/257 (26%)
Query: 20 GYFGKYLNKYNGSYIPP---GWREWGALIMNSKYYNYSINMNGRKIKH------------ 64
G FGK+ Y GS P G E+ I + + Y N R K
Sbjct: 142 GMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSAGDTAVVRVVMD 201
Query: 65 --------GEDYYN--DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG----PE 110
G+DY+ Y D+I +++++L + K+P + ++ PH PE
Sbjct: 202 ENINYPMFGKDYFKRPQYSADMIHEEAMKWLDKQD---GKQPFFGIFTYTLPHAELAQPE 258
Query: 111 DSA-PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTKRLQ 166
DS Y FF DK W Q + P H QF ++
Sbjct: 259 DSILTGYQKKFFE---------------DKTWGGQEGSRYNPSVHTHAQFAGMIT----- 298
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH---------LGQFGLVKG-KSFPF 216
+D V +++ +LK+ G NT +I+TSD+G H G+ G ++G K +
Sbjct: 299 ---RLDYYVGEVLNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCY 355
Query: 217 EFDIRVPFLMRGPGIVP 233
E IR+PF++R PG VP
Sbjct: 356 EGGIRIPFIVRWPGKVP 372
>gi|298290556|ref|YP_003692495.1| sulfatase [Starkeya novella DSM 506]
gi|296927067|gb|ADH87876.1| sulfatase [Starkeya novella DSM 506]
Length = 480
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 79/176 (44%), Gaps = 9/176 (5%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE Y Y D++A + +LR + KP L +SF APH P + P++ +++
Sbjct: 153 GESSYTFYDRDILARAQI-WLRGAAARARAKPWALFVSFVAPHFPLTAPPEHYYRYYDDA 211
Query: 125 SHHTPA-YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT-----LQSVDDAVEKI 178
P Y+ P+ ++ + F +R + +D+ + K+
Sbjct: 212 RLQWPKLYETQARPEHPYLREYAECFA-YDEHFAGPDDVRRALAGYYGLVGFLDENIGKL 270
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ L G +T ++Y SDHG +LG GL GKS +E + VP + GPG PG
Sbjct: 271 LATLDATGLSQDTRVVYVSDHGDNLGARGLW-GKSTMYEEAVSVPLIAAGPGFEPG 325
>gi|103486502|ref|YP_616063.1| sulfatase [Sphingopyxis alaskensis RB2256]
gi|98976579|gb|ABF52730.1| sulfatase [Sphingopyxis alaskensis RB2256]
Length = 515
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + L T+ +VD+ V ++ L+D G NT ++YTSD G++LG+ G K F +E +
Sbjct: 308 MQQYLGTIAAVDEGVGAVLDYLEDSGLDRNTIVVYTSDQGFYLGEHGWFD-KRFIYEESM 366
Query: 221 RVPFLMRGPG-IVPGT 235
R PFL++ PG I PGT
Sbjct: 367 RTPFLIQYPGHIRPGT 382
>gi|325263385|ref|ZP_08130119.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Clostridium sp. D5]
gi|324031094|gb|EGB92375.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Clostridium sp. D5]
Length = 498
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 91/201 (45%), Gaps = 44/201 (21%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----------- 120
Y D+I + + R+L++ +S KP ++ + APH + ++ +MF
Sbjct: 151 YVTDIITDMTTRWLKERDRS---KPFFVMCNHKAPHDFWEYHERHEDMFAEKEIPVPCSL 207
Query: 121 FNVTSHHTPAY-----------------------DYAPNP-DKQWILQVTRKMQPVHRQF 156
F H +PA DY P K + K ++++
Sbjct: 208 FENKVHRSPATRERGSSVTPRSKIRSLYEDFCRPDYVTGPLTKTETMSFEEKGYAAYQKY 267
Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
+ L+T+ ++D++V I+ EL++ G L++T +IYTSD G LG+ + K + F
Sbjct: 268 ----LKDYLRTVAAIDESVGNILNELEEQGILDDTIVIYTSDQGIFLGEHDY-QDKRWSF 322
Query: 217 EFDIRVPFLMRGPG-IVPGTM 236
E +R P L+R P I PGT+
Sbjct: 323 EESLRTPMLVRYPKEIPPGTV 343
>gi|399993702|ref|YP_006573942.1| arylsulfatase [Phaeobacter gallaeciensis DSM 17395 = CIP 105210]
gi|398658257|gb|AFO92223.1| putative arylsulfatase [Phaeobacter gallaeciensis DSM 17395 = CIP
105210]
Length = 508
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 86/192 (44%), Gaps = 35/192 (18%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF-------NVT 124
+Y + I + + FL + ++P L +S+P PH P + +M+ +T
Sbjct: 195 HYNNWITDRACGFLEERD---GEQPFFLFVSYPDPHHPFSPPAPWCDMYDPQEVPAPALT 251
Query: 125 SHH---TPAY--------------DYAPNPD---KQWILQVTRKMQPVH-RQFTDLLMTK 163
+ P+Y D+ NP +Q +Q T++ RQ +
Sbjct: 252 ADELAAMPSYILDGDREEAGKSYVDFLRNPGPPREQGFMQTTQRFSEASLRQ----AIAH 307
Query: 164 RLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
+ +D+ + +++ +L+ G +T II+TSDHG LG GL++ P+ + VP
Sbjct: 308 TYGMVSMIDNCIGRLLAQLEAQGLAEDTLIIFTSDHGELLGDHGLIRKGPSPYRPLLHVP 367
Query: 224 FLMRGPGIVPGT 235
++ GPG+ PGT
Sbjct: 368 LVIAGPGVAPGT 379
>gi|239624380|ref|ZP_04667411.1| sulfatase [Clostridiales bacterium 1_7_47_FAA]
gi|239520766|gb|EEQ60632.1| sulfatase [Clostridiales bacterium 1_7_47FAA]
Length = 511
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 92/190 (48%), Gaps = 25/190 (13%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP---EDSAPQYANMFFNVTSHHT 128
Y D+I +D++RFL +K+ KP L + + APH P ED + +++ +
Sbjct: 197 YVTDIITDDALRFLEDNKE----KPFYLSVHYTAPHDPWLKEDQPREIWDLYNDC----- 247
Query: 129 PAYDYAPNPDKQ-WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
+ Y PNP + W Q R+ QP + + + ++D + +I+ L
Sbjct: 248 -EFSYIPNPPRHPW--QEARRTQPDDEKQRREYIQGYYTCVTAMDRNIGRIMDYLIANHL 304
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGK---SFP---FEFDIRVPFLMRGPGIVPGTMYLLDV 241
NT +I+ +D+G+++G G V GK +FP +E ++VP ++ G G+ G + +
Sbjct: 305 DENTMVIFLADNGFNIGHHG-VWGKGNATFPLNMYESSVKVPCIIWGAGVRGGKVN--EA 361
Query: 242 LIPQVRKFSS 251
L+ Q F S
Sbjct: 362 LVSQYDLFPS 371
>gi|357384308|ref|YP_004899032.1| choline-sulfatase [Pelagibacterium halotolerans B2]
gi|351592945|gb|AEQ51282.1| choline-sulfatase [Pelagibacterium halotolerans B2]
Length = 477
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 70/157 (44%), Gaps = 31/157 (19%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
+P L ++F PH P D A+M +V P + + +H
Sbjct: 229 RPWFLTVNFAGPHEPLDVT---ASMRESVKDRQFP---------------MPGRHDGLH- 269
Query: 155 QFTDLLMTKR---LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
TDL T R ++ +D V + V+ L+ G L NT I++ SDHG LG L
Sbjct: 270 --TDLHQTVRQNYAAMVELIDGWVGRYVEILERAGALENTIIVFASDHGEMLGDHNLW-A 326
Query: 212 KSFPFEFDIRVPFLMRGPGIVPGT------MYLLDVL 242
KS PFE +RVP +M GPG P + LLDV+
Sbjct: 327 KSVPFEASVRVPLVMAGPGFEPSAAPVAAPVSLLDVM 363
>gi|330447504|ref|ZP_08311152.1| putative sulfatase yidJ [Photobacterium leiognathi subsp.
mandapamensis svers.1.1.]
gi|328491695|dbj|GAA05649.1| putative sulfatase yidJ [Photobacterium leiognathi subsp.
mandapamensis svers.1.1.]
Length = 517
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 64/275 (23%), Positives = 106/275 (38%), Gaps = 50/275 (18%)
Query: 5 VELTSRNLLYFDI------SHGYFGK-------------YLNKYNGSYIPPGWREWGALI 45
+EL + + D+ S GY GK Y N G Y W +W A
Sbjct: 129 IELKKSTVTWSDLLKQQGYSMGYIGKWHLDAPETPFVPSYNNPMEGRY----WNDWTAPD 184
Query: 46 MNS------KYYNYSINMN-----GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF-- 92
Y Y ++MN + + N + P+ A+ ++++LR +
Sbjct: 185 RRHGFDFWYAYGTYDLHMNPIYWTNETPRDKPLHINQWSPEHEADIAIKYLRNENGQYRD 244
Query: 93 SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP 151
S +P LV+S PH P D PQ Y + F TS P+ QW P
Sbjct: 245 SSQPFTLVVSMNPPHSPYDQVPQKYLDRFSGKTSQQ-----LNNRPNVQWDKAYQEGYGP 299
Query: 152 VHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
+ + + + VD+ +I+ EL L +T +++ SDHG +G G
Sbjct: 300 KY-------FKEYMAMVNGVDEQFGRIIDELDRLNLTEDTLVVFFSDHGCCMGSNGQPT- 351
Query: 212 KSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
K+ ++ +R+P + R PG + + L P +
Sbjct: 352 KNVHYDEAMRIPMIFRWPGKINAQQHDLLFSAPDI 386
>gi|340757398|ref|ZP_08694000.1| sulfatase [Fusobacterium varium ATCC 27725]
gi|251834661|gb|EES63224.1| sulfatase [Fusobacterium varium ATCC 27725]
Length = 524
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 45/63 (71%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + KI+ +L++LG +T I++T+DHG+ +GQ GL++ F +E I++PF+++ PG
Sbjct: 330 MDKYIGKILDKLEELGLAEDTIIVFTTDHGHFVGQHGLIRKGPFHYEDLIKIPFIVKYPG 389
Query: 231 IVP 233
VP
Sbjct: 390 FVP 392
>gi|150003729|ref|YP_001298473.1| arylsulfatase [Bacteroides vulgatus ATCC 8482]
gi|319640049|ref|ZP_07994776.1| arylsulfatase [Bacteroides sp. 3_1_40A]
gi|345517063|ref|ZP_08796541.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
gi|423313130|ref|ZP_17291066.1| hypothetical protein HMPREF1058_01678 [Bacteroides vulgatus
CL09T03C04]
gi|149932153|gb|ABR38851.1| arylsulfatase precursor [Bacteroides vulgatus ATCC 8482]
gi|254833835|gb|EET14144.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
gi|317388327|gb|EFV69179.1| arylsulfatase [Bacteroides sp. 3_1_40A]
gi|392686344|gb|EIY79650.1| hypothetical protein HMPREF1058_01678 [Bacteroides vulgatus
CL09T03C04]
Length = 508
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 104/257 (40%), Gaps = 69/257 (26%)
Query: 20 GYFGKYLNKYNGSYIPP---GWREWGALIMNSKYYNYSINMNGRKIKH------------ 64
G FGK+ Y GS P G E+ I + + Y N R K
Sbjct: 142 GMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSAGDTAVVRVVMD 201
Query: 65 --------GEDYYN--DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG----PE 110
G+DY+ Y D+I +++++L + K+P + ++ PH PE
Sbjct: 202 ENINYPMFGKDYFKRPQYSADMIHEEAMKWLDKQD---GKQPFFGIFTYTLPHAELAQPE 258
Query: 111 DSA-PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTKRLQ 166
DS Y FF DK W Q + P H QF ++
Sbjct: 259 DSILTGYQKKFFE---------------DKTWGGQEGSRYNPSVHTHAQFAGMIT----- 298
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH---------LGQFGLVKG-KSFPF 216
+D V +++ +LK+ G NT +I+TSD+G H G+ G ++G K +
Sbjct: 299 ---RLDYYVGEVLNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCY 355
Query: 217 EFDIRVPFLMRGPGIVP 233
E IR+PF++R PG VP
Sbjct: 356 EGGIRIPFIVRWPGKVP 372
>gi|149199849|ref|ZP_01876878.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149137020|gb|EDM25444.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 467
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 58/103 (56%), Gaps = 14/103 (13%)
Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG------YHLGQFG-LVKGKSFPFE 217
+ ++ +DD V +I K+LK+LG NT +IYT+D+G +H G G L GK FE
Sbjct: 250 INVIEHMDDQVGRITKKLKELGLDKNTIVIYTTDNGPWLPFKHHGGSAGELRDGKGSTFE 309
Query: 218 FDIRVPFLMRGPGIVP-GT-----MYLLDVLIPQVRKFSSGSL 254
RVP +M PG++P GT M +D L+P + SS +L
Sbjct: 310 GGQRVPCVMWAPGLIPAGTETNELMTTID-LLPSLAALSSSTL 351
>gi|196232398|ref|ZP_03131251.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196223470|gb|EDY17987.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 517
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 50/77 (64%), Gaps = 2/77 (2%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M L ++SVD++V +I++ L++ G NT +IY+SD G++LG+ G K + FE +
Sbjct: 299 MHDYLACIKSVDESVGRILQYLEEAGLAENTIVIYSSDQGFYLGEHGWFD-KRWIFEESL 357
Query: 221 RVPFLMRGPGIV-PGTM 236
R P L+R PG+ PG++
Sbjct: 358 RAPLLIRWPGVTKPGSV 374
>gi|402222605|gb|EJU02671.1| alkaline phosphatase-like protein [Dacryopinax sp. DJM-731 SS1]
Length = 484
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + L+T+QSVDD V +++ L G NT +IYTSD G+ LG+ G K F +E
Sbjct: 257 MQRYLRTIQSVDDNVGRLLDYLDKEGLTENTIVIYTSDQGFFLGEHGWFD-KRFIYEESF 315
Query: 221 RVPFLMRGP-GIVPGTMYLLDVLIPQV 246
++PFL+R P I PGT + D +I V
Sbjct: 316 QMPFLIRYPREIKPGT--VCDDIIQNV 340
>gi|167762012|ref|ZP_02434139.1| hypothetical protein BACSTE_00358 [Bacteroides stercoris ATCC
43183]
gi|167700104|gb|EDS16683.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
Length = 463
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 90/213 (42%), Gaps = 61/213 (28%)
Query: 30 NGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSK 89
N Y P WR N K N N + IKH D L D+++F+ ++K
Sbjct: 179 NDPYYYPYWR-----FNNEKLENVKENEGDKHIKHNTD--------LSTEDAIKFINRNK 225
Query: 90 QSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM 149
+ P L +++ APH P +N+ T YD D+ W +
Sbjct: 226 NN----PFFLYLAYDAPHEP-----------YNI--DETTWYD-----DEAWDMN----- 258
Query: 150 QPVHRQFTDLLMTKRLQTLQS-VDDAVEKIVKELKDLGELNNTYIIYTSDHGY------- 201
TKR +L + +D A+ +++ EL LG NT +I+ SD+G
Sbjct: 259 ------------TKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAKQAPLE 306
Query: 202 HLGQFGLVKG-KSFPFEFDIRVPFLMRGPGIVP 233
LG G +KG K +E IRVPF++ PG VP
Sbjct: 307 ELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVP 339
>gi|260587234|ref|ZP_05853147.1| arylsulfatase [Blautia hansenii DSM 20583]
gi|260542429|gb|EEX22998.1| arylsulfatase [Blautia hansenii DSM 20583]
Length = 514
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 83/173 (47%), Gaps = 26/173 (15%)
Query: 68 YYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
Y Y+P + + + S+ FLR+ +KP L+ S+ PH P D AP Y +N
Sbjct: 199 YEEKYHPTNWVTDRSIDFLRRKD---PQKPFFLMASYLRPHPPFD-APSYYFDLYNKKEL 254
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQF------TDLLMTKRLQ-----TLQSVDDAV 175
PA W + T ++Q + R F +D ++ + Q + +D +
Sbjct: 255 TPPAV-------GDW--ETTEELQAMGRVFDSKCGPSDAVLIREAQIGYYACITHLDHQI 305
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
++++ L + G +NT I++TSDHG L + + KS P++ IR+P ++ G
Sbjct: 306 GRLIQALVEYGVYDNTLILFTSDHGEELCDHHMFR-KSRPYQGSIRIPMIVSG 357
>gi|18309213|ref|NP_561147.1| arylsulfatase [Clostridium perfringens str. 13]
gi|81768190|sp|Q8XNV1.1|SULF_CLOPE RecName: Full=Arylsulfatase; AltName: Full=Cys-type sulfatase
gi|18143889|dbj|BAB79937.1| probable phosphonate monoester hydrolase [Clostridium perfringens
str. 13]
Length = 481
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 73/155 (47%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D++ + ++ + +++ ++ +D + + + L + G+LNNT ++
Sbjct: 240 EDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGKLNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333
>gi|398829781|ref|ZP_10587976.1| choline-sulfatase [Phyllobacterium sp. YR531]
gi|398216083|gb|EJN02641.1| choline-sulfatase [Phyllobacterium sp. YR531]
Length = 506
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 76/159 (47%), Gaps = 17/159 (10%)
Query: 85 LRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP----NPDKQ 140
L + +++P L +SF PH P + +Y +++ N + + P+ P +P Q
Sbjct: 176 LSRDSDDAARRPWCLTVSFTHPHDPYVARRKYWDLYENSLALN-PSVGAIPFDDQDPHSQ 234
Query: 141 WILQVTRKMQPVHRQFTDLLMTKR------LQTLQSVDDAVEKIVKELKDLGELNNTYII 194
+LQ H F R L + VDD V ++++ L+ L++T I+
Sbjct: 235 RLLQACD-----HGAFDITPEQVRRARQGYLANITYVDDKVGELLETLETTRMLDDTIIV 289
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
+TSDHG LG+ GL +F FE RVP ++ G GI P
Sbjct: 290 FTSDHGDMLGERGLWFKMNF-FEGSARVPLMIAGKGITP 327
>gi|325109768|ref|YP_004270836.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970036|gb|ADY60814.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 561
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 116/274 (42%), Gaps = 55/274 (20%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKI--KHGEDYYNDYYPDLI 77
GY + K++ P + + L KY+N + G K K+ + N + D I
Sbjct: 114 GYQTAMIGKWHLKVEPADFDYYCVLPGQGKYHNPEFRVRGDKPWGKNLIKFDNMHVTDAI 173
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY---- 133
+ ++ +L++ + KP L+ + APH D+AP+Y + V PA D
Sbjct: 174 TDLTLDWLKEGRDE--DKPFFLMHHYKAPHDYFDNAPRYESYLAGVD---IPAPDSLWKR 228
Query: 134 -------------------------APNPDKQWILQVTRKMQPVH-------RQFTD--- 158
+ NP + ++L + M P + F+D
Sbjct: 229 GDEFGSMATRGANDELVPHIGTSIGSRNPRRSYLLDLP-DMYPNEFPEGYDPKYFSDEEN 287
Query: 159 ------LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGK 212
+ K L+ ++ +DD + ++ L++ GEL+NT IIYT D G+ LG+ + K
Sbjct: 288 KRLAYNAYLRKFLRCVKGIDDNLGRLFAHLEETGELDNTVIIYTGDQGFMLGEHDY-QDK 346
Query: 213 SFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQV 246
+ +E R+PFL+R P +P +D ++ V
Sbjct: 347 RWMYEESQRMPFLVRYPKTIPAGQ-KIDAIVENV 379
>gi|423297327|ref|ZP_17275388.1| hypothetical protein HMPREF1070_04053 [Bacteroides ovatus
CL03T12C18]
gi|392667504|gb|EIY61012.1| hypothetical protein HMPREF1070_04053 [Bacteroides ovatus
CL03T12C18]
Length = 527
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 47/69 (68%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT+++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTFVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|87307553|ref|ZP_01089697.1| mucin-desulfating sulfatase [Blastopirellula marina DSM 3645]
gi|87289723|gb|EAQ81613.1| mucin-desulfating sulfatase [Blastopirellula marina DSM 3645]
Length = 494
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 79/196 (40%), Gaps = 18/196 (9%)
Query: 41 WGALIMNSKYY---NYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPI 97
WG + Y N S+ K H Y + D+I RQS + KKP
Sbjct: 150 WGGSPGQTDYRTANNKSMKKYATKYPHSTLSYGAFAQDVI--------RQSVEQ--KKPF 199
Query: 98 MLVMSFPAPHGPEDSAPQ----YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVH 153
L +SF APH P P+ YA F + + P + Q R ++ +
Sbjct: 200 CLSISFKAPHKPATPDPRFNEVYAGKKFTKPDNFGREFGEHLAPQSKSGRQFPRFIEWKY 259
Query: 154 RQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKS 213
D M K Q + +D A+ I EL G +NT +IYTSD+GY G G K
Sbjct: 260 DTDYDGEMAKYHQQVYGIDVALGMIRDELAAQGVADNTVVIYTSDNGYICGSHGY-GSKV 318
Query: 214 FPFEFDIRVPFLMRGP 229
P E RVP ++ P
Sbjct: 319 LPMEESSRVPLIIYDP 334
>gi|336426668|ref|ZP_08606677.1| hypothetical protein HMPREF0994_02683 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010684|gb|EGN40666.1| hypothetical protein HMPREF0994_02683 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 483
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/174 (24%), Positives = 81/174 (46%), Gaps = 5/174 (2%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY--DY 133
+A+++ R + ++K+ +P + +F PH P Y +M+ P+Y D+
Sbjct: 194 FVAHETNRLIDKAKEK--DQPFFIWSNFWGPHSPSIVPEPYYSMYDPKLIPEHPSYCEDF 251
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
P ++ + + + + +DD V IV +LK+ G +T I
Sbjct: 252 HDKPYGYYLTEKMWGLSDYGWSGFQEISARYFGHCTLIDDCVGMIVDKLKECGLYEDTVI 311
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG-TMYLLDVLIPQV 246
IY++DHG +G L++ +F F+ RVP +++G G ++ LL L+P +
Sbjct: 312 IYSADHGDCMGAHRLIEKGAFTFDEIYRVPLVVKGLGTKDNDSLVLLQELMPSI 365
>gi|393784387|ref|ZP_10372552.1| hypothetical protein HMPREF1071_03420 [Bacteroides salyersiae
CL02T12C01]
gi|392666163|gb|EIY59680.1| hypothetical protein HMPREF1071_03420 [Bacteroides salyersiae
CL02T12C01]
Length = 496
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 103/242 (42%), Gaps = 29/242 (11%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYN-YSINMNGRK------IKHGEDYYNDY 72
GY + K++ Y P G+ + L +Y N I+ RK K Y +
Sbjct: 117 GYQTAIIGKWHLKYEPAGFDYYNVLPGQGRYKNPVLISKEERKGNTCPFDKTPGKIYQGH 176
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV--------- 123
D+IA+ ++ ++++ K +P ML+ F APH + A +++++ +V
Sbjct: 177 STDVIASQAINWMKEHKDD--NQPFMLMCHFKAPHRSWEYAERFSDLLKDVEIPEPENMF 234
Query: 124 ---------TSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDA 174
T T + + D + L + M L+ + +D+
Sbjct: 235 DTYEGRAYYTKLQTMSLEDMNEKDMKCELPANLSRDEFRKWAYQRYMKDYLRCIAGIDEN 294
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVP 233
V +I+K L + G NT I+YTSD G++LG+ G + E + +P L+R P I P
Sbjct: 295 VGRILKFLDENGLAQNTVIVYTSDQGFYLGEHGWFDKRYMQKE-SLNMPLLIRYPNEIKP 353
Query: 234 GT 235
G+
Sbjct: 354 GS 355
>gi|325107824|ref|YP_004268892.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968092|gb|ADY58870.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 489
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 103/241 (42%), Gaps = 28/241 (11%)
Query: 14 YFDISHGYFGK----YLNKYNG--SYIPPGWREWGALIMNSKYYNYSINMNG-------- 59
Y I H G+ L + NG ++ W A I+ S + ++ M G
Sbjct: 95 YMQIDHQLTGQTTLGQLLRTNGYDTFATGKWHNGKAAILRSFEHGENVFMGGMCDHTKVP 154
Query: 60 -RKIKHGEDYY-----NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA 113
I+ GE N + +L A+ +V +L +KP ++F APH P +
Sbjct: 155 LSDIRDGELVSRGVKPNSFSSELFADAAVNYLNGRD---GEKPFFAYVAFTAPHDPRNPP 211
Query: 114 PQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ---TLQS 170
+Y M+++ PA +P L + ++ + D++ + + +
Sbjct: 212 EKYREMYYS-NPPSLPANFLTQHPFNNGSLVIRDEVLAPWPRPEDMVQRQLCEYYGLISH 270
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+DD + +I++ L+ NT I+Y +DHG +G GL+ GK +E ++ P + GPG
Sbjct: 271 MDDQIGRILEALEKTPGGENTIIVYAADHGLAMGSHGLL-GKQNLYEHSMKCPLIFSGPG 329
Query: 231 I 231
I
Sbjct: 330 I 330
>gi|440718008|ref|ZP_20898478.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436436890|gb|ELP30586.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 479
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 70/133 (52%), Gaps = 14/133 (10%)
Query: 136 NPDKQWILQVTRKMQ--PVHRQFTDLLMTKRL---QTLQSVDDAVEKIVKELKDLGELNN 190
N DK + + + M P+H + +KR T+Q +D + +I++ LK+LG N
Sbjct: 219 NKDKSFFVYLGHNMPHIPLHASEKFVGSSKRGIYGDTIQELDWSTGEILRALKELGLDQN 278
Query: 191 TYIIYTSDHGYHLGQFG----LVKGKSFPFEFDIRVPFLMRGPGIVP-GTMYLLDV---- 241
T +I+TSD+G + G+ G L GK E +RVPF+ R PG +P GT + +
Sbjct: 279 TLVIFTSDNGPNTGKGGSAGVLKGGKGSTLEGGVRVPFVARWPGTIPAGTEFDEAITGMD 338
Query: 242 LIPQVRKFSSGSL 254
L+P + K + G +
Sbjct: 339 LLPTLTKLAGGEV 351
>gi|269120708|ref|YP_003308885.1| sulfatase [Sebaldella termitidis ATCC 33386]
gi|268614586|gb|ACZ08954.1| sulfatase [Sebaldella termitidis ATCC 33386]
Length = 480
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 85/180 (47%), Gaps = 20/180 (11%)
Query: 72 YYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
Y+P + + N+ + FLR+ + +K L +SF PH P D Y NM+ N +
Sbjct: 177 YHPTNWVVNEGINFLRRRDK---RKNFFLKLSFIRPHSPLDPPEYYYNMYINREIDN--- 230
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDL 185
P P ++ ++ + Q + M KR + ++ +D + + + LK+
Sbjct: 231 ----PIPAEEENIKEAYNINAAEGQISKEAM-KRAKVAYYGSITHIDHQIGRFLMVLKEN 285
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG--IVPGTMYLLDVLI 243
L + +++ SDHG +G GL + KS P++ I VPF++ PG + G M D L+
Sbjct: 286 DLLKESIVLFVSDHGDLMGDHGLFR-KSMPYQGSIHVPFIVYDPGNFLNGGVMREPDELV 344
>gi|305666713|ref|YP_003863000.1| mucin-desulfating sulfatase [Maribacter sp. HTCC2170]
gi|88708937|gb|EAR01171.1| mucin-desulfating sulfatase [Maribacter sp. HTCC2170]
Length = 525
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/267 (20%), Positives = 118/267 (44%), Gaps = 50/267 (18%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINM-----NGRKIKHGEDYYNDY 72
+GY + K++ P G+ + L +Y+N + +G G++Y +
Sbjct: 123 ENGYQTAVIGKWHLVGKPEGFDYYNVLPGQGRYWNPILKSEDNWSDGYDSSTGKEY-KGF 181
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY- 131
D+IA+ ++ L++ + +KP M+ +F A H P + +Y +++ N+ +
Sbjct: 182 STDVIADLTIEHLQKRDK---EKPFMMFCNFKATHEPFEYPERYNDLYKNIEIPEPESLL 238
Query: 132 DYAPN------------------------PDKQWI-----------LQVTRKMQPVHRQF 156
D+ PN P+ W L +K + ++++
Sbjct: 239 DFDPNNTGRTFHGQTLERLGARWDKATENPETFWTSYPGLPYPLEGLDSIQKRKKIYQK- 297
Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
L+ ++ ++DD + K++ L+D G +NT ++YT+D GY LG+ G K +
Sbjct: 298 ---LVKDFMRCGAAIDDNIGKLLDYLEDQGIADNTIVVYTADQGYFLGEHGFFD-KRMIY 353
Query: 217 EFDIRVPFLMRGPGIVPGTMYLLDVLI 243
E +R+PF++R P + + D+++
Sbjct: 354 EESLRMPFVIRYPKEIREGKRIDDIIL 380
>gi|340619501|ref|YP_004737954.1| sulfatase [Zobellia galactanivorans]
gi|339734298|emb|CAZ97675.1| Sulfatases, family S1-23 [Zobellia galactanivorans]
Length = 522
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/226 (23%), Positives = 97/226 (42%), Gaps = 47/226 (20%)
Query: 59 GRKIKHGEDY--YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
G+ +++GE+ + +Y D++ ++ +L + + KP +LV+S PH P ++
Sbjct: 178 GKFLRNGEEVGETDGWYVDVLTKEATNWLTKVRDK--SKPFLLVVSTHEPHTPIAPPEKF 235
Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDA 174
++++ N T L+ T K V R D+ K+ T+ +D A
Sbjct: 236 SSLYDNATVDS---------------LEQTIKYGGVDRTLYDISSNKKEYYGTVSQLDHA 280
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHG--------------------YHLGQFGLVKG-KS 213
++K ++D +NT +I+TSD+G Y G G+ +G K
Sbjct: 281 FSDLMKTVEDENAYDNTMVIFTSDNGPEHPVNLEESQGTWEEPIRDYCFGTPGIYRGMKR 340
Query: 214 FPFEFDIRVPFLMRGPGIVPG-----TMYLLDVLIPQVRKFSSGSL 254
+PFE RVP ++R P +P M + +P V + +
Sbjct: 341 YPFEGGHRVPGIVRYPAKIPAGTTSDAMVVATDFLPTVATLAQAEI 386
>gi|420238024|ref|ZP_14742462.1| choline-sulfatase [Rhizobium sp. CF080]
gi|398089025|gb|EJL79561.1| choline-sulfatase [Rhizobium sp. CF080]
Length = 507
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 76/171 (44%), Gaps = 8/171 (4%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAP 135
+AN + L + + ++P L +SF PH P + ++ +++ + H P P
Sbjct: 168 FLANQKLYHLARERDDADRRPWALTVSFTHPHDPYVARKKFWDLYEDC-EHLLPEVGPIP 226
Query: 136 NPD-----KQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
D K+ I K V + +D+ V ++V L L++
Sbjct: 227 LEDQDPHSKRIIFSCDYKNFDVTEDNVRRSRRAYFANISYIDEKVGELVDTLTRTRMLDD 286
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDV 241
T I++ SDHG LG+ GL SF FE RVP ++ GPG VP ++L V
Sbjct: 287 TLILFCSDHGDMLGERGLWFKMSF-FEGSARVPLMIAGPG-VPAGLHLAPV 335
>gi|319900823|ref|YP_004160551.1| sulfatase [Bacteroides helcogenes P 36-108]
gi|319415854|gb|ADV42965.1| sulfatase [Bacteroides helcogenes P 36-108]
Length = 509
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 104/256 (40%), Gaps = 69/256 (26%)
Query: 20 GYFGKYLNKYNGSYIPP---GWREWGALIMNSKYYNYSINMNGRKIKH------------ 64
G FGK+ Y GS P G E+ I + + Y N R K
Sbjct: 142 GMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSLGDTAVVRVVMD 201
Query: 65 --------GEDYYN--DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHG----PE 110
G+DY+ Y D+I +++++L +++P + ++ PH PE
Sbjct: 202 ENIKYPMFGKDYFKRPQYSADMIHKEAMKWLDSQN---AQQPFFGIFTYTLPHAELAQPE 258
Query: 111 DSAP-QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP---VHRQFTDLLMTKRLQ 166
DS Y FF D+ W Q + P H QF ++
Sbjct: 259 DSILLGYEKKFFT---------------DRTWGGQEGSRYNPSIHTHAQFAGMIT----- 298
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH---------LGQFGLVKG-KSFPF 216
+D V +++K+LKD G NT +I+TSD+G H G+ G ++G K +
Sbjct: 299 ---RLDHYVGEVLKKLKDKGLDENTIVIFTSDNGPHEEGGADPAFFGRDGKLRGLKRQCY 355
Query: 217 EFDIRVPFLMRGPGIV 232
E IRVPF++R PG V
Sbjct: 356 EGGIRVPFIVRWPGKV 371
>gi|325264228|ref|ZP_08130960.1| putative mucin-desulfating sulfatase
(N-acetylglucosamine-6-sulfatase) [Clostridium sp. D5]
gi|324030712|gb|EGB91995.1| putative mucin-desulfating sulfatase
(N-acetylglucosamine-6-sulfatase) [Clostridium sp. D5]
Length = 539
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 76/167 (45%), Gaps = 13/167 (7%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD----- 132
AN + +F+ Q+ Q +KP ++ P PH +A ++ +++ + P YD
Sbjct: 200 ANCANKFIEQADQE--QKPFCFWLTMPRPHQTYAAAKEFWDLYEGIELELPPNYDDKMEH 257
Query: 133 -YAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGE 187
+ D Q Q+ + + D + L+ + +DDAV ++ +L++LG
Sbjct: 258 RSSAAKDTQRKFQMDKDWIAFGDKDFDAARKRVLRGYYACVSQMDDAVGTVLNKLEELGI 317
Query: 188 LNNTYIIYTSDHGYHLGQFGLV-KGKSFPFEFDIRVPFLMRGPGIVP 233
NT IIY +DHG G+ G++ K F ++P + G +P
Sbjct: 318 RENTLIIYMTDHGEFAGEHGMIEKAPGIGFRCVTQIPMIFSWKGRLP 364
>gi|149198280|ref|ZP_01875326.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
gi|149138576|gb|EDM26983.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
Length = 518
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 95/231 (41%), Gaps = 35/231 (15%)
Query: 37 GWREWGALIMNSKYY-------------NY-SINMNGRKIKHGEDYYNDYYPDLIANDSV 82
G+ W A YY NY N+NG+K+ Y D + + S+
Sbjct: 141 GFDHWAAFRGQGTYYADGHGASRVVPQNNYDGYNVNGKKVPQ-----LGYITDELTDMSI 195
Query: 83 RFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQ-- 140
+L KQ +KP + +S H D + + TP YA P+ +
Sbjct: 196 DWLNNRKQK--EKPFFMYVSHKGVH--SDFVARDEDRGIYKDKPWTPPSTYANTPENRKN 251
Query: 141 ---WILQVTRKMQPVH-----RQFTDLLMTKRL-QTLQSVDDAVEKIVKELKDLGELNNT 191
W++ F KR + L +DDAV + ++ L+ +GEL+NT
Sbjct: 252 KPMWLINQRNSRHGSDYGYNLENFDVTFYFKRYSEALIPIDDAVGRTIEHLRKMGELDNT 311
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
+Y D+G+ G+ GL+ ++ +E +RVP +MR P G + +++
Sbjct: 312 LFVYMGDNGFQFGEQGLIDKRT-AYEASMRVPMIMRYPKAFKGGTVVDEIV 361
>gi|332665689|ref|YP_004448477.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332334503|gb|AEE51604.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 475
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 81/165 (49%), Gaps = 17/165 (10%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPA 130
++ A+ ++ +L Q+K ++P ++F +PH P Y M+ + ++ P
Sbjct: 181 NMYADAAIAYLNQNKGG--QQPFFAYVAFTSPHDPRTPPAPYDKMYDPNDIPLPKNYLPQ 238
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKIVKELKDLG 186
+ + D I ++ P+ R T ++ K L + VD + +I++ LK G
Sbjct: 239 HPF----DNGEIRVRDEQLLPLPR--TPEMVKKDLAAYYGMISEVDAQIGRIIQTLKANG 292
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+ +NT II+ D+G +G GLV GK +E +R+P +M GP I
Sbjct: 293 QYDNTLIIFAGDNGLAVGSHGLV-GKQSLYEHSMRIPLVMVGPNI 336
>gi|224536908|ref|ZP_03677447.1| hypothetical protein BACCELL_01784 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521463|gb|EEF90568.1| hypothetical protein BACCELL_01784 [Bacteroides cellulosilyticus
DSM 14838]
Length = 516
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 75/161 (46%), Gaps = 7/161 (4%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHHT 128
D P I ++++ ++ K + P + +SFP PH P Y +MF + + T
Sbjct: 177 DQQPTRIVDEAIEWIDSQKDN----PFFVWVSFPEPHNPYQVCEPYYSMFAPDKIPAVKT 232
Query: 129 PAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
D +K IL ++ P + L + ++ +DD ++++V+ELK+ G
Sbjct: 233 SRKDVLKKGEKYQILAELEDASCPDLERDLPRLRGNYMGMIRLIDDQIKRLVEELKEKGL 292
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
T I+ SDHG + G++GL++ E R+P + G
Sbjct: 293 FEKTIIVVLSDHGDYCGEYGLIRKGVGLSESLTRIPMVWAG 333
>gi|399065906|ref|ZP_10748111.1| arylsulfatase A family protein [Novosphingobium sp. AP12]
gi|398029084|gb|EJL22580.1| arylsulfatase A family protein [Novosphingobium sp. AP12]
Length = 579
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 25/232 (10%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
+FGK+ + PG+ +W + I Y+ + ++N++G+ +
Sbjct: 139 AFFGKWHMGASTDAPRPGFDKWVSFIGQGNYWPSKVNTGQSSPNLNVDGKTVPQ-----K 193
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
Y D + + ++ +L +K+ KKP L +S H P+Y + + +
Sbjct: 194 GYITDELTDYAMDWL--TKERDPKKPFFLYLSHKGVHSDPLPPPRYVHQYDKAKFTLPAS 251
Query: 131 YDYAPNPDKQWILQVTRKMQPVHR-QF---TDLLMTKRLQ----TLQSVDDAVEKIVKEL 182
P + L V + H F D+ MT+ L+ TL ++DD++ +++ L
Sbjct: 252 AANTPQNNAGKPLWVQNQRNTWHGIDFFYNADVPMTEYLKYYYGTLSAIDDSLGRVMDYL 311
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ +T +++TSD+GY +G+ GL+ K +E IRVP ++ PG VP
Sbjct: 312 RKNHLEKDTLVVFTSDNGYMIGEHGLID-KRNAYEASIRVPLVVWEPGTVPA 362
>gi|373956839|ref|ZP_09616799.1| sulfatase [Mucilaginibacter paludis DSM 18603]
gi|373893439|gb|EHQ29336.1| sulfatase [Mucilaginibacter paludis DSM 18603]
Length = 491
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 41/260 (15%)
Query: 5 VELTSRNLLYFDI------SHGYFGKY-LNKYNGSYIPP-------GWREW---GALIMN 47
VEL + + + DI S+GY GK+ L+ YIP W EW G
Sbjct: 116 VELPADMVCWSDILEANGYSNGYIGKWHLDSPQKPYIPTYNNEGKTAWNEWTEPGRRHGF 175
Query: 48 SKYYNYS--------INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSF--SKKPI 97
+Y Y + + R ++ Y N++ P A+ ++ F+ ++ KP
Sbjct: 176 DYWYAYGTYDRHLHPMYWDTRAVRDSFHYVNEWGPQHEADKAIEFITNPGNTYRDQSKPF 235
Query: 98 MLVMSFPAPHGPEDSAPQ-YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQF 156
LV+S PH + PQ Y N++ NV P +PD I M +R
Sbjct: 236 ALVVSMNPPHSDYKNVPQQYVNLYKNV-----PLESLLKDPD---IPPAGTPMGEEYRNN 287
Query: 157 TDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPF 216
T + VD V +I+ LK G ++T +++ +DHG LG+ + K+ +
Sbjct: 288 IKYYYT----CIAGVDAQVGRILTCLKQSGLDDHTIVVFMADHGNCLGKHDEIS-KNNIY 342
Query: 217 EFDIRVPFLMRGPGIVPGTM 236
E +R+PF+++ G + M
Sbjct: 343 EESLRIPFIIKWKGHIKSRM 362
>gi|159468109|ref|XP_001692225.1| hypothetical protein CHLREDRAFT_189474 [Chlamydomonas reinhardtii]
gi|158278411|gb|EDP04175.1| predicted protein [Chlamydomonas reinhardtii]
Length = 628
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 72/141 (51%), Gaps = 12/141 (8%)
Query: 96 PIMLVMSFPAPH-GPEDSAP----QYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM- 149
P L ++ PH GP D P ++AN + P Y +P ++ + T+ +
Sbjct: 237 PFFLYLASANPHDGPVDGMPRVEAKFANAMPGLKVPRGPNYGVPTDP--RFGMSKTKPLW 294
Query: 150 -QPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL 208
P +TD R ++ ++D+ ++++V +L+ LG LNNTYI ++SD+G+ LGQ +
Sbjct: 295 WDPA---WTDKHYRMRALSMLTIDEQIDRLVTKLECLGILNNTYIFFSSDNGFKLGQHNI 351
Query: 209 VKGKSFPFEFDIRVPFLMRGP 229
K FE D+ +PF P
Sbjct: 352 PSEKFTYFEEDVALPFFTLHP 372
>gi|386821061|ref|ZP_10108277.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426167|gb|EIJ39997.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 553
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 49/73 (67%), Gaps = 2/73 (2%)
Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
L T+++VDD V K++ L++ G +NT ++YTSD G++LG+ G K F +E R+PF
Sbjct: 327 LATVKAVDDGVGKVLDYLEENGLSDNTIVVYTSDQGFYLGEKGFFD-KRFMYEESFRMPF 385
Query: 225 LMRGP-GIVPGTM 236
L++ P I PGT+
Sbjct: 386 LIQYPKAIKPGTV 398
>gi|372268475|ref|ZP_09504523.1| sulfatase [Alteromonas sp. S89]
Length = 469
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 96/211 (45%), Gaps = 28/211 (13%)
Query: 66 EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY------ANM 119
E + +++ I + S +F+R+S +S +P L +SF APH + Q+ A +
Sbjct: 171 ETHRHEHTTARIGDLSAQFVRESAKS--GQPFNLSVSFHAPHADDHDPNQFVYPRDLAFL 228
Query: 120 FFNVTSHHTP-AYDYAPNPDKQWILQVTRKMQPVHR----QFTDLLMTKRLQTLQSVDDA 174
+ +VT P A D + +W+ K + R Q ++ + + +D
Sbjct: 229 YEDVTIPEPPLADDRYFDAQPEWVRNSISKDRWYWRFDTPQKYQEMVKGYYRMISGLDRE 288
Query: 175 VEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP----- 229
V KIV+EL+ G +NT I++ D+GY LG+ L GK +E IRVPF + P
Sbjct: 289 VGKIVRELEKAGVADNTIIMFMGDNGYFLGERQLA-GKWLLYENSIRVPFFVYDPRRKGM 347
Query: 230 ---------GIVPGTMYLLDVLIPQVRKFSS 251
+ P + L + IP++ + S
Sbjct: 348 EVDKMVLNIDVAPTILELAGIDIPELMQGES 378
>gi|59714190|ref|YP_206965.1| sulfatase/phosphatase [Vibrio fischeri ES114]
gi|59482438|gb|AAW88077.1| predicted sulfatase/phosphatase [Vibrio fischeri ES114]
Length = 518
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 80/175 (45%), Gaps = 16/175 (9%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
+ N + P+ A+ ++++LR ++ + KP LV+S PH P D PQ F +S
Sbjct: 220 HINQWSPEHEADIAIKYLRNENGNYRNNDKPFTLVVSMNPPHSPYDQVPQKYLDRFKESS 279
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
+ P+ QW + P + + + + VD+ +IV EL L
Sbjct: 280 RTLNS-----RPNVQWDKEYLEGYGPEY-------FKEYMAMVNGVDEQFGRIVDELDAL 327
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLL 239
+T +++ SDHG +G G K+ +E +R+P + R PG I P + LL
Sbjct: 328 NLAKDTLVVFFSDHGCCMGSNG-NPTKNVHYEEAMRIPMMFRWPGKIAPKSDDLL 381
>gi|196233888|ref|ZP_03132726.1| iduronate-2-sulfatase [Chthoniobacter flavus Ellin428]
gi|196222082|gb|EDY16614.1| iduronate-2-sulfatase [Chthoniobacter flavus Ellin428]
Length = 264
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 42/65 (64%), Gaps = 2/65 (3%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D +++ L LG +NT +++TSDHGYHLG+ GL + +S FE RVP ++ GPG
Sbjct: 45 MDAQTGRVLAALDRLGLRDNTIVVFTSDHGYHLGEHGLWQKQSL-FEESTRVPLIVAGPG 103
Query: 231 IV-PG 234
+ PG
Sbjct: 104 VTKPG 108
>gi|32476327|ref|NP_869321.1| sulfatase yidj [Rhodopirellula baltica SH 1]
gi|32446872|emb|CAD78778.1| putative sulfatase yidj [Rhodopirellula baltica SH 1]
Length = 527
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 79/168 (47%), Gaps = 11/168 (6%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY 131
+ D + + + F+R+ S++P +S P PHGP Y MF N+ P
Sbjct: 231 FSTDWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVR--PPM 284
Query: 132 DYAPNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNN 190
+ + D+ W+ R Q ++F LMT+ ++ +DD V ++ L +L
Sbjct: 285 TFQLDGDQPGWLPATNRNSQ---QRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKR 341
Query: 191 TYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
T +++TSDHG + G + K P+E +VP ++ PG++ + +
Sbjct: 342 TVVVFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIIAAPGLISAGLRI 388
>gi|159463296|ref|XP_001689878.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283866|gb|EDP09616.1| predicted protein [Chlamydomonas reinhardtii]
Length = 131
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 14/125 (11%)
Query: 93 SKKPIMLVMSFPAPHG---------PEDSAPQYANMF--FNVTSHHTPAYDYAPNPDKQW 141
S KP ML ++ APH P +YAN++ N+ + TP + NP
Sbjct: 5 SGKPFMLYVAPTAPHRSSTDGATWYPPTPPKRYANLYQGENLQAPRTPNFGVR-NP--TL 61
Query: 142 ILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGY 201
+ T ++ + D L RL+ L++VDD V +V L + G LNNTY+I+TSD+GY
Sbjct: 62 PRKGTPRVDAAFGAYMDELFVARLRALRAVDDLVGNLVARLNESGVLNNTYVIFTSDNGY 121
Query: 202 HLGQF 206
F
Sbjct: 122 VSADF 126
>gi|160890624|ref|ZP_02071627.1| hypothetical protein BACUNI_03069 [Bacteroides uniformis ATCC 8492]
gi|317479838|ref|ZP_07938958.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156859623|gb|EDO53054.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316904044|gb|EFV25878.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 521
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 69/144 (47%), Gaps = 3/144 (2%)
Query: 91 SFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVTSHHTPAYDYAPNPDK-QWILQVTR 147
S KP ++ S PH P Y +MF ++ + A D ++ Q + ++
Sbjct: 200 SHKDKPFLMWFSIAEPHNPYQVCEPYYSMFPPESLPEMGSSAKDLNTKGEEYQLLAEMMA 259
Query: 148 KMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG 207
+ +R+ L + L+ +DD + + V ELK G +NT II+ +DHG ++G++G
Sbjct: 260 QGHVGYRENLQRLRSNYHGMLRMIDDQLSRFVGELKKNGVYDNTIIIFVADHGDYVGEYG 319
Query: 208 LVKGKSFPFEFDIRVPFLMRGPGI 231
L+K + R+P GPGI
Sbjct: 320 LMKKGVGLDDVLTRIPMQWTGPGI 343
>gi|325110426|ref|YP_004271494.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970694|gb|ADY61472.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 489
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 82/184 (44%), Gaps = 13/184 (7%)
Query: 52 NYSINMNGRKIKHGEDY-YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE 110
+YS N K+ E Y ++ D++R KQ KP+ L +SF APH P
Sbjct: 150 SYSTAKNKSMAKYAEKYPHSTLSYGAFGQDAIR--EAVKQD---KPLCLSISFKAPHKPA 204
Query: 111 DSAPQYANMFFNVT-----SHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRL 165
P++ +++ T + A ++ KQ Q R + + D M K
Sbjct: 205 TPDPRFKHVYAGKTFTRPDNFGREAGEHLSEQSKQG-RQYPRFTEWKYDSDYDGEMAKYF 263
Query: 166 QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFL 225
Q + ++D A+ I +EL G +NT IIYTSD+GY G G K P E RVP +
Sbjct: 264 QQVYAIDVALGMIREELDAQGIADNTVIIYTSDNGYICGAHGY-GSKVLPMEESSRVPLI 322
Query: 226 MRGP 229
+ P
Sbjct: 323 IYDP 326
>gi|149175816|ref|ZP_01854434.1| iduronate-2-sulfatase [Planctomyces maris DSM 8797]
gi|148845263|gb|EDL59608.1| iduronate-2-sulfatase [Planctomyces maris DSM 8797]
Length = 492
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 108/274 (39%), Gaps = 52/274 (18%)
Query: 17 ISHGYFGKYLNKYNGS-----YIPPGW-----------REWGALIMNSKYYNYSINMNGR 60
ISH G+ +YNG+ +P W R WG Y N +G
Sbjct: 141 ISHTADGRVF-EYNGTGDGRDELPHAWDELATPFGSWKRGWGIFFA---YANGRSREDGS 196
Query: 61 KIKHGEDY---YNDYYPD-LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQY 116
I+ ++ ++ PD L+A ++ LR+ K+ KP L + F PH P + Q
Sbjct: 197 GIRDLMEFKVEQDEELPDGLLARQAIEKLREYKEG--GKPFFLGLGFFKPHLPFVAPKQD 254
Query: 117 ANMFFNVTSHHTPAYDYAPNPDKQWILQVTR---------KMQPVHRQFTDLLMTKRLQT 167
+ F NV P + P+ + + K +P+ R+ + + L
Sbjct: 255 WDAFENV---EIPPVPHPEKPESAYWHKSGEFYNYDMEFEKTRPLSREARENVRRGYLAC 311
Query: 168 LQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR 227
++ D V K++ L +LG NT +I DHG+ LG L K P E ++ ++R
Sbjct: 312 VRYTDRQVGKVLTALDELGMRENTIVIVWGDHGWFLGDSALW-AKHAPLERALKSTLMIR 370
Query: 228 GPGIVPGTM-------------YLLDVLIPQVRK 248
PG+ + L+D+ P RK
Sbjct: 371 APGVAEAGLKSAALVETVDIYPTLIDLCQPAFRK 404
>gi|283781717|ref|YP_003372472.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283440170|gb|ADB18612.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 511
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 106/243 (43%), Gaps = 33/243 (13%)
Query: 10 RNLLYFDISHGYFG--KYLNKYN--GSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHG 65
R L I HG G K L +++ G G + LI + N+ + M+ K H
Sbjct: 136 RTLTTGKIYHGGLGPKKRLEEFDVWGPAGGIGAKPEKKLIPPTPMGNHPL-MDWGKFDHR 194
Query: 66 EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE-------DSAPQYAN 118
++ DY I + ++ L Q ++ P+ L + + PH P D PQ
Sbjct: 195 DEDKGDYQ---ITSWAIEQLDDQVQHHAETPMFLSVGYFLPHVPCFISPKWYDEVPQGDK 251
Query: 119 MFFNVTS---HHTPAYDY-----APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQS 170
+ V + P + + P P +W+ HRQ+ +L+ + L +
Sbjct: 252 LLPLVAADDRSDIPRFAWYLHWSLPEPRLKWVED--------HRQWENLVRSY-LASTTF 302
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
VD + +++ L++ ++T ++ DHG+HLG+ G + GK+ +E RVP + GPG
Sbjct: 303 VDAQIGRLLTALEERKLADDTIVVVWGDHGWHLGEKG-ITGKNTLWERSTRVPLIFAGPG 361
Query: 231 IVP 233
I P
Sbjct: 362 ITP 364
>gi|423223239|ref|ZP_17209708.1| hypothetical protein HMPREF1062_01894 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639340|gb|EIY33165.1| hypothetical protein HMPREF1062_01894 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 516
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 75/161 (46%), Gaps = 7/161 (4%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHHT 128
D P I ++++ ++ K + P + +SFP PH P Y +MF + + T
Sbjct: 177 DQQPTRIVDEAIEWIDSQKDN----PFFVWVSFPEPHNPYQVCEPYYSMFAPDKIPAVKT 232
Query: 129 PAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
D +K IL ++ P + L + ++ +DD ++++V+ELK+ G
Sbjct: 233 SRKDILKKGEKYRILAELEDASCPDLERDLPRLRGNYMGMIRLIDDQIKRLVEELKEKGL 292
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
T I+ SDHG + G++GL++ E R+P + G
Sbjct: 293 FEKTIIVVLSDHGDYCGEYGLIRKGVGLSESLTRIPMVWAG 333
>gi|294056299|ref|YP_003549957.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293615632|gb|ADE55787.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 563
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 89/197 (45%), Gaps = 39/197 (19%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNV----------- 123
D IA+ ++ + ++ + +P L + F APH + AP+Y + +V
Sbjct: 186 DCIADSALEWFQELRDP--DQPFFLKLQFKAPHDYFEYAPRYEDYLADVEIPEPASMWNN 243
Query: 124 --------------------TSHHTPAY--DYAPNPDKQWILQVTRKM--QPVHRQFTDL 159
TS Y + + + W +V R + +Q L
Sbjct: 244 ERNGSIATRGVDDELLPYIGTSVGARHYRRNMVDDENAPWAHKVDRSLTGDAAKKQAYQL 303
Query: 160 LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFD 219
+ L+ ++ VDD V++++ ++ G L+NT +IYT D G++LG+ + K + +E
Sbjct: 304 YLKAYLRCVKGVDDNVKRVLDYMEREGILDNTVVIYTGDQGFYLGEHDYID-KRWGYEEA 362
Query: 220 IRVPFLMRGP-GIVPGT 235
+R+PF++R P GI GT
Sbjct: 363 MRMPFIVRYPKGIPAGT 379
>gi|417300771|ref|ZP_12087962.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327542922|gb|EGF29375.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 309
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 9/97 (9%)
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG----LVKGKSFPFEFDIRV 222
T+Q +D + +I+K +KDLG NT +I+TSD+G + G+ G L GK E +RV
Sbjct: 88 TIQELDWSTGEILKAVKDLGLDKNTLVIFTSDNGPNTGKGGSAGVLKGGKGSTLEGGVRV 147
Query: 223 PFLMRGPGIVPGTMYLLDV-----LIPQVRKFSSGSL 254
PF+ R PG +P + L+P + K + G +
Sbjct: 148 PFVARWPGTIPAGTESDEAITGMDLLPTLTKLAGGEV 184
>gi|422872814|ref|ZP_16919299.1| arylsulfatase [Clostridium perfringens F262]
gi|380306241|gb|EIA18513.1| arylsulfatase [Clostridium perfringens F262]
Length = 481
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 5/155 (3%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N S+ FLR+ S KP L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNGSIDFLRRRDPS---KPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPV-HRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
D + + ++ + +++ ++ +D + + + L + GELNNT ++
Sbjct: 240 EDVENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLF 299
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
SDHG +G + K P+E RVPF + PG
Sbjct: 300 VSDHGDMMGDHNWFR-KGIPYEGSARVPFFIYDPG 333
>gi|315645058|ref|ZP_07898184.1| sulfatase [Paenibacillus vortex V453]
gi|315279479|gb|EFU42784.1| sulfatase [Paenibacillus vortex V453]
Length = 512
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + KI+ +L DLG NT +++T+DHG+ GQ GL F +E IR+PF++R P
Sbjct: 318 MDKYIGKILDKLDDLGLAENTVVVFTTDHGHFFGQHGLQAKGGFHYEDLIRIPFMVRYPD 377
Query: 231 IVPG 234
VP
Sbjct: 378 QVPA 381
>gi|430747568|ref|YP_007206697.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430019288|gb|AGA31002.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 491
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 76/156 (48%), Gaps = 25/156 (16%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMFFNVT----------SHHTPAYDY-----APNPDK 139
+P ++ + F PH P ++ ++ +++ + T P + + P P
Sbjct: 204 QPFLVAVGFRLPHVPCFASQRWFDLYPDETLVMPPVKVDDRDDVPEFSWYLHWKLPEPRL 263
Query: 140 QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
W+L+ ++ +P+ R + L ++ +D V +++ L+ G+ + T ++ SDH
Sbjct: 264 SWLLEA-KQWRPLVRSY--------LASISFMDSQVGRVLDALEVSGQADRTVVVLWSDH 314
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
G+HLG+ G + GK+ +E RVP + GPGI G
Sbjct: 315 GWHLGEKG-ITGKNTLWERSTRVPLIFAGPGIAAGA 349
>gi|423288210|ref|ZP_17267061.1| hypothetical protein HMPREF1069_02104 [Bacteroides ovatus
CL02T12C04]
gi|392671099|gb|EIY64575.1| hypothetical protein HMPREF1069_02104 [Bacteroides ovatus
CL02T12C04]
Length = 516
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 3/143 (2%)
Query: 95 KPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHR 154
KP +++S P PH P+ ++ +Y + N+ + P + D++ +
Sbjct: 209 KPFYMMLSIPDPHSPDIASEEYLQKYINL-DYEAPETMVTNDTDQRPRWARGGQYNVNKD 267
Query: 155 QFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSF 214
+F + ++ VDD V +I+ L +NT +++T+DHG L + V K
Sbjct: 268 KFDKKALANYFAMVECVDDNVGRILDFLDKNNLTDNTIVVFTADHGDMLYEHSRV-NKGL 326
Query: 215 PFEFDIRVPFLMRGP-GIVPGTM 236
P+E R+PF++R P I+PG +
Sbjct: 327 PYESSARIPFVIRYPEKIIPGKI 349
>gi|377813493|ref|YP_005042742.1| choline-sulfatase [Burkholderia sp. YI23]
gi|357938297|gb|AET91855.1| choline-sulfatase [Burkholderia sp. YI23]
Length = 511
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 83/179 (46%), Gaps = 18/179 (10%)
Query: 86 RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYD---YAPNPDKQWI 142
R+ +P M+V S PH P +Y +M+ + P+Y A +P + +
Sbjct: 180 RERHAGKDARPFMMVASLTHPHDPYAIPRKYWDMYRD-EDIDMPSYRDSLEACDPHSKRL 238
Query: 143 LQV--TRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
V + P RQ +L +R + VDD I++ L+ G ++T ++ TSD
Sbjct: 239 RHVYEADRTPPTERQ---ILNARRAYYGAVSYVDDQFASILEALEQAGLADDTIVVVTSD 295
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-----IVPGTMYLLDVLIPQVRKFSSG 252
HG LG+ GL +F FE RVP ++ P VP ++ LDVL P + +F+ G
Sbjct: 296 HGEMLGERGLWYKMTF-FEGGCRVPLIVHAPKRFRAHRVPDSVSHLDVL-PTLVEFARG 352
>gi|32476803|ref|NP_869797.1| iduronate-2-sulfatase [Rhodopirellula baltica SH 1]
gi|32447349|emb|CAD77175.1| iduronate-2-sulfatase [Rhodopirellula baltica SH 1]
Length = 571
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 106/250 (42%), Gaps = 33/250 (13%)
Query: 15 FDISHGYFGK-------YLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED 67
F I HG G + + Y+ P + G L Y+ + + + G
Sbjct: 201 FHIGHGNHGDPESFSVPHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKTLPRGAA 260
Query: 68 YYN------DYYPDLIANDSVRFLRQSKQSFSKK--PIMLVMSFPAPHGPEDSAPQ---- 115
Y + DY +A ++++ L+ +KQ + P + F PH P SAPQ
Sbjct: 261 YESPDAKDEDYADGRVAAETIQRLQAAKQRQKTEGTPFFIASGFARPHLPF-SAPQKYWD 319
Query: 116 -YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV----HRQFTDLLMTKRLQ---- 166
Y + +H T D AP + +++ +PV + F D L +
Sbjct: 320 LYDPASLPMPTHETLPVD-APKVAGKRGGEIS-NYKPVPTEPNADFDDELKRNLIHGYYA 377
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
++ VD + K++KEL L L+NT ++ DHG+HLG G+ K +E R+P L+
Sbjct: 378 SVSYVDAQIGKVIKELDRLELLDNTIVVLWGDHGFHLGDLGIWT-KHTNYEQANRIPILI 436
Query: 227 RGPGIV-PGT 235
PG+ PG+
Sbjct: 437 TAPGVTQPGS 446
>gi|350568607|ref|ZP_08937005.1| arylsulfatase [Propionibacterium avidum ATCC 25577]
gi|348660850|gb|EGY77546.1| arylsulfatase [Propionibacterium avidum ATCC 25577]
Length = 491
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 7/161 (4%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N + R+ + + P L +SF PH P D PQ+A ++ PA +
Sbjct: 191 VVNQATRWFYRRDPT---TPFFLYLSFHRPHAPYD-PPQWAFDRYDGAPLRPPAVGDWSD 246
Query: 137 PDKQWILQVTRKMQPVHRQFTDL--LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
W + D+ M + +D + ++ + L + G +NTYI
Sbjct: 247 TFSDWRNDADPTSLVARYRPRDISRAMAGYYGHMTHIDTQISRLRQVLGEFGVADNTYIA 306
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+ SDHG +G L + K +P++ +PFL+ GPGI PG+
Sbjct: 307 FISDHGDMMGDHHLWR-KGYPYQGSTHIPFLLAGPGITPGS 346
>gi|270290492|ref|ZP_06196717.1| sulfatase [Pediococcus acidilactici 7_4]
gi|270281273|gb|EFA27106.1| sulfatase [Pediococcus acidilactici 7_4]
Length = 485
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 77/168 (45%), Gaps = 5/168 (2%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ ++S++FL++ + P +L MSF PH P + Y +M+ H +
Sbjct: 189 VVSESIKFLQRRDPT---APFLLKMSFEKPHAPLNPPKYYFDMYMERLPKHIDLHIGNWE 245
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
+ + + + ++ L + +D + + + LK+ GE NT I +
Sbjct: 246 KESNEMPNIYALKGCLKEDDQRRMVAAYLGLITHIDHQISRFLTALKEFGEDKNTIIWFV 305
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLLDVLI 243
SDHG LG+ L + K +P++ IR+P + PG ++ G + + L+
Sbjct: 306 SDHGDQLGEHYLFR-KGYPYQGSIRIPSFIYDPGNLIAGHKHKIKQLV 352
>gi|297182883|gb|ADI19034.1| arylsulfatase a and related enzymes [uncultured alpha
proteobacterium HF0070_05I22]
Length = 512
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 82/178 (46%), Gaps = 23/178 (12%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY----- 131
+A+ S+ +L + + KP L +SFP PH P D P+ ++ N P +
Sbjct: 207 VADRSIDWLSKRHKD---KPFCLWVSFPDPHHPFD-CPEPWSLLHNPEEVDLPEFLEKDL 262
Query: 132 DYAP---------NPD-KQWILQVTRK---MQPVHRQFTDLLMTKRLQTLQS-VDDAVEK 177
D P PD K +L+ RK P + MT + S +D +V +
Sbjct: 263 DERPWWHKRALEDEPDLKDPVLKRFRKEGSRMPDQTEAQLREMTANYYGMISLIDHSVGR 322
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
IV L + L+ T +IYTSDHG HLG+ GL ++ I V ++RGPGI G+
Sbjct: 323 IVACLNENNILDETIVIYTSDHGDHLGERGLYLKGPMLYDSLINVGMIVRGPGIEAGS 380
>gi|374314953|ref|YP_005061381.1| arylsulfatase A family protein [Sphaerochaeta pleomorpha str.
Grapes]
gi|359350597|gb|AEV28371.1| arylsulfatase A family protein [Sphaerochaeta pleomorpha str.
Grapes]
Length = 463
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 72/158 (45%), Gaps = 12/158 (7%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
I +D++ +LR K K LV A H P + P+ ++ T + +
Sbjct: 161 ITDDTIEYLRNRKDE--NKSFFLVSGHYATHSPFEGHPERLVEYYRRKGVDT----FLKH 214
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
+ + +Q + P + L + + +D++V +++ EL+ G L NT IIYT
Sbjct: 215 ANYPFGIQRNESLYPSRTDRKEAL-CQYYAGISQIDESVGRVLDELERQGILENTIIIYT 273
Query: 197 SDHGYHLGQFGLV--KGKSFPF---EFDIRVPFLMRGP 229
+DHG + GQ GL ++P E DIRVP + P
Sbjct: 274 ADHGLNCGQHGLFGKANATYPINMVEEDIRVPLIFYAP 311
>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 551
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 92/200 (46%), Gaps = 21/200 (10%)
Query: 41 WGALIMNSKYYNYSINMNGRKIKHGEDYYND--YYPDLIANDSVRFLRQSKQSFSKKPIM 98
+G L +Y N+ G+ + +D Y L+A++ R++R + +KP
Sbjct: 157 YGHLHTEVGFYPPFANVGGKDFQENGVSIDDEGYETYLLADEVSRYIRDRDE---EKPFF 213
Query: 99 LVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTD 158
+ M F APH P D+ + + ++ + D AP Q TR+M + Q +
Sbjct: 214 IYMPFIAPHTPLDAPQELQEKYKDIET------DLAPARSNQ--TDSTRRMAKLMMQPSA 265
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG------LVKGK 212
M + + ++D A+ +++ L + G +NT +++ SD+G +G L GK
Sbjct: 266 RPMYAAV--VDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAAYSYGGADNAPLRGGK 323
Query: 213 SFPFEFDIRVPFLMRGPGIV 232
FE IRV LMR PG++
Sbjct: 324 GETFEGGIRVVSLMRWPGVL 343
>gi|255505698|ref|ZP_05347616.3| putative arylsulfatase [Bryantella formatexigens DSM 14469]
gi|255266363|gb|EET59568.1| arylsulfatase [Marvinbryantia formatexigens DSM 14469]
Length = 408
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 86/180 (47%), Gaps = 21/180 (11%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--EDSAPQYANMFFNVTSHHTP 129
Y D I +D+V FL + ++P L +++ APH P + +Y ++ + P
Sbjct: 83 YVTDAITDDAVSFLESRTED--RQPFYLHVAYTAPHAPWLNNHPKEYTELYKDCPFESVP 140
Query: 130 AYDYAP-NPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
+ P +PD ++ K + + + + ++D + +I+ +L ++G
Sbjct: 141 ---HLPRHPDSIYLTDEVAKDERGN-------LIGYYAAITAMDAGIGRILGKLDEMGLT 190
Query: 189 NNTYIIYTSDHGYHLGQFGL-VKGKS-FP---FEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
NT +++TSD+G+ G G KG + FP +E I+VPF+ PG +P + D L+
Sbjct: 191 ENTLVVFTSDNGFSCGHHGFWGKGNATFPLNMYEESIKVPFIACHPGHIP-EGSVCDALV 249
>gi|425736217|ref|ZP_18854524.1| choline-sulfatase [Brevibacterium casei S18]
gi|425478434|gb|EKU45626.1| choline-sulfatase [Brevibacterium casei S18]
Length = 619
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 9/156 (5%)
Query: 86 RQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQV 145
R ++ + +P ++V SF PH P + ++ + F T+ PA+ P+P
Sbjct: 179 RANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAE-TAIPDPAHPEVPDPAVDPHSHR 237
Query: 146 TRKMQPVHRQFTDL-----LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
R M + + + + +DD V I+ L+DLG NT +I TSDHG
Sbjct: 238 LRTMSGLDEREPAIEDVRRARRAYYAAVSYIDDHVGAIMARLRDLGLDANTVVIVTSDHG 297
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPG--IVPG 234
LG+ GL K P+E RVP ++ GP ++PG
Sbjct: 298 DMLGEKGLWY-KMSPYEQSSRVPLIVAGPAEVVLPG 332
>gi|357386109|ref|YP_004900833.1| arylsulfatase [Pelagibacterium halotolerans B2]
gi|351594746|gb|AEQ53083.1| Arylsulfatase [Pelagibacterium halotolerans B2]
Length = 487
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 94/209 (44%), Gaps = 29/209 (13%)
Query: 37 GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDL-----IANDSVRFLRQSKQS 91
G+ +W L + + ++ NGR+ H + PDL A+ ++ FL SK
Sbjct: 139 GYAQW--LAETAPAFRNALARNGRRELH-------HPPDLHMSKWAADRTIDFLHASKDD 189
Query: 92 FSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQP 151
+P +MS PH P + P A + + PA P D+ ++ ++
Sbjct: 190 --GRPFFCLMSLFDPHDPYEDYP--AALAGAIDDARIPA-PIPPRADQPRCVKREQEGSY 244
Query: 152 VHRQFTDL-------LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLG 204
+ R F+D + ++ +D V +++ L D G +T +I+TSDHG LG
Sbjct: 245 LGR-FSDFSPHEIASIRKGYAASIAFLDRQVGRVLDALDDAGLTEDTLVIFTSDHGDQLG 303
Query: 205 QFGL-VKGKSFPFEFDIRVPFLMRGPGIV 232
GL VKG + +E + VP L+R PG +
Sbjct: 304 DHGLFVKGVAL-YEPTVGVPLLLRWPGKI 331
>gi|168335540|ref|ZP_02693609.1| choline sulfatase [Epulopiscium sp. 'N.t. morphotype B']
Length = 419
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 81/187 (43%), Gaps = 25/187 (13%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF-------- 121
N + D+ + ++ FL Q +KP L + APH P ++ +Y M+
Sbjct: 124 NMHSTDIFCDTALEFLENYDQ---EKPFYLYTALVAPHDPRNAPEKYEEMYSPDDVSTPE 180
Query: 122 NVTSHHTPAYDYAPNPDKQWILQV-TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVK 180
N HT +D + +L+ R+ + R D + +D ++K
Sbjct: 181 NFLPQHT--FDNGDLWVRDELLEFFPRREYSIRRHIADYY-----SMISHIDARFGDVMK 233
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-----IVPGT 235
+LKD G NT II+ D+G LGQ GL+ GK +E IRVP +++ G +
Sbjct: 234 KLKDKGLYENTIIIFAGDNGLALGQHGLM-GKQSVYEHSIRVPLIVKPAGEFVPRMTDAY 292
Query: 236 MYLLDVL 242
YL D+
Sbjct: 293 AYLCDIF 299
>gi|449136380|ref|ZP_21771768.1| sulfatase family protein [Rhodopirellula europaea 6C]
gi|448885000|gb|EMB15464.1| sulfatase family protein [Rhodopirellula europaea 6C]
Length = 480
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 80/169 (47%), Gaps = 11/169 (6%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D + + + F+R+ S++P +S P PHGP Y MF ++ P +
Sbjct: 187 DWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFEDMPVR--PPMTFQ 240
Query: 135 PNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+ K W+ R + ++F LMT+ ++ +DD V ++ L +L T +
Sbjct: 241 LDGTKPGWLPGTNRNAE---QRFNARLMTQYFGMVRCIDDNVGMLLALLDELSLTERTVV 297
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
++TSDHG + G + K P+E +VP ++ PG+VP + + + +
Sbjct: 298 VFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMIVAAPGLVPAGLRIDEAM 345
>gi|189459670|ref|ZP_03008455.1| hypothetical protein BACCOP_00298 [Bacteroides coprocola DSM 17136]
gi|189433629|gb|EDV02614.1| arylsulfatase [Bacteroides coprocola DSM 17136]
Length = 387
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 79/160 (49%), Gaps = 17/160 (10%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAYDY 133
A+ ++ FL+ ++ + +P M +SF +PH P + P Y + +V + P + +
Sbjct: 87 ADAAINFLKTTEHN--NQPFMAFVSFTSPHDPRNILPDYGKKYCPAELSVPDNFLPEHPF 144
Query: 134 APNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGELN 189
D + ++ PV R T L+ K L + VD + ++V L++ G ++
Sbjct: 145 ----DNGELNIRDEQLLPVPR--TPELLKKELSLYYGMVSEVDLQIGRVVDALEESGNMD 198
Query: 190 NTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
NT I++ SD+G +GQ GL+ GK +E + VP + P
Sbjct: 199 NTIIVFASDNGLAMGQNGLI-GKQSLYEHSVGVPMAIITP 237
>gi|149195944|ref|ZP_01873000.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
gi|149140791|gb|EDM29188.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Lentisphaera araneosa HTCC2155]
Length = 515
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 108/237 (45%), Gaps = 29/237 (12%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKI--KHGEDYYNDYYPDLI 77
GY + K++ P + + L Y+N N+ G K K+ + + D I
Sbjct: 121 GYETAMIGKWHLKKEPATFDYYCVLPGQGLYHNPIFNIRGSKPWPKNTITKKDQHSSDAI 180
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYAN-----------MFFNVTSH 126
+ S+ +L+ + KP L+ F APH + A +Y + F+V +
Sbjct: 181 TDISLHWLKNERDK--SKPFFLMHHFKAPHDMFEYAKRYESYLEDVHIPEPESLFSVPAG 238
Query: 127 HTPAYDYAPNPDKQ---WILQVTRKM-------QPVHRQFT-DLLMTKRLQTLQSVDDAV 175
+ D K W Q+ +K+ +P + + + + L+ ++ +DD +
Sbjct: 239 SAGSKDLGSGLSKNHNPW--QLPQKLGVSDDIPEPEYTRLSYQKYLKAYLRCVKGIDDNI 296
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+++ LKD +L+NT IIYTSD G+ LG+ L+ K + +E + +PF++ PG++
Sbjct: 297 ARLLSYLKDSNQLDNTIIIYTSDQGFFLGEHNLI-DKRWMYEEAMGMPFIVYAPGMI 352
>gi|293375272|ref|ZP_06621554.1| arylsulfatase [Turicibacter sanguinis PC909]
gi|325842515|ref|ZP_08167686.1| arylsulfatase [Turicibacter sp. HGF1]
gi|292646028|gb|EFF64056.1| arylsulfatase [Turicibacter sanguinis PC909]
gi|325489559|gb|EGC91923.1| arylsulfatase [Turicibacter sp. HGF1]
Length = 442
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYH-------LGQFGLVKGKS 213
M L S+D V +I+ LK+ G NT IIYTSDHG H + FG K
Sbjct: 228 MPDYLGCCHSLDQNVGRIISVLKEQGIYENTLIIYTSDHGCHFKTMKKYINPFGADDYKR 287
Query: 214 FPFEFDIRVPFLMRGPGIVPGTM 236
P E I +P +MRGPG + G M
Sbjct: 288 SPDENSIHIPLVMRGPGFLGGVM 310
>gi|29348504|ref|NP_812007.1| sulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|29340409|gb|AAO78201.1| putative sulfatase [Bacteroides thetaiotaomicron VPI-5482]
Length = 516
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/162 (24%), Positives = 76/162 (46%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ ++ K++ P + +SFP PH P Y +MF +
Sbjct: 175 EEQHPAKIVNETLSWIESQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 230
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D +K IL ++ P Q + + ++ +DD ++++++ LK G
Sbjct: 231 TSRKDLEKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 290
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 291 QFENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 332
>gi|311748195|ref|ZP_07721980.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|126576685|gb|EAZ80933.1| sulfatase family protein [Algoriphagus sp. PR1]
Length = 481
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 81/175 (46%), Gaps = 10/175 (5%)
Query: 75 DLIANDSVRFLRQSK--QSFSKKPIMLVMSFPAPHGPEDSAPQYANM--FFNVTSHHTPA 130
DL+ +D + S+ + +P L + F PH P + Y + F +T+
Sbjct: 191 DLVHSDGKTAKKASELIREHKDEPFFLAVGFVRPHVPFVAPASYFDPYPFAKITTPKQAP 250
Query: 131 YDYAPNPDK--QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
D+ P++ ++ V +M +Q + ++ +D V K++ LK+ G
Sbjct: 251 NDWEDIPERGINYVTSVNAEMNQEQKQKA---IAAYYASVSYMDTQVGKVLTTLKEEGLE 307
Query: 189 NNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
NNT I++TSDHG+HLG+ S E ++VP +++ PG P ++ LI
Sbjct: 308 NNTIIVFTSDHGFHLGEHEFWMKVSL-HEESVKVPMMIKVPGKNPAVVHSFTELI 361
>gi|109900202|ref|YP_663457.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109702483|gb|ABG42403.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 486
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 73/157 (46%), Gaps = 16/157 (10%)
Query: 82 VRFLRQSKQSF----SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP 137
RFL + F + KP + +++ PH P + P+ + S + P
Sbjct: 183 ARFLTEKAIEFIDESTDKPFFININYVEPHFPFEGLPE------RLVSQYRPVARKLLRD 236
Query: 138 DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
L + K V + + L ++ L + +DD V +I+ L+ G L+NT I + S
Sbjct: 237 GGNSSLALASKDTAVPKDHEEKL-SQYLAAISLIDDQVGQIMDALEGRGLLDNTIIAFVS 295
Query: 198 DHGYHLGQFGLV--KGKSFPFEF---DIRVPFLMRGP 229
DHG +GQ+GL SFP+ F +R+PF++ GP
Sbjct: 296 DHGMLMGQYGLYGKTNASFPYNFYEETVRIPFIIYGP 332
>gi|325263386|ref|ZP_08130120.1| choline sulfatase [Clostridium sp. D5]
gi|324031095|gb|EGB92376.1| choline sulfatase [Clostridium sp. D5]
Length = 464
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 83/181 (45%), Gaps = 26/181 (14%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--------FNVTSH 126
+L+ + +V FL + + +P L ++ APH P +Y ++ N
Sbjct: 177 ELLTDMAVSFLDEYTED---RPFFLYTAYLAPHDPRTMPDKYKELYQPENMTLPVNFQKQ 233
Query: 127 HTPAYDYAPNPDKQWILQ-VTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
H +++ + + +L R + + RQ + + +DD + KI++ LK
Sbjct: 234 H---FEFGVSGIRDELLAPYPRTEEEMRRQLAEYYAM-----ITHLDDEIGKIMEALKRN 285
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT-----MYLLD 240
+ NT II T D+G +G GL+ GK +E IR+P + GPGI G +YLLD
Sbjct: 286 QQWENTIIILTGDNGLAIGSHGLM-GKQNLYEESIRIPLIFAGPGIPKGQVRDQFVYLLD 344
Query: 241 V 241
+
Sbjct: 345 I 345
>gi|260429935|ref|ZP_05783910.1| iduronate 2-sulfatase [Citreicella sp. SE45]
gi|260418858|gb|EEX12113.1| iduronate 2-sulfatase [Citreicella sp. SE45]
Length = 371
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 79/182 (43%), Gaps = 19/182 (10%)
Query: 93 SKKPIMLVMSFPAPHGPEDSAPQYAN--MFFNVTSHHT---------PAYDYAPNPDKQW 141
+KP+ L + FP PH P D P+Y M +V H P + PD ++
Sbjct: 155 GEKPLFLSIGFPGPHPPYDPTPEYPGTYMAHDVPLPHVTQDEIDRLPPPWKEQRRPDSEF 214
Query: 142 I-LQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
V ++P + + + +D V +I+ L+ G L+N +I+TSDHG
Sbjct: 215 DHYAVHWNLEPSEADLHKMRAYDYVNVVL-IDREVGRILVALERRGRLDNGVVIFTSDHG 273
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGSLIFIMSF 260
LG GL + K P E RVP ++ PG P + D L V+ F G I ++
Sbjct: 274 DTLGDHGLSQ-KWAPCEQVTRVPMVISQPGRFPARRF--DGL---VQLFDLGPTILRLAG 327
Query: 261 LI 262
L
Sbjct: 328 LT 329
>gi|257067331|ref|YP_003153586.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
gi|256558149|gb|ACU83996.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
Length = 483
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/211 (23%), Positives = 84/211 (39%), Gaps = 46/211 (21%)
Query: 65 GEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT 124
GE+ Y D++ S+ F+ + ++P L++ APH P P+Y +++ T
Sbjct: 136 GEEQIPGYATDIVTRQSLDFIDRRD---PEQPFCLLVHHKAPHRPWIPHPRYEHLYEAGT 192
Query: 125 SHHTPAYDYAPNPDKQWILQVTRK--MQPVHRQFTDL----------------------- 159
P P+ W TR ++ V DL
Sbjct: 193 ---------IPEPETMWDDHATRSEVVREVAMNLDDLRPTDYKDELPAELEGETEEARRA 243
Query: 160 --------LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKG 211
M L+ +Q+VDD++ +I+ L G NT ++YTSD G+ LG G
Sbjct: 244 RASWKYQRYMRDYLRCVQAVDDSIGEILDHLDQEGLGENTLVVYTSDQGFFLGDHGWFD- 302
Query: 212 KSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
K + + +P L+R P +P + D++
Sbjct: 303 KRLMLDESLTMPMLLRWPAQIPAGSRVSDIV 333
>gi|383124694|ref|ZP_09945357.1| hypothetical protein BSIG_1553 [Bacteroides sp. 1_1_6]
gi|251841147|gb|EES69228.1| hypothetical protein BSIG_1553 [Bacteroides sp. 1_1_6]
Length = 518
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/162 (24%), Positives = 76/162 (46%), Gaps = 7/162 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHH 127
+ +P I N+++ ++ K++ P + +SFP PH P Y +MF +
Sbjct: 177 EEQHPAKIVNETLSWIESQKEN----PFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLK 232
Query: 128 TPAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
T D +K IL ++ P Q + + ++ +DD ++++++ LK G
Sbjct: 233 TSRKDLEKKGEKYRILAELEDASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASG 292
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
+ NT + SDHG + G++GL++ + E R+P + G
Sbjct: 293 QFENTIFVVLSDHGDYWGEYGLIRKGAGLSESLARIPMVWAG 334
>gi|381204856|ref|ZP_09911927.1| sulfatase [SAR324 cluster bacterium JCVI-SC AAA005]
Length = 504
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 45/73 (61%), Gaps = 1/73 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
+ + L+T+ S+D++V KI+ L D G NT +IYTSD G+ LG+ G K F +E
Sbjct: 287 LKRYLRTIHSIDESVGKILDYLDDNGLAENTLVIYTSDQGFFLGEHGWYD-KRFMYEESF 345
Query: 221 RVPFLMRGPGIVP 233
++PFL R P +P
Sbjct: 346 QMPFLARFPAEIP 358
>gi|87310895|ref|ZP_01093021.1| iduronate-2-sulfatase [Blastopirellula marina DSM 3645]
gi|87286410|gb|EAQ78318.1| iduronate-2-sulfatase [Blastopirellula marina DSM 3645]
Length = 469
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 102/251 (40%), Gaps = 52/251 (20%)
Query: 18 SHGYF----GKYLNKYNG-------SYIPPGWREWGA----LIMNSKYYNYSINMNGRKI 62
+HGYF GK + Y S+ P +WGA ++ + + G +
Sbjct: 87 AHGYFTQNIGKIYHNYRQTLRNDPQSWSTPAVHDWGAHSNDWFVSGEPFGLKSISKGPAV 146
Query: 63 KHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN 122
+ + Y IA D+V +R+ ++ K+P L + F PH P FN
Sbjct: 147 QKVDVADEAYLDGRIAADAVLAIRE--RAAQKQPFFLAVGFWKPHLP-----------FN 193
Query: 123 VTSHHTPAYDYAPNPDKQWILQVTRKMQP---VH-----RQFTDLLMTKRLQTLQSV--- 171
+ YD P+ + + Q+ + P H R +TD+ T + Q++
Sbjct: 194 APKPYWDKYD--PDQIRAHLDQLPKSDAPQIAFHPYGEIRSYTDIPKTGDISAEQNLVLN 251
Query: 172 ----------DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
D + K++ EL+ G NT ++ SDHG+HLG+ L K+ FE D R
Sbjct: 252 HGYYAAISFLDAQIGKVLHELQRQGLAENTIVVLWSDHGFHLGEHDLW-CKTSNFELDTR 310
Query: 222 VPFLMRGPGIV 232
VP L+ P
Sbjct: 311 VPLLIAPPAAA 321
>gi|358066156|ref|ZP_09152690.1| hypothetical protein HMPREF9473_04753 [Clostridium hathewayi
WAL-18680]
gi|356696019|gb|EHI57644.1| hypothetical protein HMPREF9473_04753 [Clostridium hathewayi
WAL-18680]
Length = 479
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/174 (23%), Positives = 86/174 (49%), Gaps = 23/174 (13%)
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--EDSAPQYANMFFNVTSHHTP 129
Y D+I +D++ F+ K+ +++P L + + APH P + +Y +++ P
Sbjct: 154 YVTDVITDDAISFM---KEYANRQPFYLQVGYTAPHAPWLNNHPKEYTDLYEKCPFESCP 210
Query: 130 AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTK--RLQTLQSVDDAVEKIVKELKDLGE 187
+ +PD ++ + K DL + ++D + +I++++++LG
Sbjct: 211 MEE--EHPDSIYLTEEVLK---------DLRSNQIGYYAAATAMDRNIGRILEQIEELGI 259
Query: 188 LNNTYIIYTSDHGYHLGQFGL-VKGK-SFP---FEFDIRVPFLMRGPGIVPGTM 236
T +I++SD+G+ G G KG +FP +E ++VPF+ PG++P +
Sbjct: 260 REETLVIFSSDNGFSCGHHGFWGKGNGTFPINMYESSVKVPFIASQPGVIPAGV 313
>gi|410665791|ref|YP_006918162.1| sulfatase [Simiduia agarivorans SA1 = DSM 21679]
gi|409028148|gb|AFV00433.1| sulfatase [Simiduia agarivorans SA1 = DSM 21679]
Length = 548
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 46/74 (62%), Gaps = 1/74 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M L T++SVD+ V +++ LK G NT I+YTSD+G+ LG+ G K F ++ +
Sbjct: 328 MQDYLATIESVDENVGRLMDYLKAEGLDKNTIIVYTSDNGFFLGEHGWFD-KRFAYQESL 386
Query: 221 RVPFLMRGPGIVPG 234
+VP ++R PG+ P
Sbjct: 387 QVPLIVRWPGVTPA 400
>gi|149198504|ref|ZP_01875549.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138510|gb|EDM26918.1| choline sulfatase [Lentisphaera araneosa HTCC2155]
Length = 466
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 87/192 (45%), Gaps = 20/192 (10%)
Query: 87 QSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF--FNVTSHHTPAYDYAPNPDKQWILQ 144
Q+ Q KP + + F PH P P++ N++ ++ +P D P K ++
Sbjct: 185 QALQEDFDKPFFMSVGFFRPHVPLLVPPKWFNLYDEESIVLAPSPKSDLDDVP-KNFLSI 243
Query: 145 VTRKMQPVHRQF--TDL---LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ P H++ TD L L ++ VD V +++ LK+ +NT +I SDH
Sbjct: 244 NDYAVAPTHKEVLATDSHRKLTHAYLASISFVDACVGRVIDALKNSKYADNTIVILWSDH 303
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLIPQVRKFSSGSLIFIMS 259
G+HLG+ ++ +E +VP L+ GPGI G L SLI I
Sbjct: 304 GFHLGEKEHWAKRTL-WEESTKVPLLVYGPGIESGEACL-----------EPASLIDIYP 351
Query: 260 FLINLNTMRAEK 271
L++L ++A K
Sbjct: 352 TLVDLCGVKAPK 363
>gi|393788823|ref|ZP_10376949.1| hypothetical protein HMPREF1068_03229 [Bacteroides nordii
CL02T12C05]
gi|392653929|gb|EIY47579.1| hypothetical protein HMPREF1068_03229 [Bacteroides nordii
CL02T12C05]
Length = 526
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 316 MRDYMKTVKSLDDNVGRVLDYLKEKGMLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 374
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 375 RTPLIMRLP 383
>gi|333378667|ref|ZP_08470397.1| hypothetical protein HMPREF9456_01992 [Dysgonomonas mossii DSM
22836]
gi|332883071|gb|EGK03355.1| hypothetical protein HMPREF9456_01992 [Dysgonomonas mossii DSM
22836]
Length = 520
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 110/282 (39%), Gaps = 74/282 (26%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ P G+ + L YYN S G K+ + Y +LI
Sbjct: 111 GYQTGIVGKWHMQCEPKGFDYYCVLNDQGDYYNPSFKTKGSHGKYIRE--EGYATNLITT 168
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------------GPED--------- 111
S+ FL+ +S KP L++ APH PE
Sbjct: 169 HSIDFLKNRDKS---KPFALLVHHKAPHRNWMPEAKYYDLYEDVEFPKPETFYDDYESRC 225
Query: 112 SAPQYANMFFNVTSHHTPAYDYAPNP-------DKQW----ILQVTRKMQPVHRQF---- 156
SA + +M + T T YD N +K+W + Q +M P R+
Sbjct: 226 SAARTQDMTIDKTM--TFIYDLKLNELKNTPPYNKEWSDGGLQQAMNQMTPAQREAWEKA 283
Query: 157 --------------TDLL--------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
D L M L+ ++S+DD V +++ L+ G ++NT I+
Sbjct: 284 YHPKNMEFINSNLSGDALLDWKFQRYMRDYLRCIKSIDDEVGRLIDYLEKEGLMDNTIIV 343
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GIVPGT 235
YTSD G+++G+ G K F +E R P ++R P I GT
Sbjct: 344 YTSDQGFYMGEHGWFD-KRFMYEESFRTPLIIRYPKAIKSGT 384
>gi|169343129|ref|ZP_02864154.1| sulfatase family protein [Clostridium perfringens C str. JGS1495]
gi|169298767|gb|EDS80842.1| sulfatase family protein [Clostridium perfringens C str. JGS1495]
Length = 481
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 73/159 (45%), Gaps = 13/159 (8%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+ N+S+ FLR+ S K L MSF PH P D Y +M+ + D+A
Sbjct: 183 VVNESIDFLRRRDPS---KSFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANK 239
Query: 137 PDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNT 191
D++ + + V + + KR + ++ +D + + + L + GELNNT
Sbjct: 240 DDEE---NRGKDINCV-KGIINNKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNT 295
Query: 192 YIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
++ SDHG +G + K P+E RVPF + PG
Sbjct: 296 IFLFVSDHGDMMGDHNWFR-KGIPYEGSSRVPFFIYDPG 333
>gi|440715395|ref|ZP_20895942.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SWK14]
gi|436439739|gb|ELP33153.1| mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase)
[Rhodopirellula baltica SWK14]
Length = 510
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 77/183 (42%), Gaps = 11/183 (6%)
Query: 52 NYSINMNGRKIKHGEDY-YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPE 110
NY+ N K+ E Y ++ D +R SKQ KP L +SF APH P
Sbjct: 163 NYATAKNKSMRKYAEQYPHSTLSYAAFGKDVIR--EASKQD---KPFCLSISFKAPHKPA 217
Query: 111 DSAPQ----YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ 166
P+ YA F + Y P + Q R + + D M + Q
Sbjct: 218 TPDPRFNHVYAGKQFTKPLNFGREYSKHLAPQSKLGRQYPRFSEWKYDTDYDGEMARYHQ 277
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
+ ++D A+ I ELK G +NT +IYTSD+GY G G K P E RVP ++
Sbjct: 278 QVYAIDVALGVIRDELKAQGVADNTVVIYTSDNGYICGSHGY-GSKVLPMEESSRVPLMI 336
Query: 227 RGP 229
P
Sbjct: 337 YDP 339
>gi|343085137|ref|YP_004774432.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353671|gb|AEL26201.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 542
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 53/91 (58%), Gaps = 3/91 (3%)
Query: 146 TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQ 205
+K Q H +F + + L+T++SVDD V +++ L+ G NT ++YTSD G++LG+
Sbjct: 312 AKKKQIAHWKF-NRYIKDYLRTIKSVDDGVGEVLDYLEANGLAENTIVVYTSDQGFYLGE 370
Query: 206 FGLVKGKSFPFEFDIRVPFLMRGPG-IVPGT 235
G K F +E R P L+R P I PGT
Sbjct: 371 HGWFD-KRFMYEQSFRTPLLVRYPKEIKPGT 400
>gi|284036170|ref|YP_003386100.1| sulfatase [Spirosoma linguale DSM 74]
gi|283815463|gb|ADB37301.1| sulfatase [Spirosoma linguale DSM 74]
Length = 485
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 82/171 (47%), Gaps = 22/171 (12%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQ--YANMFFNVTSHHT---P 129
DL+ FL+ + + KP L +SF APH + + P+ YA F ++ T P
Sbjct: 181 DLMGQQMDEFLQGNP---AGKPFCLSVSFKAPHAQDAANPEFPYAERFTDLYRDQTLKRP 237
Query: 130 AYD----YAPNPDKQWIL---QVTRKMQPVHRQFTDLLMTKRLQT----LQSVDDAVEKI 178
A Y PD W Q +++ R TD + + ++ + +DD V +
Sbjct: 238 AAADDKYYRQFPD--WFRHNDQNESRIRWSRRFATDSMFQQTTKSYNRLITGIDDVVGNL 295
Query: 179 VKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
+ L++ G +NT IIYTSD+G++ G++G K + E IRVP ++ P
Sbjct: 296 RRTLQERGLADNTIIIYTSDNGFYEGEYGFAD-KWYGHELSIRVPLIIYDP 345
>gi|149198324|ref|ZP_01875370.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138620|gb|EDM27027.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 514
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 98/235 (41%), Gaps = 37/235 (15%)
Query: 37 GWREWGALIMNSKYYNYS-INMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKK 95
G+ W L KY++ + G+ I GE + D+I ++ ++ + K
Sbjct: 132 GFDYWEVLPGQGKYWDPEFVTETGKTIYPGE-----HSSDVITRRALNWMNNERDK--SK 184
Query: 96 PIMLVMSFPAPHG-----------------PE-----------DSAPQYANMFFNVTSHH 127
P ML++ F APH PE +A +Y +M + +
Sbjct: 185 PFMLMVHFKAPHRSWQPTTRWKKKFSTMTFPEPDTLFDDYQGRGTAAKYQDMNIEHSMNM 244
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
+P K+++ + + + + + M L + VD+ + KI+ +L + G
Sbjct: 245 VGDLKSNQSPRKEFLKKNALTGKALVKWKYQMYMRDYLACIAGVDENIGKILDQLAESGL 304
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVL 242
NT ++Y+SD G++LG+ G K F +E R P L R PG++ D++
Sbjct: 305 DKNTIVMYSSDQGFYLGEHGWFD-KRFMYEESYRTPLLARWPGVIKAKTRNEDLV 358
>gi|444304861|ref|ZP_21140650.1| arylsulfatase A family protein [Arthrobacter sp. SJCon]
gi|443482831|gb|ELT45737.1| arylsulfatase A family protein [Arthrobacter sp. SJCon]
Length = 874
Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats.
Identities = 62/255 (24%), Positives = 107/255 (41%), Gaps = 40/255 (15%)
Query: 6 ELTSRNLLYFDI--SHGY----FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYS-INMN 58
E+ R + D+ ++GY FGK+ P G+ W Y + I+ +
Sbjct: 448 EMDYRVPTFIDVLRNNGYRTALFGKWHLGEQEVSEPRGFDAWKVFPGQGDYMDPEMIDPS 507
Query: 59 GRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP--------- 109
GR I+ G Y D++ + S+ ++ + + +P L++ APH P
Sbjct: 508 GRTIEKG------YATDIVTDLSLDWIDERGED---EPFCLLVHHKAPHRPWVPDEKHKH 558
Query: 110 ---EDSAPQYANMFFNVTS--------HHTPAYDYAPNPDKQWI---LQVTRKMQPVHRQ 155
+ + P+ F + S H T A D + K + L+ + R
Sbjct: 559 LYADGTIPEPETFFDDYESRSKAVRGVHMTIADDMGADDIKMEVPEHLRGEENREERMRW 618
Query: 156 FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFP 215
+ M LQ +QS+DD V +++ L+ G NT ++YTSD G+ LG G K
Sbjct: 619 KYQIYMRDYLQCIQSIDDNVGRMLDHLEAKGLAENTLVVYTSDQGFFLGDHGWFD-KRLM 677
Query: 216 FEFDIRVPFLMRGPG 230
F+ +++P L+R P
Sbjct: 678 FDQSLQMPMLIRWPA 692
>gi|399154474|ref|ZP_10754541.1| sulfatase family protein 7 [gamma proteobacterium SCGC AAA007-O20]
Length = 522
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 79/188 (42%), Gaps = 43/188 (22%)
Query: 66 EDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT- 124
E++Y Y + + FL + + P +LV+SFP PH P +Y +++
Sbjct: 219 EEHYPTSY---VGEQACEFLEEQDDT----PFILVVSFPDPHHPFTPPGRYFDLYDPADI 271
Query: 125 ------SHHTPAYDYAPN------------PDKQWILQVTRKMQPVHRQFTDLLMTKRL- 165
SH T A PN PD+ W P H TD T+R+
Sbjct: 272 PLPKSFSHRTSARSDLPNHIQRMYEIGAEKPDEFW---------PFH---TDDKATRRMI 319
Query: 166 ----QTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIR 221
T+ +D+ V +++ LK+ G+ NT IIY SDHG ++G G V IR
Sbjct: 320 ALNYGTITMIDEQVGVVMQTLKNTGKSENTNIIYMSDHGDYMGDHGTVLKGGVHSHGLIR 379
Query: 222 VPFLMRGP 229
VP + P
Sbjct: 380 VPLIWSDP 387
>gi|333383458|ref|ZP_08475118.1| hypothetical protein HMPREF9455_03284 [Dysgonomonas gadei ATCC
BAA-286]
gi|332827697|gb|EGK00436.1| hypothetical protein HMPREF9455_03284 [Dysgonomonas gadei ATCC
BAA-286]
Length = 532
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 112/291 (38%), Gaps = 72/291 (24%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIAN 79
GY + K++ P G+ + L YYN S + K+ + Y +LI
Sbjct: 123 GYQTGVVGKWHMQCEPKGFNYYCVLNDQGDYYNPSFKTKESQGKYVRE--EGYATNLITT 180
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT--------------- 124
S+ F+ ++ KP L++ APH +Y N++ +V
Sbjct: 181 HSIEFIENRDKN---KPFALLVHHKAPHRNWMPEAKYYNLYEDVEFPKPETFYDDYETRC 237
Query: 125 -----------SHHTPAYDYAPNP-------DKQW----ILQVTRKMQPVHRQF------ 156
T YD N +K+W + Q +M P R+
Sbjct: 238 SAAKTQDMTIDKTMTFIYDLKLNELRNTTPYNKEWNNGGLQQAMNQMTPAQREAWENAYH 297
Query: 157 -------------TDLL-------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
DLL M L+ ++S+DD V +++ L+ G ++NT I+YT
Sbjct: 298 PKNMEFIDSNLSGDDLLNWKYQRYMKDYLRCIKSIDDEVGRLIDYLEKEGLMDNTIIVYT 357
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP-GIVPGTMYLLDVLIPQV 246
SD G+++G+ G K F +E R P ++R P I GT D L+ +
Sbjct: 358 SDQGFYMGEHGWFD-KRFMYEESFRTPLIIRYPKAIEAGTK--TDALVQNI 405
>gi|237723220|ref|ZP_04553701.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229447742|gb|EEO53533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 549
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 339 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 397
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 398 RTPLIMRLP 406
>gi|423214669|ref|ZP_17201197.1| hypothetical protein HMPREF1074_02729 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692575|gb|EIY85812.1| hypothetical protein HMPREF1074_02729 [Bacteroides xylanisolvens
CL03T12C04]
Length = 509
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 82/170 (48%), Gaps = 7/170 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
N + +L A+ +++F+ + + + P ++ ++F +PH P + P Y + P
Sbjct: 203 NTFSSELYADAAIKFIETN--ASNDNPFLMYVAFTSPHDPRNVLPDYGRKY-GSKEITMP 259
Query: 130 AYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKRL--QTLQSVDDAVEKIVKELKDLG 186
+P D + + K+ P R +L + + VD + +I+ L+ G
Sbjct: 260 KNFVTQHPFDNGDLNERDEKLLPTPRVPEQVLAERARYYSMVNEVDVQIGRILDMLEKSG 319
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
+ +NT I++ +D+G +G+ GL+ GK +E +RVP ++ GP I M
Sbjct: 320 KADNTIIVFAADNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPNIPQNVM 368
>gi|295135339|ref|YP_003586015.1| sulfatase [Zunongwangia profunda SM-A87]
gi|294983354|gb|ADF53819.1| sulfatase [Zunongwangia profunda SM-A87]
Length = 544
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 59/118 (50%), Gaps = 1/118 (0%)
Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAV 175
+ N F +T+ ++ A P+ + + + R + + T+ SVD+ V
Sbjct: 271 WTNDFDRMTAEQRKQWNAAYRPENDAFWEQDLHGEALARYKGQRYLKDYMATIASVDEGV 330
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
KI+ L+ G NT ++YTSD G++LG+ G K F +E R+P LM+ PG +P
Sbjct: 331 GKILDYLEAQGLDENTLVVYTSDQGFYLGENGWF-DKRFMYEPSFRMPLLMQLPGKIP 387
>gi|161503090|ref|YP_001570202.1| hypothetical protein SARI_01155 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160864437|gb|ABX21060.1| hypothetical protein SARI_01155 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 530
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 76/169 (44%), Gaps = 16/169 (9%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQ-YANMFFNVTSH 126
N + P+ A+ ++++LR + S +P LV+S PH P D PQ Y + F + TS
Sbjct: 233 NQWSPEHEADMAIKYLRNEGGKYRNSNEPFALVVSMNPPHSPYDQVPQKYLDRFKDQTSR 292
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLG 186
P+ W P + + + + VD+ +I+ EL+ L
Sbjct: 293 -----SMNKRPNVVWDKTYQEGYGPEY-------FKEYMAMINGVDEQFGRILTELERLK 340
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGT 235
+T +++ SDHG +G G K+ +E +R+P + R PG +P
Sbjct: 341 LDKDTLVVFFSDHGCCMGSNGQPT-KNVHYEEAMRIPMMFRWPGKLPAC 388
>gi|443704610|gb|ELU01589.1| hypothetical protein CAPTEDRAFT_204150 [Capitella teleta]
Length = 446
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 73/153 (47%), Gaps = 21/153 (13%)
Query: 93 SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN--PDKQWILQVTRKM- 149
S +P + M PH P D ++ ++ + PA P P+ W + R+
Sbjct: 140 STQPFFVAMGIKKPHLPFDFPEEFLELYDDA---EMPANPLLPKNVPNIAWNHKHIREQG 196
Query: 150 -------QPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKELKDLGELNNTYIIYTS 197
P+H D++ K L+ T+ VD + +++++L L NNT +++ S
Sbjct: 197 SHFNATSDPLHG--LDIVAMKALRKAYYATMSYVDSIIGQVLQKLDSLKLTNNTIVVFLS 254
Query: 198 DHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
DH HLG+ G++ GK F + +VPF++R PG
Sbjct: 255 DHAIHLGEHGII-GKKTNFIVNAQVPFMIRVPG 286
>gi|298386258|ref|ZP_06995814.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. 1_1_14]
gi|383125360|ref|ZP_09946002.1| hypothetical protein BSIG_5089 [Bacteroides sp. 1_1_6]
gi|251837805|gb|EES65895.1| hypothetical protein BSIG_5089 [Bacteroides sp. 1_1_6]
gi|298260635|gb|EFI03503.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. 1_1_14]
Length = 527
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|255692838|ref|ZP_05416513.1| mucin-desulfating sulfatase [Bacteroides finegoldii DSM 17565]
gi|260621400|gb|EEX44271.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 527
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|29347038|ref|NP_810541.1| sulfatase yidJ [Bacteroides thetaiotaomicron VPI-5482]
gi|29338936|gb|AAO76735.1| putative sulfatase yidJ [Bacteroides thetaiotaomicron VPI-5482]
Length = 579
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 369 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 427
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 428 RTPLIMRLP 436
>gi|374620758|ref|ZP_09693292.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374303985|gb|EHQ58169.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 552
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 94/231 (40%), Gaps = 24/231 (10%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY--------NYSINMNGRKIKHGEDYYND 71
G+FGK+ + G+ W + YY IN++G+++
Sbjct: 121 GFFGKWHMGESNDSPRDGFDRWVSFAGQGSYYPIKRPDGSTNVINVDGQRVPQ-----QG 175
Query: 72 YYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT---SHHT 128
Y D I ++ +L + + + KP L +S A H AP++ + +VT
Sbjct: 176 YITDEITEYALTWLEEQRDA--TKPFFLYLSHKAVHSDAVPAPRHRGQYDDVTFELPSSV 233
Query: 129 PAYDYAPNPDKQWILQVTRKMQPVHRQF-TDLLMTKRLQ----TLQSVDDAVEKIVKELK 183
D W+ + + +D MT L+ L SVDD++ + L
Sbjct: 234 EITDAYLEGKPMWVKNQRNSWHGIDFAYASDRKMTDYLKDYFGALSSVDDSIGTLRAYLS 293
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ NT II+ SD+G+ G GL+ K +E IRVP ++ PG VP
Sbjct: 294 EADIAENTVIIFYSDNGFLTGDHGLID-KRNAYEGSIRVPMIVAAPGRVPA 343
>gi|299146834|ref|ZP_07039902.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 3_1_23]
gi|383113493|ref|ZP_09934265.1| hypothetical protein BSGG_3196 [Bacteroides sp. D2]
gi|298517325|gb|EFI41206.1| putative N-acetylglucosamine-6-sulfatase [Bacteroides sp. 3_1_23]
gi|313695661|gb|EFS32496.1| hypothetical protein BSGG_3196 [Bacteroides sp. D2]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|380696306|ref|ZP_09861165.1| sulfatase yidJ [Bacteroides faecis MAJ27]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|334139744|ref|YP_004532942.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. PP1Y]
gi|333937766|emb|CCA91124.1| putative N-acetylglucosamine-6-sulfatase [Novosphingobium sp. PP1Y]
Length = 569
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 54/234 (23%), Positives = 108/234 (46%), Gaps = 29/234 (12%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYY---------NYSINMNGRKIKHGEDYYN 70
+FGK+ + PG+ +W + + Y+ + ++N++G+++
Sbjct: 137 AFFGKWHMGASTDAPRPGFDKWVSFMGQGNYWPAKQAGPHSSPNLNVDGKEVPQ-----K 191
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
Y D + + ++ +L +K+ KKP L +S H P+YA+ + + + T
Sbjct: 192 GYITDELTDYAMNWL--TKERDPKKPFFLYLSHKGVHSDPLPPPRYAHQYDD--AKFTLP 247
Query: 131 YDYAPNPDKQ-----WILQVTRKMQPVHRQF-TDLLMTKRLQ----TLQSVDDAVEKIVK 180
A P+ W+ + + D+ MT+ L+ TL ++DD++ +I+
Sbjct: 248 ASAADTPENNAGKPVWVQNQRNTWHGIDFFYNADVPMTEYLKYYYGTLSAIDDSLGRIMT 307
Query: 181 ELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
L+ +T +++TSD+G+ G+ GL+ K +E IRVP ++ PG VP
Sbjct: 308 YLRKNHLEKDTLVVFTSDNGFMTGEHGLID-KRNAYEASIRVPLVVWEPGTVPA 360
>gi|331083552|ref|ZP_08332663.1| hypothetical protein HMPREF0992_01587 [Lachnospiraceae bacterium
6_1_63FAA]
gi|330403763|gb|EGG83315.1| hypothetical protein HMPREF0992_01587 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 489
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 82/173 (47%), Gaps = 26/173 (15%)
Query: 68 YYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
Y Y+P + + + S+ FLR+ +KP L+ S+ PH P D AP Y +N
Sbjct: 174 YEEKYHPTNWVTDRSIDFLRRKD---PQKPFFLMASYLRPHPPFD-APSYYFDLYNKKEL 229
Query: 127 HTPAYDYAPNPDKQWILQVTRKMQPVHRQF------TDLLMTKRLQ-----TLQSVDDAV 175
PA W + T ++Q + R F +D + + Q + +D +
Sbjct: 230 TPPAV-------GDW--ETTEELQAMGRVFDSKCGPSDAELIREAQIGYYACITHLDHQI 280
Query: 176 EKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
++++ L + G +NT I++TSDHG L + + KS P++ IR+P ++ G
Sbjct: 281 GRLIQALVEYGVYDNTLILFTSDHGEELCDHHMFR-KSRPYQGSIRIPMIVSG 332
>gi|325109857|ref|YP_004270925.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970125|gb|ADY60903.1| Iduronate-2-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 487
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 82/164 (50%), Gaps = 16/164 (9%)
Query: 76 LIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMF----FNVTSHHTPAY 131
+ A ++V+ L + +S ++P + + PH P + +Y +++ V
Sbjct: 194 IAAAEAVKQLNEYAKS--EEPFFMAVGLYRPHTPYVAPKKYFDLYPLDTIRVPQIRAGYL 251
Query: 132 DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLGE 187
D P P + VTRK + V+ + L + +Q ++ D + +I+ L D G
Sbjct: 252 DSLPKPAQ---TSVTRKKEQVN--LEESLAKQAIQAYYASITFADAQLGQILDALDDSGL 306
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+NT +++TSDHGYH+G+ G + K+ FE +RVP ++ PG+
Sbjct: 307 ADNTIVLFTSDHGYHMGEHGHYQ-KTTLFENALRVPLIISSPGM 349
>gi|423298989|ref|ZP_17277014.1| hypothetical protein HMPREF1057_00155 [Bacteroides finegoldii
CL09T03C10]
gi|408474338|gb|EKJ92857.1| hypothetical protein HMPREF1057_00155 [Bacteroides finegoldii
CL09T03C10]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|404366924|ref|ZP_10972300.1| hypothetical protein FUAG_02208 [Fusobacterium ulcerans ATCC 49185]
gi|313689858|gb|EFS26693.1| hypothetical protein FUAG_02208 [Fusobacterium ulcerans ATCC 49185]
Length = 524
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 45/63 (71%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + KI+ +L++LG +T I++T+DHG+ +GQ GL++ F +E I++PF+++ P
Sbjct: 330 MDKYIGKILDKLEELGLAEDTIIVFTTDHGHFVGQHGLIRKGPFHYEDLIKIPFIVKYPK 389
Query: 231 IVP 233
+VP
Sbjct: 390 VVP 392
>gi|373496909|ref|ZP_09587452.1| hypothetical protein HMPREF0402_01325 [Fusobacterium sp. 12_1B]
gi|371964568|gb|EHO82081.1| hypothetical protein HMPREF0402_01325 [Fusobacterium sp. 12_1B]
Length = 524
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 45/63 (71%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + KI+ +L++LG +T I++T+DHG+ +GQ GL++ F +E I++PF+++ P
Sbjct: 330 MDKYIGKILDKLEELGLAEDTIIVFTTDHGHFVGQHGLIRKGPFHYEDLIKIPFIVKYPE 389
Query: 231 IVP 233
+VP
Sbjct: 390 VVP 392
>gi|293372797|ref|ZP_06619178.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292632306|gb|EFF50903.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|160886350|ref|ZP_02067353.1| hypothetical protein BACOVA_04360 [Bacteroides ovatus ATCC 8483]
gi|423289518|ref|ZP_17268368.1| hypothetical protein HMPREF1069_03411 [Bacteroides ovatus
CL02T12C04]
gi|156108235|gb|EDO09980.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
gi|392667229|gb|EIY60739.1| hypothetical protein HMPREF1069_03411 [Bacteroides ovatus
CL02T12C04]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|336414624|ref|ZP_08594970.1| hypothetical protein HMPREF1017_02078 [Bacteroides ovatus
3_8_47FAA]
gi|335933736|gb|EGM95738.1| hypothetical protein HMPREF1017_02078 [Bacteroides ovatus
3_8_47FAA]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|262409643|ref|ZP_06086183.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294647124|ref|ZP_06724728.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294807889|ref|ZP_06766670.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|336407138|ref|ZP_08587772.1| hypothetical protein HMPREF0127_05085 [Bacteroides sp. 1_1_30]
gi|345509515|ref|ZP_08789111.1| hypothetical protein BSAG_04540 [Bacteroides sp. D1]
gi|229447035|gb|EEO52826.1| hypothetical protein BSAG_04540 [Bacteroides sp. D1]
gi|262352496|gb|EEZ01596.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292637541|gb|EFF55955.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294444887|gb|EFG13573.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|295087977|emb|CBK69500.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
XB1A]
gi|335948239|gb|EGN09956.1| hypothetical protein HMPREF0127_05085 [Bacteroides sp. 1_1_30]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|423216278|ref|ZP_17202803.1| hypothetical protein HMPREF1074_04335 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691129|gb|EIY84380.1| hypothetical protein HMPREF1074_04335 [Bacteroides xylanisolvens
CL03T12C04]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|298482971|ref|ZP_07001153.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. D22]
gi|298270943|gb|EFI12522.1| N-acetylglucosamine-6-sulfatase [Bacteroides sp. D22]
Length = 527
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLIMRLP 384
>gi|423304333|ref|ZP_17282332.1| hypothetical protein HMPREF1072_01272 [Bacteroides uniformis
CL03T00C23]
gi|423310553|ref|ZP_17288537.1| hypothetical protein HMPREF1073_03287 [Bacteroides uniformis
CL03T12C37]
gi|392681724|gb|EIY75081.1| hypothetical protein HMPREF1073_03287 [Bacteroides uniformis
CL03T12C37]
gi|392684919|gb|EIY78239.1| hypothetical protein HMPREF1072_01272 [Bacteroides uniformis
CL03T00C23]
Length = 526
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 75/161 (46%), Gaps = 7/161 (4%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF--NVTSHHT 128
D P I ++++ ++ K + P + +SFP PH P Y +MF + + T
Sbjct: 177 DQQPTRIVDEAIEWIDSQKDN----PFFVWISFPEPHNPYQVCEPYYSMFAPDKIPAVKT 232
Query: 129 PAYDYAPNPDKQWIL-QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGE 187
D +K IL ++ P + L + ++ +DD ++++V++LK+ G
Sbjct: 233 SRKDVLKKGEKYQILAELEDASCPNLERDLPRLRGNYMGMIRLIDDQIKRLVEDLKEKGL 292
Query: 188 LNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRG 228
T I+ SDHG + G++GL++ E R+P + G
Sbjct: 293 FEKTIIVVLSDHGDYCGEYGLIRKGVGLSESLTRIPMVWAG 333
>gi|197337865|ref|YP_002158674.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri MJ11]
gi|197315117|gb|ACH64566.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri MJ11]
Length = 518
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 80/175 (45%), Gaps = 16/175 (9%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
+ N + P+ A+ ++++LR ++ + KP LV+S PH P D PQ F +S
Sbjct: 220 HINQWSPEHEADIAIKYLRNDGGNYRDNDKPFTLVVSMNPPHSPYDQVPQKYLDRFKDSS 279
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
+ P+ QW + P + + + + VD+ +I+ EL L
Sbjct: 280 RTLNS-----RPNVQWDKEYLEGYGPEY-------FKEYMAMVNGVDEQFGRIIDELDTL 327
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLL 239
+T +++ SDHG +G G K+ +E +R+P + R PG I P + LL
Sbjct: 328 NLTEDTLVVFFSDHGCCMGSNG-NPTKNVHYEEAMRIPMMFRWPGKIAPKSDDLL 381
>gi|440638195|gb|ELR08114.1| hypothetical protein GMDG_02941 [Geomyces destructans 20631-21]
Length = 511
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 67/279 (24%), Positives = 108/279 (38%), Gaps = 66/279 (23%)
Query: 22 FGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDS 81
GK+ + P G+ W + Y++ + G K +H + Y D +I + S
Sbjct: 99 IGKWHLGEGSDHEPTGFDFWSVVSGQGAYHDPEMIEMG-KTRHEKGYATD----IITDKS 153
Query: 82 VRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFF-NVTSHHTPAYDYA------ 134
+ +L++ Q+ KP L+ APH + P++A+++ +V T DY
Sbjct: 154 LEWLKKLDQA---KPFFLMCHHKAPHRSWECDPKHADLYTEDVKVPETFDDDYKNRARAA 210
Query: 135 ----------------------------------------PNP---------DKQWILQV 145
PNP DKQ
Sbjct: 211 AEAKIRVDTDLTYFDLGLAQPDGGSEVGELVSLTSTLRKIPNPEDVTKLRLIDKQSGEVF 270
Query: 146 TRKMQPVHRQFTDLLMTKR-LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLG 204
T K R F KR L+T+ S+DD V +++K L D N+T +IYTSD G+ LG
Sbjct: 271 TFKTNAELRHFKYQRYIKRYLRTIASIDDNVGRMLKYLDDENLTNDTVVIYTSDQGFFLG 330
Query: 205 QFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYLLDVLI 243
G K F +E ++PF++R P + D++
Sbjct: 331 DHGWFD-KRFMYEESFQMPFMIRYPREIAKGSVCDDIVC 368
>gi|358065392|ref|ZP_09151935.1| hypothetical protein HMPREF9473_03998 [Clostridium hathewayi
WAL-18680]
gi|356696383|gb|EHI57999.1| hypothetical protein HMPREF9473_03998 [Clostridium hathewayi
WAL-18680]
Length = 519
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 75/159 (47%), Gaps = 18/159 (11%)
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP---AYDYA 134
N ++ FL S +P +L + + APH P + +Y N ++N T P +D
Sbjct: 178 TNRAIDFLDNRDAS---RPFLLFVGYYAPHSPHNPPAEYFNRYYNRTDLDEPWIAPWDVP 234
Query: 135 PNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTY 192
P + + + +H Q +L + + +D ++++ L+NTY
Sbjct: 235 PVNNGNVMSRY------IHLQGEELRAARAGYYGNIAFLDSQTSRLLERAM---RLDNTY 285
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+I+TSDHG LG L + KS P+E + +PF+M GP I
Sbjct: 286 VIFTSDHGEMLGDHYLYQ-KSRPYEGAVHIPFMMMGPDI 323
>gi|344201753|ref|YP_004786896.1| sulfatase [Muricauda ruestringensis DSM 13258]
gi|343953675|gb|AEM69474.1| sulfatase [Muricauda ruestringensis DSM 13258]
Length = 560
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 109/281 (38%), Gaps = 68/281 (24%)
Query: 15 FDISHGYFGKYLNKYN-----------GSYIPPGWREWGALIMNSKYYNYS-INMNGRKI 62
FD S FG+ L + G G+ L YYN + IN G+
Sbjct: 121 FDFSQQTFGELLQQAGYKTGVLGKLHLGDTPSKGFDYVDILPGQGSYYNPTFINEEGQYQ 180
Query: 63 KHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFN 122
G Y ++I + +++++ K ++P ML + +PH P P M+ N
Sbjct: 181 LEG------YTTEIITDKAIKWMDSVKSG--EQPFMLFLGHKSPHRPWQPGPNELGMYEN 232
Query: 123 VT--SHHTPAYDYAPNPD----------------------------------KQWIL--- 143
V T DY+ N + KQW
Sbjct: 233 VEIPEPETLFDDYSGNREVASLNYMSISEAMKLEQDLKITDQPQNGFTEEQQKQWDAIYG 292
Query: 144 QVTRKMQPVHRQFTDLL-------MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYT 196
+ K + + Q DL M L ++ VD +V ++ LKD G NT +IYT
Sbjct: 293 PINEKFKKDNPQGDDLTRFKYQRYMRDYLASVAGVDKSVGHVLDYLKDAGLDENTIVIYT 352
Query: 197 SDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV-PGTM 236
SD G++LG+ G K + ++ +R P L++ PG V PGT+
Sbjct: 353 SDQGFYLGEHGWFD-KRWMYKESLRTPLLVKWPGTVKPGTV 392
>gi|374375586|ref|ZP_09633244.1| N-acetylglucosamine-6-sulfatase [Niabella soli DSM 19437]
gi|373232426|gb|EHP52221.1| N-acetylglucosamine-6-sulfatase [Niabella soli DSM 19437]
Length = 566
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 89/195 (45%), Gaps = 26/195 (13%)
Query: 54 SINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA 113
+N+NG+ + Y D + + ++ FL ++ KP ML +S A H A
Sbjct: 219 GLNVNGKHVPQ-----KGYITDELTDYALDFLNHRDKN---KPFMLYLSHKAVHDNFVPA 270
Query: 114 PQYANMFFN-----VTSHHTPAYDYAPNPDKQWILQVTRKMQ-----PVHRQFTDLLMTK 163
+ MF N + T + AP W LQ R P H K
Sbjct: 271 ERDKGMFKNDHFNPPLTMQTGKLEGAP----MW-LQNQRNSWHGVDFPYHSNLDIEAYYK 325
Query: 164 R-LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRV 222
R +TL VD+++ +I+ LK G L +T IIY D+G+ G+ GL+ ++ +E +RV
Sbjct: 326 RYAETLYGVDESLGRILDYLKQKGLLESTLIIYMGDNGFQFGEHGLIDKRT-AYEASMRV 384
Query: 223 PFLMRGPGIV-PGTM 236
P + PGI+ PGT+
Sbjct: 385 PMVAYCPGIIRPGTV 399
>gi|251799577|ref|YP_003014308.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247547203|gb|ACT04222.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 511
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 42/64 (65%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + I+ +L++LG +NT +++T+DHG+ GQ GL F +E IR+PF++R PG
Sbjct: 317 MDKYIGTILDKLEELGIADNTVVVFTTDHGHFFGQHGLQAKGGFHYEDLIRLPFIVRYPG 376
Query: 231 IVPG 234
VP
Sbjct: 377 QVPA 380
>gi|421613935|ref|ZP_16055004.1| sulfatase family protein [Rhodopirellula baltica SH28]
gi|408495142|gb|EKJ99731.1| sulfatase family protein [Rhodopirellula baltica SH28]
Length = 494
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 78/165 (47%), Gaps = 11/165 (6%)
Query: 75 DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYA 134
D + + + F+R+ S++P +S P PHGP Y MF ++ P +
Sbjct: 201 DWLCDRAADFIRE----HSQEPFCYHLSLPDPHGPNTVRQPYDTMFEDMPVR--PPMTFQ 254
Query: 135 PNPDK-QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYI 193
+ K W+ R + ++F LMT+ ++ +DD V ++ L +L T +
Sbjct: 255 LDGTKPGWLPGTNRNAE---QRFNARLMTQYFGMVRCIDDNVGMLLALLDELSLTERTVV 311
Query: 194 IYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMYL 238
++TSDHG + G + K P+E +VP ++ PG+VP + +
Sbjct: 312 VFTSDHGDLCYEHGRLN-KGNPYEGSAKVPMVIAAPGLVPAGLRI 355
>gi|373451267|ref|ZP_09543194.1| hypothetical protein HMPREF0984_00236 [Eubacterium sp. 3_1_31]
gi|371969034|gb|EHO86486.1| hypothetical protein HMPREF0984_00236 [Eubacterium sp. 3_1_31]
Length = 483
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 100/252 (39%), Gaps = 35/252 (13%)
Query: 25 YLNKYNGSYIPPGWREWGAL------IMNSKYYNYSINMNGRK----IKHGEDYYNDYYP 74
YL Y G+ IP W + NS + +N +G + + H Y +P
Sbjct: 122 YLGCYRGNDIP-HWMHQEVCDDYLFDLQNSLDRHADVNASGAECNSWVTHPWIYEERLHP 180
Query: 75 -DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDY 133
+ A++S+RFL+ +S KP L+ SF PH P D P Y M+ + D+
Sbjct: 181 TNWCADNSIRFLKTRDRS---KPFFLMSSFVRPHQPLDPPPAYYAMYKDKQLREPAQGDW 237
Query: 134 APNP-DKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTY 192
KQ+ M ++ VD + +I+ L++ G +T
Sbjct: 238 DNEAMTKQYGFTSDSIYGCNDAAMRHDAMAGYYASITHVDHQIGRILSALREEGMYEDTI 297
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM------------RGPGIV------PG 234
I++ SDHG L L + K FP+E +PFLM R GI P
Sbjct: 298 ILFVSDHGEMLFDHSLWR-KVFPYEGSTHIPFLMHIGKNIADLLPKRIQGICELRDIMPT 356
Query: 235 TMYLLDVLIPQV 246
+ L D+ IP+
Sbjct: 357 LLALCDLPIPET 368
>gi|397611619|gb|EJK61409.1| hypothetical protein THAOC_18109 [Thalassiosira oceanica]
Length = 373
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 75/166 (45%), Gaps = 28/166 (16%)
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA------NMFFNVTSHH-TPAYD 132
D++RFLR+ + KP L +++ A H + QY +M+ N T YD
Sbjct: 175 DALRFLREKPKD---KPFFLNVAYYATHAVDTDVRQYMPQRASMSMYLNDTIFEPEDTYD 231
Query: 133 YAP------NPDK---QWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
P N + +W +K Q +M + VD +V IV +L+
Sbjct: 232 KMPYFFSQENEGRNRWRWRFDTHKKHQR--------MMKNYYRMASEVDTSVGIIVDQLQ 283
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
GELNNT II+T+D+G + GL K +P + IRVP ++ P
Sbjct: 284 KSGELNNTMIIFTTDNGNFHAEHGLAD-KWYPHQESIRVPLIINDP 328
>gi|423688263|ref|ZP_17663066.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri SR5]
gi|371492766|gb|EHN68372.1| N-acetylglucosamine-6-sulfatase [Vibrio fischeri SR5]
Length = 518
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 80/175 (45%), Gaps = 16/175 (9%)
Query: 68 YYNDYYPDLIANDSVRFLRQSKQSF--SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTS 125
+ N + P+ A+ ++++LR ++ + KP LV+S PH P D PQ F +S
Sbjct: 220 HINQWSPEHEADIAIKYLRNDGGNYRDNDKPFTLVVSMNPPHSPYDQVPQKYLDRFKDSS 279
Query: 126 HHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDL 185
+ P+ QW + P + + + + VD+ +I+ EL L
Sbjct: 280 RTLNS-----RPNVQWDKEYLEGYGPEY-------FKEYMAMVNGVDEQFGRIINELDAL 327
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG-IVPGTMYLL 239
+T +++ SDHG +G G K+ +E +R+P + R PG I P + LL
Sbjct: 328 NLTEDTLVVFFSDHGCCMGSNG-NPTKNVHYEEAMRIPMMFRWPGKIAPKSDDLL 381
>gi|266621952|ref|ZP_06114887.1| arylsulfatase [Clostridium hathewayi DSM 13479]
gi|288866341|gb|EFC98639.1| arylsulfatase [Clostridium hathewayi DSM 13479]
Length = 520
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 81/179 (45%), Gaps = 15/179 (8%)
Query: 68 YYNDYYP-DLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSH 126
Y Y+P + ++ S+ FLR+ S KP L+ S+ PH P D APQY +
Sbjct: 205 YEEKYHPTNWVSTRSIDFLRRRDTS---KPFFLMASYLRPHPPFD-APQYYFDLYRDKQL 260
Query: 127 HTPAY----DYAPNPDKQWILQV-TRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
PA D D Q + ++ PV + + +D + ++++
Sbjct: 261 TPPAVGDWEDEDFTGDYQRLGRIYDSATGPVDPELIRQAQIGYYACITHLDHQIGRLIQA 320
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG----IVPGTM 236
L + ++NT I++TSDHG L L + KS P+E R+P L+ GP PGT+
Sbjct: 321 LVEYKLMDNTIILFTSDHGEELCDHHLFR-KSRPYEGSCRIPMLLSGPERLIHAAPGTV 378
>gi|251797269|ref|YP_003012000.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247544895|gb|ACT01914.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 480
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 44/65 (67%), Gaps = 2/65 (3%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGL-VKGKSFPFEFDIRVPFLMRGP 229
+DD V +++ L+ G+L+NT +++ SDHG LG G+ +KG F ++ +RVP ++RGP
Sbjct: 279 IDDQVGRMLDSLERSGQLDNTIVVFMSDHGELLGDHGMYLKGPHF-YDCSVRVPLIVRGP 337
Query: 230 GIVPG 234
GI G
Sbjct: 338 GIHGG 342
>gi|153808549|ref|ZP_01961217.1| hypothetical protein BACCAC_02844 [Bacteroides caccae ATCC 43185]
gi|423219400|ref|ZP_17205896.1| hypothetical protein HMPREF1061_02669 [Bacteroides caccae
CL03T12C61]
gi|149128871|gb|EDM20088.1| arylsulfatase [Bacteroides caccae ATCC 43185]
gi|392625450|gb|EIY19516.1| hypothetical protein HMPREF1061_02669 [Bacteroides caccae
CL03T12C61]
Length = 527
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 317 MRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 375
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 376 RTPLVMRLP 384
>gi|410456452|ref|ZP_11310313.1| sulfatase [Bacillus bataviensis LMG 21833]
gi|409928121|gb|EKN65244.1| sulfatase [Bacillus bataviensis LMG 21833]
Length = 503
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Query: 171 VDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG 230
+D + KI+ +L DLG NT +++T+DHG+ GQ GL F +E +++P+++R PG
Sbjct: 309 LDKYIGKILDKLDDLGLAKNTIVVFTTDHGHFFGQHGLHDKGGFHYEDLLKLPYIVRYPG 368
Query: 231 IVPG-----TMYLLDVLIPQVRKFSSGSLIFIMS 259
VP +M L L P F+ S+ M+
Sbjct: 369 KVPAGQICHSMQSLVDLAPTFLSFAGISIPRTMT 402
>gi|389872526|ref|YP_006379945.1| type I phosphodiesterase/nucleotide pyrophosphatase family protein
2 [Advenella kashmirensis WT001]
gi|388537775|gb|AFK62963.1| type I phosphodiesterase/nucleotide pyrophosphatase family protein
2 [Advenella kashmirensis WT001]
Length = 538
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 78/166 (46%), Gaps = 17/166 (10%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAY---DY 133
IA+ S+ L + + S++P L +SFP PH P QY +M+ PA D+
Sbjct: 223 IADRSIDLL--NDYAGSEQPFFLQVSFPDPHHPFTPPGQYWDMY-KPEDMALPASFYSDH 279
Query: 134 APNPDKQWILQ------VTRKMQPV----HRQFTDLLMTKRLQTLQSVDDAVEKIVKELK 183
P QW+ Q T+K V R+ + L ++ ++D + +++ EL+
Sbjct: 280 EPPQHVQWLYQRRDSGAATKKGTAVFACDEREAKEALALN-YGSISNIDAQIGRVMAELE 338
Query: 184 DLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
LG +NT +I+TSDHG LG L+ + RVPF+ P
Sbjct: 339 RLGLQDNTIVIFTSDHGDFLGDHQLMLKGPIHYRGLTRVPFIWSDP 384
>gi|229822378|ref|YP_002883904.1| sulfatase [Beutenbergia cavernae DSM 12333]
gi|229568291|gb|ACQ82142.1| sulfatase [Beutenbergia cavernae DSM 12333]
Length = 489
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 67/158 (42%), Gaps = 27/158 (17%)
Query: 93 SKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV 152
P +SFP PH P Y ++F D P++ +
Sbjct: 177 EDDPFFCWVSFPEPHNPYQVPEPYFSLF-----------DEDEIPERLAGPEALAGKGRT 225
Query: 153 HRQFTDLLMTKR--------------LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSD 198
+R DL+ TKR L+ +DD + ++V L D +T I + SD
Sbjct: 226 YRWLGDLIGTKRPGYDEAWRRYAANYCGMLRLIDDQLRRLVGHLGD--RARDTVIAFVSD 283
Query: 199 HGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
HG +G++GL + + EF +R+PF + GPG+ PG +
Sbjct: 284 HGDFVGEYGLQRKGAGMSEFLMRIPFQLSGPGVRPGGV 321
>gi|357384297|ref|YP_004899021.1| mucin-desulfating sulfatase MdsA [Pelagibacterium halotolerans B2]
gi|351592934|gb|AEQ51271.1| mucin-desulfating sulfatase MdsA precursor [Pelagibacterium
halotolerans B2]
Length = 519
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 46/72 (63%), Gaps = 1/72 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + L+T+QS+DD V +++ L + G NT +IYTSD G+ LG+ G K F +E +
Sbjct: 292 MKRYLRTVQSIDDNVGRMLDYLDERGLSENTIVIYTSDQGFFLGEHGWFD-KRFMYEESL 350
Query: 221 RVPFLMRGPGIV 232
++PFL+R P +
Sbjct: 351 QMPFLVRYPAAI 362
>gi|270296054|ref|ZP_06202254.1| mucin-desulfating sulfatase MdsA [Bacteroides sp. D20]
gi|270273458|gb|EFA19320.1| mucin-desulfating sulfatase MdsA [Bacteroides sp. D20]
Length = 529
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 105/277 (37%), Gaps = 72/277 (25%)
Query: 20 GYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSI---NMNGRKIKHGEDYYNDYYPDL 76
GY K++ + P G+ + L +YYN + NG IK Y L
Sbjct: 123 GYVTSVFGKWHLNVEPKGFDHYDILFDQGEYYNPKFRRPDTNGEYIKE-----KGYATTL 177
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPH-------------------GPED------ 111
I + ++ +L K +KP ++++ APH PE
Sbjct: 178 ITDHAIEWL--EKNISDEKPFCVLVNHKAPHRNWMPDFPNMHLFEDVDFPEPETLFDDYT 235
Query: 112 -SAPQYANMFFNVTSHHTPAYDYA-----PNPDKQWI----LQVTRKMQPVHRQFTDLLM 161
PQ + H A+D+ P Q+I L +M R + +
Sbjct: 236 TRGPQMKTQELTIDRHMGYAFDFKVRELKNEPTLQYIRDSWLMAMAEMDSKQRATWEAIY 295
Query: 162 TKR--------------------------LQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
K+ +T++SVD+ V +++ L++ G L+NT I+Y
Sbjct: 296 DKKNEEFLKNPPVGKDLLRWKYQRYVKEYCRTIRSVDEQVGRLLDYLEEKGVLDNTVIVY 355
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
TSD G+ LG+ GL K F +E R P ++ P +V
Sbjct: 356 TSDQGFLLGEHGLY-DKRFMYEESFRTPLIISCPSLV 391
>gi|218780829|ref|YP_002432147.1| sulfatase [Desulfatibacillum alkenivorans AK-01]
gi|218762213|gb|ACL04679.1| sulfatase [Desulfatibacillum alkenivorans AK-01]
Length = 468
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 165 LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPF 224
+ + SVD V +++K L+D G +NT ++Y D+GY G+ + K + +E IR+PF
Sbjct: 276 CRVVASVDRQVGRLLKFLEDKGLADNTIVVYAGDNGYFWGEHRKID-KRWAYEESIRIPF 334
Query: 225 LMRGPGIVP 233
++R PG+VP
Sbjct: 335 MIRAPGVVP 343
>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 445
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 100/228 (43%), Gaps = 45/228 (19%)
Query: 20 GYFGKYLNKYNGSYIPP--GWREWGALIMNSKYYNYSINM-------NGRKIKHGEDYYN 70
G FGK+ Y+ ++ P G+ E+ + + Y+ I+ +G KIK+ + Y
Sbjct: 127 GMFGKWHLGYDKAFNPTLQGFDEFVGFVSGNVDYHGHIDQEGYLDWWDGVKIKNEKGYTT 186
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
D LI+ V+F+++ + P L + APH P
Sbjct: 187 D----LISEYGVKFIQEHNPEVKRAPFFLYLPHEAPHSP--------------------- 221
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQ--FTDLLMTKRLQTLQSVDDAVEKIVKELKDLGEL 188
Y DK V R++ Q D + + + ++ +D+ V +I++ LK+ G+
Sbjct: 222 --YQRRIDK-----VLREIGTAGTQEVIQDSISSIYKEMVEVMDEGVGRIMQSLKETGQY 274
Query: 189 NNTYIIYTSDHGY-HLGQFGLVKG-KSFPFEFDIRVPFLMRGPGIVPG 234
NT +I+ SD+G H G G ++G K+ P+E RVP + P V G
Sbjct: 275 ENTIVIFISDNGANHYGDNGGLRGFKAGPYEGGSRVPAIFSFPKEVKG 322
>gi|319641749|ref|ZP_07996431.1| sulfatase [Bacteroides sp. 3_1_40A]
gi|317386636|gb|EFV67533.1| sulfatase [Bacteroides sp. 3_1_40A]
Length = 515
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 311 MRDYMKTVKSLDDNVGRVLDYLKENGLLDNTLVVYTSDQGFYMGEHGWFD-KRFMYEESM 369
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 370 RTPLIMRLP 378
>gi|345519869|ref|ZP_08799280.1| hypothetical protein BSFG_02713 [Bacteroides sp. 4_3_47FAA]
gi|254836257|gb|EET16566.1| hypothetical protein BSFG_02713 [Bacteroides sp. 4_3_47FAA]
Length = 515
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 311 MRDYMKTVKSLDDNVGRVLDYLKENGLLDNTLVVYTSDQGFYMGEHGWFD-KRFMYEESM 369
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 370 RTPLIMRLP 378
>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
Length = 500
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 88/201 (43%), Gaps = 36/201 (17%)
Query: 52 NYSINMNGRKIKHGEDYYN-------------DYYPDLIANDSVRFLRQSKQSFSKKPIM 98
+ IN+ G + H Y++ +Y D + +++V+F+ +K + P +
Sbjct: 166 GFDINIGGCGMGHPSSYFSPYKNPTLKDGPVGEYLADRLTDEAVKFIENTKGT----PFL 221
Query: 99 LVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTD 158
L +S + H P + + + P +++ + + V Q
Sbjct: 222 LYLSHYSVHTPLQAKKGL------IEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIY 275
Query: 159 LLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFG-------LVKG 211
M +QS+D++V +++ +LK+LG NT II+TSD+G G L G
Sbjct: 276 AAM------MQSLDESVGRVLDKLKELGLDKNTVIIFTSDNGGLSTAEGAPTSNMPLRAG 329
Query: 212 KSFPFEFDIRVPFLMRGPGIV 232
K +P+E +R P +++ PG+
Sbjct: 330 KGWPYEGGVREPLVVKWPGVT 350
>gi|372210442|ref|ZP_09498244.1| mucin-desulfating sulfatase [Flavobacteriaceae bacterium S85]
Length = 506
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 104/241 (43%), Gaps = 53/241 (21%)
Query: 37 GWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQSKQSFSKKP 96
G+ W L+ +Y N ++ + IK+ + Y D ++ N ++ +L + + +KP
Sbjct: 136 GFDYWEVLLGQGEYKNPDF-LSSKGIKNEKGYVTD----IVTNKALGWLEKGRNK--EKP 188
Query: 97 IMLVMSFPAPHGP------------EDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQ 144
ML++ APH P + + P+ N+F + + T A + N + +
Sbjct: 189 FMLMVHQKAPHRPWVPKREHMQLYKDVNIPEPDNLFDDYATRTTAAREQDLNMNTSFRDD 248
Query: 145 VTRKMQP------------VHRQFTD---------------------LLMTKRLQTLQSV 171
K++P + +F D L M L+ + SV
Sbjct: 249 SDLKIKPLTFRPERFIKVGIDDRFADRIEKYKHLKLTGKELTKLKYQLYMKDYLRCIWSV 308
Query: 172 DDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
D+ + +++ LK G NT ++Y+SD G+++G+ G K F +E R P +++ PG+
Sbjct: 309 DENIGRLMDYLKATGLDKNTIVMYSSDQGFYMGEHGWF-DKRFMYEESFRTPLIVKWPGV 367
Query: 232 V 232
V
Sbjct: 368 V 368
>gi|423293953|ref|ZP_17272080.1| hypothetical protein HMPREF1070_00745 [Bacteroides ovatus
CL03T12C18]
gi|392677174|gb|EIY70593.1| hypothetical protein HMPREF1070_00745 [Bacteroides ovatus
CL03T12C18]
Length = 509
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 82/165 (49%), Gaps = 7/165 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
N + +L A+ +++F+ + + + P ++ ++F +PH P + P Y + + P
Sbjct: 203 NTFSSELYADAAIKFIETN--ASNDNPFLMYVAFTSPHDPRNVLPDYGRKY-DSKEITMP 259
Query: 130 AYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLG 186
+P D + + K+ P R +L + + VD + +I+ L+ G
Sbjct: 260 KNFITQHPFDNGDLNERDEKLLPTPRVPEQVLAERANYYSMVNEVDVQIGRILDMLEKSG 319
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+ +NT I++ +D+G +G+ GL+ GK +E +RVP ++ GP I
Sbjct: 320 KDDNTIIVFAADNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPNI 363
>gi|149915315|ref|ZP_01903842.1| sulfatase [Roseobacter sp. AzwK-3b]
gi|149810604|gb|EDM70445.1| sulfatase [Roseobacter sp. AzwK-3b]
Length = 505
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 82/173 (47%), Gaps = 13/173 (7%)
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
Y D +A ++ R + + +P L +SF PH P + +Y +++ + H P
Sbjct: 163 YDDEVAFNASRKVYDLGRGHDARPWCLTVSFTHPHDPYVARRKYWDLY-EKSDHLLPTVP 221
Query: 130 --AY-DYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKD 184
AY D+ P+ + + R D+ ++R + +DD + +++ L
Sbjct: 222 AMAYEDHDPHSKRIFDANDWRSFDITE---ADIRRSRRAYFANISYLDDKIGEVLAALDG 278
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTMY 237
+ +NT I++ SDHG LG+ GL SF +E RVP ++ PG+ PG ++
Sbjct: 279 TRQTDNTIILFVSDHGDMLGERGLWFKMSF-YEGASRVPLMIAAPGMAPGLVH 330
>gi|256422335|ref|YP_003122988.1| sulfatase [Chitinophaga pinensis DSM 2588]
gi|256037243|gb|ACU60787.1| sulfatase [Chitinophaga pinensis DSM 2588]
Length = 520
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M L T +S+D + +I++ L G +NT +IY SD G++LG+ G K F +E +
Sbjct: 311 MRDYLATARSLDRNIGRILQYLDSTGLADNTVVIYCSDQGFYLGEHGWFD-KRFIYEQSL 369
Query: 221 RVPFLMRGPGIV-PGT 235
PF+MR PG++ PGT
Sbjct: 370 HTPFVMRYPGVIKPGT 385
>gi|149196007|ref|ZP_01873063.1| iduronate-2-sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140854|gb|EDM29251.1| iduronate-2-sulfatase [Lentisphaera araneosa HTCC2155]
Length = 574
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 78/158 (49%), Gaps = 20/158 (12%)
Query: 90 QSFSKKPIMLVMSFPAPHGPEDSAPQYANMF-------------FNVTSHHTPAYDYAPN 136
+S S + L + F PH P A +Y +MF +H+T +D
Sbjct: 226 RSGSDQAFFLSVGFVRPHTPLHVAQKYFDMFPIDQIQLPEILENDADDTHYTDLFD---- 281
Query: 137 PDKQWIL--QVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYII 194
DK+ ++ + +K P ++ L ++ +VD+ + +++K L + +NT +I
Sbjct: 282 ADKKGLMYFDLLKKSYPNWQEGIKAFTQAYLASIAAVDENIGRVIKTLDESRFKDNTIVI 341
Query: 195 YTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+TSDHG+++G+ + K+ P+E RVPF++R P I
Sbjct: 342 FTSDHGWNMGEKDYL-FKNSPWEESGRVPFIVRAPQIA 378
>gi|255690669|ref|ZP_05414344.1| choline sulfatase [Bacteroides finegoldii DSM 17565]
gi|260623693|gb|EEX46564.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 344
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 81/165 (49%), Gaps = 7/165 (4%)
Query: 70 NDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP 129
N + +L A+ +++F+ + + + P ++ ++F +PH P + P Y + P
Sbjct: 38 NTFSSELYADAAIKFIETN--ASNDNPFLMYVAFTSPHDPRNVLPDYGRKY-GSKEITMP 94
Query: 130 AYDYAPNP-DKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDLG 186
+P D + + K+ P R +L + + VD + +I+ L+ G
Sbjct: 95 KNFVTQHPFDNGDLNERDEKLLPTPRVPEQVLAERANYYSMVNEVDVQIGRILDMLEKSG 154
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
+ +NT I++ +D+G +G+ GL+ GK +E +RVP ++ GP I
Sbjct: 155 KADNTIIVFAADNGLCVGEHGLL-GKQNLYEAAVRVPLVICGPNI 198
>gi|399994498|ref|YP_006574738.1| choline-sulfatase [Phaeobacter gallaeciensis DSM 17395 = CIP
105210]
gi|398659053|gb|AFO93019.1| putative choline-sulfatase [Phaeobacter gallaeciensis DSM 17395 =
CIP 105210]
Length = 468
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 75/173 (43%), Gaps = 38/173 (21%)
Query: 107 HGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQ---------VTRKMQPVH---- 153
H PE + P +A M VT H Y ++WI Q V +++P H
Sbjct: 172 HAPERNEP-WAAMVSLVTPH------YPLTAPQEWIDQIDPEMIAPPVAGRLRPAHPELA 224
Query: 154 --RQFTD---------LLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHG 200
R F D +++ +R L + +DD + +++ L+ G+ T I+Y SDHG
Sbjct: 225 RIRDFYDHDPWFDDEAVMVARRSYLALVAFMDDCMGQVLSALEQSGQTEETLIVYVSDHG 284
Query: 201 YHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM----YLLDVLIPQVRKF 249
LG G GKS E + VP ++ GPG+ G LLD+ + F
Sbjct: 285 EMLGDHGFW-GKSVMLESSVGVPMIVSGPGVPVGRCKTPTSLLDIATTALATF 336
>gi|333031085|ref|ZP_08459146.1| N-acetylglucosamine-6-sulfatase [Bacteroides coprosuis DSM 18011]
gi|332741682|gb|EGJ72164.1| N-acetylglucosamine-6-sulfatase [Bacteroides coprosuis DSM 18011]
Length = 520
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 45/74 (60%), Gaps = 1/74 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M + T +VD+ + K++ L++ GEL+NT IIYTSD G+ LG+ G K F +E
Sbjct: 316 MRDYMSTTLAVDEGIGKLLDYLEETGELDNTLIIYTSDQGFFLGEHGFFD-KRFMYEECQ 374
Query: 221 RVPFLMRGPGIVPG 234
R+P +MR P ++
Sbjct: 375 RMPLVMRYPKLIKA 388
>gi|325109948|ref|YP_004271016.1| N-acetylglucosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324970216|gb|ADY60994.1| N-acetylglucosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 534
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 61/254 (24%), Positives = 109/254 (42%), Gaps = 62/254 (24%)
Query: 35 PPGWREWGALIMNSKYYNYSI------NMNGRKIKHGEDYYNDYYPDLIANDSVRFLRQS 88
P G+ L+ YYN + + N +++H Y D+I + ++ +L++
Sbjct: 130 PTGFDYSEVLVGQGTYYNPVMLKDPNGDGNRERVRH-----TGYTTDIITDLALDWLKEG 184
Query: 89 KQSFSKKPIMLVMSFPAPH-----GPE-------DSAPQYANMFFNVTSHHTPA------ 130
+ S KP M++ APH GP+ ++ P+ N+F + T A
Sbjct: 185 RDS--DKPFMMMFQHKAPHREWAPGPDHLTMYDGETIPEPDNLFDDYEGRGTAAKTQDMT 242
Query: 131 -----------YDYAP--NPDKQWILQVTRKMQPVHRQFTDLLMTKR------------- 164
+ P N D++ + P ++ F + +T
Sbjct: 243 IAKTMTARDLKFTVPPYMNEDQKKVW--NEAYDPKNKAFEEANLTGDDLIRWKYQRYIKD 300
Query: 165 -LQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVP 223
L+ + SVDD V ++++ L+D G NT +IY+SD G++LG G K F +E R P
Sbjct: 301 YLRCIASVDDNVGRMLQYLEDNGLAENTVVIYSSDQGFYLGDHGWFD-KRFMYEESYRQP 359
Query: 224 FLMRGPGIV-PGTM 236
++R P +V PG++
Sbjct: 360 LMVRWPNVVKPGSV 373
>gi|198274022|ref|ZP_03206554.1| hypothetical protein BACPLE_00159 [Bacteroides plebeius DSM 17135]
gi|198273100|gb|EDY97369.1| arylsulfatase [Bacteroides plebeius DSM 17135]
Length = 522
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 45/69 (65%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ LK+ G L NT ++YTSD G+++G+ G K F +E +
Sbjct: 312 MRDYMKTVKSLDDNVGRVLDYLKEKGLLENTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 370
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 371 RTPLIMRLP 379
>gi|421613377|ref|ZP_16054463.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
gi|408495971|gb|EKK00544.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
Length = 534
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 80/174 (45%), Gaps = 19/174 (10%)
Query: 69 YNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHT 128
Y D Y +A +++ + Q ++ P L M + PH + +Y +++ ++
Sbjct: 197 YVDGYNTAMAIATLKEMTQDNET----PFFLAMGYKLPHLNWCAPSKYWDLY---DANDI 249
Query: 129 PAYDYAPNPDKQWILQV-------TR----KMQPVHRQFTDLLMTKRLQTLQSVDDAVEK 177
P D P+ + + TR K+ P+ + + L L ++ VD + K
Sbjct: 250 PMADETDAPENGAAMGLHASFELRTRAGIPKIGPLSPELSRKLKHAYLASVSYVDAQIGK 309
Query: 178 IVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGI 231
++ L+D G +NT I+ DHG+HLG G V GK+ +E RVP ++ P +
Sbjct: 310 LIAALEDAGVRDNTVIVVWGDHGWHLGDMG-VWGKATNYEIATRVPLMIWAPDM 362
>gi|196233398|ref|ZP_03132242.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196222538|gb|EDY17064.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 478
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 118/266 (44%), Gaps = 40/266 (15%)
Query: 1 MAWPVELTSRNLLYFDISHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSI----- 55
M V L SR +L +S + K + + G P + + K Y+
Sbjct: 73 MEGAVCLPSRTMLLTGMSMFHAMKAVPRGKGPIAPDDTPDTYSFPRAMKAAGYATLHTGK 132
Query: 56 NMNGRKIKHGE--DYYNDYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSA 113
N N K+ E + Y+ A+ ++ F+ K++ KP+ L + APH P D
Sbjct: 133 NDNSPKLVTAEFDETYDPGESMACADKAIEFI---KRTAGTKPMCLYI---APHEPHD-- 184
Query: 114 PQYANMFFNVTSHHTPA-----YDYAP-----NPDKQWILQVTRKMQ---PVHRQFTDLL 160
PQYA F +H+ P +AP N D ++V +M P ++
Sbjct: 185 PQYATPDF--YAHYNPEDVPLPKAFAPFHTFDNGD----IKVRDEMTLPWPRTKENLGKK 238
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEF-D 219
+ + + + D A+ +IV+ L+D G+ NT I D+G LG+ GL+ GK +EF
Sbjct: 239 LARYYASTEYWDAAMGRIVQALRDAGQFENTIFIIAGDNGLSLGEHGLL-GKQNLYEFGG 297
Query: 220 IRVPFLMRGPGIVPG----TMYLLDV 241
+ VP ++ G GI G T+YL+DV
Sbjct: 298 MHVPLIVAGKGIPKGETRATVYLMDV 323
>gi|260427447|ref|ZP_05781426.1| choline-sulfatase [Citreicella sp. SE45]
gi|260421939|gb|EEX15190.1| choline-sulfatase [Citreicella sp. SE45]
Length = 503
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 77/169 (45%), Gaps = 11/169 (6%)
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTP--- 129
Y D +A + R + + +P L +SF PH P + +Y +++ + H P
Sbjct: 162 YDDEVAYHATRKVYDLGRRKDDRPWCLTVSFTHPHDPYVARRKYWDLYEDC-EHLLPEVG 220
Query: 130 --AYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKR--LQTLQSVDDAVEKIVKELKDL 185
YD +P Q I + D+ ++R + +D + +I++ L+
Sbjct: 221 DLGYDNQ-DPHSQRIFDAN-DWRSFDITEEDIRRSRRAYFANISYLDGKIGEILEALEGT 278
Query: 186 GELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
+ NT I++ SDHG LG+ GL +F FE RVP +M PG+ PG
Sbjct: 279 RQAENTAILFVSDHGDMLGERGLWFKMNF-FEGSSRVPMMMAAPGVAPG 326
>gi|309775806|ref|ZP_07670801.1| arylsulfatase [Erysipelotrichaceae bacterium 3_1_53]
gi|308916483|gb|EFP62228.1| arylsulfatase [Erysipelotrichaceae bacterium 3_1_53]
Length = 485
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 79/175 (45%), Gaps = 13/175 (7%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+A+ S+RFL+ ++ +P L+ SF PH P D Y +M+ + D+ P
Sbjct: 185 VADRSIRFLKTRDRT---RPFFLMSSFVRPHQPLDPPQAYFDMYKDKALRLPADGDWDPQ 241
Query: 137 PDKQWILQVTRKMQPVHRQFTDL-LMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIY 195
+++ + + L M ++ VD + +I+ LK+ G ++T I++
Sbjct: 242 EGCGTAGRISDSIFGCCDESMRLDAMAGYYASITHVDHQLGRILTALKEDGVYDDTIILF 301
Query: 196 TSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMR--------GPGIVPGTMYLLDVL 242
TSDHG L L + K FP+E +PFLM P V G L DV+
Sbjct: 302 TSDHGEMLFDHHLWR-KVFPYEGSTHIPFLMHIGKHIADVIPHTVNGITELRDVM 355
>gi|224005180|ref|XP_002296241.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586273|gb|ACI64958.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 619
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 78/170 (45%), Gaps = 22/170 (12%)
Query: 80 DSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNP-- 137
D++RFL + KP L ++F A H + QY +++ + A+D P P
Sbjct: 321 DALRFLVTRPRD---KPFFLNVAFYATHAKDGDVRQYMPQNSSMSMY---AHDEIPVPAT 374
Query: 138 --DKQW------ILQVTRKMQPVHRQFTD-----LLMTKRLQTLQSVDDAVEKIVKELKD 184
D+ W + H +F + +M + VD A I++EL+
Sbjct: 375 ATDEAWKKMPYFFTEKNEGRARWHSRFENHTLHQSMMKNYYRMATEVDTACGVILEELRR 434
Query: 185 LGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPG 234
GEL+NT II+T+D+G + GL K +P + IRVP ++R P + G
Sbjct: 435 QGELDNTVIIFTTDNGNFHAEHGLAD-KWYPHQESIRVPLIIRDPRMKGG 483
>gi|421609321|ref|ZP_16050519.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
gi|408499985|gb|EKK04446.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
Length = 503
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 33/250 (13%)
Query: 15 FDISHGYFGK-------YLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED 67
F I HG G + + Y+ P + G L Y+ + + + G
Sbjct: 133 FHIGHGNHGDPESFSVPHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKSLPRGAA 192
Query: 68 YYN------DYYPDLIANDSVRFLRQSKQSFSKK--PIMLVMSFPAPHGPEDSAPQ---- 115
Y + DY +A ++++ L+ +KQ + P + F PH P SAPQ
Sbjct: 193 YESPDAKDEDYADGRVAAETIQRLQAAKQRQQTEGTPFFIASGFARPHLPF-SAPQKYWD 251
Query: 116 -YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV----HRQFTDLLMTKRLQ---- 166
Y + +H T D AP + +++ +PV + F D L +
Sbjct: 252 LYDPASLPMPTHETLPVD-APKVAGKRGGEIS-NYKPVPTEPNADFDDELKRNLIHGYYA 309
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
++ VD + K++KEL L +NT ++ DHG+HLG G+ K +E R+P L+
Sbjct: 310 SVSYVDAQIGKVIKELDRLELSDNTIVVLWGDHGFHLGDLGIWT-KHTNYEQANRIPILI 368
Query: 227 RGPGIV-PGT 235
PG+ PG+
Sbjct: 369 TAPGVTQPGS 378
>gi|402073090|gb|EJT68723.1| hypothetical protein GGTG_13710 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 383
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 59/129 (45%), Gaps = 6/129 (4%)
Query: 116 YANMFFNVTSHHTPAYDYAPNPDKQ---WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVD 172
+ +F N T TP++ N + WI +++R M + D +R Q L VD
Sbjct: 201 HEKLFTNATVPRTPSFSPPDNVQRNQVWWIRELSR-MGDAEMDWADAEYRRRAQALAGVD 259
Query: 173 DAVEKIVKELKDLGELNNTYIIYTSDHGYHL--GQFGLVKGKSFPFEFDIRVPFLMRGPG 230
V ++ +L++ G L+NTY L + GK+ PF D +PF++RGPG
Sbjct: 260 KLVSDVLGKLEEKGALDNTYSKRRHSFAVPLPSSNHRMAVGKTTPFTEDTNLPFVVRGPG 319
Query: 231 IVPGTMYLL 239
+ G L
Sbjct: 320 VPQGVESTL 328
>gi|423111042|ref|ZP_17098737.1| hypothetical protein HMPREF9687_04288 [Klebsiella oxytoca 10-5243]
gi|376377522|gb|EHS90291.1| hypothetical protein HMPREF9687_04288 [Klebsiella oxytoca 10-5243]
Length = 497
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 76/172 (44%), Gaps = 33/172 (19%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGP---------------EDSAPQYANMFF 121
I+N +V FLRQ ++ +P ++V+S+ PH P D P+ +
Sbjct: 166 ISNRAVDFLRQPARA--DEPFLMVVSYDEPHHPFTCPVEYLRKYQDFYYDLGPKAHDSLA 223
Query: 122 NVTSHHTPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKE 181
+ HH P+P + + R P++ D VDD + +++
Sbjct: 224 DKPEHHRLWSQAMPSP----VGEDGRYHHPLYFACNDF-----------VDDQIGRVINA 268
Query: 182 LKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVP 233
L + +NT++IYTSDHG +G L+ + ++ R+P +MR P VP
Sbjct: 269 LTPQ-QRDNTWVIYTSDHGEMMGAHRLISKGAAMYDDITRIPLIMRPPQGVP 319
>gi|223936368|ref|ZP_03628280.1| sulfatase [bacterium Ellin514]
gi|223894886|gb|EEF61335.1| sulfatase [bacterium Ellin514]
Length = 510
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 85/175 (48%), Gaps = 19/175 (10%)
Query: 77 IANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPN 136
+A +++R LR+ K KP + F PH P+ + +Y F++ P
Sbjct: 224 VATETIRLLREHKD----KPFFIAAGFYRPHVPDIATKKY----FDLYPKEKIELPIGPK 275
Query: 137 PDKQWILQVTRKMQPVHRQFTD--LLMTKR--LQTLQSVDDAVEKIVKELKDLGELNNTY 192
+ I ++ ++P++ TD L + KR ++ VD V +++ EL++L + T
Sbjct: 276 EHFKDIPEMALTVKPLNYGLTDDQLRLFKRAYFASISFVDAQVGRVLNELENLKLADKTI 335
Query: 193 IIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPG------IVPGTMYLLDV 241
++ DHG+ LG+ G + +S F+ + VP ++ PG + P T+ LLD+
Sbjct: 336 VVCFGDHGWLLGEHGQWQKQSL-FDESVHVPLMVYLPGAKGNGKVSPRTVELLDI 389
>gi|358381505|gb|EHK19180.1| hypothetical protein TRIVIDRAFT_69180 [Trichoderma virens Gv29-8]
Length = 552
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 76/163 (46%), Gaps = 6/163 (3%)
Query: 71 DYYPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPA 130
DY +++ + ++ K+P L +S PH P +Y N + +V P
Sbjct: 181 DYDEEVMYRSTQYLWDHVREGPDKRPFALTVSLTHPHDPYTITKKYWNRYEDV-DIDLPK 239
Query: 131 YDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ----TLQSVDDAVEKIVKELKDLG 186
A ++ + +FTD + + + ++ VDD + K+++ L+D G
Sbjct: 240 VRIAKEKQDTHSKRLLKVCDLWDLEFTDEQIKRAKRAYYGSVSYVDDCIGKLLETLEDAG 299
Query: 187 ELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGP 229
++NT +I++ DHG LG+ GL S+ FE +RVP L+ P
Sbjct: 300 LMDNTIVIFSGDHGDMLGERGLWYKMSY-FESSVRVPMLVHYP 341
>gi|393784525|ref|ZP_10372688.1| hypothetical protein HMPREF1071_03556 [Bacteroides salyersiae
CL02T12C01]
gi|392665506|gb|EIY59030.1| hypothetical protein HMPREF1071_03556 [Bacteroides salyersiae
CL02T12C01]
Length = 526
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M ++T++S+DD V +++ L++ G L+NT ++YTSD G+++G+ G K F +E +
Sbjct: 316 MRDYMKTVKSLDDNVGRVLDYLEEKGMLDNTLVVYTSDQGFYMGEHGWF-DKRFMYEESM 374
Query: 221 RVPFLMRGP 229
R P +MR P
Sbjct: 375 RTPLIMRLP 383
>gi|430741207|ref|YP_007200336.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430012927|gb|AGA24641.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 451
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 53/236 (22%), Positives = 97/236 (41%), Gaps = 45/236 (19%)
Query: 18 SHGYFGKYLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDYYNDYYPDLI 77
+HGYF ++ K++ P S+++++ + +G D +
Sbjct: 111 AHGYFVGHVGKWHNGTFP------------SEHFDFGRSYSGTHWMTEPDGSKIHVTKKN 158
Query: 78 ANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYA------NMFFNVT------- 124
ND++ FLR + KP L ++F A H + + QY +++ +VT
Sbjct: 159 ENDALEFLRDRP---ADKPFCLTLAFFATHAEDGNPKQYLPQPESLSLYQDVTVPVPKTA 215
Query: 125 --SHHTPAYDYAPNPDKQ------WILQVTRKMQPVHRQFTDLLMTKRLQTLQSVDDAVE 176
+H + N + W + Q + + L VD
Sbjct: 216 TEAHFKKLPPFIANEKNEGRNRWHWRFDTPERYQQYMKDYYRLA--------SEVDATCG 267
Query: 177 KIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIV 232
+++ EL+ G L+NT +I+T+D+GY + GL K +P E IRVP ++R P +
Sbjct: 268 RVLAELEKQGVLDNTLVIFTTDNGYFHAEHGLAD-KWYPHEESIRVPLIVRDPRMA 322
>gi|260906523|ref|ZP_05914845.1| choline-sulfatase [Brevibacterium linens BL2]
Length = 509
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 73/157 (46%), Gaps = 9/157 (5%)
Query: 85 LRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT---SHHTPAYDYAPNPDKQW 141
+R ++ + +P ++V SF PH P + ++ + F V H D A +P
Sbjct: 178 VRANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAEVDIPDPAHPEVPDIAEDPHSHR 237
Query: 142 ILQVT--RKMQPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDH 199
+ ++ K +P + +DD + KI + L++L +NT II TSDH
Sbjct: 238 LRTMSGLDKKEPGTEDIR-RARRAYYAAVSYIDDHIGKIRQRLRELELEDNTVIIVTSDH 296
Query: 200 GYHLGQFGLVKGKSFPFEFDIRVPFLMRGP--GIVPG 234
G LG+ GL K P+E RVP ++ GP + PG
Sbjct: 297 GDMLGEKGLWY-KMSPYEQSSRVPIIINGPAEAVTPG 332
>gi|383766220|ref|YP_005445201.1| sulfatase [Phycisphaera mikurensis NBRC 102666]
gi|381386488|dbj|BAM03304.1| sulfatase [Phycisphaera mikurensis NBRC 102666]
Length = 466
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 70/158 (44%), Gaps = 19/158 (12%)
Query: 90 QSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKM 149
Q+ P V F +PH P APQ ++ S TP P PD + L
Sbjct: 211 QAAGGTPFFCVAGFYSPHSPL-VAPQRFLDLYDPKSL-TPL----PEPDDRAGLD----- 259
Query: 150 QPVHRQFTDLLMTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLV 209
P HR+ + + VD V +I+ EL G +T + +T DHG LG FG
Sbjct: 260 DPEHRRQA---LHGYYAMISEVDHWVGEILSELARAGHAEDTVVAFTGDHGESLGDFGR- 315
Query: 210 KGKSFPFEFDI-RVPFLMRGPGIVPGTMYLLDVLIPQV 246
GKSFP E + RVP L+ PG PG + D L+ V
Sbjct: 316 WGKSFPGEDCVSRVPLLVASPGQRPG---VCDALVEAV 350
>gi|253688811|ref|YP_003018001.1| sulfatase [Pectobacterium carotovorum subsp. carotovorum PC1]
gi|251755389|gb|ACT13465.1| sulfatase [Pectobacterium carotovorum subsp. carotovorum PC1]
Length = 564
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 98/234 (41%), Gaps = 25/234 (10%)
Query: 20 GYFGKYL---NKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGEDY----YNDY 72
G+FGK+ Y G+ W L+ YY INM G + K D Y
Sbjct: 132 GFFGKWHFGGADYTAKAGFAGFDRWVGLLGQGDYY--PINMFGEQAKLNIDGKMVPQKGY 189
Query: 73 YPDLIANDSVRFLRQSKQSFSKKPIMLVMSFPAPHGPEDSAPQYANMFFNVT-----SHH 127
D + + +V +L + KKP M+ +S H A ++ VT ++
Sbjct: 190 ITDELTDYAVNWLDGIDK---KKPFMMYLSHKGVHSDFYPAIRHKGSMDKVTFPLPETYA 246
Query: 128 TPAYDYAPNPDKQWILQVTRKMQPVHRQFTDLLMTKRLQ-----TLQSVDDAVEKIVKEL 182
+Y P W+ V + + ++ Q TL+SVDD+V ++ + L
Sbjct: 247 DTPENYEGKP--MWVKNQRNSWHGVDYPYNKKMDMQQFQRDYYETLRSVDDSVGRVQEWL 304
Query: 183 KDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLMRGPGIVPGTM 236
K G NT ++ D+G+ G+ GL+ +S +E +RVP + GPG G +
Sbjct: 305 KKNGLDKNTIVMVMGDNGFTFGEHGLIDKRS-AYETSMRVPLIASGPGFGKGDV 357
>gi|440717286|ref|ZP_20897776.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
gi|436437472|gb|ELP31098.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
Length = 503
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 33/250 (13%)
Query: 15 FDISHGYFGK-------YLNKYNGSYIPPGWREWGALIMNSKYYNYSINMNGRKIKHGED 67
F I HG G + + Y+ P + G L Y+ + + + G
Sbjct: 133 FHIGHGNHGDPESFSVPHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKSLPRGAA 192
Query: 68 YYN------DYYPDLIANDSVRFLRQSKQSFSKK--PIMLVMSFPAPHGPEDSAPQ---- 115
Y + DY +A ++++ L+ +KQ + P + F PH P SAPQ
Sbjct: 193 YESPDAKDEDYADGRVAAETIQRLQAAKQRQQTEGTPFFIASGFARPHLPF-SAPQKYWD 251
Query: 116 -YANMFFNVTSHHTPAYDYAPNPDKQWILQVTRKMQPV----HRQFTDLLMTKRLQ---- 166
Y + +H T D AP + +++ +PV + F D L +
Sbjct: 252 LYDPASLPMPTHETLPVD-APKVAGKRGGEIS-NYKPVPTEPNADFDDELKRNLIHGYYA 309
Query: 167 TLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDIRVPFLM 226
++ VD + K++KEL L +NT ++ DHG+HLG G+ K +E R+P L+
Sbjct: 310 SVSYVDAQIGKVIKELDRLELSDNTIVVLWGDHGFHLGDLGIWT-KHTNYEQANRIPILI 368
Query: 227 RGPGIV-PGT 235
PG+ PG+
Sbjct: 369 TAPGVTQPGS 378
>gi|387790940|ref|YP_006256005.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
gi|379653773|gb|AFD06829.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
Length = 516
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 161 MTKRLQTLQSVDDAVEKIVKELKDLGELNNTYIIYTSDHGYHLGQFGLVKGKSFPFEFDI 220
M L T++SVD++V +++ LK G NT +IYTSD G++LG+ G K F +E +
Sbjct: 312 MQDYLATVKSVDESVGELLDYLKREGLDKNTIVIYTSDQGFYLGEHGWF-DKRFMYEESM 370
Query: 221 RVPFLMRGPG-IVPGT 235
+P LM+ PG I PGT
Sbjct: 371 GMPLLMKYPGHIEPGT 386
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,693,552,418
Number of Sequences: 23463169
Number of extensions: 208934889
Number of successful extensions: 511716
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2207
Number of HSP's successfully gapped in prelim test: 3729
Number of HSP's that attempted gapping in prelim test: 505273
Number of HSP's gapped (non-prelim): 7151
length of query: 271
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 131
effective length of database: 9,074,351,707
effective search space: 1188740073617
effective search space used: 1188740073617
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)