BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1088
(905 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
Length = 543
Score = 513 bits (1322), Expect = e-142, Method: Compositional matrix adjust.
Identities = 246/377 (65%), Positives = 298/377 (79%), Gaps = 13/377 (3%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
L +FV + AS P ++IFILADDLGWNDVGFHG QIPTPNIDALAYSG+IL+NYY
Sbjct: 8 LILFPLFVTIRASERP-NVIFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLILQNYYV 66
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
+CTPSRSA+MTGK+PIHTGMQH VL+G E GLPL+EKILP+YL+ELGY R+VGKWH
Sbjct: 67 TPICTPSRSALMTGKYPIHTGMQHTVLFGAEPRGLPLTEKILPEYLRELGYTNRLVGKWH 126
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
LG Y KEYTP +RGF+SHLGYWTGHQDY+DH+A E WG DMRR+++ A+DLHG+YSTD
Sbjct: 127 LGSYTKEYTPLYRGFDSHLGYWTGHQDYYDHTAVENPGWGFDMRRNMDLAYDLHGQYSTD 186
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
VFT EAV II NH+T PLFLYLAH A HSANPY PL APD + +I +KR +FAA
Sbjct: 187 VFTQEAVKIIENHNTTNPLFLYLAHVAVHSANPYNPLPAPDETVEKFSNIPSYKRQRFAA 246
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+L KLD+SVG VVE+L +R+ML NSII+F +DNGGAAAGFNLNAASN+PLRGVKNT+WEG
Sbjct: 247 MLSKLDDSVGAVVESLSKRKMLKNSIIIFSTDNGGAAAGFNLNAASNFPLRGVKNTVWEG 306
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
GVRGAGL+WSPL++ VA+Q++H+SDWLPTLLSA NS+++NI +
Sbjct: 307 GVRGAGLLWSPLIKQPQRVAQQFMHISDWLPTLLSAVG-------ANSSLKNI-----DG 354
Query: 404 ILRYENGTHEYNSPRIE 420
+ +E+ + NSPR E
Sbjct: 355 LDLWESLSQNKNSPRSE 371
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 82/162 (50%), Gaps = 17/162 (10%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV------- 621
NS IDG+D+W LS+N+ S R+ ILHNIDD + +ALT G WKLVK
Sbjct: 346 NSSLKNIDGLDLWESLSQNKNSPRSEILHNIDDIYGNAALTVGAWKLVKGTTYNGAWDGW 405
Query: 622 -----MRYQVDLT---GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQI 673
Y DL+ P V L + + + + KLR+A+ I C + +PC+P
Sbjct: 406 YGPSGRDYTYDLSLVVNSPTGVALKSI-NMSLVPDEITKLRNASRIDCATDQPIPCKPLE 464
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
CLF++ DPCE NLA + +N T + ++N A P
Sbjct: 465 ETCLFNVIQDPCELRNLAKENPKIVMNLEET-LNKYNNTALP 505
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 29/36 (80%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL+G E GLPL+EKILP+YL+ELGY R++
Sbjct: 88 MQHTVLFGAEPRGLPLTEKILPEYLRELGYTNRLVG 123
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 756 MRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
+ KLR+A+ I C + +PC+P CLF++ DPCE NLA
Sbjct: 441 ITKLRNASRIDCATDQPIPCKPLEETCLFNVIQDPCELRNLA 482
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 31/52 (59%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDLK 905
+ K P ++ +E+ L N TA+ P N P D G+PK +++ W+ FGD +K
Sbjct: 481 LAKENPKIVMNLEETLNKYNNTALPPGNLPLDPRGNPKYWNNTWTNFGDYIK 532
>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
Length = 544
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 239/351 (68%), Positives = 281/351 (80%), Gaps = 8/351 (2%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
R + AF ++P F+ V ++ PHI+FILADDLGWNDVGFHG QIPTPNIDA
Sbjct: 5 RAVLWAFLIVP--------FLRHVTAAERPHIVFILADDLGWNDVGFHGSGQIPTPNIDA 56
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
LAYSG++L YY +CTPSRSA+MTGK+PIHTGMQH VL E GLPL EK+LP+YLK
Sbjct: 57 LAYSGLLLDRYYVSPICTPSRSALMTGKYPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLK 116
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
ELGYRT IVGKWHLGFY K+YTPT+RGF+SH+G+W+GHQDYFDH+A E WGLDMRR +
Sbjct: 117 ELGYRTHIVGKWHLGFYTKQYTPTYRGFDSHIGFWSGHQDYFDHTAVESPYWGLDMRRGM 176
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
E AWDLHG+YSTDVFT+EAV +I+NH+ +PLFLYLAHAA HS NPY+PL APD +
Sbjct: 177 EAAWDLHGQYSTDVFTSEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDVDVAKF 236
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
+I D+ R +FAA+L KLDESVG+VVEAL+++ ML NSIIVF +DNGG AGFNLNAASN
Sbjct: 237 TNIFDYNRRRFAAMLSKLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAGFNLNAASN 296
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
WPLRG KNTLWEGGVRGAGLIWSP L G V+ Q H+SDWLPTLL+AA
Sbjct: 297 WPLRGTKNTLWEGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAG 347
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 74/142 (52%), Gaps = 34/142 (23%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
IDG+D+W LS + S R ++LHNIDD + +S +T G+WK ++ Q D GP
Sbjct: 354 SIDGLDLWHALSEDTESPRQSVLHNIDDIFGVSGITIGEWKYIQ-GSTYEGQWDGWYGP- 411
Query: 634 QVYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVKE-VP- 668
S REW+ A+A +R+LR+ A I+C P + +P
Sbjct: 412 -------SGREWVYDIGGVIGSPTGRAVASVGYSLSSENVRRLREQAMIKCRPRNDTLPL 464
Query: 669 CEPQIAPCLFDIKNDPCEKNNL 690
C+P IAPCLF + DPCE NNL
Sbjct: 465 CKPLIAPCLFHVLKDPCEDNNL 486
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 65/158 (41%), Gaps = 60/158 (37%)
Query: 749 YSNEEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRIN 806
YS E +R+LR+ A I+C P + +P C+P IAPCLF + DPCE NNL IN
Sbjct: 437 YSLSSENVRRLREQAMIKCRPRNDTLPLCKPLIAPCLFHVLKDPCEDNNL--------IN 488
Query: 807 HYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
Y P++ M
Sbjct: 489 EY--------------------------------------------------PEIARAMR 498
Query: 867 KELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
++L N TA+ P N P+DK +P+ +DH W FGD L
Sbjct: 499 EDLKKWNATALLPGNLPWDKRANPELWDHTWINFGDHL 536
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 27/36 (75%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL E GLPL EK+LP+YLKELGYRT I+
Sbjct: 91 MQHGVLKCAEPRGLPLQEKLLPEYLKELGYRTHIVG 126
>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
Length = 546
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 232/341 (68%), Positives = 272/341 (79%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+ L L++ VD V +S PHI+FILADDLGWNDVGFHGL QIPTPNIDALAY+G++L
Sbjct: 8 MTLCICLTVFVVDHVVASARPHIVFILADDLGWNDVGFHGLSQIPTPNIDALAYTGLLLD 67
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
YY +CTPSRSA+MTGKHPIHTGMQH VL E GLPL EK+LP+YL+ LGY T +V
Sbjct: 68 RYYVSPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMV 127
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLGFY KEYTPT+RGF+SHLG+W+GH DYFDH+A E WGLDMRR LEPAWDLHG+
Sbjct: 128 GKWHLGFYTKEYTPTYRGFDSHLGFWSGHHDYFDHTAVEEPYWGLDMRRGLEPAWDLHGQ 187
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
YSTDVFT EAV +I NH+T P+FLYL+HAA HS NPY PL A DH + I D+ R
Sbjct: 188 YSTDVFTKEAVRLIDNHNTSRPMFLYLSHAAVHSGNPYNPLPAHDHDVAKFPKILDYNRR 247
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+FA +L KLDESVG VVEAL +R+ML NS+IVF +DNGG AGFNLNAASN+PLRG KNT
Sbjct: 248 RFAGMLSKLDESVGLVVEALRKRKMLENSVIVFSTDNGGPPAGFNLNAASNFPLRGTKNT 307
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
LWEGGVRG GL+WSP L + G ++ Q +H+SDWLPTLL+A
Sbjct: 308 LWEGGVRGTGLVWSPKLINPGRISRQLIHISDWLPTLLTAV 348
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 72/135 (53%), Gaps = 16/135 (11%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG--GP 632
IDG+D+W L+ + S R +LHNIDD + I+ +T G WK ++ D G G
Sbjct: 357 IDGVDMWRALNEDTISPRKMVLHNIDDIYGIAGITYGDWKFIQGSTYNGQWDDWYGPSGR 416
Query: 633 DQVY-----LSGLSDREWLALAM-------RKLRDAASIQCGPVKE-VP-CEPQIAPCLF 678
D Y + + R ++ + KL+ A+I+C E +P C+P APCLF
Sbjct: 417 DWAYDPYAVMESAAGRAAASVGLSLTAGNIEKLQQGAAIKCSTRNENLPLCKPLEAPCLF 476
Query: 679 DIKNDPCEKNNLADR 693
+++ DPCEKNNL ++
Sbjct: 477 NVREDPCEKNNLIEK 491
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 60/151 (39%)
Query: 756 MRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVG 813
+ KL+ A+I+C E +P C+P APCLF+++
Sbjct: 446 IEKLQQGAAIKCSTRNENLPLCKPLEAPCLFNVRE------------------------- 480
Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
DP ++ N I EK YP ++S ++++L+ +N
Sbjct: 481 --DPCEKNNLI-----EK--------------------------YPSIVSDLKEKLSLLN 507
Query: 874 RTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
+A+ P N P+DK G+P +DH W+ FGD L
Sbjct: 508 ASAILPGNLPWDKRGNPDYWDHTWTNFGDYL 538
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL E GLPL EK+LP+YL+ LGY T ++
Sbjct: 93 MQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMVG 128
>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 543
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 234/341 (68%), Positives = 269/341 (78%), Gaps = 2/341 (0%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+ L L++ V VA+ PHI+FILADDLGWNDVGFHG QIPTPNIDALAYSG++L
Sbjct: 7 ITLCLCLTVFVVGHVAAR--PHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLD 64
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
YY +CTPSRSA+MTGKHPIHTGMQH VL E GLPL EK+LP+Y ++LGY T IV
Sbjct: 65 RYYVSPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPEYFRDLGYSTHIV 124
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLGFY KEYTP +RGF+SH+G+W+GH DYFDHSA E WGLDMRR LEPAWDLHG+
Sbjct: 125 GKWHLGFYTKEYTPMYRGFDSHIGFWSGHHDYFDHSAVEEPYWGLDMRRGLEPAWDLHGQ 184
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
YSTDVFT EAV +I NH+T P+FLYLAHAA HS NPY PL A D + +I ++ R
Sbjct: 185 YSTDVFTKEAVKLIDNHNTSRPMFLYLAHAAVHSGNPYNPLPAHDQDVAKFTNIFNYNRR 244
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+FA IL KLDESVG VVEAL +R+ML NS+IVF +DNGGA AGFNLNAASNWPLRG KNT
Sbjct: 245 RFAGILSKLDESVGLVVEALRRRKMLENSVIVFSTDNGGAPAGFNLNAASNWPLRGTKNT 304
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
LWEGGVRG GL+WSP L G V Q +H+SDWLPTLL+AA
Sbjct: 305 LWEGGVRGVGLLWSPKLIKPGRVGRQLIHISDWLPTLLTAA 345
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 75/144 (52%), Gaps = 34/144 (23%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
IDG+D+W LS + S R +LHNIDD + ++ +T G WK ++ G D
Sbjct: 354 IDGMDIWHALSEDTESPRKMVLHNIDDIYGVAGITYGDWKFIQ-------GSTYNGQWDG 406
Query: 635 VYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVKE-VP-C 669
Y G S REW+ A+A +RKLR+ A I+C P + +P C
Sbjct: 407 WY--GPSGREWVYDVGGVIGSAAGRAVASVGLSLSADNIRKLREDAMIKCPPRNDNLPLC 464
Query: 670 EPQIAPCLFDIKNDPCEKNNLADR 693
+P APCLF+++ DPCE NNL ++
Sbjct: 465 KPLEAPCLFNVRQDPCEDNNLIEK 488
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 33/49 (67%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
IDG+D+W LS + S R +LHNIDD + ++ +T G WK ++ ++ NG
Sbjct: 354 IDGMDIWHALSEDTESPRKMVLHNIDDIYGVAGITYGDWKFIQGSTYNG 402
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 36/52 (69%), Gaps = 2/52 (3%)
Query: 750 SNEEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADR 799
S + +RKLR+ A I+C P + +P C+P APCLF+++ DPCE NNL ++
Sbjct: 437 SLSADNIRKLREDAMIKCPPRNDNLPLCKPLEAPCLFNVRQDPCEDNNLIEK 488
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 30/47 (63%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
YP V+S +++ L +N +AV P N P+DK +P +DH W FGD L
Sbjct: 489 YPSVVSDLKERLRVLNASAVLPGNLPWDKRANPDYWDHTWINFGDYL 535
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL E GLPL EK+LP+Y ++LGY T I+
Sbjct: 90 MQHGVLKCAEPRGLPLHEKLLPEYFRDLGYSTHIVG 125
>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 545
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 228/337 (67%), Positives = 269/337 (79%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
F L++ V VA++ PHI+FILADDLGWNDVGFHG QIPTPNIDALAYSG++L YY
Sbjct: 11 FYLTVFVVCHVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYV 70
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
+CTPSRSA+MTGKHPIHTGMQH VL E GLPL EK+LPQYL+ELGY T IVGKWH
Sbjct: 71 TPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWH 130
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
LGFY KEYTP +RGF+SH+G+W+GH DYFDHSA E WGLDMRR L AWDLHG+YSTD
Sbjct: 131 LGFYTKEYTPMYRGFDSHIGFWSGHHDYFDHSAVESPYWGLDMRRGLNSAWDLHGQYSTD 190
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+FT EAV +I++H+ P+FLYL+HAA HS N Y PL APD + +I +++R +FA
Sbjct: 191 IFTKEAVKLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQDVAKFTNIFNYERRRFAG 250
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+L KLDESVG+VVEAL ++ ML NS+IVF +DNGG AGFNLNAASNWPLRG KNTLWEG
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEG 310
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
GVRGAGL+WSP L G V+ Q H+SDWLPTL++AA
Sbjct: 311 GVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAA 347
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 75/145 (51%), Gaps = 34/145 (23%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
IDG+D+W LS + S R +LHNIDD + ++ +T G WK ++ Q D GP
Sbjct: 355 SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQ-GSTYNGQWDGWYGP- 412
Query: 634 QVYLSGLSDREW--------------------LAL---AMRKLRDAASIQCGPVKE-VP- 668
S REW LAL A+R+LR+ A I+C P + +P
Sbjct: 413 -------SGREWVYDIGGVIGSTAGRAVASVGLALSADAIRRLREDAMIKCPPKNDSLPL 465
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
C+P APCLF+++ DPCE NNL ++
Sbjct: 466 CKPLEAPCLFNVQQDPCEDNNLINK 490
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 62/155 (40%), Gaps = 60/155 (38%)
Query: 752 EEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYT 809
+ +R+LR+ A I+C P + +P C+P APCLF+++ DPCE NNL IN Y
Sbjct: 441 SADAIRRLREDAMIKCPPKNDSLPLCKPLEAPCLFNVQQDPCEDNNL--------INKYP 492
Query: 810 TEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKEL 869
+ V L K R
Sbjct: 493 SLVTELQDKLR------------------------------------------------- 503
Query: 870 ANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
+N TA+ P N P+D +P +DH W+ FGD L
Sbjct: 504 -KLNATAILPGNLPWDNKANPDYWDHTWTNFGDYL 537
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 39/70 (55%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
IDG+D+W LS + S R +LHNIDD + ++ +T G WK ++ ++ NG +
Sbjct: 355 SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQGSTYNGQWDGWYGPSG 414
Query: 569 NSYQNEIDGI 578
+ +I G+
Sbjct: 415 REWVYDIGGV 424
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL E GLPL EK+LPQYL+ELGY T I+
Sbjct: 92 MQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVG 127
>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
Length = 545
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 227/337 (67%), Positives = 267/337 (79%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
F L++ V VA++ PHI+FILADDLGWNDVGFHG QIPTPNIDALAYSG++L YY
Sbjct: 11 FYLTVFVVCHVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYV 70
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
+CTPSRSA+MTGKHPIHTGMQH VL E GLPL EK+LPQYL+ELGY T IVGKWH
Sbjct: 71 TPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWH 130
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
LGFY KEYTP +RGF+SH+G+W+GH DYFDHSA E WGLDMRR L AWDLHG+YSTD
Sbjct: 131 LGFYTKEYTPMYRGFDSHIGFWSGHHDYFDHSAVESPYWGLDMRRGLNSAWDLHGQYSTD 190
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+FT EAV +I++H+ P+FLYL HAA HS N Y PL PD + +I +++R +FA
Sbjct: 191 IFTKEAVKLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQDVAKFTNIFNYERRRFAG 250
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+L KLDESVG+VVEAL ++ ML NS+IVF +DNGG AGFNLNAASNWPLRG KNTLWEG
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEG 310
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
GVRGAGL+WSP L G V+ Q H+SDWLPTL++AA
Sbjct: 311 GVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAA 347
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 74/145 (51%), Gaps = 34/145 (23%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
IDG+D+W LS + S R +LHNIDD + ++ +T G WK ++ G D
Sbjct: 355 SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQ-------GSTYNGQWD 407
Query: 634 QVYLSGLSDREWL-----------------------ALAMRKLRDAASIQCGPVKE-VP- 668
Y G S REW+ A A+R+LR+ A I+C P + +P
Sbjct: 408 GWY--GPSGREWVYDIGGVIGSTAGRAVASVGLTLSADAIRRLREDAMIKCPPKNDSLPL 465
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
C+P APCLF+++ DPCE NNL ++
Sbjct: 466 CKPLEAPCLFNVQQDPCEDNNLINK 490
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 64/155 (41%), Gaps = 60/155 (38%)
Query: 752 EEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYT 809
+ +R+LR+ A I+C P + +P C+P APCLF+++ DPCE NNL IN Y
Sbjct: 441 SADAIRRLREDAMIKCPPKNDSLPLCKPLEAPCLFNVQQDPCEDNNL--------INKYP 492
Query: 810 TEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKEL 869
+ V E + + +K
Sbjct: 493 SLV----------------TELRDRLRK-------------------------------- 504
Query: 870 ANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
+N TA+ P N P+D +P +DH W+ FGD L
Sbjct: 505 --LNATAILPGNLPWDNRANPDYWDHTWTNFGDYL 537
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL E GLPL EK+LPQYL+ELGY T I+
Sbjct: 92 MQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVG 127
>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 545
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/343 (66%), Positives = 269/343 (78%), Gaps = 6/343 (1%)
Query: 50 FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
V + PPHIIFI+ADDLGWNDVGFHG QIPTPNIDALAYSG+IL YY +CTP
Sbjct: 18 IVAAAPETTPPHIIFIVADDLGWNDVGFHGSGQIPTPNIDALAYSGLILNRYYVSPICTP 77
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
SRSA+MTGK+PIHTGMQ VL G E GLPL EK+LP+YL+ELGYRT IVGKWHLGFY K
Sbjct: 78 SRSALMTGKYPIHTGMQRGVLKGAEPRGLPLKEKLLPEYLRELGYRTHIVGKWHLGFYTK 137
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
EYTPT+RGFESHLGYWTGHQDY+DHSA E WG+DMRR++EPAWDLHG+YSTDVFT EA
Sbjct: 138 EYTPTYRGFESHLGYWTGHQDYYDHSAVEEPYWGMDMRRNMEPAWDLHGQYSTDVFTKEA 197
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
V +I +H+ +P+FLYLAHAA HSANPY PL D ++ + I ++KR +FA + +D
Sbjct: 198 VKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDEIVDKIK-IPEYKRRRFAGMASAMD 256
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
SVG VVEALE+R ML + +IVF +DNGG A GFNLNAASNWPLRGVK+TL+EGGVRG G
Sbjct: 257 ASVGAVVEALEKRNMLRDCLIVFSTDNGGPAEGFNLNAASNWPLRGVKDTLFEGGVRGVG 316
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
L+WS L G ++E H++DWLPTLL AA YVN+T
Sbjct: 317 LVWSERLARPGRLSEATFHIADWLPTLLRAAG-----GYVNAT 354
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 82/162 (50%), Gaps = 21/162 (12%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
+IDG+D W L+ ++ S R +LHNIDD +A++ +WKL+K R + + GP
Sbjct: 355 GKIDGLDAWEALADDKESPRRVVLHNIDDVLGTAAISVDQWKLIK-GSTYRGRWNQWYGP 413
Query: 633 ---DQVY-LSGLSDREWLALAMRK------------LRDAASIQCGPVKEVP---CEPQI 673
D Y + +++ +A A+++ LR+ A+I CG + P C P
Sbjct: 414 SGRDYPYDVEAVAEGCLVAKALKRLGRGLDRGKLVELRNQATIHCGEQPQQPQKACNPLE 473
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
A CL D++ DPCE+ NLAD D + E+ R N A P
Sbjct: 474 AACLIDVRADPCERRNLADEQPDV-VRRLEEELARLNASAVP 514
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 76/203 (37%), Gaps = 68/203 (33%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
+T GR+NQ P + + + K K+ + + K + +LR+
Sbjct: 401 STYRGRWNQWYGPSGRDYPYDVEAVAEGCLVAKALKRLGRGLDRGK-------LVELRNQ 453
Query: 763 ASIQCGPVKEVP---CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQ 819
A+I CG + P C P A CL D++ DP +
Sbjct: 454 ATIHCGEQPQQPQKACNPLEAACLIDVRA---------------------------DPCE 486
Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAP 879
R N L E+ PDV+ ++E+ELA +N +AV P
Sbjct: 487 RRN----LADEQ---------------------------PDVVRRLEEELARLNASAVPP 515
Query: 880 INKPFDKGGDPKNFDHAWSIFGD 902
N P+D DP +DH W+ FGD
Sbjct: 516 GNLPWDSQADPALWDHTWNNFGD 538
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 27/36 (75%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQ VL G E GLPL EK+LP+YL+ELGYRT I+
Sbjct: 93 MQRGVLKGAEPRGLPLKEKLLPEYLRELGYRTHIVG 128
>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
Length = 519
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 218/310 (70%), Positives = 258/310 (83%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GWNDVGFHG QIPTPNIDALAYSG++L YY +CTPSRSA+MTGK+PIHTGMQH++L
Sbjct: 10 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIHTGMQHSIL 69
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
G E GLPL EKILP+YL+ELGY T IVGKWHLGFYK+EYTPT+RGF++H+GYWTGH D
Sbjct: 70 KGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHIGYWTGHHD 129
Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
Y+DH+A E WGLDMRR ++PAWDLHG+YSTDVFT EAV +I+NH++ P+FLYLAHAA
Sbjct: 130 YYDHTAVENPYWGLDMRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSRPMFLYLAHAA 189
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
HS NPY PL APD + +I D+ R +FA IL KLD+SVG+VVEAL + ML +S+I
Sbjct: 190 VHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGILSKLDQSVGQVVEALYKNNMLQDSVI 249
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
VF +DNGG AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP L G V+ Q +H++
Sbjct: 250 VFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPRLVRAGRVSRQMIHIT 309
Query: 371 DWLPTLLSAA 380
DWLPTL++AA
Sbjct: 310 DWLPTLITAA 319
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 77/165 (46%), Gaps = 35/165 (21%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
IDG+++W LS + S R ILHNIDD + +SA+T G WK ++ G D
Sbjct: 328 SIDGVNLWDALSEDTESPRTNILHNIDDIYGVSAITVGDWKFIQ-------GSTYNGVWD 380
Query: 634 QVYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVKE-VP- 668
Y G S REW+ A+A + LR+ A I+C P + +P
Sbjct: 381 GWY--GPSGREWIYDADGVISSTAGRAIASVGLSVTTEIISSLRENAMIKCPPRNDSLPI 438
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIA 713
C P PCLF+I DPCE NNL + + E+ +FN A
Sbjct: 439 CRPLQEPCLFNIYQDPCEDNNLV-KQFPTIVRKLQEELKKFNNTA 482
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH++L G E GLPL EKILP+YL+ELGY T I+
Sbjct: 64 MQHSILKGAEPRGLPLHEKILPEYLRELGYSTHIVG 99
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 58/155 (37%), Gaps = 60/155 (38%)
Query: 750 SNEEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINH 807
S E + LR+ A I+C P + +P C P PCLF+I
Sbjct: 412 SVTTEIISSLRENAMIKCPPRNDSLPICRPLQEPCLFNI--------------------- 450
Query: 808 YTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
Y DP + N + K +P ++ ++++
Sbjct: 451 ------YQDPCEDNNLV-------------------------------KQFPTIVRKLQE 473
Query: 868 ELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
EL N TA+ P N P+D DP +DH W+ FGD
Sbjct: 474 ELKKFNNTAILPGNLPWDSKADPNLWDHTWNNFGD 508
>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
Length = 532
Score = 476 bits (1226), Expect = e-131, Method: Compositional matrix adjust.
Identities = 218/311 (70%), Positives = 253/311 (81%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GWNDVGFHG QIPTPNIDALAYSG++L YY +CTPSRSA+MTGK+PIH GMQH VL
Sbjct: 26 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGVL 85
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
G E GLPL EKILP+YL++LGY T IVGKWHLGFY KEYTPT+RGF SH G+WTGHQD
Sbjct: 86 KGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQD 145
Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
YFDH+A E WGLDMRRD+EPAWDLHG+YSTDVFT EA+ +I H++ PLFLYLAHAA
Sbjct: 146 YFDHTAVESPYWGLDMRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAHAA 205
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
HS NPY PL APD + +I D+ R +FA IL KLD+SVG+VVE L ++ ML +S+I
Sbjct: 206 VHSGNPYNPLPAPDEEVAKFDNIFDYNRRRFAGILSKLDKSVGQVVEGLRKKGMLRDSVI 265
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
+F +DNGG AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP L G V+ Q +H++
Sbjct: 266 IFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPKLTRSGRVSRQLLHIT 325
Query: 371 DWLPTLLSAAN 381
DWLPTL++AA
Sbjct: 326 DWLPTLITAAG 336
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 82/167 (49%), Gaps = 36/167 (21%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
IDG+D+WS LS + S R ++LHNIDD + +SA+T G WKLV+ G D
Sbjct: 344 IDGMDLWSALSDDTESPRTSVLHNIDDIYGVSAVTVGDWKLVQ-------GSTYNGAWDG 396
Query: 635 VYLSGLSDREWL-----------------------ALAMRKLRDAASIQCGPVKE-VP-C 669
Y G S REW+ A +R LR+ A I+C P + +P C
Sbjct: 397 WY--GPSGREWVYDVGGVIGSEAGRAVASVGLGITAETVRMLREKAMIKCPPKNDSLPLC 454
Query: 670 EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIA-YP 715
+P PCLF++ DPCE NNL R + E+ +FN A YP
Sbjct: 455 KPLQEPCLFNVYQDPCEDNNLV-RQSPTIVRKLQEELKKFNSSAIYP 500
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 27/36 (75%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL G E GLPL EKILP+YL++LGY T I+
Sbjct: 80 MQHGVLKGAEPRGLPLHEKILPEYLRDLGYSTHIVG 115
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 61/154 (39%), Gaps = 60/154 (38%)
Query: 754 EGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTE 811
E +R LR+ A I+C P + +P C+P PCLF++
Sbjct: 431 ETVRMLREKAMIKCPPKNDSLPLCKPLQEPCLFNV------------------------- 465
Query: 812 VGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELAN 871
Y DP + N + + P ++ ++++EL
Sbjct: 466 --YQDPCEDNNLV-------------------------------RQSPTIVRKLQEELKK 492
Query: 872 INRTAVAPINKPFDKGGDPKNFDHAWSIFGDDLK 905
N +A+ P N P+D GDP +DH W+ FGD +K
Sbjct: 493 FNSSAIYPGNLPWDNKGDPSLWDHTWNNFGDYVK 526
>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
Length = 528
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 219/319 (68%), Positives = 261/319 (81%), Gaps = 1/319 (0%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
+I +LA GWNDVGFHG QIPTPNIDALAYSG++L YY +CTPSRSA+MTGK+PI
Sbjct: 13 LILLLAVH-GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPI 71
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
HTGMQH VL G E GLPL EK+LP+YL+ELGY T IVGKWHLGFYKKEYTPT+RGF++H
Sbjct: 72 HTGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYRGFDTH 131
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
+G+WTGH DYFDH+A E WGLD+RR ++PAWDLHG+YSTD+FT EAV +I NH++ P
Sbjct: 132 IGFWTGHHDYFDHTAVENPYWGLDIRRGMQPAWDLHGQYSTDIFTKEAVRLIDNHNSSRP 191
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
+FLYLAHAA HS NPY PL APD + +I D+ R +FA +L KLD+SVG+VV+AL +
Sbjct: 192 MFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGMLSKLDQSVGQVVDALRK 251
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGI 361
R ML +SII+F SDNGG AAGFNLNAASNWPLRGVKNTLWEGGVRG GLIWSP L G
Sbjct: 252 RDMLRDSIIIFSSDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGTGLIWSPRLVRPGR 311
Query: 362 VAEQYVHVSDWLPTLLSAA 380
V+ Q +H++DWLPTL++AA
Sbjct: 312 VSRQMLHITDWLPTLITAA 330
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 72/143 (50%), Gaps = 36/143 (25%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
+DG+D+W+ LS + S R +LHNIDD + +SA+T G WKL++ G D
Sbjct: 339 VDGMDLWNALSEDTESPRANVLHNIDDIYGVSAITVGDWKLIR-------GSTYKGVWDG 391
Query: 635 VYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVK--EVP- 668
Y G S REW+ A+A + LR+ A I+C P++ +P
Sbjct: 392 WY--GPSGREWVYDVDGVISSTTAHAIASVGLGITTDVVLMLRENALIKC-PLRNDSLPI 448
Query: 669 CEPQIAPCLFDIKNDPCEKNNLA 691
C+P CLF+I DPCE NNL
Sbjct: 449 CKPLEEACLFNIYQDPCEDNNLV 471
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 27/36 (75%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL G E GLPL EK+LP+YL+ELGY T I+
Sbjct: 75 MQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVG 110
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 31/49 (63%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ K +P ++ +++ EL N +A+ P N P+D GDP +DH W+ FGD
Sbjct: 470 LVKQFPTIVRKLQDELKKFNSSALLPGNLPWDSKGDPNLWDHTWNNFGD 518
>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
Length = 552
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 228/347 (65%), Positives = 269/347 (77%), Gaps = 4/347 (1%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
++ AVL LAFT+ +V +S PP+IIFILADDLGWNDVGFHG QIPTPN+DALAY
Sbjct: 3 LVVIAVLILAFTIVLV----TSSKPPPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAY 58
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
SGIIL YY +CTPSR+A+MTG++PIHTGMQH VLYG E GLPL EK+LP+YL+ELG
Sbjct: 59 SGIILNRYYVTPICTPSRAALMTGRYPIHTGMQHAVLYGMEPRGLPLEEKLLPEYLRELG 118
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y+ IVGKWHLG Y + YTP RGF+SH+G+WTGH FDHSA E + WGLDMRR + A
Sbjct: 119 YKNHIVGKWHLGHYTRRYTPLERGFDSHVGFWTGHHHMFDHSAVETETWGLDMRRGYDVA 178
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+DLHGKY+T V EAV I NHS +PLFLY+AHAA HSANPY+ L APD + H+
Sbjct: 179 YDLHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAHAAVHSANPYDFLPAPDVTVAGLEHV 238
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
E + R KFAA+L +LDESVG +VEAL+ + ML N+IIVF SDNGG A GFN NAASNWPL
Sbjct: 239 EPYPRRKFAAMLSELDESVGAIVEALKIKGMLDNTIIVFSSDNGGPAEGFNSNAASNWPL 298
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
RGVKNTLWEGGVR AG IWSPL+ V++Q +H+SDWLPTLL AA
Sbjct: 299 RGVKNTLWEGGVRAAGFIWSPLIAESRRVSQQMIHISDWLPTLLDAA 345
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 68/144 (47%), Gaps = 18/144 (12%)
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
N + +DGI+VW L ++R ILHNIDD W +ALT WK++K + Q
Sbjct: 348 NVETLPKTLDGINVWKSLLNGNSTERREILHNIDDIWGSAALTVNNWKILKGTN-YKGQW 406
Query: 627 DLTGGPD-------------QVYLSG--LSDREWLAL--AMRKLRDAASIQCGPVKEVPC 669
D GP Q +G LS+ L +R LR A++ C C
Sbjct: 407 DNWYGPPGDRDPEAYNITAVQHCPTGKALSEMNLLPTEATIRNLRLNATVHCSKEVRATC 466
Query: 670 EPQIAPCLFDIKNDPCEKNNLADR 693
P PCL+D+ DPCE +NLA+R
Sbjct: 467 NPLEQPCLYDVLQDPCELDNLAER 490
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 55/153 (35%), Gaps = 58/153 (37%)
Query: 753 EEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEV 812
E +R LR A++ C C P PCL+D+ DPCE +NLA+R
Sbjct: 444 EATIRNLRLNATVHCSKEVRATCNPLEQPCLYDVLQDPCELDNLAERFP----------- 492
Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANI 872
A LD M+KK L
Sbjct: 493 ------------AILD-----------------------SMLKK------------LDAY 505
Query: 873 NRTAVAPINKPFDKGGDPKNFDHAWSIFGDDLK 905
N TAV P N D GDP+ + + W FGDDL+
Sbjct: 506 NATAVPPGNLALDTRGDPRFWGYTWHNFGDDLE 538
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VLYG E GLPL EK+LP+YL+ELGY+ I+
Sbjct: 90 MQHAVLYGMEPRGLPLEEKLLPEYLRELGYKNHIVG 125
>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
Length = 557
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 210/328 (64%), Positives = 251/328 (76%), Gaps = 2/328 (0%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P++IFILADD GWNDV FHG QIPTPN+DALA+SG+IL+NYY LCTPSRSA+MTGKH
Sbjct: 26 PNLIFILADDFGWNDVSFHGSKQIPTPNLDALAFSGLILQNYYVTPLCTPSRSALMTGKH 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH+VLYG R GLPLSE LP+YLK+LGY+ IVGKWHLG YK YTP FRGF+
Sbjct: 86 PIHTGMQHDVLYGYSRYGLPLSEITLPEYLKDLGYKNHIVGKWHLGHYKSVYTPLFRGFD 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
SH GYWTGHQDY+DH+A E WG DMRR+ W +GKY+T + T EA D+I H
Sbjct: 146 SHYGYWTGHQDYYDHTAVEWNAWGYDMRRNHSVDWSAYGKYTTTLLTDEACDVITKHDVS 205
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+AH A HSANPY PLQAP+ + + IE+ +R ++AA++H+LD SVGK+V++L
Sbjct: 206 SPLFLYVAHLAVHSANPYSPLQAPEETVEMFSSIENLQRRRYAAMVHELDVSVGKIVKSL 265
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--E 357
ML N++IVF +DNGG A GFN NAASNWPL+GVKNT WEGGVR AGLIWSPL+
Sbjct: 266 GDNNMLENTVIVFSTDNGGPAEGFNQNAASNWPLKGVKNTPWEGGVRAAGLIWSPLIPKS 325
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKSDI 385
RG V + +SDWLPTL AA +S I
Sbjct: 326 RRGQVMSNLMDISDWLPTLFEAAGQSRI 353
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 37/152 (24%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK----------------- 617
+DG+ W+ + ++PS RN +LHNIDD+ +A+ K WKLVK
Sbjct: 363 LDGVSHWNTVLYDKPSARNHVLHNIDDQLGYAAIRKENWKLVKGTTYEGSWDGWYGPSGR 422
Query: 618 --------------VVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG- 662
V + + V + + + G+S + L ++L DA I CG
Sbjct: 423 TNESSTNFINSVDDVSLIYKNAVLDSDAAQYIAMLGMSATQNL---YQQLLDA-EISCGE 478
Query: 663 -PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 693
PV PC P A CLFDI+ DPCE NNLA++
Sbjct: 479 RPVDAHPCLPLDALCLFDIEQDPCEYNNLAEK 510
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH+VLYG R GLPLSE LP+YLK+LGY+ I+
Sbjct: 91 MQHDVLYGYSRYGLPLSEITLPEYLKDLGYKNHIVG 126
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 30/45 (66%), Gaps = 3/45 (6%)
Query: 757 RKLRDAASIQCG--PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
++L DA I CG PV PC P A CLFDI+ DPCE NNLA++
Sbjct: 467 QQLLDA-EISCGERPVDAHPCLPLDALCLFDIEQDPCEYNNLAEK 510
>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
Length = 563
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 208/321 (64%), Positives = 257/321 (80%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ I+ADDLGWNDVGFHG +QIPTPNID +A+SG+ L NYY +CTPSR+A+MTGK+
Sbjct: 49 PNIVLIIADDLGWNDVGFHGSNQIPTPNIDIMAWSGVSLHNYYVTPICTPSRAALMTGKY 108
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH V++ E GLPL+EKILPQYLKELGY+T +VGKWHLG YKKEY P RGF+
Sbjct: 109 PIHTGMQHTVIFAAEPRGLPLTEKILPQYLKELGYKTHLVGKWHLGSYKKEYLPLNRGFD 168
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
SHLG+W G D +DH+ +E WG D RRD A DL G+Y+TDV+T EAV II +H+T
Sbjct: 169 SHLGFWNGKIDMYDHTNQEKGYWGFDFRRDFSTAHDLFGQYATDVYTNEAVKIIKSHNTS 228
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFL L+H+A H+ NP EP++AP+ HI+DF+R KFAA+L KLDESVG+VV AL
Sbjct: 229 SPLFLMLSHSAVHTGNPSEPIRAPEKLFVNFTHIQDFQRRKFAAVLTKLDESVGEVVAAL 288
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ + +L++SI+VF +DNGGAAAGFN NAASN+PLRGVKNTLWEGGVRGAG +WSP ++ R
Sbjct: 289 KAKGVLNDSIVVFTTDNGGAAAGFNDNAASNYPLRGVKNTLWEGGVRGAGWLWSPFIDKR 348
Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
VA Q +H+ DWLPTLLSAA
Sbjct: 349 SRVATQRMHLVDWLPTLLSAA 369
Score = 86.3 bits (212), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 80/158 (50%), Gaps = 21/158 (13%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
IDG+D W LS++ PS R +++HNIDDE +++T +WK V K Y G
Sbjct: 379 IDGVDQWCALSQDLPSARESLVHNIDDESGSASITYKQWK---VHKGTNYGGSWDGWYGP 435
Query: 631 -GPDQVY-----LSGLSDREWLALAMR-------KLRDAASIQCGPVKEVPCEPQIAPCL 677
G + Y L+ + L + M +LR A+++CG + +PC P APCL
Sbjct: 436 AGREGAYDTTRLLASKAAGALLDIGMLPDTEHILRLRSEATVECGDREALPCRPLEAPCL 495
Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
F+I DPCE NLAD D + E+ R N+ A P
Sbjct: 496 FNIDEDPCETRNLADIHPDV-LQVMLKELDRVNRTAVP 532
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 32/47 (68%)
Query: 752 EEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
+ E + +LR A+++CG + +PC P APCLF+I DPCE NLAD
Sbjct: 464 DTEHILRLRSEATVECGDREALPCRPLEAPCLFNIDEDPCETRNLAD 510
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 29/36 (80%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V++ E GLPL+EKILPQYLKELGY+T ++
Sbjct: 114 MQHTVIFAAEPRGLPLTEKILPQYLKELGYKTHLVG 149
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+PDVL M KEL +NRTAV P N+P GGDPK + + + FGD
Sbjct: 512 HPDVLQVMLKELDRVNRTAVPPNNQPLTPGGDPKYWGYVITNFGD 556
>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
Length = 533
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/326 (64%), Positives = 266/326 (81%), Gaps = 2/326 (0%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PPHIIFI+ADDLGWNDVG+HG D+IPTPNIDALAY+GIIL YY + +CTPSRSA+MT
Sbjct: 7 SKPPHIIFIMADDLGWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMT 66
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G+HPIH GMQH VL+G E GLPL+EK+LP+YL++LGY T IVGKWHLGFYKKEYTP +R
Sbjct: 67 GRHPIHNGMQHRVLFGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYR 126
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
GFESH+G+WTGHQDY+DH+AEE ++WGLDMR ++PAW LHG+YST V+T E+V II N+
Sbjct: 127 GFESHIGFWTGHQDYYDHTAEEERLWGLDMRHGMKPAWYLHGEYSTHVYTRESVKIIKNY 186
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
++ +PLFLY+AHAA HS N Y PL APD ++ HI+++ R ++AA++ +LD SVG ++
Sbjct: 187 NSTKPLFLYVAHAAVHSGNKYNPLPAPDKTVDKLDHIQNYNRRRYAAMVSELDTSVGNII 246
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL+ MLSNSI++F +DNGG A G N N ASNWP+RGVKNTLWEGGVRG G++WS
Sbjct: 247 SALKDANMLSNSIVIFSTDNGGPANGLNYNYASNWPMRGVKNTLWEGGVRGVGVVWSSKF 306
Query: 357 E--SRGIVAEQYVHVSDWLPTLLSAA 380
+ S G V+ + +H+ DWLPTL +AA
Sbjct: 307 KPNSGGRVSNRMMHICDWLPTLYAAA 332
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 71/141 (50%), Gaps = 24/141 (17%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
+DG ++W+ L + PS RN ILHNIDD + SALT +WKLVK + G
Sbjct: 343 LDGYNMWNSLLDDLPSPRNEILHNIDDIFGNSALTMSQWKLVKGTTYNGFYDSWFGPTGR 402
Query: 631 GPDQVY-----LSGLSDREWLAL--------AMRKLRDAASIQCGPVKEVP-------CE 670
P Y LS LS++ L ++R++R+ A+I CG + + C
Sbjct: 403 NPGYFYNIGQVLSSLSNKALSTLQTVPLNPESIRRIREEATIDCGNHQLISSESRFSNCN 462
Query: 671 PQIAPCLFDIKNDPCEKNNLA 691
P CLF++ DPCE +N+A
Sbjct: 463 PLKESCLFNVDEDPCEIDNVA 483
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 29/36 (80%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL+G E GLPL+EK+LP+YL++LGY T I+
Sbjct: 75 MQHRVLFGVETRGLPLTEKLLPEYLQKLGYSTHIVG 110
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ K +P V+ +ME+ L N TAV P D G+PK + + W+ FGD
Sbjct: 482 VAKSFPQVVKEMEERLLYWNSTAVKPAKVINDLRGNPKFWGYVWTNFGD 530
>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
Length = 564
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 216/333 (64%), Positives = 253/333 (75%), Gaps = 2/333 (0%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S P+IIFILADDLGWNDVGFHG QIPTPN+DALAYSGIIL YY +CTPSRSA+MT
Sbjct: 36 SARPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVNPICTPSRSALMT 95
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
GK+PIHTGMQH VLY E GLPLSEK+LPQYLK+LGY IVGKWHLG Y+ +TP R
Sbjct: 96 GKYPIHTGMQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSNHIVGKWHLGHYQLRFTPMQR 155
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
GF+SH G+WTGH DH+A E WGLDMRR + A+DLHG+Y+T V AEA+ I+ H
Sbjct: 156 GFDSHTGFWTGHHHMNDHTAVEHGHWGLDMRRGYDVAYDLHGQYTTHVLGAEAIAIVQGH 215
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+ PLFLY+AHAA HSANPY+ L APD + HIE+++R KFAA++ +LD +VG +V
Sbjct: 216 NKSSPLFLYVAHAAVHSANPYDFLPAPDETVANLGHIENYRRRKFAAMMVELDRTVGSLV 275
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
+AL R ML N+IIVF SDNGG A GFN NAASNWPLRGVKNTLWEGG+R AG IWSPLL
Sbjct: 276 DALHARDMLENTIIVFSSDNGGPADGFNDNAASNWPLRGVKNTLWEGGLRAAGFIWSPLL 335
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
++ V+ Q V V DWLPTL AA N S +P
Sbjct: 336 QNVSRVSHQMVQVCDWLPTLYEAAGGNVSALPT 368
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 88/187 (47%), Gaps = 31/187 (16%)
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
N ++ ++DGI VW LS P++R ILHNIDD W +ALT G WKLVK R
Sbjct: 362 NVSALPTDLDGISVWHELSSGAPTRRIEILHNIDDIWGTAALTVGNWKLVKGSHYNR-TW 420
Query: 627 DLTGGPDQVYLSGLSDREWLAL----------------------AMRKLRDAASIQCGPV 664
D GP +G+ D + AL + +LR A++ CG
Sbjct: 421 DGWYGP-----AGIRDEKAYALDKLANSPAGKVMEELSLLPTPERITQLRREATVSCGAG 475
Query: 665 KEVP--CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEE 722
+ C+P PCL+D+++DPCE NNLA+ + + +N A P E++
Sbjct: 476 AHMASECDPLERPCLYDVESDPCEYNNLAEE-HLHILQSLLARLADYNSTAVPPSNLEDD 534
Query: 723 KKKKKKK 729
+ + ++
Sbjct: 535 PRGEPQR 541
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 27/36 (75%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VLY E GLPLSEK+LPQYLK+LGY I+
Sbjct: 104 MQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSNHIVG 139
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP--CEPQIAPC 782
+ +K K K ++ E + +LR A++ CG + C+P PC
Sbjct: 430 RDEKAYALDKLANSPAGKVMEELSLLPTPERITQLRREATVSCGAGAHMASECDPLERPC 489
Query: 783 LFDIKNDPCEKNNLAD 798
L+D+++DPCE NNLA+
Sbjct: 490 LYDVESDPCEYNNLAE 505
>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
Length = 562
Score = 447 bits (1151), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/326 (65%), Positives = 253/326 (77%), Gaps = 1/326 (0%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
S P+IIFILADDLG+NDVGFHG +IPTPNIDALAYSGIIL YY +CTPSRSA+M
Sbjct: 22 SPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALM 81
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TGK+PIHTGMQH VLY E GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +
Sbjct: 82 TGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLY 141
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
RGF SH+G+W+GHQDY DH+A E WGLDMR + A+DLHG Y+TDV T +V +I N
Sbjct: 142 RGFSSHVGFWSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIAN 201
Query: 236 H-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H +T PLFLY+AHAA HS+NPY PL PD+ + HI ++KR KFAA++ K+D SVG+
Sbjct: 202 HNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQ 261
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
+V+ L + ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSP
Sbjct: 262 IVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSP 321
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL+ V+ Q +H+ DWLPTLL AA
Sbjct: 322 LLKKSQRVSNQTMHIIDWLPTLLEAA 347
Score = 79.7 bits (195), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 99/235 (42%), Gaps = 41/235 (17%)
Query: 515 VWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNE 574
+WS L + N +H ID W + G N S +
Sbjct: 318 MWSPLLKKSQRVSNQTMHIID------------WLPTLLEAAGGQPALSNLSK------Q 359
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD- 633
IDG +W L +++ S R +LHNIDD W +AL+ G WKLVK R D GP
Sbjct: 360 IDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN-YRGSWDGWYGPAG 418
Query: 634 ---------QVYLSGLSDREWLALAM-------RKLRDAASIQC--GPVKEVPCEPQI-- 673
Q+ + + AL M +++R AA++ C + C
Sbjct: 419 ERDPRLYDWQLVGRSRAGKALEALKMLPSRADQQRIRAAATVSCPGQSSQGTSCVATAFS 478
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
APCLF I++DPCE+ NLA + + +N TE+ RFN A P + + + +
Sbjct: 479 APCLFHIRDDPCEQYNLA-KQYPEVVNALMTELERFNATAVPPSNKPADPRADPR 532
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL ELGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 31/49 (63%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ K YP+V++ + EL N TAV P NKP D DP+ +++ W+ FGD
Sbjct: 495 LAKQYPEVVNALMTELERFNATAVPPSNKPADPRADPRFWNYTWTNFGD 543
>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 514
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 207/326 (63%), Positives = 249/326 (76%), Gaps = 2/326 (0%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII ILADDLGWNDVGFHG +QIPTPNIDALA++GIIL NYY +CTPSRSA++TGK+
Sbjct: 31 PHIIIILADDLGWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKY 90
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTG+QH V++G GL L+EK+LP+YL+ L Y TR VGKWHLG +KK+YTP +RGF+
Sbjct: 91 PIHTGLQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFD 150
Query: 180 SHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
SH GYWTGHQDY+DH+A E WG DMRR + G Y+TD+FT EAV +I H +
Sbjct: 151 SHYGYWTGHQDYYDHTAIENPGFWGYDMRRGMNVTRSDFGYYTTDLFTNEAVKVIKGHDS 210
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
++PLFLYLAH ATHS N Y PLQAP + +I+D R FA +L KLDESVGKVVEA
Sbjct: 211 NKPLFLYLAHLATHSGNKYSPLQAPAETVAKFNYIKDKNRRLFAGMLSKLDESVGKVVEA 270
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L M++N +I+F +DNGG A GFNLNAASNWPLRGVK+TLWEGGVRG G IWSP L S
Sbjct: 271 LADSNMINNCVILFSTDNGGPAGGFNLNAASNWPLRGVKDTLWEGGVRGVGFIWSPFLPS 330
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSD 384
V+ +H++DWLPTLLS N S+
Sbjct: 331 SK-VSNAMIHITDWLPTLLSLTNASN 355
Score = 46.2 bits (108), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 37/59 (62%)
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
+N ++ ++IDGI+VW LS+ PS R IL NID E +I+AL WK K N+ N N
Sbjct: 351 TNASNSISDIDGINVWPNLSKGLPSVRKEILLNIDTERKIAALRYKNWKYKKGNANNWN 409
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 32/49 (65%)
Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWK 614
+N ++ ++IDGI+VW LS+ PS R IL NID E +I+AL WK
Sbjct: 351 TNASNSISDIDGINVWPNLSKGLPSVRKEILLNIDTERKIAALRYKNWK 399
Score = 40.0 bits (92), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 33
+QH V++G GL L+EK+LP+YL+ L Y TR
Sbjct: 96 LQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTR 128
>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 713
Score = 421 bits (1081), Expect = e-114, Method: Compositional matrix adjust.
Identities = 197/326 (60%), Positives = 243/326 (74%), Gaps = 5/326 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII ILADDLGWNDV FHG DQIPTPNIDALAY+GIIL N+Y LCTPSRSA+MTGK+
Sbjct: 72 PHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKN 131
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH+VL+ E GLPLSEK+LP+YL+E+GY+T VGKWHLG++KKEYTPT+RGF+
Sbjct: 132 PIHLGMQHSVLFPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYTPTYRGFD 191
Query: 180 SHLGYWTGHQDYFDHSAEE-----MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
SH GYW G QDY+ H +E + G DMRR+L AWD GKYSTD+FT EA+ +I+
Sbjct: 192 SHFGYWNGLQDYYTHITQEPDPAFSEFKGFDMRRNLTVAWDTVGKYSTDLFTNEAIRLIN 251
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H TD P+FLYLAH A H N + +APD + +I D +R AA++ KLD+SVG
Sbjct: 252 EHDTDRPMFLYLAHLAVHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGD 311
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V++AL R ML NSI++F+SDNG + G N SN+PLRG+KN+ WEGG RG IWSP
Sbjct: 312 VMDALRNRGMLENSIVLFMSDNGAPSQGILNNKGSNYPLRGIKNSPWEGGTRGVAAIWSP 371
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
L++ V+ Q + +SDWLPTLLSAA
Sbjct: 372 LIKKSKRVSNQMMFISDWLPTLLSAA 397
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 90/190 (47%), Gaps = 29/190 (15%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN---GTSENRS 566
IDG D+WS L N+ S R+ ++ NIDD ++ RG +K + + G+ G S + S
Sbjct: 407 IDGFDLWSALVLNKVSSRSEVVLNIDDLSDYASFRRGNFKYIIGRTETGSDWLGASGDPS 466
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
S Q D + +L SK + I Q L + + + +K+ Y
Sbjct: 467 EGISPQ-----YDPYKILY----SKTGVAISGIITAKQAMELKERRKRSIKI-----YDT 512
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKND 683
L ++ LS + +LR A I+C V+E +PCEP APCLF+I+ D
Sbjct: 513 ALETKFEEKILS--------VENISELRRKAQIKCN-VQEKDRIPCEPMTAPCLFNIEKD 563
Query: 684 PCEKNNLADR 693
PCE NLA+R
Sbjct: 564 PCEMVNLAER 573
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 16/194 (8%)
Query: 615 LVKVVKVMRYQVDLTGGPD---QVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
L+ V R Q+ G D + L+ +S R + L + L D AS + G K +
Sbjct: 393 LLSAAGVNRKQLGNIDGFDLWSALVLNKVSSRSEVVLNIDDLSDYASFRRGNFKYIIGRT 452
Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKK 731
+ DP E ++ + + +I + T V I E +E++K+ K
Sbjct: 453 ETGSDWLGASGDPSE--GISPQYDPYKILYSKTGVAISGIITAKQAMELKERRKRSIKIY 510
Query: 732 KKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKN 788
+ K ++K E + +LR A I+C V+E +PCEP APCLF+I+
Sbjct: 511 DTALETKFEEKILSV-------ENISELRRKAQIKCN-VQEKDRIPCEPMTAPCLFNIEK 562
Query: 789 DPCEKNNLADRSEV 802
DPCE NLA+R V
Sbjct: 563 DPCEMVNLAERRPV 576
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 28/32 (87%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH+VL+ E GLPLSEK+LP+YL+E+GY+T
Sbjct: 137 MQHSVLFPTEPRGLPLSEKLLPEYLREIGYKT 168
>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
Length = 545
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 252/360 (70%), Gaps = 24/360 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADDLG+NDVGFHG QIPTPNIDALAYSGIIL YY +CTPSRSA+MTGK+
Sbjct: 26 PHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL EKILPQYL +LGY + I GKWHLG +K EYTP FRGF
Sbjct: 86 PIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFS 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH +WGLDMR + A++LHG+Y+TDV T +V +I NH +
Sbjct: 146 SH-----------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAA 188
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS NPY PL PD + HI +KR ++AA++ K+DESVG +V+
Sbjct: 189 KGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQ 248
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F SDNGG A GFNLN ASN+PLRGVKNTLWEGGVR AGL+WSPLLE
Sbjct: 249 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEK 308
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
R V ++ +H+SDWLPTL+ AA IP N+ + + + + ++ H Y SPR
Sbjct: 309 RQRVVDETMHISDWLPTLIEAAGG--IPALANANLSQSL----DGMSIWQTLVHAYPSPR 362
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 87/192 (45%), Gaps = 24/192 (12%)
Query: 556 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
I G +N N Q+ +DG+ +W L PS R T+LHNIDD W +ALT G WK+
Sbjct: 327 IEAAGGIPALANANLSQS-LDGMSIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKM 385
Query: 616 VKVVKVMRYQVDLTGGPD----------QVYLSGLSDREWLALAM-------RKLRDAAS 658
++ + D GP + L S + L M ++LR+ A+
Sbjct: 386 IQGTN-YNGRWDGWYGPGGERDPHLYDWNLVLKSSSGKAMTGLKMLPSRADQQRLRETAT 444
Query: 659 IQCG--PVKEVPCEPQI--APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
+ C + C APCLF +++DPCE+ NLA++ + ++ E+ N A
Sbjct: 445 VTCQGQSTQGTSCVATAFSAPCLFHVRDDPCEQFNLAEQYP-KVLDLLVAELQHINSTAV 503
Query: 715 PDKEEEEEKKKK 726
P + ++ +
Sbjct: 504 PPSNQPDDPRAS 515
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL +LGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHI 124
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 58/170 (34%), Gaps = 62/170 (36%)
Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG--PVKEVPCEPQI--APCLFDIKNDPCE 792
K K K ++LR+ A++ C + C APCLF +++DPC
Sbjct: 417 KSSSGKAMTGLKMLPSRADQQRLRETATVTCQGQSTQGTSCVATAFSAPCLFHVRDDPC- 475
Query: 793 KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
++FN
Sbjct: 476 --------------------------EQFN------------------------------ 479
Query: 853 MMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ + YP VL + EL +IN TAV P N+P D P F++ W+ FGD
Sbjct: 480 -LAEQYPKVLDLLVAELQHINSTAVPPSNQPDDPRASPMYFNYTWTNFGD 528
>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
Length = 545
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 252/360 (70%), Gaps = 24/360 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADDLG+NDVGFHG QIPTPNIDALAYSGIIL YY +CTPSRSA+MTGK+
Sbjct: 26 PHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL EKILPQYL +LGY + I GKWHLG +K EYTP FRGF
Sbjct: 86 PIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFS 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH +WGLDMR + A++LHG+Y+TDV T +V +I NH +
Sbjct: 146 SH-----------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAA 188
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS NPY PL PD + HI +KR ++AA++ K+DESVG +V+
Sbjct: 189 KGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQ 248
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F SDNGG A GFNLN ASN+PLRGVKNTLWEGGVR AGL+WSPLLE
Sbjct: 249 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEK 308
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
R V ++ +H+SDWLPTL+ AA IP N+ + + + + ++ H Y SPR
Sbjct: 309 RQRVVDETMHISDWLPTLIEAAGG--IPALANANLSQSL----DGMSIWQTLVHAYPSPR 362
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 87/192 (45%), Gaps = 24/192 (12%)
Query: 556 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
I G +N N Q+ +DG+ +W L PS R T+LHNIDD W +ALT G WK+
Sbjct: 327 IEAAGGIPALANANLSQS-LDGMSIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKM 385
Query: 616 VKVVKVMRYQVDLTGGPD----------QVYLSGLSDREWLALAM-------RKLRDAAS 658
++ + D GP + L S + L M ++LR+ A+
Sbjct: 386 IQGTN-YNGRWDGWYGPGGERDPHLYDWNLVLKSSSGKAMTGLKMLPSRADQQRLRETAT 444
Query: 659 IQCG--PVKEVPCEPQI--APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
+ C + C APCLF +++DPCE+ NLA++ + ++ E+ N A
Sbjct: 445 VTCQGQSTQGTSCVATAFSAPCLFHVRDDPCEQFNLAEQYP-KVLDLLMAELQHINSTAV 503
Query: 715 PDKEEEEEKKKK 726
P + ++ +
Sbjct: 504 PPSNQPDDPRAS 515
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL +LGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHI 124
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 58/170 (34%), Gaps = 62/170 (36%)
Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG--PVKEVPCEPQI--APCLFDIKNDPCE 792
K K K ++LR+ A++ C + C APCLF +++DPC
Sbjct: 417 KSSSGKAMTGLKMLPSRADQQRLRETATVTCQGQSTQGTSCVATAFSAPCLFHVRDDPC- 475
Query: 793 KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
++FN
Sbjct: 476 --------------------------EQFN------------------------------ 479
Query: 853 MMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ + YP VL + EL +IN TAV P N+P D P F++ W+ FGD
Sbjct: 480 -LAEQYPKVLDLLMAELQHINSTAVPPSNQPDDPRASPMYFNYTWTNFGD 528
>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 671
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 197/326 (60%), Positives = 244/326 (74%), Gaps = 5/326 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ ILADDLGWNDV FHG DQIPTPNIDALAY+GIIL N+Y LCTPSRSA+MTGK+
Sbjct: 28 PHIVIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKN 87
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH+VL+ E GLPLSEK+LP+YL+E+GY+T VGKWHLG++KKEYTPT+RGF+
Sbjct: 88 PIHLGMQHSVLFPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTPTYRGFD 147
Query: 180 SHLGYWTGHQDYFDHSAEE-----MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
SH GYW G QDY+ H +E + G DMRR+L AWD GKYSTD+FT EAV +I+
Sbjct: 148 SHFGYWNGLQDYYTHITQEPDPVYSEYKGFDMRRNLTVAWDTVGKYSTDLFTNEAVRLIN 207
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H+ D+P+FLYLAH A H N + +APD + +I D +R AA++ KLD+SVG
Sbjct: 208 EHNIDQPMFLYLAHLAPHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGD 267
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V++AL R ML NSI+VF+SDNG + G N SN+PLRG+KN+ WEGG RG IWSP
Sbjct: 268 VMDALRNRGMLENSIVVFMSDNGAPSQGLLSNEGSNYPLRGIKNSPWEGGTRGVAAIWSP 327
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
L++ V+ Q + +SDWLPTLLSAA
Sbjct: 328 LIKKSKRVSNQMMFMSDWLPTLLSAA 353
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 89/190 (46%), Gaps = 28/190 (14%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN---GTSENRS 566
IDG D+WS L N+ S R+ ++ NIDD +A RG +K + + G+ G S + S
Sbjct: 363 IDGYDLWSALVLNKISSRSEVMLNIDDLSDYAAFRRGDFKYIIGRTETGSAWLGASGDPS 422
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
S Q + I SK + I Q L + + K + VK+ Y
Sbjct: 423 EGISPQYDPHKILY---------SKTGVAISGIITAKQAMELKEERRK--RSVKI--YDT 469
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKND 683
L ++ LS + +LR A I+C VKE +PCEP APCLF+I+ D
Sbjct: 470 ALKTNFEEKILS--------VENISELRRKAQIKCN-VKEKDRIPCEPMTAPCLFNIEKD 520
Query: 684 PCEKNNLADR 693
PCE NLA++
Sbjct: 521 PCEMVNLAEK 530
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 86/191 (45%), Gaps = 15/191 (7%)
Query: 615 LVKVVKVMRYQVDLTGGPD---QVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
L+ V R Q+ G D + L+ +S R + L + L D A+ + G K +
Sbjct: 349 LLSAAGVDRRQLGNIDGYDLWSALVLNKISSRSEVMLNIDDLSDYAAFRRGDFKYIIGRT 408
Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKK 731
+ DP E ++ + + +I + T V I E +E+++K+ K
Sbjct: 409 ETGSAWLGASGDPSE--GISPQYDPHKILYSKTGVAISGIITAKQAMELKEERRKRSVKI 466
Query: 732 KKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKN 788
K ++K E + +LR A I+C VKE +PCEP APCLF+I+
Sbjct: 467 YDTALKTNFEEKILSV------ENISELRRKAQIKCN-VKEKDRIPCEPMTAPCLFNIEK 519
Query: 789 DPCEKNNLADR 799
DPCE NLA++
Sbjct: 520 DPCEMVNLAEK 530
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 28/32 (87%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH+VL+ E GLPLSEK+LP+YL+E+GY+T
Sbjct: 93 MQHSVLFPAEPRGLPLSEKLLPEYLREVGYKT 124
>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
Length = 583
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/330 (60%), Positives = 248/330 (75%), Gaps = 4/330 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII ILADDLGWNDV FHG +QIPTPNIDALAY G++LKN+Y LCTPSR+A++TGK+
Sbjct: 41 PHIIIILADDLGWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTGKY 100
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH ++ E GLPL EK+LPQYLK++ Y T IVGKWHLG+YK EYTP +RGF+
Sbjct: 101 PIHLGMQHEAIFPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRGFD 160
Query: 180 SHLGYWTGHQDYFDH-SAEEMKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
+H GYW G QDY+ H +AE + G+DMRR+ AWD GKYS D++T EAV +I+ H+
Sbjct: 161 THFGYWNGLQDYYSHKTAEPYTLNIGMDMRRNFTVAWDTMGKYSVDLYTDEAVRLINTHN 220
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
TD P+FLYLA A H+ N + QA + +I D KR ++AA++ KLDESVGKVVE
Sbjct: 221 TDNPMFLYLAQIAPHAGNANQLPQALPEEIEKFSYIIDPKRKRYAAVVSKLDESVGKVVE 280
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL+ R MLSNSII+F+SDNG +GF N SN+P RG+K TLWEGGVRGA IWSPL++
Sbjct: 281 ALKNRNMLSNSIILFMSDNGAPTSGFLSNGGSNYPFRGIKKTLWEGGVRGAAAIWSPLIK 340
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
R V+ +HV+DWLPTLLSAA NK +I
Sbjct: 341 KRERVSYHLMHVTDWLPTLLSAAGGNKEEI 370
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 64/141 (45%), Gaps = 27/141 (19%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
IDG+D+W + + S R +L NIDD A+ GK+K V+ + D G
Sbjct: 373 IDGLDMWETIVYGDCSPRKEVLINIDDICDSEAIRIGKYKYVR--GESKKNNDWVGDSGR 430
Query: 631 ---------GPDQVY----------LSGLSDREWLALAMRKLRDAASIQCGPVKE--VPC 669
P+ V L G ++ + M +LR A + C +E V C
Sbjct: 431 NSSEQRPPYSPETVLNSKVGIAIASLPGNNNVALTSAKMLELRQQAEVHCNVTEEEKVQC 490
Query: 670 EPQIAPCLFDIKNDPCEKNNL 690
+ Q+APCLFD++ DPCE N+
Sbjct: 491 DTQLAPCLFDLERDPCEMINI 511
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 756 MRKLRDAASIQCGPVKE--VPCEPQIAPCLFDIKNDPCEKNNL 796
M +LR A + C +E V C+ Q+APCLFD++ DPCE N+
Sbjct: 469 MLELRQQAEVHCNVTEEEKVQCDTQLAPCLFDLERDPCEMINI 511
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH ++ E GLPL EK+LPQYLK++ Y T I+
Sbjct: 106 MQHEAIFPSEPRGLPLEEKLLPQYLKDMNYVTHIVG 141
>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
Length = 541
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 212/345 (61%), Positives = 246/345 (71%), Gaps = 22/345 (6%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
M F L + L V +S P P+IIFILADDLG+NDVGFHG QIPTPNIDALAY
Sbjct: 1 MLFLWLFIFLILPQFLKGEVGNSSPKPNIIFILADDLGFNDVGFHGSAQIPTPNIDALAY 60
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
SGIIL YY +CTPSRSA+MTGK+PIHTGMQH VLY E GL L EKILPQYL +LG
Sbjct: 61 SGIILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVLYAAEPRGLSLKEKILPQYLNDLG 120
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y + I GKWHLG +K +YTP FRGF SH WGLDMR E A
Sbjct: 121 YTSHIAGKWHLGHWKLKYTPLFRGFSSH--------------------WGLDMRNGTEVA 160
Query: 214 WDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
+DLHG+Y+TDV T AV +I NH +T PLFLY+AHAA HS+NPY PL PD+ + H
Sbjct: 161 YDLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAHAACHSSNPYNPLPVPDNEVMKLGH 220
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I +KR KFAA++ K+DESVG +V+ L + ML NSII+F SDNGG A GFNLN ASN+P
Sbjct: 221 IPHYKRRKFAAMVTKMDESVGLIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 280
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
L+GVKNTLWEGGVR AGLIWSPLL+SR V+ Q +H+SDWLPTLL
Sbjct: 281 LKGVKNTLWEGGVRAAGLIWSPLLKSRQRVSTQTIHISDWLPTLL 325
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 82/177 (46%), Gaps = 27/177 (15%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD- 633
IDG +W L +EPS R ++HNIDD W + L+ G WKLVK + D GP
Sbjct: 341 IDGQSIWQALVHDEPSPRLKVIHNIDDIWGSAGLSVGDWKLVKGTNYYG-KWDGWYGPAG 399
Query: 634 ---------QVYLSGLSDREWLALAM-------RKLRDAASIQCGPVKEVP----CEPQI 673
Q S + L M ++LR AA+++C + + P C+
Sbjct: 400 ERDPHLYDWQQIPKSRSGKALNGLKMLPSQAEQQRLRAAATVKC--LGQSPQGSSCKATA 457
Query: 674 --APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
APCLF+I +DPCE+ NLA++ + + E+ N A P + +++ K
Sbjct: 458 FSAPCLFNIHDDPCEQFNLAEQYP-KVLKALMEELKYQNSTAVPPANKPGDRRASPK 513
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
IDG +W L +EPS R ++HNIDD W + L+ G WKLVK + G
Sbjct: 341 IDGQSIWQALVHDEPSPRLKVIHNIDDIWGSAGLSVGDWKLVKGTNYYG 389
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 24/34 (70%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GL L EKILPQYL +LGY + I
Sbjct: 92 MQHAVLYAAEPRGLSLKEKILPQYLNDLGYTSHI 125
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 30/49 (61%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ + YP VL + +EL N TAV P NKP D+ PK +++ W+ FGD
Sbjct: 476 LAEQYPKVLKALMEELKYQNSTAVPPANKPGDRRASPKYWNYTWTNFGD 524
>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 689
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/326 (58%), Positives = 244/326 (74%), Gaps = 5/326 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII ILADDLGWNDV FHG DQIPTPNIDA+AY+GIIL ++Y LCTPSR+A+MTGK+
Sbjct: 59 PHIIIILADDLGWNDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTGKN 118
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH+VL E GLPLSEK+LP+YL+E+GYRT VGKWHLG++++EYTPTFRGF+
Sbjct: 119 PIHLGMQHSVLLPSEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRGFD 178
Query: 180 SHLGYWTGHQDYFDHSAEE-----MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+H GYW G QDY+ H +E + GLDMRR+L AWD GKYSTD+FT EAV +I+
Sbjct: 179 THFGYWNGLQDYYTHITQEPDPQFGEFMGLDMRRNLTAAWDTQGKYSTDLFTEEAVRLIN 238
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H D+P+FLYLAH A H NP L+A D + +I D +R AA++ K+D+SVG+
Sbjct: 239 EHDKDDPMFLYLAHLAPHKGNPNRLLRASDEDIARFSYILDPERRIQAAVVSKVDQSVGE 298
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V++AL +R ML NSI++F+SDNG G N SN+PLRG+K++ WEGG RG IWSP
Sbjct: 299 VMDALRKRGMLENSIVLFMSDNGAPTQGSLSNQGSNYPLRGIKDSPWEGGTRGVAAIWSP 358
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
L++ V+ Q + ++DWLPTLLSAA
Sbjct: 359 LIKESQRVSNQMMFIADWLPTLLSAA 384
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 79/168 (47%), Gaps = 43/168 (25%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWK-LVKVVKVMRYQV 626
D+ IDG D+W L ++ S R+ ++ NIDD + +++ +G +K ++ + R V
Sbjct: 387 DSKQLGNIDGFDLWPALRSDKISPRSDVVLNIDDISKYASIRRGDFKYVIGETETGRAWV 446
Query: 627 DLTGGPDQ-----------------VYLSGL------------------SDREWLALAMR 651
TG P + V +SG+ D E L+
Sbjct: 447 GATGAPSEGIPPKYDPDKVLYSKAGVAISGVLTAKQAKELKKKKNTSSKIDFEEKILSAE 506
Query: 652 K---LRDAASIQCGPVKE---VPCEPQIAPCLFDIKNDPCEKNNLADR 693
K LR+ A +QC VKE +PC+P APCLF+I+ DPCE NLAD+
Sbjct: 507 KIFQLRNDARVQCN-VKEENRIPCDPTAAPCLFNIEKDPCEMVNLADK 553
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 35/49 (71%), Gaps = 4/49 (8%)
Query: 754 EGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKNDPCEKNNLADR 799
E + +LR+ A +QC VKE +PC+P APCLF+I+ DPCE NLAD+
Sbjct: 506 EKIFQLRNDARVQCN-VKEENRIPCDPTAAPCLFNIEKDPCEMVNLADK 553
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 27/32 (84%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH+VL E GLPLSEK+LP+YL+E+GYRT
Sbjct: 124 MQHSVLLPSEPRGLPLSEKLLPEYLREVGYRT 155
>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
Length = 542
Score = 411 bits (1056), Expect = e-111, Method: Compositional matrix adjust.
Identities = 200/322 (62%), Positives = 239/322 (74%), Gaps = 21/322 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFILADDLG+NDVGFHG +IPTPNIDALAYSGIIL YY +CTPSRSA+MTGK+
Sbjct: 26 PNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +RGF
Sbjct: 86 PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFS 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH WGLDMR + A+DLHG Y+TDV T ++V +I NH +T
Sbjct: 146 SH--------------------WGLDMRNGTQVAYDLHGHYTTDVITDQSVKVIANHNAT 185
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS+NPY PL PD+ + HI ++KR KFAA++ K+D+SVG++V+
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQ 245
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIVDWLPTLLEAA 327
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 82/176 (46%), Gaps = 23/176 (13%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
+IDG +W L ++ S R +LHNIDD W +AL+ G WKLVK R D GP
Sbjct: 339 QIDGQSIWRALVHDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN-YRGSWDGWYGPA 397
Query: 634 ----------QVYLSGLSDREWLALAM-------RKLRDAASIQC--GPVKEVPCEPQI- 673
Q+ + + AL M +++R AA++ C + C
Sbjct: 398 GERDPRLYDWQLVGRSRAGKALEALKMLPSRADQQRIRAAATVSCPGQSSQGTSCVATAF 457
Query: 674 -APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
APCLF I++DPCE+ NLA + + ++ E+ RFN A P + + + +
Sbjct: 458 SAPCLFHIRDDPCEQYNLA-KQYPEVVSALMAELERFNATAVPPSNKPADPRADPR 512
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL ELGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ K YP+V+S + EL N TAV P NKP D DP+ ++ W+ FGD
Sbjct: 475 LAKQYPEVVSALMAELERFNATAVPPSNKPADPRADPRFWNFTWTNFGD 523
>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
Length = 633
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 200/322 (62%), Positives = 238/322 (73%), Gaps = 21/322 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFILADDLG+NDVGFHG +IPTPNIDALAYSGIIL YY +CTPSRSA+MTGK+
Sbjct: 26 PNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +RGF
Sbjct: 86 PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFS 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH WGLDMR + A+DLHG Y+TDV T +V +I NH +T
Sbjct: 146 SH--------------------WGLDMRNGTQVAYDLHGHYTTDVITEHSVKVIANHNAT 185
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS+NPY PL PD+ + HI ++KR KFAA++ K+D+SVG++V+
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQ 245
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIVDWLPTLLEAA 327
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL ELGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
+P L + G N S +IDG +W L +++ S R +LHNIDD W +A+
Sbjct: 320 LPTLLEAAGGQPALANLSK------QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAAI 373
Query: 544 TRGKWKLVKENSINGN 559
+ G WKLVK + G+
Sbjct: 374 SVGDWKLVKGTNYRGS 389
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 30/44 (68%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK 617
+IDG +W L +++ S R +LHNIDD W +A++ G WKLVK
Sbjct: 339 QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAAISVGDWKLVK 382
>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 710
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/332 (56%), Positives = 244/332 (73%), Gaps = 5/332 (1%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+I+ I+ADD+GWNDV FHG DQIPTPNIDALAY+G+IL ++Y LCTPSRSA++TGK
Sbjct: 43 PPNIVMIIADDMGWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGK 102
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+PIHTGMQH V+ E GLPL EKILPQYLKE GY T +GKWH GF+++EYTPT+RGF
Sbjct: 103 YPIHTGMQHLVILEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGF 162
Query: 179 ESHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+SH GYW G QDY+ H S + G DMRR++ A D +GKYSTD+FT EAV +I
Sbjct: 163 DSHFGYWQGLQDYYTHEVGSSNPKEGFLGFDMRRNMSLARDTYGKYSTDLFTDEAVRLIE 222
Query: 235 NHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
H + P+FLYLAH A HS N EPLQAPD + ++ED +R +AA++ KLD+SVG
Sbjct: 223 EHRPEAGPMFLYLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVG 282
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+VV AL ++ ML NSI+VF++DNG A G + N SN+PLRG+K + WEG VRGA +WS
Sbjct: 283 EVVSALRRKNMLQNSIVVFMADNGAATQGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWS 342
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
PL++ + + + ++DWLPTLLSA+ D+
Sbjct: 343 PLIQRPKRIYNELMSIADWLPTLLSASGLRDV 374
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 87/190 (45%), Gaps = 30/190 (15%)
Query: 510 IDGIDVWSVLSRNEPSK-RNTILHNIDDEWQISALTRGKWKLVKENSINGNG--TSENRS 566
IDG+D W +S PS RN IL NID + SAL RG++K V NG R
Sbjct: 381 IDGVDQWPAISGVAPSPPRNEILVNIDPIFNYSALRRGEFKYVLGTVGNGEEWYGETGRP 440
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS---ALTKGKWKLVKVVKVMR 623
+ + D +VL SK T + + Q+S A+ K + VK
Sbjct: 441 ENQGLEGASPTYDPETVLM----SKAGTAIAGLLTAKQVSEIRAVRKDTYGRVK------ 490
Query: 624 YQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG-PVKE-VPCEPQIAPCLFDIK 681
D++ S L + L KLR +AS++C P E V C P +PCLF+IK
Sbjct: 491 --------DDEMATSKLLTVDELL----KLRSSASLRCTVPESERVACHPLQSPCLFNIK 538
Query: 682 NDPCEKNNLA 691
DPCE+ NLA
Sbjct: 539 EDPCEQRNLA 548
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG-PVKE-VPCEPQIA 780
K+ + + +K + K + K +E + KLR +AS++C P E V C P +
Sbjct: 473 KQVSEIRAVRKDTYGRVKDDEMATSKLLTVDE-LLKLRSSASLRCTVPESERVACHPLQS 531
Query: 781 PCLFDIKNDPCEKNNLA 797
PCLF+IK DPCE+ NLA
Sbjct: 532 PCLFNIKEDPCEQRNLA 548
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 23/32 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH V+ E GLPL EKILPQYLKE GY T
Sbjct: 109 MQHLVILEAEPRGLPLHEKILPQYLKEAGYAT 140
>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
Length = 478
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/322 (62%), Positives = 236/322 (73%), Gaps = 21/322 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFILADDLG+NDVGFHG IPTPNIDALAYSGIIL YY +CTPSRSA+MTGK+
Sbjct: 26 PNIIFILADDLGFNDVGFHGSADIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +RGF
Sbjct: 86 PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYASHIAGKWHLGHWKLKYTPLYRGFS 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH WGLDMR E A+DLHG Y+TDV T +V +I +H +T
Sbjct: 146 SH--------------------WGLDMRNGTEVAYDLHGHYTTDVITDHSVKVIASHNAT 185
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS+NPY PL PD+ + HI +KR KFAA++ K+D+SVG++V+
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMAHIPHYKRRKFAAMVSKMDDSVGQIVDQ 245
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIIDWLPTLLEAA 327
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 91/212 (42%), Gaps = 59/212 (27%)
Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
+P L + G N S +IDG +W L +++ S R +LHNIDD W +AL
Sbjct: 320 LPTLLEAAGGPAAVANLSK------QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAAL 373
Query: 544 TRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
+ G WKLVK GT+ S D Y +RN L+ +W
Sbjct: 374 SVGDWKLVK-------GTNYRGSWDGWY---------------GPAGERNPRLY----DW 407
Query: 604 QISALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGP 663
Q+ A ++ K ++ +K++ Q D +++R AA++ C
Sbjct: 408 QMVARSRAG-KALEALKMLPSQAD----------------------QQRIRAAATVSCPA 444
Query: 664 --VKEVPCEPQI--APCLFDIKNDPCEKNNLA 691
K C APCLF I++DPCE+ NLA
Sbjct: 445 QSSKGTSCLATAFSAPCLFHIRDDPCEQYNLA 476
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL ELGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYASHI 124
>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
Length = 542
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/341 (59%), Positives = 240/341 (70%), Gaps = 24/341 (7%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
P++F L + S P+IIFILADDLG+NDVGF G QIPTPNIDALAYSG+IL
Sbjct: 19 PISFAL------VACQSTQPNIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNR 72
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
YY +CTPSRSA+MTGK+PIHTGMQH VLY E GLPL KILPQYL ELGY + I G
Sbjct: 73 YYVNPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLDLKILPQYLNELGYTSHIAG 132
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
KWHLG +K+ YTP +RGF SH GY WGLDMR E A+DLHG+Y
Sbjct: 133 KWHLGHWKRVYTPLYRGFSSHHGY-----------------WGLDMRNGTEIAYDLHGQY 175
Query: 221 STDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
+TDV T ++ +I NH + PLFLY+AHAA HS NPY PL A D + I+ +KR
Sbjct: 176 TTDVITQHSLQVIANHKPAKGPLFLYVAHAAVHSGNPYNPLPASDDAVRRLDKIQHYKRR 235
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
K+AA++ +DESVGK+VE L + RML NSIIVF +DNGG A GFN N ASN+PLRGVKNT
Sbjct: 236 KYAALVTAMDESVGKIVEQLRKSRMLENSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNT 295
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
LWEGGVRGA L+WSP L R AEQ +H+ DWLPTL+ AA
Sbjct: 296 LWEGGVRGAALLWSPQLTKRPRTAEQLMHMVDWLPTLVEAA 336
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 82/175 (46%), Gaps = 28/175 (16%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
++DG+ +W L RNE S RN+ILHNIDD W +AL+ G WKL V+ Y G
Sbjct: 349 KLDGVSLWQALVRNEASPRNSILHNIDDIWGSAALSVGVWKL---VQGSHYNGSWDGWYG 405
Query: 634 QVYLSGLSDREWLAL-------AMR------------KLRDAASIQC-----GPVKEVPC 669
L D W + AM+ +LR AA++ C P ++ C
Sbjct: 406 PAGLRNPHDYNWQLVTKCAAGQAMQQLKMLPQQADIIRLRKAATVSCKEAKVKPSPDIAC 465
Query: 670 EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
P CLF++ +DPCE+ NLA + + + E+ FN A P + ++ +
Sbjct: 466 NPLKEICLFNVLDDPCEQRNLA-KQHPKVLALLLNELQLFNATAIPPANKPDDPR 519
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 24/34 (70%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL KILPQYL ELGY + I
Sbjct: 97 MQHTVLYAAEPRGLPLDLKILPQYLNELGYTSHI 130
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 5/66 (7%)
Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG-----PVKEVPCEPQIAPCLFDIKNDPC 791
K + ++ K ++ + +LR AA++ C P ++ C P CLF++ +DPC
Sbjct: 422 KCAAGQAMQQLKMLPQQADIIRLRKAATVSCKEAKVKPSPDIACNPLKEICLFNVLDDPC 481
Query: 792 EKNNLA 797
E+ NLA
Sbjct: 482 EQRNLA 487
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 34/53 (64%)
Query: 850 KKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+++ + K +P VL+ + EL N TA+ P NKP D GDP+ ++H W+ FGD
Sbjct: 482 EQRNLAKQHPKVLALLLNELQLFNATAIPPANKPDDPRGDPRLWNHTWTNFGD 534
>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 559
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/336 (55%), Positives = 244/336 (72%), Gaps = 2/336 (0%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+A+S PHII I+ADDLGW+DV FHG DQIPTPNID LAYSG++L NYY +CTP+R+A
Sbjct: 22 LATSSQPHIIVIIADDLGWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQPICTPTRAA 81
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG+HPIH G+Q V+ GLPL+E I+PQYLK LGY T IVGKWHLGF+ +YTP
Sbjct: 82 LMTGRHPIHLGLQDGVIVASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLGFFAWQYTP 141
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
+RGF++H GY+ G + Y+DH+AEE K GLD R + E +G+YST++FT+ A II
Sbjct: 142 LYRGFDTHFGYYNGEEGYYDHTAEEPKYIGLDFRNNTELFKSAYGEYSTELFTSYAEKII 201
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
HNH+ ++PLFLYLAH A HS N Y PL+AP Y + +I+D +R FA ++ LD+SVG
Sbjct: 202 HNHNKNKPLFLYLAHQAVHSGNSYSPLEAPYKYTSRFPYIQDERRRTFAGMVSALDDSVG 261
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ AL+ + N II+F +DNGG AAGF+ N ASNWPLRG+K+TLWEGGVRG G I S
Sbjct: 262 NITRALKHSGLYENCIILFSTDNGGPAAGFDANYASNWPLRGIKHTLWEGGVRGDGFIHS 321
Query: 354 PLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPN 387
PLLE G ++ +HV DWLPT+ ++ N S +P
Sbjct: 322 PLLEKPGRISTDMIHVCDWLPTIYHIAGGNVSSLPT 357
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLVKENSINGN 559
N +S +DG +VW LSR PS R ILHNID +AL G++K++
Sbjct: 351 NVSSLPTNLDGFNVWDTLSRGTPSPRTEILHNIDPHKNHNAAALRVGEFKIIL------- 403
Query: 560 GTSENRSNDNSYQNE 574
GT E+ D Y+ E
Sbjct: 404 GTEEHGEWDGWYEPE 418
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLV 616
N +S +DG +VW LSR PS R ILHNID +AL G++K++
Sbjct: 351 NVSSLPTNLDGFNVWDTLSRGTPSPRTEILHNIDPHKNHNAAALRVGEFKII 402
>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
Length = 544
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/322 (61%), Positives = 236/322 (73%), Gaps = 21/322 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFILADDLG+NDVGFHG +IPTPNIDALAYSGIIL YY +CTPSRSA+MTGK+
Sbjct: 26 PNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL EKILPQYL ELGY + I GKWHLG +K +YTP RGF
Sbjct: 86 PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLHRGFS 145
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH WGLDMR + A+DLHG Y+TDV T +V +I +H +T
Sbjct: 146 SH--------------------WGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIASHNAT 185
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS+NPY PL PD+ + HI +KR KFAA++ K+D+SVG++V+
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVLKMSHIPHYKRRKFAAMVSKMDDSVGQIVDQ 245
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIIDWLPTLLEAA 327
Score = 79.3 bits (194), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 84/177 (47%), Gaps = 25/177 (14%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
+IDG +W L +++ S R +LHNIDD W +AL+ G WKLVK R D GP
Sbjct: 339 QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN-YRGSWDGWYGPA 397
Query: 634 ----------QVYLSGLSDREWLALAM-------RKLRDAASIQCGPVKE---VPCEPQI 673
Q+ + AL M +++R AA++ C P + C
Sbjct: 398 GERDPRQYDWQLVAKSRVGKALDALKMLPSRADQQRIRAAATVTC-PAQSSQGTSCLATA 456
Query: 674 --APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
APCLF I++DPCE+ NLA + + +N TE+ RFN A P + + + +
Sbjct: 457 FSAPCLFHIRDDPCEQYNLA-KQYPEVVNALMTELERFNATAVPPSNKPADPRADPR 512
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL ELGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 31/49 (63%)
Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ K YP+V++ + EL N TAV P NKP D DP+ +++ W+ FGD
Sbjct: 475 LAKQYPEVVNALMTELERFNATAVPPSNKPADPRADPRFWNYTWTNFGD 523
>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 570
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 192/343 (55%), Positives = 248/343 (72%), Gaps = 1/343 (0%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
+L + TLS+ F ++V P+II ILADDLGWNDV FHG +QI TPNIDALAY+GIIL
Sbjct: 26 ILLIFITLSIFFQNVVDGIERPNIIIILADDLGWNDVSFHGSNQIQTPNIDALAYNGIIL 85
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
++Y LCTPSR+++MTGK+P GMQH V+ E GLPL+E ++P+Y + GY T
Sbjct: 86 NSHYVPALCTPSRASLMTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHA 145
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
VGKWHLGF+KKEYTP +RGF+SH G+W G QDY+DH+ + G DMRR+ E + G
Sbjct: 146 VGKWHLGFFKKEYTPIYRGFDSHFGHWNGFQDYYDHTTMSDSLKGYDMRRNFEVDYSYQG 205
Query: 219 KYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
Y+TDVFT EA+ II NH++ + PLFLYL+H A HS NP P QAP+ ++ H I D
Sbjct: 206 MYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPEDEISKHECINDPG 265
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R +AA++ KLDESVG+VV ALE+ +ML+NSII+F+SDNG A G + N SN+PLRG+K
Sbjct: 266 RKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRGSNYPLRGLK 325
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ WEGGVRG IWSP L V++Q +H+SDWLPTLL+AA
Sbjct: 326 ESPWEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAA 368
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 81/190 (42%), Gaps = 48/190 (25%)
Query: 508 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSN 567
N+IDGID+W+VLS + PS R + +N D+ S+L WK V+ GT++
Sbjct: 379 NKIDGIDMWNVLSNDLPSPRKEVFNNYDEIENYSSLMIDSWKYVE-------GTAQEGKA 431
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
D ++ EPS+ N + + +E + R + D
Sbjct: 432 DYWFE---------------EPSRNNCSEYRVSNE-----------------DIFRLRRD 459
Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 686
T D S +L++ + + ++ + C+P + CLF++ +DPCE
Sbjct: 460 STIICDNPTFSS-------SLSITR-NNHTDVKNKTKYVLTCDPLLKRFCLFNLNDDPCE 511
Query: 687 KNNLADRSED 696
+ NLAD D
Sbjct: 512 RLNLADVFPD 521
>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
Length = 556
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 199/351 (56%), Positives = 244/351 (69%), Gaps = 20/351 (5%)
Query: 44 FTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
+ L ++ + + + P PHII+ILADDLG+NDVGFHG QIPTPNIDALAYSG+IL YY
Sbjct: 9 WLLPLMVISVSSQQSPKPHIIYILADDLGFNDVGFHGSAQIPTPNIDALAYSGLILNRYY 68
Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
+CTPSRS++MTGK+ IHTGMQH VLYG E GLPL EK+LPQYL +LGY + I GKW
Sbjct: 69 VAPICTPSRSSLMTGKYAIHTGMQHTVLYGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKW 128
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYST 222
HLG +K YTP RGF SH G+ WGLDMR A++LHG+Y+T
Sbjct: 129 HLGHWKMPYTPLRRGFNSHHGF-----------------WGLDMRNGSRVAYELHGQYTT 171
Query: 223 DVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
DV T ++D+I NH + PLFLY+AHAA HS NPY PL D + IE +KR ++
Sbjct: 172 DVITQHSIDVIANHPVSKGPLFLYVAHAAAHSGNPYNPLPVADDEVIKLGRIEHYKRRRY 231
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
AA++ KLDESVG +V+ L + ML NSIIVF +DNGGAA GFNLN ASN+PLRGVKNTLW
Sbjct: 232 AAMVSKLDESVGIIVDQLRRSNMLENSIIVFATDNGGAAEGFNLNFASNYPLRGVKNTLW 291
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSDIPNYVNS 391
EGGVRG GL+WS R VAEQ +H++DWLPT + A K + N+ S
Sbjct: 292 EGGVRGVGLLWSQRYLKRPRVAEQTMHITDWLPTFVEVAGGKEALANFTTS 342
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 86/180 (47%), Gaps = 24/180 (13%)
Query: 571 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG 630
+ +DG +W L ++E S R ++LHNIDD W SALT G WKL+K + D
Sbjct: 339 FTTSLDGQSIWQALVQDEASPRKSVLHNIDDIWNSSALTVGDWKLIKGTN-YKGAWDGWY 397
Query: 631 GPD-----QVY--------LSGLSDREWLALAMR----KLRDAASIQC-----GPVKEVP 668
GP Q+Y L+G + + L R +LR AA++ C K
Sbjct: 398 GPAGERDPQLYNWQQVAKSLAGKAMLQLKVLPSRADQQRLRSAATVNCPNQQSNTQKGSV 457
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
C P PCLF++++DPCE+ NLA + +N +E+ FN A P ++ + K
Sbjct: 458 CNPLQTPCLFNLRDDPCEQYNLATQYP-LVLNSLLSELKHFNSTAIPPSNLPDDPRGNPK 516
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 26/34 (76%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLYG E GLPL EK+LPQYL +LGY + I
Sbjct: 91 MQHTVLYGAEPRGLPLEEKLLPQYLNDLGYTSHI 124
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 62/175 (35%), Gaps = 63/175 (36%)
Query: 733 KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQC-----GPVKEVPCEPQIAPCLFDIK 787
++ K K + K ++LR AA++ C K C P PCLF+++
Sbjct: 411 QQVAKSLAGKAMLQLKVLPSRADQQRLRSAATVNCPNQQSNTQKGSVCNPLQTPCLFNLR 470
Query: 788 NDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKK 847
+D P +++N
Sbjct: 471 DD---------------------------PCEQYN------------------------- 478
Query: 848 KKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
+ YP VL+ + EL + N TA+ P N P D G+PK +++ W+ FGD
Sbjct: 479 ------LATQYPLVLNSLLSELKHFNSTAIPPSNLPDDPRGNPKFWNYTWTNFGD 527
>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
Length = 561
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/360 (54%), Positives = 257/360 (71%), Gaps = 17/360 (4%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
+R R +AF +L L + A+S P+I+FILADDLGWNDVGFHG +Q+ TPN+D
Sbjct: 4 HRFRCVAFLLLQL--------TAVSATSPRPNIVFILADDLGWNDVGFHGSEQVLTPNLD 55
Query: 90 ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
ALAY G+IL+NYY +CTPSR+A+MTG+HPIHTGMQ NV+Y E GL L+E I PQYL
Sbjct: 56 ALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQQNVIYSAEPYGLGLNEIIFPQYL 115
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---SAEEMKM---WG 203
K++GY+T IVGKWHLGF+ ++YTP RGF+SH GY+ G +DY+ H +A E+ WG
Sbjct: 116 KQIGYKTHIVGKWHLGFFAEQYTPIERGFDSHYGYYMGAEDYWVHIAGNAHEVSFNASWG 175
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY---EPL 260
LD RR+ E G+YST++FT EA +II +H+ EPLF+Y+A A HSANPY L
Sbjct: 176 LDFRRNGEVVKTAFGQYSTELFTTEAENIIASHNQSEPLFMYVAQQAVHSANPYTGDAEL 235
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+AP Y HI++ KR KFAA++ LD+SVG + +AL + +L N+II F +DNGG A
Sbjct: 236 EAPFKYYEKFPHIKNEKRRKFAAMVSALDDSVGNITKALHAKGILENTIICFSTDNGGPA 295
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+GF+ N ASNWPLRGVK +LWEGGVRG G +WSPL+++ G + + + D+LPTLL AA
Sbjct: 296 SGFDFNDASNWPLRGVKESLWEGGVRGTGFVWSPLMKTSGYTSPAMMQIFDFLPTLLKAA 355
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 69/169 (40%), Gaps = 35/169 (20%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
++DGID W ++S + + R +LHNID+ AL + K+V L D
Sbjct: 365 DLDGIDQWEIISERKDAVRTEMLHNIDNTEGTRALRINEMKVVVSQGGSTRSWPLWYSTD 424
Query: 634 QVY---------LSGLSDREW-----------------LALAMRKLRDAAS--IQCGPVK 665
Q Y +S +W L R LR I CGPV
Sbjct: 425 QFYGGYNHTDHSVSADLSAKWDIDSEVAPRVNSDLTSVLRNLGRPLRSGTPLKIHCGPVP 484
Query: 666 EVP---CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQ 711
C+ PCLF+I DPCE +NLA D+R + +T + R +Q
Sbjct: 485 PSARGSCDVGKKPCLFNITADPCEFHNLA----DERPDMLSTMLDRLDQ 529
Score = 45.8 bits (107), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQ NV+Y E GL L+E I PQYLK++GY+T I+
Sbjct: 91 MQQNVIYSAEPYGLGLNEIIFPQYLKQIGYKTHIVG 126
>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
Length = 531
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 196/319 (61%), Positives = 230/319 (72%), Gaps = 18/319 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADDLG+NDVGF G QIPTPNIDALAYSG+IL YY +CTPSRSA+MTGK+
Sbjct: 21 PHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKY 80
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL KILPQYL +LGY + I GKWHLG +++ YTP +RGF
Sbjct: 81 PIHTGMQHTVLYAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGFS 140
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH GY WGLDMR E A+DLHG+YSTDV T ++++I H +T
Sbjct: 141 SHHGY-----------------WGLDMRNGTEVAYDLHGQYSTDVITQHSLNVISKHNAT 183
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS NPY PL D + I+ +KR K+AA++ +DESVGK+VE
Sbjct: 184 KGPLFLYVAHAAVHSGNPYNPLPVKDDAVRRLDTIQHYKRRKYAALVTAMDESVGKIVEQ 243
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + RML SIIVF +DNGG A GFN N ASN+PLRGVKNTLWEGGVRGA L+WSP L
Sbjct: 244 LRKSRMLETSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAALLWSPQLTK 303
Query: 359 RGIVAEQYVHVSDWLPTLL 377
R AEQ +H+ DWLPTL+
Sbjct: 304 RPRTAEQTMHIVDWLPTLV 322
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 70/142 (49%), Gaps = 27/142 (19%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
++DGI +W L +NE S R ++LHNIDD W +AL+ G WKLVK Y G
Sbjct: 338 KLDGISLWQALLKNETSPRKSVLHNIDDIWGSAALSVGDWKLVKGTN---YNGSWDGWYG 394
Query: 634 QVYLSGLSDREWLALA-------------------MRKLRDAASIQCGPV--KEVP---C 669
+ D +W ++A M +LR+AA++ C K+ P C
Sbjct: 395 PAGMRNPHDYDWQSVAKSATGQAMQQLKMLPKTADMIRLREAANVICKDANPKDAPHIAC 454
Query: 670 EPQIAPCLFDIKNDPCEKNNLA 691
P PCLF+++ DPCE+ N A
Sbjct: 455 NPLSEPCLFNVREDPCEQRNQA 476
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 24/34 (70%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL KILPQYL +LGY + I
Sbjct: 86 MQHTVLYAAEPRGLPLDLKILPQYLNDLGYTSHI 119
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 5/66 (7%)
Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV--KEVP---CEPQIAPCLFDIKNDPC 791
K + ++ K + M +LR+AA++ C K+ P C P PCLF+++ DPC
Sbjct: 411 KSATGQAMQQLKMLPKTADMIRLREAANVICKDANPKDAPHIACNPLSEPCLFNVREDPC 470
Query: 792 EKNNLA 797
E+ N A
Sbjct: 471 EQRNQA 476
>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 567
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 195/365 (53%), Positives = 253/365 (69%), Gaps = 20/365 (5%)
Query: 32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
TR +A+L A L + S P+I+FILADDLGWND+ FHG D+IPTPNIDAL
Sbjct: 4 TRSSLWALLAFATILQLA----RTQSSKPNIVFILADDLGWNDLSFHGSDEIPTPNIDAL 59
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A++GI+L N YT +CTPSR A+MTGK+PI GMQ YG E GLPLSEK+LP+YL+E
Sbjct: 60 AFNGIVLNNLYTQPVCTPSRVALMTGKYPIKLGMQGPPTYGAEPNGLPLSEKLLPEYLRE 119
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM-----KMWGLDM 206
LGY TR +GKWHLGFYK+ YTPT RGF+SH GY+TG+ Y+D+ +++ + G DM
Sbjct: 120 LGYTTRAIGKWHLGFYKQAYTPTRRGFDSHFGYYTGYVSYYDYLLQDVYQNFGEFQGFDM 179
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
RR+ AWD+ GKY+TDVFT EAV +I ++PLF+YLAH A H+ N + L+AP
Sbjct: 180 RRNDTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFMYLAHVAVHTGNRGKYLEAPQSE 239
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA------ 320
+N HI D R +AA++ KLDESVG+VVEAL +++ML N+IIVF+SDNG +
Sbjct: 240 VNKFNHILDPNRRIYAAMVSKLDESVGRVVEALTEKKMLQNTIIVFMSDNGSPSFDGSGR 299
Query: 321 -----AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
AG N SN+P RG+KNTLWEGGV+ A IW+P + V++Q +H++DWLPT
Sbjct: 300 NFRPEAGVTANWGSNFPYRGIKNTLWEGGVKSASFIWAPYFQENPRVSKQLMHITDWLPT 359
Query: 376 LLSAA 380
L SAA
Sbjct: 360 LYSAA 364
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 80/189 (42%), Gaps = 46/189 (24%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN-----GTSEN 564
+DG+D W+ L+ N PS+RN +L NID++ + + + + WKL+ ++ NG G +
Sbjct: 375 LDGVDQWTSLTLNLPSQRNFVLLNIDEKQRYAGILKDNWKLIVGSTANGTLDGFFGATRP 434
Query: 565 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRY 624
R+ N+ +VL P+ R L + D + G V+ + MRY
Sbjct: 435 RTPYNAT----------AVLY--SPAGR--ALTKLSDSLPFATAGFGPMSSVRELLAMRY 480
Query: 625 QVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDP 684
+ + P GP + PC P CLFD+ DP
Sbjct: 481 ESTIRCNPS--------------------------AAGPRTQSPC-PAGEACLFDLVADP 513
Query: 685 CEKNNLADR 693
CE NN+A +
Sbjct: 514 CETNNVAKK 522
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQ YG E GLPLSEK+LP+YL+ELGY TR +
Sbjct: 93 MQGPPTYGAEPNGLPLSEKLLPEYLRELGYTTRAIG 128
>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
Length = 528
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/317 (60%), Positives = 230/317 (72%), Gaps = 21/317 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADDLG+NDVGF G QIPTPNIDALAYSG+IL YY +CTPSRSA+MT K+
Sbjct: 21 PHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTAKY 80
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQH VLY E GLPL+ KILPQYL +LGY + I GKWHLG +K+ YTP +RGF
Sbjct: 81 PIHTGMQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPLYRGFS 140
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
SH WGLDMR E A+DLHG+Y+TDV T ++++I H ST
Sbjct: 141 SH--------------------WGLDMRNGTELAYDLHGQYTTDVITQHSLNVIAKHNST 180
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+AHAA HS NPY PL A D + I+D+KR K+AA++ +D+SVG++VE
Sbjct: 181 KGPLFLYVAHAAVHSGNPYNPLPAKDDIVRRLGTIQDYKRRKYAALVTAMDDSVGRIVEQ 240
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + ML NSII+F +DNGG A GFN N ASN+PLRGVKNTLWEGGVRGAGL+WS L
Sbjct: 241 LRKSHMLDNSIIIFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAGLVWSAQLSK 300
Query: 359 RGIVAEQYVHVSDWLPT 375
R VA+Q +H++DWLPT
Sbjct: 301 RPRVADQTMHIADWLPT 317
Score = 85.9 bits (211), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 71/143 (49%), Gaps = 27/143 (18%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
++DG+ VW L +NE S R ++LHNIDD W +AL+ G WKLVK Y G
Sbjct: 335 QLDGMSVWQALLKNEVSPRKSVLHNIDDIWGSAALSVGDWKLVKGTN---YNGSWDGWYG 391
Query: 634 QVYLSGLSDREWLALA-------------------MRKLRDAASIQC--GPVKEVP---C 669
L D +W +A M + R+AA++ C +K+ P C
Sbjct: 392 PAGTRNLRDYDWYLVAKSLAGQALQQLKVLPKTADMMRAREAANVICKDARLKDAPNISC 451
Query: 670 EPQIAPCLFDIKNDPCEKNNLAD 692
P APCLF+I+ DPCE+ N AD
Sbjct: 452 NPLKAPCLFNIREDPCEQRNQAD 474
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 58/152 (38%), Gaps = 63/152 (41%)
Query: 756 MRKLRDAASIQC--GPVKEVP---CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTT 810
M + R+AA++ C +K+ P C P APCLF+I+
Sbjct: 427 MMRAREAANVICKDARLKDAPNISCNPLKAPCLFNIRE---------------------- 464
Query: 811 EVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
DP ++ NQ YP VL M KEL
Sbjct: 465 -----DPCEQRNQ-------------------------------ADVYPKVLELMLKELQ 488
Query: 871 NINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
IN TAV P NKP D G P+ +++ W+ FGD
Sbjct: 489 LINSTAVPPANKPDDPLGYPRFWNYTWTNFGD 520
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL+ KILPQYL +LGY + I
Sbjct: 86 MQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHI 119
>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 646
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 189/344 (54%), Positives = 244/344 (70%), Gaps = 5/344 (1%)
Query: 48 MVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
+ F +VAS + P+II I+ADD+G+NDVGFHG ++IPTPNIDALAY+G+IL ++YT
Sbjct: 8 IFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQ 67
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
LCTPSRSA +TGK+PIH GMQH V+ E GLPL+E ILPQYLK GY T +GKWHL
Sbjct: 68 ALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHL 127
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
GF++KEYTPT+RGF+SH GYW G QDY+ H+ G DMRR++ W GKYST +
Sbjct: 128 GFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVHFTPEHGYDMRRNMTVDWSAQGKYSTTL 187
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
FT EAV +I H+T+ P+F+YLAH A HS N +PLQAPD + HI D +R +AA+
Sbjct: 188 FTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAM 247
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ LD+SVG V+ AL + ML NSIIVF+SDNG G + N SN+PLRG KN+ WEG
Sbjct: 248 VSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNSAWEGA 307
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIP 386
+R IWSPL++ V+ +H+SDWLPT +AA NK+++P
Sbjct: 308 MRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP 351
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 91/217 (41%), Gaps = 40/217 (18%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG--G 631
++DG+D+W+ +S + S R +LHNID+ + AL G WK + G G
Sbjct: 352 KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDGWYGSSG 411
Query: 632 PDQVY---------------LSGLSDREWL----------------ALAMRKLRDAASIQ 660
D +Y L+GL+ + + + ++ LR AA ++
Sbjct: 412 RDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLRGAAEVK 471
Query: 661 CGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
C V +E+P C +PCLF+IK DPCE+ NLA +N + RF Q A
Sbjct: 472 CPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLN-MEMALARFKQTAL 530
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
P + + K + + KK+K Y N
Sbjct: 531 PIRNVPRDPNADPAKWNNTWVNWQDYEDVKKQKIYFN 567
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 6/61 (9%)
Query: 752 EEEGMRKLRDAASIQCGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
+ E ++ LR AA ++C V +E+P C +PCLF+IK DPCE+ NLA + +
Sbjct: 457 DSETIKTLRGAAEVKCPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVL 516
Query: 806 N 806
N
Sbjct: 517 N 517
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 23/36 (63%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V+ E GLPL+E ILPQYLK GY T +
Sbjct: 88 MQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIG 123
>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 391 bits (1004), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/335 (55%), Positives = 240/335 (71%), Gaps = 1/335 (0%)
Query: 46 LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
+S+ FV L A+ P HI+FI+ADDLGW+DV FHG QIPTPNID LA +G+IL NYY
Sbjct: 11 MSLPFVALTANKKP-HIVFIVADDLGWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSP 69
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+CTP+RSAIMTGK+PIHTGMQH+V+ + GL L+E ++PQYLK LGY T VGKWHLG
Sbjct: 70 ICTPTRSAIMTGKYPIHTGMQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLG 129
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
F+K EYTP RGF+S+ GYW G DY+DHS E WGLD+ + W G YS+D+F
Sbjct: 130 FFKYEYTPIQRGFDSYFGYWCGKGDYWDHSNNEKYGWGLDLHDSEQDVWTEWGHYSSDLF 189
Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
+AV++I H+ PLFLYL A HSAN +PLQAP ++ ++I+D +R FAA++
Sbjct: 190 AEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAAMV 249
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
+D ++ KVV++L+ R M +NSIIVF +DNGG A GF+ N ASN+PLRGVK TLWEGG+
Sbjct: 250 SSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEGGI 309
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
RG I SPL+ G V + +HVSDWLPTL + A
Sbjct: 310 RGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVA 344
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 62/121 (51%), Gaps = 14/121 (11%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
+DG D+W +S + S R ++HNID +A +WK+V V +Y P+
Sbjct: 354 LDGFDLWDSISTDAMSPREEMVHNIDPVNWEAAYRFREWKIV--VNQTKYMSGWYPLPN- 410
Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP--CEPQIAPCLFDIKNDPCEKNNLAD 692
+ +RE LRDA ++CGP E+P C PCLF+IKNDPCE NLA
Sbjct: 411 -----IEERE---PHPATLRDAV-VKCGPPPEIPVNCTASDGPCLFNIKNDPCEYVNLAK 461
Query: 693 R 693
+
Sbjct: 462 K 462
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 34/50 (68%), Gaps = 4/50 (8%)
Query: 759 LRDAASIQCGPVKEVP--CEPQIAPCLFDIKNDPCEKNNLADRSEVQRIN 806
LRDA ++CGP E+P C PCLF+IKNDPCE NLA + E++ +N
Sbjct: 421 LRDAV-VKCGPPPEIPVNCTASDGPCLFNIKNDPCEYVNLA-KKELEILN 468
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH+V+ + GL L+E ++PQYLK LGY T
Sbjct: 89 MQHSVILAAQPYGLGLNETLMPQYLKRLGYAT 120
>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
Length = 648
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/346 (54%), Positives = 245/346 (70%), Gaps = 7/346 (2%)
Query: 48 MVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
+ F +VAS + P+II I+ADD+G+NDVGFHG ++IPTPNIDALAY+G+IL ++YT
Sbjct: 8 IFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQ 67
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
LCTPSRSA +TGK+PIH GMQH V+ E GLPL+E ILPQYLK GY T +GKWHL
Sbjct: 68 ALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHL 127
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHS--AEEMKMWGLDMRRDLEPAWDLHGKYST 222
GF++KEYTPT+RGF+SH GYW G QDY+ H+ A G DMRR++ W GKYST
Sbjct: 128 GFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVHATFTPEHGYDMRRNMTVDWSAQGKYST 187
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
+FT EAV +I H+T+ P+F+YLAH A HS N +PLQAPD + HI D +R +A
Sbjct: 188 TLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYA 247
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A++ LD+SVG V+ AL + ML NSIIVF+SDNG G + N SN+PLRG KN+ WE
Sbjct: 248 AMVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNSAWE 307
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIP 386
G +R IWSPL++ V+ +H+SDWLPT +AA NK+++P
Sbjct: 308 GAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP 353
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 91/217 (41%), Gaps = 40/217 (18%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG--G 631
++DG+D+W+ +S + S R +LHNID+ + AL G WK + G G
Sbjct: 354 KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDGWYGSSG 413
Query: 632 PDQVY---------------LSGLSDREWL----------------ALAMRKLRDAASIQ 660
D +Y L+GL+ + + + ++ LR AA ++
Sbjct: 414 RDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLRGAAEVK 473
Query: 661 CGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
C V +E+P C +PCLF+IK DPCE+ NLA +N + RF Q A
Sbjct: 474 CPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLN-MEMALARFKQTAL 532
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
P + + K + + KK+K Y N
Sbjct: 533 PIRNVPRDPNADPAKWNNTWVNWQDYEDVKKQKIYFN 569
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 6/61 (9%)
Query: 752 EEEGMRKLRDAASIQCGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
+ E ++ LR AA ++C V +E+P C +PCLF+IK DPCE+ NLA + +
Sbjct: 459 DSETIKTLRGAAEVKCPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVL 518
Query: 806 N 806
N
Sbjct: 519 N 519
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 23/36 (63%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V+ E GLPL+E ILPQYLK GY T +
Sbjct: 88 MQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIG 123
>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 610
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 181/328 (55%), Positives = 239/328 (72%), Gaps = 8/328 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFILADDLGWND+ FHG DQI TPN+DALAY+G+IL + Y + +CTPSR+A+MTG +
Sbjct: 41 PNIIFILADDLGWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTGMY 100
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQ L E LP + K+LP YLK+LGY TR+VGKWHLG+Y+KE+TPT+RGF+
Sbjct: 101 PIHNGMQGLPLEASEPRALP-AGKLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRGFD 159
Query: 180 SHLGYWTGHQDYFDHSAEE-------MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
SHLGYW G Y+D+ +E + G DMRR++ PA+DL G+Y+T++FT EAV +
Sbjct: 160 SHLGYWNGIVSYYDYILQEDDNRKPRSSLNGFDMRRNITPAYDLQGRYATEMFTDEAVHL 219
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I +H+ + PLFLY++H A H+ NP + L+AP +N HI D R FA ++ KLDESV
Sbjct: 220 IRSHNKNTPLFLYMSHLAVHAGNPGKFLEAPQEAINKFLHIADPNRRTFAGMVQKLDESV 279
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G+++ ALE+ ML NSIIVF +DNG G N SN+PLRG+K TLWEGG RG IW
Sbjct: 280 GRLIAALEESNMLHNSIIVFSADNGAPTIGGYPNWGSNYPLRGIKETLWEGGTRGVSFIW 339
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPL+++ V+ + +H+SDWLPTL +AA
Sbjct: 340 SPLIKNTPRVSNELIHISDWLPTLYAAA 367
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 86/211 (40%), Gaps = 55/211 (26%)
Query: 515 VWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNE 574
+WS L +N P N ++H D W + GN EN N
Sbjct: 338 IWSPLIKNTPRVSNELIHISD------------WLPTLYAAAGGN--MENVKN------- 376
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL----TKGKWKL-VKVVKVMRYQVDLT 629
+DG++ W +S N PS+RN IL NID+ Q +A+ K WKL V V Y
Sbjct: 377 LDGVNQWPSISENLPSERNEILLNIDEILQTAAIRMNSRKYHWKLVVGTVNNGTYDEYFG 436
Query: 630 GGPD-----------------QVY--LSGLSDREWLALA-----MRKLRDAASIQCGPVK 665
G + QVY L L+ + L M+ LR A+I+C
Sbjct: 437 EGTNKFPENNPPYDFNAVANSQVYRTLERLAQKRLLTYPTSKEKMKVLRRDATIRCDYKI 496
Query: 666 EVP--CEPQIAP---CLFDIKNDPCEKNNLA 691
P C P P CL+DIK DPCE NNL
Sbjct: 497 INPNRCNPSDGPNQVCLYDIKRDPCELNNLT 527
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 5/67 (7%)
Query: 736 KKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP--CEPQIAP---CLFDIKNDP 790
+ ++ +K+ Y +E M+ LR A+I+C P C P P CL+DIK DP
Sbjct: 461 RTLERLAQKRLLTYPTSKEKMKVLRRDATIRCDYKIINPNRCNPSDGPNQVCLYDIKRDP 520
Query: 791 CEKNNLA 797
CE NNL
Sbjct: 521 CELNNLT 527
>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
Length = 485
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/337 (55%), Positives = 234/337 (69%), Gaps = 8/337 (2%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GWNDV FHG DQIPTPN+D+LAYSG+IL NYY +CTP+RSAIMTG+HPIHTG+QH V+
Sbjct: 3 GWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGVI 62
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
G GLPL+E ILPQYLK LGY T IVGKWHLG + E+TPTFRGF+S+ GY TG +
Sbjct: 63 SGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKDN 122
Query: 191 YFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFL 244
Y+DH S EE+ GLD+R EP W +G YST++F EA II +H T +PLFL
Sbjct: 123 YYDHTDDESNSPEELGYKGLDLRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLFL 182
Query: 245 YLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRM 304
YL H A HS NP PLQAP Y++ HI+ R FAA++ LD++VG V +AL R M
Sbjct: 183 YLPHQAVHSGNPDNPLQAPQKYIDKFPHIQHPGRRTFAAMVSALDDAVGNVTKALSARGM 242
Query: 305 LSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
L NS+I+F +DNGG AAGF+ N ASNWPLRGVKNTLWEGGV G G + SPL++
Sbjct: 243 LENSVIIFTTDNGGPAAGFDQNYASNWPLRGVKNTLWEGGVHGTGFVHSPLIKQPKRTTH 302
Query: 365 QYVHVSDWLPTL--LSAANKSDIPNYVNSTVENIIPR 399
+ +HV D LPT+ L+ + +++ N + V I R
Sbjct: 303 ELLHVCDLLPTIYELAGGDSTELKNLDGTNVWETISR 339
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH V+ G GLPL+E ILPQYLK LGY T I+
Sbjct: 57 LQHGVISGATPFGLPLNETILPQYLKPLGYATHIVG 92
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 10/94 (10%)
Query: 498 ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSIN 557
E D++ +DG +VW +SR S R +LHNID + + +AL G +K++ +
Sbjct: 316 ELAGGDSTELKNLDGTNVWETISRGVQSPRVEVLHNIDPKRKTAALRYGDYKIILGEAYK 375
Query: 558 G--------NGTSENRSNDNSYQN--EIDGIDVW 581
G G + N S + ++++ E + I W
Sbjct: 376 GAWDGWYPPEGVTTNHSEEANHEDPCEFNNIADW 409
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%)
Query: 563 ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
E D++ +DG +VW +SR S R +LHNID + + +AL G +K++
Sbjct: 316 ELAGGDSTELKNLDGTNVWETISRGVQSPRVEVLHNIDPKRKTAALRYGDYKII 369
>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
Length = 380
Score = 380 bits (976), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/331 (53%), Positives = 238/331 (71%), Gaps = 3/331 (0%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII I+ADDLGW+DV FHG QIPTPN+D LA G+IL NYY +CTP+R+++MTGK+
Sbjct: 5 PHIIVIVADDLGWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLMTGKY 64
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH V+Y + GLPL E LPQYL+ GY+T +GKWHLGF+ KEYTPT+RGF+
Sbjct: 65 PIHLGMQHFVIYAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTYRGFD 124
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G W+ DY++H++ E WG DMR ++EP KY+T+VFT EA+ +I NH+
Sbjct: 125 SFYGMWSAKADYWNHTSFENGFWGTDMRNNMEPVTTDKDKYATEVFTREALKVIENHNKS 184
Query: 240 EPLFLYLAHAATHSANPYEPLQAP-DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
EPLFLY+AH A HSANP++PLQAP D I+ +R ++AA++ +D+S+G+V A
Sbjct: 185 EPLFLYIAHQAPHSANPHDPLQAPEDKVKKFSGVIDKIERQQYAAMVTCVDDSIGEVFRA 244
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
LE+ RML+NS+I+F +DNGGA GFN N SN+PLRG K+ +WEGGVRG I+S L++
Sbjct: 245 LEKNRMLNNSVILFTTDNGGAPHGFNRNQGSNYPLRGGKDMMWEGGVRGTAFIYSDLIKH 304
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
+G V+ + V+DW+PTL A + P Y+
Sbjct: 305 KGRVSTDLIDVTDWVPTLYYLAGGT--PGYL 333
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH V+Y + GLPL E LPQYL+ GY+T
Sbjct: 70 MQHFVIYAAQPYGLPLGEITLPQYLQIQGYKT 101
>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
Length = 573
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/410 (48%), Positives = 269/410 (65%), Gaps = 15/410 (3%)
Query: 41 PLAFTLSMVFVDLVASSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
P TL ++++ AS+ P+I+ I+ADDLGWNDV FHG +QIPTPNIDALAY GII
Sbjct: 6 PYLVTLWLLWLGGGASTAHQERPNIVIIVADDLGWNDVSFHGSNQIPTPNIDALAYDGII 65
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
L +Y LCTPSR+++MTGKHP++ GMQ +V+ E GL L +K++PQY +E GYRT
Sbjct: 66 LNRHYVPPLCTPSRASLMTGKHPMNIGMQDHVIISDEPWGLGLDQKLMPQYFREAGYRTH 125
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWD 215
+VGKWHLGF+++ YTPT+RGF+SH GY + DY+DHS + E GLDMRR+ +D
Sbjct: 126 LVGKWHLGFFRRAYTPTYRGFDSHFGYLGPYIDYWDHSLQMNETSARGLDMRRNTAVNYD 185
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
+G Y+TD+F EAV +I +H+ +PLFL L H A H+ N +PLQAP + +I+D
Sbjct: 186 ANGTYATDLFNDEAVRLIDSHNRSKPLFLVLTHLAPHTGNEDDPLQAPADEIAKFDYIQD 245
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
KR AA++ ++D VG++ +LE+RRML+N+II+F +DNG G + N+ SN+PLRG
Sbjct: 246 PKRRTLAAMVSRIDTGVGRIYRSLERRRMLNNTIILFYADNGAPTLGIHANSGSNYPLRG 305
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVEN 395
K + WEG VRGA LIWS LL +GIV+ Q++HVSDWLPTL AA PN NS ++
Sbjct: 306 QKESPWEGAVRGAALIWSRLLPRKGIVSNQWLHVSDWLPTLGHAAGIRISPN--NSPIDG 363
Query: 396 IIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
+N ++ T NS R N + T+ K +Y NGT
Sbjct: 364 -----QN---QWSTLTSSSNSGRTVVMNNAHNEFTYSSYIKRGWKYVNGT 405
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 83/172 (48%), Gaps = 23/172 (13%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM----RYQVDLTG 630
IDG + WS L+ + S R +++N +E+ S+ K WK V R+ L
Sbjct: 361 IDGQNQWSTLTSSSNSGRTVVMNNAHNEFTYSSYIKRGWKYVNGTSFKGAYDRWLGQLKD 420
Query: 631 GPDQV----YLSGLSDREWLALAM-------RKLRDAASIQCGP------VKEVPCEPQI 673
G +Q+ Y L + + +M ++LR A ++C V+ +PCEP +
Sbjct: 421 G-EQISHEEYYKRLIASQTIGSSMQLTRDDVKRLRSKAMVECSSAQGTVYVQAIPCEPLV 479
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKK 725
PCLF++ DPCE+ NLAD+ + + ++V RF + A P + + + +
Sbjct: 480 KPCLFNVVEDPCERVNLADQYP-ELLKDLQSDVERFKRDALPPRNQPSDSRS 530
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 6/52 (11%)
Query: 754 EGMRKLRDAASIQCGP------VKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
+ +++LR A ++C V+ +PCEP + PCLF++ DPCE+ NLAD+
Sbjct: 448 DDVKRLRSKAMVECSSAQGTVYVQAIPCEPLVKPCLFNVVEDPCERVNLADQ 499
>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 476
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/366 (50%), Positives = 251/366 (68%), Gaps = 5/366 (1%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L F L ++ V ++ PP+IIF++ADDLGWNDV FHG QIPTP+IDALA G+IL NY
Sbjct: 11 LTFFLLGCMLECVYTAQPPNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNY 70
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +CTP+RSAIMTGKHPIHTG+Q++V+ E GL +E I+PQYL+ LGYRT +VGK
Sbjct: 71 YVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGK 130
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLGF+K+ TP+ RGFES+ GY+ G QDYF H + E + G D + + G+YS
Sbjct: 131 WHLGFFKESLTPSHRGFESYYGYYGGMQDYFTHESTEHTLTGFDFHVNGSIYKPVFGQYS 190
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY--EPLQAPDHYLNIHRHIEDFKRS 279
T+++T + +II NH+ EPL++YLAH A HSAN Y + LQAP Y +I + R
Sbjct: 191 TEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSAN-YNGQRLQAPYKYYERFPNITNENRR 249
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
KFAA++ LD+S+G + + L++ + +N++IVF +DNGG A GF+ N A+NWPLRGVK+T
Sbjct: 250 KFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPLRGVKDT 309
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPNYVNSTVENII 397
WEGG+RGAG +W L+E G ++ +HV DW+PTL L+ N S + + V ++
Sbjct: 310 TWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGIDVWPML 369
Query: 398 PRYENS 403
R E S
Sbjct: 370 SRAEPS 375
Score = 75.9 bits (185), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 26/135 (19%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
+ S +DGIDVW +LSR EPS R ILHNID +SA+ G +KLV+
Sbjct: 353 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQ---------- 402
Query: 628 LTGGPDQVYLSGLSD------REWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCL 677
Q Y SD + + + + +A + C P K C+P+ PCL
Sbjct: 403 -----GQNYNGSWSDWYPPEGESSVDVDSKPVPNAFVVSC-PSKPANASTNCDPKEKPCL 456
Query: 678 FDIKNDPCEKNNLAD 692
F+I++DPCE NN+AD
Sbjct: 457 FNIRHDPCEFNNIAD 471
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 37/57 (64%)
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
+ S +DGIDVW +LSR EPS R ILHNID +SA+ G +KLV+ + NG+
Sbjct: 353 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGS 409
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 21/24 (87%)
Query: 775 CEPQIAPCLFDIKNDPCEKNNLAD 798
C+P+ PCLF+I++DPCE NN+AD
Sbjct: 448 CDPKEKPCLFNIRHDPCEFNNIAD 471
>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
Length = 498
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 239/324 (73%), Gaps = 1/324 (0%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII ++ADDLGW+D+ FHG QIPTPN+DALA SG+IL NYY + TPSR++ MTGK+
Sbjct: 1 PHIIMVVADDLGWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKY 60
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH G+QH+ L+ + G+PL+EK LP++L+E+GY+T VGKW LGF+ KEYTPT+RGF+
Sbjct: 61 PIHMGVQHDTLHNRQPFGVPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFD 120
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G+WT H+DY++H A + +G+D+RR+L+ + D G Y T++ EA ++I NHS D
Sbjct: 121 SFFGFWTSHEDYYNHVAND-GGYGIDLRRNLDVSNDHTGVYGTELLAREADEVIENHSGD 179
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLYLAH A H N EPLQAP +++ ++I D +R FA ++ LDES+ ++V +L
Sbjct: 180 KPLFLYLAHQAVHVGNMDEPLQAPKRHVDKFKYITDERRRTFAGMVSSLDESMRQLVTSL 239
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+++ + NSII+F +DNGGAA G +++A SN+PLRG KNTLWEGGVRG + SPL++
Sbjct: 240 KRKGLYQNSIIIFTTDNGGAAGGLDMSAGSNFPLRGNKNTLWEGGVRGVAFVHSPLIKRP 299
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
G V + +H SDWLPTL A S
Sbjct: 300 GRVYDGLMHASDWLPTLHLLAGGS 323
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 17/127 (13%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
+IDG D+W +S+ PS R +LH ID + +AL G +K++ + + +++ D P
Sbjct: 329 QIDGYDLWDGISKGTPSPRYEVLHGIDTMQGNRAALRVGDYKVI-LNQDLKFYSDWYKRP 387
Query: 633 DQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP--------CEPQIAPCLFDIKNDP 684
+ + E + +L A++ C P C PQ PCLF+IK DP
Sbjct: 388 E-------GEHELDYIRKPRLLSNATVDCTIKSPHPLLYTHAPLCNPQKKPCLFNIKWDP 440
Query: 685 CEKNNLA 691
CE +NLA
Sbjct: 441 CEYHNLA 447
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 27/36 (75%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH+ L+ + G+PL+EK LP++L+E+GY+T +
Sbjct: 66 VQHDTLHNRQPFGVPLTEKFLPEFLREMGYQTHAVG 101
>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 479
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/369 (49%), Positives = 250/369 (67%), Gaps = 8/369 (2%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L F L ++ V ++ PP+IIF++ADDLGWNDV FHG QIPTP+IDALA G+IL NY
Sbjct: 11 LTFFLLGCMLECVYTAQPPNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNY 70
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +CTP+RSAIMTGKHPIHTG+Q++V+ E GL +E I+PQYL+ LGYRT +VGK
Sbjct: 71 YVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGK 130
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE---EMKMWGLDMRRDLEPAWDLHG 218
WHLGFY KE+TP RGFES GY+ G QDYF H + + + G D + + G
Sbjct: 131 WHLGFYSKEHTPIERGFESTFGYYLGQQDYFTHETQVKRKHTLTGFDFHVNGSIYKPVFG 190
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY--EPLQAPDHYLNIHRHIEDF 276
+YST+++T + +II NH+ EPL++YLAH A HSAN Y + LQAP Y +I +
Sbjct: 191 QYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSAN-YNGQRLQAPYKYYERFPNITNE 249
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R KFAA++ LD+S+G + + L++ + +N++IVF +DNGG A GF+ N A+NWPLRGV
Sbjct: 250 NRRKFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPLRGV 309
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPNYVNSTVE 394
K+T WEGG+RGAG +W L+E G ++ +HV DW+PTL L+ N S + + V
Sbjct: 310 KDTTWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGIDVW 369
Query: 395 NIIPRYENS 403
++ R E S
Sbjct: 370 PMLSRAEPS 378
Score = 75.9 bits (185), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 26/135 (19%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
+ S +DGIDVW +LSR EPS R ILHNID +SA+ G +KLV+
Sbjct: 356 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQ---------- 405
Query: 628 LTGGPDQVYLSGLSD------REWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCL 677
Q Y SD + + + + +A + C P K C+P+ PCL
Sbjct: 406 -----GQNYNGSWSDWYPPEGESSVDVDSKPVPNAFVVSC-PSKPANASTNCDPKEKPCL 459
Query: 678 FDIKNDPCEKNNLAD 692
F+I++DPCE NN+AD
Sbjct: 460 FNIRHDPCEFNNIAD 474
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 37/57 (64%)
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
+ S +DGIDVW +LSR EPS R ILHNID +SA+ G +KLV+ + NG+
Sbjct: 356 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGS 412
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 21/24 (87%)
Query: 775 CEPQIAPCLFDIKNDPCEKNNLAD 798
C+P+ PCLF+I++DPCE NN+AD
Sbjct: 451 CDPKEKPCLFNIRHDPCEFNNIAD 474
>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
Length = 675
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 175/330 (53%), Positives = 231/330 (70%), Gaps = 4/330 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ADD+GWNDVGFHG +QIPTPNIDALAY GIIL +YT +CTPSR+++MTGK+
Sbjct: 32 PHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKN 91
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI+ GMQH V+ E GL L +KI+P+Y KE GYRT +VGKWHLGF K+YTPT RGF+
Sbjct: 92 PINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFD 151
Query: 180 SHLGYWTGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
+H+GY + DY+D++ + G DMR +L +D +G Y+TD FT A II H
Sbjct: 152 THVGYLGPYVDYWDYTLKFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERH 211
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
T +PLFL + H A H+AN +PLQAP+ + +I D +R +AA++ KLD+SVG++
Sbjct: 212 DTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIF 271
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
+L + ML NSII+F+SDNG A + N SN+PLRG+K+ WE R IWSPLL
Sbjct: 272 NSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIWSPLL 331
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ R V+ Q++H+SDWLPTL SAA DIP
Sbjct: 332 QERQRVSNQFIHISDWLPTLASAAGI-DIP 360
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 61/141 (43%), Gaps = 25/141 (17%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+EIDG D W LS + + R +L+ ID+ + S+ + +K V G
Sbjct: 365 HSEIDGQDQWEALSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQ 424
Query: 632 P---DQVYLSGLSDREWLALAMRK-----------------LRDAASIQCGPVKEV-PCE 670
P DQ LSD +++ L ++ LR A + C E C
Sbjct: 425 PNTSDQT----LSDDQYIDLVLQTEITRWAGETISRDTIKYLRKHARVNCNHQPEANKCN 480
Query: 671 PQIAPCLFDIKNDPCEKNNLA 691
P PCLFDI NDPCE N+L+
Sbjct: 481 PLKRPCLFDIINDPCELNDLS 501
Score = 42.7 bits (99), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 754 EGMRKLRDAASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLA 797
+ ++ LR A + C E C P PCLFDI NDPCE N+L+
Sbjct: 457 DTIKYLRKHARVNCNHQPEANKCNPLKRPCLFDIINDPCELNDLS 501
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V+ E GL L +KI+P+Y KE GYRT ++
Sbjct: 97 MQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVG 132
>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
Length = 585
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 182/352 (51%), Positives = 246/352 (69%), Gaps = 13/352 (3%)
Query: 42 LAFTLSMVFVDLV---------ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
L L ++FV ++ SS P+I+ I+ADDLGWNDV FHG QIPTPNIDALA
Sbjct: 4 LPVLLGLLFVAVLNVFGEKSTDRSSNRPNIVVIIADDLGWNDVSFHGSLQIPTPNIDALA 63
Query: 93 YSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL 152
YSGIIL +Y LCTPSR+++MTGKHPI+ GMQH+V+ E GL L +K++P+Y +E
Sbjct: 64 YSGIILNRHYAPPLCTPSRASLMTGKHPINIGMQHHVIEVDEPWGLGLDQKLMPEYFREA 123
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDL 210
GYRTR+VGKWHLGF++K YTPT RGF+SH GY + DY+DHS + GLDMRR+L
Sbjct: 124 GYRTRLVGKWHLGFFRKAYTPTMRGFDSHYGYIGPYIDYWDHSLQMSNTSTRGLDMRRNL 183
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNH--STDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ + G Y+TD+FT EAV +IH+H ++ PLFL + H A H+ N +P+QAP+ +
Sbjct: 184 QVDYSARGTYATDLFTREAVRLIHDHNQTSANPLFLVVTHLAPHTGNEDDPMQAPEEDVE 243
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
+ I+D KR AA++ KLD+ VG++V+AL++ ML+N++++F +DNG G + N
Sbjct: 244 LFSFIKDPKRRVLAAMISKLDDGVGQIVQALKKSGMLNNTVVLFYADNGAPTIGKHSNGG 303
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN+PLRG K + WEG VR A IWSPLL + G V+ Q++HVSDWLPTL AA
Sbjct: 304 SNFPLRGQKYSPWEGAVRTAAAIWSPLLNNTGRVSNQWIHVSDWLPTLARAA 355
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 66/149 (44%), Gaps = 35/149 (23%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK------------ 620
+EIDG D W L+ + R T++HNID S+ TK WKLV
Sbjct: 364 SEIDGQDQWEALNNPNLTGRTTVMHNIDQFQHYSSYTKNGWKLVNGTSWEGAFDDWMGSL 423
Query: 621 -----------VMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG--PVKEV 667
+R + G +V L ++ ++R A++ C P +
Sbjct: 424 DEEDELSEEEYSVRLISSVVGRLTKVDLDDVA----------RIRRDATVHCPYIPGGTI 473
Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRSED 696
CEPQ +PCLF+I DPCE NN+A++ D
Sbjct: 474 DCEPQKSPCLFNILEDPCETNNIANQRPD 502
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 754 EGMRKLRDAASIQCG--PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
+ + ++R A++ C P + CEPQ +PCLF+I DPCE NN+A++
Sbjct: 452 DDVARIRRDATVHCPYIPGGTIDCEPQKSPCLFNILEDPCETNNIANQ 499
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%)
Query: 508 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
+EIDG D W L+ + R T++HNID S+ T+ WKLV S G
Sbjct: 364 SEIDGQDQWEALNNPNLTGRTTVMHNIDQFQHYSSYTKNGWKLVNGTSWEG 414
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH+V+ E GL L +K++P+Y +E GYRTR++
Sbjct: 96 MQHHVIEVDEPWGLGLDQKLMPEYFREAGYRTRLVG 131
>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 502
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 178/332 (53%), Positives = 247/332 (74%), Gaps = 3/332 (0%)
Query: 50 FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
F+ +V + PHII I+ADDLGWND+ FHG ++IPTPNID LA +G+IL NYY + +CTP
Sbjct: 10 FIMIVIHADKPHIIMIVADDLGWNDISFHGSNEIPTPNIDRLANNGVILDNYYVLPICTP 69
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
SRSAIMTG++PIHTGMQ + ++G G+ L+EK LPQYLK+ GY+T VGKWHLGF+ K
Sbjct: 70 SRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAK 129
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
+YTPT+RGF+S+ G + G DY++HS E GLD+ + + G YST+++TAEA
Sbjct: 130 QYTPTYRGFDSYYGSYLGKGDYWNHSNTETYS-GLDLHDNENGVFSQDGNYSTEMYTAEA 188
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
+ I+NH++ EPLFLYLA+ A HSAN E PLQAP +++ +I+ +R K+AA+L +
Sbjct: 189 ISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRKYAAMLGYM 248
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D VG+V +AL +++ML NSII+F +DNGG A GF+ N A+N+PLRGVK TL+EGGVRG
Sbjct: 249 DYGVGRVHDALAKKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATLFEGGVRGV 308
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
++S L+ES V+ + +H++DWLPTL++ A
Sbjct: 309 SFVYSKLIESPR-VSHELIHITDWLPTLVNLA 339
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 60/123 (48%), Gaps = 20/123 (16%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
+DG D W+ L + S+RN +L NID++ W+ AL G WK++K G
Sbjct: 348 LDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKIIK-----------EGNYW 396
Query: 634 QVYLSGLSDREWLALAMRKLRDAASIQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
+ S E + L +++++CG P+ C+ + CLF I DPCE N+L
Sbjct: 397 DGWYPPPSFNEQSNNSFSYL--SSTVKCGHDIPIVINHCD---SYCLFHIDEDPCEINDL 451
Query: 691 ADR 693
+ +
Sbjct: 452 SKK 454
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 30/45 (66%), Gaps = 1/45 (2%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTRGKWKLVKE 553
+DG D W+ L + S+RN +L NID++ W+ AL G WK++KE
Sbjct: 348 LDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKIIKE 392
>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
Length = 607
Score = 370 bits (949), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 175/330 (53%), Positives = 231/330 (70%), Gaps = 4/330 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ADD+GWNDVGFHG +QIPTPNIDALAY GIIL +YT +CTPSR+++MTGK+
Sbjct: 32 PHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKN 91
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI+ GMQH V+ E GL L +KI+P+Y KE GYRT +VGKWHLGF K+YTPT RGF+
Sbjct: 92 PINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFD 151
Query: 180 SHLGYWTGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
+H+GY + DY+D++ + G DMR +L +D +G Y+TD FT A II H
Sbjct: 152 THVGYLGPYVDYWDYTLKFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERH 211
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
T +PLFL + H A H+AN +PLQAP+ + +I D +R +AA++ KLD+SVG++
Sbjct: 212 DTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIF 271
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
+L + ML NSII+F+SDNG A + N SN+PLRG+K+ WE R IWSPLL
Sbjct: 272 NSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIWSPLL 331
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ R V+ Q++H+SDWLPTL SAA DIP
Sbjct: 332 QERQRVSNQFIHISDWLPTLASAAGI-DIP 360
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 61/141 (43%), Gaps = 25/141 (17%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+EIDG D W LS + + R +L+ ID+ + S+ + +K V G
Sbjct: 365 HSEIDGQDQWEALSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQ 424
Query: 632 P---DQVYLSGLSDREWLALAMRK-----------------LRDAASIQCGPVKEV-PCE 670
P DQ LSD +++ L ++ LR A + C E C
Sbjct: 425 PNTSDQT----LSDDQYIDLVLQTEITRWAGETISRDTIKYLRKHARVNCNHQPEANKCN 480
Query: 671 PQIAPCLFDIKNDPCEKNNLA 691
P PCLFDI NDPCE N+L+
Sbjct: 481 PLKRPCLFDIINDPCELNDLS 501
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 754 EGMRKLRDAASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLA 797
+ ++ LR A + C E C P PCLFDI NDPCE N+L+
Sbjct: 457 DTIKYLRKHARVNCNHQPEANKCNPLKRPCLFDIINDPCELNDLS 501
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V+ E GL L +KI+P+Y KE GYRT ++
Sbjct: 97 MQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVG 132
>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
Length = 881
Score = 366 bits (940), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 179/340 (52%), Positives = 237/340 (69%), Gaps = 6/340 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FI+ADD+G NDVGFHG +QIPTPNIDALAY GIIL +Y+ +CTPSR+A+MTG+H
Sbjct: 34 PHIVFIMADDMGLNDVGFHGSNQIPTPNIDALAYDGIILNRHYSAPMCTPSRAALMTGRH 93
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P++ GMQH V+ E GL L EK+LPQY +E GYRT ++GKWHLGFY + Y PT RGF+
Sbjct: 94 PMNVGMQHYVIDSDEPWGLGLQEKLLPQYFREAGYRTHLIGKWHLGFYAEPYLPTNRGFD 153
Query: 180 SHLGYWTGHQDYFDHSA--EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
+H+GY + DY+ + + + G D+R++L + +G Y+TD FT AV+II +H+
Sbjct: 154 THIGYLGPYIDYWSYISKMDSATFEGYDLRQNLAVNYKPNGTYATDYFTEAAVEIIRSHN 213
Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
T E + L L H A H+ N PLQAP+ + +I D R +AA++ KLD+SVG+V
Sbjct: 214 RTGERMLLVLNHLAPHTGNDDAPLQAPEETIEKFAYIRDTDRRTYAAMMSKLDDSVGQVY 273
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL+Q ML NSIIVF+SDNGG G + N+ SN+PLRG K++ WEG VR A LIWSPLL
Sbjct: 274 RALQQHGMLENSIIVFISDNGGVTRGMHSNSGSNYPLRGQKHSPWEGAVRTAALIWSPLL 333
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
+ R V+ Q+ H+SDWLPTL SAA IP Y + +I
Sbjct: 334 KDRQRVSNQWFHISDWLPTLASAAG---IPAYRGDNLSDI 370
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 37/193 (19%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
++IDGID W L+ + R +++N+D+ + ++ K V + G
Sbjct: 368 SDIDGIDQWEALAYGTGNPRQRVMNNLDEIFGYTSYVDNGLKYVNGTTLDGVNDGWYGAI 427
Query: 633 DQVYLSGLSDREWLALAMRK---------------LRDAASIQCG-PVKEVPCEPQIAPC 676
D + S+ E+L M LR+ A I CG + C P PC
Sbjct: 428 DDTD-TLPSNAEYLRTVMASHVGRLTKPDEKVVSYLREHARIDCGNDIGNRDCNPLRRPC 486
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKK 736
LFDI NDPCE ++L +R YPDK + +++ +K + + K +
Sbjct: 487 LFDIINDPCEMHDLFER--------------------YPDKVRQIQERVEKYRLRAAKPR 526
Query: 737 KKKKKKKKKKKKY 749
+ K+
Sbjct: 527 NRPHDPAADPSKF 539
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 752 EEEGMRKLRDAASIQCG-PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
+E+ + LR+ A I CG + C P PCLFDI NDPCE ++L +R
Sbjct: 455 DEKVVSYLREHARIDCGNDIGNRDCNPLRRPCLFDIINDPCEMHDLFER 503
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V+ E GL L EK+LPQY +E GYRT ++
Sbjct: 99 MQHYVIDSDEPWGLGLQEKLLPQYFREAGYRTHLIG 134
>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
Length = 500
Score = 365 bits (937), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 177/327 (54%), Positives = 231/327 (70%), Gaps = 3/327 (0%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
+G P+++FILADDLGW+D+ HG +IPTPNID LA GI+L NYY +CTPSR+A+MT
Sbjct: 22 AGHPNVVFILADDLGWDDISLHGSQEIPTPNIDLLATDGILLNNYYVQPICTPSRAALMT 81
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G+HP+H G+QH+V+ + GL L+E +LPQYLK LGY T +VGKWHLGFY KE+TPT R
Sbjct: 82 GRHPVHLGLQHDVIVWAQPYGLGLNETLLPQYLKTLGYSTHMVGKWHLGFYDKEHTPTKR 141
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF+SHLGY+TG +DY+DH+ K WGLD D E G+YST+VFT+EA +I
Sbjct: 142 GFDSHLGYYTGCEDYYDHTWGFTKQDWGLDFWHDREVDRSAFGQYSTEVFTSEAERVIAE 201
Query: 236 HSTDEPLFLYLAHAATHSANPYEP--LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
H +PLFLYLA A HS NP LQAP Y+ I+ +R FA ++ LD+SV
Sbjct: 202 HDVSKPLFLYLAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKSEERRVFAGMVSALDDSVS 261
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
V +AL ++ +L+++II+F +DNGG A + N A NWPLRG K T+WEGGVRG G IWS
Sbjct: 262 NVTKALHEKGILNDTIIIFSTDNGGPANRQDYNDACNWPLRGSKRTMWEGGVRGNGFIWS 321
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
PLLE+ G V+E + + DW+PT+L AA
Sbjct: 322 PLLENSGYVSEHLMQIVDWVPTVLEAA 348
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 63/150 (42%), Gaps = 22/150 (14%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
+DG+ W LSR+E S R +LHNID AL K+V GG +
Sbjct: 359 LDGLSQWKALSRHESSPRKVLLHNIDPINGDGALRTTDMKIV-----------FGGGDTE 407
Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRS 694
Y E L + K + + C+P + PC+F+I DPCE N+AD
Sbjct: 408 RYPGWYEPEEALIRSPSKRPSSTA----------CKPWLKPCVFNITADPCEYYNIADFH 457
Query: 695 EDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
D + +V FNQ A P + + K
Sbjct: 458 PDI-LQELLQKVDEFNQTAVPPRNVPSDPK 486
>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 588
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 183/366 (50%), Positives = 241/366 (65%), Gaps = 11/366 (3%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASS--GP---PHIIFILADDLGWNDVGFHGLDQIPT 85
R R++ A + L + ++ +S P PHI+FILADDLGWNDV ++G QI T
Sbjct: 31 RFRLVNIATIGAVILLFIALAAMLLTSRRSPVRQPHIVFILADDLGWNDVSYNGCPQIRT 90
Query: 86 PNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL 145
PNIDALA++GI L+ YYT +CTPSR+A+MTG++PIHTGMQH V+ E GLPL K+L
Sbjct: 91 PNIDALAWNGIRLQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVILQNEPRGLPLKFKLL 150
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---SAEEMKMW 202
PQ+L +LGY ++++GKWHLGFYKKEYTPT RGF+ H+G W G DY+ H +
Sbjct: 151 PQWLGDLGYVSQMLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVDYYSHIRFNKIGFSHS 210
Query: 203 GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
GLD R+ L + G+Y T+ T A +I NH ++PLFLYLAH A H AN ++PLQ
Sbjct: 211 GLDFRQGLSEGREFDGQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQV 270
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
P Y + + I + R+ +A ++ LDESVG VVEAL +R MLS++++VF SDNGG G
Sbjct: 271 PKKYSDKYHDIGHWNRTMYAGMVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGGDTNG 330
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES-RGIVAEQYVHVSDWLPTL--LSA 379
N N AS+WP +G K TLWEGG+ G IWSPL RG H+SDWLPTL L+
Sbjct: 331 ENPNYASSWPFKGQKRTLWEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAG 390
Query: 380 ANKSDI 385
+ SD+
Sbjct: 391 GDPSDL 396
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 69/145 (47%), Gaps = 19/145 (13%)
Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV---KVMR 623
D S +IDGI LSR + R +L NID +SA+ +G +KLV V V+
Sbjct: 391 GDPSDLGDIDGISHLDSLSRRSETPRKELLINIDPIENVSAIIEGHFKLVSGVVRGGVLD 450
Query: 624 YQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAA-SIQCG--------PVK------EVP 668
+ G Y + E +L R LR+A + CG P+K P
Sbjct: 451 EWFQVPGNITWDYNRARQECE-TSLVARVLRNAGHDVACGSEDGSFPTPIKCGKRDPSKP 509
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
C P +APCLFD+ DPCE NN+A++
Sbjct: 510 CVPTVAPCLFDLSKDPCEYNNIAEQ 534
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 27/38 (71%), Gaps = 5/38 (13%)
Query: 765 IQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
I+CG P K PC P +APCLFD+ DPCE NN+A++
Sbjct: 499 IKCGKRDPSK--PCVPTVAPCLFDLSKDPCEYNNIAEQ 534
>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
Length = 550
Score = 359 bits (922), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 183/396 (46%), Positives = 250/396 (63%), Gaps = 18/396 (4%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S P+II IL DD+G+NDV FHG +QI TPNIDALAY+G++L Y LCTPSR+ ++T
Sbjct: 24 STKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRATLLT 83
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
GK+PIHTGMQH V+ E GLP E+++P++ ++ GY T+++GKWHLGF++K+ TPT R
Sbjct: 84 GKYPIHTGMQHYVIITDEPWGLPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLTPTMR 143
Query: 177 GFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
GF+ H GY+ G+ DY+DH+ + G+D RRDL+P +G Y+TD FTAEA +I
Sbjct: 144 GFDHHYGYYNGYIDYYDHTLHMLTKNYTKGVDFRRDLDPCPQDNGTYATDAFTAEAKRVI 203
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
H PLF+ L+H A H+ N P+QAP+ + HI D KR +A ++ LD+SVG
Sbjct: 204 EQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHITDPKRRTYAGMISSLDKSVG 263
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ V+AL ML+NSII+ SDNG G + N SN+P RG K + WEGG+R AG IWS
Sbjct: 264 QTVKALSDNGMLNNSIILLYSDNGAPTVGIHSNQGSNYPFRGQKESPWEGGIRSAGAIWS 323
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAAN-------KSDIPNYVNSTVENIIPRYENSILR 406
PLL+ RG V+ Q +H DWLPTL +AAN K D N S NI P+ +L
Sbjct: 324 PLLQQRGYVSNQIIHAIDWLPTLAAAANVELPENLKLDGVNMWPSLSGNIEPQ-PRRLLH 382
Query: 407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE 442
+ Y+S N +Y NG+ +E +Y+
Sbjct: 383 VMDDVFGYSS--YMRDNLKYVNGS-----SFEGQYD 411
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 28/167 (16%)
Query: 574 EIDGIDVWSVLSRN-EPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM----RYQVDL 628
++DG+++W LS N EP R +LH +DD + S+ + K V ++ DL
Sbjct: 359 KLDGVNMWPSLSGNIEPQPRR-LLHVMDDVFGYSSYMRDNLKYVNGSSFEGQYDQWLGDL 417
Query: 629 TGGPDQ----VYLSGLSDREWLALA---------MRKLRDAASIQCGPVKE-------VP 668
G D Y + E A+ + ++R+ A+ +C P+ E
Sbjct: 418 ESGQDDPLSPYYAQQVLSSEVHAVIGSQSVTNERIEQMREEATHRC-PLNEQDHLQEIYL 476
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
CEP APC F++ DPCE+ NLA+ Q + + EV +F Q A P
Sbjct: 477 CEPLKAPCFFNLAKDPCERFNLANLYPLQ-LQFLSQEVEQFRQNAVP 522
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 8/57 (14%)
Query: 754 EGMRKLRDAASIQCGPVKE-------VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
E + ++R+ A+ +C P+ E CEP APC F++ DPCE+ NLA+ +Q
Sbjct: 450 ERIEQMREEATHRC-PLNEQDHLQEIYLCEPLKAPCFFNLAKDPCERFNLANLYPLQ 505
>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
Length = 472
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 169/266 (63%), Positives = 205/266 (77%)
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
MTGK+PIHTGMQH VLYG E GLPL+EK+LPQYLKELGY+ I GKWHLG Y +++TP
Sbjct: 1 MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
RGF+SH+G+WTGH FDH+A E WGLDMRR + A+DLHG Y+T V E+V I
Sbjct: 61 ERGFDSHVGFWTGHHHMFDHTAVETNAWGLDMRRGFDVAYDLHGYYTTHVIRDESVAAIR 120
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H+T +P+FLY++HAATHSANPY+ L APD + I ++ R KFAA+L +LD+SVG
Sbjct: 121 AHNTSQPMFLYVSHAATHSANPYDFLPAPDETVERLAGISNYSRRKFAAMLTELDQSVGA 180
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
++ AL +R ML NSIIVF +DNGG A GFN NAASNWPLRG KNTLWEGGVRGAG IWSP
Sbjct: 181 IIVALSERGMLDNSIIVFSTDNGGPAEGFNNNAASNWPLRGTKNTLWEGGVRGAGCIWSP 240
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
L+E + V+ Q +H+SDWLPTLL AA
Sbjct: 241 LIEEKRRVSHQIMHISDWLPTLLDAA 266
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 70/149 (46%), Gaps = 26/149 (17%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK-------- 620
N + +DGI VW L + ++R+ ILHNIDD W +ALT WK+VK
Sbjct: 271 NKLPSNLDGISVWPELRNGDVTRRHEILHNIDDIWGSAALTVDDWKVVKGTNYEGKWDAW 330
Query: 621 --------VMRYQV-DLTGGPDQVYLSGLS----DREWLALAMRKLRDAASIQCGPVKEV 667
Y + + G P LS L + E + KLR A++QC
Sbjct: 331 YGPAGDHDTKSYNLSSIFGCPTGKALSKLKMLPPEEE-----IIKLRREATVQCANGTRN 385
Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRSED 696
C P +PCLFD+ NDPCE NLAD+ D
Sbjct: 386 KCNPLESPCLFDLFNDPCEFENLADQYPD 414
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 28/34 (82%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLYG E GLPL+EK+LPQYLKELGY+ I
Sbjct: 11 MQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHI 44
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/58 (46%), Positives = 33/58 (56%)
Query: 742 KKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
K K K EE + KLR A++QC C P +PCLFD+ NDPCE NLAD+
Sbjct: 354 KALSKLKMLPPEEEIIKLRREATVQCANGTRNKCNPLESPCLFDLFNDPCEFENLADQ 411
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
YPD+L + K+L++ N TAV P N P D GDP+ + + W FGD+L
Sbjct: 412 YPDILEVLLKKLSDYNATAVPPGNMPLDSRGDPRFWGYTWHNFGDEL 458
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%)
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
N + +DGI VW L + ++R+ ILHNIDD W +ALT WK+VK + G
Sbjct: 271 NKLPSNLDGISVWPELRNGDVTRRHEILHNIDDIWGSAALTVDDWKVVKGTNYEG 325
>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
Length = 657
Score = 358 bits (920), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 178/358 (49%), Positives = 233/358 (65%), Gaps = 20/358 (5%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
L S PHI+ I+ADD+GWNDVGFHG +QIPTPNIDALAY GIIL +YT +CTPSR+
Sbjct: 24 LYESGQKPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRA 83
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
AIMTG++PI GMQH V+ E GL L +KI+P+Y +E GYRT +VGKWHLGF+ ++YT
Sbjct: 84 AIMTGRNPISVGMQHYVIDSDEPWGLGLDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYT 143
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
PT RGF+SH Y G DMR +L +D +G Y+TD FT A I
Sbjct: 144 PTMRGFDSHTNY-----------------TGYDMRHNLAVDYDANGTYATDHFTDAASRI 186
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I H+ EPLFL + H A H+ N +PLQAP+ + HI D R +AA++ KLD+SV
Sbjct: 187 IDKHNPSEPLFLMVNHLAPHTGNDNDPLQAPEERIKKFEHISDENRRIYAAMVSKLDDSV 246
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G + ++L + M+ NSII+F+SDNG + G + N+ SN+PLRG KN+ WEG R IW
Sbjct: 247 GAIFKSLRAKNMIQNSIILFISDNGAPSFGLHSNSGSNFPLRGQKNSPWEGATRNVAAIW 306
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENG 410
SPLLE R V+ QY+H+SDWLPT S A +P NS+ + + ++E L Y+ G
Sbjct: 307 SPLLEERQRVSNQYIHISDWLPTFASIAGIR-LPQARNSSEVDGVDQWE--ALSYDTG 361
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 71/167 (42%), Gaps = 36/167 (21%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG- 631
+E+DG+D W LS + R IL+ ID+ + S+ + +K V TGG
Sbjct: 345 SEVDGVDQWEALSYDTGYPRRVILNMIDEIFGYSSYMENGFKYVN---------GTTGGG 395
Query: 632 ----------------PDQVYLSGLSDREWLALA--------MRKLRDAASIQC-GPVKE 666
D+ Y+ + + E L+ ++ LR + + C P+
Sbjct: 396 NFDGWYGQLNNEDPHPSDEEYIKTVLETEIAQLSGDVLSSELIKHLRKHSRVVCHKPLHP 455
Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIA 713
C P CLFDI NDPCE N+++ R E + T+V ++A
Sbjct: 456 TTCNPLKRACLFDIVNDPCELNDISAR-EPFKFRELRTKVETLRRLA 501
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 754 EGMRKLRDAASIQC-GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
E ++ LR + + C P+ C P CLFDI NDPCE N+++ R
Sbjct: 436 ELIKHLRKHSRVVCHKPLHPTTCNPLKRACLFDIVNDPCELNDISAR 482
>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
Length = 545
Score = 357 bits (916), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 177/351 (50%), Positives = 237/351 (67%), Gaps = 16/351 (4%)
Query: 44 FTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
FTL FV +V ++G P+I++ILADDLGW+D+ HG +QIPTPNIDALA GI+L
Sbjct: 7 FTLC-SFVCIVFAAGSVRNQRPNIVYILADDLGWDDISLHGSEQIPTPNIDALAADGILL 65
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
NYY +CTPSR+A++TGKHP+H G+QHN + GL L+E+ILP+YL LGY + +
Sbjct: 66 NNYYVQPICTPSRAALLTGKHPVHLGLQHNTIPAPSAYGLGLNERILPEYLNTLGYDSHM 125
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEP 212
VGKWHLG++ ++TPT+RGF+SH GY G +DY DH S M WGLD D +
Sbjct: 126 VGKWHLGYFTPQHTPTYRGFKSHFGYLNGCEDYLDHTLAYDFSTLGMDGWGLDFWNDTKI 185
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP---YEPLQAPDHYLNI 269
G+YST++FT A ++I + +T EPLFLY++H A HS NP LQAP Y N
Sbjct: 186 HRTSFGQYSTEIFTTRAEELIRS-NTGEPLFLYMSHQAVHSGNPGLNGSKLQAPWKYFNK 244
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
+I+ +R + A ++ LD+SVG + L + ML+N+IIVF +DNGG A F+ N AS
Sbjct: 245 FNYIQSDERRRLAGMISALDDSVGNITRTLHAKGMLNNTIIVFSTDNGGPADRFDDNCAS 304
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
NWPLRG K T+WEGG+RG G +WSPLL+ V+E ++ + DWLPTLL AA
Sbjct: 305 NWPLRGGKRTMWEGGLRGNGFVWSPLLKHSNYVSEHFMSIMDWLPTLLDAA 355
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 63/139 (45%), Gaps = 18/139 (12%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK-----------VVKVM 622
+++G+ W LS N PS R+ +LHNID L G K+V M
Sbjct: 365 DVEGMSQWKSLSENSPSPRDYLLHNIDPLTGEGGLRLGDMKVVFGGGNTKAHRGWYPPEM 424
Query: 623 RYQVDLTGGPDQVYLSGLSD-REWLALAMRKLRDAASIQCGPVK---EVPCEPQIAPCLF 678
R+ + P ++ + L D L + + + ++CGP C+ I PCLF
Sbjct: 425 RWGNETVSAPSHLFETDLWDIFHDLGRPPVQKKISVQVECGPAPPDVTTNCKSWIEPCLF 484
Query: 679 DIKNDPCEKNNLADRSEDQ 697
+I DPCE +NL SEDQ
Sbjct: 485 NISADPCEYHNL---SEDQ 500
>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
Length = 577
Score = 356 bits (913), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 178/398 (44%), Positives = 248/398 (62%), Gaps = 18/398 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S P+II IL DD+G+NDV FHG +QI TPNIDALAY+GI+L +Y LCTPSR+ ++
Sbjct: 24 ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TGK+PIHTGMQH V+ E GLP E+++P+ ++ GY T +VGKWHLGF++K+ TPT
Sbjct: 84 TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
RGF+ H GY+ G+ DY+D + + GLD RRDLEP + G Y+T+ FT EA +
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I H + PLF+ L+H A H+ N P+QAP+ + HI D KR +A ++ LD+SV
Sbjct: 204 IERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G+ + AL ML+NSI++ SDNG G + NA SN+P RG K + WEGG+R G +W
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRGQKESPWEGGIRSVGALW 323
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-------IL 405
SPLL+ RG V+ Q +H DWLPTL +AA S +P+ + N+ P S +L
Sbjct: 324 SPLLQQRGYVSNQAIHAIDWLPTLAAAAGVS-LPSDLRLDGLNLWPSLSASAQPQPRNLL 382
Query: 406 RYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
+ Y+S +Y NG+ +E RY++
Sbjct: 383 HVLDDVFGYSS--YTQDTLKYVNGS-----SFEGRYDH 413
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 51/155 (32%)
Query: 575 IDGIDVWSVLSRN-EPSKRNTILHNIDDEWQISALTKGKWKLV----------------- 616
+DG+++W LS + +P RN +LH +DD + S+ T+ K V
Sbjct: 361 LDGLNLWPSLSASAQPQPRN-LLHVLDDVFGYSSYTQDTLKYVNGSSFEGRYDHWLGELE 419
Query: 617 ------------KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPV 664
++V+ Q L ++ DR +R++RD A+ +C P+
Sbjct: 420 DGEEDPNSRDYGQLVRASEVQAVLGS------VAPTDDR------IRQMRDEATQRC-PI 466
Query: 665 KE-------VPCEPQIAPCLFDIKNDPCEKNNLAD 692
+E CEP A CLFD+ NDPCE+ NLA+
Sbjct: 467 QERDPEDPVYLCEPLKAACLFDLANDPCERYNLAN 501
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 47/91 (51%), Gaps = 8/91 (8%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE------- 772
E E ++ + + + + + + ++ +R++RD A+ +C P++E
Sbjct: 417 ELEDGEEDPNSRDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC-PIQERDPEDPV 475
Query: 773 VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
CEP A CLFD+ NDPCE+ NLA+ +Q
Sbjct: 476 YLCEPLKAACLFDLANDPCERYNLANLYPLQ 506
>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 526
Score = 354 bits (908), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 180/353 (50%), Positives = 235/353 (66%), Gaps = 9/353 (2%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L L LS++ LV SS P+IIFILADDLGW+DV FHG QIPTPNID LAYSG++L
Sbjct: 7 LILKVVLSLLSWKLVISSKQPNIIFILADDLGWDDVSFHGSHQIPTPNIDELAYSGVLLH 66
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
NYY +CTP+R A++TG++P+H G+QH V+ E GLPL+E LP YLK+LGY T +V
Sbjct: 67 NYYVQPVCTPTRGALLTGRYPMHLGLQHFVITPNEPVGLPLNETTLPTYLKKLGYSTHMV 126
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLGF+ KEYTPT+RGF+SH GY+ GHQDY+ H+A WG D+R +++ G+
Sbjct: 127 GKWHLGFFAKEYTPTYRGFDSHYGYFLGHQDYYTHNALWNNQWGFDLRHNMDLQRSTFGE 186
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP---YEPLQAPDHYLNIHRHIEDF 276
Y ++FT +A +I++H +PLFLY AH A H N L+AP Y + HI D
Sbjct: 187 YGPELFTTQAEKLIYDHDHKKPLFLYFAHQAVHYGNSGPNGTLLEAPYKYTSRFPHIADH 246
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R +A ++ LD+SVG + AL++ + +NSIIVF +DNGG A N PLRG
Sbjct: 247 QRRIYAGMVSALDDSVGNITRALKKSGLYNNSIIVFSTDNGGPHEC----EAFNTPLRGT 302
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPN 387
KNTLWEGG+RGA I S LLE V E +HV DWLPT+ ++ N SD+ N
Sbjct: 303 KNTLWEGGIRGAAFIHSVLLEKPKRVCEGMMHVVDWLPTMYHVAGGNLSDLHN 355
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 56/125 (44%), Gaps = 10/125 (8%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
+DG ++W LS S R ILHNID Q +AL G K++ + V P Q
Sbjct: 356 LDGFNMWDTLSSGVSSPRTEILHNIDPITQTAALRVGNLKVIVGENKSDWFV-----PPQ 410
Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP---CEPQIAPCLFDIKNDPCEKNNLA 691
+ + L + C V V C+P+ PCL++IKNDPCE NLA
Sbjct: 411 ITFG--KNGTNLTKQFGSDHRGIIVHCSSVPPVAYKNCDPKQKPCLYNIKNDPCEYYNLA 468
Query: 692 DRSED 696
D D
Sbjct: 469 DEYPD 473
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 9/55 (16%)
Query: 765 IQCGPVKEVP---CEPQIAPCLFDIKNDPCEKNNLAD------RSEVQRINHYTT 810
+ C V V C+P+ PCL++IKNDPCE NLAD +S + ++N Y +
Sbjct: 433 VHCSSVPPVAYKNCDPKQKPCLYNIKNDPCEYYNLADEYPDVLQSLLYKLNDYNS 487
>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
Length = 579
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 172/356 (48%), Positives = 234/356 (65%), Gaps = 8/356 (2%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
+L ++ LS+ + S P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI
Sbjct: 7 LVLLVVSSILSLAYGS--GYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGI 64
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
+L +Y LCTPSR+ ++TGK+PIHTGMQH V+ E GLP E+++P+ ++ GY T
Sbjct: 65 LLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYST 124
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPA 213
+VGKWHLGF++K+ TPT RGF+ H GY+ G+ DY+DH + GLD RRDLEP
Sbjct: 125 HLVGKWHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPC 184
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+ +G Y+T+ FT+EA II H +PLF+ L+H A H+ N P+QAP+ + HI
Sbjct: 185 PEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHI 244
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
D KR +A ++ LD+SV + + AL+ ML+NSII+ SDNG G + NA SN+P
Sbjct: 245 RDPKRRTYAGMISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPY 304
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
RG K + WEGG+R AG +WSPLL+ RG V+ Q +H DWLPTL AA S D+P
Sbjct: 305 RGQKESPWEGGIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLP 360
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 82/214 (38%), Gaps = 58/214 (27%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN-----GTSEN 564
+DGI++W +LS NE K T++H +D+ + S+ R K V +S G G E
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELET 420
Query: 565 RSND---NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV 621
+D SY+ + DV S+L LTK + ++
Sbjct: 421 NEDDPLGESYEQHVLASDVQSLLGNR-------------------GLTKDR------IRQ 455
Query: 622 MRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIK 681
MR + T P + G + E CEP APC FD+
Sbjct: 456 MRSEATETCPP----IEGQNPLE--------------------SHFKCEPLKAPCFFDLA 491
Query: 682 NDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
DPCE+ NLA Q + E+ + + A P
Sbjct: 492 KDPCERYNLAQMYPLQ-LQQLADELEQIRKTAIP 524
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 7/55 (12%)
Query: 756 MRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
+R++R A+ C P++ CEP APC FD+ DPCE+ NLA +Q
Sbjct: 453 IRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQ 507
>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
Length = 577
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 169/338 (50%), Positives = 227/338 (67%), Gaps = 6/338 (1%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A S P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L +Y LCTPSR+ +
Sbjct: 23 AYSAKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATL 82
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TGK+PIHTGMQH V+ E GLP E+++P+ ++ GY T +VGKWHLGF++K+ TPT
Sbjct: 83 LTGKYPIHTGMQHFVIITDEPWGLPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPT 142
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
RGF+ H GY+ G+ DY+DH + GLD RRDLEP + +G Y+T+ FT+EA
Sbjct: 143 MRGFDHHFGYYNGYIDYYDHQVRMLDRNYSHGLDFRRDLEPCPEANGTYATEAFTSEAKR 202
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
II H +PLF+ ++H A H+ N P+QAP+ + HI D KR +A ++ LD+S
Sbjct: 203 IIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKS 262
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
V + + AL+ ML+NSII+ SDNG G + NA SN+P RG K + WEGG+R AG +
Sbjct: 263 VAQTIGALKDNDMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGAL 322
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
WSPLL+ RG V+ Q +H DWLPTL AA S D+P
Sbjct: 323 WSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 25/165 (15%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM-RYQV------- 626
+DGI++W +LS NE K T++H +D+ + S+ K K + RY
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYINGSSFQGRYDHWLGELKT 420
Query: 627 ----DLTGGPDQVYLSG-----LSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
L+ +Q L+ L +R +R++R A+ C PV+ + CE
Sbjct: 421 NEDHPLSESYEQHVLASDVQSILGNRWLTEDRIRQMRSEATEMCPPVEGLNPLEPQFKCE 480
Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
P APC FD+ DPCE+ NLA+ Q + EV + + A P
Sbjct: 481 PLKAPCFFDLAKDPCERYNLAETYPLQ-LQQLADEVEQIRKTAIP 524
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
E+ +R++R A+ C PV+ + CEP APC FD+ DPCE+ NLA+ +Q
Sbjct: 450 EDRIRQMRSEATEMCPPVEGLNPLEPQFKCEPLKAPCFFDLAKDPCERYNLAETYPLQ 507
>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
Length = 578
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 172/338 (50%), Positives = 226/338 (66%), Gaps = 6/338 (1%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A S P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L +Y LCTPSR+ +
Sbjct: 23 AYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATL 82
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TGK+PIHTGMQH V+ E GLP E+++P+ +E GY T +VGKWHLGF+ K+ TPT
Sbjct: 83 LTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFREAGYSTHLVGKWHLGFWHKDLTPT 142
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
RGF+ H GY+ G+ DY+DH + GLD RRDLEP + +G Y+T+ FTAEA
Sbjct: 143 RRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTAEAKR 202
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
II H +PLF+ ++H A H+ N P+QAP+ + HI D KR +A ++ LD+S
Sbjct: 203 IIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKS 262
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
V + V AL+ ML+NSII+ SDNG G + NA SN+P RG K + WEGG+R AG +
Sbjct: 263 VAQTVGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGAL 322
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
WSPLL+ RG V+ Q +H DWLPTL AA S D+P
Sbjct: 323 WSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 74/156 (47%), Gaps = 25/156 (16%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV----------KVVKVMRY 624
+DGI++W +LS NE K T++H +D+ + S+ K K V + + +
Sbjct: 361 LDGINLWPLLSGNEQPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFQGRYDQWLGELET 420
Query: 625 QVD--LTGGPDQVYLSG-----LSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
D L+ +Q L+ L +R +R++R A+ C PV+ CE
Sbjct: 421 SADDPLSESYEQHVLASDVQSILGNRGLTEDRIRQMRSEATELCPPVEGQNPLEPHFKCE 480
Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEV 706
P APC FD+ NDPCE+ NLA R+ ++ EV
Sbjct: 481 PLKAPCFFDLANDPCERYNLA-RTYPLQLQQLADEV 515
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 7/58 (12%)
Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
E+ +R++R A+ C PV+ CEP APC FD+ NDPCE+ NLA +Q
Sbjct: 450 EDRIRQMRSEATELCPPVEGQNPLEPHFKCEPLKAPCFFDLANDPCERYNLARTYPLQ 507
>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
Length = 579
Score = 352 bits (904), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 172/351 (49%), Positives = 231/351 (65%), Gaps = 8/351 (2%)
Query: 44 FTLSMVFVDLVAS--SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
+S +F S S P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L +
Sbjct: 10 LVVSTIFSLAFGSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKH 69
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y LCTPSR+ ++TGK+PIHTGMQH V+ E GLP E+++P+ ++ GY T +VGK
Sbjct: 70 YVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGK 129
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHG 218
WHLGF++K+ TPT RGF+ H GY+ G+ DY+DH + GLD RRDLEP + +G
Sbjct: 130 WHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEANG 189
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y+T+ FT+EA II H +PLF+ L+H A H+ N P+QAP+ + HI D KR
Sbjct: 190 TYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKR 249
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+A ++ LD+SV + + AL+ ML+NSII+ SDNG G + NA SN+P RG K
Sbjct: 250 RTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKE 309
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
+ WEGG+R AG +WSPLL+ RG V+ Q +H DWLPTL AA S D+P
Sbjct: 310 SPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 73/165 (44%), Gaps = 25/165 (15%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV-MRYQV------- 626
+DGI++W +LS NE K T++H +D+ + S+ + K V RY
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFEGRYDQWLGELET 420
Query: 627 ----DLTGGPDQVYLSG-----LSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
L+ +Q L+ L +R +R++R A+ C PV+ CE
Sbjct: 421 NEDDPLSESYEQHVLASDVQSLLGNRGLTEDRIRQMRSEATETCPPVEGQNPLEPHFKCE 480
Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
P APC FD+ DPCE+ NLA Q + EV + + A P
Sbjct: 481 PLKAPCFFDLAKDPCERYNLAQIYPLQ-LQQLADEVEQIRKTAIP 524
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 7/58 (12%)
Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
E+ +R++R A+ C PV+ CEP APC FD+ DPCE+ NLA +Q
Sbjct: 450 EDRIRQMRSEATETCPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQ 507
>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
Length = 512
Score = 351 bits (900), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 178/342 (52%), Positives = 231/342 (67%), Gaps = 13/342 (3%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GWNDV +HG QI TPNIDALA++GI L+ YY LCTPSR+A++TG++PI+ G+QH+V+
Sbjct: 1 GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
Y E GLPLS+ +LPQ+L +LGY T +GKWH+GF+KKEYTPT RGFE H+G+W + D
Sbjct: 61 YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120
Query: 191 YFDHSAEEM---KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLA 247
Y+ H + + GLDMRR+L A + G+Y T + T EA+++I NH D+PLFLYLA
Sbjct: 121 YYKHEKAYLGPTRSPGLDMRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLYLA 180
Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
H A HSA P +PLQ PD Y+ +R I +R K+A ++ LDESVG V EAL + MLSN
Sbjct: 181 HLAPHSAGPQDPLQVPDKYIEQYRDIGSTERMKYAGMVSALDESVGLVFEALAKNGMLSN 240
Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR---GIVAE 364
++ VF SDNG AAG N N AS+WPLRG K T WEGGVR A L+WS L R G E
Sbjct: 241 TLFVFTSDNGADAAGPNANYASSWPLRGQKYTPWEGGVRAAALLWSHALGPRVRAGGDYE 300
Query: 365 QYVHVSDWLPTLLSAANKS-------DIPNYVNSTVENIIPR 399
H++DWLPTL A D ++VN+ V + P+
Sbjct: 301 HLFHITDWLPTLYQLAGGDPAHLGDIDGVSHVNALVGDASPQ 342
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 29/174 (16%)
Query: 566 SNDNSYQNEIDGIDVWSVL---SRNEPSK--RNTILHNIDDEWQISALTKGKWKLVKVVK 620
D ++ +IDG+ + L + +P + R +L NID SA+ G++K V V
Sbjct: 317 GGDPAHLGDIDGVSHVNALVGDASPQPPQPPRTEVLVNIDPIDNYSAIIDGRFKFVAGVA 376
Query: 621 ---VMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRD------------AASIQCG--- 662
+ + G D L E +A R L+ AA I CG
Sbjct: 377 ERGIYEQWYPILGNVDWNSTEPLEQCEASPVA-RVLKSFGPEPACGPSGGAAPIDCGVRD 435
Query: 663 PVKEVPCEPQIAPCLFDIKNDPCEKNNL-ADRSEDQRINHYTTEVGRFNQIAYP 715
P K C+P +APCLFD++ DPCE N++ A+R E + T++ R+ + P
Sbjct: 436 PAKA--CQPTVAPCLFDLREDPCEYNDVSAERPE--LVKSLLTKLRRYEDASMP 485
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 32/47 (68%), Gaps = 6/47 (12%)
Query: 762 AASIQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNL-ADRSEVQR 804
AA I CG P K C+P +APCLFD++ DPCE N++ A+R E+ +
Sbjct: 426 AAPIDCGVRDPAKA--CQPTVAPCLFDLREDPCEYNDVSAERPELVK 470
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 25/32 (78%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
+QH+V+Y E GLPLS+ +LPQ+L +LGY T
Sbjct: 55 LQHSVIYNEEPRGLPLSDTLLPQWLADLGYVT 86
>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
Length = 579
Score = 351 bits (900), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 171/351 (48%), Positives = 231/351 (65%), Gaps = 8/351 (2%)
Query: 44 FTLSMVFVDLVAS--SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
+S +F S S P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L +
Sbjct: 10 LVVSTIFSLAFGSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKH 69
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y LCTPSR+ ++TGK+PI+TGMQH V+ E GLP E+++P+ ++ GY T +VGK
Sbjct: 70 YVPNLCTPSRATLLTGKYPIYTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGK 129
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHG 218
WHLGF++K+ TPT RGF+ H GY+ G+ DY+DH + GLD RRDLEP + +G
Sbjct: 130 WHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEANG 189
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y+T+ FT+EA II H +PLF+ L+H A H+ N P+QAP+ + HI D KR
Sbjct: 190 TYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKR 249
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+A ++ LD+SV + + AL+ ML+NSII+ SDNG G + NA SN+P RG K
Sbjct: 250 RTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKE 309
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
+ WEGG+R AG +WSPLL+ RG V+ Q +H DWLPTL AA S D+P
Sbjct: 310 SPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 72/165 (43%), Gaps = 25/165 (15%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM-RYQVDL----T 629
+DGI++W +LS NE K T++H +D+ + S+ K K V RY L T
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFEGRYDQWLGELET 420
Query: 630 GGPDQV------------YLSGLSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
D + S L +R +R++R A+ C PV+ CE
Sbjct: 421 NEDDPLSESYEKHVLASDVQSLLGNRGLTEDRIRQMRSEATETCPPVEGQNPLEPHFKCE 480
Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
P APC FD+ DPCE+ NLA Q + EV + + A P
Sbjct: 481 PLKAPCFFDLAKDPCERYNLAQIYPLQ-LQELADEVEQIRKTAIP 524
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 7/58 (12%)
Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
E+ +R++R A+ C PV+ CEP APC FD+ DPCE+ NLA +Q
Sbjct: 450 EDRIRQMRSEATETCPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQ 507
>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 502
Score = 350 bits (897), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 172/327 (52%), Positives = 220/327 (67%), Gaps = 5/327 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADDLGWNDV +HG QI TPNIDALA++GI L YYT LCTPSRSA +TG +
Sbjct: 26 PHIVFILADDLGWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTGCY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P++TGMQH+V+ E GLPL K+LPQ+L + GY +R++GKWHLG+YK+EYTPT RGF+
Sbjct: 86 PMNTGMQHSVILTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRGFQ 145
Query: 180 SHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
SH+G W G DY+ H S + + G D RRD++ + + G+Y T V T EA+ II +
Sbjct: 146 SHVGSWEGFSDYYSHIMDFSWQTWSISGHDFRRDMQKSKEDDGRYYTHVMTEEALKIIKD 205
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H ++PLFLY+AH A HS N EPL+AP Y + + I R+ +A +L LDESVG V
Sbjct: 206 HPNEKPLFLYIAHLAVHSGNQPEPLKAPTKYTDPYMDIGHPSRTLYAGMLSVLDESVGAV 265
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
EAL +R +L+ +I+V +DNG + S WP RG K T WEGGVR ++WSPL
Sbjct: 266 FEALGRRGILNETIVVLTTDNGAGITTQFSSFGSGWPFRGQKGTAWEGGVRVPAVVWSPL 325
Query: 356 L-ESRGIVAEQYVHVSDWLPTLLSAAN 381
E RG V E HVSDWLPT A
Sbjct: 326 FSEHRGAVVESLFHVSDWLPTFYELAG 352
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 21/148 (14%)
Query: 563 ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK----- 617
E D S EIDGI L E R I+ NID +SA+ +KL++
Sbjct: 349 ELAGGDPSNLGEIDGISQLDTLRHREKLPRTEIVLNIDPIENVSAIIVYPFKLMQGDDQG 408
Query: 618 ---------VVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRK--LRDAASIQCG---P 663
+ V R Q D G + + + + + + A S++CG P
Sbjct: 409 GHYDDWYPFMGNVTRTQKDAREGCEASVVYRVMKHSGFVVTCGQDPVTYATSVKCGEKDP 468
Query: 664 VKEVPCEPQIAPCLFDIKNDPCEKNNLA 691
K C+P + PCLFD+ DPCE N+++
Sbjct: 469 TKA--CKPIVKPCLFDLSKDPCEYNDIS 494
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 755 GMRKLRDAASIQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
G + A S++CG P K C+P + PCLFD+ DPCE N+++
Sbjct: 451 GQDPVTYATSVKCGEKDPTKA--CKPIVKPCLFDLSKDPCEYNDIS 494
>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
purpuratus]
Length = 514
Score = 348 bits (893), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 173/324 (53%), Positives = 215/324 (66%), Gaps = 2/324 (0%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFI+ADDLGW+DV HG QIPTPNID LA G+ L NYY LCTPSRSAIMTG+
Sbjct: 10 PPHIIFIVADDLGWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTGR 69
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
HPIHTG+Q V+ GL L EK + QYLK LGY T VGKWHLG++ KEYTPT+RGF
Sbjct: 70 HPIHTGLQFGVISPEAPYGLGLEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRGF 129
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+S G++ G DY+ H + ++ G D+ ++ + G+YSTD+F EA II H+
Sbjct: 130 DSFFGFYNGRGDYYTHEEVQSEVSGYDLHKNGKVYRPAFGQYSTDIFNQEAEQIIKAHNA 189
Query: 239 DEPLFLYLAHAATHSA-NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLYLAH A H+ P LQAPD Y HIE R +AA++ LD+SVG + +
Sbjct: 190 SQPLFLYLAHQAVHAGVYPDRLLQAPDKYYQRFPHIETEGRRMYAAMVSALDDSVGSISQ 249
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
L + NSIIVF +DNGGA F+ ASNWPLRG K+TLWEGGVRG + SPL+
Sbjct: 250 TLRDAGLYDNSIIVFTTDNGGATYDFFDGTHASNWPLRGGKHTLWEGGVRGTAFVNSPLI 309
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
+ ++Q +HV DWLPTL S A
Sbjct: 310 KKPRRFSDQMMHVCDWLPTLHSVA 333
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 62/127 (48%), Gaps = 15/127 (11%)
Query: 576 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQV 635
DG D W LS N PS RN ILHNI+ + +A+ G +KL + G
Sbjct: 344 DGFDQWDALSNNAPSPRNEILHNIEPIEKYAAIRVGDFKL--------HYGAFGGKIGSK 395
Query: 636 YLSGLSDREWLALAMRK-LRDAASIQCGPVKEVPCEPQI-----APCLFDIKNDPCEKNN 689
Y SG E M + + ++ ++C P K+ PCL++I++DPCE NN
Sbjct: 396 YFSGWYPPEGATSNMAEYVPNSFRVRC-PSKQANASTNCDYGDRKPCLYNIRDDPCEYNN 454
Query: 690 LADRSED 696
+AD ++D
Sbjct: 455 IADWNQD 461
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%)
Query: 511 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKL 550
DG D W LS N PS RN ILHNI+ + +A+ G +KL
Sbjct: 344 DGFDQWDALSNNAPSPRNEILHNIEPIEKYAAIRVGDFKL 383
>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
Length = 502
Score = 346 bits (887), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 175/348 (50%), Positives = 223/348 (64%), Gaps = 10/348 (2%)
Query: 39 VLPLAFTLSMVFVDLVASSG----PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+L AF L++ F + SG PP+I+FILADDLGW+DV FHG Q+PTPNIDALA
Sbjct: 1 MLRCAFVLTLSF--FLECSGNSEIPPNIVFILADDLGWDDVSFHGSRQVPTPNIDALASD 58
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GIIL NYY LC+PSR A+MTGKHPI G+Q V+ + GL L EK+LP+YL LGY
Sbjct: 59 GIILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGVIMPAQPSGLGLKEKLLPEYLNTLGY 118
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
++ +VGKWHLG +EYTP RGFESH G++ G + Y H GLD + EP
Sbjct: 119 KSHMVGKWHLGMCAEEYTPMHRGFESHFGFYQGCESYTTHMCGNS---GLDFWLNEEPDH 175
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
G+YST +FTA+A ++ H T P+FLYLAH A H N + APD Y + I
Sbjct: 176 SAGGQYSTSLFTAKAEQLLAEHDTASPMFLYLAHQAVHVGNQDQKFYAPDKYTDKLSFIS 235
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPL 333
D +R + A +L LDESVGKV +AL + ML N+IIVF SDNG G + SN+PL
Sbjct: 236 DDRRRQMAGMLTALDESVGKVTKALHAKGMLENTIIVFSSDNGAPHIGNSDHTVGSNFPL 295
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
RG K ++WEGG R +WSPL + G +++Q +H+SDWLPTLL AA
Sbjct: 296 RGGKPSIWEGGSRVPAFVWSPLFKKSGYISKQLMHISDWLPTLLQAAG 343
>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 554
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 169/322 (52%), Positives = 208/322 (64%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADDLGW+DV FHG QIPTPN+D LA G+IL NYY CTPSR+A+MTG +
Sbjct: 36 PHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLY 95
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI TGMQ + E GLP +ILPQYLKE GY T +VGKWHLG YK+ TPT RGF+
Sbjct: 96 PIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFD 155
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S GY+ G DYF H+ GLD + +P W G YST VFT A II N +
Sbjct: 156 SFYGYYYGESDYFAHTISYENHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKS 215
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PL L + H ATH A E LQAP ++ +I + R+ +A ++ LD+SVG+VV+AL
Sbjct: 216 KPLLLVITHQATHCALERERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKAL 275
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
ML N+IIVF SDNGG G + + NWPLRG K T+WEGG R A +WSPLL R
Sbjct: 276 SDAGMLENTIIVFSSDNGGGPWGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWSPLLARR 335
Query: 360 GIVAEQYVHVSDWLPTLLSAAN 381
V+ Q +H++DWLPT S A
Sbjct: 336 RRVSHQLMHITDWLPTFYSIAG 357
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 22/139 (15%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTKGKWKLV---KVVKVMRYQVDLT 629
E+DG D+W LS S R +L+N D ++ S AL ++KLV RY+V
Sbjct: 365 ELDGHDMWRPLSHGRRSPRVELLYNYDYKYTKSAALRNHRYKLVLDETGRFADRYKVPGG 424
Query: 630 GGP--DQVYLSGLSDREWLALAMRKL------------RDAASIQCGPVKEVPCE-PQIA 674
P D YL+ S A +R R A + CG K++ +
Sbjct: 425 SRPRHDLDYLAAKSTA---ASVLRDFYKKKQLNFPKSWRRKARLTCGKQKKINFSWSATS 481
Query: 675 PCLFDIKNDPCEKNNLADR 693
LFD+ DPCE NNLA +
Sbjct: 482 VYLFDVVTDPCELNNLASK 500
>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
Length = 634
Score = 342 bits (878), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 163/324 (50%), Positives = 222/324 (68%), Gaps = 3/324 (0%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I ADD+GWNDVGFHG +QI TP+IDALAY G+IL +Y+ +CTPSR+A+MTG+H
Sbjct: 34 PHIVLITADDMGWNDVGFHGSNQIATPHIDALAYDGVILNRHYSAPMCTPSRAALMTGRH 93
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI+ GMQH V+ E GL L ++I+PQY + GYRT ++GKWHLGF+ + Y PT RGF+
Sbjct: 94 PINVGMQHYVIDSDEPWGLGLDQRIMPQYFRAAGYRTHMIGKWHLGFFTEHYIPTNRGFD 153
Query: 180 SHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
+H+GY + DY+ + ++ G DMR++ + +G Y+TD FT+ A DII H
Sbjct: 154 THIGYLGPYVDYWSYVSKMNSGTFEGYDMRQNQFVNYAANGTYATDYFTSAARDIIAQHG 213
Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+ +P+ L + H A H+ N +PLQAP ++ +I + R +AA++ KLD+SVG V
Sbjct: 214 KSGQPMLLVMNHLAPHAGNDDDPLQAPQETIDRFAYIGNRDRRTYAAMMSKLDDSVGAVY 273
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
E+L ML NSIIVF+SDNGG G + N SN+P RG K++ WEG VR A LIWSPLL
Sbjct: 274 ESLRANGMLDNSIIVFLSDNGGVTRGMHSNTGSNYPFRGQKHSPWEGAVRTAALIWSPLL 333
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
+ V+ Q+ H+SDWLPTL SAA
Sbjct: 334 KDTQRVSNQWFHISDWLPTLASAA 357
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 68/140 (48%), Gaps = 17/140 (12%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDL 628
NS ++IDGID W L+ + R +++N+D+ + ++ + +K + +
Sbjct: 363 NSSFSDIDGIDQWEALAYGTGNPRQRLMNNLDEIFGYTSYMENGFKYINGTTLNGVNDAW 422
Query: 629 TGGPDQV--------YLSGLSDREWLALAMR-------KLRDAASIQCG-PVKEVPCEPQ 672
G D + YLS + +A A R LR A++ CG + PC P
Sbjct: 423 YGETDPLDQLPDDEQYLSSVMATR-IARAGRLDEKLISYLRRHATVNCGNSIATKPCNPL 481
Query: 673 IAPCLFDIKNDPCEKNNLAD 692
+ PCLFDI NDPCE ++++D
Sbjct: 482 VKPCLFDIINDPCEMHDISD 501
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 1/48 (2%)
Query: 752 EEEGMRKLRDAASIQCG-PVKEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
+E+ + LR A++ CG + PC P + PCLFDI NDPCE ++++D
Sbjct: 454 DEKLISYLRRHATVNCGNSIATKPCNPLVKPCLFDIINDPCEMHDISD 501
>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
Length = 558
Score = 342 bits (876), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 169/382 (44%), Positives = 240/382 (62%), Gaps = 24/382 (6%)
Query: 50 FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
F + ++G P+++FI+ADDLGWNDVGFHG +QIPTPNIDALAY+GIIL ++Y+ TP
Sbjct: 20 FEKIHCATGQPNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTP 79
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
SR+A++TGK+P+ G+Q + E LP KI+ +Y K++GY T +VGKWHLG +
Sbjct: 80 SRAALLTGKYPMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRW 138
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAE----EMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
TPTFRGF+ G++ G Y+D+ + + + G D+RRD P+W+ GKY+TD+F
Sbjct: 139 NDTPTFRGFDHFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLF 198
Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
AVD+I H+ + PLF+ +AH A H N + L+AP +N +HI D R +AA++
Sbjct: 199 AEHAVDVIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMV 258
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
KLD+S+G V EALE + ML N+I+VF+SDNG G + N SN+PLRG+K+TL+EGGV
Sbjct: 259 SKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGV 318
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS-------------------DIP 386
R IWSPLL V+ +H++DWLPTL +A D+P
Sbjct: 319 RTVACIWSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLP 378
Query: 387 NYVNSTVENIIPRYENSILRYE 408
+ N NI + N+ LR+
Sbjct: 379 SARNDIPLNIDEKTRNAALRFS 400
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 51/187 (27%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
++DGID WS L + PS RN I NID++ + +AL WKL+ GTS N S +
Sbjct: 363 DLDGIDQWSSLVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIV-------GTSGNGSYN 415
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTKGKWKLVKVVKVMRYQVD 627
+ ++ NI++ ++ SA+ + ++ K + Y
Sbjct: 416 GYFGAPLN--------------------ENIEEQQYNTSAINDS--PVGRIAKKINY--- 450
Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDP 684
+ LS+ ++ L R A+++C K PC+P A CL++I NDP
Sbjct: 451 ----------NPLSETDFDGL-----RRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDP 495
Query: 685 CEKNNLA 691
CE+N+LA
Sbjct: 496 CEENDLA 502
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 5/59 (8%)
Query: 752 EEEGMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDPCEKNNLADR--SEVQRI 805
E LR A+++C K PC+P A CL++I NDPCE+N+LA S V+RI
Sbjct: 454 SETDFDGLRRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDPCEENDLAKFFPSVVRRI 512
>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
Length = 563
Score = 341 bits (875), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 172/357 (48%), Positives = 240/357 (67%), Gaps = 12/357 (3%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNID 89
IM+ +V+ L +V L P+II I+ADDLGWNDV FHG DQIPTPNID
Sbjct: 9 IMSMSVMIATGCLFYFYVVLAQQKAQAQQRPNIIVIMADDLGWNDVSFHGADQIPTPNID 68
Query: 90 ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
ALAY+G+IL+ +Y + +CTPSR+A +TG++PI TGMQ + L E +PL+ +LP+YL
Sbjct: 69 ALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGHPLDPGEVRAIPLNNTLLPEYL 128
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
++LGY T +VGKWH+G+Y +TP +RGF++ LGY++G YF+H+ E+ G D+ D
Sbjct: 129 RKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGFMTYFNHTIEQNHHVGYDLHYD 188
Query: 210 LEPAWDLHGKYS----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
+ A +L KYS TD+ T A DII NH+ +PL+L L+H A HS+N + ++ D
Sbjct: 189 V--AGNLSVKYSHEYMTDLITERAEDIILNHNHSKPLYLQLSHIAAHSSNINKTVEVRDE 246
Query: 266 YLN--IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+IEDF R K A ++ LDESVG+VV+AL+Q ML NSII+F+SDNG G
Sbjct: 247 EETNATLGYIEDFDRRKLAGVVTALDESVGRVVQALKQANMLENSIILFMSDNGAQTVGL 306
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N+ SN+PLRG+K TL+EGGVRGA ++SP +++ ++ Q +HV+DWLPT SAA
Sbjct: 307 LDNSGSNYPLRGMKFTLFEGGVRGAACVYSPAIKNPSRISNQLIHVTDWLPTFYSAA 363
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 20/137 (14%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+ +DG+D W+ + E ++R IL NID++ +S G++KL+ K+ G
Sbjct: 371 EENLDGVDQWATIVSEENTRRKNILLNIDEKEDLSGALMGRYKLINGAKLQYGDYYGDSG 430
Query: 632 PDQVY----------------LSGLSDREWLALAMRKLRDAASIQCGPVKEVP-CEPQIA 674
+ Y ++ +S+ + + +LR AA + C P C +
Sbjct: 431 MSESYPEYNVSNVLHSAAGFAVASVSNVTLNSERIIELRRAAMVVCDNFTSYPKC---LD 487
Query: 675 PCLFDIKNDPCEKNNLA 691
CLFD+ NDPCE +++
Sbjct: 488 RCLFDVYNDPCETTDVS 504
>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 169/335 (50%), Positives = 231/335 (68%), Gaps = 8/335 (2%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+I+ I+ADDLGWNDV FHG D+IPTPNIDALAY+G+IL +Y + LCTPSR+A +TGK
Sbjct: 13 PPNIVVIMADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRTAFLTGK 72
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+PI TGMQ VL E G+PL++ +LP+YL++LGY T +VGKWH+G++ K YTPT RGF
Sbjct: 73 YPIRTGMQGYVLQPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYTPTRRGF 132
Query: 179 ESHLGYWTGHQDYFDHSA--EEMKMWGLDMRRDLEPAWDLHGKYS--TDVFTAEAVDIIH 234
++ LGY+ G+ YF+H+ EE K G D R + + +Y TD+ T E +II
Sbjct: 133 DTFLGYYNGYIHYFNHTILDEEQKYLGYDFHRIVGENRTIEYRYDYITDIITDEVENIIF 192
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFKRSKFAAILHKLDESV 292
+H+ +PL+L ++H A HS +Q D +IED R K+A+++ LDESV
Sbjct: 193 SHNPAKPLYLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDINRRKYASVVATLDESV 252
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G++++AL + ML NSIIVF+SDNG GF N SN+PLRG+K +L+EGGVRGA I+
Sbjct: 253 GRIIDALRKTDMLKNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGVRGAAYIY 312
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
SPL++ V+ Q H++DWLPTL SAA N SD+
Sbjct: 313 SPLIDRLSRVSTQLFHITDWLPTLYSAAGGNSSDL 347
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 64/136 (47%), Gaps = 13/136 (9%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDD----EWQISALTKGKWKLVKVVKVMRYQVDLT 629
++DG D WS + E SKR +IL NID+ E+Q G L +++
Sbjct: 349 QLDGFDQWSAIKSAEDSKRKSILINIDERDTSEYQKYYNYSGNNALYPKYNA----INIL 404
Query: 630 GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP-CEPQIAPCLFDIKNDPCEKN 688
P ++ +S A + +LR A+I C + C + CLFD+K DPCE
Sbjct: 405 ASPVASAIASISTSMLNANKIMQLRKEATIVCKNFMDFSNCTNR--SCLFDVKEDPCETT 462
Query: 689 NLADR--SEDQRINHY 702
+L+ + E +R+N +
Sbjct: 463 DLSAKYPKEVERLNMF 478
>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 536
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/372 (44%), Positives = 235/372 (63%), Gaps = 24/372 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+++FI+ADDLGWNDVGFHG +QIPTPNIDALAY+GIIL ++Y+ TPSR+A++TGK+
Sbjct: 8 PNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKY 67
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+ G+Q + E LP KI+ +Y K++GY T +VGKWHLG + TPTFRGF+
Sbjct: 68 PMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFD 126
Query: 180 SHLGYWTGHQDYFDHSAE----EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G++ G Y+D+ + + + G D+RRD P+W+ GKY+TD+F AVD+I
Sbjct: 127 HFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQK 186
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H+ + PLF+ +AH A H N + L+AP +N +HI D R +AA++ KLD+S+G V
Sbjct: 187 HNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAV 246
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
EALE + ML N+I+VF+SDNG G + N SN+PLRG+K+TL+EGGVR IWSPL
Sbjct: 247 FEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPL 306
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS-------------------DIPNYVNSTVENI 396
L V+ +H++DWLPTL +A D+P+ N NI
Sbjct: 307 LVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDIPLNI 366
Query: 397 IPRYENSILRYE 408
+ N+ LR+
Sbjct: 367 DEKTRNAALRFS 378
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 51/187 (27%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
++DGID WS L + PS RN I NID++ + +AL WKL+ GTS N S +
Sbjct: 341 DLDGIDQWSSLVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIV-------GTSGNGSYN 393
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTKGKWKLVKVVKVMRYQVD 627
+ ++ NI++ ++ SA+ + ++ K + Y
Sbjct: 394 GYFGAPLN--------------------ENIEEQQYNTSAINDS--PVGRIAKKINY--- 428
Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDP 684
+ LS+ ++ L R A+++C K PC+P A CL++I NDP
Sbjct: 429 ----------NPLSETDFDGL-----RRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDP 473
Query: 685 CEKNNLA 691
CE+N+LA
Sbjct: 474 CEENDLA 480
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 5/59 (8%)
Query: 752 EEEGMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDPCEKNNLADR--SEVQRI 805
E LR A+++C K PC+P A CL++I NDPCE+N+LA S V+RI
Sbjct: 432 SETDFDGLRRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDPCEENDLAKFFPSVVRRI 490
>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
Length = 542
Score = 340 bits (871), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 169/351 (48%), Positives = 225/351 (64%), Gaps = 9/351 (2%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + S PPHIIFILADD GW+DVGFHG +IPTPN+DALA GIIL N+
Sbjct: 6 LVLALSFLTGSFGNSHRPPHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNH 65
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y+ LCTPSR +++TGKHPI G+Q V+Y + GL L EK+LP+YLK LGY++ +VGK
Sbjct: 66 YSQPLCTPSRGSLLTGKHPIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGK 125
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLGF+ EYTP RGF+SH G++ +DY H M GLD + +P G YS
Sbjct: 126 WHLGFFADEYTPMRRGFDSHYGFYGASEDYMTHIG---GMGGLDFWLNGQPDRSGQGHYS 182
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP---LQAPDHYLNIHRHIEDFKR 278
T +FT +A ++ H+ EP+FLY +H A H+++P E L AP Y + I D +
Sbjct: 183 TTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVEC 242
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
A ++ LDESVG + ++L ML N+II+F SDNG ++ +SN+PLRG K
Sbjct: 243 QHMAGMISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAPHENTDV-CSSNYPLRGAKT 301
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
++WEGG R +WSPLL+ G V++Q +H+SDWLPTLL AA N S +P
Sbjct: 302 SVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAGYNMSALPG 352
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 78/195 (40%), Gaps = 51/195 (26%)
Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTRGKWKLVKENSINGNG 560
N ++ ++ G+ W L N PS RN++L+N D + Q S AL G KL++
Sbjct: 346 NMSALPGDLYGVSQWQALQENGPSARNSMLYNADHKKQGSWALRVGDMKLLQ-------- 397
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK 620
+E + +S + I++ S L NEP+ N + +++ Q++++
Sbjct: 398 -AEGATQLHSGWQQPFAINLQSQLPSNEPNNTNYVQ---EEQSQVASIL----------- 442
Query: 621 VMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG----PVKEVPCEPQIAPC 676
E L MR + CG V C PC
Sbjct: 443 -----------------------EELGRGMRNSTTKYFVNCGFISSGVTSHSCIDTGRPC 479
Query: 677 LFDIKNDPCEKNNLA 691
L+++ DPCE +NLA
Sbjct: 480 LYNVTADPCEYHNLA 494
>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
Length = 563
Score = 339 bits (869), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 164/327 (50%), Positives = 225/327 (68%), Gaps = 8/327 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II I+ADDLGWNDV FHG DQIPTPNIDALAY+G+IL+ +Y + +CTPSR+A +TG++
Sbjct: 39 PNIIVIMADDLGWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRY 98
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI TGMQ L E +PL+ +LP+YL++LGY T +VGKWH+G+Y +TP +RGF+
Sbjct: 99 PIRTGMQGYPLKAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFD 158
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS----TDVFTAEAVDIIHN 235
+ LGY++G+ YF H+ E+ G D+ D+ A +L KYS TD+ T A DII N
Sbjct: 159 TFLGYYSGYITYFKHTIEQNLHVGYDLHYDV--AGNLSVKYSHEYMTDLITERAEDIIFN 216
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFKRSKFAAILHKLDESVG 293
H+ +PL+L L+H A HS++ ++ D +IEDF R K A ++ +DESVG
Sbjct: 217 HNRSKPLYLQLSHVAAHSSDAKANMEVRDEEETNATLGYIEDFDRRKLAGVVTAMDESVG 276
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+VV+AL Q ML NSIIVF+SDNG G N SN+PLRG+K TL+EGGVRG ++S
Sbjct: 277 RVVQALRQTNMLENSIIVFMSDNGAQTVGLLNNYGSNYPLRGLKFTLFEGGVRGVACVYS 336
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
P +++ ++ Q +HV+DWLPT SAA
Sbjct: 337 PAIKNPSRISNQLIHVTDWLPTFYSAA 363
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 63/137 (45%), Gaps = 20/137 (14%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+ +DG+D W+ + + ++R +L NID++ +S G++KL+ V G
Sbjct: 371 EENLDGVDQWATIVSEKKTRRKNVLLNIDEKQALSGALMGRYKLINGANVSYGDYYGDSG 430
Query: 632 PDQVY----------------LSGLSDREWLALAMRKLRDAASIQCGPVKEVP-CEPQIA 674
+ Y L+ +S+ + + + +LR AA++ C P C +
Sbjct: 431 TSESYPEYNVSNVLHSAAGSALASVSNATFNSERIIELRRAATVVCDNFTSYPRC---LD 487
Query: 675 PCLFDIKNDPCEKNNLA 691
CLFD+ DPCE +++
Sbjct: 488 RCLFDVYKDPCETTDVS 504
>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 529
Score = 338 bits (868), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 218/326 (66%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
+S+ PP+IIFILADDLGW DV F G QIPTPN+D LA GIIL NYY LC PSR A+
Sbjct: 15 SSTVPPNIIFILADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVQPLCAPSRGAL 74
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
M+G +PIHTG+QH V E GLP + I+P+YLK LGY T ++GKWHLG++K+ YTPT
Sbjct: 75 MSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGYHKESYTPT 134
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
RGF+S GY G +DY+DH+ GLD + P + YST++FT +A +I
Sbjct: 135 RRGFDSFYGYLNGGEDYYDHTILWSNASGLDFWENTTPVRNEGNHYSTELFTKKAQSLIK 194
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
+H +P+FLY +H A H + L+AP + +I++ RS A +++LD+SVG
Sbjct: 195 HHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNRSIHAGAVYELDKSVGL 254
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V+EAL +R MLSNSI++F +DNGG G N+ NWPLRG K T WEGG RGA +WSP
Sbjct: 255 VMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGYNWPLRGSKETNWEGGARGAAFVWSP 314
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL G ++ Q +H++DWLPTL SAA
Sbjct: 315 LLFKSGRLSNQMMHITDWLPTLYSAA 340
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 78/209 (37%), Gaps = 59/209 (28%)
Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
+P L ++ GN ++ IDG D+W LS + S R +L NID SAL
Sbjct: 333 LPTLYSAAGGNVST---------LGNIDGKDMWKALSEDLESPRQEVLINIDPIENSSAL 383
Query: 544 TRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
G+ K+V S N G+ + R +DG+D + SR + D +
Sbjct: 384 IVGRHKVVL-GSFN-EGSHDMRMKAPGGSRPVDGLDQMMLSSRTGKV--------LKDFY 433
Query: 604 QISALT-KGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG 662
+ LT + W+ VV+ RY
Sbjct: 434 NVRQLTVRPNWRNEAVVRCDRY------------------------------------AP 457
Query: 663 PVKEVPCEPQIAPCLFDIKNDPCEKNNLA 691
P V P P FD+++DPCE NNLA
Sbjct: 458 PNNFVAASP---PYYFDLEHDPCELNNLA 483
>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 579
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 236/348 (67%), Gaps = 10/348 (2%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
++ ++LPL + F L +S PHII +ADDLGWNDV FHG +QIPTPNIDAL Y+
Sbjct: 9 ISTSLLPLILFIDCAF-GLYETS--PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYN 65
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GIIL +Y + TPSR+A TG++PI GMQ + G E GLPL+ KILP++L+ LGY
Sbjct: 66 GIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGY 125
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T+++GKWH+G+Y +YTP RGF++ LG++ + Y+D++ M G DM R +PA+
Sbjct: 126 TTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYISYYDYNYSNQNMSGYDMHRGDDPAY 185
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--H 272
++ +Y+TD+FT+EA++II NH + PL+L L+H A HS PL+ P + N H
Sbjct: 186 GMNREYATDMFTSEAINIIENHELNRPLYLQLSHLAVHS-----PLEQPANVYNDREPIH 240
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I + R K+A ++ KLDESVG++ AL ++ ML +S+I+F++DNG A+ G N SN+P
Sbjct: 241 IREPNRRKYAKMVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYP 300
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
LRG K TL+EGGVRG +WS L+ V+ + VH++DWLPTL SAA
Sbjct: 301 LRGTKYTLYEGGVRGVAALWSSRLQRAARVSNELVHITDWLPTLYSAA 348
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 60/145 (41%), Gaps = 28/145 (19%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP- 632
EIDGID W +LS R+ +L NID+ + ++KL++ Y G
Sbjct: 357 EIDGIDQWRLLSEGRGRGRDKLLLNIDEVLKTEGAIYSRFKLLRGTIESGYYDGYYGDSG 416
Query: 633 ----------DQVYLSGLSDREWLALA--------MRKLRDAASIQCGP-------VKEV 667
D V S +S+ L M +LR A++QC
Sbjct: 417 KTLETLPTYTDIVMKSTVSESLTYHLGGPVTQPSTMVRLRQDATVQCNANFTYYRRQMLT 476
Query: 668 PCEPQIAPCLFDIKNDPCEKNNLAD 692
PC + CLFDI NDPCE N+AD
Sbjct: 477 PC--NVTECLFDIVNDPCETRNIAD 499
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 9/50 (18%)
Query: 756 MRKLRDAASIQCGP-------VKEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
M +LR A++QC PC + CLFDI NDPCE N+AD
Sbjct: 452 MVRLRQDATVQCNANFTYYRRQMLTPC--NVTECLFDIVNDPCETRNIAD 499
>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
Length = 516
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 168/355 (47%), Positives = 230/355 (64%), Gaps = 17/355 (4%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
MA A + L + F +A+ P HI+FILADD GW DVG+H L +I TP +D L+
Sbjct: 1 MALASFLIITGLFLSFPGFLAAKQP-HIVFILADDFGWYDVGYH-LSEIRTPILDKLSSG 58
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+ L+NYY LCTPSR+ +MTG++ IHTGMQH +++ C+ +PL EK+LPQ +KE GY
Sbjct: 59 GVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGY 118
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRR 208
T +VGKWHLG YKK+ PT RGF+S+LGY TG +DY+ H SA + LD+R
Sbjct: 119 ATHMVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSEDYYTHIRCHPISALNLTRCALDLRE 178
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
A G YST++F+ AV II H++ EPLFLY+A A H+ PLQ P+ Y+
Sbjct: 179 AEAVARSYKGTYSTELFSQRAVSIIEKHTSTEPLFLYVAFQAVHA-----PLQVPERYVA 233
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
+ I+D R +A ++ +DE+VG + AL++R + N+++VF +DNGG L+
Sbjct: 234 PYSFIQDHSRRSYAGMVSAMDEAVGNITLALQKRGLWENTVLVFSTDNGGQ----TLSGG 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SNWPLRG K +LWEGGVRG G + SPLLE G ++ Q +H+SDWLPTL+ A S
Sbjct: 290 SNWPLRGRKWSLWEGGVRGVGFVASPLLEQPGTISRQLIHISDWLPTLVGLAGGS 344
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH +++ C+ +PL EK+LPQ +KE GY T ++
Sbjct: 89 MQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVG 124
>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
Length = 549
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 164/333 (49%), Positives = 219/333 (65%), Gaps = 9/333 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADD GW+DVGFHG +IPTPN+DALA GIIL N+Y+ LCTPSR +++TGKH
Sbjct: 1 PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI G+Q V+Y + GL L EK+LP+YLK LGY++ +VGKWHLGF+ EYTP RGF+
Sbjct: 61 PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
SH G++ +DY H M GLD + +P G YST +FT +A ++ H+
Sbjct: 121 SHYGFYGASEDYMTHIG---GMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQT 177
Query: 240 EPLFLYLAHAATHSANPYEP---LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
EP+FLY +H A H+++P E L AP Y + I D + A ++ LDESVG +
Sbjct: 178 EPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLT 237
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
++L ML N+II+F SDNG ++ +SN+PLRG K ++WEGG R +WSPLL
Sbjct: 238 KSLHTNGMLENTIIIFSSDNGAPHENTDV-CSSNYPLRGAKTSVWEGGTRVPAFVWSPLL 296
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
+ G V++Q +H+SDWLPTLL AA N S +P
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLEAAGYNMSALPG 329
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 90/215 (41%), Gaps = 58/215 (26%)
Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTRGKWKLVKENSINGNG 560
N ++ ++ G+ W L N PS RN++L+N D + Q S AL G KL++
Sbjct: 323 NMSALPGDLYGVSQWQALQENGPSARNSMLYNADHKKQGSWALRVGDMKLLQ-------- 374
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK 620
+E + +S + I++ S L NEP+ N + +++ Q++++ +
Sbjct: 375 -AEGATQLHSGWQQPFAINLQSQLPSNEPNNTNYVQ---EEQSQVASILE---------- 420
Query: 621 VMRYQVDLTGGPDQVYLSGLSDRE---WLALAM-RKLRDAASIQ---------------C 661
+ L+G E WLA + ++L + A IQ C
Sbjct: 421 ------------EHETLAGKRRHEATFWLASTLCKQLAEPAVIQQARQNKRRSGRTVPNC 468
Query: 662 G----PVKEVPCEPQIAPCLFDIKNDPCEKNNLAD 692
G V C PCL+++ DPCE NLA+
Sbjct: 469 GFASTGVAPRSCVNTGRPCLYNVTADPCEYYNLAE 503
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+Q V+Y + GL L EK+LP+YLK LGY++ ++
Sbjct: 66 LQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVG 101
>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 163/333 (48%), Positives = 233/333 (69%), Gaps = 6/333 (1%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+I+ I+ADDLGWNDV FHG D+IPTPNIDALAY+G+IL +Y + +CTPSR+A +TGK
Sbjct: 3 PPNIVVIIADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTGK 62
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+PI TGMQ L G E G+ L+ +LP+YL++LGY T +VGKWH+G++ + Y PT RGF
Sbjct: 63 YPIRTGMQGYPLQGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRGF 122
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--TDVFTAEAVDIIHNH 236
++ GY+ G+ YF+H+ E + G D+ R + + +Y TD+ T EA +II +H
Sbjct: 123 DTFAGYYNGYIQYFNHTLYESEQLGYDLHRIIGDDHKIEYRYDYMTDLITDEAENIISSH 182
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
+ +PL+L +AH A HS++ E ++ + +IED R K+A+++ LDESVG+
Sbjct: 183 NPAKPLYLQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDINRRKYASVVATLDESVGR 242
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V++AL++ ML+NSIIVF+SDNG GF N SN+PLRG+K +L+EGG+RGA I+SP
Sbjct: 243 VIDALKKTDMLNNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGIRGAACIYSP 302
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
L++ V+ Q H++DWLPTL SAA N SD+
Sbjct: 303 LIDRPSRVSNQLFHITDWLPTLYSAAGGNSSDL 335
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 20/147 (13%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG-- 631
++DG D WS + + SKR++IL NID++ A G +KLV + + +G
Sbjct: 337 QLDGFDQWSAIKSAKNSKRSSILINIDEKENSEAALIGHYKLVTDTSEYQKYYNYSGNNA 396
Query: 632 --PDQVYLSGLSDREWL-----------ALAMRKLRDAASIQCGPVKEVP-CEPQIAPCL 677
P + L+ A + +LR A+I C + C + CL
Sbjct: 397 LYPKYNVTTILASPAASAIASISISTLNASKIMQLRKEATIICKNFTDFSNCTNR--SCL 454
Query: 678 FDIKNDPCEKNNLADR--SEDQRINHY 702
FD+ DPCE +L + + +R+N +
Sbjct: 455 FDLNKDPCETTDLFSKHPKDVKRLNMF 481
>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
Length = 542
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 221/332 (66%), Gaps = 4/332 (1%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
+G NDV FHG +QI TPNIDALAY+G++L +Y LCTPSR+ ++TGK+PIHTGMQH V
Sbjct: 1 MGMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWV 60
Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ 189
+ E GLP E+++P+ +E GY T +VGKWHLGF++++ TPT RGF+ H GY+ G+
Sbjct: 61 IITDEPWGLPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGYYNGYI 120
Query: 190 DYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
DY+DH + GLD RRD EP +G Y+T+ FT+EA II H +PLF+ L
Sbjct: 121 DYYDHQVRLLGTNYSAGLDFRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPLFMVL 180
Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
+H A H+ N P+QAP+ + HI+D KR +A ++ LD+SVG+ + AL+ ML+
Sbjct: 181 SHLAVHTGNEDNPMQAPEEEVAKFSHIKDPKRRTYAGMISSLDKSVGQTMRALKDNDMLN 240
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
N+I++ SDNG G + NA SN+P RG K + WEGG+R G +WSPLLE +G V+ Q
Sbjct: 241 NTIVLLYSDNGAPTVGIHSNAGSNYPYRGQKESPWEGGIRSVGALWSPLLEKQGYVSNQA 300
Query: 367 VHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
+H DWLPTL +AA S +P + EN+ P
Sbjct: 301 IHAIDWLPTLAAAAGVS-LPQDLELDGENLWP 331
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 70/167 (41%), Gaps = 27/167 (16%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLT 629
E+DG ++W LS K +H +D+ + S+ K V + + DL
Sbjct: 323 ELDGENLWPSLSEKAEPKPRKFIHVLDEVFGYSSYMSDTLKYVNGSSFDGQYDYWLGDLE 382
Query: 630 GGPDQVYLSGLSDREWLALAMR--------------KLRDAASIQCGPVK-------EVP 668
D LSG+ + LA ++ ++R A+ +C ++
Sbjct: 383 ANEDDP-LSGIYGEQVLASEVQSVLGSHGLTEDRILQMRSEATQKCPAIEGQDPLQPHYR 441
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
CEP +APC FD+ DPCE+ NLA R ++ EV F Q A P
Sbjct: 442 CEPLVAPCFFDLAQDPCERYNLA-RLYPLQLEKLRQEVEEFRQTAIP 487
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 7/58 (12%)
Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
E+ + ++R A+ +C ++ CEP +APC FD+ DPCE+ NLA +Q
Sbjct: 413 EDRILQMRSEATQKCPAIEGQDPLQPHYRCEPLVAPCFFDLAQDPCERYNLARLYPLQ 470
>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
Length = 613
Score = 336 bits (861), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 178/388 (45%), Positives = 238/388 (61%), Gaps = 37/388 (9%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
+M F V LAF + + V + PP+IIFILADDLGWNDV FHG QIPTP+IDALA
Sbjct: 3 MMTFKVAVLAFFMG-CMLQRVYTDQPPNIIFILADDLGWNDVSFHGSSQIPTPHIDALAQ 61
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G+IL NYY +CTP+RSAIMTGKHPIHTG++H V+ GL L EK++PQYL+ELG
Sbjct: 62 EGVILTNYYVSPICTPTRSAIMTGKHPIHTGLEHGVIGVSHPYGLGLEEKLMPQYLRELG 121
Query: 154 YRTRIVGK-------------WHLGFY-------------------KKEYTPTFRGFESH 181
YRT +VGK + LG K+ TP+ RGFES
Sbjct: 122 YRTHMVGKVSLEHGVIGVSHPYGLGLEEKLMHQYLRELGYRTHMVGKESLTPSHRGFESF 181
Query: 182 LGYWTGHQDYFDHS-AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
GY+ G DY+ H + M G D D + G+YST++FT +II H+ E
Sbjct: 182 YGYYAGMGDYYTHEITSDGNMTGFDFHMDGSVHKPVFGQYSTEIFTERTQEIILKHNPKE 241
Query: 241 PLFLYLAHAATHSAN-PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PL++YLAH A HSAN + LQAP Y +I R K+AA++ LD+S+G + ++L
Sbjct: 242 PLYIYLAHQAVHSANYDGQRLQAPHEYYKRFPNITHENRRKYAAMVAALDDSIGNITQSL 301
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ + +N++IVF +DNGG A GF+ N ASNWPLRG+K+T WEGG+RG G +W L+E
Sbjct: 302 KESSLYNNTLIVFTTDNGGPANGFDFNYASNWPLRGMKHTTWEGGLRGVGFLWGALIEKP 361
Query: 360 GIVAEQYVHVSDWLPTL--LSAANKSDI 385
G +++ +HV DW+PTL L+ N S +
Sbjct: 362 GRMSDGMMHVCDWVPTLYGLAGGNTSTL 389
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 104/235 (44%), Gaps = 40/235 (17%)
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
+ S +DGIDVW +LSR + S R ILH ID +SA+ G +KLV+ G
Sbjct: 383 GGNTSTLQHLDGIDVWPMLSRGKQSPRQEILHTIDPLLDLSAIRIGDYKLVQGQ----RG 438
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK 620
T N Y E G + S R ILH ID +SA+ G +KLV+
Sbjct: 439 TGRN----GWYPPEGVG----------KQSPRQEILHTIDPLLDLSAIRIGDYKLVQ--- 481
Query: 621 VMRYQVDLTGGPDQVYLSGLSDREWLALA---MRKLRDAASIQCGPVK----EVPCEPQI 673
G +G E ++LA + + +A + C P K C P
Sbjct: 482 ----------GQRGTGRNGWYPPEGVSLADLDSKPVPNAFVVSC-PYKPANASTNCNPYK 530
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
PCLF+I++DPCE NN+A+ ++D + T + + P + +E + + K
Sbjct: 531 KPCLFNIRHDPCEFNNIANWNQDI-VELLLTRIEEYRATMVPIRNKEPDPRSYPK 584
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
++H V+ GL L EK++PQYL+ELGYRT ++
Sbjct: 93 LEHGVIGVSHPYGLGLEEKLMPQYLRELGYRTHMVG 128
>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
Length = 429
Score = 336 bits (861), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 164/336 (48%), Positives = 218/336 (64%), Gaps = 16/336 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG I TP +DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFVLADDLGWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST+VFT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTE 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A +
Sbjct: 218 RATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASV 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V ALE+R + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALERRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 329 VGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGS 364
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVG 144
>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 594
Score = 335 bits (859), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 165/320 (51%), Positives = 218/320 (68%), Gaps = 4/320 (1%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
++ ILADD+GWNDV F G QIPTPN+DALA +G+IL+N+Y CTPSR+A++TG +PI
Sbjct: 85 VVVILADDMGWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYPI 144
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
H+GMQH V+ E GLPL K+LPQ+LK+LGYRT ++GKWHLG +KKE+ PT RGF+SH
Sbjct: 145 HSGMQHYVIRSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDSH 204
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG-KYSTDVFTAEAVDIIHNHSTDE 240
LGY+ G+ DYF H+ + LD +D P H +Y+T +FT A +II +H D
Sbjct: 205 LGYYNGYIDYFTHNHTYKRDSALDFFKDEVP---YHSEEYATRLFTDRAEEIIRDHDVDN 261
Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE 300
PLFLY AH A H A +P QAP ++ ++ D R+ FAA+L +LD SVG+VVEAL
Sbjct: 262 PLFLYFAHLAVHRATDRDPFQAPQETIDKFSYVGDRNRTTFAAMLAELDVSVGRVVEALA 321
Query: 301 QRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG 360
++ +L N+II+F SDNGG A N +N+PLRG K TL+EGG R +WSPL+
Sbjct: 322 KKGILDNTIILFSSDNGGQATAPMENTGTNFPLRGQKRTLFEGGTRVPAFVWSPLIRRPR 381
Query: 361 IVAEQYVHVSDWLPTLLSAA 380
V VH+ DWLPT+ S A
Sbjct: 382 RVFYDMVHIVDWLPTIYSLA 401
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 59/139 (42%), Gaps = 23/139 (16%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV--KVVKVMR-YQVDLTGG 631
IDG++VW +S S R IL+NID + A+ G++KL+ + + Y G
Sbjct: 411 IDGMNVWGAVSDGMASPRTEILYNIDPVQRDLAIRVGRYKLLYGPMYNLTEWYDNRGFGY 470
Query: 632 PDQVYLSGLSDREWLALAMRKL-------------------RDAASIQCG-PVKEVPCEP 671
L D+ +A +R L R+ + CG P C P
Sbjct: 471 ASHSVLERYMDKSAVARVLRDLGYWWKGREEDYSRRPTWQWREETRVHCGNPSSATACNP 530
Query: 672 QIAPCLFDIKNDPCEKNNL 690
PCLFD+ DPCE NNL
Sbjct: 531 SEQPCLFDLATDPCEYNNL 549
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 760 RDAASIQCG-PVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
R+ + CG P C P PCLFD+ DPCE NNL
Sbjct: 512 REETRVHCGNPSSATACNPSEQPCLFDLATDPCEYNNL 549
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH V+ E GLPL K+LPQ+LK+LGYRT ++
Sbjct: 148 MQHYVIRSREPWGLPLDFKLLPQHLKDLGYRTHLIG 183
>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
Length = 502
Score = 334 bits (857), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 226/342 (66%), Gaps = 16/342 (4%)
Query: 48 MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLC 107
M A++ PPHI+F+LADDLGWNDVGFHG +I TP +D L+ G+ L YYT LC
Sbjct: 1 MALSQPAAATSPPHIVFVLADDLGWNDVGFHG-SEILTPTLDFLSGQGVRLAGYYTQPLC 59
Query: 108 TPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
TPSRS +++G++ IHTG+QH +++ C+ PL +K+LP+ LKE GY T +VGKWHLG Y
Sbjct: 60 TPSRSQLLSGRYQIHTGLQHQIIWPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMY 119
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYS 221
K + PT RGF+S+ GYWTG +DY+ H + + LD R PA D KYS
Sbjct: 120 KTDCLPTRRGFDSYFGYWTGGEDYYSHERCYLITTLNITRCALDFRDGEVPATDYQMKYS 179
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T +FT AVD+I NH+ ++PLFLYLA+ A HS PLQ PD Y+ + I D R +
Sbjct: 180 THLFTDRAVDLITNHNPEKPLFLYLAYQAVHS-----PLQVPDQYIEPYTSIHDKNRRLY 234
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
A ++ LDE+V V AL+++ + +N++ +F +DNGG L+ +NWPLRG K TLW
Sbjct: 235 AGMVSVLDEAVANVTNALKKQGLWNNTVFIFSTDNGGQ----TLSGGNNWPLRGRKWTLW 290
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
EGG+RG G + S LL+ +G+ +++ +H+SDWLPTL++ A S
Sbjct: 291 EGGIRGVGFVTSSLLKQKGVTSKELIHISDWLPTLVNLAGGS 332
>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
Length = 533
Score = 334 bits (857), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 164/336 (48%), Positives = 218/336 (64%), Gaps = 16/336 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG I TP +DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFVLADDLGWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST+VFT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTE 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A +
Sbjct: 218 RATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASV 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V ALE+R + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALERRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 329 VGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGS 364
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVG 144
>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 511
Score = 333 bits (855), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 172/359 (47%), Positives = 223/359 (62%), Gaps = 22/359 (6%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GW+DV FHG QIPTPN+D LA GIIL NYY CTPSR+A+MTG +PIHTGMQH VL
Sbjct: 2 GWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGVL 61
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
E GLPLS I+PQYLK LGY T IVGKW+LG YK YTPTFRGF+S GY++ +D
Sbjct: 62 SPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVED 121
Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
Y++H+ GLD + +P ++ G YST ++T +I N +P FLYL + A
Sbjct: 122 YYNHTVLWDNQTGLDFWLNTQPLRNVSGIYSTQLYTERTKFLIENRDVSKPFFLYLPYQA 181
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
H N +PLQAP ++ +I + R+ FA + LDES+G+V AL ML N+II
Sbjct: 182 VHCGNFDDPLQAPQENIDKFPYIGEENRTIFAGMTDALDESIGEVFRALFDADMLDNTII 241
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
V +DNGG GF N N+PLRG K TLWEGG R + +WSPLL+ +G+V++Q +H++
Sbjct: 242 VLSADNGGLPFGFESNRGYNFPLRGGKGTLWEGGCRASAFVWSPLLKRKGVVSDQMMHIT 301
Query: 371 DWLPTLLSAA----------NKSDIPNYVNSTVENIIPRYE----------NSILRYEN 409
DW+PTL SAA + +D+ +V + PRYE NS LRY N
Sbjct: 302 DWVPTLYSAAGGNPQTLGKLDGTDM--WVPLAYGSASPRYEILYNIDPIDRNSALRYTN 358
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 63/138 (45%), Gaps = 22/138 (15%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV-KVVKVMRY-QVDLTGG 631
++DG D+W L+ S R IL+NID + SAL +KLV + Y Q LT G
Sbjct: 320 KLDGTDMWVPLAYGSASPRYEILYNIDPIDRNSALRYTNYKLVLGIANGGAYDQRYLTTG 379
Query: 632 PDQVY-----LSGLSD-----REWLALAMRKL----RDAASIQCG---PVKEVPCEPQIA 674
+ Y L S+ RE+ K R A++ CG P V P
Sbjct: 380 GSRPYGDLDLLMAQSEAAQVLREYYGTGYLKFPDQWRQRATLTCGNKVPKNFVSLSP--- 436
Query: 675 PCLFDIKNDPCEKNNLAD 692
P LFD+ DPCE NN+A+
Sbjct: 437 PYLFDLTADPCEINNIAN 454
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH VL E GLPLS I+PQYLK LGY T I+
Sbjct: 56 MQHGVLSPAEPYGLPLSVSIMPQYLKNLGYETHIVG 91
>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
Length = 354
Score = 333 bits (853), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 163/325 (50%), Positives = 216/325 (66%), Gaps = 7/325 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADD GW+DVGFHG +IPTPN+DALA GIIL N+Y+ LCTPSR +++TGKH
Sbjct: 1 PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI G+Q V+Y + GL L EK+LP+YLK LGY++ +VGKWHLGF+ EYTP RGF+
Sbjct: 61 PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
SH G++ +DY H M GLD + +P G YST +FT +A ++ H+
Sbjct: 121 SHYGFYGASEDYMTHIG---GMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQT 177
Query: 240 EPLFLYLAHAATHSANPYEP---LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
EP+FLY +H A H+++P E L AP Y + I D + A ++ LDESVG +
Sbjct: 178 EPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLT 237
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
++L ML N+II+F SDN GA G +SN+PLRG K+++WEGG R +WSPLL
Sbjct: 238 KSLHANGMLENTIIIFSSDN-GAPHGNTDVCSSNYPLRGAKSSVWEGGTRVPAFVWSPLL 296
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
+ G V++Q +H+SDWLPTLL AA
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLKAAG 321
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+Q V+Y + GL L EK+LP+YLK LGY++ ++
Sbjct: 66 LQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVG 101
>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 527
Score = 332 bits (851), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 162/326 (49%), Positives = 219/326 (67%), Gaps = 6/326 (1%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
AS PP+IIF+L D LGW+DV FHG QIPTPN+DALA GIIL NYY +CTPSR+A+
Sbjct: 18 ASKLPPNIIFMLVDGLGWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAAL 77
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
MTG +PIHTG+QH VL E GLPL KI+P+Y K+LGY T ++GKW+LG+Y KEYTPT
Sbjct: 78 MTGMYPIHTGLQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPT 137
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+RGF+S G++ +DYF H+ E + + R ++ ++ S D D++
Sbjct: 138 YRGFDSFYGFYNYEEDYFTHNLEFVNQSNAMVWR--PSSFCVYLTLSPDT----GFDLLS 191
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
P FLYL+H + H A+ +PLQAP+ + +I D +R+K+A ++ LDES G
Sbjct: 192 ASIERGPFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDALDESAGD 251
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V+EAL + ML+N+IIV S NGG ++G NA SN+PLRG K LWEGG R + IWSP
Sbjct: 252 VLEALYEAGMLANTIIVMSSANGGLSSGVESNAGSNFPLRGGKGALWEGGTRASAFIWSP 311
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL + V++Q +H++DWLPTL +AA
Sbjct: 312 LLYQKNRVSDQMMHITDWLPTLYAAA 337
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH VL E GLPL KI+P+Y K+LGY T ++
Sbjct: 88 LQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIG 123
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 68/182 (37%), Gaps = 48/182 (26%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
+DG ++W+ LS N PS R +L+NID Q SA+ +KLV S +G+ +E+R
Sbjct: 347 LDGQNLWNHLSYNLPSPRYELLYNIDPIDQTSAIRFLNYKLVLGES-SGSEENEHRLKTP 405
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLT 629
+D+ S+ RN HN + + M
Sbjct: 406 GNSRPYRDLDIAMAQSKAARVLRN--FHN------------------RGILFM------- 438
Query: 630 GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
D W R A++ CG + P P LFD+ DPCE N
Sbjct: 439 ------------DSNW--------RQKAALNCGDDERENVVPGAPPYLFDLAKDPCEMRN 478
Query: 690 LA 691
+
Sbjct: 479 IV 480
>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
Length = 591
Score = 331 bits (849), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 7/341 (2%)
Query: 45 TLSMVFVDLVASS-----GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
TL F+ L ++ PHII I+ADDLGW+DVGFHG DQI TPNIDAL Y+GIIL
Sbjct: 10 TLVSYFIVLFGTTLGLYESAPHIIVIMADDLGWDDVGFHGSDQIRTPNIDALGYNGIILN 69
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
+Y + TPSR+A TG++PI GMQ + G E G+PL+ +ILP++L++LGY T+++
Sbjct: 70 RHYVLPSSTPSRTAFFTGQYPIRMGMQGEDIQGGEPRGIPLNVRILPEFLRDLGYMTKLI 129
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLG+Y ++TP RGF+S LG++ H Y+++ M G DM R PA+ K
Sbjct: 130 GKWHLGYYTPQHTPLRRGFDSFLGFYNSHVSYYNYKYSFQNMSGYDMHRGDAPAYGSTDK 189
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
Y TD FT EA+ II + PL+L ++H A H+ P E Q +HY + HI + R
Sbjct: 190 YVTDFFTDEAIKIIEYYDPSRPLYLQISHLAVHA--PLEGPQDYNHYDSQFLHIREINRR 247
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
K+A ++ +LD SVG++V+AL + ML NS+I+F++DNG A G N SN+PLRG+K T
Sbjct: 248 KYAKMVLRLDNSVGRIVQALGNKGMLKNSLILFLTDNGAAPIGKFRNYGSNYPLRGMKYT 307
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
L+EGGVRG L+WSP L V+++ +HV+DWLPTL S A
Sbjct: 308 LYEGGVRGVALLWSPRLRKTARVSDELMHVTDWLPTLYSIA 348
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 63/155 (40%), Gaps = 35/155 (22%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK---------------- 617
EIDGID W +L+ + PS R+ +L NID+ + ++KLV+
Sbjct: 357 EIDGIDQWCMLNGSLPSARDRLLLNIDEVSKTEGAIYKQFKLVRGSIEGGLYDGYFYDHI 416
Query: 618 -----VVKVM--------------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAAS 658
++ +M + L + S L D + +LR A+
Sbjct: 417 ERNVPIMLIMPSDPKTIVQEDVPPYTETILRSVVSETITSFLGDPVTQPSTIIQLRREAT 476
Query: 659 IQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 693
+ C + CLFD+ NDPCE NN+A++
Sbjct: 477 VNCRSRNRSFLTCNVTECLFDVNNDPCETNNIAEQ 511
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 758 KLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR-SEVQR 804
+LR A++ C + CLFD+ NDPCE NN+A++ S + R
Sbjct: 470 QLRREATVNCRSRNRSFLTCNVTECLFDVNNDPCETNNIAEQYSRISR 517
>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
Length = 564
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 167/351 (47%), Positives = 228/351 (64%), Gaps = 15/351 (4%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M ++LPL T+ L PPHII +ADDLGWNDVGFHG +QIPTPNIDALAY+
Sbjct: 9 MIISLLPLILTIDCA---LGLYETPPHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYN 65
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GIIL +Y + TPSR A TG +PI GMQ + + G E GLPL KILP++L+ LGY
Sbjct: 66 GIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGY 125
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T+++GKWH+GF+ +YTP RGF++ G++ H Y+D+ M G DM +PA+
Sbjct: 126 VTKLIGKWHMGFHTLQYTPLHRGFDTFFGFYNSHITYYDYEYSNQNMTGYDMHCGDDPAY 185
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--- 271
+ +Y+TD+FT EA+ II NH PL+L ++H A H+ P++ PD + R
Sbjct: 186 GMKREYATDLFTNEAIKIIENHELPRPLYLQISHLAVHA-----PIEQPD---DSSRDEI 237
Query: 272 -HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
I + R K+A ++ KLDESVG+VV AL ++ ML +S+I+F++DNG A+ G N SN
Sbjct: 238 VQIREPNRRKYAKMVSKLDESVGRVVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSN 297
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+PLRG K TL+EGGVRG +WS LE V ++ +H++DWLPTL SAA
Sbjct: 298 YPLRGTKYTLYEGGVRGVAALWSSRLEKGARVFKKLIHITDWLPTLYSAAG 348
>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
Length = 521
Score = 330 bits (845), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 160/331 (48%), Positives = 224/331 (67%), Gaps = 16/331 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++F+LADDLGW DVG+HG QI TP +DAL+ +G+ L+ YY LCTPSRS ++TG+
Sbjct: 32 PPHLVFVLADDLGWQDVGWHG-SQIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGR 90
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH +++ C+ +PL EK+LP+ LKE GY T +VGKWHLG Y+ E PT RGF
Sbjct: 91 YQIHTGLQHEIIWPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGF 150
Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DY+ H ++ + LD+R + A YST+VFT A D+
Sbjct: 151 DTYFGYLLGSEDYYSHEHCVPIVSKNVTRCALDLRDGEKIADGFKNMYSTNVFTQRAQDL 210
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH ++PLFLYLA + H EPLQ P+ Y+ + I+D KR K+A ++ LDE+V
Sbjct: 211 IANHQPEKPLFLYLALQSVH-----EPLQVPEKYVEPYSFIKDEKRRKYAGMVSILDEAV 265
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G + EAL+ R + +N++++F +DNGG L +NWPLRG K TLWEGGVRGAG +
Sbjct: 266 GNITEALKTRGLWNNTVLIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGAGFVA 321
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ RG+ + + +H+SDWLPTL+ A S
Sbjct: 322 SPLLKQRGVESHELIHISDWLPTLVKLAGGS 352
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LP+ LKE GY T ++
Sbjct: 97 LQHEIIWPCQPSCVPLDEKLLPELLKEAGYVTHMVG 132
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 61/151 (40%), Gaps = 39/151 (25%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKV--VKVMRYQVD----- 627
+DG DVW LS EPS R +LHNID + + KG + V ++Y +
Sbjct: 359 LDGFDVWKTLSEGEPSPRKELLHNIDPIFVDPSPCKGNRSCLAVDYAAALKYSIFNISIH 418
Query: 628 ----------LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCL 677
LTG P G S W ++ ++ P+ + L
Sbjct: 419 AAIRQGKWKLLTGNP------GCS--HWFP-PPSLFNESKTMSSDPLTK-------KLWL 462
Query: 678 FDIKNDPCEKNNLADRSED------QRINHY 702
FDI +D E+N+L+D+ D R+ HY
Sbjct: 463 FDIVHDAEERNDLSDQYPDIVKKLLIRLQHY 493
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 18/26 (69%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW LS EPS R +LHNID
Sbjct: 359 LDGFDVWKTLSEGEPSPRKELLHNID 384
>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 578
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 162/346 (46%), Positives = 223/346 (64%), Gaps = 13/346 (3%)
Query: 44 FTLSMVFVDLVASS-----GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
F + L+ SS G PHII +ADDLGWNDVGFHG QIPTPNIDAL Y+GIIL
Sbjct: 4 FIALLTIAGLLGSSCADFEGRPHIIVFMADDLGWNDVGFHGATQIPTPNIDALGYNGIIL 63
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+Y + C+P+R+A +TGK+PI GMQ + G E GLP+ + LP+YL+ LGY T +
Sbjct: 64 NKHYVLPSCSPTRAAFLTGKYPIRMGMQGAGIAGGEPRGLPVHVQTLPEYLQGLGYETNL 123
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
+GKWH+G++ ++ P RGF+ G++ H Y+D+ + M G DM + E A+ G
Sbjct: 124 IGKWHVGYHTPKHLPNRRGFDYFYGFYNSHIGYYDYRYSQGNMSGFDMHINGETAYGTDG 183
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP---DHYLNIHRHIED 275
Y+TD FT A+D+I+ H + P++L ++H A P+ P+ P + Y + RHI +
Sbjct: 184 VYATDRFTQAAIDVIYRHDLESPMYLQVSHLA-----PHAPMDVPFEDNPYDDEFRHISE 238
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
KR +A ++ +LD+S+G++V +L R ML NS+IVF+SDNGGA G N ASNWPLRG
Sbjct: 239 PKRRAYAKMVARLDDSLGRIVSSLGDRGMLKNSVIVFMSDNGGATIGKFRNWASNWPLRG 298
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K TL+EGGVR +WSP L S+G V+EQ HV+DWLPTL A
Sbjct: 299 TKYTLFEGGVRSVAAMWSPKLLSKGRVSEQLFHVTDWLPTLFQVAG 344
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 29/145 (20%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
IDG+ W VLS + R+ IL NID+ Q A ++K+V+ Y ++ G
Sbjct: 353 IDGVSQWDVLSIGTGNVRDKILLNIDEYTQTEAAIDKRFKIVRGSLYHGYYDNVEGEIGR 412
Query: 631 -------GPDQVYLSGLSD--REWLAL------AMRKLRDAASIQCGPVK---------E 666
++ S +SD RE L + M +L+ A++ C P
Sbjct: 413 GHKNPMYNTSRILKSAVSDAIREHLGVPVTQESVMWELQRQATVLCRPNMTNMQYTSRYT 472
Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLA 691
+P Q A CLFD+ DPCE NN+A
Sbjct: 473 LPSCNQTA-CLFDLGIDPCETNNVA 496
>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 562
Score = 329 bits (844), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 225/325 (69%), Gaps = 4/325 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II I+ADDLGWNDV FHG +QIPTPNIDALAY+G+IL+ +Y + +CTPSR+A +TG++
Sbjct: 39 PNIIVIMADDLGWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRY 98
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI TGMQ L E +PL+ +LP+YL++LGY T +VGKWH+G+Y +TPT RGF+
Sbjct: 99 PIRTGMQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFD 158
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP--AWDLHGKYSTDVFTAEAVDIIHNHS 237
+ GY++G+ YF+H+ ++ G D+ D + D + +Y+TD+ T A +II NH
Sbjct: 159 TFFGYYSGYISYFNHTIKQDDHIGYDLHYDNSKNLSIDYNFEYTTDLITERAENIIKNHD 218
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFKRSKFAAILHKLDESVGKV 295
+PL+L L H A HS++ E ++ D ++IED+ R K+A ++ +DESVG+V
Sbjct: 219 RRKPLYLQLCHLAAHSSDAKEVMEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRV 278
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
++AL Q ML NSIIVF+SDNG G N SN+PLRG+K TL+EGG+RG ++S L
Sbjct: 279 IKALGQSSMLENSIIVFISDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYSRL 338
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
+++ ++ + +H++DWLPT SAA
Sbjct: 339 IQNSSRISNELMHITDWLPTFYSAA 363
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRY---------- 624
+DG+D W + + SKR ++L NID+ +S+ GK+KL+ K ++Y
Sbjct: 374 MDGVDQWDTIVSGKESKRESVLLNIDEVEDVSSALIGKYKLIINGKNIQYNDYYGDNGTS 433
Query: 625 ----QVDLTGGPDQVYLSGLSDREWLALAMR---KLRDAASIQCGPVKEVP-CEPQIAPC 676
+ +++ + S + L ++ KLR+ A + C C + C
Sbjct: 434 VSYPEYNVSSVLHSIAGSTIFHATNFTLNIKDVIKLRNKARVVCNNFTSYSKC---VDKC 490
Query: 677 LFDIKNDPCEKNNLADRSED--QRINHYTTE 705
LFDI NDPCE +L++ + +++N++ E
Sbjct: 491 LFDIYNDPCETTDLSNEHPEIVKKLNYFIAE 521
>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
Length = 534
Score = 329 bits (843), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 160/331 (48%), Positives = 219/331 (66%), Gaps = 16/331 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS ++
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 101 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A + LD R E A YST++FT A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +D
Sbjct: 221 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 275
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G
Sbjct: 276 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 331
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLA 362
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
Length = 520
Score = 329 bits (843), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 224/347 (64%), Gaps = 17/347 (4%)
Query: 44 FTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
+ + +V + V + P PHI+FI+ADD GWNDVG+H +I TPN+D L+ G+ L+NYY
Sbjct: 12 WMMIVVMILPVTLAAPRPHIVFIMADDFGWNDVGYHN-SEIKTPNLDLLSAKGVRLQNYY 70
Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
LC+PSR+ +MTG++ IHTGMQH +++ C+ +PL EK+LPQ LKE GY T +VGKW
Sbjct: 71 VQPLCSPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVGKW 130
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDL 216
HLG YKK+ PT RGF+S+ GY+ G +DY+ H +A + LD+R E A
Sbjct: 131 HLGMYKKDCLPTHRGFDSYFGYYLGSEDYYTHTRCYPITALNLTRCALDLRDGEEVATAY 190
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
G YST++F+ AV +I H+ +PLFLY+A A H EPLQ P+ Y+ + I+D
Sbjct: 191 KGAYSTELFSQRAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDV 245
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R K+A ++ +DE+VG + AL Q + N+++VF +DNGG SNWPLRG
Sbjct: 246 SRRKYAGMVSAMDEAVGNISLALRQAGLWDNTVLVFSTDNGGQTRA----GGSNWPLRGR 301
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K LWEGG+RG G + SPLLE+ G V + +H+SDWLPTL+ A S
Sbjct: 302 KWALWEGGIRGVGFVSSPLLENPGTVNNELMHISDWLPTLVRLAGGS 348
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 93 MQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVG 128
>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 593
Score = 328 bits (841), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 170/361 (47%), Positives = 211/361 (58%), Gaps = 39/361 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADDLGW+DV FHG QIPTPN+D LA G+IL NYY CTPSR+A+MTG +
Sbjct: 36 PHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLY 95
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI TGMQ + E GLP +ILPQYLKE GY T +VGKWHLG YK+ TPT RGF+
Sbjct: 96 PIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFD 155
Query: 180 SHLGYWTGHQDYFDHSAEEMK--MW----------------------------------- 202
S GY+ G DYF H+ ++ W
Sbjct: 156 SFYGYYYGESDYFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTISYEN 215
Query: 203 --GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
GLD + +P W G YST VFT A II N + +PL L + H ATH A E L
Sbjct: 216 HTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERL 275
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
QAP ++ +I + R+ +A ++ LD+SVG+VV+AL ML N+IIVF SDNGG
Sbjct: 276 QAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGP 335
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + + NWPLRG K T+WEGG R A +WSPLL R V+ Q +H++DWLPT S A
Sbjct: 336 WGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIA 395
Query: 381 N 381
Sbjct: 396 G 396
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 22/139 (15%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTKGKWKLV---KVVKVMRYQVDLT 629
E+DG D+W LS S R +L+N D ++ S AL ++KLV RY+V
Sbjct: 404 ELDGHDMWRPLSHGRRSPRVELLYNYDYKYTKSAALRNHRYKLVLDETGRFADRYKVPGG 463
Query: 630 GGP--DQVYLSGLSDREWLALAMRKL------------RDAASIQCGPVKEVPCE-PQIA 674
P D YL+ S A +R R A + CG K++ +
Sbjct: 464 SRPRHDLDYLAAKSTA---ASVLRDFYKKKQLNFPKSWRRKARLTCGKQKKINFSWSATS 520
Query: 675 PCLFDIKNDPCEKNNLADR 693
LFD+ DPCE NNLA +
Sbjct: 521 VYLFDVVTDPCELNNLASK 539
>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
Length = 431
Score = 328 bits (841), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 156/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 41 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H+SDWLPTL+ A S
Sbjct: 331 GFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGS 365
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145
>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 579
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 164/347 (47%), Positives = 226/347 (65%), Gaps = 9/347 (2%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+ ++LPL T+ V L PHII +ADDLGWNDVGFHG +QIPTPNIDALAY+
Sbjct: 9 ITISLLPLILTIDCV---LGLYETAPHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYN 65
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GIIL +Y + TPSR A TG +PI GMQ + + G E GLPL KILP++L+ LGY
Sbjct: 66 GIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGY 125
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T+++GKWH+G++ +YTP RGF++ G++ H Y+D+ M G DM R +PA
Sbjct: 126 TTKLIGKWHMGYHTPQYTPLHRGFDTFFGFYNSHITYYDYEYSNQNMTGYDMHRGDDPAH 185
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RHI 273
+ +Y TD+FT EA+ II NH PL+L ++H A H+ P++ PD + I
Sbjct: 186 GIKREYVTDLFTKEAIKIIENHELPRPLYLQISHLAVHA-----PIEQPDDSSSDEIIQI 240
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+ R K+A ++ KLDESVG++V AL ++ ML +S+I+F++DNG A+ G N SN+PL
Sbjct: 241 REPNRRKYAKMVSKLDESVGRIVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPL 300
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
RG K TL+EGGVRG +WS LE V ++ +H++DWLPTL SAA
Sbjct: 301 RGTKYTLYEGGVRGVAALWSSRLEKAARVFKKLIHITDWLPTLYSAA 347
>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLMKLA 361
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144
>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
Length = 431
Score = 327 bits (839), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 41 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H++DWLPTL+ A S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145
>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
Length = 535
Score = 327 bits (839), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 41 AGASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
Length = 532
Score = 327 bits (839), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 215/329 (65%), Gaps = 16/329 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++F+LADDLGWNDVGFHG I TP++DALA G+ L NYY LCTPSRS +++G+
Sbjct: 43 PPHLVFLLADDLGWNDVGFHG-SSIRTPHLDALAAGGVRLDNYYVQPLCTPSRSQLLSGR 101
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF
Sbjct: 102 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 161
Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DY+ H A + LD R E A YST+VFT A +
Sbjct: 162 DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATAL 221
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH ++PLFLYLA + H EPLQ P+ YL + IED R +A ++ +DE+V
Sbjct: 222 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRRHYAGMVSLMDEAV 276
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G V AL++R + N++ +F +DNGG L +NWPLRG K TLWEGGVRG G +
Sbjct: 277 GNVTAALKKRGLWDNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVA 332
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 333 SPLLKQKGVKNRELIHISDWLPTLVKLAG 361
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 108 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 143
>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
Length = 415
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 41 AGASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
Length = 533
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144
>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
Length = 528
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 224/334 (67%), Gaps = 16/334 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS ++
Sbjct: 36 AAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IH G+QH ++ C+ +PL EK+LPQ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 95 TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154
Query: 176 RGFESHLGYWTGHQDYFDHSAEEM------KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A LD+R EPA + YST++FT A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V +AL+ R + +N++++F +DNGG + +NWPLRG K TLWEGG+RGAG
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGAG 325
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ SPLL+ +G+ + + +H++DWLPTL++ A S
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 359
>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
Length = 533
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144
>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 330 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 362
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
Length = 458
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 156/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 41 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H+SDWLPTL+ A S
Sbjct: 331 GFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGS 365
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145
>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
Length = 408
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 220 RAIALITNHPREKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 546
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 225/327 (68%), Gaps = 8/327 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II I+ADDLGWNDV FHG +QIPTPNIDALAY+G+IL+ +Y + +CTPSR+A +TG++
Sbjct: 24 PNIIVIMADDLGWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRY 83
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI TGMQ L E +PL+ +LP+YL++LGY T +VGKWH+G+Y +TPT RGF+
Sbjct: 84 PIRTGMQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFD 143
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMR----RDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
+ GY+ G+ YF+H+ ++ G D+ ++L A++ +Y TD+ T A +II N
Sbjct: 144 TFFGYYNGYISYFNHTIKQNNHVGYDLHYHNSKNLSVAYNF--EYITDLITERAENIIKN 201
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFKRSKFAAILHKLDESVG 293
H +PL+L L+H A HS++ E ++ D +IED+ R K+A ++ +DESVG
Sbjct: 202 HDRRKPLYLQLSHLAVHSSDAKEVMEVRDEQETNATLEYIEDYNRRKYAGVVTAMDESVG 261
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+V++AL Q ML NSIIVF+SDNG G N SN+PLRG+K TL+EGG+RG ++S
Sbjct: 262 RVIKALGQSSMLENSIIVFMSDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYS 321
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
L+++ ++ + +H++DWLPT SAA
Sbjct: 322 RLIQNSSRISNELIHITDWLPTFYSAA 348
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 70/150 (46%), Gaps = 22/150 (14%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV--KVVKVMRYQVD----- 627
+DG+D W + SKR ++L NID+ +S+ GK+KL+ K ++ Y D
Sbjct: 359 MDGVDQWDTIVSGRESKRGSVLLNIDEVENVSSALLGKYKLINGKNIQYSDYYGDNGTSV 418
Query: 628 ------LTGGPDQVYLSGLSDREWLALAMR---KLRDAASIQCGPVKEVP-CEPQIAPCL 677
++ S + L + KLR+ A + C P C + CL
Sbjct: 419 SYPEYNVSSVLHSFVGSTIFHTTNFTLNIEDVIKLRNKARVVCNNFTSYPKC---VDKCL 475
Query: 678 FDIKNDPCEKNNLADRSED--QRINHYTTE 705
FD+ NDPCE +L+D + +++N++ TE
Sbjct: 476 FDVYNDPCETTDLSDEYPEIVKKLNYFITE 505
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 6/61 (9%)
Query: 754 EGMRKLRDAASIQCGPVKEVP-CEPQIAPCLFDIKNDPCEKNNLADRSE--VQRINHYTT 810
E + KLR+ A + C P C + CLFD+ NDPCE +L+D V+++N++ T
Sbjct: 448 EDVIKLRNKARVVCNNFTSYPKC---VDKCLFDVYNDPCETTDLSDEYPEIVKKLNYFIT 504
Query: 811 E 811
E
Sbjct: 505 E 505
>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
Length = 413
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144
>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 327 bits (837), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 330 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 362
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
Length = 413
Score = 327 bits (837), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144
>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 330 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 362
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 331 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 363
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 532
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 170/350 (48%), Positives = 230/350 (65%), Gaps = 12/350 (3%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L++ V + PPHII IL DDLGW D GFHG DQI TPN+DALAYSG+IL +
Sbjct: 10 FIIVLNLAVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQIKTPNMDALAYSGMILNRH 69
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y + CTPSRSA++TG +PI TGMQ L G + LPLS K+ P++LK LGYRT +VGK
Sbjct: 70 YVLPSCTPSRSALLTGLYPIRTGMQGMPLKGGDVRNLPLSFKLKPEFLKNLGYRTHLVGK 129
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMK-MWGLDMRRDLEPAW 214
WHLG+ + P RGF+S GY+ G+ DYF + E+++ +G D+ R+ E
Sbjct: 130 WHLGYRTINHLPNQRGFDSFFGYYNGYVDYFKFGHNQTVAGEKIEYFYGYDLHRNGEIYQ 189
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLN-IHRH 272
Y+T +FT EA II NH+ EPL+LY +H ATH+ + ++ P D +N + H
Sbjct: 190 TDKDTYATRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDADVNKTYGH 249
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL--NAASN 330
I+ + R FA L +LD+SVG+V+EAL+++ ML NSII+ +SDNGG +L N +SN
Sbjct: 250 IKHYGRRAFAGCLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTVSVDLPPNWSSN 309
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
WPL G K TL+EGGVR LIWSPLL +G++ + ++H++DWLPTL SAA
Sbjct: 310 WPLGGTKFTLFEGGVRSVALIWSPLL-PKGVINDDFIHITDWLPTLYSAA 358
>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
Length = 534
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 218/332 (65%), Gaps = 16/332 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS ++
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A + LD R E A YST++FT A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 331
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAG 363
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
Length = 534
Score = 326 bits (836), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 158/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 40 AGASQPPHLVFLLADDLGWNDVGFHG-SRIHTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGG+RG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRG 329
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 330 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 362
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 326 bits (835), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 330 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 362
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
Length = 415
Score = 326 bits (835), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 331 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 363
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
Length = 414
Score = 325 bits (834), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 218/332 (65%), Gaps = 16/332 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS ++
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A + LD R E A YST++FT A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 331
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAG 363
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
Length = 520
Score = 325 bits (834), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 158/331 (47%), Positives = 215/331 (64%), Gaps = 16/331 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++ +LADDLGWNDVGFHG ++ TP++DALA G+ L NYYT LCTPSRS ++TG+
Sbjct: 31 PPHLVLVLADDLGWNDVGFHG-SRLRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTGR 89
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF
Sbjct: 90 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGF 149
Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DYF H A + LD R E A + YS ++F A+ +
Sbjct: 150 DTYFGYLLGSEDYFSHEHCVFIKALNVTRCALDFRDGEEVATEYKNMYSANIFANRAISL 209
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH ++PLFLYLA + H EPLQ P+ YL + I D R +A ++ +DE+V
Sbjct: 210 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRRLYAGMVSLMDEAV 264
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G V ALE + +N++ +F +DNGG L +NWPLRG K TLWEGG+RG G +
Sbjct: 265 GNVTAALESHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGIRGVGFVA 320
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 321 SPLLKQKGVRTRELIHISDWLPTLVKLAGGS 351
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 96 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVG 131
>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
Length = 534
Score = 325 bits (834), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 41 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H++DWLPTL+ A S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145
>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
Length = 534
Score = 325 bits (833), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 41 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H++DWLPTL+ A S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145
>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 575
Score = 325 bits (833), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 158/323 (48%), Positives = 221/323 (68%), Gaps = 7/323 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII I+ADDLGWNDVGFHG +QIPTPNIDAL Y+GIIL +Y + TPSRSA +TG +
Sbjct: 27 PHIIVIMADDLGWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI GMQ + + G E GLPL KILP++L++LGY T+++GKWH+G++ +YTP RGF+
Sbjct: 87 PIRIGMQGDGIRGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFD 146
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG++ H Y+D+ M G D+ R +PA ++ +Y+TD+FT EAV +I H
Sbjct: 147 FFLGFYNSHISYYDYHYSNQNMSGYDLHRGDDPAHGINREYATDLFTKEAVRMIETHELP 206
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--HIEDFKRSKFAAILHKLDESVGKVVE 297
PL+L ++H A H+ PL+ P N R +I + R K+A ++ +LD+SVG++V
Sbjct: 207 RPLYLQISHLAVHA-----PLEQPRDEYNDGRFSYIREPNRRKYARMVSRLDDSVGRIVH 261
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL ++ ML +S+I+F++DNG A+ G N SN+PLRG K TL+EGGVR +WSP L+
Sbjct: 262 ALGEKGMLRDSLILFLTDNGAASVGRYRNWGSNYPLRGTKYTLYEGGVRSVAALWSPRLQ 321
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
V+ Q +H++DWLPTL SAA
Sbjct: 322 KVARVSNQLIHITDWLPTLYSAA 344
>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
Length = 564
Score = 324 bits (831), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/335 (48%), Positives = 221/335 (65%), Gaps = 6/335 (1%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L Y +CTPSR
Sbjct: 19 DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+++G++PIHTG QH V+ E L L+ ++P+ KE GY T +VGKWHLGF + EY
Sbjct: 79 GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
TPT RGF+ H GYW + DYF + +M + G D RR++E G Y TD+ TA
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMELECRDRGVYVTDLLTA 197
Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA +I +H+ E PLFL L+H A H+AN +PLQAP+ + +I+D R K+AA++
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMIS 257
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
KLD+SVG+++ AL L NSI++F SDNG + G N SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVR 317
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
AG IWS L++RG + Q ++V+DWLPTL AA+
Sbjct: 318 VAGAIWSSGLQARGSIFRQPLYVADWLPTLSRAAD 352
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 63/140 (45%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DGID+W LS + P ILH +DD W++SAL G+WK V
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTASGRYDSVLTY 419
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V + LS R + R A+++CG ++ C P
Sbjct: 420 RELDDLDPRDSRYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNP 478
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ CL+DI +DPCE+NNL
Sbjct: 479 LLEECLYDILSDPCEQNNLV 498
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT GR++ + + ++ + + + + + + ++ + + + R
Sbjct: 407 TTASGRYDSVLTYRELDDLDPRDSRYAVTVRNSATSRALSRYDLRRLTQQRISL--TRRL 464
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C P + CL+DI +DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDILSDPCEQNNLV 498
>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
Length = 506
Score = 324 bits (831), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 217/332 (65%), Gaps = 16/332 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S PPH++F+LADDLGWNDVGFHG I TP++DALA G++L NYYT LCTPSRS ++
Sbjct: 14 ASRPPHLVFVLADDLGWNDVGFHG-SCIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 72
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 73 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 132
Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A + LD R E A YST++FT A
Sbjct: 133 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 192
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +D
Sbjct: 193 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 247
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G
Sbjct: 248 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 303
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 304 FVASPLLKQKGVKNRELIHISDWLPTLVKLAG 335
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 82 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 117
>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
Length = 520
Score = 324 bits (830), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 160/360 (44%), Positives = 235/360 (65%), Gaps = 19/360 (5%)
Query: 31 RTR-IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
R+R + F +L + F ++SG P+I+FILADD GWND+G+HG I TPN+D
Sbjct: 14 RSRPFLPFLLLVCVLSGGKTFAQ--SASGKPNIVFILADDYGWNDIGYHG-SVIRTPNLD 70
Query: 90 ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
LA G+ L+NYY +CTPSRS +MTG++ IH G+QH++++ + GLPL E LPQ L
Sbjct: 71 RLAAEGVKLENYYIQPICTPSRSQLMTGRYQIHFGLQHSIIWPPQPSGLPLDEVTLPQRL 130
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMW-----G 203
KE GY T IVGKWHLGFYK+EYTP RGF++ G+ TG ++++ H ++ M + G
Sbjct: 131 KEGGYSTHIVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSENHYSHRNSGGMPGFRPGWNG 190
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
LD+R P D +G YST +F +A+++I +P+FLYL A H+ PLQAP
Sbjct: 191 LDLRDQDRPVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMFLYLPFQAVHA-----PLQAP 245
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
Y++++RHI D+ R +AA+ +DE+VG + +AL+Q + N+++VF +DNGG
Sbjct: 246 QKYISMYRHINDYNRRMYAAMTTAMDEAVGNITDALKQYGLWDNTVLVFSTDNGGQV--- 302
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ +NWPLRG K +LWEGG+RG G + SPL++++G ++ +H+SDW PTL+ A S
Sbjct: 303 -MAGGNNWPLRGWKGSLWEGGIRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLAGGS 361
>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
Length = 503
Score = 323 bits (828), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/329 (48%), Positives = 216/329 (65%), Gaps = 16/329 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++F+LADDLGWNDVGFHG ++ TP++DALA G+ L NYYT LCTPSRS ++TG+
Sbjct: 15 PPHLVFVLADDLGWNDVGFHG-SRLRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTGR 73
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH +++ C+ +PL EK+LPQ L+E GY T +VGKWHLG Y+KE PT RGF
Sbjct: 74 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRGF 133
Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DY+ H A + LD R E A YST++FT A +
Sbjct: 134 DTYFGYLLGSEDYYTHEHCVFIKALNVTRCALDFRDGEEVATQYKNLYSTNIFTNRATSL 193
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+V
Sbjct: 194 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHLYAGMVSLMDEAV 248
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G V AL+ + +N++ +F +DNGG L +NWPLRG K TLWEGGVRG G +
Sbjct: 249 GNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVA 304
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 305 SPLLKQKGVKNRELIHISDWLPTLVKLAG 333
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ L+E GY T ++
Sbjct: 80 LQHQIIWPCQPSCVPLDEKLLPQLLQEAGYATHMVG 115
>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
Length = 491
Score = 323 bits (828), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 155/325 (47%), Positives = 221/325 (68%), Gaps = 13/325 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ADD G+ND+G+HG +I TPN+D LA G+ L+NYY +CTP+RS +M+G++
Sbjct: 24 PHILLIIADDFGYNDIGYHG-SEIKTPNLDKLAGEGVKLENYYVQPICTPTRSQLMSGRY 82
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH V+ + GLPL ILPQ LKE+GY T VGKWHLGFYK+EY PT RGF+
Sbjct: 83 QIHTGLQHGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEYLPTNRGFD 142
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
SH GY TG +DYF H M G D+R ++ PA + G YST +F +A ++++NH+TD
Sbjct: 143 SHFGYLTGAEDYFKHDRCFGAMCGTDLRDNMNPA-NYTGVYSTHLFAQKAAEVVNNHNTD 201
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLYL + H+ PLQ P+ Y + HI+D +R +A ++ +DE++G + +
Sbjct: 202 KPLFLYLPFQSVHA-----PLQVPEQYTKPYMHIQDKQRRTYAGMVSAMDEAIGNLTDVF 256
Query: 300 EQRRMLSNSIIVFVSDNGGAA-AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
++R M +N+++VF +DNGG AG N N+PLRG K++LWEGGV G GL+ +L+
Sbjct: 257 KKRGMWNNTLMVFSTDNGGQIHAGGN-----NFPLRGWKDSLWEGGVHGVGLVHGQMLKR 311
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
+G V+ +HVSDW PTL+S +
Sbjct: 312 KGSVSRDLIHVSDWFPTLISMVGGT 336
>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
Length = 581
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 235/348 (67%), Gaps = 10/348 (2%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
++ ++LPL + F L +S PHII +ADDLGWNDV FHG +QIPTPNIDAL Y+
Sbjct: 9 ISTSLLPLILFIDCAF-GLYETS--PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYN 65
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GIIL +Y + TPSR+A TG++PI GMQ + G E GLPL+ KILP++L+ LGY
Sbjct: 66 GIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGY 125
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T+++GKWH+G+Y +YTP RGF++ LG++ + Y+D+S M G DM R +PA+
Sbjct: 126 TTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYISYYDYSYSNQNMSGYDMHRGDDPAY 185
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--H 272
++ +Y+TD+FT EA++II NH + PL+L L+H A H+ PL+ P + N H
Sbjct: 186 GMNREYATDMFTREAINIIENHELNRPLYLQLSHLAVHA-----PLEQPMNVYNDREPIH 240
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I + R K+A ++ KLDESVG++ AL ++ ML +S+I+F++DNG A+ G N SN+P
Sbjct: 241 IREPNRRKYAKMVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYP 300
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
LRG K TL+EGGVRG +WS L+ V+ + VH++DWLPTL SAA
Sbjct: 301 LRGTKYTLYEGGVRGVAALWSSRLQKAARVSNELVHITDWLPTLYSAA 348
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 11/73 (15%)
Query: 756 MRKLRDAASIQCGP-------VKEVPCEPQIAPCLFDIKNDPCEKNNLADR-SEVQR-IN 806
M +LR A++QC PC + CLFDI NDPCE N+A+ + R +N
Sbjct: 454 MVRLRQDATVQCNANFTYYRRQMLTPC--NVTECLFDIVNDPCETRNIAETYVRIARDLN 511
Query: 807 HYTTEVGYLDPKQ 819
++ E ++ KQ
Sbjct: 512 YFLEEYSHILVKQ 524
>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
Length = 535
Score = 322 bits (826), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 159/329 (48%), Positives = 217/329 (65%), Gaps = 16/329 (4%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
H++F+LADDLGW+DVGFHG +I TP++DALA +G++L NYYT LCTPSRS ++TG++
Sbjct: 48 HLVFVLADDLGWHDVGFHG-SRIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 107 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST++FT A +I
Sbjct: 167 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIS 226
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I D R +A ++ +DE+VG
Sbjct: 227 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGN 281
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ + +N++ VF +DNGG L +NWPLRG K +LWEGGVRG G + SP
Sbjct: 282 VTAALKSHGLWNNTVFVFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 337
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LL+ +G+ + + VH+SDWLPTL+ A S
Sbjct: 338 LLKRKGVKSRELVHISDWLPTLVGLAGGS 366
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
Length = 565
Score = 322 bits (826), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 163/337 (48%), Positives = 218/337 (64%), Gaps = 10/337 (2%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L Y +CTPSR
Sbjct: 19 DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+++G++PIHTG QH V+ E L L+ ++P+ KE GY T +VGKWHLGF + EY
Sbjct: 79 GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM------WGLDMRRDLEPAWDLHGKYSTDVF 225
TPT RGF+ H GYW + DYF + KM G D RR++E G Y TD+
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS---KMPVANYSLGYDFRRNMELECRDRGVYVTDLL 195
Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
TAEA +I H+ E PLFL L+H A H+AN +PLQAP+ + +I+D R K+AA+
Sbjct: 196 TAEAERLIKGHAGKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAM 255
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ KLD+SVG+++ +L L NSI++F SDNG + G N SN+PLRG KNT WEGG
Sbjct: 256 ISKLDQSVGRIMSSLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGG 315
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
VR AG IWS L++RG + Q ++V+DWLPTL AA
Sbjct: 316 VRVAGAIWSSKLQARGSIFSQPLYVADWLPTLCRAAG 352
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 63/140 (45%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DGID+W+ LS + P ILH +DD W+++AL G+WK V
Sbjct: 360 KLDGIDLWAELSGSADAPHVPREILHILDDVWRLAALQLGQWKYVNGTTAAGRYDSLLTY 419
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V + LS R + R A+++CG ++ C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISITRRLAAVRCGDLQR-SCNP 478
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ CL+DI DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNLV 498
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT GR++ + + ++ + ++ + + + + ++ + + + R
Sbjct: 407 TTAAGRYDSLLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISI--TRRL 464
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C P + CL+DI DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNLV 498
>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
Length = 575
Score = 322 bits (825), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 159/329 (48%), Positives = 216/329 (65%), Gaps = 4/329 (1%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++G P+II ILADD+G++DV F G + TPNIDALA+ G IL Y +CTPSR A++
Sbjct: 33 TAGRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALL 92
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+G++PIHTG QH V+ E GL L+ ++P+ ++ GY T ++GKWHLGF + EYTPT
Sbjct: 93 SGRYPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTR 152
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
RGF+ H GYW + DY+ ++ G D RR++E G Y TD+ T EA +
Sbjct: 153 RGFDYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERV 212
Query: 233 IHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I +EPLFL L+H ATH+AN +PLQAP+ + +I+D R K+AA++ KLD+S
Sbjct: 213 IREREGQEEPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQS 272
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VG++V AL L NSI++F SDNG + G N SNWPLRG KNT WEGGVR AG I
Sbjct: 273 VGRIVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAI 332
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
WS L++RG + Q ++V+DWLP+L AA
Sbjct: 333 WSAQLQARGNIFTQPIYVADWLPSLAHAA 361
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 66/140 (47%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------ 619
E+DGID+W L+ + P ILH +DDEW++++ G WK +
Sbjct: 370 ELDGIDLWPQLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTY 429
Query: 620 --------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
+ RY V + P L+ R + LR +A+++CG ++ C P
Sbjct: 430 RELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQRVMTLRRSAAVRCGDLQR-GCNP 488
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ C++DI +DPCEKNNL
Sbjct: 489 LVEECVYDIWDDPCEKNNLV 508
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/95 (23%), Positives = 51/95 (53%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT G+++Q+ + ++ + ++ + + + + ++ + + + LR +
Sbjct: 417 TTSAGQYDQVLTYRELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQR--VMTLRRS 474
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C P + C++DI +DPCEKNNL
Sbjct: 475 AAVRCGDLQR-GCNPLVEECVYDIWDDPCEKNNLV 508
>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
Length = 532
Score = 322 bits (824), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 215/329 (65%), Gaps = 16/329 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++F+LADDLGWNDVGFHG I TP++DALA G++L NYYT LCTPSRS ++TG+
Sbjct: 43 PPHLVFVLADDLGWNDVGFHG-SSIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR 101
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ I G+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF
Sbjct: 102 YQIRMGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 161
Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DY+ H +A + LD R E A YST++FT A +
Sbjct: 162 DTYFGYLLGSEDYYSHERCTLINALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATAL 221
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE++
Sbjct: 222 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAI 276
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G +
Sbjct: 277 GNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVA 332
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 333 SPLLKQKGVKNRELIHISDWLPTLVKLAG 361
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 108 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 143
>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
Length = 565
Score = 322 bits (824), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 219/335 (65%), Gaps = 6/335 (1%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L +Y +CTPSR
Sbjct: 19 DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+++G++PIHTG QH V+ E L L+ ++P+ K GY T +VGKWHLGF + EY
Sbjct: 79 GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEY 138
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
TPT RGF+ H GYW + DYF + +M + G D RR++E G Y TD+ TA
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMELECRDRGAYVTDLLTA 197
Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA +I +H+ E PLFL L+H A H+AN +PLQAP+ + +I+D R K+AAI+
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIIS 257
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
KLD+SVG+++ L L NSI++F SDNG + G N SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIMSTLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVR 317
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
AG IWS L++RG + Q ++V+DWLPTL AA
Sbjct: 318 VAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAG 352
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 23/139 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DGID+W LS + P ILH +DD W+++AL G+WK V
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTY 419
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V + LS R + R A+++CG ++ C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISITRRLAAVRCGDLQR-SCNP 478
Query: 672 QIAPCLFDIKNDPCEKNNL 690
+ CL+DI DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNL 497
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 21/94 (22%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT GR++ + + ++ + ++ + + + + ++ + + + R
Sbjct: 407 TTAAGRYDSLLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISI--TRRL 464
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
A+++CG ++ C P + CL+DI DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNL 497
>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
Length = 519
Score = 322 bits (824), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 218/330 (66%), Gaps = 16/330 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH++FILADD GW D+G+HG +I TPN+D L+ G+ L+NYY LCTPSR+ +MTG++
Sbjct: 30 PHVVFILADDFGWYDIGYHG-SEIRTPNLDKLSAGGVRLENYYVQPLCTPSRNQLMTGRY 88
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTGMQH +++ C+ +PL EK+LPQ +KE GY T +VGKWHLG YKK+ PT RGF+
Sbjct: 89 QIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFD 148
Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
++LGY TG +DYF H + + LD+R E A G YST++ + A+ II
Sbjct: 149 TYLGYLTGSEDYFTHFRCYQSPSLNLSRCALDLRDGEEVATGYKGVYSTELLSQRAISII 208
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
H + +PLF+Y+A A H+ PLQ P+ Y+ + I+D R +A ++ +DE+VG
Sbjct: 209 ERHISQKPLFMYVALQAVHA-----PLQVPERYVTPYSFIKDTNRRLYAGMVSAMDEAVG 263
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ AL++ + +N+++VF +DNGG L+ SNWPLRG K +LWEGGVRG G + S
Sbjct: 264 NITMALQKEGLWNNTVLVFSTDNGGQ----TLSGGSNWPLRGRKWSLWEGGVRGVGFVAS 319
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
PLL+ G V + +H+SDWLPTL+ A S
Sbjct: 320 PLLKQPGTVNRELIHISDWLPTLVGLAGGS 349
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH +++ C+ +PL EK+LPQ +KE GY T ++
Sbjct: 94 MQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVG 129
>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
Length = 528
Score = 322 bits (824), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 156/331 (47%), Positives = 214/331 (64%), Gaps = 16/331 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+FILADD GWNDVG+HG +I TPN+D L+ G+ L+NYY LCTPSR+ +MTG+
Sbjct: 28 PPHIVFILADDFGWNDVGYHG-SEIKTPNLDKLSAKGVRLENYYVQPLCTPSRNQLMTGR 86
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ I TGMQH +++ C+ +PL EK+LPQ ++E GY T +VGKWHLG Y+K+ PT RGF
Sbjct: 87 YQIRTGMQHQIIWPCQPYCVPLDEKLLPQLMREAGYATHMVGKWHLGMYRKDCLPTRRGF 146
Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+S+ GY TG +DYF H + +D+R E A G YST +FT I
Sbjct: 147 DSYFGYLTGSEDYFSHQRCSYVPPLNVTRCAVDLREGEEVATGYTGTYSTQLFTDRVTSI 206
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I ++ +PLFLY+A A H+ PLQ P+ Y+ + I+D R +A ++ +DE+V
Sbjct: 207 IAKQNSKKPLFLYVALQAVHA-----PLQVPERYVAPYSFIKDPNRRLYAGMVSAMDEAV 261
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G + AL++ + N+++VF +DNGG L SNWPLRG K +LWEGGVRG G +
Sbjct: 262 GNITLALQETGLWDNTVLVFSTDNGGQ----TLAGGSNWPLRGRKWSLWEGGVRGVGFVT 317
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ G V+ Q +H+SDWLPTL+ A S
Sbjct: 318 SPLLKQPGTVSRQLIHISDWLPTLVGLAGGS 348
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH +++ C+ +PL EK+LPQ ++E GY T ++
Sbjct: 93 MQHQIIWPCQPYCVPLDEKLLPQLMREAGYATHMVG 128
>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
Length = 563
Score = 321 bits (823), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/368 (44%), Positives = 227/368 (61%), Gaps = 14/368 (3%)
Query: 25 LKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHG 79
+K L I+ V+ L + F+ + SS P P+II I+ADDLG+NDV FHG
Sbjct: 1 MKRLKKSQNIVIATVMVCVLGLLLYFI--LTSSKPSTAKKPNIIIIIADDLGYNDVSFHG 58
Query: 80 LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLP 139
QIPTPN+ +A GIIL +YT CTPSR+A++TG++PI +GMQ L E LP
Sbjct: 59 SSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLP 118
Query: 140 LSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEE 198
L+ +P + + LGY+T +VGKWHLG KE TP +GF+SH GYW G YFD+ S +
Sbjct: 119 LNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSK 178
Query: 199 MK----MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
M + GLD+ EP W G+Y+T++FT ++D+I H PLFL ++H A H+
Sbjct: 179 MDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTG 238
Query: 255 NPYEPLQAPDHYLNIHR--HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
L PD H +I+D +R +A ++ LD S+G+++ L++++ML NSI++F
Sbjct: 239 QNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLF 298
Query: 313 VSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDW 372
SDNG G N+ SNWPLRGVK + +EGGVR A I+SPL +G V+E +H+SDW
Sbjct: 299 FSDNGAQTVGMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDW 358
Query: 373 LPTLLSAA 380
LPTL SAA
Sbjct: 359 LPTLYSAA 366
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 73/169 (43%), Gaps = 14/169 (8%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
D ++ +IDGID W L+ N PS R IL NID+ + A+ + K+KL++ YQ
Sbjct: 369 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQGTPNYYYQQT 428
Query: 628 LT----------GGPDQVYL-SGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPC 676
L G DQ Y SG D + C + P C
Sbjct: 429 LLFNCKSGSYHEGTFDQYYGDSGRGPENPTPNPNHTTTDLS--WCRAPDQTPILNCTKGC 486
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKK 725
LFD+ DPCE N+ + SE + N ++ +F + P + ++ + K
Sbjct: 487 LFDLDKDPCETTNIIE-SEPEIANQLYEKIAQFWKELVPQRNKDTDPKS 534
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%)
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
D ++ +IDGID W L+ N PS R IL NID+ + A+ R K+KL++
Sbjct: 369 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQ 418
>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
Length = 492
Score = 321 bits (823), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 217/330 (65%), Gaps = 16/330 (4%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT L TPSRS ++T
Sbjct: 1 SRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLT 59
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT R
Sbjct: 60 GRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRR 119
Query: 177 GFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
GF+++ GY G +DY+ H A + LD R E A YST++FT A+
Sbjct: 120 GFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAI 179
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE
Sbjct: 180 ALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDE 234
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G
Sbjct: 235 AVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGF 290
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 291 VASPLLKQKGVKNRELIHISDWLPTLVKLA 320
Score = 42.7 bits (99), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 68 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 103
>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
Length = 565
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 219/335 (65%), Gaps = 6/335 (1%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L +Y +CTPSR
Sbjct: 19 DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+++G++PIHTG QH V+ E L L+ ++P+ K GY T +VGKWHLGF + EY
Sbjct: 79 GALLSGRYPIHTGTQHFVISNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEY 138
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
TPT RGF+ H GYW + DYF + +M + G D RR++E G Y TD+ TA
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMELECRDRGAYVTDLLTA 197
Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA +I +H+ E PLFL L+H A H+AN +PLQAP+ + +I+D R K+AAI+
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIIS 257
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
KLD+SVG+++ L L NSI++F SDNG + G N SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIMSTLAATDQLENSIVIFYSDNGAPSLGMFANTGSNFPLRGQKNTPWEGGVR 317
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
AG IWS L++RG + Q ++V+DWLPTL AA
Sbjct: 318 VAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAG 352
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 23/139 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DGID+W LS + P ILH +DD W+++AL G+WK V
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTY 419
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V + LS R + R A+++CG ++ C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISITRRLAAVRCGDLQR-SCNP 478
Query: 672 QIAPCLFDIKNDPCEKNNL 690
+ CL+DI DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNL 497
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 21/94 (22%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT GR++ + + ++ + ++ + + + + ++ + + + R
Sbjct: 407 TTAAGRYDSLLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISI--TRRL 464
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
A+++CG ++ C P + CL+DI DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNL 497
>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
Length = 554
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 160/334 (47%), Positives = 218/334 (65%), Gaps = 6/334 (1%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L Y +CTPSR
Sbjct: 19 DQSAAARRPNIIIIMADDMGFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+++G++PIHTG QH V+ E L L+ ++P+ KE GY T +VGKWHLGF + EY
Sbjct: 79 GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
TPT RGF+ H GYW + DYF + +M + G D RR+++ G Y TD+ T
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTT 197
Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA +I +H+ E PLFL L+H A H+AN +PLQAP+ + +I+D R K+AA++
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMIS 257
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
KLD+SVG+++ L L NSI++F SDNG + G N SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIISTLSSTDQLDNSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVR 317
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
AG IWS L++RG + Q ++V+DWLPTL AA
Sbjct: 318 VAGAIWSSKLQARGSIFSQPLYVADWLPTLSRAA 351
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV---------- 621
++DGID+W LS + P ILH +DD W++SAL G+WK V
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTADGRYDSVLTY 419
Query: 622 ----------MRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V + LS R + R A+++CG ++ C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNP 478
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ CL+DI DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNLV 498
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT GR++ + + ++ + ++ + + + + ++ + + + R
Sbjct: 407 TTADGRYDSVLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISL--TRRL 464
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C P + CL+DI DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNLV 498
>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
Length = 581
Score = 320 bits (821), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 163/380 (42%), Positives = 241/380 (63%), Gaps = 23/380 (6%)
Query: 40 LPLAFTLSMVFVDLVASSG------------PPHIIFILADDLGWNDVGFHGLDQIPTPN 87
LP F + + V +V G PPHI+ ILADD+GWNDV FHG ++IPTPN
Sbjct: 3 LPYKFMMILGGVAIVCFLGYGTLAWGKAIPLPPHIVIILADDMGWNDVSFHGANEIPTPN 62
Query: 88 IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
IDALAY+G+IL YYT+ +CTPSRSA+MTG++PI GMQ + E G+PL+ ++P+
Sbjct: 63 IDALAYNGVILNKYYTMPICTPSRSALMTGRYPIRDGMQGTPMRPAEPRGIPLNVSLMPE 122
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA---EEMKMWGL 204
++ LGY TR+VGKWHLG+ ++YTP RGF++ GY+ G Y+D+ + ++ G
Sbjct: 123 QMRRLGYETRLVGKWHLGYTTEDYTPVRRGFDTFFGYYNGFISYYDYWIGWNDTNEVTGY 182
Query: 205 DMRRDLEPAWDL--HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY--EPL 260
D+ RD +++L +Y TD+ T EA II N+ +PLFL ++H A H+ + +PL
Sbjct: 183 DLHRDESDSFELAHSSEYFTDLITDEAEKIIRNNKNAKPLFLEISHLAVHAGSKVHDDPL 242
Query: 261 QA--PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
+ D +IED++ K+A ++ LDESVG+VV+AL++ ML NSII+F+SDNG
Sbjct: 243 EVRRTDDVNASFPYIEDYQHRKYAGMMAALDESVGRVVKALKEAEMLENSIIIFMSDNGA 302
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
G N SN+P+RG+K ++EG R A I+SPL+++ V+E+ +H+ DWLPTL +
Sbjct: 303 PTVGLYNNTGSNYPMRGIKGGMFEGAARAAACIFSPLIKAHSRVSEELMHIVDWLPTLYT 362
Query: 379 AANKSDIPNYVNSTVENIIP 398
AA + P + S + +P
Sbjct: 363 AAGGN--PMDLQSQFDGALP 380
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 73/211 (34%), Gaps = 55/211 (26%)
Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
+P L + GN D + +DG+ WS + PS R ++L NID+ A
Sbjct: 357 LPTLYTAAGGNPMDLQSQFDGAL--PLDGVSQWSSIVAGGPSSRQSLLVNIDEAQGFEAA 414
Query: 544 TRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
G+ KLVK + D Y N S N+PS
Sbjct: 415 IIGRHKLVKGMT----------KEDGYYGN-----------SGNDPS------------- 440
Query: 604 QISALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGP 663
+ V KV+ T G L+G + A LR + I C P
Sbjct: 441 ---------FPAYNVKKVLSS----TAGASIGKLAGFASPS--ARRALWLRQKSVITCKP 485
Query: 664 VKEVP-CEPQIAPCLFDIKNDPCEKNNLADR 693
C CLFD+ DPCE +L+ +
Sbjct: 486 FTSAANCS---GTCLFDLSKDPCETRDLSSK 513
>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
Length = 514
Score = 320 bits (821), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 153/334 (45%), Positives = 221/334 (66%), Gaps = 16/334 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A+ PPH++ +LADDLGW DVG+HG I TP +DAL G+ L+ YYT LCTPSRS
Sbjct: 18 AAARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQ 76
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+++G++ IHTG+QH +++ C+ LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE P
Sbjct: 77 LLSGRYQIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLP 136
Query: 174 TFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A+ + LD R E A YST++FT
Sbjct: 137 TRRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTE 196
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+D+I NH T++PLFLYLA + H EPL+ Y+ + I+D KR ++A ++
Sbjct: 197 RAIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSL 251
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG + +AL++ + +N+++VF +DNGG + +NWPLRG K TLWEGGVRG
Sbjct: 252 MDEAVGNLTDALKEYGLWNNTVLVFSTDNGGQ----TMAGGNNWPLRGRKWTLWEGGVRG 307
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G + SPLL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 308 VGFVASPLLKQKGVESHELIHISDWLPTLVHLAG 341
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ LPL EK+LP+ LK+ GY T ++
Sbjct: 88 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVG 123
>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
Length = 593
Score = 320 bits (821), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 160/355 (45%), Positives = 227/355 (63%), Gaps = 16/355 (4%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
R+ ++ + L F ++ F D PHII + DDLGWNDV FHG +QIPTPNIDA
Sbjct: 8 RSNLLVLCCI-LLFGDALGFYDTA-----PHIIVFMVDDLGWNDVSFHGSNQIPTPNIDA 61
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
L Y+GIIL +Y + TPSR+A TG +PI GMQ + G E GLPL+ +ILP+YL+
Sbjct: 62 LGYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGIQGGEPRGLPLNIRILPEYLR 121
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
LGY T+++GKWH+G++ ++TP RGF++ LG++ H Y+D+ M G DM R
Sbjct: 122 GLGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVSYYDYRYSYQNMSGYDMHRGD 181
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNI 269
PA+ L+ +Y+TD+FT EA+ II H PL+L ++H A H+ P+++P D + N
Sbjct: 182 NPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHLAVHA-----PIESPDDDHRNS 236
Query: 270 HR----HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
+R HI + R +A+++ +LDESVG++V L R ML +S+I+F++DNG G
Sbjct: 237 NRERFKHIPEVNRRNYASMVTRLDESVGRIVHTLGNRGMLRDSLILFLTDNGAPTIGKFR 296
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N SN PLRG+K TL+EGGVRG ++WSP L V + VH++DWLPT SAA
Sbjct: 297 NWGSNLPLRGMKYTLYEGGVRGVAVLWSPRLRKAARVCNELVHITDWLPTFYSAA 351
>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
Length = 560
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/329 (47%), Positives = 217/329 (65%), Gaps = 4/329 (1%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S P+II ++ADD+G++D+ G + TPNIDALAY G +L Y +CTPSR A++
Sbjct: 29 ASKLPNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALL 88
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+GK+PIHTG QH+V+ E GL L+ ++P+ ++ GY T +VGKWHLGF + EYTPT+
Sbjct: 89 SGKYPIHTGTQHSVILNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPTY 148
Query: 176 RGFESHLGYWTGHQDYFDHSAE---EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
RGF+ H GYW G+ DY+ ++ + G D RR+++ G Y TD+ T EA I
Sbjct: 149 RGFDYHYGYWGGYIDYYQRRSQMPSDNYSMGYDFRRNMQVECTDRGVYVTDLLTNEAERI 208
Query: 233 IHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I + ++PLFL L+H A H+ N +PLQAP+ L HI D R +AA++ KLDES
Sbjct: 209 IREQPAKEQPLFLILSHLAPHTGNEIDPLQAPEEELQKFVHINDPNRRLYAAMVSKLDES 268
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VG+V+ AL + L NSI++F SDNG + G N SNWPL+G K++ WEGG+R AG I
Sbjct: 269 VGRVITALARTDQLENSIVIFYSDNGAPSVGIFANTGSNWPLKGQKDSPWEGGLRVAGAI 328
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
WSPLL++RG + Q ++V D+LPTL AA
Sbjct: 329 WSPLLKARGNLFTQPIYVGDFLPTLAHAA 357
>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 533
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 165/397 (41%), Positives = 234/397 (58%), Gaps = 30/397 (7%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
+ F L + + + AS PP+IIF+L DDLGW+DV FHG QIPTPN+D LA G+IL NY
Sbjct: 8 MVFLLIFLALGVSASKKPPNIIFMLVDDLGWDDVSFHGSPQIPTPNMDVLAGDGVILNNY 67
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y CTPSR+A+MTG +PIH G+Q V+ + GLPL K++P++ K++GY T ++GK
Sbjct: 68 YVQHFCTPSRAALMTGLYPIHNGLQDFVIDVAQPYGLPLYLKVMPEFFKDMGYETHMIGK 127
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLG+++KEYTPT+RGF+S GY+ G +DY++HS ++ ++R+ +
Sbjct: 128 WHLGYFRKEYTPTYRGFDSFYGYYNGAEDYYNHSITKVISQSYNIRQGSVTKKRIENYIK 187
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
++ + +PLFLYLA+ + H PL+AP+ + +I + R+ F
Sbjct: 188 NTELVLLSLTFYLSILFSQPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIF 242
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
A + LD+SVG+V+E L + ML N+IIV +DNGG G N N+PLRG K TLW
Sbjct: 243 AGAVDALDQSVGEVLETLYEADMLDNTIIVLSTDNGGLPFGVQSNRGYNFPLRGAKGTLW 302
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN-------------------- 381
EGG RG+ IWSPLL V++Q +H++DWLPTL SAA
Sbjct: 303 EGGTRGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGNPVSLGKVDGYDMWTQL 362
Query: 382 KSDIPNYVNSTVENIIPRYENSILRYEN-----GTHE 413
++P+ + NI P NS LRY N GT+E
Sbjct: 363 SYNLPSSRYEVLYNIDPTDLNSALRYTNYKLVLGTYE 399
Score = 46.2 bits (108), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 75/207 (36%), Gaps = 62/207 (29%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
++DG D+W+ LS N PS R +L+NID SAL +KLV GT E+ D
Sbjct: 352 KVDGYDMWTQLSYNLPSSRYEVLYNIDPTDLNSALRYTNYKLVL-------GTYEDGEFD 404
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDL 628
+ R R + + + K +V+K
Sbjct: 405 QRF--------------RTTGGSR---------PYGDLEIAMAQSKAARVLK-------- 433
Query: 629 TGGPDQVYLSGLSDREWLALAMRKLRDAASIQC----GPVKEVPCEPQIAPCLFDIKNDP 684
G +R L R R A++ C GP V P P LFD+ DP
Sbjct: 434 ----------GFYNRGNFLLP-RNWRQKATVNCVSNNGPKNFVSQAP---PYLFDLTKDP 479
Query: 685 CEKNNLADRSED------QRINHYTTE 705
CE NN+A QR+ Y +
Sbjct: 480 CELNNIASAEAGLVDFLKQRLQSYASS 506
>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 374
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 204/312 (65%), Gaps = 1/312 (0%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GW+DV HG QI TPNID LA G+ L NYY +CTP+RSAIMTGKHPIHTGMQH+ +
Sbjct: 43 GWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGKHPIHTGMQHDTI 102
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
E GL L EK + Q+LK LGY T VGKWHLG++ ++Y PT RGF+S GY+ G D
Sbjct: 103 GADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAEDYIPTRRGFDSFFGYYNGRGD 162
Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
Y+ H E G D+ ++ E W G+YST++FT EA II H+ +PLFLYLAH A
Sbjct: 163 YYTHEDTEGGFGGYDLHKNGEVHWPDFGQYSTEIFTEEAQQIIKTHNASQPLFLYLAHQA 222
Query: 251 THSANPYEPL-QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
H+ + L +AP Y + +I+ R FAA++ LD+SVG + + L+ + +NSI
Sbjct: 223 VHAGVYGKDLVEAPHKYYQMFPNIKTEGRRMFAAMVAGLDDSVGNISQTLQDAGLYNNSI 282
Query: 310 IVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
IVF +DNGG G++ N ASNWPLRG K+TLWEGGVRG + SPL+E +++ +HV
Sbjct: 283 IVFTTDNGGPTNGYDGNHASNWPLRGCKHTLWEGGVRGTAFVNSPLIEKPRRFSDRMMHV 342
Query: 370 SDWLPTLLSAAN 381
DWLPTL A
Sbjct: 343 CDWLPTLYGVAG 354
>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 575
Score = 320 bits (820), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 170/364 (46%), Positives = 228/364 (62%), Gaps = 22/364 (6%)
Query: 32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
+RI +L + L+ +V L PHI+ I+ADDLGWNDVGFHG QIPTPNIDAL
Sbjct: 6 SRITGIFILIVNLQLTD-YVLLRELKKRPHIVVIVADDLGWNDVGFHGSIQIPTPNIDAL 64
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
AY+G IL +Y CTPSR+A++TGK+PI G+Q + + LP +EKILPQYLKE
Sbjct: 65 AYNGAILNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPIASGKASALPTNEKILPQYLKE 124
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
LGY T +VGKWHLG Y+K +TPT RGF+SH GYW G+ Y + + M G+D RR E
Sbjct: 125 LGYSTHLVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYISYRNSTHATRTMSGIDARRGFE 184
Query: 212 PAWDL--HGKYSTDVFTAEAVDIIHNHSTDEP-LFLYLAHAATHSANPYEPLQAPDHYLN 268
A + +Y+TDVFT EA II + + +FL ++H A HS N P+H
Sbjct: 185 RAGNEMDRDRYATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGN-----SGPNHLEV 239
Query: 269 IHR--------HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+++ +IE+ R +A ++ LD+SVGK++E+L + ML NSI+VF+SDNG
Sbjct: 240 LNKTYNDEAFGYIENENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFISDNGAPT 299
Query: 321 A----GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
G + N SNWPLRG K ++ EGGVRG IWSP L+ + V+ H++DWLPTL
Sbjct: 300 DDPLWGHD-NFGSNWPLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTL 358
Query: 377 LSAA 380
+AA
Sbjct: 359 YTAA 362
>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
Length = 554
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 217/336 (64%), Gaps = 10/336 (2%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L Y +CTPSR
Sbjct: 19 DQSAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+++G++PIHTG QH V+ E L L+ ++P+ KE GY T +VGKWHLGF + EY
Sbjct: 79 GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM------WGLDMRRDLEPAWDLHGKYSTDVF 225
TPT RGF+ H GYW + DYF + KM G D RR+++ G Y TD+
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS---KMPVANYSLGYDFRRNMDLECRDRGVYVTDLL 195
Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
T EA +I +H+ E PLFL L+H A H+AN +PLQAP+ + +I+D R K+AA+
Sbjct: 196 TTEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDSNRRKYAAM 255
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ KLD+SVG+++ L L NSI++F SDNG + G N SN+PLRG KNT WEGG
Sbjct: 256 ISKLDQSVGRIISTLSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGG 315
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
VR AG IW+ L++RG + Q ++V+DWLPTL AA
Sbjct: 316 VRVAGAIWNSKLQARGSIFSQPLYVADWLPTLSRAA 351
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DGID+W LS + P ILH +DD W++SAL G+WK V
Sbjct: 360 KLDGIDLWPELSGTADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTAAGRYDSVLTY 419
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V + LS R + R A+++CG ++ C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNP 478
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ CL+DI DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNLV 498
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT GR++ + + ++ + ++ + + + + ++ + + + R
Sbjct: 407 TTAAGRYDSVLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISL--TRRL 464
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C P + CL+DI DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNLV 498
>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
Length = 598
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 161/363 (44%), Positives = 236/363 (65%), Gaps = 20/363 (5%)
Query: 31 RTRIMAFAVLPLAFTL---SMVFVDLV-ASSGPPHIIFILADDLGWNDVGFHGLDQIPTP 86
R R++A +V ++ +V+ D+ +SSG P+I+FILADD GWND+G+HG I TP
Sbjct: 89 RCRVVAVSVSYFQKSVRRQDVVYSDIQESSSGKPNIVFILADDYGWNDIGYHG-SVIRTP 147
Query: 87 NIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILP 146
N+D LA G+ L+NYY LC+PSR +MTG++ I G+QH++++ + GLPL E LP
Sbjct: 148 NLDRLAAEGVKLENYYVQPLCSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLP 207
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE-----EMKM 201
Q LKE GY T IVGKWHLGFYK++YTPT RGF++ GY TG +DY+ H + + +
Sbjct: 208 QRLKEGGYSTHIVGKWHLGFYKQDYTPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQT 267
Query: 202 W-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
W GLD+R P D +G YST +F +A++II ++P+FL+L+ A H +PL
Sbjct: 268 WSGLDLRDQNRPVTDQNGTYSTHLFANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPL 322
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
QAP+ ++ + HI D R +AA+ +D++VG V AL+Q + N++++F +DNGG
Sbjct: 323 QAPEEDISRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGGRV 382
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
NWPLRG K +LWEGGVRG G + SPL++++G ++ +H+SDW PTL+ A
Sbjct: 383 D----RGGINWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLA 438
Query: 381 NKS 383
+ S
Sbjct: 439 SGS 441
>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 571
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 160/350 (45%), Positives = 227/350 (64%), Gaps = 20/350 (5%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
+F L+ +F + G PHI+ I+ADD+GWNDVGFHG ++IPTPNIDALAY G+IL +
Sbjct: 11 FSFILTPIFGPIY---GQPHIVVIIADDMGWNDVGFHGSNEIPTPNIDALAYGGVILNRH 67
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y + CTPSR+A +TG+HPI G+Q + E G+PL E++LP+YL+ELGY TR+VGK
Sbjct: 68 YALPTCTPSRTAFLTGRHPIRMGLQGIPMNVAEPRGVPLHERLLPEYLRELGYVTRLVGK 127
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-- 219
WHLG+Y ++TPT RGF+S +GY+ G YF+H+ + K G+D WD GK
Sbjct: 128 WHLGYYTDKHTPTRRGFDSFVGYYGGVITYFNHTVTKDKHTGIDYH------WDTSGKIE 181
Query: 220 ------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHR 271
Y TD + +A +I NH +PLFL LAH A H++ +P++ + +
Sbjct: 182 PFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLAHVAAHASENRDPIEVRNMTEVNDTLS 241
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA-GFNLNAASN 330
+I D R K+A ++ +D+SVG+VV+AL+ MLSNSII+F+SDNG A N SN
Sbjct: 242 YIPDINRRKYAGVVTAMDDSVGRVVKALKDANMLSNSIIIFMSDNGSPTAEAPYTNYGSN 301
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+PLRG+K T++EGGVR ++SP L+ R V+++ H++DW PTL A
Sbjct: 302 YPLRGIKATVFEGGVRVPACVFSPRLKDRFRVSDELFHITDWFPTLYKLA 351
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 71/156 (45%), Gaps = 34/156 (21%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQ-- 625
D S ++DG+D WS +S ++ S R ++L NID+ A G +KL++ + RY
Sbjct: 354 DLSKIQDLDGVDQWSSISGSQKSNRESLLVNIDEVSNPEAAISGYYKLIRGIN--RYDDY 411
Query: 626 ------------VDLTGGPDQVYLSGLSDREWLALA---------MRKLRDAASIQCGPV 664
D+TG LS L+ R +L + +LR+ A+++C
Sbjct: 412 YGKDGNDYSPKTYDVTG-----VLSSLAGRAIASLGNQYLPPQKRITELRNKATLRCEKK 466
Query: 665 KEVP-CEPQIAPCLFDIKNDPCEKNNLADRSEDQRI 699
+ P C CLFDI DPCE N+ D + I
Sbjct: 467 DDRPSCRDT---CLFDIVKDPCETTNIIDEHPEMEI 499
>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
Length = 583
Score = 319 bits (818), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 166/355 (46%), Positives = 231/355 (65%), Gaps = 5/355 (1%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
A++ L T +V + SG P+II I+ADDLG++DV F G + TPNIDALAYSG+I
Sbjct: 13 AIILLLTTGLVVVLAGTEISGKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVI 72
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
L N YT +CTPSR+A++TGK+PI+TGMQH V+ + GLPL+E + + GYRT
Sbjct: 73 LNNLYTAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMADIFRGNGYRTS 132
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK--MWGLDMRRDLEPAWD 215
++GKWHLG ++ YTPT RGF+ HLGY + DY++ S E++ G D R +L+P +
Sbjct: 133 LIGKWHLGMSQRNYTPTLRGFDYHLGYLGAYVDYYNQSYEQVSKGYRGHDFRENLKPNHE 192
Query: 216 LHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
KY TD+ T AV I +H+ +PLFL L H A H+AN +P+QAP L+ +
Sbjct: 193 HVDKYVTDILTDAAVREIDDHAAKNNSKPLFLLLNHLAPHAANDADPMQAPADELSGFEY 252
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I D +AA++ +LD+SVG VV+AL ++ ML NSI++F+SDNGG G + ASN+P
Sbjct: 253 IRDETHRYYAAMVSRLDKSVGHVVDALARQDMLQNSILLFLSDNGGPTLGEHATTASNYP 312
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
LRG KN+ WEGG+R + IWS E G V +Q +++ D LPTL +AA S P+
Sbjct: 313 LRGQKNSPWEGGIRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPD 367
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 75/162 (46%), Gaps = 30/162 (18%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS--ALTKGKWKLVK-VVKVMRY------Q 625
+DG+++WS L S I+H ID+E A T+GKWK+V RY +
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEEAPEPHLAYTRGKWKVVSGTTNSGRYDGWLGER 429
Query: 626 VDLTGGP-DQVYLSGLSDRE-WLALA--------MRKLRDAASIQCGPVKE---VPCEPQ 672
V P Q Y + + + WL L + LR A I+C E PC P
Sbjct: 430 VTTEVDPRSQEYEALIQNTSVWLQLQKVSGGERNISGLRQQARIECPATTEDEHTPCLPL 489
Query: 673 IAPCLFDIKNDPCEKNNLADRSED--------QRINHYTTEV 706
APCLFDI+ DPCE+NNL + ++ QR+ +T +
Sbjct: 490 EAPCLFDIEADPCEQNNLYEEYKNTTVFQDLWQRVQQFTAQA 531
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 11/77 (14%)
Query: 747 KKYSNEEEGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKNDPCEKNNLAD----- 798
+K S E + LR A I+C E PC P APCLFDI+ DPCE+NNL +
Sbjct: 455 QKVSGGERNISGLRQQARIECPATTEDEHTPCLPLEAPCLFDIEADPCEQNNLYEEYKNT 514
Query: 799 ---RSEVQRINHYTTEV 812
+ QR+ +T +
Sbjct: 515 TVFQDLWQRVQQFTAQA 531
>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
Length = 531
Score = 319 bits (817), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 158/351 (45%), Positives = 221/351 (62%), Gaps = 12/351 (3%)
Query: 42 LAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
+ L ++ ++ SS P P+II I+ADDLG+NDV FHG QIPTPN+ +A GI
Sbjct: 1 MVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGSSQIPTPNLAKMATRGI 60
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
IL +YT CTPSR+A++TG++PI +GMQ L E LPL+ +P + + LGY+T
Sbjct: 61 ILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLPLNMPTMPLHFQNLGYKT 120
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMK----MWGLDMRRDLE 211
+VGKWHLG KE TP +GF+SH GYW G YFD+ S +M + GLD+ E
Sbjct: 121 HLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFE 180
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
P W G+Y+T++FT ++D+I H PLFL ++H A H+ L PD H
Sbjct: 181 PVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHE 240
Query: 272 --HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
+I+D +R +A ++ LD S+G+++ L++++ML NSI++F SDNG G N+ S
Sbjct: 241 FSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSGS 300
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
NWPLRGVK + +EGGVR A I+SPL +G V+E +H+SDWLPTL SAA
Sbjct: 301 NWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAA 351
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 23/165 (13%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMR 623
D ++ +IDGID W L+ N PS R IL NID+ + A+ + K+KL++ +
Sbjct: 354 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQGSYHEGTFDQ 413
Query: 624 YQVDLTGGPDQVYLS---GLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDI 680
Y D GP+ + +D W C + P CLFD+
Sbjct: 414 YYGDSGRGPENPTPNPNHTTTDLSW---------------CRAPDQTPILNCTKGCLFDL 458
Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKK 725
DPCE N+ + SE + N ++ +F + P + ++ + K
Sbjct: 459 DKDPCETTNIIE-SEPEIANQLYEKIAQFWKELVPQRNKDTDPKS 502
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%)
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
D ++ +IDGID W L+ N PS R IL NID+ + A+ R K+KL++
Sbjct: 354 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQ 403
>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
Length = 575
Score = 319 bits (817), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 159/327 (48%), Positives = 213/327 (65%), Gaps = 4/327 (1%)
Query: 58 GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
G P+II ILADD+G++DV F G + TPNIDALA+ G IL Y +CTPSR A+++G
Sbjct: 35 GRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSG 94
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++PIHTG QH V+ E GL L+ ++P+ ++ GY T ++GKWHLGF + EYTPT RG
Sbjct: 95 RYPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRG 154
Query: 178 FESHLGYWTGHQDYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
F+ H GYW + DY+ ++ G D RR++E G Y TD+ T EA +I
Sbjct: 155 FDYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERVIR 214
Query: 235 NHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
E PLFL L+H ATH+AN +PLQAP+ + +I+D R K+AA++ KLD+SVG
Sbjct: 215 EREGQEQPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVG 274
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
++V AL L NSI++F SDNG + G N SNWPLRG KNT WEGGVR AG IWS
Sbjct: 275 RIVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAIWS 334
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
L++RG + Q ++V+DWLP+L AA
Sbjct: 335 AQLQARGNIFTQPIYVADWLPSLAHAA 361
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 66/140 (47%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------ 619
E+DGID+W L+ + P ILH +DDEW++++ G WK +
Sbjct: 370 ELDGIDLWPQLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTY 429
Query: 620 --------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
+ RY V + P L+ R + LR +A+++CG ++ C P
Sbjct: 430 RELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQRVMTLRRSAAVRCGDLQR-GCNP 488
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ C++DI +DPCEKNNL
Sbjct: 489 LVEECVYDIWDDPCEKNNLV 508
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/95 (23%), Positives = 51/95 (53%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT G+++Q+ + ++ + ++ + + + + ++ + + + LR +
Sbjct: 417 TTSAGQYDQVLTYRELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQR--VMTLRRS 474
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C P + C++DI +DPCEKNNL
Sbjct: 475 AAVRCGDLQR-GCNPLVEECVYDIWDDPCEKNNLV 508
>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
Length = 358
Score = 318 bits (816), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 215/336 (63%), Gaps = 16/336 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PPH++ ILADDLGWNDVGFHG +I TP++D LA G+ L NYY LCTPSR+
Sbjct: 17 CVSARPPHVVLILADDLGWNDVGFHG-SEIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQ 75
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++ I TG+QH +++ C+ +PL EK+LPQ L+E GY T +VGKWHLG ++K+ P
Sbjct: 76 LMTGRYQIRTGLQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLP 135
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+S GY TG +DY+ H + + LD+R A + G+YST++ T
Sbjct: 136 THRGFQSFFGYLTGSEDYYTHKRCSPIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTE 195
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A II H+ D+PLFLY+A A H+ PLQ PDHY+ + I+D R ++A ++
Sbjct: 196 RATHIITQHTPDQPLFLYVALQAVHA-----PLQVPDHYIAPYSFIQDPHRRRYAGMVSA 250
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG + L++ + N++++F +DNGG L +NWPLRG K TLWEGGVRG
Sbjct: 251 MDEAVGNITHTLQETGLWDNTVLIFSTDNGGQ----TLYGGNNWPLRGRKWTLWEGGVRG 306
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ PL+E G V + +H+SDWLPT+ A S
Sbjct: 307 VAFVSGPLIEKPGAVNRELIHISDWLPTIAGLAGAS 342
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ L+E GY T ++
Sbjct: 87 LQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVG 122
>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
Length = 535
Score = 318 bits (815), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 215/331 (64%), Gaps = 16/331 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++F+LADDLGWNDV FHG I TP++D LA G++L NYYT LCTPSRS ++TG+
Sbjct: 46 PPHLVFVLADDLGWNDVSFHG-SNIRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF
Sbjct: 105 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DY+ H + + LD R + A YST++FT A +
Sbjct: 165 DTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATAL 224
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I +H ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+V
Sbjct: 225 ITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAV 279
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGG+RG G +
Sbjct: 280 GNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVA 335
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 336 SPLLKQKGVKNRELIHISDWLPTLVKLARGS 366
Score = 42.7 bits (99), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 510
Score = 318 bits (814), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 203/322 (63%), Gaps = 6/322 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FIL DDLGW DV FHG QIPTPNID LA G+IL NYY + CTPSR+A+MTG +
Sbjct: 3 PNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTGLY 62
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQ +++ GLPL KILPQ+ ++LGY ++GKWHLGF+K Y P RGF+
Sbjct: 63 PIHTGMQSDIIEPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRGFD 122
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G++TG DY++H S E G+ + L AW + + T +++
Sbjct: 123 TFFGFYTGSNDYYNHTSGSENNEQGVSLVATLTAAWGRIAQGVEPLLTDSPLNVY----- 177
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+P F Y +H A HSA EP QAP + +I + R+ +A +L LDESVG+VVEA
Sbjct: 178 PQPFFCYFSHHAVHSALMAEPFQAPARNVLKFPYIGESNRTIYAGMLDALDESVGRVVEA 237
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ ML ++IIVF SDNGGA G N NWPLRG K TLWEG VR +WSP
Sbjct: 238 LDNAGMLEDTIIVFSSDNGGAPCGEASNQGFNWPLRGAKFTLWEGSVRVPAFVWSPKFLK 297
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
+ V+ Q +H+SDWLPTL +AA
Sbjct: 298 QSRVSNQLMHISDWLPTLYTAA 319
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 77/181 (42%), Gaps = 31/181 (17%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
D S IDG+D+W L+ + SKR IL NID + IS L ++K K+V + +
Sbjct: 322 DVSALGVIDGVDMWHSLTHSSASKRKEILLNIDASFNISGL---RYKQYKLVVGGGFDGE 378
Query: 628 LTGGPDQVYLSG----------LSDREWLALAMRKLRDAASIQCGPVK-----EVPCE-- 670
L D Y +G L + +A ++K + P+ V C
Sbjct: 379 LN---DHYYFTGGTRPTNNIDELRNESTMARVLKKFYEKQRRSWSPLTWSENVAVDCRRD 435
Query: 671 -------PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEK 723
P P LF+I+ DPCE NNLA +++ + ++ +N P + ++
Sbjct: 436 RLTENFVPMQPPYLFNIEQDPCELNNLA-KTKVHTLRFLMKKLDAYNSTVVPSLFQPDDP 494
Query: 724 K 724
+
Sbjct: 495 R 495
>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
Length = 491
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 216/328 (65%), Gaps = 16/328 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH++ +LADDLGW DVG+HG I TP +DAL G+ L+ YYT LCTPSRS +++G++
Sbjct: 1 PHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRY 59
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH +++ C+ LPL EK+LP+ L+E GY T +VGKWHLG Y+KE PT RGF+
Sbjct: 60 QIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFD 119
Query: 180 SHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
++ GY G +DY+ H A+ + LD R E A YST++FT A+D+I
Sbjct: 120 TYFGYLLGSEDYYTHDRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVI 179
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
NH T++PLFLYLA + H EPL+ P+ Y+ + I+D KR +A ++ +DE+VG
Sbjct: 180 ANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRRHYAGMVTLMDEAVG 234
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ +AL+ + N++ VF +DNGG L +NWPLRG K TLWEGGVRG G + S
Sbjct: 235 NLTDALKTYGLWDNTVFVFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVAS 290
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
PLL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 291 PLLKRKGVESHELIHISDWLPTLVHLAG 318
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ LPL EK+LP+ L+E GY T ++
Sbjct: 65 LQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVG 100
>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
Length = 619
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 156/326 (47%), Positives = 215/326 (65%), Gaps = 6/326 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II ++ADD+G++D+ G + TPNIDALAY G +L Y +CTPSR A+++GK+
Sbjct: 33 PNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKY 92
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTG QH V+ E GL L+ ++P+ +E GY T +VGKWHLGF + EYTPT RGF+
Sbjct: 93 PIHTGTQHYVIGNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPTHRGFD 152
Query: 180 SHLGYWTGHQDYFDHSAEEMKMW----GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
+H GYW + DY+ + +M + G D RR+++ G Y TD+ T EA II
Sbjct: 153 NHYGYWGAYIDYYQRRS-QMPLGNYSVGYDFRRNMQVECTDRGVYVTDLLTNEAERIIRE 211
Query: 236 H-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
+ ++PLFL L+H A H+ N +PLQAP+ L HI+D R +AA++ KLDESVG+
Sbjct: 212 QPAKEQPLFLILSHLAPHTGNTNKPLQAPEEELRKFTHIKDPNRRLYAAMVSKLDESVGR 271
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V+ AL + L NSI++F SDNG + G N SNWPL+G KN+ WEGG+R AG IWSP
Sbjct: 272 VITALARTDQLENSIVIFYSDNGAPSVGMFSNTGSNWPLKGQKNSPWEGGLRVAGAIWSP 331
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL++RG + Q ++V D+LPTL AA
Sbjct: 332 LLKARGNLFTQPIYVGDFLPTLAHAA 357
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 64/139 (46%), Gaps = 23/139 (16%)
Query: 575 IDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------- 619
+DGID+W LS ++ P ILH +DD W++ +L +G++K V
Sbjct: 367 LDGIDLWPQLSGPKDAPHVPREILHQLDDVWRVQSLFQGQYKYVNGTTLAGQFDQVLTYR 426
Query: 620 -------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQ 672
+ RY V++ P L+ + + + +A++ C ++ C
Sbjct: 427 ELDDLDPRASRYVVEVRNSPASRALARYDLQRITQARINTISRSAAVICSDMQR-GCNAL 485
Query: 673 IAPCLFDIKNDPCEKNNLA 691
+ CL+D+ DPCE+NNLA
Sbjct: 486 LEECLYDLSTDPCEQNNLA 504
>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
Length = 567
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 159/331 (48%), Positives = 214/331 (64%), Gaps = 5/331 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II ILADD+G++DV F G + TPNIDALAY G IL N Y +CTPSR A+++G++
Sbjct: 28 PNIIIILADDMGFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPAMCTPSRGALLSGRY 87
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P HTG QH V+ E GL L+ ++P+ KE GY T ++GKWHLGF EYTPT RGF+
Sbjct: 88 PAHTGTQHFVISNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLGFASPEYTPTRRGFD 147
Query: 180 SHLGYWTGHQDYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN- 235
H GYW + DY+ ++ G D RR+++ G Y TD+ T EA +I
Sbjct: 148 YHYGYWGAYIDYYQRRSQMPVANYSMGYDFRRNMDLECQNRGVYITDLLTQEAEHVIREK 207
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
+ +E FL L+H ATH+AN +PLQAP+ + +I+D +R K+AA++ KLD+SVG++
Sbjct: 208 AAANETFFLMLSHLATHTANDNDPLQAPEEEIRKFAYIKDPRRRKYAAMVAKLDQSVGRI 267
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
+ L Q L NSI++F SDNG + G N SNWPLRG KNT WEGG+R AG IWS
Sbjct: 268 ITTLAQTEQLENSIVIFYSDNGAPSVGQFANTGSNWPLRGQKNTPWEGGIRVAGAIWSTQ 327
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
L++RG Q ++V+DWLPTL AA D+P
Sbjct: 328 LQARGNKFTQPIYVADWLPTLAHAAGI-DLP 357
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 23/139 (16%)
Query: 574 EIDGIDVWSVL--SRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DG+D+W L S++ ILH +DD W +S++ G WK +
Sbjct: 362 QLDGLDLWPQLAGSKDAAHVPREILHALDDVWHVSSMLLGHWKYINGTTASGTYDQVLTY 421
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V++ P L+ + + LR +AS++CG ++ C P
Sbjct: 422 RELDDLDPRNSRYVVEVRNSPASRALAHFDLQRLGQQKITSLRHSASVRCGNLQR-GCNP 480
Query: 672 QIAPCLFDIKNDPCEKNNL 690
+ CL+DI DPCE+NNL
Sbjct: 481 LVEECLYDIAADPCEQNNL 499
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 3/94 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT G ++Q+ + ++ + + + + + + ++ ++ + LR +
Sbjct: 409 TTASGTYDQVLTYRELDDLDPRNSRYVVEVRNSPASRALAHFDLQRLGQQK--ITSLRHS 466
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
AS++CG ++ C P + CL+DI DPCE+NNL
Sbjct: 467 ASVRCGNLQR-GCNPLVEECLYDIAADPCEQNNL 499
>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
Length = 584
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 165/371 (44%), Positives = 237/371 (63%), Gaps = 10/371 (2%)
Query: 21 LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
+ +L + T ++A VL +A + ++A+S P+II I+ADDLG++DV F G
Sbjct: 1 MSTHLDKFSSATWLLAGFVLCIALSNG-----IIAASDRPNIIIIMADDLGFDDVSFRGS 55
Query: 81 DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
+ TPNIDALAYSG+IL N Y +CTPSR+A++TGK+PI+TGMQH V+ + GLPL
Sbjct: 56 NNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPL 115
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
+E + + +E GYRT ++GKWHLGF ++ +TPT RGF+ HLGY + DY+ S E+
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGYLGAYVDYYTQSYEQQS 175
Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS---TDEPLFLYLAHAATHSAN 255
G D R +L+ + D G Y TDV T AV I +H+ + +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDNLKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPLFLLLNHLAPHAAN 235
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
P+QAP ++ +I + +AA++ KLD+SVG V++AL ++ ML NSI++F+SD
Sbjct: 236 DDNPMQAPAEEVSRFEYIRNKTHRYYAAMVSKLDQSVGSVIDALARQEMLQNSIVLFLSD 295
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NGG G + ASN+PLRG K++ WEG +R + IWS E G V +Q +++ D LPT
Sbjct: 296 NGGPTQGQHSTTASNYPLRGQKDSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355
Query: 376 LLSAANKSDIP 386
L +AA S P
Sbjct: 356 LAAAAGISPDP 366
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 69/149 (46%), Gaps = 45/149 (30%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVKVMRYQVDLTGGP 632
+DG+++WS L S I+H ID++ + L T+GKWK++ +G
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGKWKVI------------SGTT 417
Query: 633 DQ-VYLSGLSDRE--------------------WLALA--------MRKLRDAASIQC-G 662
+Q +Y L RE W L + +LRD ASI+C
Sbjct: 418 NQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWQQLQQVTSGERNISELRDQASIECPN 477
Query: 663 PVKEV-PCEPQIAPCLFDIKNDPCEKNNL 690
P V PC P APCLFDI+ DPCE++NL
Sbjct: 478 PATGVRPCLPLEAPCLFDIEADPCERSNL 506
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
G NQ Y E + + + ++ + ++ ++ ++ E + +LRD ASI+
Sbjct: 415 GTTNQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWQQLQQVTSGERNISELRDQASIE 474
Query: 767 C-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
C P V PC P APCLFDI+ DPCE++NL Y +LD R Q
Sbjct: 475 CPNPATGVRPCLPLEAPCLFDIEADPCERSNL--------YAEYKNSTTFLDLWARIQQF 526
Query: 825 A 825
A
Sbjct: 527 A 527
>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
Length = 558
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 207/325 (63%), Gaps = 5/325 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII I+ DD+G ND+G +QIPTPNIDAL Y+G++L YY CTPSR+A +TG +
Sbjct: 27 PHIIVIMGDDMGHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYVQNACTPSRAAFLTGNY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI + MQ + E LPL+ ++PQ+LK LGYRT IVGKWHLG + TPT +GF+
Sbjct: 86 PIRSAMQGLPIVAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFD 145
Query: 180 SHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
SH GYW G Y+D+ + + G D+ E G+Y+T VFT A+DII H+
Sbjct: 146 SHFGYWNGFTGYYDYFTDFNSTAIEGFDLHDRFETERGYQGQYATRVFTERALDIIEGHN 205
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHY--LNIHRHIEDFKRSKFAAILHKLDESVGKV 295
T PLFL + H A H+ L P+ + +I+D +R +A I+ +LD S+G+V
Sbjct: 206 TTRPLFLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRRLYAEIVAELDRSIGQV 265
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
V L +R+ML NSII+F SDNG G N+ SNWPLRG+K T +EGG+RG I+SPL
Sbjct: 266 VRKLSERQMLENSIILFFSDNGAPTVGPYTNSGSNWPLRGIKLTNFEGGIRGTATIFSPL 325
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L+ RG V ++ +HVSDWLPT +AA
Sbjct: 326 LKKRGYVNKELIHVSDWLPTFYAAA 350
>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
Length = 522
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 220/332 (66%), Gaps = 16/332 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++G PHI+F+LADDLGWNDVG+H I TP++DAL+ G+ L+NYYT LCTPSRS ++
Sbjct: 30 ATGQPHIVFVLADDLGWNDVGYHD-SNIFTPHLDALSAQGVRLENYYTQPLCTPSRSQLL 88
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IHTG+QH +++ C+ +PL EK+LP+ L+E GY T +VGKWHLG ++KE PT
Sbjct: 89 TGRYQIHTGLQHQIIWPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTR 148
Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF++ GY G +DY+ H A ++ LD R + A YST+VFT A
Sbjct: 149 RGFDTFFGYLLGSEDYYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERA 208
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
V++I NH +PLFLYLA + H EPLQ P+ YL + I++ R +A ++ +D
Sbjct: 209 VNLIANHPAQKPLFLYLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQHYAGMVTLMD 263
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V AL++ + +N++ +F +DNGG L +NWPLRG K TLWEGG+RG G
Sbjct: 264 EAVGNVTRALQKYGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGIRGVG 319
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ SPLL+ + + + + +H+SDWLPTL++ A
Sbjct: 320 FVSSPLLKRKRVKSSELIHISDWLPTLVNLAG 351
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 14/36 (38%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LP+ L+E GY T ++
Sbjct: 98 LQHQIIWPCQPSCIPLDEKLLPELLREAGYVTHMVG 133
>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
Length = 559
Score = 315 bits (807), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 166/398 (41%), Positives = 231/398 (58%), Gaps = 40/398 (10%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S P+II IL DD+G+NDV FHG +QI TPNIDALAY+GI+L +Y LCTPSR+ ++
Sbjct: 24 ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TGK+PIHTGMQH V+ E GLP E+++P+ ++ GY T +VGKWHLGF++K+ TPT
Sbjct: 84 TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
RGF+ H GY+ G+ DY+D + + GLD RRDLEP + G Y+T+ FT EA +
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I H PLF+ L+H A H+ N P+QAP+ + HI D KR +A ++ LD+SV
Sbjct: 204 IERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G+ + AL ML+NSI++ SDNG G + NA SN+P RG
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRG----------------- 306
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-------IL 405
RG V+ Q +H DWLPTL +AA S +P+ + N+ P S +L
Sbjct: 307 -----QRGYVSNQAIHAIDWLPTLAAAAGVS-LPSDLRLDGLNLWPSLSASAQPQPRNLL 360
Query: 406 RYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
+ Y+S +Y NG+ +E RY++
Sbjct: 361 HVLDDVFGYSS--YTQDTLKYVNGS-----SFEGRYDH 391
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 51/155 (32%)
Query: 575 IDGIDVWSVLSRN-EPSKRNTILHNIDDEWQISALTKGKWKLV----------------- 616
+DG+++W LS + +P RN +LH +DD + S+ T+ K V
Sbjct: 339 LDGLNLWPSLSASAQPQPRN-LLHVLDDVFGYSSYTQDTLKYVNGSSFEGRYDHWLGELE 397
Query: 617 ------------KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPV 664
++V+ Q L ++ DR +R++RD A+ +C P+
Sbjct: 398 DGEEDPNSKDYGQLVRASEVQAVLGS------VAPTDDR------IRQMRDEATQRC-PI 444
Query: 665 KE-------VPCEPQIAPCLFDIKNDPCEKNNLAD 692
+E CEP A CLFD+ NDPCE+ NLA+
Sbjct: 445 QERDPEDPVYLCEPLKAACLFDLANDPCERYNLAN 479
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 47/91 (51%), Gaps = 8/91 (8%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE------- 772
E E ++ K + + + + + ++ +R++RD A+ +C P++E
Sbjct: 395 ELEDGEEDPNSKDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC-PIQERDPEDPV 453
Query: 773 VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
CEP A CLFD+ NDPCE+ NLA+ +Q
Sbjct: 454 YLCEPLKAACLFDLANDPCERYNLANLYPLQ 484
>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
Length = 478
Score = 315 bits (806), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 219/330 (66%), Gaps = 17/330 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADD GWND+G+HG I TPN+D LA G+ L+NYY +C+PSR +MTG++
Sbjct: 3 PHIVFILADDYGWNDIGYHG-SFIKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRY 61
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IH G+QH+V+ GLPL E LPQ LKE+GY T +VGKWHLGF+++EY P RGF+
Sbjct: 62 QIHYGLQHSVITHDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGFD 121
Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
+ G+ TG +DY+ H S + + GLD+R EP D +G YST +F +A+DII
Sbjct: 122 TFYGFLTGGEDYWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQNGTYSTHLFQRKAIDII 181
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+H ++P+FLYL+ A H+ PLQAP+ Y+ ++ +++ K+AA++ +DE+VG
Sbjct: 182 AHHDRNKPMFLYLSFQAVHA-----PLQAPEGYIEKYKSVQNILIRKYAAMVTAMDEAVG 236
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
V +AL+ + N++++F +DNG + A SNWPLRG KNTLWEGG+RG + S
Sbjct: 237 NVTDALKGSGLWDNTVLIFSTDNGARRS-----AGSNWPLRGWKNTLWEGGIRGVCFVTS 291
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LLE +G ++ +H+SDW PTL+ A S
Sbjct: 292 NLLERKGTKSDALIHISDWFPTLIHLAQGS 321
>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
Length = 570
Score = 315 bits (806), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 160/354 (45%), Positives = 219/354 (61%), Gaps = 16/354 (4%)
Query: 44 FTLSMVFVDLVASSGP------------PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
++LS F+ L+ G P+II I+ADD+G++DV F G + TPNIDAL
Sbjct: 2 WSLSATFLLLLCLRGSKANGAEMEATRRPNIIIIMADDMGFDDVSFRGGREFITPNIDAL 61
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
AY G +L Y +CTPSR A+++G++PIHTG QH V+ E L + ++P+ +
Sbjct: 62 AYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWALDSNATLMPEIFQR 121
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM---WGLDMRR 208
GY T +VGKWHLGF + EYTPT RGF+ H GYW + DY+ ++ G D RR
Sbjct: 122 AGYSTNLVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYIDYYQRRSKMPVANYSLGYDFRR 181
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
++E G Y TD+ T EA +I + D+PLFL L+H ATH+AN +PLQAP+ +
Sbjct: 182 NMELECRDRGTYVTDLLTTEAERLIKEQAGKDKPLFLMLSHLATHTANEDDPLQAPEEEI 241
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+I+D R K+AA++ KLD+SVG+++ L L NSI++F SDNG + G N
Sbjct: 242 RKFSYIKDPNRRKYAAMVSKLDQSVGRIISTLASTDQLENSIVIFYSDNGAPSVGMFANT 301
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SNWPLRG KNT WEGGVR AG +WS LE+RG + Q ++V DWLPTL AA+
Sbjct: 302 GSNWPLRGQKNTPWEGGVRVAGAVWSSKLEARGSIFTQPIYVGDWLPTLAHAAD 355
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 64/139 (46%), Gaps = 23/139 (16%)
Query: 575 IDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------- 619
+DGID+W LS + P ILH++DD W++ +L G WK V
Sbjct: 366 LDGIDLWPQLSGAADAPHVPREILHSLDDVWKVGSLLMGHWKYVNGTTSAGQYDSVLTYR 425
Query: 620 -------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQ 672
+ RY V + P L+ L R + LR +A++ CG + C P
Sbjct: 426 EFDDIDPRESRYAVTIRNSPASRALTRLDLRRLTQQRIAGLRRSAAVHCGDFQR-SCNPL 484
Query: 673 IAPCLFDIKNDPCEKNNLA 691
+ CLFDI DPCE+NNLA
Sbjct: 485 LEDCLFDISADPCEQNNLA 503
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/95 (24%), Positives = 48/95 (50%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT G+++ + + ++ + ++ + + + + ++ + + + LR +
Sbjct: 412 TTSAGQYDSVLTYREFDDIDPRESRYAVTIRNSPASRALTRLDLRRLTQQR--IAGLRRS 469
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A++ CG + C P + CLFDI DPCE+NNLA
Sbjct: 470 AAVHCGDFQR-SCNPLLEDCLFDISADPCEQNNLA 503
>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
Length = 525
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/326 (48%), Positives = 222/326 (68%), Gaps = 4/326 (1%)
Query: 66 LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
+ADDLG++DV F G + TPNIDALAYSG+IL N YT +CTPSRSA++TGK+PI TGM
Sbjct: 1 MADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGM 60
Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
QH V+ + GLPL+E + + ++ GY T ++GKWHLG ++ +TPT RGF++HLGY
Sbjct: 61 QHYVIVNDQPWGLPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGYL 120
Query: 186 TGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEP 241
+ DY+D + + + G D R +LE + D G Y TD+ + AV++I H+ T +P
Sbjct: 121 GAYIDYYDQTYLQSSQNYSRGHDFRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKP 180
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
LFL L+H A H+AN +PLQAP L+ +I++ +AA++ +LD+SVG+V+EAL
Sbjct: 181 LFLLLSHLAPHAANDNDPLQAPMEELSQFEYIQNKSHRYYAAMVSRLDKSVGRVIEALAN 240
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGI 361
R ML NSI++F+SDNGG + G + N ASN+PLRG K+T WEGG+R + IWS E G
Sbjct: 241 REMLRNSIVLFLSDNGGPSVGEHANTASNYPLRGQKHTPWEGGIRSSAAIWSTEFERLGS 300
Query: 362 VAEQYVHVSDWLPTLLSAANKSDIPN 387
V +Q +++ D LPTL SAA S P+
Sbjct: 301 VWKQRIYIGDLLPTLCSAAGISLDPS 326
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 64/148 (43%), Gaps = 23/148 (15%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTKGKWKLVKVV------------- 619
+DG+++WS L S I+H+ID D A +GKWK++
Sbjct: 329 LDGLNLWSALKYGYESVEREIVHSIDAVDPQPHLAYMRGKWKIINGTTNSGQYDGWLGQR 388
Query: 620 -------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC-GPVKEVPCEP 671
+ + Y + L L+D+ + +LR A I C P PC P
Sbjct: 389 ITDEVDPRSVHYDQLIRNTTVWQQLQLLADKPQQERNITELRQQAQILCPTPANAKPCMP 448
Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRI 699
IAPC FDI+ DPCE+NNL D + I
Sbjct: 449 LIAPCFFDIEVDPCEQNNLYDEYRNTSI 476
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 39/75 (52%), Gaps = 9/75 (12%)
Query: 752 EEEGMRKLRDAASIQC-GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTT 810
+E + +LR A I C P PC P IAPC FDI+ DPCE+NNL D Y
Sbjct: 422 QERNITELRQQAQILCPTPANAKPCMPLIAPCFFDIEVDPCEQNNLYDE--------YRN 473
Query: 811 EVGYLDPKQRFNQIA 825
+LD +R +Q A
Sbjct: 474 TSIFLDLWERIDQYA 488
>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
Length = 513
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 216/322 (67%), Gaps = 7/322 (2%)
Query: 66 LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
+ADD+G NDVGF+G QIPTP+IDALAY+GIIL +YT CTPSR+A++TG++PI GM
Sbjct: 1 MADDVGRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGM 60
Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
Q + E LPL +PQYLK LGY+T +VGKWHLG+ E TP RGF+SH GYW
Sbjct: 61 QGLPIRAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYW 120
Query: 186 TGHQDYFDHS-----AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
G YF+++ A + + G D+ + PAW GKY+TD+FT +A+ II H++++
Sbjct: 121 NGFVGYFNYTAVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNSEK 180
Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHR--HIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFL LAH A H+ L PD R I++ KR +A I+ +LD+SVG++VE
Sbjct: 181 PLFLVLAHLAGHTGEDGVELGVPDVAQAETRFSFIKNPKRRLYADIVSRLDDSVGQIVEK 240
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+++ +L NSII+F+SDNG G N+ SNWPLRG+K T +EGG+R I+SPL
Sbjct: 241 LDEKNLLDNSIILFLSDNGAQTIGVYENSGSNWPLRGLKLTEFEGGIRAPAAIYSPLFHQ 300
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
+G V+++ +H++D+LPT +AA
Sbjct: 301 KGYVSKELIHITDFLPTFYAAA 322
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 71/157 (45%), Gaps = 11/157 (7%)
Query: 571 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG 630
Y +IDG++ W VLS+ +P KR+ IL N ++ +SA+ G++KLV+ D G
Sbjct: 328 YLGKIDGVNQWEVLSQQKPGKRDEILVNFNEIDNVSAIIMGQYKLVQGRLAFPEYGDYHG 387
Query: 631 -------GPDQVYLSGLSD--REWLALAMRKLRDAASIQ-CGPVKEVPCEPQIAPCLFDI 680
P + + LS + + LR+ + C K P CLF++
Sbjct: 388 DSGRNPEDPPYNFTAVLSKNINQQNRNKIESLRNQTDLSWCRGEKNQPLIDCTEGCLFNL 447
Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDK 717
++DPCE N+ + E + F+QI P +
Sbjct: 448 ESDPCETTNII-QEEKIIAGKLLGRLFEFSQILVPQQ 483
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 33/47 (70%)
Query: 506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
Y +IDG++ W VLS+ +P KR+ IL N ++ +SA+ G++KLV+
Sbjct: 328 YLGKIDGVNQWEVLSQQKPGKRDEILVNFNEIDNVSAIIMGQYKLVQ 374
>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 599
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 169/361 (46%), Positives = 229/361 (63%), Gaps = 18/361 (4%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
++ ++ L +S + D GP PHII I+ADDLGWNDVGFHG QIPTPNIDA
Sbjct: 5 ISCSIRILIILVSSLLSDCSFLRGPNKKQPHIILIVADDLGWNDVGFHGSIQIPTPNIDA 64
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
LAY+G+IL +Y CTPSR+A++TGK+PI G+Q + LPL+EKILPQYLK
Sbjct: 65 LAYNGVILNRHYVQPTCTPSRAALLTGKYPIRYGLQGFPIIAGVPLALPLNEKILPQYLK 124
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
+LGY T +VGKWHLG K ++TP RGF+SH GYW G Y + + M G D RR
Sbjct: 125 DLGYSTHLVGKWHLGANKNQHTPIKRGFDSHFGYWNGFISYRNSTHSTGLMVGKDARRGF 184
Query: 211 EPAWD-LHGKYSTDVFTAEA---VDIIHNHSTDEPLFLYLAHAATHSANPY-EPLQAPDH 265
E A D + +Y+TD+FT EA + + NH D+P+FL ++H A H+ P L+ +
Sbjct: 185 ERAGDEMVDRYATDIFTDEANKVIKLCKNH--DKPMFLMVSHLAVHTGVPGPNILEVSNK 242
Query: 266 YLNIHR--HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA-- 321
N R +IE+ +R +A +L LDESVG ++E+L+ ML +SI++F+SDNG A
Sbjct: 243 THNDIRFDYIENKERRLYAGMLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDP 302
Query: 322 --GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
G+ N+ SNWPLRG K + +GGVRG IWSP L+ + + E H++DWLPTL +A
Sbjct: 303 IWGYG-NSGSNWPLRGEKGAVLDGGVRGVAAIWSPWLKKKHRIFENLFHITDWLPTLYTA 361
Query: 380 A 380
A
Sbjct: 362 A 362
>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 528
Score = 313 bits (802), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 157/353 (44%), Positives = 216/353 (61%), Gaps = 10/353 (2%)
Query: 43 AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
F + + + ASS P+I+ I+ DDLGW+D+ HG DQIPTPNID LA G++L+NYY
Sbjct: 6 CFVIVLRIANCAASS--PNIVLIVIDDLGWDDISLHGSDQIPTPNIDKLAAEGVLLENYY 63
Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
T +CTPSR A+MTGK+PIH G+Q++V+ G + GLP KI+PQYL Y++ I+GKW
Sbjct: 64 TQAICTPSRGALMTGKYPIHLGLQYDVIQGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKW 123
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM-----KMW-GLDMRRDLEPAWDL 216
HLG + E PT RGF SH G+ GH DYF+H EE +M+ GLD+ + P
Sbjct: 124 HLGHSRSELLPTRRGFHSHFGFRLGHSDYFNHWGEESSPVKNEMYAGLDLWSNEVPIKKY 183
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIED 275
HG Y+ D+FT A+ I+ H+ PLFLYLAH A H + PLQAP ++ I D
Sbjct: 184 HGTYANDLFTKRAISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYD 243
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
KR +AA++ +D S+G+++ + N+II +DNGG G + +SN PLRG
Sbjct: 244 EKRKHYAAMVSAMDASIGELMHGISTNGFAENTIIFLTNDNGGPTNGMASSGSSNHPLRG 303
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
K TLWEGG RG+ L W P G + H++D LPTLL+A+ + N+
Sbjct: 304 CKYTLWEGGTRGSALFWYPKKLGAGTY-KGMAHITDVLPTLLTASGNAVAGNF 355
>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
Length = 584
Score = 313 bits (801), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 156/362 (43%), Positives = 216/362 (59%), Gaps = 52/362 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII I+ADDLGW+DV FHG QIPTP ID A G+IL NYY +CTPSR+++MTGK+
Sbjct: 27 PHIIIIVADDLGWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTGKY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PI+ GM WHLGF+ KEYTP +RGF+
Sbjct: 87 PINLGM------------------------------------WHLGFFTKEYTPVYRGFD 110
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G+W DY++HS+ E WG+D+R ++EP G Y T++FT EAV +I H T
Sbjct: 111 SFYGFWNAKTDYWNHSSYENNFWGVDLRDNMEPVQSEDGTYGTELFTREAVKVIEAHDTS 170
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH---------IEDFKRSKFAAILHKLDE 290
PLFLY+AH A H+ANP EPLQAP +++ I+D +R +AA++ LD+
Sbjct: 171 TPLFLYVAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYAAMVTSLDQ 230
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
SVG + AL +R ML +S+++F +DNGGA G N N SN+PLRG K+ LWEGGV+G
Sbjct: 231 SVGDIFAALSKRHMLRDSVVIFTTDNGGAPYGLNWNRGSNFPLRGGKDMLWEGGVKGVAF 290
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD---IPNY----VNSTVENIIPRYENS 403
++S L++ +G V+++ + V+DW+PT+ A + +PN V ST+ P +
Sbjct: 291 VYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVPNMDGKNVWSTISEGAPSPRDE 350
Query: 404 IL 405
IL
Sbjct: 351 IL 352
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 22/149 (14%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
+DG +VWS +S PS R+ ILHNID + + L KGK+K+V+ G D
Sbjct: 331 MDGKNVWSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQ-------------GMDD 377
Query: 635 VYLS-GLSDR----EWLALAMRKLRDAASIQCGPV--KEVPCEPQIAP-CLFDIKNDPCE 686
Y G DR ++ +L A I C +E C+ CLFD++ DPCE
Sbjct: 378 TYKGVGWYDRYPGHALSSMKQPELLPGAVIDCKKTFDEERKCDSSDGKFCLFDMEEDPCE 437
Query: 687 KNNLADRSEDQRINHYTTEVGRFNQIAYP 715
++L+++ + + T + + IA P
Sbjct: 438 YHDLSNQLP-EVLAEMKTRLEYYKNIALP 465
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 30/43 (69%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
+DG +VWS +S PS R+ ILHNID + + L +GK+K+V+
Sbjct: 331 MDGKNVWSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQ 373
>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
Length = 585
Score = 312 bits (800), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 164/371 (44%), Positives = 234/371 (63%), Gaps = 10/371 (2%)
Query: 21 LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
+ +L + T ++ VL +A + ++A+S P+II I+ADDLG++DV F G
Sbjct: 1 MSTHLDKFSSATSLLTGFVLFIALSNG-----IIATSDKPNIIIIMADDLGFDDVSFRGS 55
Query: 81 DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
D TPNIDALAYSG+IL N Y +CTPSR+A++TGK+PI+TGMQH V+ + GLPL
Sbjct: 56 DNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPL 115
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
+E + + +E GYRT ++GKWHLG ++ +TPT RGF+ H GY + DY+ S E+
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVDYYTQSYEQQN 175
Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSAN 255
G D R +L+ D G Y TDV T AV I +H ++ +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDNLKSTHDHVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAAN 235
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+P+QAP ++ +I + +AA++ +LD+SVG V++AL ++ ML NSII+F+SD
Sbjct: 236 DDDPMQAPAEEVSRFEYISNKAHRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSD 295
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NGG G + ASN+PLRG KN+ WEG +R + IWS E G V +Q +++ D LPT
Sbjct: 296 NGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355
Query: 376 LLSAANKSDIP 386
L +AA S P
Sbjct: 356 LAAAAGISPDP 366
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 69/146 (47%), Gaps = 21/146 (14%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVK-------VMRYQ 625
+DG+++WS L S I+H ID++ + L T+GKWK++ + ++
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHR 429
Query: 626 VDLTGGPDQVYLSGL--SDREWLALA--------MRKLRDAASIQC-GPVKEV-PCEPQI 673
P V L + WL L + +LRD + I+C P V PC P
Sbjct: 430 ETSEVDPGAVEYEELVRNTSVWLQLQQVTSGERNISELRDKSRIECPDPATGVKPCLPLE 489
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRI 699
PCLFDI+ DPCE++NL ++ I
Sbjct: 490 GPCLFDIEADPCERSNLYAEYQNSTI 515
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 10/81 (12%)
Query: 747 KKYSNEEEGMRKLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQR 804
++ ++ E + +LRD + I+C P V PC P PCLFDI+ DPCE++NL
Sbjct: 455 QQVTSGERNISELRDKSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNL-------- 506
Query: 805 INHYTTEVGYLDPKQRFNQIA 825
Y +LD +R Q A
Sbjct: 507 YAEYQNSTIFLDLWERIQQFA 527
>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
Length = 531
Score = 312 bits (800), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 210/343 (61%), Gaps = 11/343 (3%)
Query: 48 MVFVDLVASSG---------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
++F+ + +G PPHI+FIL DDLGW D HG QIPTPN+DALA +G++L
Sbjct: 5 LLFISVACVAGSSVKTPFRKPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTGVLL 64
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
NYY LC+PSR A+MTG +P H G++ L G + GLPL KILP++LK+LGY T I
Sbjct: 65 NNYYVQPLCSPSRGALMTGLYPAHNGIRMP-LVGAQVAGLPLQFKILPEHLKDLGYETHI 123
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
VGKWHLG++ YTPT+RGF++ G+ G DY+ E+ GLD
Sbjct: 124 VGKWHLGYFNLNYTPTYRGFDTFFGFHNGPIDYYRGIMEQEGHVGLDFWNGTSALPLKER 183
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y+T A II N +T +PLFLYLAH A HS E LQAP +I D R
Sbjct: 184 TYATARLQNHAKSIIANRNTSKPLFLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRDSSR 243
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVK 337
A ++ LD+SVG++VE LE+ +L ++I+VF +DNGG + N NWPLRG+K
Sbjct: 244 KILAGMMDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRADPNRGFNWPLRGIK 303
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T+WEGG+RG IWSP LE V+ Q +H+SDWLPTL SAA
Sbjct: 304 GTVWEGGIRGTAFIWSPRLEVSRRVSTQLMHISDWLPTLYSAA 346
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 73/177 (41%), Gaps = 37/177 (20%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV------------KVVKVM 622
+DGID+W LS PS R +L ++E +AL K+V +V+ +
Sbjct: 356 LDGIDMWQSLSTGGPSPRIEVLLEFNNETDRAALRYLNHKIVLGTFSDEVSKRYEVIGGV 415
Query: 623 RYQVDLTG---------------GPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEV 667
R +DL G Q++L G D+ W R A+++C
Sbjct: 416 RPGIDLDALQRNSKAGRVLKRFYGNPQLFLHGF-DKSW--------RRRATVRCNEHGRT 466
Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
LFD+++DPCE NLAD S + ++G + + P+ +E + K
Sbjct: 467 NFNSSSKVFLFDLESDPCELRNLAD-SNVTLLTDLFVKLGAYLRTLRPEINKEIDPK 522
>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
Length = 585
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 164/371 (44%), Positives = 234/371 (63%), Gaps = 10/371 (2%)
Query: 21 LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
+ +L + T ++ VL +A + +VA+S P+II I+ADDLG++DV F G
Sbjct: 1 MSTHLDKFSSATSLLTGFVLCIALSNG-----IVATSDKPNIIIIMADDLGFDDVSFRGS 55
Query: 81 DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
+ TPNIDALAYSG+IL N Y +CTPSR+A++TGK+PI+TGMQH V+ + GLPL
Sbjct: 56 NNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPL 115
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
+E + + +E GYRT ++GKWHLG ++ +TPT RGF+ HLGY + DY+ S E+
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQN 175
Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSAN 255
G D R L+ D G Y TD+ T AV I +H ++ +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAAN 235
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+P+QAP ++ +I + +AA++ +LD+SVG V++AL ++ ML NSII+F+SD
Sbjct: 236 DDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSD 295
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NGG G + ASN+PLRG KN+ WEG +R + IWS E G V +Q +++ D LPT
Sbjct: 296 NGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355
Query: 376 LLSAANKSDIP 386
L +AA S P
Sbjct: 356 LAAAAGISPDP 366
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 70/158 (44%), Gaps = 45/158 (28%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVKVMRYQVDLTGGP 632
+DG+++WS L S I+H ID++ L T+GKWK++ +G
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAEPHLSYTRGKWKVI------------SGTT 417
Query: 633 DQ-VYLSGLSDRE--------------------WLALA--------MRKLRDAASIQC-G 662
+Q +Y L RE WL L + +LRD + I+C
Sbjct: 418 NQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWLQLQQVSFGERNISELRDQSRIECPD 477
Query: 663 PVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEDQRI 699
P V PC P PCLFDI+ DPCE++NL ++ I
Sbjct: 478 PATGVKPCLPLEGPCLFDIEADPCERSNLYAEYQNSTI 515
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 10/121 (8%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
G NQ Y E + + + ++ + + ++ S E + +LRD + I+
Sbjct: 415 GTTNQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWLQLQQVSFGERNISELRDQSRIE 474
Query: 767 C-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
C P V PC P PCLFDI+ DPCE++NL Y +LD R Q
Sbjct: 475 CPDPATGVKPCLPLEGPCLFDIEADPCERSNL--------YAEYQNSTIFLDLWSRIQQF 526
Query: 825 A 825
A
Sbjct: 527 A 527
>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
Length = 585
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 163/371 (43%), Positives = 234/371 (63%), Gaps = 10/371 (2%)
Query: 21 LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
+ +L + T ++ VL +A + ++A+S P+II I+ADDLG++DV F G
Sbjct: 1 MSTHLDKFSSATSLLTGFVLFIALSNG-----IIATSDKPNIIIIMADDLGFDDVSFRGS 55
Query: 81 DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
D TPNIDALAYSG+IL N Y +CTPSR+A++TGK+PI+TGMQH V+ + GLP+
Sbjct: 56 DNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPI 115
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
+E + + +E GYRT ++GKWHLG ++ +TPT RGF+ H GY + DY+ S E+
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVDYYTQSYEQQN 175
Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSAN 255
G D R +L+ D G Y TDV T AV I +H ++ +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDNLKSTHDYVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAAN 235
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+P+QAP ++ +I + +AA++ +LD+SVG V++AL ++ ML NSII+F+SD
Sbjct: 236 DDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSD 295
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NGG G + ASN+PLRG KN+ WEG +R + IWS E G V +Q +++ D LPT
Sbjct: 296 NGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355
Query: 376 LLSAANKSDIP 386
L +AA S P
Sbjct: 356 LAAAAGISPDP 366
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 74/202 (36%), Gaps = 68/202 (33%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TRGKWKLVKENSING--------N 559
+DG+++WS L S I+H ID++ + L TRGKWK++ + G
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHR 429
Query: 560 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV 619
TSE Y+ I VW L + +RN +
Sbjct: 430 ETSEVDPRAVEYEELIRNTSVWLQLQQVSSGERN-------------------------I 464
Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC-GPVKEV-PCEPQIAPCL 677
+RYQ + I+C P V PC P PCL
Sbjct: 465 SELRYQ-------------------------------SRIECPDPATGVKPCLPLEGPCL 493
Query: 678 FDIKNDPCEKNNLADRSEDQRI 699
FDI+ DPCE++NL ++ I
Sbjct: 494 FDIEADPCERSNLYAEYQNSTI 515
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 10/81 (12%)
Query: 747 KKYSNEEEGMRKLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQR 804
++ S+ E + +LR + I+C P V PC P PCLFDI+ DPCE++NL
Sbjct: 455 QQVSSGERNISELRYQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNL-------- 506
Query: 805 INHYTTEVGYLDPKQRFNQIA 825
Y +LD R Q A
Sbjct: 507 YAEYQNSTIFLDLWARIQQFA 527
>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
Length = 382
Score = 311 bits (798), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 156/342 (45%), Positives = 213/342 (62%), Gaps = 12/342 (3%)
Query: 42 LAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L T +++FV L S + PH++FI+ADDLGWNDVGF Q+ TP++D LA +G+IL +
Sbjct: 6 LVATFTLIFVYLYGSQAAKPHVVFIVADDLGWNDVGFRN-PQVLTPHLDKLAKAGVILNS 64
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
Y LC+PSR+ M+G P HTG+Q V+ G +P+ LPQ LKELGY T VG
Sbjct: 65 SYVQPLCSPSRNCFMSGYFPYHTGLQDGVIRPASPGFVPIKFTFLPQKLKELGYSTHAVG 124
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
KWHLGF +YTPT+RGF++ +GY+ G +DY+ H+ E K G D+R + + GKY
Sbjct: 125 KWHLGFCNLKYTPTYRGFDTFVGYYIGAEDYYKHTREYDKFSGYDLRFNTSVYTEAKGKY 184
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
ST VF AVDII +H TD PL+LYL A H+ PL+ P Y N+++HI D R
Sbjct: 185 STRVFAERAVDIIKSHDTDTPLYLYLPFQAVHA-----PLEVPPEYENLYKHIHDLPRRT 239
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+ ++ LDE+V + ALE+ ++ N ++VF +DNGG G AA+N PLRG K TL
Sbjct: 240 YCGMISALDEAVANITNALEETGLIDNLLLVFTTDNGGPFNG----AANNLPLRGCKATL 295
Query: 341 WEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
WEGG +GAG I+S LL+ G + +H DW PT + A
Sbjct: 296 WEGGTKGAGFIYSKTLLKKTGYLNTGMMHAVDWYPTFVELAG 337
>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 548
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 166/347 (47%), Positives = 213/347 (61%), Gaps = 20/347 (5%)
Query: 47 SMVFVDLVASS--GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
S+VF+ VA + PPHI+FILADDLGW+DV FHG QIPTPNID LA GI L NYY
Sbjct: 22 SLVFLATVALAKKMPPHIVFILADDLGWDDVSFHGSSQIPTPNIDVLAADGITLHNYYVQ 81
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
+CTPSR+A+MTG +PI TGMQH V+ E GLPL K++P++LK+LGY T +VGK
Sbjct: 82 PMCTPSRAALMTGLYPIRTGMQHWVIRSPEPWGLPLELKLMPEHLKDLGYSTHLVGKVLF 141
Query: 165 GFYK----------KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
K E T F S + YF + + GLD R EP
Sbjct: 142 DLKKFIVSVNRLCINESTEVCHTFVSAVTLCI----YFVYKSHA----GLDFRNGEEPFH 193
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
+ G+Y+T +FT A+ II H+ +PLFLYL+H A H A EPLQAPD + +I
Sbjct: 194 NDTGQYATTLFTDRAISIIEQHNQTKPLFLYLSHLAPHGATHDEPLQAPDENVEKFDYIG 253
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
+ R+ +A ++ LD SVG+VVEAL + ML NSI+ F SDNG GF N NWPLR
Sbjct: 254 EEDRTIYAGMVDALDVSVGRVVEALSRAGMLENSIVAFSSDNGAVPFGFRSNRGFNWPLR 313
Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+K ++WEGGVR G +WSPLL ++ Q +H++DWLPTL +AA
Sbjct: 314 GIKASVWEGGVRVPGFVWSPLLRKSARLSTQMMHITDWLPTLYAAAG 360
Score = 43.1 bits (100), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 71/176 (40%), Gaps = 23/176 (13%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV--KVVKVMRYQVDLTG 630
++DG ++W LSR S R IL NID + L GK KL+ V V TG
Sbjct: 367 GQLDGHNMWEALSRGWRSPRTEILLNIDPITGSAGLRFGKHKLLIPGVNGTTDAHVPTTG 426
Query: 631 GP------DQVYLSGLS----------DREWLALAMRKLRDAASIQCGPVK-EVPCEPQI 673
P D + L+ D W R ++A + CGP + +
Sbjct: 427 YPRPTEDMDTMMQQSLAAKVLRRFYNVDGIWFR---RGWENSAEVDCGPAEWQSNFVSGR 483
Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKK 729
LFDI+ DPCE NLA + + ++ + P + +EE+ + +K
Sbjct: 484 VYNLFDIERDPCEMINLA-ATRRGTVRMMMRKLQSYAAAMIPPRNKEEDPRSFPEK 538
>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
Length = 1007
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 161/380 (42%), Positives = 221/380 (58%), Gaps = 24/380 (6%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
R+++ A L L + M+ PP++I I+ADDLGWNDV FH QI TPNID
Sbjct: 427 RSQLAARQALLLIWLSLMIGKIKGVDRSPPNVIVIVADDLGWNDVSFHSSKQIFTPNIDV 486
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
LAY G+IL +Y T S+ A+MTG HP+ G Q + L K++P+Y +
Sbjct: 487 LAYHGVILNRHYCAPFGTASQVALMTGSHPLSVGTQSASNEPDQPWTLDPELKLMPEYFR 546
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRR 208
+ GY T ++GKW LGF +K+YTPT RGF+SH G+ + DY+DHS GLDMRR
Sbjct: 547 DAGYATHLIGKWGLGFSRKDYTPTQRGFDSHFGFLGPYIDYWDHSLRLRNTSTRGLDMRR 606
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+L+ + ++G Y+TD+F EAV +I H +PL L L H A H+ N +P+QAP +
Sbjct: 607 NLDVDYSVNGSYATDLFNGEAVRLIREHDQKKPLLLVLTHLAPHTGNEDDPMQAPAEEVE 666
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
+I D KR AA++ K+DE VG++++ L +R ML NSI++F +DNG G + N+
Sbjct: 667 KFDYIRDEKRRVLAAMISKVDEGVGQIIQTLAERDMLDNSIVLFYADNGAPTVGMHSNSG 726
Query: 329 SNWPLRGV---------------------KNTLWEGGVRGAGLIWSPLLE-SRGIVAEQY 366
SN+PLRGV K + WEG VR IWSPLL + G V+ Q+
Sbjct: 727 SNFPLRGVCIDNWSTLHIRCTNGCFSFQQKYSPWEGAVRTVAAIWSPLLNLTAGRVSNQW 786
Query: 367 VHVSDWLPTLLSAANKSDIP 386
+HVSDWLPTL AA IP
Sbjct: 787 IHVSDWLPTLAHAAGIEGIP 806
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 69/135 (51%), Gaps = 18/135 (13%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV-------KVVKVMRYQ 625
+EIDG + W L + RN +++NID+ S+ ++ WK V K M
Sbjct: 809 SEIDGRNQWEALKNPAITVRNVVMNNIDELHHYSSYSRNGWKYVNGTSWEGKFDNWMGEL 868
Query: 626 VDLTGGPDQVY---LSGLSDREWLALAMR---KLRDAASIQC---GPVKEVPCEPQIAPC 676
D ++ Y L+G + L + +LR AA+++C G VK C+P +PC
Sbjct: 869 EDEDELSEEEYVVRLAGSVVGRMMPLDLEHVARLRRAATVECEVEGQVKA--CQPMESPC 926
Query: 677 LFDIKNDPCEKNNLA 691
LF++ +DPCE+NN+A
Sbjct: 927 LFNLMDDPCERNNVA 941
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 34/47 (72%), Gaps = 5/47 (10%)
Query: 754 EGMRKLRDAASIQC---GPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
E + +LR AA+++C G VK C+P +PCLF++ +DPCE+NN+A
Sbjct: 897 EHVARLRRAATVECEVEGQVKA--CQPMESPCLFNLMDDPCERNNVA 941
>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
Length = 557
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 158/355 (44%), Positives = 229/355 (64%), Gaps = 16/355 (4%)
Query: 39 VLPLAFTLSMVFVDL-----VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
VL F ++ ++ S PP+I+ I+ADDLGWNDV F+G D+IPTPNID+LAY
Sbjct: 11 VLLFIFVCDVILKNISRNSQATSKQPPNIVVIMADDLGWNDVSFNGGDEIPTPNIDSLAY 70
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
+G+IL +Y + +CTPSR+A TG++PI +GMQ L G E +PL+ +LPQYL++LG
Sbjct: 71 NGVILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPLQGAEPRSIPLNNILLPQYLRKLG 130
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y T +VGKWH+G+ +TPT RGF++ LGY++G+ +YF H+ E G D+ R +
Sbjct: 131 YATHLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIEYFSHNLVENGQSGYDIHRSVGDN 190
Query: 214 WDLHGKYS--TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ-----APDHY 266
+ +Y TD+ T EA +II +H+ +PL+L LAH A H++ + ++ A +
Sbjct: 191 HTIEYRYDYMTDLITDEAENIISSHNPAKPLYLQLAHLAPHASTVDDVIEVRSWKATNDT 250
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL- 325
L +I D R KFA+ + +DES+G+VV+AL + ML NSIIVF+SDNG L
Sbjct: 251 LG---YIRDINRRKFASAVVTMDESIGRVVDALRRADMLKNSIIVFMSDNGAPTKDQILY 307
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N SN+PLRG+K + +EG VRG I+SPL++ V+ Q H++DWLPTL +AA
Sbjct: 308 NFGSNYPLRGMKQSFFEGAVRGVACIYSPLIDFPSRVSTQLFHITDWLPTLYAAA 362
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 64/141 (45%), Gaps = 17/141 (12%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV----------- 616
D S +DGID WS++ + +R +L NI+++ A G++KL+
Sbjct: 365 DASDLKALDGIDQWSMIKNADNGRRQFLLVNINEKTDSKAALIGRYKLIRDQSEYTKYYG 424
Query: 617 ---KVVKVMRYQV-DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQ 672
K Y V ++ P ++ +S +++LR+ + + C + C +
Sbjct: 425 STGKDPSYCEYNVTNVLASPVASAIADISRSSLDIDKIKQLREKSRVVCRVSRFSRCTKR 484
Query: 673 IAPCLFDIKNDPCEKNNLADR 693
CLFDIK DPCE N++ +
Sbjct: 485 T--CLFDIKKDPCETTNVSSK 503
>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
Length = 591
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/353 (45%), Positives = 227/353 (64%), Gaps = 6/353 (1%)
Query: 40 LPLAFTLSMVFVD-LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L F L + + ++A+S P+II I+ADDLG++DV F G + TPNIDALAYSG+IL
Sbjct: 14 LITGFVLCIALSNGIIATSDKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVIL 73
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
N Y +CTPSR+A++TGK+PI+TGMQH V+ + GLPL+E + + +E GYRT +
Sbjct: 74 NNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSL 133
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDL 216
+GKWHLGF ++ +TPT RGF+ H GY + DY+ S E+ G D R +L+ D
Sbjct: 134 LGKWHLGFSQRNFTPTQRGFDRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDH 193
Query: 217 HGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
G+Y TDV T AV I +H ++ +PLFL L H A H+AN P+QAP ++ +I
Sbjct: 194 VGRYITDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDNPMQAPAEEVSRFEYI 253
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+ +AA++ KLD+SVG V++AL ++ ML NSI++F+SDNGG G + ASN+PL
Sbjct: 254 GNKTHRYYAAMVSKLDQSVGYVIDALARQEMLQNSIVLFLSDNGGPTQGQHSTTASNYPL 313
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
RG K++ WEG +R + IWS E G V +Q +++ D LPTL + A S P
Sbjct: 314 RGQKDSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAGAGISPDP 366
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 45/149 (30%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVKVMRYQVDLTGGP 632
+DG+++WS L S I+H ID++ + L T+GKWK++ +G
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAMPHLSYTRGKWKVI------------SGTT 417
Query: 633 DQ-VYLSGLSDRE--------------------WLALA--------MRKLRDAASIQCGP 663
+Q +Y L RE W L + +LRD ASI+C
Sbjct: 418 NQGLYDGWLGHRETSEVDPRAAVYEELVRNTSVWRQLQQVSFGERNISELRDQASIECPD 477
Query: 664 VKE--VPCEPQIAPCLFDIKNDPCEKNNL 690
PC P APCLFDI+ DPCE++NL
Sbjct: 478 TANGVRPCLPLEAPCLFDIEADPCERSNL 506
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 10/121 (8%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
G NQ Y E + + ++ + ++ ++ S E + +LRD ASI+
Sbjct: 415 GTTNQGLYDGWLGHRETSEVDPRAAVYEELVRNTSVWRQLQQVSFGERNISELRDQASIE 474
Query: 767 CGPVKE--VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
C PC P APCLFDI+ DPCE++NL Y +LD R Q
Sbjct: 475 CPDTANGVRPCLPLEAPCLFDIEADPCERSNL--------YAEYKNSTTFLDLWARIQQF 526
Query: 825 A 825
A
Sbjct: 527 A 527
>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 537
Score = 309 bits (791), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 161/368 (43%), Positives = 226/368 (61%), Gaps = 23/368 (6%)
Query: 25 LKELGYRTRIMAFAVLPLAFTLSMVFVDLV--ASSGPPHIIFILADDLGWNDVGFHGLDQ 82
LK L + AF V L T+ ++ L+ A+ PPHI+FI+ADD GW DVG+H
Sbjct: 4 LKMLKVKRCSCAFLVEALTVTV-LICTGLIKGATGKPPHIVFIVADDYGWFDVGYHN-ST 61
Query: 83 IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE 142
I TPN+D LA G+ L+NYY +C+PSRS +MTG++ IHTG+QH V+ + LPL+E
Sbjct: 62 IKTPNLDLLASRGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNE 121
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-------S 195
LPQ LKE GY T +VGKWHLGFYK E P RGF+S GY +G QDY+ H
Sbjct: 122 TTLPQKLKESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPG 181
Query: 196 AEEMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
E W G+D + AW+ G YS VFT A +I H+ ++PLFLYL + H
Sbjct: 182 FPEGNHWLGIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG- 240
Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
PLQ P+ Y+ + H +D R +A ++ +DE+VGKVV++L++ + +++++VF +
Sbjct: 241 ----PLQVPEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTT 296
Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDW 372
DNGG + +NWPLRG KNTLWEGGV G G I P++ + +G V++ ++H+SDW
Sbjct: 297 DNGGTPG----KSGNNWPLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDW 352
Query: 373 LPTLLSAA 380
PTL+
Sbjct: 353 FPTLIEGV 360
>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
Length = 500
Score = 308 bits (790), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 159/343 (46%), Positives = 211/343 (61%), Gaps = 11/343 (3%)
Query: 48 MVFVDLVASSG---------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
++F+ + +G PPHI+FIL DDLGW D HG QIPTPN+DALA +G++L
Sbjct: 5 LLFISVACVTGTSAKTPYPTPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTGVLL 64
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
NYY LC+PSR+A+MTG +P H G++ ++ G + GLPL KILP++LK+LGY T +
Sbjct: 65 NNYYVQPLCSPSRAALMTGLYPAHNGIRMPLM-GAQVAGLPLQFKILPEHLKDLGYETHM 123
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
VGKWHLG YTPT+RGF++ G++ G DY+ E+ GLD
Sbjct: 124 VGKWHLGHSSLNYTPTYRGFDTFFGFYNGPIDYYHGIMEQEGHIGLDFWNGTRALPLEER 183
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y+T F A II N + +PLFLYLAH A HSA E LQAP + D R
Sbjct: 184 IYATTRFQDHANYIIANRNASKPLFLYLAHQAVHSAYEPEFLQAPGENTKKFPFLGDASR 243
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVK 337
A ++ LD+SVG++VE LE+ +L ++I+VF +DNGG + N NWPLRG+K
Sbjct: 244 KSLAGMVDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRSDPNRGFNWPLRGIK 303
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T+WEGG+RG IWSP LE V+ Q +H+SDWLPTL SAA
Sbjct: 304 GTVWEGGIRGTAFIWSPRLELSSRVSTQLMHISDWLPTLYSAA 346
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 38/146 (26%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV------------KVVKVM 622
+DGID+W LS PS R +L ++E +AL K+V +VV +
Sbjct: 356 LDGIDMWQSLSTGGPSPRTEVLLEFNNETDRAALRYLNHKIVLGIYSNEVSERYEVVGGV 415
Query: 623 RYQVDLT----------------GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE 666
R +DL G P Q++L G D+ W R A+++C
Sbjct: 416 RPGIDLDVLQRNSKAGRVLARFYGNP-QLFLRGF-DKSW--------RSRATVRCNENGP 465
Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLAD 692
LFD++ DPCE +NLAD
Sbjct: 466 TNFNSSSQVFLFDLETDPCELHNLAD 491
>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
Length = 574
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 210/325 (64%), Gaps = 4/325 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P++I I+ADD+G++D+ G + TPNIDAL + G +L Y +CTPSR A+++GK+
Sbjct: 35 PNVIIIMADDMGFDDISLRGGREFLTPNIDALGFHGRLLDRLYAPAMCTPSRGALLSGKY 94
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTG QH V+ E L L+ ++P+ + GY T +VGKWHLGF + EYTPT RGF+
Sbjct: 95 PIHTGTQHFVISNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWHLGFARPEYTPTHRGFD 154
Query: 180 SHLGYWTGHQDYFDHSA---EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
H GYW + DY+ + E+ + G D RR++E G Y TD+ T EA II
Sbjct: 155 YHYGYWGAYIDYYQRRSQMPEKTYIVGYDFRRNMEVECTDRGVYVTDLLTNEAERIIQET 214
Query: 237 STDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
+ + PLFL + H ATH+AN +PLQAP+ + HI+D K+AA++ KLD+SVG+V
Sbjct: 215 AAKQKPLFLMINHLATHTANDNDPLQAPEEEVQKFLHIKDPNHRKYAAMVSKLDQSVGRV 274
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
+ AL + L NSI++F SDNG + G N SNWPL+G KN+ WEGGVR AG+IWS L
Sbjct: 275 ITALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGVRVAGVIWSQL 334
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L +RG + Q ++V D+LPTL AA
Sbjct: 335 LTARGHLFTQPIYVGDFLPTLAHAA 359
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVL--SRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------ 619
++DGID+W L S++ P ILH++DD S+L G+WK V
Sbjct: 368 QLDGIDLWPQLAGSKDAPHVPREILHSLDDVSHFSSLLLGQWKYVNGTTLAGQFDQQLTH 427
Query: 620 --------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
+ RY V++ P L+ + M LR A+++CG ++ C
Sbjct: 428 RELDELDPRARRYVVEVRNSPASKALARYDLQRLTQHRMHTLRRLATVRCGDMQR-GCNA 486
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ CL+D+ DPCE NNLA
Sbjct: 487 LLEECLYDLSVDPCELNNLA 506
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
TT G+F+Q + +E + + ++ + + K + ++ + M LR
Sbjct: 415 TTLAGQFDQQLTHRELDELDPRARRYVVEVRNSPASKALARYDLQRLTQHR--MHTLRRL 472
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
A+++CG ++ C + CL+D+ DPCE NNLA
Sbjct: 473 ATVRCGDMQR-GCNALLEECLYDLSVDPCELNNLA 506
>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
Length = 488
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 215/345 (62%), Gaps = 20/345 (5%)
Query: 43 AFTLSMVFVDLV--ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
A +L ++ ++ A++ PPHI+FIL DDLGW D+G+HG I TPNI+ LA GIIL N
Sbjct: 6 ALSLVLLSAQILSEANAIPPHIVFILVDDLGWFDLGYHG-SVIRTPNINQLAGDGIILDN 64
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
YY LCTP+RSA+MTGK+PIH G QH V+ + GLPL LP+ LK+ GY T IVG
Sbjct: 65 YYVQPLCTPTRSALMTGKYPIHLGTQHGVILPGQPMGLPLDSSTLPEQLKQQGYATHIVG 124
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
KWHLGFYK+++ PT RGF+S GY+ G +D+F H+ LD R + D G Y
Sbjct: 125 KWHLGFYKEDFVPTKRGFDSFYGYYCGAEDHFTHNVLGF----LDFRDNDLIVKDQKGTY 180
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
T FT AVD IH H++ PLFLYL H P+QAP Y++ + I+D R
Sbjct: 181 GTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPEYIDKYSFIKDKTRRT 235
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--AAAGFNLNAASNWPLRGVKN 338
AA++ +DE++G V AL+ + N+++VF +DNGG A G+ N+PLRG KN
Sbjct: 236 HAAMVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNGGIHTAGGY------NYPLRGEKN 289
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
TLWEGGVRGAG + P+ G++ +HV+DW PTL+ A S
Sbjct: 290 TLWEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAGGS 334
>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
Length = 492
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 151/332 (45%), Positives = 208/332 (62%), Gaps = 22/332 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADD GWND+G+H I TPN+D LA G+ L+NYY +C+PSR+ +MTG++
Sbjct: 16 PHIVFILADDYGWNDIGYHS-SLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTGRY 74
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I GMQH+VL GLPL E LPQ LKE GY T IVGKWHLG +KKEY PT+RGF+
Sbjct: 75 QIRYGMQHSVLMSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRGFD 134
Query: 180 SHLGYWTGHQDYFDH--------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
+ G+ G +DYF H + E + D +P +G YST VF +++D
Sbjct: 135 TFFGFLGGGEDYFTHRIPNEIVETPETYR--AFDFWDGSKPCLSENGSYSTHVFARKSID 192
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
+I H D+P+FLYL A H+ PL+AP+ ++N + HI +A + +DE+
Sbjct: 193 LISRHDKDKPMFLYLPFQAVHA-----PLEAPEEFINKYTHIRSKNMRTYAGVTTAMDEA 247
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VG + AL+ M N++++F +DNG + AG SNWPLRG KNTLWEGGVRG G +
Sbjct: 248 VGNITRALKDHGMWENTVLIFSTDNGASKAG------SNWPLRGFKNTLWEGGVRGVGFV 301
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ + +++ +H+SDW PT++ A S
Sbjct: 302 SSPLLKVKQRISDALLHISDWFPTIVRIAGGS 333
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 22/36 (61%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH+VL GLPL E LPQ LKE GY T I+
Sbjct: 80 MQHSVLMSDRPHGLPLGEVTLPQVLKESGYATHIVG 115
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 48/108 (44%), Gaps = 24/108 (22%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID--------DEWQI-----------SALTRGKWKL 550
+DG DVW +S PS R ILHNID D +I +A+ WKL
Sbjct: 340 LDGYDVWDTISEGTPSPRTEILHNIDPLIAATNSDASKIHGHSTFNTSVKAAIRVRDWKL 399
Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGI---DVWSVLSRNEPSKRNTI 595
+ N GNG + ++ D + + G+ +W R++P +R +
Sbjct: 400 LTGNP--GNGDWQLQTFDGKIKEKSTGVRGKQLWLFNIRSDPQERKDV 445
>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
Length = 784
Score = 306 bits (785), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 153/311 (49%), Positives = 209/311 (67%), Gaps = 2/311 (0%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GW+DV FHG QIPTPNID+LA SG+IL NYY +++ MTGK+ H G QH VL
Sbjct: 1 GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-Q 189
+ + GLP +EKILPQYLKE GY VGKW LG+YK+E P RGF+ G T +
Sbjct: 61 HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFDFFYGGLTSSGK 120
Query: 190 DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHA 249
DY+ HSA + + +GLD+RR+ E + G Y T+V+T EAV+II N++ ++PLFLY+AH
Sbjct: 121 DYYTHSAFD-ENYGLDLRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFLYVAHQ 179
Query: 250 ATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
A H+ N +PLQAP+ YL HI++ KR FA ++ LDESV + AL Q+ +L N++
Sbjct: 180 AVHTGNADDPLQAPESYLKKLNHIKNIKRKLFAGMVLALDESVLNITIALAQKGLLDNTV 239
Query: 310 IVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
IVF +D GGA G L++ASN+PLRG K T+WEGG+R ++S L++++G V+ + H
Sbjct: 240 IVFTTDAGGAVGGQELSSASNFPLRGSKLTVWEGGIRAVAFVYSDLIKNKGRVSLEMYHS 299
Query: 370 SDWLPTLLSAA 380
+DWL T L A
Sbjct: 300 TDWLLTTLGLA 310
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 17/128 (13%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ-ISALTKGKWKLVKVVKVMRYQVDLTGGP 632
++DG DVW +S E S R ILH D + + +L G +KLV + + + + D P
Sbjct: 318 KVDGFDVWKSISEGEVSPRFEILHQFDPLTKNLCSLRVGDYKLV-INQDIGFYGDWYPRP 376
Query: 633 DQVYLSGLSDREWLALAMRKLRDAASIQCG-----PVKEV---PCEPQIAPCLFDIKNDP 684
++ E L + A + C P E+ PC+P PC+F+I+ DP
Sbjct: 377 AEI-------GELRYLKRIETLPNAKVTCDKNYPHPFLEMHAPPCDPMKKPCMFNIQWDP 429
Query: 685 CEKNNLAD 692
CE +NLA+
Sbjct: 430 CEFHNLAE 437
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/29 (65%), Positives = 22/29 (75%)
Query: 2 QHNVLYGCERGGLPLSEKILPQYLKELGY 30
QH VL+ + GLP +EKILPQYLKE GY
Sbjct: 56 QHGVLHNKQPFGLPHTEKILPQYLKEAGY 84
>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
Length = 547
Score = 306 bits (785), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 10/335 (2%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V+++ PH+I I+ADD+GW+D HG + TPN+D L SG+ L YYT LC+P+R+A
Sbjct: 17 VSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTA 76
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TGK+ GMQ L E G+PL E+++ QYL++ GYRT++VGKWH+G E P
Sbjct: 77 VLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLP 136
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
T+RGFE+H G G DY++++A+E + GL + DL+P W G Y TDV+T ++
Sbjct: 137 TYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITDVYTEKS 195
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
II NH+ EPL+L L H A H+ N LQAP + RH+E R FAA++ KLD
Sbjct: 196 TTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLD 255
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+S+G++V LE++ ML N+II F +DNG G N+ SN+PLRGVK + WEGG+RG
Sbjct: 256 DSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNA 315
Query: 350 LIW-----SPLLESRGIVAEQYVHVSDWLPTLLSA 379
+IW +P RG V + +H +DW+PTLL A
Sbjct: 316 MIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEA 350
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 79/183 (43%), Gaps = 30/183 (16%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLTG 630
+DGI +WS + N+PS R I IDD + S++T G+ KLVK Y DL G
Sbjct: 359 LDGIPMWSHIIENKPSPRTEIF-EIDDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRG 417
Query: 631 --GPDQVYLSGLSD-REWLALAMRKL---------RDAASIQCGPVKEVPCEPQIAP-CL 677
G Y L D + W +L + RD A + CG V PC P CL
Sbjct: 418 IIGTPPDYKQKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCL 477
Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
+DI DPCE +L++ E+ + QI E+EE K ++ + K
Sbjct: 478 YDIIEDPCELRDLSE------------ELPQLAQILLYRLEQEEAKIIPREGQYVADPKS 525
Query: 738 KKK 740
K
Sbjct: 526 APK 528
Score = 43.5 bits (101), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK 739
I+N P + + + D NH + +GR + E + + + K
Sbjct: 369 IENKPSPRTEIFEI--DDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYK 426
Query: 740 KKKKKKKKKYSNEEEGM------RKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 792
+K + K S E G+ RD A + CG V PC P CL+DI DPCE
Sbjct: 427 QKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCLYDIIEDPCE 486
Query: 793 KNNLAD 798
+L++
Sbjct: 487 LRDLSE 492
>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
Length = 517
Score = 306 bits (785), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 217/334 (64%), Gaps = 25/334 (7%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS ++
Sbjct: 36 AAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IH G+QH ++ C+ +PL EK+LPQ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 95 TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154
Query: 176 RGFESHLGYWTGHQDYFDHSAEE------MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A LD+R EPA + YST++FT A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++ + ++PLQ P+ Y+ + I+D R +A ++ LD
Sbjct: 215 TTLIANHPPEKSV--------------HDPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 260
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V +AL+ R + +N++++F +DNGG + +NWPLRG K TLWEGG+RGAG
Sbjct: 261 EAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGAG 316
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ SPLL+ +G+ + + +H++DWLPTL++ A S
Sbjct: 317 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 350
>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 10/335 (2%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V+++ PH+I I+ADD+GW+D HG + TPN+D L SG+ L YYT LC+P+R+A
Sbjct: 17 VSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTA 76
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TGK+ GMQ L E G+PL E+++ QYL++ GYRT++VGKWH+G E P
Sbjct: 77 VLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLP 136
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
T+RGFE+H G G DY++++A+E + GL + DL+P W G Y TDV+T ++
Sbjct: 137 TYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITDVYTEKS 195
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
II NH+ EPL+L L H A H+ N LQAP + RH+E R FAA++ KLD
Sbjct: 196 TTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLD 255
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+S+G++V LE++ ML N+II F +DNG G N+ SN+PLRGVK + WEGG+RG
Sbjct: 256 DSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNA 315
Query: 350 LIW-----SPLLESRGIVAEQYVHVSDWLPTLLSA 379
+IW +P RG V + +H +DW+PTLL A
Sbjct: 316 MIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEA 350
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 79/183 (43%), Gaps = 30/183 (16%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLTG 630
+DGI +WS + N+PS R I IDD + S++T G+ KLVK Y DL G
Sbjct: 359 LDGIPMWSHIIENKPSPRTEIF-EIDDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRG 417
Query: 631 --GPDQVYLSGLSD-REWLALAMRKL---------RDAASIQCGPVKEVPCEPQIAP-CL 677
G Y L D + W +L + RD A + CG V PC P CL
Sbjct: 418 IIGTPPDYKQKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCL 477
Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
+DI DPCE +L++ E+ + QI E+EE K ++ + K
Sbjct: 478 YDIIEDPCELRDLSE------------ELPQLAQILLYRLEQEEAKIIPREGQYVSDPKS 525
Query: 738 KKK 740
K
Sbjct: 526 APK 528
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK 739
I+N P + + + D NH + +GR + E + + + K
Sbjct: 369 IENKPSPRTEIFEI--DDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYK 426
Query: 740 KKKKKKKKKYSNEEEGM------RKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 792
+K + K S E G+ RD A + CG V PC P CL+DI DPCE
Sbjct: 427 QKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCLYDIIEDPCE 486
Query: 793 KNNLAD 798
+L++
Sbjct: 487 LRDLSE 492
>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 10/335 (2%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V+++ PH+I I+ADD+GW+D HG + TPN+D L SG+ L YYT LC+P+R+A
Sbjct: 17 VSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTA 76
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TGK+ GMQ L E G+PL E+++ QYL++ GYRT++VGKWH+G E P
Sbjct: 77 VLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLP 136
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
T+RGFE+H G G DY++++A+E + GL + DL+P W G Y TDV+T ++
Sbjct: 137 TYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITDVYTEKS 195
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
II NH+ EPL+L L H A H+ N LQAP + RH+E R FAA++ KLD
Sbjct: 196 TTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLD 255
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+S+G++V LE++ ML N+II F +DNG G N+ SN+PLRGVK + WEGG+RG
Sbjct: 256 DSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNA 315
Query: 350 LIW-----SPLLESRGIVAEQYVHVSDWLPTLLSA 379
+IW +P RG V + +H +DW+PTLL A
Sbjct: 316 MIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEA 350
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 79/183 (43%), Gaps = 30/183 (16%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLTG 630
+DGI +WS + N+PS R I IDD + S++T G+ KLVK Y DL G
Sbjct: 359 LDGIPMWSHIIENKPSPRTEIF-EIDDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRG 417
Query: 631 --GPDQVYLSGLSD-REWLALAMRKL---------RDAASIQCGPVKEVPCEPQIAP-CL 677
G Y L D + W +L + RD A + CG V PC P CL
Sbjct: 418 IIGTPPDYKQKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCL 477
Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
+DI DPCE +L++ E+ + QI E+EE K ++ + K
Sbjct: 478 YDIIEDPCELRDLSE------------ELPQLAQILLYRLEQEEAKIIPREGQYVADPKS 525
Query: 738 KKK 740
K
Sbjct: 526 APK 528
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK 739
I+N P + + + D NH + +GR + E + + + K
Sbjct: 369 IENKPSPRTEIFEI--DDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYK 426
Query: 740 KKKKKKKKKYSNEEEGM------RKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 792
+K + K S E G+ RD A + CG V PC P CL+DI DPCE
Sbjct: 427 QKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCLYDIIEDPCE 486
Query: 793 KNNLAD 798
+L++
Sbjct: 487 LRDLSE 492
>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 557
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 212/328 (64%), Gaps = 5/328 (1%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
P +++ ILADD+GWND FHG +IPTPN+DALA SG+IL+++Y +CTP+R+A++TG
Sbjct: 79 PTNVVMILADDMGWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALLTGL 138
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P HTGMQ+ V+ E GLPL KILP YL E Y + +VGKWHLG + + PT R F
Sbjct: 139 YPFHTGMQNFVIRTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTARHF 198
Query: 179 ESHLGYWTGHQDYFDH---SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
+H+GY+ G DYF H S + GLD + + Y+T +FT AV++I N
Sbjct: 199 NTHVGYYNGFIDYFTHEHISPGNDSLIGLDWH--INEENENEEGYATHLFTKRAVNLIEN 256
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H + EPLF+ L+H A H+ +P QAP + HI+D R +AA++ +LD+SVG+V
Sbjct: 257 HKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRKVYAAMVDELDQSVGQV 316
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
VEAL ++R+L +++IVF+SDNGG G N SN+P RG K TL+EGG R + +WS
Sbjct: 317 VEALYRKRLLDSTMIVFLSDNGGQTTGVMNNTGSNYPFRGQKRTLFEGGTRVSAFVWSTD 376
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + + +HV+DWLPT+L A S
Sbjct: 377 IVKKPRIESGLMHVTDWLPTILKRAGLS 404
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 30/148 (20%)
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLT 629
SY E+DG D WS+LS PS R +L+N++ Q +AL ++KL+ R V L
Sbjct: 404 SYPRELDGRDQWSMLSEGAPSARAEVLYNLNYVDQSAALRNDRFKLIVGP---RNGVSLL 460
Query: 630 GGPDQVY--------LSGLSDREWLALAMRKL-------------------RDAASIQCG 662
DQ++ L G+ D A +++ RD A I+CG
Sbjct: 461 NDWDQLHKGKHSRAQLDGMMDESLTARTLKEAGLWWKMRHGDTLSADHSTWRDKAEIKCG 520
Query: 663 PVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
+ C+ +PCLFD++NDPCE NN+
Sbjct: 521 SHDKTLCDVHRSPCLFDLENDPCELNNV 548
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%)
Query: 760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
RD A I+CG + C+ +PCLFD++NDPCE NN+
Sbjct: 512 RDKAEIKCGSHDKTLCDVHRSPCLFDLENDPCELNNV 548
>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
Length = 476
Score = 306 bits (783), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 158/337 (46%), Positives = 217/337 (64%), Gaps = 21/337 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ I+ADDLG +DVG+HG I TPNID LAY+G+ L+NYY +CTPSRS +MTG++
Sbjct: 1 PNILLIVADDLGHHDVGYHG-SVIKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRY 59
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QHN++ + +PL +LP+ LK+ GY T +VGKWHLGFYK E P RGF+
Sbjct: 60 QIHTGLQHNIINPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFD 119
Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
S+ GY TG +DYF H + G+D+R D E W+ GKYST +F +A D++
Sbjct: 120 SYYGYLTGSEDYFTHRRCGALPGANKTVCGIDLRNDFEVDWNQTGKYSTQLFAEKAEDVV 179
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
H+ D+PLFLY+A A H+ N Q P+ YL + I+D KR A ++ +DE
Sbjct: 180 RKHAVHQPDQPLFLYVAFQAVHAPN-----QVPNEYLKPYD-IDDPKRRLLAGMVTCMDE 233
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+VG + A + + N+++VF +DNGG N N+ +NWPLRG K++LWEGG+RG G
Sbjct: 234 AVGNITAAFKDTGLWDNTVVVFTTDNGGDV---NENSGNNWPLRGWKHSLWEGGMRGVGF 290
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDI 385
+ SPLL+S G +HVSDW PTL L+ AN S I
Sbjct: 291 VNSPLLKSFGYPVNGLMHVSDWFPTLLNLAGANTSSI 327
>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
Length = 556
Score = 305 bits (781), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 216/335 (64%), Gaps = 25/335 (7%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 72 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 130
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 131 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 190
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 191 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 250
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++ + ++PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 251 ATTVIANHPPEKSV--------------HDPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 296
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 297 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 352
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H++DWLPTL+ A S
Sbjct: 353 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 387
Score = 40.0 bits (92), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 141 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 176
>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 513
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 219/359 (61%), Gaps = 12/359 (3%)
Query: 39 VLPLAFTL-SMVFVDLVASSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+L F L S D+ SG P+++ I+ DDLGW+D+G HG QIPTPNID LA
Sbjct: 4 ILFFLFCLRSFGTCDISGPSGETKAPNVVLIVVDDLGWDDIGLHGSSQIPTPNIDKLAEE 63
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G++L NYYT +CTPSR+++MTGK+P+ G+QH+V+ GLP + KI+PQYL + Y
Sbjct: 64 GVVLDNYYTQPICTPSRASLMTGKYPVRLGLQHDVISAATPFGLPSNFKIMPQYLHDKNY 123
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK------MWGLDMRR 208
IVGKWHLG + E+ PT RGF+ H GY G D++ H + +GLD+
Sbjct: 124 DCHIVGKWHLGHSRSEFLPTRRGFKDHFGYRLGSSDHYSHYGADDSDVPGSLFYGLDLWH 183
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ PA + +GKYSTD++T + DI+ H+ PLFLYLA+ A H+ NP + LQAP ++
Sbjct: 184 NEVPAKEFNGKYSTDIYTHRSTDILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIVD 243
Query: 269 -IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
I + +R ++AA++ +D ++G V+ A+ N+++ F +DNGG + +
Sbjct: 244 RFSSSIRNDRRRRYAAMVSAVDTAIGNVMGAIRANGFAGNTLVFFTNDNGGPINANDRSP 303
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
SN+PLR K TLWEGGVRG G+ W+P + G H+ D LPT+LSAA S +P
Sbjct: 304 GSNYPLRAGKFTLWEGGVRGTGIFWAPQVLKPGKFG-GLSHIVDVLPTILSAAGMSSVP 361
>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
Length = 661
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 211/331 (63%), Gaps = 12/331 (3%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
+ A++ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSR
Sbjct: 130 SITAATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSR 188
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
S +TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE
Sbjct: 189 SQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKEC 248
Query: 172 TPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEA 229
PT RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 249 MPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRV 308
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LD
Sbjct: 309 QQILASHNPTKPIFLYIAYQAVHS-----PLQAPVRYFEHYRSIVNINRRRYAAMLSCLD 363
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E++ V AL+ +NSII++ SDNGG A SNWPLRG K T WEGG+R G
Sbjct: 364 EAIHNVTLALKAYGFYNNSIIIYSSDNGGQPAA----GGSNWPLRGSKGTYWEGGIRAIG 419
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ SPLL+++G V + VH++DW PTL+S A
Sbjct: 420 FVHSPLLKNKGTVCRELVHITDWYPTLISLA 450
>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
Length = 596
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
Length = 578
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
Length = 599
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
Length = 601
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/329 (47%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 72 TTSASQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 130
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 131 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 190
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 191 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDSDNGIYSTQMYTQRVQQ 250
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +PLFLYLA+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 251 ILASHNPTKPLFLYLAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 305
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 306 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 361
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 362 HSPLLKNKGTVCKELVHITDWYPTLISLA 390
>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
Length = 597
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
Length = 599
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
Length = 598
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
Length = 596
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
Length = 598
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
AS+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TASTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHDPRKPIFLYIAYQAVHS-----PLQAPSRYFEHYRSIVNINRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 516
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 151/347 (43%), Positives = 221/347 (63%), Gaps = 17/347 (4%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L FT +V + S P HI+FILADD G++D+G+H I TPN+D LA G+ L+NY
Sbjct: 22 LFFTTLIVLLLRPVSPEPTHIVFILADDYGFHDIGYHD-SIIKTPNLDRLASEGVKLENY 80
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +CTP+RS +M+G++ IHTG+QH +++ C+ LP++E +PQ LKE GY T IVGK
Sbjct: 81 YVQPICTPTRSQLMSGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGK 140
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL---EPAWDLHG 218
WHLG YKKE PT RGF++ GY TG +DY+ H+ K G+D R ++ +P ++ G
Sbjct: 141 WHLGMYKKECLPTERGFDTFFGYLTGSEDYYTHNRSYDKFHGMDFRENMQIVQPQYN--G 198
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
+YST VF +A +II +H PLFLYL A H PLQ PD Y + +I + +R
Sbjct: 199 QYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQAVHG-----PLQVPDQYEKPYANITNKQR 253
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+A ++ +DE+VG + + L+ + NS+I+F +DNGG + +NWPLRG K
Sbjct: 254 RTYAGMVSCMDEAVGNITQTLKDTGLYDNSVIIFSTDNGGQV----IAGGNNWPLRGWKG 309
Query: 339 TLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
+LWEGG+ G G + SPL+ ++RG +++ +HV+DW PT++ A S
Sbjct: 310 SLWEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGS 356
>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
Length = 596
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
Length = 599
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 784
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 218/345 (63%), Gaps = 19/345 (5%)
Query: 42 LAFTLSMVFVDL-VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L + L + D+ V PHIIFILADDLGW+DVG+HG I TPNIDALA G+ L N
Sbjct: 23 LVYVLILEVTDIHVNGRQKPHIIFILADDLGWSDVGYHG-SVIKTPNIDALASEGVKLDN 81
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
YY LCTPSR +++G++ IHTG+QH + + LP+ E ILPQ LKE GY T +VG
Sbjct: 82 YYMSLLCTPSRGQLLSGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKERGYATHMVG 141
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP-AWDLHGK 219
KWHLGFY+KE P +RGF++ +G++ G DY+ H+ G D RR+ + A G+
Sbjct: 142 KWHLGFYRKECLPNYRGFDTFMGFYQGMADYYYHNISTGIYHGWDFRRNNDVIAQKYAGQ 201
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI--EDFK 277
YST VF EA II H+ + PLFL+L+ A H PLQ P Y ++++ + +
Sbjct: 202 YSTHVFADEAQIIIMKHNPEVPLFLFLSFQAIHL-----PLQVPSRYADMYKTLIPNNAD 256
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R K+A ++ +DE+VG+VV AL+ + NS++VF +DNGG G NWPLRG K
Sbjct: 257 RQKYAGMVTCMDEAVGRVVTALKHTGLWENSVLVFSTDNGGILGG-------NWPLRGSK 309
Query: 338 NTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
+L+EGGVRG G + SPLL S RG V + +H++DWLPTL+ A
Sbjct: 310 ASLYEGGVRGVGFVTSPLLPSRVRGTVNRELLHMTDWLPTLVRLA 354
>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
Length = 597
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 69 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 127
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 128 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 187
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 188 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 247
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 248 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 302
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 303 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 358
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 359 HSPLLKNKGTVCKELVHITDWYPTLISLA 387
>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
Length = 599
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
(N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
intestinalis]
Length = 522
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 221/343 (64%), Gaps = 20/343 (5%)
Query: 48 MVFVDLVASSG--PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
+VFV ++ SG P+I+ I+ADDLG+NDVG+H D I TPNI+ LA G+IL++YY
Sbjct: 13 LVFVQVLYVSGHQKPNILLIVADDLGFNDVGYHNPD-IYTPNINKLAKDGVILESYYVQP 71
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+CTPSRS +MTG++ IHTG+QH+V++ + LP+ E ILPQ LKE GY T VGKWHLG
Sbjct: 72 ICTPSRSQLMTGRYQIHTGLQHSVIFAPQPNCLPVDEIILPQKLKEAGYTTHAVGKWHLG 131
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDY--------FDHSAEEMKMWGLDMRRDLEPAWDLH 217
FYKKE PT RGF++ GY+ G +DY F + ++ G D + W+ +
Sbjct: 132 FYKKECLPTSRGFDTFYGYYCGAEDYYTKQVHANFHFGNKTRRVSGFDFHDNSRTEWEAN 191
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
G YS+ ++ AV II +H++ PLF+YL + H PLQ P Y+ +RHI+D K
Sbjct: 192 GTYSSYLYRDRAVRIIKSHNSSIPLFMYLPFQSVHF-----PLQVPAKYIKRYRHIKDRK 246
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R F+A++ +DE++G VV+AL+ + + ++++VF +DNGG L +NWPLRG K
Sbjct: 247 RRTFSAMVTAMDEAIGSVVDALKWKGIWQDTLVVFTTDNGGQT----LFGGNNWPLRGRK 302
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+LWEGGVRG GL+ + +G + + VH+SDW PTLL A
Sbjct: 303 ASLWEGGVRGVGLVRGYGIRDKGRSSNELVHISDWFPTLLYIA 345
>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
Length = 601
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 210/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A++ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 72 AAATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 130
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 131 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 190
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 191 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDSAAWDHDNGIYSTQMYTQRVQQ 250
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 251 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 305
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
V V AL+ +NSI+++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 306 VSNVTLALKTYGFYNNSILIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 361
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 362 HSPLLKNKGTVCKELVHITDWYPTLISLA 390
>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
Length = 605
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 76 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 134
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 135 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 194
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 195 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQK 254
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 255 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 309
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 310 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 365
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 366 HSPLLKNKGTVCKELVHITDWYPTLISLA 394
>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
Length = 576
Score = 303 bits (775), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 208/325 (64%), Gaps = 4/325 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II I+ADD+G++D+ G + TPNIDALA+ G +L Y +CTPSR A+++GK+
Sbjct: 35 PNIIIIMADDMGFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAMCTPSRGALLSGKY 94
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTG QH V+ E L L+ ++P+ + GY T +VGKWHLG+ + E+TPT RGF+
Sbjct: 95 PIHTGTQHFVISNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGYARPEFTPTHRGFD 154
Query: 180 SHLGYWTGHQDYFDHSAE---EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
H GYW + DY+ ++ + + G D RR++E G Y TD+ T EA +I
Sbjct: 155 YHYGYWGAYIDYYQRRSQMPDKTYIMGYDFRRNMEVECADRGVYMTDLLTNEAERVIQET 214
Query: 237 ST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
+ +PLFL + H A H+ N +PLQ P+ L HI+D K+AA++ KLD+SVG+V
Sbjct: 215 AAKQQPLFLMINHLAVHTGNDNDPLQVPEEELQKFTHIKDPNHRKYAAMVSKLDKSVGRV 274
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
+ AL + L NSI++F SDNG + G N SNWPL+G KN+ WEGG+R AG IWSPL
Sbjct: 275 IGALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGLRVAGAIWSPL 334
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L++R V Q ++V D+LPTL AA
Sbjct: 335 LKARSSVFTQPIYVGDFLPTLAHAA 359
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
++DGID+W LS + P ILH++DD W +S+L G WK V +
Sbjct: 368 QLDGIDLWPQLSGPKEAPHVPREILHSLDDVWNVSSLLLGHWKYVNGTTLAGQYDQQLVQ 427
Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
RY V++ L+ + M+ + A + CG ++ C
Sbjct: 428 RELDDLDPRANRYVVEVRNSLASRALARFDQQRLTQHRMKNIMIDAKVDCGDMQR-GCNA 486
Query: 672 QIAPCLFDIKNDPCEKNNLA 691
+ CL+D+ DPCE+NNLA
Sbjct: 487 LLEECLYDMSVDPCERNNLA 506
>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
caballus]
Length = 523
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 208/328 (63%), Gaps = 16/328 (4%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
++++L DD+G +DVG H + TP +DALA G++L NYYT LC PSRS +++G++ I
Sbjct: 37 LVYMLTDDMGXHDVGLHE-SRFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQLLSGRYQI 95
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
HTG+QH +++ C+ LPL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF+++
Sbjct: 96 HTGLQHQIIWPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTY 155
Query: 182 LGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GY G +DY+ H A + LD R E A YST VFT A +I N
Sbjct: 156 FGYLLGSEDYYSHERCTFIDALNVTRCALDFRDGEEVATGYKNMYSTSVFTERATALITN 215
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H ++PLFLYLA + H +PLQ P+ YL + I+D R +A ++ LDE+VG V
Sbjct: 216 HPPEKPLFLYLALQSVH-----QPLQVPEEYLKPYDFIQDKNRYHYAGMVSLLDEAVGNV 270
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL+ R + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G + SPL
Sbjct: 271 TAALKSRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASPL 326
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
L+ +G+ + +H+SDWLPTL+ A S
Sbjct: 327 LKQKGVKNRELIHISDWLPTLVKLAGGS 354
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ LPL EK+LPQ LKE GY T ++
Sbjct: 99 LQHQIIWPCQPSCLPLDEKLLPQLLKEAGYTTHMVG 134
>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
Length = 600
Score = 302 bits (773), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 154/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A++ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TAATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDRLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSIIV+ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIVYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
Length = 571
Score = 302 bits (773), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 152/323 (47%), Positives = 208/323 (64%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TGK+
Sbjct: 48 PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 106
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF+
Sbjct: 107 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 166
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
+ G G DY+ H + M G D+ + AWD +G YST ++T I+ +H
Sbjct: 167 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 226
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 227 PTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTL 281
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NSII++ SDNGG + SNWPLRG K T WEGG+R G + SPLL+
Sbjct: 282 ALKRYGFYNNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 337
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
++G V ++ VH++DW PTL+S A
Sbjct: 338 NKGTVCKELVHITDWYPTLISLA 360
>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
Length = 597
Score = 302 bits (773), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 152/323 (47%), Positives = 208/323 (64%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TGK+
Sbjct: 74 PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 132
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF+
Sbjct: 133 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 192
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
+ G G DY+ H + M G D+ + AWD +G YST ++T I+ +H
Sbjct: 193 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 252
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 253 PTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTL 307
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NSII++ SDNGG + SNWPLRG K T WEGG+R G + SPLL+
Sbjct: 308 ALKRYGFYNNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 363
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
++G V ++ VH++DW PTL+S A
Sbjct: 364 NKGTVCKELVHITDWYPTLISLA 386
>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
Length = 599
Score = 301 bits (772), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 71 VTAVSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 129
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 130 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 189
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 190 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 249
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 250 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 304
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 305 INNVTLALKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 360
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 361 HSPLLKNKGTVCKELVHITDWYPTLISLA 389
>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
Length = 599
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
++ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDRAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLACLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V ALE +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALETYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL++ A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLITLA 388
>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
Length = 599
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 71 VTALSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 129
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 130 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 189
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 190 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQ 249
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 250 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 304
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 305 INNVTLALKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAIGFV 360
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 361 HSPLLKNKGTVCKELVHITDWYPTLISLA 389
>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 149/333 (44%), Positives = 214/333 (64%), Gaps = 17/333 (5%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
SS P HI+FILADD G++D+G+H I TPN+D LA G+ L+NYY CTP+RS +M
Sbjct: 10 SSSPTHIVFILADDYGFHDIGYHD-SIIKTPNLDRLASEGVKLENYYVQPKCTPTRSQLM 68
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+G++ IHTG+QH +++ C+ LP++E +PQ LKE GY T IVGKWHLG YKKE PT
Sbjct: 69 SGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTE 128
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL---EPAWDLHGKYSTDVFTAEAVDI 232
RGF++ GY TG +DY+ H+ K G+D R ++ +P ++ G+YST VF +A +I
Sbjct: 129 RGFDTFFGYLTGSEDYYTHNRSYDKFHGMDFRENMQIVQPQYN--GQYSTHVFAEKAKNI 186
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I +H PLFLYL A H PLQ PD Y + +I + +R +A ++ +DE+V
Sbjct: 187 IKSHDPQIPLFLYLPLHAVHG-----PLQVPDQYEKPYTNITNKQRRTYAGMVSCMDEAV 241
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G + + LE + NS+I+F +DNGG +NWPLRG K +LWEGG+ G G +
Sbjct: 242 GNITQTLEDAGLYDNSVIIFSTDNGGQVTA----GGNNWPLRGWKGSLWEGGMHGVGFVN 297
Query: 353 SPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPL+ ++RG +++ +HV+DW PT++ A S
Sbjct: 298 SPLINHQARGTTSKELIHVTDWFPTIVHLAGGS 330
>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
Length = 595
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 206/323 (63%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TGK+
Sbjct: 76 PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 134
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF+
Sbjct: 135 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 194
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
+ G G DY+ H + M G D+ + AWD +G YST ++T I+ +H
Sbjct: 195 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 254
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 255 PRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTL 309
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G + SPLL+
Sbjct: 310 ALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 365
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
++G V ++ VH++DW PTL+S A
Sbjct: 366 NKGTVCKELVHITDWYPTLISLA 388
>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
Length = 600
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 206/323 (63%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TGK+
Sbjct: 76 PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 134
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF+
Sbjct: 135 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 194
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
+ G G DY+ H + M G D+ + AWD +G YST ++T I+ +H
Sbjct: 195 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 254
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 255 PRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTL 309
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G + SPLL+
Sbjct: 310 ALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 365
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
++G V ++ VH++DW PTL+S A
Sbjct: 366 NKGTVCKELVHITDWYPTLISLA 388
>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
Length = 548
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 215/328 (65%), Gaps = 6/328 (1%)
Query: 66 LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
+ADD+G++DV F G + TPNIDALAYSG+IL N YT +CTPSR+A++TGK+PI+TGM
Sbjct: 1 MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60
Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
QH V+ + GLP EK + + +E GY T ++GKWHLG ++ +TPT RGF+ HLGY
Sbjct: 61 QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120
Query: 186 TGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS---TD 239
+ DY+D + ++ G D R +L D G Y TDV + AV++I HS +
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFL L+H A H+AN +P+QAP L +I + +AA++ +LD SVG+V++AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRHYAAMVSRLDRSVGRVIDAL 240
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+++ML +SI++F+SDNGG G + ASN+PLRG KN+ WEGG+R + IWS E
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSTTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300
Query: 360 GIVAEQYVHVSDWLPTLLSAANKSDIPN 387
G V +Q ++ D LPTL +AA + P+
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPD 328
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 29/143 (20%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS--ALTKGKWKLVKVV------------ 619
++DG+++WS L S I+HNID + A T+GKWK+V
Sbjct: 330 KLDGMNLWSALKYGYESLEREIVHNIDMDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGE 389
Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLALAM--------RKLRDAASIQC----GPVKEV 667
+V R ++D + + S + L L M LR A I+C G K
Sbjct: 390 RVTR-EMDPRSANYEELIRNTSVWQHLQLLMGSQNERNITSLRQQAQIECPEPSGATK-- 446
Query: 668 PCEPQIAPCLFDIKNDPCEKNNL 690
PC P +APCLFDI DPCE+NNL
Sbjct: 447 PCLPLLAPCLFDIDADPCERNNL 469
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 39/80 (48%), Gaps = 14/80 (17%)
Query: 750 SNEEEGMRKLRDAASIQC----GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
S E + LR A I+C G K PC P +APCLFDI DPCE+NNL +
Sbjct: 421 SQNERNITSLRQQAQIECPEPSGATK--PCLPLLAPCLFDIDADPCERNNLYE------- 471
Query: 806 NHYTTEVGYLDPKQRFNQIA 825
Y +LD QR Q A
Sbjct: 472 -DYRNTTIFLDLLQRIEQFA 490
>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
Length = 598
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
++ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TTATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
Length = 548
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 216/328 (65%), Gaps = 6/328 (1%)
Query: 66 LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
+ADD+G++DV F G + TPNIDALAYSG+IL N YT +CTPSR+A++TGK+PI+TGM
Sbjct: 1 MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60
Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
QH V+ + GLP EK + + +E GY T ++GKWHLG ++ +TPT RGF+ HLGY
Sbjct: 61 QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120
Query: 186 TGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS---TD 239
+ DY+D + ++ G D R +L D G Y TDV + AV++I HS +
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFL L+H A H+AN +P+QAP L +I + +AA++ +LD SVG+V++AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRYYAAMVSRLDRSVGRVIDAL 240
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+++ML +SI++F+SDNGG G + + ASN+PLRG KN+ WEGG+R + IWS E
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSSTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300
Query: 360 GIVAEQYVHVSDWLPTLLSAANKSDIPN 387
G V +Q ++ D LPTL +AA + P+
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPD 328
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 25/141 (17%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS--ALTKGKWKLVKVV------------ 619
++DG+++WS L S I+HNID + A T+GKWK+V
Sbjct: 330 KLDGMNLWSALKYGYESLEREIVHNIDMDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGE 389
Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLALAM--------RKLRDAASIQCG--PVKEVPC 669
+V R +VD + + S + L L M LR A I+C P PC
Sbjct: 390 RVTR-EVDPRSANYEELIRNTSVWQHLQLLMGSQNERNITSLRQQAQIECPEPPGATKPC 448
Query: 670 EPQIAPCLFDIKNDPCEKNNL 690
P +APCLFDI DPCE+NNL
Sbjct: 449 LPLLAPCLFDIDADPCERNNL 469
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 38/78 (48%), Gaps = 10/78 (12%)
Query: 750 SNEEEGMRKLRDAASIQCG--PVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINH 807
S E + LR A I+C P PC P +APCLFDI DPCE+NNL +
Sbjct: 421 SQNERNITSLRQQAQIECPEPPGATKPCLPLLAPCLFDIDADPCERNNLYE--------D 472
Query: 808 YTTEVGYLDPKQRFNQIA 825
Y +LD QR Q A
Sbjct: 473 YRNTTIFLDLLQRIEQFA 490
>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 562
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 206/324 (63%), Gaps = 11/324 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+F++ DD G+ND+G+HG +I TP +D LA G+ L+NYY +C+PSRS +MTG+
Sbjct: 29 PPHIVFVMVDDQGYNDIGYHG-SEIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGR 87
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LP LP+ L+E GY T +VGKWHLGF E PT RGF
Sbjct: 88 YQIHTGLQHSIIRARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGF 147
Query: 179 ESHLGYWTGHQDYFD-HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
+S LG TG D+F S + + G D+ PAW+L G YST ++T DI+ H
Sbjct: 148 QSFLGSLTGSGDHFSFQSCDGTEACGFDLHDGDRPAWELRGNYSTRLYTERVKDILRRHD 207
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A A H+ PLQAP H L ++ + + R +AA++ +DESVG++V
Sbjct: 208 HRKPLFLYVALQAVHT-----PLQAPGHLLRRYQALGNRPRRHYAAMVSGVDESVGEIVS 262
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
L +R NS++++ SDNGG L+ NWPLRG K + WEGGVR G + SPLL+
Sbjct: 263 ELRERGYYDNSVLIYSSDNGGQP----LSGGCNWPLRGGKGSYWEGGVRAVGFVHSPLLK 318
Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
+G+V+ +H+SDW PTLLS A
Sbjct: 319 RKGVVSHALIHISDWYPTLLSLAG 342
>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
Length = 597
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TAVTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
Length = 515
Score = 300 bits (768), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 207/329 (62%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGF +KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYTAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
Length = 572
Score = 300 bits (768), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 42 TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 100
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 101 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 160
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 161 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 220
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 221 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 275
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 276 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 331
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 332 HSPLLKNKGTVCKELVHITDWYPTLISLA 360
>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
Length = 564
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 213/323 (65%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADD G+ D+G+HG +I TP +D LA G+ L+NYY +C+PSRS +TGK+
Sbjct: 47 PHIVFILADDQGYRDIGYHG-SEIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGKY 105
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LK+ GY+T +VGKWHLGFYKKE PT RGF+
Sbjct: 106 QIHTGLQHSIIRPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGFD 165
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
S G G DY++H + + G D+ + AWD +G YST+++T + I+ +H+
Sbjct: 166 SFFGSLLGSGDYYNHYKCDSPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSSHN 225
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
++PLFLY+A+ A HS PLQAP +YL+ + I + R ++AA+L LD++V V
Sbjct: 226 PNKPLFLYIAYQAVHS-----PLQAPGNYLDNYNSILNENRRRYAAMLSCLDDAVNNVTT 280
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ NSII++ SDNGG + +NWPLRG K T WEGG+R G + SP+L+
Sbjct: 281 ALKKYGFYDNSIIIYSSDNGGQP----MAGGNNWPLRGSKGTYWEGGIRAVGFVHSPILK 336
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+G + ++ VH++DW PTL++ A
Sbjct: 337 MKGYICKELVHITDWFPTLVTLA 359
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 70/166 (42%), Gaps = 28/166 (16%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
++DG D+W +S + S R ILHNID + + A G W + ++V +++
Sbjct: 368 QLDGYDIWETISEGKRSPRQEILHNIDPMYTKARNGSWAAGYGIWNTAIQSAIRVNHWKL 427
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCLFDIKN 682
LTG P G D W+ + GP + V + LF+I
Sbjct: 428 -LTGNP------GYGD--WIP-------PQSFSNVGPNRWHNERVTLASGKSVWLFNITA 471
Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
DP E+ +L+DR D + + +FN+ A P + + + K
Sbjct: 472 DPYERIDLSDRYPD-IVKQLLRRLAQFNKTAVPVRYPPRDPRSNPK 516
>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
Length = 614
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
Length = 607
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 208/323 (64%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TGK+
Sbjct: 84 PHIVFILADDQGFRDVGYHG-SEIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGKY 142
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF+
Sbjct: 143 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 202
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
+ G G DY+ H + M G D+ + AWD +G YST ++T + I+ +H
Sbjct: 203 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHD 262
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 263 PRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAINNVTL 317
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ NSII++ SDNGG + SNWPLRG K T WEGG+R G + SPLL+
Sbjct: 318 ALKRYGFYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 373
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
++G + ++ VH++DW PTL++ A
Sbjct: 374 NKGTICKELVHITDWYPTLITLA 396
>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
Length = 301
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 142/224 (63%), Positives = 172/224 (76%), Gaps = 5/224 (2%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII ILADDLGWNDV FHG DQIPTPNIDALAY+GIIL N+Y LCTPSRSA+MTGK+
Sbjct: 74 PHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKN 133
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH+VLY E GLPLSEK+LPQYL+E+GY+T VGKWHLG++KK+YTPT+RGF+
Sbjct: 134 PIHLGMQHSVLYPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRGFD 193
Query: 180 SHLGYWTGHQDYFDHSAEEM-----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
SH GYW G +DY+ H A+E + G DMRR+L AWD GKY+TD+FT EAV +I+
Sbjct: 194 SHFGYWNGLEDYYTHIAQEPDSQYNEYKGFDMRRNLTVAWDTAGKYATDLFTNEAVRLIN 253
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
H T+ P+FLYLAH A H N + L+APD + +I D +R
Sbjct: 254 EHDTERPMFLYLAHLAVHKGNENQLLRAPDEEIAKFSYILDPER 297
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 28/32 (87%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
MQH+VLY E GLPLSEK+LPQYL+E+GY+T
Sbjct: 139 MQHSVLYPTEPRGLPLSEKLLPQYLQEIGYKT 170
>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
Length = 598
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
Length = 470
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 207/324 (63%), Gaps = 12/324 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFILADD GW+D+G+H I TPN+D LA G+ L+NYY +C+PSR +MTG++
Sbjct: 1 PNIIFILADDYGWDDIGYHN-HFIHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRY 59
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IH G+QH V+ GLPL E LPQ LK+ GYRT +VGKWHLGF KKEYTP +RGF+
Sbjct: 60 QIHYGLQHGVIRNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFD 119
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G+ TG +DY+ H + + GLD+R EP D +G YST +F +A D+I H +
Sbjct: 120 KFYGFLTGSEDYWTHRRYK-GVRGLDLRDQDEPVLDENGTYSTHLFARKATDMILKHDQN 178
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+P+FLYL A H PLQ P+ YL + HI +AA + +DE+VG V AL
Sbjct: 179 QPMFLYLPFQAVHG-----PLQVPEKYLQEYMHINFTVDRIYAAQVTAMDEAVGNVTNAL 233
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ + N++++F +DNG + SNWPLRG KNTLWEGGVRG G + S LL+ +
Sbjct: 234 KNSGLWENTVLIFSTDNGARR-----HTGSNWPLRGWKNTLWEGGVRGVGFVNSNLLKRK 288
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
G ++ +H+SDW PTLL ++ S
Sbjct: 289 GKTSDALIHISDWFPTLLRISDGS 312
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 23/107 (21%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW-------------------QISALTRGKWKL 550
+DG DVW +S+ + S R +LHNID + SAL G WKL
Sbjct: 319 LDGFDVWDAISKGKTSPRKELLHNIDPFYDDQTGSYSPLSYDNIFNTSSYSALRSGDWKL 378
Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGID--VWSVLSRNEPSKRNTI 595
+ G G + R ++ ++ D + +W R +P +RN +
Sbjct: 379 L--TGYQGAGWNRRRPYNHEFEESPDPPNKRLWLFNVREDPGERNDL 423
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 58/141 (41%), Gaps = 12/141 (8%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
+DG DVW +S+ + S R +LHNID + + + Y +G D
Sbjct: 319 LDGFDVWDAISKGKTSPRKELLHNIDPFYDDQTGSYSPLSYDNIFNTSSYSALRSG--DW 376
Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRS 694
L+G W R+ + +E P P LF+++ DP E+N+L+
Sbjct: 377 KLLTGYQGAGW---NRRRPYNHE------FEESPDPPNKRLWLFNVREDPGERNDLSKMH 427
Query: 695 EDQRINHYTTEVGRFNQIAYP 715
D + ++ +N+ A P
Sbjct: 428 PDI-VQDLLEKLAVYNKTAVP 447
>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
Length = 593
Score = 300 bits (767), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 159/374 (42%), Positives = 222/374 (59%), Gaps = 32/374 (8%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASS--------------------GPPHIIFILAD 68
G+ R++A A L + LS++ ++ PH+IFILAD
Sbjct: 19 GWPGRVVAMAALAGFWVLSLIAYGYLSFGQRLEKEEGSWRALASERLEPSQPHLIFILAD 78
Query: 69 DLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHN 128
D G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TGK+ IHTG+QH+
Sbjct: 79 DQGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHS 137
Query: 129 VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH 188
++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF++ G G
Sbjct: 138 IIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGS 197
Query: 189 QDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
DY+ H + M G D+ + AWD +G YST ++T I+ +H+ +P+FLY+
Sbjct: 198 GDYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYI 257
Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V AL+ +
Sbjct: 258 AYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKAYGFYN 312
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
NSII++ SDNGG SNWPLRG K T WEGG+R G + SPLL+++G V ++
Sbjct: 313 NSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKEL 368
Query: 367 VHVSDWLPTLLSAA 380
VH++DW PTL+S A
Sbjct: 369 VHITDWYPTLISLA 382
>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
Length = 491
Score = 300 bits (767), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 147/344 (42%), Positives = 222/344 (64%), Gaps = 16/344 (4%)
Query: 42 LAFTLSMVFVDLVA--SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L+F + +++V SS PH++F+LADDLGW+DVGFHG +I TPNID LA +G+IL
Sbjct: 5 LSFHCFFLCLNVVVLQSSAKPHLLFVLADDLGWSDVGFHG-SKIQTPNIDRLAANGVILD 63
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
NYY +CTP+R+++MTGK+PIHTG+QH +++ GLPL+ +LPQ L++ GY T ++
Sbjct: 64 NYYVQPVCTPTRASLMTGKYPIHTGLQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHML 123
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLGFY E TPT+RGF++ G+++G ++++ H + LD+R + E D +G
Sbjct: 124 GKWHLGFYNWESTPTYRGFDTFYGFYSGAENHYTHVQDHY----LDLRDNEEIVRDQNGT 179
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
YS +FT A I+ H PLF+Y+A HS P+QAP Y++ + I+D R
Sbjct: 180 YSAHLFTKRAEQIVRAHDPSTPLFMYMAFQNVHS-----PVQAPKEYIDRYSFIKDPLRR 234
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+AA++ +D+++G + A ++ + N+I++F +DNGG N ++PLRG K+T
Sbjct: 235 TYAAMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNGGVPK----NGGYDYPLRGRKDT 290
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LWEGGVRG + LE G+ + +HV+DW PTL+S A S
Sbjct: 291 LWEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAGGS 334
>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
Length = 555
Score = 299 bits (766), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
Length = 570
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
Length = 597
Score = 298 bits (764), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 212/326 (65%), Gaps = 11/326 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 77 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 135
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+ELGY T +VGKWHLGFY+KE PT RGF
Sbjct: 136 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 195
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 196 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 255
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 256 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 310
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 311 ALKRHGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 366
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + VH++DW PTL++ A +
Sbjct: 367 RKRRTSRALVHITDWYPTLVALAGGT 392
>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
Length = 573
Score = 298 bits (763), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 151/328 (46%), Positives = 211/328 (64%), Gaps = 12/328 (3%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A S PHI+FILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 45 APSSQPHIVFILADDQGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQF 103
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY++E PT
Sbjct: 104 ITGKYQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPT 163
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
RGF++ G G DY+ H + + G D+ + AWD +G YST ++T + I
Sbjct: 164 QRGFDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQI 223
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 224 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAI 278
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
V AL++ NSII++ SDNGG + SNWPLRG K T WEGG+R G +
Sbjct: 279 NNVTLALKKYGYYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVH 334
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL++ A
Sbjct: 335 SPLLKNKGSVCKELVHITDWFPTLITLA 362
>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
Length = 478
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 208/324 (64%), Gaps = 17/324 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FILADD GW+D+G+HG I TP +D LA G+ L+NYY +C+PSR +MTG++
Sbjct: 1 PNIVFILADDYGWDDIGYHG-SFIQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRY 59
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I G+QH+V+ GLPL E LPQ LKE GY T +VGKWHLGF++KE+ P RGF+
Sbjct: 60 QIRYGLQHSVITSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFD 119
Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
GY TG +DY+ H + + + GLD+R +P D +G YST +F +A++II
Sbjct: 120 KFYGYLTGGEDYWTHRRPNLYAKDPLAFHGLDLRDQDKPVLDQNGTYSTHLFAKKAIEII 179
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
NH +P+FLYL A HS PL+AP Y+ ++ +++ +AA++ +DE+VG
Sbjct: 180 QNHERSKPMFLYLPFQAVHS-----PLEAPTEYVEKYKDVKNPLVKTYAAMVSAMDEAVG 234
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
V +AL++ R+ N++++F +DNG NA SNWPLRG KNTLWEGGVRG G + S
Sbjct: 235 NVTDALKRTRLWDNTVLIFSTDNGARR-----NAGSNWPLRGWKNTLWEGGVRGVGFVTS 289
Query: 354 PLLESRGIVAEQYVHVSDWLPTLL 377
LL + +H+SDW PTLL
Sbjct: 290 KLLRKGKRKCDALIHISDWFPTLL 313
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 62/160 (38%), Gaps = 49/160 (30%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID-------------------DEWQISALTKGKWKL 615
+DG D+W +S PS R ILHNID + +A+ G WKL
Sbjct: 326 LDGFDIWDTISSGTPSPRKEILHNIDPLHPKTDLSSLDSLHDHIFNPSVYAAIRSGDWKL 385
Query: 616 VKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAP 675
LTG DQ GL ++ + K DA P
Sbjct: 386 ------------LTG--DQ----GLGGWKYSGTSGNKYEDAVE-----------PPNKHL 416
Query: 676 CLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I++DP E+N+L+ + D + + +N+ A P
Sbjct: 417 WLFNIRSDPQERNDLSVQYPDV-VQELLERLAAYNKTALP 455
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 22/107 (20%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID-------------------DEWQISALTRGKWKL 550
+DG D+W +S PS R ILHNID + +A+ G WKL
Sbjct: 326 LDGFDIWDTISSGTPSPRKEILHNIDPLHPKTDLSSLDSLHDHIFNPSVYAAIRSGDWKL 385
Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGID--VWSVLSRNEPSKRNTI 595
+ + G G + ++ N Y++ ++ + +W R++P +RN +
Sbjct: 386 LTGDQGLG-GWKYSGTSGNKYEDAVEPPNKHLWLFNIRSDPQERNDL 431
>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 673
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 12/333 (3%)
Query: 50 FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
F D + PPHIIFI+ DD G+ND+G+H D I TP +D LA G+ L+NYY +CTP
Sbjct: 38 FNDRASGVQPPHIIFIMTDDQGFNDIGYHSSD-IKTPTLDKLAAKGVKLENYYIQPICTP 96
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
SRS +TG++ IHTG+QH+++ + LP + LPQ L++LGY T +VGKWHLGFYKK
Sbjct: 97 SRSQFITGRYQIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGFYKK 156
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
E PT RGF+++ G TG +Y+ +S+ + ++ G D+ AWD GKYST +FT
Sbjct: 157 ECLPTRRGFDTYFGSLTGSVNYYTYSSCDGPELCGFDLHEGESVAWDQGGKYSTHLFTQR 216
Query: 229 AVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
I+ H +PLF++L+ A HS PLQ+P Y+ + +E+ R K+AA++
Sbjct: 217 VRKILARHDPQSQPLFIFLSFQAVHS-----PLQSPKEYICPYSGLENVARRKYAAMVST 271
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+V + AL + NS+++F +DNGG L+ SNWPLRG K T WEGG+RG
Sbjct: 272 VDEAVRNITYALRKYGYYQNSVLIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGIRG 327
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ R V++ VH++DW PTL+ A
Sbjct: 328 LGFVHSPLLKKRKRVSKALVHITDWYPTLVGLA 360
>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
Length = 573
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 210/326 (64%), Gaps = 12/326 (3%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PHI+FILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +T
Sbjct: 47 SSQPHIVFILADDQGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFIT 105
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
GK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY++E PT R
Sbjct: 106 GKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQR 165
Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIH 234
GF++ G G DY+ H + + G D+ + AWD +G YST ++T + I+
Sbjct: 166 GFDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILA 225
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
+H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 226 SHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINN 280
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL++ NSII++ SDNGG + SNWPLRG K T WEGG+R G + SP
Sbjct: 281 VTLALKKYGYYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSP 336
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL+++G V ++ VH++DW PTL++ A
Sbjct: 337 LLKNKGSVCKELVHITDWFPTLITLA 362
>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
Length = 565
Score = 296 bits (759), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 212/326 (65%), Gaps = 11/326 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 40 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+ELGY T +VGKWHLGFY+KE PT RGF
Sbjct: 99 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 158
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 159 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 218
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 219 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 273
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 274 ALKRHGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 329
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + VH++DW PTL++ A +
Sbjct: 330 RKRRTSRALVHITDWYPTLVALAGGT 355
>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
Length = 573
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 217/336 (64%), Gaps = 13/336 (3%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 45 AFARPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 103
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT
Sbjct: 104 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 163
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF++ LG TG+ DY+ + + + + G D+ + AWD GKYST ++ I+
Sbjct: 164 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKIL 223
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+HS EP+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +DE+V
Sbjct: 224 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 278
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ AL++ NS+IVF +DNGG + SNWPLRG K T WEGGVRG G + S
Sbjct: 279 NITWALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHS 334
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
PL++ + + VH++DW PTL+S A N S++P
Sbjct: 335 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPG 370
>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
Length = 574
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 212/326 (65%), Gaps = 11/326 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 49 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 107
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+ELGY T +VGKWHLGFY+KE PT RGF
Sbjct: 108 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 167
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 168 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 227
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 228 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 282
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 283 ALKRHGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 338
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + VH++DW PTL++ A +
Sbjct: 339 RKRRTSRALVHITDWYPTLVALAGGT 364
>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
Length = 569
Score = 296 bits (758), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 210/326 (64%), Gaps = 11/326 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ NHS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHS 224
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + +H++DW PTL+ A +
Sbjct: 336 RKRRTSRALMHITDWYPTLVGLAGGT 361
>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
Length = 574
Score = 296 bits (757), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 216/336 (64%), Gaps = 13/336 (3%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 45 AFTRPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 103
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT
Sbjct: 104 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 163
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF++ LG TG+ DY+ + + + + G D+ AWD GKYST ++ I+
Sbjct: 164 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKIL 223
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+HS EP+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +DE+V
Sbjct: 224 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 278
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ AL++ NS+IVF +DNGG + SNWPLRG K T WEGGVRG G + S
Sbjct: 279 NITWALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHS 334
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
PL++ + + VH++DW PTL+S A N S++P
Sbjct: 335 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPG 370
>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
Length = 574
Score = 295 bits (756), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 216/336 (64%), Gaps = 13/336 (3%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 45 AFARPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 103
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT
Sbjct: 104 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 163
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF++ LG TG+ DY+ + + + + G D+ AWD GKYST ++ I+
Sbjct: 164 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKIL 223
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+HS EP+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +DE+V
Sbjct: 224 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 278
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ AL++ NS+IVF +DNGG + SNWPLRG K T WEGGVRG G + S
Sbjct: 279 NITWALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHS 334
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
PL++ + + VH++DW PTL+S A N S++P
Sbjct: 335 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPG 370
>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 490
Score = 295 bits (756), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 154/350 (44%), Positives = 206/350 (58%), Gaps = 13/350 (3%)
Query: 44 FTLSMVFVDLVASSGP--------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
F L + +D V G P+I+FIL DDLGW DV FHG QIPTPNID LA G
Sbjct: 2 FVLCRILLDAVFLLGTILQCHAKRPNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDG 61
Query: 96 IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
+IL NYY + LCTPSR+A+MTG +PI GMQ + GLPL KILPQ+ K+LGY
Sbjct: 62 VILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSIQAAGPWGLPLENKILPQHFKDLGYD 121
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
++GKWHLGF+K Y P RGF++ G++TG DY++H++ L + ++
Sbjct: 122 VNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSNDYYNHTSGSSHRKILAVTSSVQVNTL 181
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTD-----EPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
G+ S +V +H + +P F Y++H A H A E QAP +
Sbjct: 182 EKGRLSLWGPRELSVCFLHQIYSPLNFYLQPFFCYISHQAVHHALNAEMFQAPARNVLKF 241
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
+I + R+ +A +L LDES+G+VVEAL +L ++IIVF SDNGG+ G N N
Sbjct: 242 SYIGEPNRTIYAGMLDALDESMGRVVEALSSAGILEDTIIVFSSDNGGSPYGLESNRGYN 301
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
WPLRG K TLWEG VR +W+P L + V+ Q +H+SDWLPTL +AA
Sbjct: 302 WPLRGAKFTLWEGSVRVPAFVWNPKLLKKSRVSNQLMHISDWLPTLYTAA 351
Score = 46.2 bits (108), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 30/136 (22%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV------------------ 616
IDG+D+W L+ SKR IL NID + IS L ++KLV
Sbjct: 361 IDGVDMWHSLTHGSASKRKEILLNIDSLYNISGLRYKQYKLVVGGGFDGELDDHYYFPGG 420
Query: 617 ----KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE--VPCE 670
+ +R Q + ++Y DR W LA K A + + E VP +
Sbjct: 421 TRPTNNIDQLRNQSTMARVLKELYEK--QDRSWSPLAWSK-NVAVDCRRNRLTENFVPMQ 477
Query: 671 PQIAPCLFDIKNDPCE 686
P P LF+I+ DPCE
Sbjct: 478 P---PYLFNIEEDPCE 490
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 36/75 (48%), Gaps = 9/75 (12%)
Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
+P L + GN ++ IDG+D+W L+ SKR IL NID + IS L
Sbjct: 344 LPTLYTAAGGNASA---------LGGIDGVDMWHSLTHGSASKRKEILLNIDSLYNISGL 394
Query: 544 TRGKWKLVKENSING 558
++KLV +G
Sbjct: 395 RYKQYKLVVGGGFDG 409
>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
purpuratus]
Length = 784
Score = 295 bits (756), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 218/355 (61%), Gaps = 27/355 (7%)
Query: 46 LSMVFVDLVASSG----------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
L++ F++L+ S P+I+FIL DDLG+NDVG+HG +I TPNID LA G
Sbjct: 3 LALFFINLLIISNVCSKKFSQLTKPNIVFILIDDLGYNDVGYHG-SEIYTPNIDKLAREG 61
Query: 96 IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
+ L+NYY +CTP+RS +++G++ IHTG+QH+ + + LP + L+E GY
Sbjct: 62 VRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYA 121
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T VGKWHLGFYKKE PT RGF+S+ GY TG +DY+ H K L +R + AW+
Sbjct: 122 THAVGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTH---HRKRPXLALRHVDKVAWE 178
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
G YS VFT + I+ H ++P LYL + HS PLQ P Y +++I++
Sbjct: 179 YGGYYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERYKNIKN 233
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLR 334
R +A ++ LDE+VGK+V +L+Q + N++++F +DNGG AAG N NWPLR
Sbjct: 234 TNRRIYAGMMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNGGEVAAGGN-----NWPLR 288
Query: 335 GVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
G K ++WEGG+RG G + SPLL + +G V +Q +HVSDW PTL+ S + N
Sbjct: 289 GWKRSIWEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDN 343
>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 295 bits (755), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 211/332 (63%), Gaps = 16/332 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+ SS PHIIFILADDLGW+DVG+HG I TP+IDALA G+ L NYYT LCTPSRS
Sbjct: 16 LVSSSRPHIIFILADDLGWSDVGYHG-SVIKTPHIDALASEGVKLDNYYTSLLCTPSRSQ 74
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++ IHTG+QH + + LP+ E ILPQ LK+ GY T +VGKWHLGFY++E P
Sbjct: 75 LMTGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKDRGYATHMVGKWHLGFYRQECLP 134
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFTAEAVDI 232
RGF++ +G++ DY+ H+ K G D RRD + + + G+YST VF EA DI
Sbjct: 135 NNRGFDTFMGFYQAMGDYYYHNVSTGKFNGWDFRRDNDVIAERYAGQYSTHVFADEARDI 194
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI--EDFKRSKFAAILHKLDE 290
I H+ D PLFL+L+ A H PLQ P Y +I+ + R +A ++ +DE
Sbjct: 195 ISKHNPDVPLFLFLSFQAIHF-----PLQVPSRYADIYNTLIPNSADRRTYAGMVTCMDE 249
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+VG VV L+ + NS++VF +DNGG + NWPLRG K +L+EGGVRG
Sbjct: 250 AVGGVVATLKHSGLWENSVLVFSTDNGGIHS-----LGGNWPLRGGKASLYEGGVRGVAF 304
Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
+ SPLL S RG + ++ +H++DW PTL+ A
Sbjct: 305 VTSPLLPSRVRGTINKELLHMTDWFPTLVRLA 336
>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
Length = 270
Score = 295 bits (755), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 138/268 (51%), Positives = 186/268 (69%), Gaps = 4/268 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ I+ADDLGW+DV FHG QIPTP ID LA G+IL +YY +CTP+R+++MTGKH
Sbjct: 3 PNIVMIVADDLGWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTGKH 62
Query: 120 PIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
P++ GM H ++G + GLPL E PQY+K LGY T +GKWHLGF++KEYTPT+R
Sbjct: 63 PMNLGMLIHTHATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYR 122
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
GF+S G+W G +DY+DHS++E +WG D+R + +P + G Y T++F A IIH H
Sbjct: 123 GFDSFYGFWNGKEDYWDHSSQE-DVWGTDLRDNEKPVRNESGHYGTELFAERAAQIIHLH 181
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+ +PL+LYLA HSAN EPLQAP + HI KR +AA++ LDESV V
Sbjct: 182 NQTKPLYLYLAQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESVETVH 241
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFN 324
+AL + ML+N+++VF +DNGGA GFN
Sbjct: 242 KALSETGMLNNTVLVFTTDNGGAPRGFN 269
>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
Length = 573
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 151/331 (45%), Positives = 212/331 (64%), Gaps = 11/331 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
VA PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 41 VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
T RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHI 219
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +H+ PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V
Sbjct: 220 LASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 274
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
+ AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G +
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVH 330
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ + + VH++DW PTL+ A +
Sbjct: 331 SPLLKKKRRTSRALVHITDWYPTLVGLAGGT 361
>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
Length = 571
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 151/331 (45%), Positives = 212/331 (64%), Gaps = 11/331 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
VA PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 41 VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
T RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHI 219
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +HS +PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V
Sbjct: 220 LASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAV 274
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
+ AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G +
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVH 330
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ + + VH++DW PTL+ A +
Sbjct: 331 SPLLKRKRRTSRALVHITDWYPTLVGLAGGT 361
>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
Length = 564
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 208/322 (64%), Gaps = 12/322 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
H+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +TG++
Sbjct: 44 HLIFILADDQGFRDVGYHG-SEIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQ 102
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+V+ + LPL LPQ LKE GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 103 IHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDT 162
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
G G DY+ H + +M G D+ + AWD +G YST ++T + I+ +H+
Sbjct: 163 FFGSLLGSGDYYTHYKCDSPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNP 222
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V A
Sbjct: 223 RKPIFLYIAYQAVHS-----PLQAPGMYYERYRSINNINRRRYAAMLACLDEAINNVTLA 277
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L++ +NS+I++ SDNGG +NWPLRG K T WEGG+R G + SPLL++
Sbjct: 278 LKRYGYYNNSVIIYSSDNGGQPTA----GGNNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 333
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
+G V ++ +H++DW PTL++ A
Sbjct: 334 KGCVCKELIHITDWFPTLITLA 355
>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
melanoleuca]
Length = 573
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 210/328 (64%), Gaps = 11/328 (3%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++T
Sbjct: 45 SQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 103
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT R
Sbjct: 104 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 163
Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +
Sbjct: 164 GFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILAS 223
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
HS PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 224 HSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRNI 278
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPL
Sbjct: 279 TWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPL 334
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
L+ + + VH++DW PTL+ A +
Sbjct: 335 LKRKRRTSRALVHITDWYPTLVGLAGGT 362
>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
Length = 481
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 208/334 (62%), Gaps = 14/334 (4%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
D+ A+ P HI+FI+ADDLGWND+GFH D I TPNID LA G++L ++Y LC+PSR
Sbjct: 18 DVSAAGRPRHIVFIVADDLGWNDIGFHNPDII-TPNIDKLAREGLLLNHHYVQPLCSPSR 76
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
+A M+G +P TG+QH+V+ + LPL+ ILPQ LKELGY T IVGKWH GF
Sbjct: 77 AAFMSGYYPFKTGLQHSVILENQPVCLPLNITILPQKLKELGYATHIVGKWHNGFCSWNC 136
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
TPT+RGF+S GY+ +DY+ H LD R + P W +G YST FT A D
Sbjct: 137 TPTYRGFDSFFGYYGAMEDYYTHVIRGF----LDYRNNTTPVWTDNGTYSTLRFTDVATD 192
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
II H+ +PLFLYLA+ A Y P++ P Y ++ +I+ R KF+ ++ LDE+
Sbjct: 193 IIERHNQSQPLFLYLAYQAV-----YGPIEVPAKYEAMYPNIKSENRRKFSGMVSALDEA 247
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VG V + L QR ++ +++I+F +DNGG + +N+PLRG K T++EGG R G +
Sbjct: 248 VGNVTKTLRQRGLMDDTLILFTADNGGGVD----ESGNNYPLRGSKFTVYEGGTRAVGFM 303
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
+ L+ G V + +H DWLPTL +AA + +
Sbjct: 304 YGSGLQKTGTVFDGMIHAVDWLPTLTAAAGGTPV 337
>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
Length = 569
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 210/326 (64%), Gaps = 11/326 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + +H++DW PTL+ A +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGT 361
>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
Length = 569
Score = 294 bits (752), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 210/326 (64%), Gaps = 11/326 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + +H++DW PTL+ A +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGT 361
>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
Length = 503
Score = 294 bits (752), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/325 (43%), Positives = 216/325 (66%), Gaps = 12/325 (3%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
+G P+I+F+LADD G++DVG+HG +I TP +DAL+ SG+ L+NYY +CTP+RS +M+
Sbjct: 31 AGQPNIVFVLADDFGFHDVGYHG-SEIHTPTLDALSASGVRLENYYVQPICTPTRSQLMS 89
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ IHTG+QH ++ C+ LP L LKE GY T +VGKWHLGFYK+EY P R
Sbjct: 90 GRYQIHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNR 149
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
GF+++ GY +DYF+H+ ++ LD+R + P + G+YS +FT +A+D++ +H
Sbjct: 150 GFDTYFGYLNAAEDYFNHNVPWRQVRYLDLRDNNGPVRNETGQYSAHLFTGKAIDVVQSH 209
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+T +PLFLYLA+ + H+ PL+ P+ Y + +R+I D R FA ++ LDE V +
Sbjct: 210 NTSKPLFLYLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLT 264
Query: 297 EALEQRRMLSNSIIVFVSDNGGAA-AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
+AL+ + + +N++++F +DNGG AG N N+PLRG K +LWEGG G G +
Sbjct: 265 QALKDKGLWNNTVLIFSTDNGGQIHAGGN-----NYPLRGWKASLWEGGFHGVGFVSGGA 319
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L+ G V++ +HVSDW PTL++ A
Sbjct: 320 LKRSGAVSKGLIHVSDWFPTLVTLA 344
>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
Length = 569
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 211/332 (63%), Gaps = 11/332 (3%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
+ S PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 40 FIYPSRPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRS 98
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
++TG++ IHTG+QH+++ + LPL + LP L+E GY T +VGKWHLGFY+KE
Sbjct: 99 QLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPHKLQEAGYSTHMVGKWHLGFYRKECL 158
Query: 173 PTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
PT RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A
Sbjct: 159 PTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASH 218
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +HS PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+
Sbjct: 219 ILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEA 273
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
V + AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G +
Sbjct: 274 VRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFV 329
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ + + +H++DW PTL+ A +
Sbjct: 330 HSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 361
>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
Length = 1542
Score = 293 bits (750), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 201/320 (62%), Gaps = 12/320 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFI+ DD G+ DVG+HG +I TP +D LA +G+ L+NYY LC+PSRS +MTG++
Sbjct: 1027 PHIIFIMVDDQGFRDVGYHG-SEIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGRY 1085
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL LPQ LK GY T +VGKWHLGFYK+ PT RGF+
Sbjct: 1086 QIHTGLQHSIIRPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGFD 1145
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
+ G G DY+ H + + G D+ E AW+ G YST ++T +AV+I+ +H+
Sbjct: 1146 TFFGSLLGSGDYYSHYKCDSPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILASHN 1205
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
P+FLYLA A HS PLQ P YL ++ I + R K+AA++ LDE+V +
Sbjct: 1206 PKRPIFLYLAFQAVHS-----PLQVPARYLERYKSIPNLHRRKYAAMVSCLDEAVRNLTL 1260
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL+Q N ++V+ SDNGG + SNWPLRG K + WEGG+R G + SPLL
Sbjct: 1261 ALKQYGYYDNMVMVYSSDNGGQP----MAGGSNWPLRGSKGSYWEGGIRAVGFVHSPLLV 1316
Query: 358 SRGIVAEQYVHVSDWLPTLL 377
+G ++ +H++DW PTL+
Sbjct: 1317 KKGTKSKALIHITDWYPTLV 1336
>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
Length = 551
Score = 293 bits (750), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 154/326 (47%), Positives = 204/326 (62%), Gaps = 13/326 (3%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
SS PHIIFIL DD G+ DVG+HG D I TP +D LA G+ L+NYY LC+PSRS +M
Sbjct: 20 SSSQPHIIFILVDDQGFRDVGYHGSD-IKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLM 78
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IHTG+QH+V+ + LPL LPQ LK GY T +VGKWHLGFYK+ PT
Sbjct: 79 TGRYQIHTGLQHSVIRAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQ 138
Query: 176 RGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDII 233
RGF++ G G DY+ H + M G D+ E AW+ G YST++FT +AV I+
Sbjct: 139 RGFDTFFGSLLGSGDYYSHYKCQGPSMCGYDLYEGEEAAWEQDRGLYSTEMFTQKAVSIL 198
Query: 234 HNHST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
NH+ +PLFLYLA+ A HS PLQ P YL ++ I + R K+AA++ LDE++
Sbjct: 199 ANHNPRKQPLFLYLAYQAVHS-----PLQVPARYLERYKGIPNPYRRKYAAMVSCLDEAI 253
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
+ AL+ N++IV+ SDNGG L SNWPLRG K T WEGG+R G +
Sbjct: 254 HNLTLALKHYGYYDNTVIVYSSDNGGQP----LAGGSNWPLRGSKATYWEGGIRTVGFVH 309
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLS 378
SPLL ++G +H++DW PTL++
Sbjct: 310 SPLLMNKGTKCRSLIHITDWFPTLVT 335
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 63/159 (39%), Gaps = 41/159 (25%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID--------DEWQI----------SALTKGKWKLV 616
+DG DVW +S PS R+ ILHNID W+ +AL G WKL+
Sbjct: 347 LDGYDVWETISEGRPSPRHDILHNIDPIYIKAKNGSWKAGYGLWNTAIQAALRVGHWKLL 406
Query: 617 KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPC 676
V V P Q +L+ + L + R VP + +
Sbjct: 407 TGVPGYSDWV-----PPQTFLN-----QRLTTRLHNER------------VPWDRGKSIW 444
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +L+ R + + ++N+ A P
Sbjct: 445 LFNITADPYERVDLSQRYP-HIVKKMLMRLAQYNKTAVP 482
>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
Length = 575
Score = 293 bits (749), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 210/323 (65%), Gaps = 11/323 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++D+G+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 49 PPHIIFILTDDQGYHDIGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGR 107
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT RGF
Sbjct: 108 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGF 167
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ + AWD GKYST ++ I+ +HS
Sbjct: 168 DTFLGSLTGNVDYYTYDNCDGPDVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILASHS 227
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
EP+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +DE+V +
Sbjct: 228 PKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITW 282
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ NS++VF +DNGG + SNWPLRG K T WEGGVRG G + SPL++
Sbjct: 283 ALKKYGYYDNSVMVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIK 338
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+ + VH++DW PTL+S A
Sbjct: 339 RKRRTSWALVHITDWYPTLVSLA 361
>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
Length = 613
Score = 292 bits (748), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 208/322 (64%), Gaps = 11/322 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 93 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 151
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 152 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 211
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 212 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPR 271
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 272 QPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 326
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 327 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 382
Query: 360 GIVAEQYVHVSDWLPTLLSAAN 381
+ VH++DW PTL+ A
Sbjct: 383 RRTSRALVHITDWYPTLVGLAG 404
>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
Length = 633
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 160/427 (37%), Positives = 239/427 (55%), Gaps = 16/427 (3%)
Query: 32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
T + F V+ +A +L VF L P II + DD+GW+DV FH QIPTPNIDA+
Sbjct: 31 TVLAIFGVIIIAISLLTVF--LWKRQRAPDIILFIVDDMGWSDVSFHANGQIPTPNIDAM 88
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
++L ++Y CTPSR A++TGK+PI G+Q V+ + L L ++LPQYL++
Sbjct: 89 CSDAVLLNSHYVQASCTPSRGALLTGKYPIKIGLQEYVIQPGRQEALHLKHRLLPQYLRD 148
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK-MWGLDMRRDL 210
LGY T +VGKWHLGFY ++Y P RGF+S G++ G Y++HSA + G D +
Sbjct: 149 LGYATHLVGKWHLGFYAEDYLPENRGFDSFYGFYNGAGTYYNHSASDADGRIGYDWHLNK 208
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA-NPYEPLQAPDHYLNI 269
E D HGKY+TD+ T +I + ++P+FL ++H A H N E + ++
Sbjct: 209 ESDPDAHGKYATDIITQRVKHLIQSRDPEKPMFLMISHMAPHGGDNEDELFEVDRQWIED 268
Query: 270 HR--HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL-- 325
HI R+K+A ++ LD++ G+ V AL++ ML NS+I+F SDNGG +
Sbjct: 269 PEIAHIMVESRTKYAGMIRALDDAFGQTVVALKEAEMLQNSVILFTSDNGGPSIVDESYK 328
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKS 383
ASNWPLRG K ++WEGGVR + +WSPLL+S+ + + H++DWLPT L+
Sbjct: 329 TGASNWPLRGQKFSMWEGGVRVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGELR 388
Query: 384 DIP------NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
D+P + +S N +N ++ Y+ G E + R N + E +P Y
Sbjct: 389 DLPIDIDGHDIWDSIDRNTDSPRKNLLINYDTGGTENFAVRRGNFKLMHGKPPPEVDPWY 448
Query: 438 ENRYENG 444
++ Y
Sbjct: 449 DSWYATA 455
>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 406
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 207/343 (60%), Gaps = 5/343 (1%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
FT+++ ASS PP+++ ILADDLGW+DV +HG DQIPTPNID LA GIIL + Y
Sbjct: 13 FTVALFLEATSASSQPPNLVMILADDLGWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYV 72
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-- 161
L TP+R+A++TG +PIHTG Q + + GL +LPQ L +G
Sbjct: 73 QPLSTPTRAALLTGLYPIHTGTQRLDIGSADPIGLSADFTLLPQLSVTLADNFTSLGARS 132
Query: 162 -WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE--MKMWGLDMRRDLEPAWDLHG 218
WHLGF K E+ PT RGF++ G + G DY+ H A + + + G ++ + + G
Sbjct: 133 GWHLGFCKDEFKPTKRGFDTFYGIYNGDSDYWTHFARDNNIDVSGHALKDEKRALVEEAG 192
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
+Y T + +AV +IHN ++P FLY A A H LQAP Y++ ++ D+ R
Sbjct: 193 RYLTSLLANQAVQLIHNRPKNKPFFLYFAPTAVHCGGSNGSLQAPKEYISKFGYLADYDR 252
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
FA L +LD+SVG +VEAL L+N++I F +DNGGA GF+ N + NWPLRG K
Sbjct: 253 QLFAGSLAELDKSVGLIVEALYVSGQLNNTVIAFSTDNGGAPVGFSANTSPNWPLRGTKG 312
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
T+ EGGVRG G +WS L +RG V +Q HV+DW+PT +AA
Sbjct: 313 TVAEGGVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPTFYTAAG 355
>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
Length = 572
Score = 292 bits (747), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 151/332 (45%), Positives = 211/332 (63%), Gaps = 11/332 (3%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
L A PHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 41 LAAPPQRPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRS 99
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE
Sbjct: 100 QLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECL 159
Query: 173 PTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
PT RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A
Sbjct: 160 PTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASH 219
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+
Sbjct: 220 ILASHNPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEA 274
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
V + AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G I
Sbjct: 275 VRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFI 330
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ + + VH++DW PTL+ A +
Sbjct: 331 HSPLLKRKRWTSHALVHITDWYPTLVGLAGGT 362
>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 507
Score = 292 bits (747), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 161/355 (45%), Positives = 216/355 (60%), Gaps = 20/355 (5%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
++F + +A L+++ + P+I+FILADDLGW+DVG+H I TPNID LA
Sbjct: 4 VSFYIAFIAGILTILPIAGAPFGTRPNIVFILADDLGWHDVGYHD-SIIRTPNIDKLAAE 62
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+ L+NYY LCTP+R+ +MTG++ IHTGMQH VL E LP E ++PQ LKE GY
Sbjct: 63 GVKLENYYVTPLCTPTRAVLMTGRYQIHTGMQHGVLMAQEPRCLPTDEVLMPQKLKESGY 122
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS----AEEMKMWGLDMRR-D 209
T +VGKWHLGFYK TP RGF++ G + DYF+H+ + W D+R D
Sbjct: 123 TTHMVGKWHLGFYKWACTPNHRGFDTFFGMYLAGGDYFNHTRLCHGRRLAAW--DLRDGD 180
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
A + GKYST VF +A +II H PLFLYL+ A H+ PLQ P+ Y+N+
Sbjct: 181 QVVAPEYVGKYSTIVFAEKAQEIIKKHDPTNPLFLYLSFQAVHA-----PLQVPERYINM 235
Query: 270 HR-HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
++ I D R +A + +DE++G V L +R +L N+IIVF SDNGGA L A
Sbjct: 236 YKDDIRDESRRIYAGMTTCMDEAIGNVTRTLARRGLLDNTIIVFSSDNGGA----KLYGA 291
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAAN 381
SN+PLRG K TL+EGGVR ++SPLL + RG + ++V DW PT L A
Sbjct: 292 SNYPLRGQKGTLYEGGVRAPAFVYSPLLSNRVRGTATHELMYVGDWFPTFLHLAG 346
>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
Length = 324
Score = 291 bits (746), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 138/281 (49%), Positives = 188/281 (66%), Gaps = 3/281 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II IL DD+G+NDV FHG +QI TPNIDA AY+G+IL YY LCTPSR+A++TGK+
Sbjct: 28 PNIIIILIDDMGFNDVSFHGSNQILTPNIDAFAYNGVILNRYYVPNLCTPSRAALLTGKY 87
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH GMQH V E GLPL E+++PQ+ ++ GY T++VGKWHLGF+++++TP RGF+
Sbjct: 88 PIHNGMQHFVQIPDEPWGLPLGERLMPQFFRDAGYSTQLVGKWHLGFWRQDHTPIMRGFD 147
Query: 180 SHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
H GY+ G+ DY+DH+ + G D RRDL+ +G Y+T+ FT EA II H
Sbjct: 148 HHFGYYNGYIDYYDHTHYMLDRNYTAGADFRRDLQRCHSDNGTYATEAFTKEARRIIEQH 207
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
PLF+ L+H A H+ N +P+QAP + HI D KR +A ++ LD+SVG+ +
Sbjct: 208 DLSRPLFMVLSHLAVHTGNENQPMQAPYEEVAKFVHISDPKRRTYAGMVSSLDKSVGQTI 267
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
AL R ML+NSI++ SDNG G + N+ SN P RGV
Sbjct: 268 RALSDRGMLNNSIVLLYSDNGAPTLGIHSNSGSNHPFRGVS 308
>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 570
Score = 291 bits (746), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 146/334 (43%), Positives = 206/334 (61%), Gaps = 12/334 (3%)
Query: 50 FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
F+ PPHIIFI+ DD G+ND+G+H D I TP +D LA G+ L+NYY +CTP
Sbjct: 38 FIGRFPVPQPPHIIFIMTDDQGFNDIGYHSSD-IRTPVLDKLAADGVKLENYYIQPICTP 96
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
SRS +TG++ IHTG+QH+++ C+ LP + LPQ L+ELGY T +VGKWHLGFYKK
Sbjct: 97 SRSQFITGRYQIHTGLQHSIIRPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKK 156
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
E PT RGF+++ G TG +Y+ + + + G D+ AW GKYST ++T
Sbjct: 157 ECLPTRRGFDTYFGSLTGSVNYYTYDGCDGAGLCGFDLHEGESVAWGQSGKYSTHLYTQR 216
Query: 229 AVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
I+ H +PLF++L+ A H+ PLQ PD Y+ + +E+ R K+AA++
Sbjct: 217 VRKILATHDPQSQPLFIFLSFQAVHT-----PLQYPDEYIYPYLGLENVARRKYAAMVSA 271
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+V + AL + NS+I+F +DNGG L+ SNWPLRG K T WEGG+RG
Sbjct: 272 VDEAVRNITYALRKYGYYQNSVIIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGIRG 327
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G + SPLL + V++ VH++DW PTL+ A
Sbjct: 328 LGFVHSPLLRKKRRVSKALVHITDWYPTLVGLAG 361
>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 311
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 199/310 (64%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GW+D HG QIPTPN+DA+A GIIL +Y LCTPSR+A+MTG++P H GMQH+V+
Sbjct: 1 GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
E LPL+ ++P+Y + LGY+T +VGKWHLG+Y ++Y P RGF++ LG++ D
Sbjct: 61 KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120
Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
Y++ + G D R + W +Y+T +T + V+II H+ P+FL+L+H A
Sbjct: 121 YYNQNFTGNNHTGHDFRCGDQNYWAEEKEYATYYYTNKTVEIIRRHNKSAPMFLFLSHQA 180
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
H + LQ P H + +I + R+ FA ++ LD+SVG VV+ L++ ML+++II
Sbjct: 181 PHVSGGRPLLQVPTHGVRNFSYIGENNRTLFAGMVDALDQSVGIVVDTLQKAGMLNDTII 240
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
VF SDNG GF N NWPLRG K TL+EGGVRG +WS L++ V+ Q +H+S
Sbjct: 241 VFCSDNGALPWGFKSNRGYNWPLRGGKFTLYEGGVRGTAFLWSRRLKNTKRVSNQLMHIS 300
Query: 371 DWLPTLLSAA 380
DWLPTL SAA
Sbjct: 301 DWLPTLYSAA 310
>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 571
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 201/324 (62%), Gaps = 11/324 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH IFI+ DD G+ D+G+HG D I TP +D LA G+ L+NYY +C+PSRS +MTG+
Sbjct: 45 PPHFIFIMVDDQGYGDIGYHGSD-IHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMTGR 103
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LP LP+ L E GY T +VGKWHLGF + PT RGF
Sbjct: 104 YQIHTGLQHSIIRPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGRGF 163
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
+S LG TG D+F + S + + G D+ PAW++ G YST ++ I+ H
Sbjct: 164 QSFLGTLTGSGDHFSYQSCDGAEACGFDLHDGDRPAWEMAGNYSTLLYIDRVKQILKRHD 223
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
PLFLYL+ A H+ PLQ PDH+L+++ + R +AA+L LD V +VV+
Sbjct: 224 PHTPLFLYLSLQAAHT-----PLQVPDHFLHLYDSHRNRLRRHYAAMLSCLDSGVAQVVQ 278
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
L+ + + NS++++ SDNGG L+ SNWPLRG K T WEGG+R G + SPLL+
Sbjct: 279 ELKTQGLYENSVLIYSSDNGGQP----LSGGSNWPLRGGKGTYWEGGIRAVGFVHSPLLK 334
Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
+GIV+ +HVSDW PTLL A
Sbjct: 335 KKGIVSRALIHVSDWYPTLLGLAG 358
>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
Length = 540
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 148/339 (43%), Positives = 216/339 (63%), Gaps = 19/339 (5%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYTVQLCTPSR 111
L++ +GPPHI+FIL DDLGW+DVG+H + + TPNID LA G+ L +YY+ +CTPSR
Sbjct: 28 LLSMAGPPHIMFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSR 87
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
A+MTGK+PIH GMQH V+ G+P +PQ L+ LGYRT ++GKWHLGF+ +Y
Sbjct: 88 GALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDY 147
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
TP RGF+S LG++ G QD++ HS KM LD RRD EPA + G++STDVFT EA++
Sbjct: 148 TPLRRGFDSFLGFFAGEQDHWRHS----KMGFLDFRRDEEPANEYGGQHSTDVFTQEAIN 203
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I H+ +PLFL L++AA H+ PLQA + +N + D R + ++ D S
Sbjct: 204 IAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADWS 258
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNG---GAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
+G++++ ++ + +N+++++ SDNG G G+ NWPLRG K++L+EGGVR
Sbjct: 259 IGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGY------NWPLRGYKSSLFEGGVRVP 312
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+ +L+ +G HV+DW PTL+ A P+
Sbjct: 313 AFVHGEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEVEPD 351
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 56/133 (42%), Gaps = 30/133 (22%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
+IDG+D W LS +PSKR ILHNID + Q + P
Sbjct: 351 DIDGVDQWPTLSEGKPSKREEILHNID--------------------IPANQEEERMAPR 390
Query: 634 QV-YLSGLSDREWLALAMRKLRDAASIQC------GPVKEVPCE---PQIAPCLFDIKND 683
Y SG + R + K+ DA Q GPV E + P + L++I D
Sbjct: 391 GFNYYSGAALRRGHMKLVYKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITAD 450
Query: 684 PCEKNNLADRSED 696
P E+N+L+ + D
Sbjct: 451 PEERNDLSKLNPD 463
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNID 535
+IDG+D W LS +PSKR ILHNID
Sbjct: 351 DIDGVDQWPTLSEGKPSKREEILHNID 377
>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
Length = 575
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/326 (45%), Positives = 209/326 (64%), Gaps = 11/326 (3%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++T
Sbjct: 46 SRPPHIIFILTDDQGFHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLIT 104
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT R
Sbjct: 105 GRYQIHTGLQHSIIRPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRR 164
Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF++ LG TG+ DY+ + + + + G D+ AWD GKYST ++ I+ +
Sbjct: 165 GFDTFLGSLTGNVDYYSYDNCDGPGVCGFDLHEGENVAWDQAGKYSTLLYAQRVNQILAS 224
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H+ +P+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +D +V +
Sbjct: 225 HNPQQPIFIYVAFQAVHT-----PLQSPREYIYPYRGMGNVARRKYAAMVTCMDAAVKNI 279
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL++ SNS+IVF SDNGG + SNWPLRG K T WEGGVRG G + SPL
Sbjct: 280 TRALKKYGYYSNSVIVFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPL 335
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAAN 381
++ + + +H++DW PTL+ A
Sbjct: 336 IKKKRRTSRALMHITDWYPTLVKLAG 361
>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
Length = 571
Score = 290 bits (742), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPR 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A S
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGS 361
>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
Length = 573
Score = 290 bits (742), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 49 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T ++GKWHLGFY+KE PT RGF++
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDT 167
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 168 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 227
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 228 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMSNVSRRKYAAMVTCMDEAVRNITWAL 282
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NSII+F SDNGG + SNWPLRG K T WEGGVRG G I SPLL+ +
Sbjct: 283 KRYGFYNNSIIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLKRK 338
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 339 RRTSHALVHITDWYPTLVGLARGT 362
>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
Length = 572
Score = 290 bits (742), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ NHS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 338 RRTSRALMHITDWYPTLVGLAGGT 361
>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
Length = 569
Score = 290 bits (741), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361
>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
Length = 572
Score = 290 bits (741), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGT 361
>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
Length = 573
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 209/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 KPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGT 361
>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
Length = 573
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 50 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 168
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 169 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTLLYAQRASHILASHSPR 228
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 229 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 283
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NSII+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 284 KRYGFYNNSIIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 339
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 340 RRTSRALVHITDWYPTLVGLAGGT 363
>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
Length = 573
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 147/325 (45%), Positives = 207/325 (63%), Gaps = 11/325 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 49 PHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRY 107
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LPL + LPQ L++LGY T +VGKWHLGFY+KE PT RGF+
Sbjct: 108 QIHTGLQHSIIRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFD 167
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ HS
Sbjct: 168 TFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAGHSP 227
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + A
Sbjct: 228 RRPLFLYVAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNITGA 282
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 283 LKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKR 338
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 339 TRRTSRALLHITDWYPTLVGLAGGT 363
>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
Length = 569
Score = 289 bits (739), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361
>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
Length = 569
Score = 289 bits (739), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361
>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
Length = 584
Score = 289 bits (739), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 209/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 55 HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 113
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E+GY T +VGKWHLGFYKK PT RGF++
Sbjct: 114 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 173
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW+ G+YST ++ A I+ +HS
Sbjct: 174 FLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWEQSGQYSTLLYAQRASQILASHSPH 233
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 234 QPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 288
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ NS+I+F +DNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 289 KRYGYYDNSVIIFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLKRK 344
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL++ A S
Sbjct: 345 RRTSRALVHITDWYPTLVTLAGGS 368
>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
Length = 575
Score = 288 bits (738), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 51 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 109
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 110 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 169
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 170 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPR 229
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 230 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSAL 284
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 285 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 340
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 341 RRTSRALVHITDWYPTLVGLAGGT 364
>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
Length = 573
Score = 288 bits (738), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 49 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 168 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPR 227
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 228 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSAL 282
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 283 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 338
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 339 RRTSRALVHITDWYPTLVGLAGGT 362
>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
Length = 570
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 46 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 104
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 105 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 164
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 165 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPR 224
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 225 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 279
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 280 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 335
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 336 RRTSRALVHITDWYPTLVGLAGGT 359
>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
Length = 527
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 204/325 (62%), Gaps = 12/325 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFI+ DD G+ND+G+H D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 47 PPHIIFIMTDDQGFNDIGYHSSD-IKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGR 105
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LP + LPQ L+ELGY T +VGKWHLGFYKKE PT RGF
Sbjct: 106 YQIHTGLQHSIIRPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGF 165
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
+++ G TG +Y+ + S + + G D+ AW GKYST ++T I+ H
Sbjct: 166 DTYFGSLTGSVNYYTYDSCDGPGVCGFDLHEGESVAWSQRGKYSTHLYTQRVRKILATHD 225
Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+PLF++L+ A H+ PLQ P Y+ +R +E+ R K+AA++ +DE+V +
Sbjct: 226 PQSQPLFIFLSFQAVHT-----PLQCPQEYIYPYRGLENIARRKYAAMVSAVDEAVRNIT 280
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
L + NSII+F +DNGG L+ SNWPLRG K T WEGGVRG G + SPLL
Sbjct: 281 YGLRKYGYYQNSIIIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLL 336
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
+ V++ VH++DW PTL+ A
Sbjct: 337 RKKRRVSKALVHITDWYPTLVGLAG 361
>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 570
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 203/324 (62%), Gaps = 12/324 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFI+ DD G+ND+G+H D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 PHIIFIMTDDQGFNDIGYHSSD-IKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRY 106
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LP + LPQ L+ELGY T +VGKWHLGFYKKE PT RGF+
Sbjct: 107 QIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFD 166
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
++ G TG +Y+ + S + M G D+ AW GKYST ++T I+ H
Sbjct: 167 TYFGSLTGSVNYYTYDSCDGPGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHDP 226
Query: 239 -DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLF++L+ A H+ PLQ P Y+ +R +E+ R K+AA++ +DE+V +
Sbjct: 227 RSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNITY 281
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
L + NSI++F +DNGG L+ SNWPLRG K T WEGGVRG G I SPLL
Sbjct: 282 GLRKYGYYENSIMIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLR 337
Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
+ V++ VH++DW PTL+ A
Sbjct: 338 KKKRVSKALVHITDWYPTLVGLAG 361
>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
Length = 484
Score = 286 bits (731), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 144/325 (44%), Positives = 202/325 (62%), Gaps = 13/325 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ ILADDLGWNDVG++ +I TPN+D LA +G+IL Y + C+PSR+A++TG++
Sbjct: 28 PNILLILADDLGWNDVGWNN-PEIKTPNLDRLASNGVILNASYALSTCSPSRTALLTGRY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P G+QH V+ + GLPL+ +LPQ LK LGY T +GKWHLGF + EYTPTFRGF+
Sbjct: 87 PFKLGLQHGVVKKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRGFD 146
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G+++G +DY+ ++ G D R + + KYST + AV II H
Sbjct: 147 SFYGFYSGSEDYYKRKTAAIR--GYDFRMNTKVFKPKIKKYSTLDYGRRAVKIIQAHKRT 204
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
EPLFLY+ A HS PLQ P + +R+I D R F+ ++ +D ++G ++AL
Sbjct: 205 EPLFLYMPFQAVHS-----PLQVPKSFEFKYRNIVDRNRRIFSGMVSAMDSAIGMTIKAL 259
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP-LLES 358
+QR ML+N++IVF+SDNGG +NWPLRG K TLWEGG R +W LL+
Sbjct: 260 KQRNMLTNTMIVFLSDNGGTP----FFGGNNWPLRGSKATLWEGGTRVPSFVWGKMLLQK 315
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
G + + +H DW PTL+S A +
Sbjct: 316 AGYTSNEMIHAVDWFPTLVSLAGGT 340
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 71/160 (44%), Gaps = 35/160 (21%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+ ++DG+D W +LS+ +PS+R ++ I+ +A+ +GK+KL ++ T G
Sbjct: 342 EKDVDGVDQWQMLSQRQPSRRKEFVYGINSITGRAAIRRGKYKL----------IEGTAG 391
Query: 632 PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 691
+ S R + M++++ + L++I+ DP E N+L+
Sbjct: 392 HYSGWYPAPSSR---VVKMKRMKQLNKRHVN-----------STLLYNIQVDPIESNDLS 437
Query: 692 DRSED------QRINHY-TTEVGRFNQIAYPDKEEEEEKK 724
+ +R+N Y EV FN P KE + K
Sbjct: 438 AKYPHLTQFMLRRLNRYKAQEVEAFN----PRKERYSQPK 473
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 33/49 (67%)
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
+ ++DG+D W +LS+ +PS+R ++ I+ +A+ RGK+KL++ +
Sbjct: 342 EKDVDGVDQWQMLSQRQPSRRKEFVYGINSITGRAAIRRGKYKLIEGTA 390
>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 437
Score = 285 bits (730), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 199/324 (61%), Gaps = 5/324 (1%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ + DDLGW DVGFHG QIP PNIDALA G+IL YY SR ++TG +
Sbjct: 22 PNIVIVTVDDLGWADVGFHGSRQIPVPNIDALAADGVILNKYYAQPWPLSSRIGLLTGIY 81
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P TG+ V+ C+ LP +ILP + K LGYRT VG W+LGFYKK +TP RGF+
Sbjct: 82 PYRTGVG-RVMLPCQPVALPSVFRILPTFFKSLGYRTHFVGVWNLGFYKKRFTPVNRGFD 140
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT--AEAVDIIHNHS 237
+ WTG DY+ H + KM G D+R + + W+ G YST +FT A+ +
Sbjct: 141 TAYAKWTGPGDYWTHDMQT-KMQGFDLRLNDDLMWNQTGVYSTRLFTERADPTRKLCFFV 199
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PL L L+H A H+AN + LQ P +L+ I D KR FA L ++D+S+G++ E
Sbjct: 200 LHQPLLLILSHQALHTANYHGSLQCPLEHLDSFGFIRDRKRRIFAGALTEVDQSLGQIFE 259
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL RR+L+N+I+VFVS +GG G F N + N+PLRG K T +EGGVR +WSPLL
Sbjct: 260 ALHNRRLLNNTILVFVSASGGMPVGDFTNNLSFNFPLRGTKGTCFEGGVRVPAFVWSPLL 319
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
+ Q +HV+DWLPTL SAA
Sbjct: 320 NKTRRTSSQLMHVTDWLPTLFSAA 343
>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 285 bits (729), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 203/323 (62%), Gaps = 11/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IFI+ DD G+ D+G+HG D + TP +D LA G+ L+NYY +C+PSRS +MTG++
Sbjct: 44 PHLIFIMVDDQGYADIGYHGSD-VHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGRY 102
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LP LP+ L + GY T +VGKWHLGF + PT RGF+
Sbjct: 103 QIHTGLQHSIIRPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGFQ 162
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
S G TG D+F + S + + G D+ PAW++ G YST ++ I+ NH
Sbjct: 163 SFFGTLTGSGDHFSYQSCDGAEACGFDLHDGSRPAWEMRGNYSTLLYIERVKQILRNHDP 222
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+ PLFLYL+ A H+ PLQ P H+LN ++ + R +AA+L LD+ V +VV+
Sbjct: 223 NTPLFLYLSLQAAHT-----PLQVPAHFLNPYKSQNNRHRRHYAAMLSCLDDGVAQVVKE 277
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ + NSI+++ SDNGG L+ SNWPLRG K + WEGG+R G + SPLL+
Sbjct: 278 LKTTGIYDNSILIYSSDNGGQP----LSGGSNWPLRGGKGSYWEGGIRAVGFVHSPLLKK 333
Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
+G+++++ +HVSDW PTLL A
Sbjct: 334 KGVISKEMIHVSDWYPTLLGLAG 356
>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
Length = 473
Score = 285 bits (729), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 138/307 (44%), Positives = 200/307 (65%), Gaps = 15/307 (4%)
Query: 83 IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE 142
I TP++DALA G++L NYY LCTPSRS ++TG++ IH G+QH ++ C+ +PL E
Sbjct: 7 IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDE 66
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM--- 199
K+LPQ LK+ G T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A
Sbjct: 67 KLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIEC 126
Query: 200 ---KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP 256
LD+R EPA + YST++FT A +I NH ++PLFLYLA + H
Sbjct: 127 LNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH---- 182
Query: 257 YEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
+PLQ P+ Y+ + I+D R +A ++ LDE+VG V +AL+ R + +N++++F +DN
Sbjct: 183 -DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDN 241
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
GG + +NWPLRG K TLWEGG+RGAG + SPLL+ +G+ + + +H++DWLPTL
Sbjct: 242 GGQTR----SGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTL 297
Query: 377 LSAANKS 383
++ A S
Sbjct: 298 VNLAGGS 304
>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
Length = 513
Score = 285 bits (729), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 194/302 (64%), Gaps = 15/302 (4%)
Query: 85 TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI 144
TP++DALA G++L NYYT LCTPSRS ++TG++ I TG+QH +++ C+ +PL EK+
Sbjct: 49 TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108
Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEE 198
LPQ LKE GY T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALN 168
Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ LD R E A YST++FT A+ +I NH ++PLFLYLA + H E
Sbjct: 169 VTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----E 223
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
PLQ P+ YL + I+D R +A ++ +DE+VG V AL+ + +N++ +F +DNGG
Sbjct: 224 PLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGG 283
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
L +NWPLRG K +LWEGGVRG G + SPLL+ +G+ + +H+SDWLPTL+
Sbjct: 284 Q----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVK 339
Query: 379 AA 380
A
Sbjct: 340 LA 341
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 89 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 124
>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 568
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 202/325 (62%), Gaps = 12/325 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G+ND+G+H +I +P +D LA G+ L+NYY LCTPSRS ++TG+
Sbjct: 43 PPHIIFILTDDQGFNDIGYHS-GEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLITGR 101
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL LPQ L+E+GY T +VGKWHLGFY+K+ PT RGF
Sbjct: 102 YQIHTGLQHSIIRPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTRRGF 161
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ G TG DY+ + S + + G D+ AW GKYST ++T I+ H
Sbjct: 162 HTYFGSLTGSVDYYTYGSCDGKSLCGFDLHEGESVAWGRAGKYSTHLYTQRVRKILATHD 221
Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
T +PLF++L+ A H+ PL+ P Y+ +R+I + R K+AA++ +DE+V +
Sbjct: 222 PTSQPLFIFLSLQAVHT-----PLKPPKEYIYPYRNIGNVPRRKYAAMVSIVDEAVRNIT 276
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL + NS+++F +DNG L SNWPLRG K T WEGG+RG G + SPL+
Sbjct: 277 YALRKYGFYRNSVVIFSTDNGAQP----LTGGSNWPLRGCKGTYWEGGIRGVGFVHSPLI 332
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
R ++ +H++DW PTL+ A
Sbjct: 333 RHRRRISRDLIHITDWYPTLVGLAG 357
>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 545
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 153/331 (46%), Positives = 205/331 (61%), Gaps = 23/331 (6%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+FILADD G+ND+G+ + TPN+D LA GI L NYY +CTPSR+ +M+GK
Sbjct: 58 PPHIVFILADDYGFNDIGYRN-PAMRTPNLDYLAAEGIKLDNYYVQPICTPSRAQLMSGK 116
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH++++ + LPL LPQ LKE GY T + GKWHLGFYKKE PT RGF
Sbjct: 117 YQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKECWPTNRGF 176
Query: 179 ESHLGYWTGHQDYFDHSAEEM-----KMW---GLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
+S LG G D+F H+ E W GLD R L+ G YST V
Sbjct: 177 DSFLGILLGKGDHFLHTEEGGGGPYPSTWPWEGLDFRDGLQSTNAYSGIYSTHVIAERVE 236
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAAILHKLD 289
+II H D+PLFLY++ A H+ PLQ P+ YL I+D KR +A + + +D
Sbjct: 237 NIIEKHDKDKPLFLYVSFQAVHT-----PLQVPESYLQPFESSIQDEKRRIYAGMTYCMD 291
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN-AASNWPLRGVKNTLWEGGVRGA 348
E+VG + + L+++ + ++++VF SDNGG N++ ASNWPLRG K TLWEGGVR
Sbjct: 292 EAVGNITKKLKKQGLWDDTVLVFSSDNGG-----NIDQGASNWPLRGSKTTLWEGGVRAV 346
Query: 349 GLIWSPLLESR--GIVAEQYVHVSDWLPTLL 377
G + SPLL R G V+ + + +SDW PTL+
Sbjct: 347 GFVTSPLLSERMKGTVSRELIDISDWYPTLI 377
>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
Length = 560
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 211/359 (58%), Gaps = 25/359 (6%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSG------------PPHIIFILADDLGWNDVGFHGLDQ 82
M +PL+F L + V SS PH++FIL DD G+ DVG+HG +
Sbjct: 1 MLILWVPLSFLLGFIISHSVQSSWENWRNGGGEFDKQPHVVFILVDDQGFRDVGYHG-SE 59
Query: 83 IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE 142
I TP +D LA G+ L+NYY LC+PSRS +MTG++ IHTG+QH+++ + LPL
Sbjct: 60 IKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSIIRATQPNCLPLEN 119
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKM 201
LP LK+ GY T +VGKWHLGFYK+ PT RGF++ G G D++ H E M
Sbjct: 120 VTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSGDHYSHYKCEAPGM 179
Query: 202 WGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEP 259
G D+ E AW+ G YST +FT +A+ I+ H +PLFLYLA+ A HS P
Sbjct: 180 CGYDLYEGEEAAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLFLYLAYQAVHS-----P 234
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
LQ P YL ++ I + R K+AA++ LDE++ + AL++ N+++V+ SDNGG
Sbjct: 235 LQVPSRYLERYKGISNVHRRKYAAMVSCLDEAIRNLTLALKRYGYYDNTVLVYSSDNGGQ 294
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
L SNWPLRG K + WEGG+R G + SPLL ++G +H++DW PTL+S
Sbjct: 295 P----LLGGSNWPLRGSKASYWEGGIRAVGFVHSPLLRNKGTKCRSLIHITDWFPTLVS 349
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 82/217 (37%), Gaps = 62/217 (28%)
Query: 517 SVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
S L RN+ +K +++H D W + ++ G+ GT E N +D
Sbjct: 324 SPLLRNKGTKCRSLIHITD--WFPTLVSLGE------------GTLEEHLN-------LD 362
Query: 577 GIDVWSVLSRNEPSKRNTILHNID--------DEWQI----------SALTKGKWKLVKV 618
G DVW +S PS R+ ILHNID W+ +AL G WKL+
Sbjct: 363 GYDVWEAISEGRPSPRHDILHNIDPIYIKAKNGSWKAGYGIWNTAIQAALRVGHWKLLTG 422
Query: 619 VKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLF 678
+ V P Q + + W +R R + LF
Sbjct: 423 IPGYSDWV-----PPQTFSNQQMGSRWHNEHIRFDRGKSL-----------------WLF 460
Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
+I DP E+ +L+ R Q + + ++N+ A P
Sbjct: 461 NITADPYERVDLSQRYP-QIVKKMLIRLAQYNKTAVP 496
>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 500
Score = 282 bits (722), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 149/358 (41%), Positives = 218/358 (60%), Gaps = 24/358 (6%)
Query: 44 FTLSMVFVDLVAS-----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
F ++++ + V S S P+I+FIL DDLG+NDVG+HG +I TPNID LA G+ L
Sbjct: 6 FFINLIIISNVCSKKIIQSTKPNIVFILIDDLGYNDVGYHG-SEIYTPNIDKLAREGVRL 64
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+NYY +CTP+RS +++G++ IHTG+QH+ + + LP + L+E GY T
Sbjct: 65 ENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATHA 124
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM------W-GLDMRRDLE 211
VGKWHLGFYKKE PT RGF+S+ GY TG +DY+ H + W G+D+ + +
Sbjct: 125 VGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTHHRAGDGLLPNSNHWLGMDLWDNEK 184
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
AW+ G YS VFT + I+ H ++P LYL + HS PLQ P Y ++
Sbjct: 185 VAWEYVGNYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERYK 239
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
+I++ R +A ++ LDE+VGK+V +L+Q + N++++F +DNGG A +NW
Sbjct: 240 NIKNTNRRIYAGMMTCLDEAVGKIVHSLQQAGVWDNTVLIFSTDNGGEVAA----GGNNW 295
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
PLRG K ++WEGG+RG G + S LL + +G V +Q +HVSDW PTL+ S + N
Sbjct: 296 PLRGWKRSIWEGGMRGVGFVNSRLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDN 353
>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
Length = 477
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 204/344 (59%), Gaps = 19/344 (5%)
Query: 46 LSMVFVDLVA-------SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
LS+V +VA ++ PHI+FI+ DDLGWNDVG+ D I TP +D LA SG+IL
Sbjct: 3 LSVVLAIIVALCLKISTAADKPHIVFIVVDDLGWNDVGWINPD-IKTPTLDRLARSGVIL 61
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+ Y LC+PSR+ M+GK P HTG+Q V++ + +P + +PQ LK LGY T +
Sbjct: 62 NSSYVQPLCSPSRNCFMSGKFPYHTGLQDKVVFIEQPKYMPANLTTIPQRLKTLGYDTHM 121
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW-GLDMRRDLEPAWDLH 217
VGKWHLGF +YTPT+RGF++ +GY+ G +DYF H +E+ + G D R +
Sbjct: 122 VGKWHLGFCNWKYTPTYRGFDTFMGYYAGMEDYFTHVRDEVADYNGYDFRNMTDVYKGAQ 181
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
G+YST VF A+DII NH +P++LYL A H PLQ P Y + + HI D
Sbjct: 182 GEYSTYVFANRAIDIIMNHDKSKPMYLYLPFQAVHV-----PLQVPTKYSDRYSHIHDLN 236
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R ++ ++ LD++V + LE + N ++VF +DNGG A+NWPLRG K
Sbjct: 237 RKVYSGMVSALDDAVYNITTVLEDLGFMDNLLLVFTTDNGGPPT----RGANNWPLRGGK 292
Query: 338 NTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAA 380
NTLWEGG RGA ++S LL+ +G H DW PTLL A
Sbjct: 293 NTLWEGGTRGAAFVYSKTLLKKKGYTHPGLFHAVDWYPTLLDIA 336
>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
Length = 481
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 205/321 (63%), Gaps = 11/321 (3%)
Query: 64 FILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHT 123
FI+ DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++ IHT
Sbjct: 39 FIVTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHT 97
Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
G+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++ LG
Sbjct: 98 GLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLG 157
Query: 184 YWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPL 242
TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS PL
Sbjct: 158 SLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPL 217
Query: 243 FLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
FLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL++
Sbjct: 218 FLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRY 272
Query: 303 RMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
+NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 273 GFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRT 328
Query: 363 AEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 329 SRALMHITDWYPTLVGLAGGT 349
>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
Length = 487
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 194/318 (61%), Gaps = 14/318 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FI+ADDLGWNDVG+H D + TP +D LA+ G+IL Y +CTPSR+A MTG +
Sbjct: 29 PHILFIVADDLGWNDVGWHNPD-VKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGYY 87
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P H G QH V + G+P + LP+ LK+LGY T +VGKWHLGF +YTPT+RGF+
Sbjct: 88 PYHAGSQHLVFLPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGFD 147
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S GY+ +DY+ H GLD+R D E +G+Y T FT VDII H D
Sbjct: 148 SFYGYYNADEDYYTHVVAG----GLDLRDDKEVVNTKNGQYGTYFFTDRMVDIIEKHPAD 203
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLYL H EPL+ P+ + NI+ ++++ R K ++ LDE+VG V A+
Sbjct: 204 TPLFLYLPFQNVH-----EPLEVPERFENIYMNVQNENRRKLLGMVSALDEAVGNVTMAM 258
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ + N++++F +DNG G+ + + +N+PLRG K TLWEGG RG + +L+
Sbjct: 259 KNAGLWDNTLVIFTTDNG----GWIIASGNNYPLRGGKVTLWEGGTRGVAFVHGKMLQKT 314
Query: 360 GIVAEQYVHVSDWLPTLL 377
G + +H DW PT+L
Sbjct: 315 GYTNNEMIHAVDWFPTIL 332
>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
Length = 503
Score = 278 bits (712), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 146/347 (42%), Positives = 215/347 (61%), Gaps = 23/347 (6%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L TLS ++ +S P+I+FILADD G++DVG+HG I TPNID LA+ G+ L+NY
Sbjct: 16 LCITLSQ---PVLGASKQPNILFILADDYGYHDVGYHG-SAIRTPNIDRLAFEGVRLENY 71
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +CTP+RS +++G++ IHTG+QH++++ + LP L L+E GY IVGK
Sbjct: 72 YVQPICTPTRSQLLSGRYQIHTGLQHSIIWAAQPNALPKELPTLADKLREEGYANHIVGK 131
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS-----AEEMKMWGLDMRRDLEPAWDL 216
WHLGFYK+EY PT RGF+S GY TG + Y++ + + GLD R + +
Sbjct: 132 WHLGFYKEEYVPTNRGFDSFYGYLTGSEFYYNKTYCLAQINRSDVCGLDFRENDRSINE- 190
Query: 217 HGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+YST ++ ++ +H++ D+PLFLYLA + H PL+ P Y ++HI
Sbjct: 191 -SEYSTHLYAERTKQLVADHTSAHPDQPLFLYLALQSVHG-----PLEVPAQYRTPYKHI 244
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+D R +A ++ +DE+V V +A ++ + N+I+VF +DNGG A +NWPL
Sbjct: 245 KDENRQIYAGMVSCMDEAVKNVTDAFKKYGLWDNTILVFSTDNGGQVA----EGGNNWPL 300
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
RG K +LW+GG+RG G + S LL+ +G VA Q +HVSDW PTLL+ A
Sbjct: 301 RGWKGSLWDGGMRGVGFVHSTLLKQKGAVAHQLMHVSDWFPTLLNQA 347
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 44/169 (26%)
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID--------------DEWQIS 606
T N++ N+ ++DG D W+ +S R ILHNID D +
Sbjct: 342 TLLNQAGGNASSVKLDGFDQWAAISDGAEGPRTEILHNIDPLNGKRGKRTEEVFDNTVRA 401
Query: 607 ALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE 666
AL KWKL LTG P + W+ R+L P K
Sbjct: 402 ALRMNKWKL------------LTGEP--------GNGSWVPPPEREL---------PTKL 432
Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
P + LFDI+ DP E +N+A + + + ++ +N A P
Sbjct: 433 DPSSKKQNIWLFDIEADPYEISNVAAENP-EVVKTMLLKLAAYNATAVP 480
>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 648
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 191/565 (33%), Positives = 280/565 (49%), Gaps = 66/565 (11%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
V+ L +L VAS+ PPHI+F LADDLGW+DVG+H I TPNID LA G+ L
Sbjct: 7 VIILLISLISFVSSTVASTKPPHIVFFLADDLGWHDVGYHD-SIIRTPNIDKLAAEGVKL 65
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+NYY +CTP+R+ +MTG++ IHT MQH VL E+ LP E ++PQ LKE GY T +
Sbjct: 66 ENYYVTPICTPTRAVLMTGRYQIHTTMQHGVLMAQEQRCLPTDEVLMPQKLKESGYSTHM 125
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS----AEEMKMWGL-DMRRDLEPA 213
VGKWHLGFYK + TP RGF++ G++ +YF H+ + W L D + + P
Sbjct: 126 VGKWHLGFYKWDCTPNHRGFDTFFGFYLAGGEYFTHTRKCHGHRLDAWDLRDGDKMVGPE 185
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RH 272
+ G+YST ++ +A ++I + P+FLY++ A H+ PL+ PD Y + + +
Sbjct: 186 YT--GEYSTMLYARKAQEVIRKQDPNVPMFLYVSFQAVHA-----PLEVPDSYADAYGKD 238
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I D R +A + +DE++G V + L+ R M N+++VF +DNGG A GF ASN+P
Sbjct: 239 IYDQSRKLYAGMTSCMDEAIGNVTQTLKDRGMWDNTVVVFSTDNGG-ATGF---GASNYP 294
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
LRG K +EGGVRG + SPLL+ RG + ++V DWLPT + A +
Sbjct: 295 LRGQKCGHYEGGVRGPAFVSSPLLKPHVRGTKNTELMYVGDWLPTFVHLAGGTTHGTKPL 354
Query: 391 STVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNP 450
V E R + H + R ++ + E NP ++ R +
Sbjct: 355 DGVNQWEAISEGLPSRRKEILHNIDPIRAKSPTLKTEPWM--LNPYFDIRVMAAIRVGHW 412
Query: 451 KYENRYENGTHEYNGPK--NENTNPRYENGTHEYNI---PRLENSING------------ 493
K + + P+ N TNP N Y+I P N I+
Sbjct: 413 KLLTGQKGNANWQEPPELGNRLTNPYTNNILQLYHIKEDPEERNDISATRPDVVYRILLR 472
Query: 494 ----NGTS---ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID----------- 535
N T+ + + S + + WS + PS+R ILHNID
Sbjct: 473 LQALNSTAVLPTHVDPEFSLSDPVKHDMAWSPWAEGAPSRRTEILHNIDQLRGKTPVFLN 532
Query: 536 DEWQ---------ISALTRGKWKLV 551
+ W ++A+ G WKL+
Sbjct: 533 EAWMNNPYFDIRVMAAIRVGHWKLL 557
>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 517
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 213/352 (60%), Gaps = 17/352 (4%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDAL 91
M + L F L ++ V A P+I+FILADD G+ND+G+H ++ + TP +D+L
Sbjct: 1 MYLLRVVLLFGLHVLCVP--AELTKPNIVFILADDYGFNDIGYHAVEHHSDMKTPFLDSL 58
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A +G+ L+NYY +C+PSRS +M+G++ IHTG+QH V+ +R GLPL ILP+ L +
Sbjct: 59 AMAGVRLENYYIQPICSPSRSVLMSGRYQIHTGLQHYVISPQQRNGLPLDNIILPEQLHK 118
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY T +VGKWHLGFYK EY P RGF S+ GY TG +DY+ + K+ G DM +
Sbjct: 119 CGYDTHMVGKWHLGFYKDEYLPWKRGFNSYFGYLTGGEDYYTKWRCDGKLCGYDMTSEKG 178
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
P +G+YS ++F +A + I H +PLFLY+A + HS P++ P+ Y
Sbjct: 179 PTNATYGQYSANLFANKANEAIDKHDKTKPLFLYVAFQSVHS-----PMEVPESYAKPFD 233
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
+I++ R + ++ +DE+V + E L+ + N+I+VF +DNGG L+ +NW
Sbjct: 234 YIKNHNRKMYGGMVAAMDEAVKNITEHLQAAGLWDNTILVFSADNGGQT----LSGGNNW 289
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRG---IVAEQYVHVSDWLPTLLSAA 380
PLRG K TLWEGG++G G + +L IV + +H+SDW PT++ A
Sbjct: 290 PLRGRKLTLWEGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTIMEAT 341
>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 588
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 146/331 (44%), Positives = 203/331 (61%), Gaps = 17/331 (5%)
Query: 59 PPHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
PPH+I I+ADDLG+NDVG+H G I TPNID +AYSG+ L+NYY +CTP+RS ++
Sbjct: 93 PPHVIMIIADDLGYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLI 152
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IHTGMQH L+ LPL E L Q LK+ GY T VGKWHLG+ K+ P+
Sbjct: 153 TGRYQIHTGMQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSR 212
Query: 176 RGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
RGFES G G D++ H+ + + G M + W G +ST ++T A
Sbjct: 213 RGFESFFGNIMGSADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQ 272
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDE 290
+I ++PLFLYL++ A H+ PL P+ Y + I + KR ++A +++ LDE
Sbjct: 273 LIRKQPRNKPLFLYLSYEAVHT-----PLNVPEQYAKPYEGIIHNSKRRRYAGLVNILDE 327
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+V V EAL+ + NS+I+F +DNGG ++ +NWPLRG K+TLWEGG+RG G
Sbjct: 328 AVRNVTEALKYNGLYDNSVIIFTTDNGGRPKPRSV--GNNWPLRGGKSTLWEGGIRGVGF 385
Query: 351 IWSPLL--ESRGIVAEQYVHVSDWLPTLLSA 379
+ SPL+ E RG V Q +HVSDW PT++
Sbjct: 386 VHSPLIPWELRGTVNRQLIHVSDWFPTIVXG 416
>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 1410
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 196/321 (61%), Gaps = 16/321 (4%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
I+FI+ DD+GWNDVG+HG I TP++D LA G L+NYY LC+PSR +TGKH I
Sbjct: 29 IVFIMIDDVGWNDVGYHG-SSISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMI 87
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
H GM+ ++ ER LP++E + Q LK Y T +GKWH+G+YKK P RGF++
Sbjct: 88 HLGMEGGIIMPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTF 147
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
G G D++ H + W L R ++ A + G YSTD++ EA ++I NH +P
Sbjct: 148 FGIIGGCADHYTH--KNTHWWEL-YRNNISIAQEYQGHYSTDLYAREATNVIRNHDASKP 204
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
+F+YLA A H PLQAP Y++++ +IED R +AA++ +D+++G VV+ L +
Sbjct: 205 MFMYLAFQAAHL-----PLQAPRKYIDMYSNIEDPDRRVYAAMVTCMDDAIGNVVDQLRE 259
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--R 359
+ +N++++ +DNGGA N NWPL+G K +LWEGGVRG + SP L R
Sbjct: 260 AGLWNNTVLIVSTDNGGAKISGN-----NWPLKGSKASLWEGGVRGVAFVTSPFLADHVR 314
Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
G Q +HV+DW PT+L A
Sbjct: 315 GTSNHQLMHVTDWFPTMLHVA 335
>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
Length = 476
Score = 276 bits (706), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 207/325 (63%), Gaps = 15/325 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+I+FI++DD GW+D+G+HG +I TP +D LAY+GI L+NYY +C+P+RS M+G
Sbjct: 7 PPNILFIMSDDYGWHDIGYHG-SKIRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSGV 65
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QHNV++ + GLPL + ++E GY T + GKWHLG+YK+EY P RGF
Sbjct: 66 YQIHTGLQHNVIWPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRGF 125
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-- 236
+++ GY G +DY+D S G D R + + W+L YST ++ + +I+ NH
Sbjct: 126 DTYYGYLNGCEDYYDKSYCHPYC-GYDFRLNDDIQWNL-TDYSTYLYVSRVNEILLNHKI 183
Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
S D+PLFLYL + H EPL+ P Y + + HI+D R +A ++ +DE+VGK+
Sbjct: 184 YSPDKPLFLYLPLQSVH-----EPLEVPKEYSDKYSHIKDNNRRTYAGMVAAMDEAVGKI 238
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
+ ++ M +++ F +DNGG +NWPLRG K +LWEGG+RG G + SPL
Sbjct: 239 RDLFKKYEMWDETVLAFSTDNGGQIH----RGGNNWPLRGWKISLWEGGMRGVGFVHSPL 294
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L + G ++ +HVSDWLPTL+ A
Sbjct: 295 LPNSGGKSKGLMHVSDWLPTLVHVA 319
>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 732
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 194/323 (60%), Gaps = 12/323 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FIL DD G+ND+G+H I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 49 PHIVFILTDDQGFNDIGYHN-PTIKTPTLDKLAAEGVKLENYYVQPICTPSRSQLITGRY 107
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LP LP+ L+E GY T +VGKWHLGFY+K PT +GF+
Sbjct: 108 QIHTGLQHSIIRPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRKACLPTRKGFD 167
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G TG DY+ + S + + G D+ D AW GKYST +FT A I+ +H
Sbjct: 168 TFFGSLTGSVDYYSYESCDGKDLCGYDLHDDEGVAWGQEGKYSTTLFTQRARKILESHDP 227
Query: 239 DE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
E PLFL L+ A H+ PL P+ Y+ R + + R FA ++ +DE+V V
Sbjct: 228 AERPLFLLLSFQAVHT-----PLNPPESYIYPFRDMVNVARRNFAGMVSTVDEAVRNVTY 282
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL + NS+I++ +DNG +NWPLRG K T WEGGVRG G + SPLL
Sbjct: 283 ALRKYGYYRNSVIIYSTDNGAQP----FTGGNNWPLRGRKGTYWEGGVRGVGFVHSPLLR 338
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
R V++ +H++DW PTL+ A
Sbjct: 339 RRRRVSKALMHITDWFPTLVGLA 361
>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
carolinensis]
Length = 580
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 205/321 (63%), Gaps = 11/321 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
H+IFIL DD G++D G+HG D I TP +D LA G+ L+NYY +CT SRS ++TG++
Sbjct: 59 HLIFILTDDQGFHDXGYHGSD-IXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQ 117
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LP ++ LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 118 IHTGLQHSIIRPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 177
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW+ GKYST ++ I+ H+
Sbjct: 178 FLGSLTGNVDYYTYDNCDGPGVCGYDLHDGENVAWEQSGKYSTFLYAQRVNKILAAHNPK 237
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
EP+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +DE+V + AL
Sbjct: 238 EPIFIYIAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWAL 292
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ NS+IVF +DNGG + SNWPLRG K T WEGGVRG G + SPL++ +
Sbjct: 293 KKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKHK 348
Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
++ +H++DW PTL++ A
Sbjct: 349 RRISRALIHITDWYPTLVTLA 369
>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 498
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 222/359 (61%), Gaps = 20/359 (5%)
Query: 30 YRTR---IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTP 86
Y TR ++AFA++ LA ++S+V + + S PH++FI+ADDLGWND+G++ I TP
Sbjct: 4 YNTRSQSMLAFALVVLA-SMSLV---ICSGSRKPHVVFIVADDLGWNDIGYNN-PSIFTP 58
Query: 87 NIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILP 146
+D LA G+IL Y + +CTP R+++M+G + G+QH VL E GLPL+ ++P
Sbjct: 59 TLDKLAREGVILNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVLDHAEPAGLPLNFTLIP 118
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
Q +KE GY T ++GKWHLGF K EYTPT+RGF+ G++ +DYF+H+ + LD+
Sbjct: 119 QRMKEHGYTTYMLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAEDYFNHTTSKY----LDL 174
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
R E W +G YST ++ +A + I H+ P+++YL + H ++AP Y
Sbjct: 175 RNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQSVHGV-----IEAPQKY 229
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
L+++ I D R +A++ +D++V V+ ALE+ + N+++VF +DN N
Sbjct: 230 LDMYTFIHDTNRRIKSAMVTAMDDAVKVVIAALERYELWDNTLLVFTTDN--GGPANPNN 287
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSD 384
A +NWPLRG K TLWEGG RG G + +LE +G V + +HV+DW PTLL A K+D
Sbjct: 288 AGNNWPLRGSKLTLWEGGTRGVGFVHGKMLEKKGYVNNEMMHVTDWYPTLLHIAGGKAD 346
>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 574
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 199/324 (61%), Gaps = 12/324 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FIL DD G+ND+G+H I TP +D LA G+ L+NYY +CTPSRS +MTG++
Sbjct: 46 PHIVFILIDDQGFNDIGYHN-PTIKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTGRY 104
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LP LP+ L++ GY T +VGKWHLGFY+K PT +GF+
Sbjct: 105 QIHTGLQHSIIRPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKGFD 164
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS- 237
+ G TG D++++ S + + G D+ AW GKYST +FT A I+ +H+
Sbjct: 165 TFFGSLTGSVDHYNYLSCDGPGVCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILESHNP 224
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
T++PLFL L+ A H+ PLQ P Y+ +R + + R K AA++ +DE+V V
Sbjct: 225 TEKPLFLLLSLQAVHT-----PLQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVRNVTY 279
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL + NS+I++ +DNG SNWPLRG K T WEGG+RG + SPLL+
Sbjct: 280 ALRKYGFYRNSVIIYSTDNGAQP----FTGGSNWPLRGRKGTYWEGGIRGVAFVHSPLLK 335
Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
R V++ +H++DW PTL+ A
Sbjct: 336 RRRRVSKALLHITDWFPTLVGLAG 359
Score = 46.6 bits (109), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 60/153 (39%), Gaps = 31/153 (20%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID---------DEWQIS---ALTKGKWKLVKVVKVM 622
+DG DVW +S S R ILHNID W S A+ G WKL+
Sbjct: 368 LDGFDVWPTISEGRDSPRQEILHNIDPLHKHVTSESLWDTSIQAAIRVGDWKLL-----T 422
Query: 623 RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKN 682
Y P QV L L R W D P K + LF+I
Sbjct: 423 GYPGHGDWVPPQV-LPTLPGRWW---------DLERNILSPYKNSTYK---NIWLFNITA 469
Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
DPCE+++LAD+ D + + +NQ A P
Sbjct: 470 DPCERHDLADQRPDV-VLQLLARLAYYNQTAVP 501
>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 500
Score = 273 bits (699), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 152/340 (44%), Positives = 201/340 (59%), Gaps = 14/340 (4%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
F L +V +++ PPHIIFI+ DDLGWNDVG++ TP ID LA SG+ L NYY
Sbjct: 7 FLLLLVSGNVLGREKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGSGVKLLNYYV 65
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
C PSR+ +MTG+H I G+QH+ YG LPL+E + + LK +GY T IVGKWH
Sbjct: 66 ASHCLPSRNMLMTGRHAIQLGIQHDD-YGFHPRSLPLNETTIAEPLKHVGYSTHIVGKWH 124
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
GFY P RGF++ G+ D++ HS M L D A GKYST
Sbjct: 125 CGFYSDNCLPHNRGFDTFFGFVGAGIDHYTHSDHFNHMHNLRKNDDC-IAKKYIGKYSTT 183
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFA 282
+F E DII+ H ++PLFLYL+ +A H+ PL+ P YL + I D R +A
Sbjct: 184 IFANEGKDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYA 238
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+ +DE++ + EALE++ ML NSIIVF SDNGGA +A +NWPLRG K+T WE
Sbjct: 239 AMTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVN---SAGNNWPLRGGKHTSWE 295
Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
GGVR G ++ LL SRG +H++DW PTL++ A
Sbjct: 296 GGVRAIGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLA 335
>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 558
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/324 (43%), Positives = 200/324 (61%), Gaps = 12/324 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFI+ DD G+ND+G+H D I TP +D LA +G+ L+NYY +CTPSRS +TG++
Sbjct: 38 PHIIFIMTDDQGFNDIGYHNTD-IHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGRY 96
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH+++ + LP + LPQ L+E GY T +VGKWHLGFYK++ PT RGF
Sbjct: 97 QIHTGLQHSIIRSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGFN 156
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS- 237
++ G TG DY+ + S + K+ G D+ AW G+YST ++T I+ H
Sbjct: 157 TYFGSLTGSVDYYTYKSCDGPKVCGFDLHDGERVAWGQGGRYSTHLYTQRVRKILAAHDP 216
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+ +PLF++L+ A H+ PLQ+P Y+ +R + + R K+A ++ +DE+V V
Sbjct: 217 SSQPLFIFLSFQAVHT-----PLQSPKEYVYPYRRMGNMFRRKYAGMVSAVDEAVRNVTY 271
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL + N++I F +DNGG L SNWPLRG K T WEGGVRG G + SPLL
Sbjct: 272 ALRKYGYYKNTVIFFSTDNGGQP----LFGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLR 327
Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
R V+ +H++DW PTL+ A
Sbjct: 328 RRRRVSRALIHITDWYPTLMRLAG 351
>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 198/333 (59%), Gaps = 14/333 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+FIL DD G+ND+G+H I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 43 PPHIVFILIDDQGFNDIGYHN-PTIKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGR 101
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LP LP+ L+ GY T +VGKWHLGFY+K PT +GF
Sbjct: 102 YQIHTGLQHSIIRSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGF 161
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ G TG DY+ + S + G D+ + AW GKYST +FT A I+ +H
Sbjct: 162 DTFFGSLTGSVDYYSYGSCNGPGVCGYDLHDNERVAWGHEGKYSTTLFTQRAHKILKSHD 221
Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
D PLFL L+ A H PLQ P ++ ++R + + R KFAA++ +DE+V V
Sbjct: 222 PADRPLFLLLSLQAVHG-----PLQPPKSFVYLYRDMANVDRRKFAAMVSTVDEAVRNVT 276
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL + NS+I++ +DNG SNWPLRG K T WEGGVRG + SPLL
Sbjct: 277 YALRKYGYYKNSVIIYSTDNGAQP----YVGGSNWPLRGRKGTYWEGGVRGVAFVHSPLL 332
Query: 357 ESRGIVAEQYVHVSDWLPTL--LSAANKSDIPN 387
+ R V+ +H++DW PTL L+ N S P
Sbjct: 333 KHRRRVSTALLHITDWFPTLVGLAGGNISQSPG 365
>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 562
Score = 273 bits (698), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 212/355 (59%), Gaps = 18/355 (5%)
Query: 36 AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALA 92
A +L F LS V PH++F+LADD G+ND+G+H + + TP +D+LA
Sbjct: 10 AVRLLAFVFWLSAVACGKKQRGQRPHVVFVLADDFGFNDIGYHAREHYSDMYTPFLDSLA 69
Query: 93 YSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL 152
G+IL+NYY +C+P+R ++TG++ IHTG+ H ++ + GLPL +L Q L++
Sbjct: 70 AKGVILENYYVQPICSPTRGQLLTGRYQIHTGLAHGIIRAAQPYGLPLDNILLSQQLRQC 129
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK----MWGLDM-R 207
GY+T +VGKWHLGF+++EY P RGF++ G+ G ++F E K G DM
Sbjct: 130 GYKTNMVGKWHLGFFREEYLPWNRGFQNFFGFLNGGVNHFTRYHCEPKKTRRFCGYDMID 189
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
P +G+YST++F ++ ++I H+ +P+FLYL+ A H PLQ P+ YL
Sbjct: 190 SRYGPTNATYGEYSTNLFIRKSKEMIDKHNKQKPMFLYLSLQAVHG-----PLQVPNQYL 244
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+HI D R +A +++ +D + ++V+ L++ RM N+I +F +DNGG
Sbjct: 245 KRFKHIRDKNRRIYAGMVYAMDRGIRQLVKHLKRARMWKNTIFIFSTDNGGQTT----RG 300
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+NWPLRG K TLWEGG+RG G + PL + V ++ +HVSDW PT++SA +
Sbjct: 301 GNNWPLRGKKGTLWEGGIRGVGFVHGKPLQVTTPRVNKELLHVSDWYPTIMSATH 355
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 84/219 (38%), Gaps = 62/219 (28%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQISALTKGKWKLVKVV 619
IDG D W L N+ SKR ILHNID D +A+ G WKL
Sbjct: 365 IDGYDQWETLRSNKTSKRTEILHNIDPLHKKLTESEFREGFDVSVRAAIRSGDWKL---- 420
Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLA--------LAMRKLRDAASIQCGPVKEVPCEP 671
LTG +G++ EW+ + + ++RD + K V
Sbjct: 421 --------LTG------YAGMN--EWVVPPECTHGNITIGQIRDDFHVYPNSTKNVR--- 461
Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKK- 730
LF I DP E+ ++D+ + +N + +N + P +KK K+
Sbjct: 462 -----LFKITEDPHEEREVSDQYPEV-VNLLLARLKEYNTSSLPVDYPHNDKKSDPKRHG 515
Query: 731 ---------KKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+K K+KK + + +K K + +KL+
Sbjct: 516 GFWLPWRGVGRKYKRKKFRPHRARKPKLFRRKMKFKKLQ 554
>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 470
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 133/264 (50%), Positives = 167/264 (63%), Gaps = 2/264 (0%)
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P+ QHNV+ E GLPL+E +LP++L +LGY + VGKWHLG++KK YTPT+RGF
Sbjct: 4 QPLLARTQHNVILEPEPWGLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGF 63
Query: 179 ESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
+S G+W G+QDY+ H A G DMRRDL P W GKYST +FT EA DII H
Sbjct: 64 KSFYGFWNGYQDYYTHMVQATFASFEGFDMRRDLNPDWSSVGKYSTHLFTKEATDIITKH 123
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+ PLFLYLAH A H+ PLQAP +N + I+D R K+A ++ LD SV +V
Sbjct: 124 NNSVPLFLYLAHLAPHAGTYENPLQAPQEDINSFQSIKDKYRRKYAGMMKNLDNSVAEVF 183
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL ML N+I VFVSDNG G + N SNWPL+G K T WEGG+R A +WS LL
Sbjct: 184 NALHTNNMLDNTIFVFVSDNGAPTNGIHRNYGSNWPLKGEKATPWEGGIRTAAFVWSKLL 243
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
+ + +H+SDWLPTL AA
Sbjct: 244 SKKKTLPNPLMHISDWLPTLYQAA 267
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 77/163 (47%), Gaps = 24/163 (14%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK---------------VV 619
IDGI +W S N S R ILHNIDD +A+ +K +K
Sbjct: 277 IDGISMWESFSNNIKSPRKQILHNIDDITGYAAIRDTNFKYIKGSTFLGYLDYWSGSLSP 336
Query: 620 KVMRYQVD-LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEV-PCEPQIAPCL 677
+RY VD + LS ++ + LR++++++C K++ PC+P I PCL
Sbjct: 337 STIRYNVDAILNSTTAGILSQINGDSLTPSLIESLRNSSTVRCDFEKDMKPCKPFIKPCL 396
Query: 678 FDIKNDPCEKNNL-------ADRSEDQRINHYTTEVGRFNQIA 713
F I DPCE NL + + +I+++ T +G+F + A
Sbjct: 397 FHIVKDPCELVNLNYKPNSKMRKFVEAKIDYFETSLGKFRESA 439
Score = 46.6 bits (109), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 39/66 (59%), Gaps = 8/66 (12%)
Query: 756 MRKLRDAASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQ-------RINH 807
+ LR++++++C K++ PC+P I PCLF I DPCE NL + + +I++
Sbjct: 368 IESLRNSSTVRCDFEKDMKPCKPFIKPCLFHIVKDPCELVNLNYKPNSKMRKFVEAKIDY 427
Query: 808 YTTEVG 813
+ T +G
Sbjct: 428 FETSLG 433
>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 519
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/353 (41%), Positives = 216/353 (61%), Gaps = 21/353 (5%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M FA+L + + + V S P+IIFIL+DD GWND+G+H I +PNI+AL
Sbjct: 1 MIFALLEILLCFHITVI--VGSQ--PNIIFILSDDHGWNDIGYHS-HIIRSPNINALCND 55
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+ L+NYY CTPSRS +M+G++ IHTG+QH+V+ + LPL E L + LKE+GY
Sbjct: 56 GVRLENYYIQPGCTPSRSQLMSGRYQIHTGLQHSVIRNDQPNCLPLDEVTLAEKLKEVGY 115
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR-DLEPA 213
T +VGKWHLGFY PT RGF+S GY G +DY+ H + G D++R + + +
Sbjct: 116 ATHLVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQEDYYKHIHDG----GYDLKRHETDVS 171
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH- 272
G Y+T VFT+EA +II +H PLFLY+++ + H+ LQ P+HY N++
Sbjct: 172 KQYQGDYTTHVFTSEAQNIIKSHDPSTPLFLYMSYQSVHANY----LQVPEHYSNMYNGV 227
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I+D R A ++ +DE +G + + L++ + +N+II+F SDNGG + +NWP
Sbjct: 228 IDDEDRRIVAGMVTCMDEGIGNITQTLKETELWNNTIIIFSSDNGGDPN----DGGNNWP 283
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
LRG K T WEG + G G ++S + + RG V + +HVSDW PT+++ A S
Sbjct: 284 LRGEKGTHWEGAIHGLGFVYSSDIAKDVRGTVNTEMIHVSDWFPTIVNLAGGS 336
>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
Length = 494
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 204/349 (58%), Gaps = 18/349 (5%)
Query: 46 LSMVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
LS++F+ P PHI+FI+ADDLGWNDVG+H D + TP +D LA+ G+IL
Sbjct: 2 LSLLFLATSGLQQPGGNDAAKPHIVFIVADDLGWNDVGWHNPD-VRTPVLDQLAHEGVIL 60
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
Y +CTPSR+A MTG P H G QH V + + + LP+ LK LGY T +
Sbjct: 61 NQSYVNYVCTPSRTAFMTGYFPYHVGSQHLVFRPDQPSAILSNFTFLPEKLKSLGYATHM 120
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLH 217
VGKWHLGF ++TPTFRGF+S+ GY++G +DYF H + G+D + + D +
Sbjct: 121 VGKWHLGFCNWKFTPTFRGFDSYYGYYSGAEDYFTHFRSIRNGTGGIDFHDNKDVVTDQN 180
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
G YS +F+ A DI++ H + PLFLYL H+ PL+ P + +++ +++D
Sbjct: 181 GTYSAYLFSQRAADIVNKHDPNTPLFLYLPFQNVHA-----PLEVPKRFEDMYANVQDEN 235
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R K ++ LDE+VG V A+++ + N++++F +DNGG + + +N+PLRG K
Sbjct: 236 RRKLLGMVSALDEAVGNVTMAMKKTGLWDNTLLIFSTDNGGMV----IQSGNNYPLRGGK 291
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
TLWEGG RG +L+ G + +H DW PTLL+AA + P
Sbjct: 292 TTLWEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTLLAAAGGTADP 340
>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 504
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 150/351 (42%), Positives = 212/351 (60%), Gaps = 26/351 (7%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
I +VL + F+ S + +D + SS HIIFI+ DDLGWNDVG+H L I TP ID LA
Sbjct: 3 ISHISVLLVVFSSSCLLIDGL-SSDQAHIIFIVVDDLGWNDVGYHNL-YIKTPTIDRLAN 60
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
+G+ L NYY LC PSR+ +M+G+H H V+ G RG LPL+E + LKE G
Sbjct: 61 NGVKLNNYYAANLCVPSRNMLMSGRH------VHGVIMGYPRG-LPLNETTIANKLKEAG 113
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR-DLEP 212
Y T +VGKW+ GFY KE+ P RGF++ G+ +D++ H ++ D+RR DL
Sbjct: 114 YSTHLVGKWNCGFYSKEFLPHNRGFDTFFGFVDSKEDHYTHMVHDIS----DLRRNDLCV 169
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR- 271
A +G YST ++ E II NH T++PLFL+++ +A H EPLQ P Y +
Sbjct: 170 ADKYYGNYSTIMYGNEGTTIIDNHDTNKPLFLFMSFSAVH-----EPLQVPSVYEKEYIP 224
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
I+D R +A ++ +DE++ ++E LE + ML NSIIVF SDNGG L + +NW
Sbjct: 225 TIDDTDRRIYAGMVSCVDEAIANIIETLENKDMLKNSIIVFTSDNGGDP----LYSGNNW 280
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
PLRG K + WEGGVR G ++S +L +RG + +H++DW PTL+ +
Sbjct: 281 PLRGWKASNWEGGVRALGFVYSEILPTNARGTDNNELMHITDWFPTLVDIS 331
>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
Length = 684
Score = 270 bits (690), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 193/321 (60%), Gaps = 14/321 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FI+ADDLGWNDVGFH + TPNID LA G+IL Y LC+PSR A+MTG +
Sbjct: 226 PHILFIVADDLGWNDVGFHN-PAMKTPNIDKLAREGLILNQTYLQPLCSPSRHALMTGYY 284
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P H G+QH V+ + PL K LPQ LK++GY T +VGKWHLGF TPT+RGF+
Sbjct: 285 PYHAGLQHLVILPWQPVCSPLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNCTPTYRGFD 344
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S GY+ D++ H+ LD R + +P +L+G YST F + A DII H++
Sbjct: 345 SFFGYYNAQGDHYSHTWYNY----LDYRDNEKPVKNLNGTYSTFTFVSRAQDIIKKHNSS 400
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A H +P+Q P Y +++ +I+ R +F+ ++ LDE++G + + L
Sbjct: 401 QPLFLYMAFQNVH-----DPIQVPKQYEDMYPNIKTRGRRQFSGMVSALDEAIGNITDQL 455
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ M +++I+F SDNGG + +N+PLRG K T++EGG R + L+
Sbjct: 456 RESGMYDDTLIIFTSDNGGWPKYY----GNNYPLRGGKFTVYEGGTRVVSFVHGAGLQKT 511
Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
G E +H DW TL+ AA
Sbjct: 512 GEAYESIMHAVDWSDTLVEAA 532
>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 524
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 201/331 (60%), Gaps = 24/331 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
P+++FILADD G+NDVG+HG I TPN+D LA G+ L+NYY +C+P+RS +M+
Sbjct: 29 PNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGEGVKLENYYVQPICSPTRSQLMS 88
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ IHTG+QH V+ + LPL E LPQ LKE GY T +VGKWH+GFY PT R
Sbjct: 89 GRYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGYATNMVGKWHIGFYLDACLPTER 148
Query: 177 GFESHLGYWTGHQDYFD------HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
GF+S+ + +D+F + P + G+YST +FT + +
Sbjct: 149 GFDSYFAW----EDHFSCLPXXXXXXXXXXXXXXXXXANKTPVFQYEGQYSTHLFTNKTI 204
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
D+I H +PLF+YLA+ A H PL+ PD Y++ + +I D R +A ++ +DE
Sbjct: 205 DVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSYMDPYMNITDKNRRTYAGMVSCMDE 259
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
VG V AL++ + N+II+F +DNGG + A+NWPLRG K +LWEGG+ G G
Sbjct: 260 GVGNVTRALKEAGLYDNTIIIFSTDNGGPIQ----SGANNWPLRGSKGSLWEGGIHGVGF 315
Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSA 379
+ SPLL + +G V + +HVSDWLPT+++
Sbjct: 316 VHSPLLPTSVKGTVNHELIHVSDWLPTIVAG 346
>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
Length = 506
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 130/327 (39%), Positives = 199/327 (60%), Gaps = 18/327 (5%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
VA + P+++ ++ADDLGWND+ H + TPN+D L + L++YY +CTP+RS
Sbjct: 13 VAIAKKPNVLILIADDLGWNDISLHN-SYLSTPNVDGLIQESLHLQSYYVNPICTPTRSV 71
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+M+G++ IHTG+QH V+ G + GLPL++ + P+ K+ GYRT +VGKWHLGFY ++ P
Sbjct: 72 LMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKMVP 131
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGFESH GY G + +++HS G+D R G+YS D+F D++
Sbjct: 132 ENRGFESHYGYLIGAEGHYNHSQFMQGQNGVDFRDGGASTNSSWGQYSADLFAKRVEDLV 191
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
H +E L++Y+ H PL+AP HY++ I+D R +AA+ +D++VG
Sbjct: 192 EAHDVEESLYMYVGLQNVHY-----PLEAPQHYVDQFSWIKDRDRRVYAAMTKSMDDTVG 246
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+++ + + + + ++I+ F +DNGG+ L +NWPLRG+KNTLWEGG+RG G I
Sbjct: 247 RIINSFKNKGIWEDTIVYFTTDNGGST----LYGGNNWPLRGLKNTLWEGGIRGIGSIKI 302
Query: 354 PLLESRGIVA---EQYVHVSDWLPTLL 377
P G+ A +Q +HV D PTLL
Sbjct: 303 P-----GVSADKRDQLMHVVDMFPTLL 324
>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
Length = 609
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 201/326 (61%), Gaps = 26/326 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH +FILADD+G++D+G+H +I TP +D LA +G+ L+ YY +CTP+R +MTG++
Sbjct: 98 PHFLFILADDMGYHDIGYHQA-EILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRY 156
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I GMQH V+ + G+PL EK+LP+ L++ GY T ++GKWHLG + ++Y P RGF+
Sbjct: 157 QIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFD 216
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL--EPA---WDLHGKYSTDVFTAEAVDIIH 234
+G++TG QD++ H+ M G D R +P WDL+ YST VF E +
Sbjct: 217 HFMGFYTGSQDFYSHNKCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELEKRLS 276
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
+ EP F YL+ A HS PLQAP +++ +R++++ KR + A++ +D ++ K
Sbjct: 277 KMNPSEPSFTYLSFQAVHS-----PLQAPRKFIDQYRNLKNRKRRNYNAMVTAMDTAIYK 331
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
VV+A ++ + ++I++F SDNGG G ASNWPLRG K +L+EGGVR G + SP
Sbjct: 332 VVKAYKKFKFWDDTILIFSSDNGGNLKG----GASNWPLRGAKGSLFEGGVRTIGFVHSP 387
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL V+DW PTLL A
Sbjct: 388 LLP-----------VTDWFPTLLQLA 402
>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
Length = 339
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 195/318 (61%), Gaps = 11/318 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FI+ADDLGWNDV HG QIPTP+IDA+A+SG+ L NY+ +CTP+RS ++G+H
Sbjct: 33 PNIVFIVADDLGWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCTPTRSTFLSGRH 92
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+ G L LS +LP YLK+LGYRT VGKWHLG ++ PT RGF+
Sbjct: 93 VIHTGIYMPFAQGTAL-RLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTGRGFD 151
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
+LGYW+G +DY+ H G D + E A + YST +F AV+ I +
Sbjct: 152 EYLGYWSGAEDYYTHDTHG----GYDFQDGTECAIKYNNTYSTYIFAERAVNTILEADPE 207
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY A H PL+AP Y+ HI + +R AA+ LD++VG + +AL
Sbjct: 208 QPLFLYTAFQNVH-----WPLEAPAEYVARFSHIPNSERQYVAAMTSILDDAVGNITDAL 262
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ R+ N+I++F SDNGG N ++N+PLRG KNTLW GG + G+I +E+
Sbjct: 263 KRSRIADNTILIFTSDNGGPVHDEN-TESNNYPLRGGKNTLWNGGTQVVGMIAGKGIENP 321
Query: 360 GIVAEQYVHVSDWLPTLL 377
G +H SDWLP+L+
Sbjct: 322 GTDCHGMMHASDWLPSLV 339
>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
Length = 621
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 134/295 (45%), Positives = 184/295 (62%), Gaps = 11/295 (3%)
Query: 88 IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
+D LA G+ L+NYY +CTPSRS +TGK+ IHTG+QH+++ + LPL LPQ
Sbjct: 80 LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDM 206
LKE+GY T +VGKWHLGFY+KE PT RGF++ G G DY+ H + M G D+
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDL 199
Query: 207 RRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
+ AWD +G YST ++T I+ +H +P+FLY+A+ A HS PLQAP
Sbjct: 200 YENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGR 254
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
Y +R I + R ++AA+L LDE++ V AL+ +NSII++ SDNGG
Sbjct: 255 YFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTA--- 311
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SNWPLRG K T WEGG+R G + SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 312 -GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA 365
>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
Length = 484
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 206/346 (59%), Gaps = 21/346 (6%)
Query: 42 LAFTLSMVFVDLVA-------SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+ TL ++F L+A + PHI+FI+ADDLGWNDVG+H + TPNID LA
Sbjct: 1 MGVTLLVLFGGLLALLQRKQVEAKSPHILFIVADDLGWNDVGYHN-PAMKTPNIDKLARE 59
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+IL Y LC+PSR A MTG +P G+QH V+ + PL+ K LPQ LK+LGY
Sbjct: 60 GLILNQTYFQPLCSPSRHAFMTGYYPYRAGLQHLVIMPWQPVCSPLNMKFLPQRLKDLGY 119
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T +VGKWHLG + TPT+RGF+S G++ DY+ H + + LD R + +P
Sbjct: 120 ATHMVGKWHLGMCNWDCTPTYRGFDSFFGFYHAKADYYSHISYKY----LDYRDNEKPVK 175
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
+L+G YST FT+ A DII H++ +PLFLY+A P+EPLQ P Y +++ +
Sbjct: 176 NLNGTYSTFTFTSRAQDIIKKHNSSQPLFLYMAFPI-----PHEPLQVPQQYEDMYPNTM 230
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
R +++ ++ LDE++G + + L + M +++IVF SDNGG+ ++ +N+PLR
Sbjct: 231 SKGRRQYSGMVSALDEAIGNITDQLRESGMYDDTLIVFTSDNGGSP----MDYGNNYPLR 286
Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G K T++EGG R + L+ G E +H DW TL++AA
Sbjct: 287 GAKFTVYEGGTRVVSFVHGAGLQKTGEAYESMMHAVDWSDTLVAAA 332
>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 843
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 196/332 (59%), Gaps = 14/332 (4%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
+++ PPHIIFI+ DDLGWNDVG++ TP ID LA SG+ L NYY C PSR
Sbjct: 358 NVLGQEKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGSGVKLLNYYVASHCLPSR 416
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
+ +MTG+H I G+Q + +G LPL E + Q LK++GY T I+GKWH GFY
Sbjct: 417 NMLMTGRHAIQLGIQRHG-FGYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNC 475
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
P RGF++ G+ +++ HS M L D A GKYST ++ E +
Sbjct: 476 LPHNRGFDTFFGFVGAGIEHYTHSDHFNHMHNLRKNDDC-IAKQYIGKYSTTIYANEGKN 534
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDE 290
II+ H ++P FLYL+ +A H+ PL+ P YL + I D R +AA+ +DE
Sbjct: 535 IINAHDQNKPFFLYLSFSAVHT-----PLEVPSSYLKQYESTIYDEDRRTYAAMTSCVDE 589
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
++ + EALE++ ML NSIIVF SDNGGA +A SNWPLRG K+T+WEGGV+ G
Sbjct: 590 AIANITEALEEKDMLKNSIIVFTSDNGGAV---ERSAGSNWPLRGGKHTIWEGGVKAVGF 646
Query: 351 IWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
++ LL SRG +H++DW PTL++ A
Sbjct: 647 VYGELLPHHSRGTENTGLMHITDWFPTLVTLA 678
>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
Length = 545
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 135/345 (39%), Positives = 203/345 (58%), Gaps = 21/345 (6%)
Query: 43 AFTLSMVFVDLVASSG-----PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
F + ++ + + SS PPHI++I+ADDLGWNDVG+H D I TPN+D +A G+I
Sbjct: 6 TFGVVLIGLCFLVSSAEGAKRPPHILYIVADDLGWNDVGWHNPD-IKTPNLDRMAGGGVI 64
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
L + Y +CTPSR++ +TG +P G+ + + + L LP+ LK+LGY T
Sbjct: 65 LNSSYVHPICTPSRNSFLTGVYPFRVGLSGTAITPHQARFMSLKTPTLPEKLKKLGYSTH 124
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
++GKWHLGF + YTPT RGF+S LG++TG QDY+ H+ + G D R + +
Sbjct: 125 MIGKWHLGFCNERYTPTRRGFDSFLGFYTGTQDYYKHTTAK----GYDFRFNQTVFYPPK 180
Query: 218 GKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
+YST + AVDII H + PLFLYLA + H+ PLQ P Y + +I++
Sbjct: 181 KQYSTKTYAKRAVDIITEHKRKKNPLFLYLAFQSVHT-----PLQVPKKYEKQYNNIKNK 235
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R ++ ++ +D+++G V+ A + + + N I+VF +DNGGAA +N+PLRG
Sbjct: 236 DRRVYSGMVTAMDDAIGAVMAAFRKHKFMGNLIVVFTTDNGGAAH----IVGNNFPLRGS 291
Query: 337 KNTLWEGGVRGAGLIWSP-LLESRGIVAEQYVHVSDWLPTLLSAA 380
K T+WEGG R ++S LL+ G + H +DW PT+L+ A
Sbjct: 292 KTTVWEGGTRAVSFVYSKNLLKKTGYTHDGLFHATDWFPTILAVA 336
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 15/48 (31%), Positives = 30/48 (62%)
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKEN 554
+ ++DG++ W ++ +PS+R ++ ID +SA+ GK+KL+ N
Sbjct: 343 EKQLDGVNQWPMIRSGKPSRRTEFVYEIDKSRPVSAIRMGKYKLIVGN 390
Score = 39.7 bits (91), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 14/45 (31%), Positives = 29/45 (64%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
+ ++DG++ W ++ +PS+R ++ ID +SA+ GK+KL+
Sbjct: 343 EKQLDGVNQWPMIRSGKPSRRTEFVYEIDKSRPVSAIRMGKYKLI 387
>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 205/343 (59%), Gaps = 17/343 (4%)
Query: 43 AFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
+F +FV L S+ PPHI+FI+ADDLGWNDVGF D + TPNID LA SG+IL +
Sbjct: 6 SFITCSIFVHFLLCVSAKPPHILFIVADDLGWNDVGFRNPDVL-TPNIDKLARSGMILNS 64
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
Y + +CTPSR++ MTG + +G+QH + + PL+ LPQ LKELGY T +G
Sbjct: 65 SYVMPVCTPSRNSFMTGHYAFKSGLQHLAINPQQATCAPLNYTFLPQKLKELGYATHAIG 124
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
KWHLGF K E TPT+RGF++ G++ G +DY+ S G D R + P + G+Y
Sbjct: 125 KWHLGFCKWECTPTYRGFDTFFGFYNGQEDYYTLSVAG----GKDFRDNKVPV-NATGEY 179
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
ST +++ A II H +P+++YL + H+ PLQ P+ Y +++ ++ R
Sbjct: 180 STFLYSRRAESIIKEHDASKPIYMYLPFQSVHA-----PLQVPNRYTDMYTNVHTKSRRT 234
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+ ++ +DE++G + +AL+ + + ++++I+F +DNGGA A +N+PLRG K TL
Sbjct: 235 YMGMVTAMDEAIGNITKALKTKGIFNDTLIIFTTDNGGAVK----FAGNNYPLRGGKATL 290
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+EGG R + ++ +H DW+PT+LSAA +
Sbjct: 291 FEGGTRATAFVTGAGIQKFNYEYSGLIHAVDWMPTVLSAAGGT 333
>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
Length = 538
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 209/350 (59%), Gaps = 12/350 (3%)
Query: 34 IMAFAVLPLAFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
I+ F + +A S FV + A++ P+I+ I+ADDLGWNDVG+H +I TPN+D L
Sbjct: 15 IITFISVAIAAIYSNFFVSDGVRAAAQKPNIVIIMADDLGWNDVGYHN-SEIKTPNLDKL 73
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A S L +Y CTP+R+A+MTG+HP GM V++ ++ GLPL EK + Q LKE
Sbjct: 74 AESSTRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVIWPWDKVGLPLEEKTIAQTLKE 133
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY T IVGKWHLG YK+EY PT RGF+ H G++ G DYF H + GLD R+ +
Sbjct: 134 AGYYTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSIDYFTHQLDAGIQGGLDWHRNEQ 193
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
P + Y+TD+ EAV +I + ++ PLFLY++ A P+ PLQA + + +
Sbjct: 194 PVEE--EGYATDLLAQEAVKLIRDCDYNKSPLFLYVSFNA-----PHAPLQAKEKDIKNY 246
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
+I+D R FAA + +DE+VG++V AL+++++ N+++ F SDNGG + N
Sbjct: 247 ANIQDEGRRIFAAQVQSMDEAVGRIVAALKEKQVWDNTLLFFTSDNGGGSDQ-PWTRGDN 305
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
PLRG K TL++GGVR +I P G V EQ V D PTL A
Sbjct: 306 RPLRGQKGTLYDGGVRVPTIISFPAQLKGGQVIEQVFSVVDLYPTLAKLA 355
>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 355
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 198/342 (57%), Gaps = 14/342 (4%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
++F L +V + V PPHIIFI+ DDLGWNDVG++ TP ID LA +G+ L NY
Sbjct: 5 VSFLLLLVSGNAVGQEKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGNGVKLLNY 63
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y C PSR+ +MTG+H I G+ + +G LPL E + + LK GY T IVGK
Sbjct: 64 YVASHCLPSRNMLMTGRHAIQLGIPQDG-FGYHPRSLPLDETTIAEPLKHAGYSTHIVGK 122
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WH G+Y P RGF++ G+ D++ HS M L D A GKYS
Sbjct: 123 WHCGYYADNCLPHNRGFDTFFGFVGAGIDHYTHSDHFNHMHNLRKNDDCI-AKKYIGKYS 181
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSK 280
T +F E DII+ H ++PLFLYL+ +A H+ PL+ P YL + I D R
Sbjct: 182 TTIFANEGKDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRT 236
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+AA+ +DE++ + EALE++ ML NSIIVF SDNGGA +A +NWPLRG K+T
Sbjct: 237 YAAMTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVN---SAGNNWPLRGGKHTS 293
Query: 341 WEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
WEGGVR G ++ LL SRG +H++DW PTL++ A
Sbjct: 294 WEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLA 335
>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
Length = 438
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 131/291 (45%), Positives = 195/291 (67%), Gaps = 15/291 (5%)
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+CTPSR A ++G+HP+ TGMQ L E GL L++ +LP+YL++LGY T ++GKWH+G
Sbjct: 3 VCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWHVG 62
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR------DLEPAWDLHGK 219
+ + Y PT RGF++ LGY+ G YF+H+ EE + G D+ R +E +D
Sbjct: 63 YLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEENEQVGYDLHRIVGDNHTVEYRYD---- 118
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFK 277
Y T++ T EA +II +H+T++P++L LAH A+H++N E ++ D +I+D
Sbjct: 119 YMTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQDVN 178
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGV 336
R KFA+++ LD+S+G+VV+AL ++ ML N+II+F+SDNG G N SN+PLRG+
Sbjct: 179 RRKFASVVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQTEGPLYQNFGSNYPLRGL 238
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
K +L+EGGVRGA I+SPL+E+ V+ Q H++DWLPTL SAA N SD+
Sbjct: 239 KFSLYEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAGGNSSDL 289
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 70/144 (48%), Gaps = 19/144 (13%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM----------- 622
++DGID WS + E SKR +++ NI+D SA G++KL++
Sbjct: 291 QLDGIDQWSTIKTQEASKRKSVVINIED-VDYSAALMGRYKLIQDKSDFQKHYTNYKGND 349
Query: 623 ----RYQV-DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCL 677
+Y V ++ P ++ +S+ A + +LR A++ C + CE + CL
Sbjct: 350 PSYPKYNVTNILESPAASAIANISNHIVTANKIEELRKEATVVCENLMTSDCESR--TCL 407
Query: 678 FDIKNDPCEKNNLADRSEDQRINH 701
FDI DPCE +++ + + R+N
Sbjct: 408 FDIYEDPCEVTDISSQYPEVRLNQ 431
>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 495
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 199/344 (57%), Gaps = 21/344 (6%)
Query: 47 SMVFVDLVASSG-------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
++VF+ L+ +SG PPHIIFI+ DDLGWNDVG++ TP ID LA +G+ L
Sbjct: 3 TVVFLLLLMASGNVLGHEKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGNGVKLL 61
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
NYY C PSR+ +MTG+H I G+ N +G LPL E + + LK GY IV
Sbjct: 62 NYYVASHCLPSRNMLMTGRHAIQLGIP-NDGFGYHPRSLPLDETTIAEPLKHAGYSNHIV 120
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWH G+Y P RGF++ G+ D++ HS M L D A GK
Sbjct: 121 GKWHCGYYADNCLPHNRGFDTFFGFVGAGIDHYTHSDHFNHMHNLRKNDDC-IAKKYIGK 179
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKR 278
YST +F E DII+ H ++PLFLYL+ +A H+ PL+ P YL + I D R
Sbjct: 180 YSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDR 234
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+AA+ +DE++ + EALE++ ML NSIIVF SDNGGA +A +NWPLRG K+
Sbjct: 235 RTYAAMTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVN---SAGNNWPLRGGKH 291
Query: 339 TLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
T WEGGVR G ++ LL SRG +H++DW PTL++ A
Sbjct: 292 TSWEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLA 335
>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 203/346 (58%), Gaps = 16/346 (4%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
F++ + F+ V S+ PPHI+FI+ADD GWNDVGF D + TPNID LA+SG++L +
Sbjct: 8 FVFSIFIHFLPYV-SAKPPHILFIVADDYGWNDVGFRNPDVL-TPNIDKLAHSGMVLNSS 65
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y + +CTPSR++ MTG + +G+QH + + PL+ LPQ LKELGY T +GK
Sbjct: 66 YVMPVCTPSRNSFMTGHYAFKSGLQHLAILPKQAACAPLNYTFLPQKLKELGYATHAIGK 125
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLGF K E TPT+RGF++ G++ G +DY+ S G D R + P + G+YS
Sbjct: 126 WHLGFCKWECTPTYRGFDTFFGFYNGQEDYYTLSVAG----GKDFRDNRTPV-NATGEYS 180
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T ++ A II H +P+++YL + H EPLQ P Y +++ + R +
Sbjct: 181 TFLYARRAESIIKEHDASKPMYMYLPFQSVH-----EPLQVPQKYSDMYSKVHTESRRTY 235
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
++ +DE++G + +AL+ + M ++++I+F +DNGG A +N+PLRG K T++
Sbjct: 236 LGMVTAMDEAIGNITKALKTKGMFNDTLIIFTADNGGWIT----FAGNNYPLRGGKATVF 291
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
EGG R + ++ V +H DW+PT+L AA + P
Sbjct: 292 EGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAGGTPDPT 337
>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 202/341 (59%), Gaps = 23/341 (6%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
VA S P+I+F+LAD+LGW+D+G+HG ++ TP++D LA GI L+NYY +CTP+RS
Sbjct: 17 VARSSQPNIVFVLADELGWHDIGYHG-SRVQTPHLDKLASEGIKLENYYVQPMCTPTRSQ 75
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+M+G++ IHTG+QH V+ +R LPL E + Q LKE GY T +VGKWHLG Y K P
Sbjct: 76 LMSGRYQIHTGLQHTVINPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGCLP 135
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEE---------MKMWGLDMRRDLEP--AWDLHGKYST 222
T RGF+S G++ DY+ + + ++M G D+ R+ E A G Y T
Sbjct: 136 TKRGFDSFFGFYNCAVDYYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHYQT 195
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
V EA D+I H+ +PLFLYLA A H PL+ P Y +++ ++D R
Sbjct: 196 HVLAKEAEDVIRKHNPSKPLFLYLAFGAVHV-----PLEVPKVYEDMYADVKDNSRRILL 250
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+ +D+S+ V AL+ + N+I +F +DNG A A +WPL+G K +L+E
Sbjct: 251 AMATCMDDSIADVTGALKDTGLWDNTIFIFSTDNGAATA----YGGCSWPLKGGKTSLFE 306
Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAAN 381
GG+R G + SPLL + +G + +HVSDW PTL+ A
Sbjct: 307 GGIRAVGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAG 347
>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 496
Score = 259 bits (661), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 141/347 (40%), Positives = 201/347 (57%), Gaps = 22/347 (6%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M+ A + + F+ +F SG PHI+F + DDLGWNDVG+H I TP ID LA S
Sbjct: 3 MSVANVVIIFS---IFSSAEPYSGQPHIVFFVVDDLGWNDVGYHN-SYIKTPTIDMLAKS 58
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+ L NYY C PSR+ +++G+H I G+QH + G GLPL E + LKE+GY
Sbjct: 59 GVRLNNYYVASHCVPSRNMLISGRHVIDIGLQHGEI-GYYPRGLPLDEFTIADKLKEIGY 117
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE-PA 213
T ++GKWH G Y P RGF++ GY D++ H + D+R + E
Sbjct: 118 ATHLIGKWHCGCYSNHSLPHNRGFDTFFGYLGSSDDHYTHIIMSNGL--ADLRLNDECVG 175
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH- 272
+ G YST ++ EA +II H ++PLFL LA +A H +P+Q PD Y+ ++
Sbjct: 176 YKYFGDYSTIMYANEAKNIIAQHDENKPLFLMLAFSAVH-----KPIQVPDVYMQYYKST 230
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I D +R +A + +DE++ + EALE + ML NS+IV +DNGG + G +NWP
Sbjct: 231 IHDNERRTYAGMASCVDEAIANITEALENKGMLKNSVIVLTTDNGGGSVG------NNWP 284
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLL 377
LRG KN+ WEGGVRG ++S LL + RG + +H++DW PTL+
Sbjct: 285 LRGRKNSHWEGGVRGVAFVYSDLLPMDVRGTENSELMHITDWFPTLV 331
>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 533
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 188/315 (59%), Gaps = 13/315 (4%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
I+F++ADDLG+NDVGFHG QIPTP +DALA G+ L NY+T +C+PSR+++++G+H I
Sbjct: 58 ILFLVADDLGFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHAI 117
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
H G+ G L L ++LP+ L+ LGY T VGKWHLG + PT RGF+S
Sbjct: 118 HHGIYMPFAQGTAY-HLSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRGFDSF 176
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
LGYW+G +DYF H D + AW G YS FT AVD++ + ST P
Sbjct: 177 LGYWSGAEDYFAHDCAG----AYDFANNETTAWAYDGVYSAYSFTDRAVDVVASAST--P 230
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
FLY+A H PL+ P YL H+ R + ++ LDE+VG V AL+
Sbjct: 231 YFLYVAWQNVH-----WPLEVPARYLEPFGHVPAGPRRNVSGMVSALDEAVGNVTGALKA 285
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGI 361
RR ++++VFVSDNGG G A++NWPLRG KNTL+EGG R GL+ +P + ++
Sbjct: 286 RRAYDDAVVVFVSDNGGPTNGDEGTASNNWPLRGGKNTLYEGGTRVVGLLKAPGV-AKSS 344
Query: 362 VAEQYVHVSDWLPTL 376
+ +H DW PTL
Sbjct: 345 ASRAKLHAVDWFPTL 359
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 28/127 (22%)
Query: 576 DGIDVWSVLSRNEPSKRNTILH----NIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
DG+D + VL+R P + +L +D AL G WK +++ V
Sbjct: 378 DGVDSYDVLARGAPGRDELLLEAHAPGSNDTVHGEALIVGDWKFLRIGTVR--------- 428
Query: 632 PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQI-------APCLFDIKNDP 684
PD S+ W ++ + + CG E P + APCLFD+ DP
Sbjct: 429 PD-------SEAGWTRPPGQEAHPRSVVDCGAAPE-PADANATRRQCVAAPCLFDVAADP 480
Query: 685 CEKNNLA 691
CE+ + A
Sbjct: 481 CERVDRA 487
>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 469
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 148/358 (41%), Positives = 211/358 (58%), Gaps = 20/358 (5%)
Query: 33 RIMAFAVLPLAFTLSMVFV-DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
+I+ F L L T +L A P+II ILADDLGWNDVGFHG +I TPN+D L
Sbjct: 7 KIVIFICLTLIITYYFRGANELKAEINKPNIIVILADDLGWNDVGFHG-SEIKTPNLDKL 65
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A SG+ L+ +Y +CTP+R+A +TG+HP GM V+ ++ GLPL EK + + LKE
Sbjct: 66 AASGVRLERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVIKPWDKVGLPLEEKTIAETLKE 125
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY T I+GKWHLG Y++ + PT RGF+ H G++ G DYF H+ + + LD R+
Sbjct: 126 AGYYTAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGIDYFTHNDDFLG--ALDWHRN-- 181
Query: 212 PAWDLHGK---YSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYL 267
+H K Y+TD+ EAV +I+NH+ + +PLFLY+A A H+ PL A +
Sbjct: 182 ---RIHLKEEGYATDLIGQEAVKLINNHNYEQQPLFLYIAFNAPHT-----PLHAKTEDI 233
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+ I+D KR FAA + +DE++GK++++LE +++ N+ + FVSDNGG+ N
Sbjct: 234 EDYLTIDDEKRRVFAAQVQSMDEAIGKIIQSLENQQVCDNTFVFFVSDNGGSV--MRANR 291
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
N PLRG KN+L+EGGVR ++ P S Q + D PTL A +
Sbjct: 292 GDNRPLRGGKNSLYEGGVRVPAIVSYPPKLSANQEINQIFSIVDLYPTLAKLAGVESV 349
>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 522
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 150/373 (40%), Positives = 204/373 (54%), Gaps = 56/373 (15%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L+ L + AS+ PHIIFILADDLGW+DVG+HG + TP IDALA G+ L+NY
Sbjct: 10 LSILLGVSLSLATASATRPHIIFILADDLGWDDVGYHG-SVMKTPYIDALAAEGVTLENY 68
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y LCTPSRS ++TG++ IHTG+QH + + LPL E LPQ LKE GY T +VGK
Sbjct: 69 YMPSLCTPSRSVLLTGRYEIHTGLQHGTILMMQPLCLPLDEITLPQKLKEEGYDTHMVGK 128
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLH-G 218
WHLGFY+KE P RGF+S LG++ D+F H SA G D RR+ + D + G
Sbjct: 129 WHLGFYRKECLPNNRGFDSFLGFYQAMGDHFYHNISASPGHFNGFDFRRNNDVVADQYAG 188
Query: 219 KYSTDVFTAEAVD--------------------------IIHNHSTDEPLFLYLAHAATH 252
KYST +F ++ I +++ +PLFLYL+ A H
Sbjct: 189 KYSTHIFXXXFINTQTLSFVCVNNVKGVYFRGYLSSFTPITSSYNPQQPLFLYLSFQAVH 248
Query: 253 SANPYEPLQAPDHYLNIHRHI--EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
+ PLQ P Y ++ + D R +A ++ +DE+VG V L+Q + NS++
Sbjct: 249 T-----PLQVPSRYAELYNDLIPNDEDRRIYAGMVTCMDEAVGYVTTVLKQSALWENSVV 303
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVH 368
+F + LRG K TL+EGGVRG G + SPLL + RG + + +H
Sbjct: 304 IFSTX-----------------LRGGKGTLYEGGVRGVGFVTSPLLHEQVRGTINHELIH 346
Query: 369 VSDWLPTLLSAAN 381
+SDW PTL A
Sbjct: 347 ISDWFPTLTRLAG 359
>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
Length = 483
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 137/354 (38%), Positives = 208/354 (58%), Gaps = 17/354 (4%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M +V F + + F+ V S+ PPHI+FI+ADD GWNDVGF + TPNID LA S
Sbjct: 1 METSVSVFVFCIFIHFLPYV-SAKPPHIVFIVADDYGWNDVGFRN-PSVLTPNIDKLARS 58
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+IL + Y + +CTPSR++ MTG++ +G+QH V+ + PL+ LPQ LKELGY
Sbjct: 59 GMILNSSYVMPVCTPSRNSFMTGQYAFKSGLQHIVILPQQATCAPLNNTFLPQKLKELGY 118
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T +GKWHLGF K E TPT+RGF++ GY+ G +DY++ S G D R + P
Sbjct: 119 ATHAIGKWHLGFCKWECTPTYRGFDTFYGYYNGAEDYYNLSIAG----GKDFRDNRTPV- 173
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNIHRHI 273
+ G+YST ++ A II +H +P+++YL + H EPLQ + +I+ +
Sbjct: 174 NATGEYSTILYARRAESIIKDHDASKPMYMYLPFQSVH-----EPLQVSLESSSDIYSKV 228
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
R + ++ +DE++G + +AL+ + M ++++IVF +DNG G+ A +N+PL
Sbjct: 229 HTESRRTYLGMVTAMDEAIGNITKALKTKGMFNDTLIVFTADNG----GWITYAGNNYPL 284
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
RG K T++EGG R + ++ V +H DW+PT+LSAA + P
Sbjct: 285 RGGKTTVFEGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLSAAGGTPDPT 338
>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
Length = 316
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 124/263 (47%), Positives = 172/263 (65%), Gaps = 12/263 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS ++
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A + LD R E A YST++FT A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 290 ESVGKVVEALEQRRMLSNSIIVF 312
E+VG V AL+ + +N++ +F
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIF 298
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145
>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 255 bits (651), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 197/344 (57%), Gaps = 23/344 (6%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+ + LAF+ LV+S+ P+II + DDLGWND + G D I TPNID LA
Sbjct: 4 FSLVLFLLAFS-----APLVSSANSPNIILFVIDDLGWNDTSYQGSD-IQTPNIDKLAEE 57
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GI LK YY LC+PSRSA++ GK+P H G+ H V+ GL L+E + +LK+ GY
Sbjct: 58 GIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGY 117
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG-LDMRRDLEPA 213
T VGKW LG +K E+TPT+RGF++ GY+ +DY+ H K+ G LD R + +P
Sbjct: 118 STHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDADEDYYTH-----KVGGYLDFRNNTDPV 172
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
D G YST +FT D I N +D P F+Y A+ + HS PL+APD YL H
Sbjct: 173 KDEDGTYSTFLFTKAIEDAI-NAKSDSPFFIYGAYQSVHS-----PLEAPDTYLE-KCHS 225
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
R F ++ LDE + + L+ + +L ++II+ +DNGG A +SNWPL
Sbjct: 226 PYPNRKIFCGMMLALDEGISNITSLLQTKGLLDDTIIILTTDNGGQTA----LGSSNWPL 281
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
RG K T++EGGVRG +WS L +H++DW PT++
Sbjct: 282 RGNKATVFEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTII 325
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 27/125 (21%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNI-----DDEWQI---SALTKGKWKLVKVVKVMRYQV 626
+DG ++W ++ + PS R ILH + D+ Q +AL +G WKL+ + +
Sbjct: 338 LDGFNMWDTINDDTPSPRTEILHQLDPPKYDERTQFIGQAALRQGDWKLI----IGQPNC 393
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCE 686
GP VY L W+ L G ++E P + LF++K+DP E
Sbjct: 394 SEKLGPHTVY--DLCPIGWIHLN------------GTIEEPEVNPSLT-WLFNVKDDPNE 438
Query: 687 KNNLA 691
+N L+
Sbjct: 439 RNELS 443
>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 476
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 148/353 (41%), Positives = 205/353 (58%), Gaps = 17/353 (4%)
Query: 33 RIMAFAVLPLAFTLSMVFVD-LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
RI+ F L L T + L A P+II ILADDLGWNDVGFHG +I T N+D L
Sbjct: 7 RIVIFICLTLTITYYFRGANNLKAEINKPNIIVILADDLGWNDVGFHG-SEIKTTNLDKL 65
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A SG+ L+ +Y +CTP+R+A +TG+HP GM + GLPL EK + + LKE
Sbjct: 66 AVSGVRLERFYVKAMCTPTRAAFLTGRHPFRYGMSAINVTPWSETGLPLEEKTIAETLKE 125
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY T I+GKWHLG Y++ Y PT RGF+ H G++ DYF H + + GLD R+
Sbjct: 126 AGYYTAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGIDYFTHKSGD----GLDWHRNNN 181
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
P + + G YSTD+ +AV +I+NH + PLFLY+A A P+ PLQA L +
Sbjct: 182 PVY-IEG-YSTDLIAQDAVQLINNHDYHKNPLFLYIAFNA-----PHIPLQAKAEDLEDY 234
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL---NA 327
IED +R FAA + +DE++G ++++L+ +++ +N+++VF SDNGG ++
Sbjct: 235 LTIEDEQRRLFAAQVQSMDEAIGSIIQSLQAKQVWNNTLLVFTSDNGGEIVANDIRFTGR 294
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N PLRG K L+EGGVR +I P S G EQ + D PT A
Sbjct: 295 GDNRPLRGGKRNLYEGGVRVPTIISYPGHLSSGKTVEQMFSIVDLYPTFAKLA 347
>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 491
Score = 253 bits (647), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 188/326 (57%), Gaps = 16/326 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V+S+ P+II + DDLGWND + G D I TPNID LA GI LK YY LC+PSRSA
Sbjct: 18 VSSANNPNIILFVIDDLGWNDTSYQGSD-IQTPNIDKLAEEGIRLKQYYVQPLCSPSRSA 76
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++ GK+P H G+ H V+ GL L+E + +LK+ GY T VGKW LG +K E+TP
Sbjct: 77 LLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTP 136
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
T+RGF++ GY+ +DY+ H LD R + +P D G YST +FT D I
Sbjct: 137 TYRGFDTFYGYYDADEDYYTHKVGGY----LDFRNNTDPVKDEDGTYSTFLFTKAIEDAI 192
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
N +D P F+Y A+ + H PL+APD YLN +I R F ++ LDE +
Sbjct: 193 -NAKSDSPFFIYGAYQSVHG-----PLEAPDIYLN-KCNIPYPNRKIFCGMMLALDEGIS 245
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ L+ + +L ++II+ +DNGG A +SNWPLRG K T++EGGVRG +WS
Sbjct: 246 NITSLLQTKGLLDDTIIILTTDNGGQTA----LGSSNWPLRGNKATVFEGGVRGISFVWS 301
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSA 379
L +H++DW PT++
Sbjct: 302 TKLRKSNYDNNAMMHITDWYPTIIEG 327
>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 144/352 (40%), Positives = 208/352 (59%), Gaps = 24/352 (6%)
Query: 37 FAVLPLAFTLSMVFVDLVAS------SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
F L ++ L++V +V + S P+++ ILADD+GW DVG+H +I TP++D
Sbjct: 8 FGCLSTSYVLALVIPSMVWAADQPLKSAKPNVVLILADDMGWGDVGWHD-SEIKTPHLDK 66
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
LA SG L+ +Y +C+P+R+A+MTG++P+ G+Q V+ + GLPL+E+ LPQ LK
Sbjct: 67 LAASGTRLEQFYVQPVCSPTRAALMTGRYPMRHGLQVGVVRPWAQYGLPLNERTLPQALK 126
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
E+GY T I GKWHLG ++ EY PT RGF+ G++ G DYF H ++ G D RD
Sbjct: 127 EVGYETAICGKWHLGHFQPEYLPTHRGFDHQYGHYNGALDYFTH----IRDGGFDWHRDD 182
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
D YST + EA II +H T +PLFLY+ A H+ P Q P+ Y +
Sbjct: 183 RVNRD--EGYSTHLIGREATRIIGHHDTSKPLFLYVPFNAVHA-----PHQVPEKYKEPY 235
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
+++ +R+ +A +L +DE+VG++ A++ + M SN+I +F SDNGG A G SN
Sbjct: 236 SKLKEPRRT-YAGMLAAMDEAVGQIASAIDGKGMRSNTIFLFSSDNGGPAPG---QVTSN 291
Query: 331 WPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
PLRG K TL+EGGVR I W L G V +H+ D PTLL+ A
Sbjct: 292 GPLRGQKGTLYEGGVRVPAFISWEGHLRP-GTVVNAPLHIVDLFPTLLTLAG 342
>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
Length = 449
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 181/293 (61%), Gaps = 16/293 (5%)
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
LA G+ L+NYY +C+PSR +MTG++ I G+QH+V+ GLPL E LPQ L+
Sbjct: 2 LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGL 204
E GYR+ IVGKWHLGF++KEY P RGF+ GY TG +DY+ H + + GL
Sbjct: 62 ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121
Query: 205 DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
D+R +P D +G YST +F +A++ I +H +P+FLYL A HS PL+AP
Sbjct: 122 DLRDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHS-----PLEAPT 176
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
Y+ ++ + +AA++ +DE+VG V +AL++ + N++++F +DNG
Sbjct: 177 KYVEKYKDVTHPLIKIYAAMVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGARE---- 232
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
NA SNWPLRG KNTLWEGGVRG G + S LL ++ +H+SDW PTLL
Sbjct: 233 -NAGSNWPLRGWKNTLWEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLL 284
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 22/107 (20%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNI----------------DDEWQIS---ALTRGKWKL 550
+DG D+W +S PS R ILHNI DD + IS A+ G WKL
Sbjct: 297 LDGFDIWDTISTGTPSPRKEILHNIDPLHRKTASSSFNLLYDDIFNISVYAAIRSGDWKL 356
Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGID--VWSVLSRNEPSKRNTI 595
+ + +G G ++ ++ + ++ +D +W RN+P +R+ +
Sbjct: 357 LTGDQGHG-GWKDSDTSGSKFEYAVDPPTKHLWLFNIRNDPQERSDL 402
>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
Length = 357
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 189/326 (57%), Gaps = 15/326 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ADDLGW+DV ++ + + PN+ LA +G+I Y+ +LCTPSR+A++TGK
Sbjct: 8 PHILLIVADDLGWSDVSWNNPNVV-MPNLHTLATTGVIFNQTYSQRLCTPSRTALLTGKF 66
Query: 120 PIHTGMQ-HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P GMQ ++ GLPL E++LPQ LK+LGY T +VGKWHLG K EYTPT RGF
Sbjct: 67 PYRLGMQVQKSMFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGF 126
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+S G+ G +DY+ H +E GLD D +G YST+ F A +II H
Sbjct: 127 DSFYGFHHGGEDYYTHMSER----GLDFWDGRTSVSDRNGVYSTESFARRAENIISQHDP 182
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+ PLFLYL + H+ P Q P YL I+D R + LD+++ +V +A
Sbjct: 183 NTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDDNRKSILGMATALDDAIKRVTDA 237
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ + + N++ +F+SDNGG + SNWPLRG K TLWEGG R + +LE
Sbjct: 238 LKGKGLWDNTLTIFMSDNGGNY----VQGQSNWPLRGAKGTLWEGGTRVPAFVHGNMLER 293
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSD 384
G +H D LPTL+S A ++
Sbjct: 294 TGYTYHGMMHGVDILPTLVSVAGGTE 319
>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
Length = 353
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 190/325 (58%), Gaps = 15/325 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ADDLGW+DV ++ + + PN+ LA +G+I Y +LCTPSR+A++TGK
Sbjct: 1 PHILLIVADDLGWSDVSWNNPNVV-MPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKF 59
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P GMQ + + + GLPL E++LPQ LK+LGY T ++GKWHLG K EYTPT RGF+
Sbjct: 60 PYRLGMQRPIRHK-KAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFD 118
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S GY G QDY+ H ++ GLD D +G YST+ F A +II H +
Sbjct: 119 SFYGYHRGSQDYYTHMSDG----GLDFWEGKTAISDQNGVYSTESFATRAENIISQHDPN 174
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLYL H+ P Q P YL I+D R ++ LD+++ +V +AL
Sbjct: 175 TPLFLYLPLQPVHT-----PHQVPSSYLQTFSTIQDHNRKSILGMVAALDDAIKRVTDAL 229
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ + + +N++ +F+SDNGG L+ SNWPLRG K T+WEGG R +LE
Sbjct: 230 QGKGLWNNTLTIFMSDNGGDY----LDGQSNWPLRGAKGTVWEGGTRVPAFAHGNMLERT 285
Query: 360 GIVAEQYVHVSDWLPTLLSAANKSD 384
G +H D LPTL+S A ++
Sbjct: 286 GYTYHGMMHGVDILPTLVSVAGGTE 310
>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 513
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 203/334 (60%), Gaps = 23/334 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FILADD GW+D+G+HG + +P +D LA G+ L+NYY +C+P+R+ +M+G++
Sbjct: 26 PNIVFILADDFGWHDIGYHG-SIVRSPYMDFLASEGVKLENYYVQPMCSPTRAQLMSGRY 84
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I G+QH V+ +R LP E + Q +KE GY T +VGKWHLGFY+KE P RGF+
Sbjct: 85 QIRYGLQHLVIQPDQRACLPPDEVTIAQKMKEAGYATHMVGKWHLGFYRKECLPINRGFD 144
Query: 180 SHLGYWTG---HQDY---FDHSAE---EMKMWGLDMRRDLEPAWDLH-GKYSTDVFTAEA 229
+ G+ H Y + H+ E + M+G D+ R+ + H G+YST +F EA
Sbjct: 145 TFFGFLNCLIYHYTYDFGYWHTPESGNKTIMFGWDLFRNHDCVAKEHKGEYSTILFAEEA 204
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKL 288
+I NH + P+FLYL AA H+ PL P Y ++ IED +R K AA++ +
Sbjct: 205 QRVIWNHDQETPMFLYLPFAAVHN-----PLDVPREYEAMYEGIIEDEQRRKKAAMVTCM 259
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE++ + +AL M N++++F +DNGGA + +NWPLRG K +++EGG+RG
Sbjct: 260 DEAIWNITKALMDTGMWDNTVLIFSTDNGGATS----VGGNNWPLRGGKRSMFEGGIRGV 315
Query: 349 GLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ RG Q +HV+DW PT + A
Sbjct: 316 GFVTSPLLDEAVRGTENNQLIHVTDWFPTFVHLA 349
>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 142/357 (39%), Positives = 198/357 (55%), Gaps = 22/357 (6%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M V+ L M DL ++ PHI+FILADDLG+ DVG+ + TPNID LA
Sbjct: 1 MDLFVIILTLISLMTNFDLARTAQRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAE 60
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKEL 152
G+ L+ +Y C PSRS +M G++ IHTG + G +R L + +P LKE
Sbjct: 61 GVKLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRCKDG-KRSQLCMHPDTITMPMKLKEN 119
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH----SAEEMKMWGLDMRR 208
GY T ++GKWHLG + E P +GF++ GY + +DYF H A W R
Sbjct: 120 GYATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIEDYFTHYTPAGANCHDFW----RN 175
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
E A D G+YST +FT EA DII NH ++P+F+YL++ A H PLQ P HY +
Sbjct: 176 HDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLSYQAVHG-----PLQVPPHYYD 230
Query: 269 IHRHIEDFK--RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
++ R +AA+ +DE +G V L+QR M N++++FVSDNG A F+
Sbjct: 231 MYNDGPRVSTTRQTYAAMATCMDEGIGNVTSTLKQRGMWDNTVLIFVSDNG---AQFH-T 286
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
NWPLRG K +++EGG+R + SPL+E ++E VH+ DW PTL A +
Sbjct: 287 GGRNWPLRGGKGSVFEGGIRVVSFVTSPLIERPRRISEDIVHLMDWFPTLTHLAGGT 343
>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
Length = 495
Score = 248 bits (634), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 152/380 (40%), Positives = 204/380 (53%), Gaps = 33/380 (8%)
Query: 25 LKELGYRTRIMAFAVLPLAFTLSMVFVDLVA------------SSGPPHIIFILADDLGW 72
+K + ++R F L FT + + L SS PPHI+FI++DD GW
Sbjct: 1 MKNIFRKSRRFLFGFLFAVFTCCITWKLLTLNQQDLPVAVAQQSSQPPHILFIMSDDQGW 60
Query: 73 NDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG 132
DVGFHG D I TPN+D LA +G L+ YY+ +CTPSR+A++TG++P G+Q V+
Sbjct: 61 KDVGFHGSD-IRTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVIPS 119
Query: 133 CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF 192
+ GLP E +LPQ LKE GY T IVGKWHLG +Y P RGF+ G G DYF
Sbjct: 120 AGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEIDYF 179
Query: 193 DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH 252
HSA W + + E Y T + +AV +I H+ PLFLYLA A H
Sbjct: 180 THSAHGKVDWYRNNQLIKEEG------YVTTLLGQDAVKLIEKHNPKTPLFLYLAFTAPH 233
Query: 253 SANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
+ P QAP YL+ ++ I D R +AA++ +D+ +G+VV ALE+R M +N++IVF
Sbjct: 234 A-----PYQAPQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTLIVF 288
Query: 313 VSDNGGA-AAGFNLNA--------ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
SDNGG +A F A N P R K +L+EGG R L P G V
Sbjct: 289 QSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPGTVV 348
Query: 364 EQYVHVSDWLPTLLSAANKS 383
+H+ D PTL A+ S
Sbjct: 349 NHPIHIVDMYPTLTGLASVS 368
>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
Length = 659
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 137/332 (41%), Positives = 192/332 (57%), Gaps = 27/332 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+F+LADDLGWNDVG+H I TP++DALA G+ L+NYYT LCTPSRS ++TG++
Sbjct: 174 PHIVFVLADDLGWNDVGYHD-SNIFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTGRY 232
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
IHTG+QH +++ C+ LPL EK+LP+ L+E GY T +VGKWHLG ++KE PT RGF+
Sbjct: 233 QIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRGFD 292
Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
+ GY G +DY+ H A + LD R + A + YST++FT +A+D+I
Sbjct: 293 TFFGYLLGSEDYYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAIDLI 352
Query: 234 HNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
H E L + L + L + + ++ + + I+H
Sbjct: 353 AKHPAQKGTEDLIVILDSRGVPT------LVSSEEQVSFLCKDQSSDHTSL-QIIH---- 401
Query: 291 SVGKVVEALEQRRMLSNSIIVF-VSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
G + R NS F DNGG L +NWPLRG K TLWEGG+RG G
Sbjct: 402 -FGSIKNVQTPRASSENSSSAFPFPDNGGQ----TLAGGNNWPLRGRKWTLWEGGIRGVG 456
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ SPLL+ G+ + + +H+SDWLPTL++ A
Sbjct: 457 FVASPLLKQTGVQSRELIHISDWLPTLVNLAG 488
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ LPL EK+LP+ L+E GY T ++
Sbjct: 238 LQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVG 273
>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
Length = 300
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 182/286 (63%), Gaps = 5/286 (1%)
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+CTPSR+A++TGK+PI+TGMQH V+ + GLPL+E + + +E GYRT ++GKWHLG
Sbjct: 1 MCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLG 60
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDLHGKYSTD 223
++ +TPT RGF+ HLGY + DY+ S E+ G D R L+ D G Y TD
Sbjct: 61 LSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTD 120
Query: 224 VFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
+ T AV I +H + +PLFL L H A H+AN +P+QAP ++ +I +
Sbjct: 121 LLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRY 180
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+AA++ +LD+SVG V++AL ++ ML NSII+F+SDNGG G + ASN+PLRG KN+
Sbjct: 181 YAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSP 240
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEG +R + IWS E G V +Q +++ D LPTL +AA S P
Sbjct: 241 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDP 286
>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
Length = 518
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/352 (37%), Positives = 205/352 (58%), Gaps = 12/352 (3%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
+A A L L + + ++ ++ PP+I+ +LADDLGWNDV +H + PN+ LA
Sbjct: 4 FVATAWLALQWLMPLIEGTNGQTTNPPNIVMVLADDLGWNDVSWHN-SIVQMPNLQDLAE 62
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G+IL++ Y + CTPSR+A +TG++PI+TGMQ V+ + GLP+ K+LP YLK+ G
Sbjct: 63 RGVILEHAYAQEKCTPSRAAFLTGRYPINTGMQEEVVVATQMSGLPIEFKLLPSYLKDQG 122
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y T ++GKWH+G+ + YTPT RGF+SH G++ Y ++S+ E G D R DL
Sbjct: 123 YATHMIGKWHVGYCDEAYTPTRRGFDSHYGFYNSGISYSNYSSTEGTDVGYDYRDDLALN 182
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR-H 272
GKY+T FT +A +I NH P+FLY+A+ A H+ P + + Y +I+ +
Sbjct: 183 LAAEGKYTTTDFTDQAKTLIDNHDQTNPMFLYMAYNAPHT-----PFEVEESYRDIYDGN 237
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
+ D R + ++ LDE VG++V+ L++ M SN++ VF SDNGG + +N+P
Sbjct: 238 LRDGNRKTYLGMISALDEQVGQLVDKLKEVGMWSNTVFVFYSDNGGTQPQSG-QSGNNFP 296
Query: 333 LRGVKNTLWEGGVRGAGLIWSPL----LESRGIVAEQYVHVSDWLPTLLSAA 380
LRG K +L+EGG R + + LE + H+SD T ++ A
Sbjct: 297 LRGKKGSLFEGGYRVPAFLIARTRAGNLELIASTSSTLFHISDMFATFIALA 348
>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
Length = 431
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 120/267 (44%), Positives = 174/267 (65%), Gaps = 15/267 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH +++ C+ LPL EK+LP+ L+E GY T +VGKWHLG YKKE PT RGF++
Sbjct: 1 IHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDT 60
Query: 181 HLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A+ + LD R E A YST++FT A+D+I
Sbjct: 61 YFGYLLGSEDYYSHDRCVLIKAKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIA 120
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
+H T++PLFLYLA + H EPL+ P+ Y+ + I+D KR +A ++ +DE+VG
Sbjct: 121 HHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRRHYAGMVTLMDEAVGN 175
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
+ +AL++ + +N+++VF +DNGG L +NWPLRG K TLWEGGVRG G + SP
Sbjct: 176 LTDALKRYGLWNNTVLVFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVASP 231
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
LL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 232 LLKQKGVESHELIHISDWLPTLVHLAG 258
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ LPL EK+LP+ L+E GY T ++
Sbjct: 5 LQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVG 40
>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
Length = 782
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/300 (43%), Positives = 181/300 (60%), Gaps = 20/300 (6%)
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
KNY V + + T + IHTG+QH +++ C+ +PL EK+LP+ LKE GY T +
Sbjct: 332 KNYGYVPSYLNATKSPHTKREQIHTGLQHQIIWACQPNCVPLDEKLLPELLKEAGYATHM 391
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEP 212
VGKWHLG Y+KE PT RGF+S+ GY G +DY+ H + + LD R E
Sbjct: 392 VGKWHLGMYRKECLPTRRGFDSYFGYLLGSEDYYSHERCVLIRSLNVTRCALDFRDGEEV 451
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
A YST+VF AVD+I NH D+PLFLYLA + H EPLQ P+ Y+ +
Sbjct: 452 AVGYKNMYSTNVFAKRAVDLIANHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSF 506
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
I++ KR +A ++ +DE+VG V AL++ + N++++F +DNGG L +NWP
Sbjct: 507 IQNKKRRNYAGMVSIMDEAVGNVTAALKRHGLWDNTVLIFSTDNGGQ----TLAGGNNWP 562
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
LRG K TLWEGGVRG G + SPLL+ +G+ + +H+SDWLPTL++ A +VN T
Sbjct: 563 LRGRKWTLWEGGVRGIGFVASPLLKQKGVKNRELIHISDWLPTLVNLAG-----GHVNGT 617
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LP+ LKE GY T ++
Sbjct: 358 LQHQIIWACQPNCVPLDEKLLPELLKEAGYATHMVG 393
>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
caulinodans ORS 571]
gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
caulinodans ORS 571]
Length = 490
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI++ILADDLG+ DVGFHG D I TPN+D LA +G L +YT +CTP+R+A+MTG++
Sbjct: 47 PHILYILADDLGFADVGFHGSD-IKTPNLDKLAATGATLGQFYTQPMCTPTRAALMTGRY 105
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+ G+Q V+ GL E +LPQ LK +GY T ++GKWHLG K+++ P RGF+
Sbjct: 106 PLRYGLQTGVIPSGASYGLATDEFLLPQALKSVGYSTALIGKWHLGHAKQDFWPRQRGFD 165
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G G D++ H A + W D ++ +E +D T++F +A +I H
Sbjct: 166 YFYGPLVGEIDHYKHEAHGVVDWYRDNKQVVEEGYD------TELFGTDAARLIGAHDPK 219
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLYLA A H+ P QAP Y++ + +I D R +AA++ +D+ +GKVV AL
Sbjct: 220 TPLFLYLAFTAPHT-----PFQAPQAYVDRYPNITDPARRLYAAMISAMDDQIGKVVAAL 274
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLN--------AASNWPLRGVKNTLWEGGVRGAGLI 351
EQR M N++IVF SDNGG + + ASN P R K TL+EGG R A L
Sbjct: 275 EQRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELPASNAPYRDGKGTLYEGGTRVAALA 334
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTL--LSAAN 381
P + G A+ +HV D +PTL L+ AN
Sbjct: 335 NWPGRITPG-AADGVMHVVDMMPTLARLAGAN 365
>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
Length = 501
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 194/331 (58%), Gaps = 23/331 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I++I+ADD+G+ D GF+G D IPTPN+D LA G L+ +Y + +CTP+R+A+MTG++
Sbjct: 60 PNIVYIVADDMGFGDAGFNGSD-IPTPNLDKLAAEGARLEQFYALPMCTPTRAALMTGRY 118
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+ G+Q V+ LP+ E +LPQ LK+ GY T +VGKWHLG K E+ P RGF+
Sbjct: 119 PLRYGLQVGVIPAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQRGFD 178
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G G D+F HS+ +K W + + E +D +F EAV ++ H
Sbjct: 179 YFYGALVGEIDHFKHSSHGVKDWYRNNKPLNETGFD------NTLFGNEAVRVVERHEGK 232
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLYLA A H+ P QAP YL+ + I D R K+AA++ +D+ VGKVV AL
Sbjct: 233 SPLFLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAAMISVMDDGVGKVVAAL 287
Query: 300 EQRRMLSNSIIVFVSDNGG---------AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
E++ M N+IIVF SDNGG + G L ASN P R K TL+EGG R A L
Sbjct: 288 EKKGMRDNTIIVFHSDNGGVKNSLFAGDSKVGGEL-PASNGPYRDGKGTLYEGGTRVAAL 346
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P G+V + VHV D PTL + A+
Sbjct: 347 ANWPGKIKPGVV-DSMVHVVDMYPTLAAVAD 376
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 2/66 (3%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
+DG++VW LS +PS R+ +++N+D A+ +G WKLV + S+ N ++D
Sbjct: 385 LDGMNVWDALSEGKPSPRSEVVYNVDP--LAGAVRKGDWKLVWKASLPPKVELFNLADDK 442
Query: 570 SYQNEI 575
S +++
Sbjct: 443 SEAHDL 448
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
+DG++VW LS +PS R+ +++N+D A+ KG WKLV
Sbjct: 385 LDGMNVWDALSEGKPSPRSEVVYNVDP--LAGAVRKGDWKLV 424
>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
Length = 560
Score = 246 bits (628), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 126/281 (44%), Positives = 180/281 (64%), Gaps = 11/281 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A S PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 96 AAPSQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 154
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGF++KE P
Sbjct: 155 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLP 214
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+ RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I
Sbjct: 215 SCRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHI 274
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +HS PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V
Sbjct: 275 LASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 329
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+ AL++ +NS+I+F SDNGG + SNWPL
Sbjct: 330 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPL 366
>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 539
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 134/345 (38%), Positives = 195/345 (56%), Gaps = 18/345 (5%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
L ++ ++ ++ PHI+FILADDLG+ DVG+ + TPNID LA G+ L+ +Y
Sbjct: 45 LILLTSYLTVIQATQRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYA 104
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI-LPQYLKELGYRTRIVGKW 162
C PSRS +M G++ IHTG + G + I +P LKE GY T +VGKW
Sbjct: 105 QPSCMPSRSCLMMGRYQIHTGFNYKCTDGSGSQLCMHPDTITIPMKLKENGYATHMVGKW 164
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMKMWGLDMRRDLEPAWDLHG 218
HLG + E P +GF++ GY +DY+ H + E +W R + A + +G
Sbjct: 165 HLGNIRWECLPNAKGFDTFFGYHGASEDYYTHFSPAGRECRDLW----RNRDDVAQEYYG 220
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
+YST +FT EA++II NH +P+FLYL + A H PLQ P+ Y N++ D R
Sbjct: 221 QYSTHIFTNEALNIIENHDVSKPMFLYLPYQAVHG-----PLQVPEKYYNMYERPLDDIR 275
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
K+AA+ +DE +G V AL+QR M N++ +FVSDNG NWPLRG K
Sbjct: 276 RKYAAMATCMDEGIGNVTNALKQRGMWDNTVFIFVSDNGAQFP----TGGRNWPLRGGKG 331
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+++EGG+R + SPL+E + + +H++DWLPTL A +
Sbjct: 332 SVFEGGIRVVSFVTSPLIERPQRSSNEMIHLTDWLPTLTHLAGGT 376
>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
Length = 431
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 123/269 (45%), Positives = 167/269 (62%), Gaps = 15/269 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 3 IHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDT 62
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST+VFT A +I
Sbjct: 63 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTLIT 122
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A + +DE+VG
Sbjct: 123 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGN 177
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V ALE+R + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G + SP
Sbjct: 178 VTAALERRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 233
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 234 LLKRKGVKTRELIHISDWLPTLVKLAGGS 262
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 7 LQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVG 42
>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
Length = 491
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 149/347 (42%), Positives = 197/347 (56%), Gaps = 22/347 (6%)
Query: 46 LSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
LS++ A+ P PHI++ILADDLG+ DVGFHG D I TPN+D LA G L +YT
Sbjct: 34 LSLLSSGARAADAPRPHIVYILADDLGFADVGFHGSD-IKTPNLDHLAAQGARLGQFYTQ 92
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
CTP+R+A +TG++P+H G+Q + + GL E +LPQ LK++GYRT +VGKWHL
Sbjct: 93 PFCTPTRAAFLTGRYPLHYGLQVGAIPSGAKYGLATDEFLLPQALKDVGYRTALVGKWHL 152
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
G +++ P RGF+S G G D+F H A + W D + E +D T++
Sbjct: 153 GHADQKFWPRQRGFDSFYGPLVGEIDHFKHEAHGVTDWYHDNTQVKEEGYD------TEL 206
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
F EAV +I H PLFLYLA A P+ P QAP YL+ + HI +R +AA+
Sbjct: 207 FGKEAVRLIAAHDPKTPLFLYLAFTA-----PHTPFQAPQSYLDQYAHIAAPQRRAYAAM 261
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----AAGFNLNA----ASNWPLRGV 336
+ +D+ +G VV AL R M N++IVF SDNGG AG A ASN P R
Sbjct: 262 ITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDG 321
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K +L+EGG R L P + G AE +HV D LPTL A S
Sbjct: 322 KGSLYEGGTRVVALANWPGRIAPG-AAEGVMHVVDMLPTLAKLAGAS 367
>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 726
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/368 (37%), Positives = 206/368 (55%), Gaps = 51/368 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI-PTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
P++ FI+ DD+GWND+G+H D TPN+D L+ SG+ + YY++ +CTP+R+A+MTG+
Sbjct: 112 PNVFFIMIDDMGWNDIGYHSTDLANVTPNLDRLSASGVKVSQYYSMSICTPARAALMTGR 171
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P+ G+Q+NV+ GLPL+EK+LP+Y+ E GY + +VGKWHLG Y +TP RGF
Sbjct: 172 YPVRYGLQYNVIQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAHTPHRRGF 231
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWG--------------LDMRRDLEP------------ 212
E++ GY + Y+ H + G D+ +P
Sbjct: 232 ETYFGYLNDEEMYWTHQTWTATINGRKFFDFGFGNATGFYDVIERFDPPPGDDDLVSTGP 291
Query: 213 -------AWDLHGKYSTDVFTAEAVDIIHNHS--TDEPLFLYLAHAATHSANPYEPLQA- 262
+ ++ G YST++FT A++I+ + + PLFLYL+H A H P A
Sbjct: 292 TSSVYSSSLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVHDPLGVPPADAF 351
Query: 263 -PDHYLNIHRHIEDFK-----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
P+ + + +ED R +FA +L LD+S+G++V+ LE L NSIIV SDN
Sbjct: 352 SPEEWA-VLTALEDESSDASLRVRFAKVLMYLDKSIGRLVDYLETEGWLENSIIVVASDN 410
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL---ESRGIVAEQYVHVSDWL 373
GG + SN+PLRG+K++ WEGGV+ ++SP E G +HV+DWL
Sbjct: 411 GGCPS----TGGSNYPLRGLKHSNWEGGVKVPAFVYSPSHIPEEHWGTEYRGLMHVTDWL 466
Query: 374 PTLLSAAN 381
PTL AAN
Sbjct: 467 PTLADAAN 474
>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 505
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 141/350 (40%), Positives = 198/350 (56%), Gaps = 15/350 (4%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
AVL F+ ++ + V P+IIFILADDLG+ DVG+ + TPNID LA G+
Sbjct: 5 IAVLLAIFSFALQWAHGVTEQ--PNIIFILADDLGYFDVGYREGSIVKTPNIDKLAAEGV 62
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEKILPQYLKELGYR 155
L+ YY C PSRS +M G++ IHTG + L G R + LP L++ GY
Sbjct: 63 KLERYYAQSSCMPSRSCLMMGRYQIHTGFDYRCLDGQLTRLCMAPDTVTLPMKLRQYGYA 122
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE-PAW 214
T ++GKWHLG +KE PT +GF++ GY ++Y+ H+A D+ R++E A
Sbjct: 123 THMIGKWHLGHERKECVPTHKGFDTFFGYHGAAENYYTHTALGRPR-CHDLWRNMENVAE 181
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY-LNIHRHI 273
D G+YST +T EA DII NH +P F+YL++ A H PL+ P +Y + +
Sbjct: 182 DYDGQYSTLFYTKEAQDIIKNHDKKKPFFMYLSYQAVHG-----PLEVPSNYRMPYYTSG 236
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
D R +AA+ +DE +G + E L+ ML+N+I+VF SDNG A F+ NWPL
Sbjct: 237 VDESRQIYAAMATCMDEGIGNITETLKASGMLNNTILVFSSDNG---AQFH-TGGRNWPL 292
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
RG K T+WEGGVR + SPLL+ V + +H SDW PT + A +
Sbjct: 293 RGGKGTMWEGGVRVLAFVNSPLLKKSRTVNHEMIHTSDWFPTFVRLAGGT 342
>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 631
Score = 245 bits (625), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 134/330 (40%), Positives = 189/330 (57%), Gaps = 18/330 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II +LADDLG++DV FHG +QIPTPN+DA+A G+IL +Y TPSR A TGK
Sbjct: 42 PNIIVVLADDLGYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGKL 101
Query: 120 PIHTGMQHN-VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P+ G+ +L G GL L ++LP Y+++LGY T ++G+W LGFYK+ PT RGF
Sbjct: 102 PLRIGLNEGPILKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRGF 161
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMW-------GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
++H G ++ Y H + E W D+ RD +P + G Y TD++
Sbjct: 162 DTHYGPYSDSASYSSHLSRE-DAWKSLSVPPAYDLHRDGKPDFSGFGSYVTDLYKGRFER 220
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN---IHRHIEDFKRSKFAAILHKL 288
II +PLF+ L+H H A+ + PL P N HI+D R +A + L
Sbjct: 221 IIEQRR--KPLFIVLSHQTPHGAS-FGPLHQPPPRTNRASQFLHIKDRSRRSYARTVESL 277
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA--GFNLNAASNWPLRGVKNTLWEGGVR 346
D+S +V+ L++ +L+ S+I+F SDNGG + ASNWPLRG KNT WEGGVR
Sbjct: 278 DDSFAHMVDVLQRNNILNESVILFSSDNGGTTFKDPYFKTGASNWPLRGQKNTQWEGGVR 337
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ +WS + + V+E+ H DWLPT
Sbjct: 338 VSSFVWSSQI-TEPYVSEELYHFVDWLPTF 366
>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 193/337 (57%), Gaps = 44/337 (13%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
SS P+I+FI+ DDLG++DVGF QI TPNID +++I+
Sbjct: 31 SSSQPNIVFIVIDDLGFDDVGFKS-HQIKTPNID---------------------QASIL 68
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+G++ +H G+ N + + GLPL+ LPQ LK GY T +GKWHLGFYK +YTPTF
Sbjct: 69 SGRYAMHHGIV-NWIPPKDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPTF 127
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP---------AWDLHGKYSTDVFT 226
RGF S LGY++G ++YF H DM RD P A+DL G+YST +F+
Sbjct: 128 RGFNSFLGYYSGGENYFTHKNGP----AYDMHRDPLPSCGQNCSQIAFDLQGQYSTTIFS 183
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAIL 285
EAV +I +H +PLFLYLA+ A H EP QAP Y++ + I D +R FA +L
Sbjct: 184 DEAVRVIDDHIGPKPLFLYLAYQAVH-----EPAQAPQSYIDPYTDLIPDAQRRTFAGML 238
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
LDE +G V AL++ +N++IVF +DNGG G + A N+P+RG K+ ++EGGV
Sbjct: 239 SCLDEGIGNVTRALKRNGFYNNTLIVFTTDNGGPILGGDSVGARNYPMRGGKHAVYEGGV 298
Query: 346 RGAGLIWSP--LLESRGIVAEQYVHVSDWLPTLLSAA 380
RG + P G +H +DWLPTL AA
Sbjct: 299 RGIAFVHPPQNTFVETGAGYHNLMHAADWLPTLCDAA 335
>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
Length = 502
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/258 (46%), Positives = 165/258 (63%), Gaps = 2/258 (0%)
Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
GMQ + G E GLPLS +ILP++L+ LGY T+++GKWHLG++ ++TP RGF+S LG
Sbjct: 2 GMQGESIQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFLG 61
Query: 184 YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
++ H Y+D+ M G DM R PA+ L KY TD+FT EAV II H PL+
Sbjct: 62 FYNSHVSYYDYKYSYQNMSGYDMHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDPSRPLY 121
Query: 244 LYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
L ++H A H+ P E Q DHY HI + R K+A ++ KLD+SVG +V L R
Sbjct: 122 LQISHLAVHA--PLENPQDYDHYDKRFMHIVEQNRRKYAKMVSKLDDSVGLIVNTLGSRG 179
Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
ML +S+I+F++DNG A G N SN+PLRG+K TL+EGGVRG ++WSP L V
Sbjct: 180 MLRDSVILFLTDNGAAPVGKFRNYGSNFPLRGMKYTLYEGGVRGVAVLWSPRLSKAARVC 239
Query: 364 EQYVHVSDWLPTLLSAAN 381
+H++DWLPTL +AA+
Sbjct: 240 NNLMHITDWLPTLYAAAD 257
>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 658
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 141/334 (42%), Positives = 194/334 (58%), Gaps = 28/334 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+F+LADDLGW+DVG+HG I TPNID LA G+ L+NYY C PSR ++TG++
Sbjct: 204 PNIVFVLADDLGWHDVGYHG-SIIDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTGRY 262
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-GFYKKEYTPTFRGF 178
I TG+ +V C+ G + E L L E GY T +VGKWHL GF +E P RGF
Sbjct: 263 RIRTGLYGDV---CDFMG--IHEITLADKLYEAGYYTAMVGKWHLSGFQHRECYPAHRGF 317
Query: 179 ESHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
++ LG+ G Q+YF H S E W D + G+YST V+ EA II
Sbjct: 318 QTFLGFHGGSQNYFTHRRGGSNSEYDFWA----NDTSIGREYDGRYSTMVYAEEAQRIIR 373
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESVG 293
+H T++PLFLYL+ A HS PL P Y + +R IED KR +AA+ +DE++G
Sbjct: 374 HHRTEQPLFLYLSFQAVHS-----PLLVPSAYEDKYRTGIEDDKRRVYAAMTSCMDEAIG 428
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
V + L++ + + ++ +F SDNGG SNWPLRG K +EGG+R G + S
Sbjct: 429 NVTDTLKETGLWNKTVFIFSSDNGGTPT----FGGSNWPLRGRKGLWYEGGIRAVGFVNS 484
Query: 354 PLLESR--GIVAEQYVHVSDWLPTLLSAANKSDI 385
LL + G + +H++DW PTLL A K+D+
Sbjct: 485 YLLSNHVTGSTNHELIHITDWFPTLLRLA-KTDL 517
>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
Length = 551
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 119/267 (44%), Positives = 166/267 (62%), Gaps = 15/267 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
I TG+QH +++ C+ +PL EK+LPQ+LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 123 IRTGLQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 182
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST++FT A +I
Sbjct: 183 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAATLIT 242
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+VG
Sbjct: 243 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 297
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ R + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G + SP
Sbjct: 298 VTAALKSRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 353
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
LL+ +G+ + +H+SDWLPTL+ A
Sbjct: 354 LLKQKGVKNRELIHISDWLPTLVKLAG 380
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ+LKE GY T ++
Sbjct: 127 LQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVG 162
>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
Length = 1941
Score = 241 bits (616), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 134/359 (37%), Positives = 201/359 (55%), Gaps = 26/359 (7%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
R R++ + +A + + D + PPHI+ ILADDLGWND+G++ L QI TP +D
Sbjct: 16 RYRLVTAFLFAVAAIMFITLYD--KKTNPPHIVLILADDLGWNDIGWNNL-QIKTPVLDK 72
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
LAY G+I Y LCTP+R+A+MTG +P GMQH ++ + GLPL KILPQ LK
Sbjct: 73 LAYEGVIFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMVLPFQPSGLPLHLKILPQKLK 132
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDY---------FDHSAEEMKM 201
+ GY IVGKWHLG+ EYTP RGF+S G ++ ++ DHS K
Sbjct: 133 QAGYINHIVGKWHLGYCNWEYTPLNRGFDSFYGSFSNSVNHNNKISQLPISDHS----KY 188
Query: 202 WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
G D R + + G+ T +FT VDII NH D P+F+Y P + +
Sbjct: 189 KGYDFRDNTGVVQN-DGQPLTKLFTQRVVDIISNHHKDYPMFMYFP-----INQPSKGSE 242
Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
+ Y ++ ++ D + ++ A L +D++VG V++AL+ R M ++++F+SDNG
Sbjct: 243 PDEKYTALYPNVTDDQVRQWYATLSLMDDAVGHVIDALKSRGMWEETLLIFISDNG---- 298
Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
F+ +N PLRG TL+EGG+R + +++ G + + H++DW T+LS A
Sbjct: 299 AFDFMEGTNLPLRGAGTTLFEGGMRVPAIAHGKMIKKTGYINNELNHITDWHSTILSLA 357
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 24/124 (19%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
++DGI++W +S + S RN I++NID DE + +AL G WK++ TG
Sbjct: 364 DLDGINIWGTVSEGKASARNEIVYNIDEIDECRGAALRLGDWKVI------------TGN 411
Query: 632 PDQV--YLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
P + +L WL + R + A + V LF++K DP E++N
Sbjct: 412 PGLMCRFLDLNKQDGWLQ-SGRHSDNQAITDFQSISNV-------SYLFNLKEDPEERHN 463
Query: 690 LADR 693
+A++
Sbjct: 464 VAEK 467
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTRGKWKLVKEN 554
++DGI++W +S + S RN I++NID DE + +AL G WK++ N
Sbjct: 364 DLDGINIWGTVSEGKASARNEIVYNIDEIDECRGAALRLGDWKVITGN 411
>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
Length = 519
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/269 (44%), Positives = 166/269 (61%), Gaps = 15/269 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 91 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 150
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST++FT A +I
Sbjct: 151 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIT 210
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+VG
Sbjct: 211 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAVGN 265
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGG+RG G + SP
Sbjct: 266 VTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVASP 321
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 322 LLKQKGVKNRELIHISDWLPTLVKLARGS 350
Score = 43.1 bits (100), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 95 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 130
>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 189/326 (57%), Gaps = 12/326 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIFILADDLG+ DVG+ I TPNID LA G+ L+ +Y C PSR+ +M G++
Sbjct: 12 PHIIFILADDLGYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACLMMGRY 71
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
IHTG + + R ++ I LP +K+ GY T ++GKWHLG + P +G
Sbjct: 72 QIHTGYRDECMND-TRYQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCLPNAKG 130
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
F+++ GY +DY+ H + R ++ A G+YST +FT EAV+II NH
Sbjct: 131 FDTYFGYNAAAEDYYTHMLSGRQNCSDLWRDRMDVADKYIGQYSTRIFTEEAVNIIENHD 190
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+P+F+YLAH A H+ PLQ P Y +++ + R K+AA++ +DE +G V E
Sbjct: 191 ISQPMFMYLAHQAVHA-----PLQVPQEYYSLYETTLNDVRRKYAAMVTCMDEGIGNVTE 245
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ M N+++VFVSDNGG + NWPLRG K +++EGG+R + S L+E
Sbjct: 246 ALKESGMWDNTVLVFVSDNGGELSA----GGRNWPLRGGKGSVFEGGIRTVSFVTSSLIE 301
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
++ +H++DW PTL A +
Sbjct: 302 RPQRISNAMIHITDWFPTLTHLAGGT 327
>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 512
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/349 (36%), Positives = 199/349 (57%), Gaps = 19/349 (5%)
Query: 43 AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALAYSGIILK 99
+F L V + ++ PH++ ILADDLG+NDVG+ G + TP +D+LA +G+ L+
Sbjct: 12 SFLLICVVGTVAEATKKPHVVLILADDLGYNDVGYWGQNHGSAAKTPFLDSLAENGVRLE 71
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
NYYT +C+P+R A+MTG++ I G+ H +++ + GLPL +LP+ L GY T+++
Sbjct: 72 NYYTHSVCSPTRGALMTGRNRIDIGLAHGIIHTTQIEGLPLDNVLLPEQLSNCGYNTQMI 131
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---SAEEMKMWGLDMRRDLE-PAWD 215
GKWHLGF +Y P RGF G+ G ++Y+ A + G+D P +
Sbjct: 132 GKWHLGFSSSKYAPWNRGFHGFYGFLAGSENYWSKWLPMARHSNIGGVDFTDSTTGPTNE 191
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
G+YS V+ + A +I +H +PLFLYL P+ PL AP HY + IED
Sbjct: 192 TWGQYSAHVYASRARYVIQHHDQSKPLFLYLP-----LQTPHTPLGAPSHYYEPFKDIED 246
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R K+ +++ LDE+V V L+ M +++++F +DNGG + +NWP RG
Sbjct: 247 DDRMKYLSMVSVLDETVRNVTNYLKDAGMWEDTLLIFSTDNGGEVK----HGGNNWPYRG 302
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGI---VAEQYVHVSDWLPTLLSAAN 381
KNTL+EGGV+ G + +L V + +HV+DW PT+++AA+
Sbjct: 303 TKNTLYEGGVKAVGFVHGKILGDLSPAQRVNRELIHVTDWYPTIMAAAD 351
>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
Length = 431
Score = 240 bits (612), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 119/269 (44%), Positives = 166/269 (61%), Gaps = 15/269 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 3 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 62
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST++FT A +I
Sbjct: 63 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIT 122
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+VG
Sbjct: 123 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAVGN 177
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGG+RG G + SP
Sbjct: 178 VTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVASP 233
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 234 LLKQKGVKNRELIHISDWLPTLVKLARGS 262
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 7 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 42
>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 239 bits (611), Expect = 4e-60, Method: Composition-based stats.
Identities = 117/254 (46%), Positives = 167/254 (65%), Gaps = 10/254 (3%)
Query: 49 VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYTVQLC 107
V V +++ PPHI+FIL DDLGW+DVG+H + + TPNID LA G+ L +YY+ +C
Sbjct: 14 VMVARTSATKPPHILFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMC 73
Query: 108 TPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
TPSR A+MTGK+PIH GMQH V+ G+P +PQ L+ LGYRT ++GKWHLGF+
Sbjct: 74 TPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFF 133
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+YTP RGF+S LG++ G QD++ HS KM LD RRD EPA + G++STDVFT
Sbjct: 134 DWDYTPLRRGFDSFLGFFAGEQDHWRHS----KMGFLDFRRDEEPANEYGGQHSTDVFTQ 189
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA++I H+ +PLFL L++AA H+ PLQA + +N + D R + ++
Sbjct: 190 EAINIAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGA 244
Query: 288 LDESVGKVVEALEQ 301
D S+G++++ ++
Sbjct: 245 ADWSIGRLIDVYKR 258
>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 585
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 195/331 (58%), Gaps = 17/331 (5%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+ +LADD+GWNDVG++ D +PTP ++ LA +G+IL N Y+ C+PSR+A++TGK
Sbjct: 107 PPHIVLVLADDVGWNDVGWNN-DFMPTPILNELASNGVILNNTYSQPACSPSRAALLTGK 165
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P + G+QH V+ LPL +L LKELGY +GKWHLGF +YTP +RGF
Sbjct: 166 YPANAGIQHLVVQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHLGFCNWKYTPLWRGF 225
Query: 179 ESHLGYWTGH-QDYFDHSAE-----EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+S G + G+ DY H E GLD+R + +G + T +FT A I
Sbjct: 226 DSFYGIFNGYLSDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHENGTHVTYLFTERAERI 285
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH+ PLFLY A P+ PL AP + N + D +R ++A ++ +DE+V
Sbjct: 286 IRNHNPAAPLFLYTALLV-----PHGPLDAPPGFEN-SVEVNDTRRREYAGMISAMDEAV 339
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G + AL+ + M ++++ +F+SDNGG A +N+P RG K +LWEG V+ G I+
Sbjct: 340 GNITNALKDKGMWNDTLFIFLSDNGGDM----FYAGNNYPFRGNKASLWEGSVKIPGFIY 395
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+LE +G + +H +D TLLSAA +
Sbjct: 396 GNMLEKKGYSNNELIHFTDLFATLLSAAGGT 426
>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
Length = 498
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 188/340 (55%), Gaps = 24/340 (7%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+A S P+I+FI+ADDLGW DVGFHG D I TPNID LA G L+ +Y +CTP+R+A
Sbjct: 50 LAQSTKPNIVFIVADDLGWKDVGFHGSD-IKTPNIDQLAEKGGRLEQFYAQPMCTPTRAA 108
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++P GMQ V+ GL L + +LP+ LKE GY T GKWHLG + P
Sbjct: 109 LMTGRYPFRYGMQTAVIPQGGTYGLALDDYLLPEMLKEAGYATAASGKWHLGHADTAFWP 168
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
RGF+S G G D+F H SA W + E +D +F EAV +
Sbjct: 169 RQRGFDSFYGALLGEIDHFTHKSANGNADWYRNNEAIEEAGFD------NILFATEAVRV 222
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I+ H +PLFLYLA +P+ P QAP YL+ + HI D R +AA++ +D+ +
Sbjct: 223 INEHDQSKPLFLYLAF-----TSPHTPFQAPKEYLDRNSHIADESRRAYAAMISVMDDGI 277
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN--------AASNWPLRGVKNTLWEGG 344
G+VVEALE R M N++I+F SDNGG F A N P R K T +EGG
Sbjct: 278 GRVVEALETRGMRENTLILFHSDNGGVVDAFFAGESDVKGKLPADNGPYRQGKGTTYEGG 337
Query: 345 VRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
R GLI W +E A+ +HV D PTL + A +
Sbjct: 338 TRVVGLINWPGKIEPGS--ADGMMHVVDMYPTLAALAGAT 375
Score = 43.1 bits (100), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 47/112 (41%), Gaps = 27/112 (24%)
Query: 464 NGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQ---------------- 507
NGP + YE GT R+ IN G E S D
Sbjct: 323 NGPYRQGKGTTYEGGT------RVVGLINWPGKIEPGSADGMMHVVDMYPTLAALAGATP 376
Query: 508 ---NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
+DG+D+W +L +PS R I++N+ E +A+ +G WKLV + ++
Sbjct: 377 GKNKPLDGMDMWPMLGEGKPSPRTEIVYNV--EPMTAAVRQGDWKLVWKTTL 426
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
+DG+D+W +L +PS R I++N+ E +A+ +G WKLV
Sbjct: 382 LDGMDMWPMLGEGKPSPRTEIVYNV--EPMTAAVRQGDWKLV 421
>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
Length = 498
Score = 239 bits (609), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 194/334 (58%), Gaps = 29/334 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI++IL DDLG+ DVG+HG D + TPN+D LA G L +YT LCTP+R+A+MTG +
Sbjct: 53 PHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGCY 111
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+ G+Q V+ R GL +E +LPQ LKE GY+T +VGKWHLG ++Y P RGF+
Sbjct: 112 PMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGFD 171
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G G D+F H A + W D EP +D T++F A+A+ +I H +
Sbjct: 172 YFYGPLVGEIDHFKHEAHGITDWYRDNEMVKEPGYD------TELFGADAIRLIEEHDSA 225
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PL++YL+ A H+ P QAPD Y +++ I D R+ +AA++ +D+ VG V++AL
Sbjct: 226 TPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRNAYAAMISCMDDQVGPVLQAL 280
Query: 300 EQRRMLSNSIIVFVSDNGGA----AAGFNLNAAS----NWPLRGVKNTLWEGGVRGAGLI 351
E+R M +++++F SDNGG AG A N PLR K TL+EGG R L
Sbjct: 281 ERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALA 340
Query: 352 -WS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
W P E+ G+ +HV D LPTL A
Sbjct: 341 NWPGRIPAGETHGM-----MHVVDMLPTLAGLAQ 369
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
++ ++DG+DVW +S + S R +++NI E AL GKWKL
Sbjct: 373 AHAGQLDGMDVWQAISAGKASPREEVVYNI--EPTQGALRDGKWKL 416
>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
Length = 475
Score = 239 bits (609), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 119/262 (45%), Positives = 161/262 (61%), Gaps = 15/262 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 52 IHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDT 111
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST+VFT A +I
Sbjct: 112 YFGYLLGSEDYYSHERCTVIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTLIT 171
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H ++PLFLYLA + H EPLQ P+ YL + I+D R +A + +DE+VG
Sbjct: 172 THPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRRHYAGMASLMDEAVGN 226
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V ALE R + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G + SP
Sbjct: 227 VTAALESRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 282
Query: 355 LLESRGIVAEQYVHVSDWLPTL 376
LL+ +G+ + +H+SDWLP L
Sbjct: 283 LLKRKGVKTRELIHISDWLPPL 304
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 56 LQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVG 91
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 49/110 (44%), Gaps = 17/110 (15%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW-QIS-------ALTRGKWKLVKENSINGNGT 561
+DG DVW+ +S PS R +LHNID ++ IS AL + L++ ++ N +
Sbjct: 313 LDGFDVWNTISEGSPSPRMELLHNIDPDFVDISLCPGNSMALAKDDSSLLEYSAFNTSIH 372
Query: 562 SENRSNDNSYQNEIDGIDVW---------SVLSRNEPSKRNTILHNIDDE 602
+ R + G W SV+ ++P + L +ID +
Sbjct: 373 AAIRHQNWKLLTGYPGCGHWFPPPSQHNVSVIPSSDPPAKTLWLFDIDQD 422
>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 497
Score = 238 bits (608), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 117/277 (42%), Positives = 174/277 (62%), Gaps = 12/277 (4%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+G P+++FILADD G+NDVG+HG + + TPN+D LA G+ L+ YY +C+P+RS
Sbjct: 32 TGRPNVVFILADDFGYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQ 91
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+++G++ IHTG+Q+ V+ + LPL E LPQ LKE Y T +VGKWH+GFYK TP
Sbjct: 92 LLSGRYQIHTGLQYGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTP 151
Query: 174 TFRGFESHLGYWTGHQDYFDHSAE----EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
T RGF+S+ GY +G +DY+ HS + GLD+ + PA+ G+YST +FT++A
Sbjct: 152 TERGFDSYFGYLSGAEDYYSHSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKA 211
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+D+I+NH +PLFLYLA+ A HS PLQ P Y + +I R +A ++ +D
Sbjct: 212 IDVINNHERSKPLFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMD 266
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
E +G V AL + +N+II+F + + G N+
Sbjct: 267 EGIGNVTRALVDAGLYNNTIIIFSTGMSSSLVGKNMT 303
>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
sphaeroides KD131]
gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
KD131]
Length = 509
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/334 (41%), Positives = 193/334 (57%), Gaps = 29/334 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI++IL DDLG+ DVG+HG D + TPN+D LA G L +YT LCTP+R+A+MTG++
Sbjct: 64 PHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRY 122
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+ G+Q V+ R GL +E +LPQ LKE GY+T +VGKWHLG ++Y P RG +
Sbjct: 123 PMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVD 182
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G G D+F H A + W D EP +D T++F A+A+ +I H +
Sbjct: 183 YFYGPLVGEIDHFKHEAHGITDWYRDNEMVKEPGYD------TELFGADAIRLIEEHDSA 236
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PL++YL+ A H+ P QAPD Y +++ I D R +AA++ +D+ VG V++AL
Sbjct: 237 TPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVLQAL 291
Query: 300 EQRRMLSNSIIVFVSDNGGA----AAGFNLNAAS----NWPLRGVKNTLWEGGVRGAGLI 351
E+R M +++++F SDNGG AG A N PLR K TL+EGG R L
Sbjct: 292 ERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALA 351
Query: 352 -WS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
W P E+ G+ +HV D LPTL A
Sbjct: 352 NWPGRIPAGETHGM-----MHVVDMLPTLAGLAQ 380
Score = 39.7 bits (91), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
++ ++DG+DVW +S + S R +++NI E AL GKWKL
Sbjct: 384 AHAGQLDGMDVWQAISAGKASPREEVVYNI--EPTQGALRDGKWKL 427
>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
Length = 430
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/267 (44%), Positives = 164/267 (61%), Gaps = 15/267 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 2 IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST++FT A +I
Sbjct: 62 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+VG
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 232
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
LL+ +G+ + +H+SDWLPTL+ A
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAG 259
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 6 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 41
>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
Length = 430
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/267 (44%), Positives = 164/267 (61%), Gaps = 15/267 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF++
Sbjct: 2 IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61
Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G +DY+ H A + LD R E A YST++FT A +I
Sbjct: 62 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+VG
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG G + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 232
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
LL+ +G+ + +H+SDWLPTL+ A
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAG 259
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 6 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 41
>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 564
Score = 237 bits (604), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 190/345 (55%), Gaps = 37/345 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ DDLG +D+G H I TP+ D LA G+ L YY + C+P+R+++++G++
Sbjct: 64 PHILMIIMDDLGSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRASLLSGRY 123
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+HTG H ++ E GLPL E+ LPQ L+ GY+ VGKWH+G + TPTFRGF+
Sbjct: 124 PLHTGC-HTIVNDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQTPTFRGFQ 182
Query: 180 SHLGYWTGHQDYFDH-------SAEEMKMWGLDMR----RDLEPAWDLHGKYSTDVFTAE 228
S G++ G QDY H +A EM D R RD D G YST VFT E
Sbjct: 183 SFFGFYLGAQDYNTHIKQGERGNAYEMHW---DARGKCGRDCSRLVDERGNYSTHVFTRE 239
Query: 229 AVDIIHNHST--DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE----DFKRSKFA 282
A+ +I NH EPLFLYLAH A H P Q P+ Y + +R +A
Sbjct: 240 AIRVIENHPQRPHEPLFLYLAHQAVH-----WPDQVPETYRKFYEGATYSNWTDQRKTYA 294
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
+L DES+G V +AL+ M N+++VF +DNGG A +SN+P RG K T++E
Sbjct: 295 GMLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCAAQGSSNYPKRGGKCTVYE 354
Query: 343 GGVRGAGLI----WSPLLESR----GIVAEQY---VHVSDWLPTL 376
GG G G + W+ + SR E Y HV DWLPTL
Sbjct: 355 GGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTL 399
>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 425
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 122/279 (43%), Positives = 163/279 (58%), Gaps = 9/279 (3%)
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
+ LCTPSRSA+MTG +PIH GMQ V+ GLPL KILPQ+ K+LGY ++GKWH
Sbjct: 1 MPLCTPSRSALMTGLYPIHNGMQSTVILPAAPWGLPLENKILPQHFKDLGYDVNMIGKWH 60
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM--KMWGLDMRRDLEPAWDLHGKYS 221
LGF++ Y P RGF++ G +TG DY++H++ + K + L HG
Sbjct: 61 LGFFQTPYVPMKRGFDTFFGLYTGSNDYYNHTSGSVSKKQEQITGALCLTVGGITHG--V 118
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
+ T ++ +P F Y +H A H A E QAP + +IE+ R+ +
Sbjct: 119 EPLLTDSPLNFY-----PQPFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEESNRTIY 173
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
A +L LDES+G+VVEAL ML ++IIVF SDNGG G N NWPLRG K T+W
Sbjct: 174 AGMLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGGGPYGIESNRGFNWPLRGAKFTVW 233
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
EGGVR +WSP + V+ Q +H+SDWLPTL +AA
Sbjct: 234 EGGVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAA 272
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 61/148 (41%), Gaps = 30/148 (20%)
Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
D S IDG+D+W+ L+ SKR IL N + + IS L ++KLV V ++D
Sbjct: 275 DASALGTIDGVDMWNSLTHRSASKRKEILLNSNSLYNISGLRYKQYKLV-VGGGFDGELD 333
Query: 628 LTGGPDQVYLSG----------LSDREWLALAMRKLRDAASIQCGPVK-----EVPCE-- 670
D Y G L + A +RK + P+ V C
Sbjct: 334 -----DHYYFPGGTRPTNDIDQLRNESTTARVLRKFYEKQDRSWSPLAWSNNVAVDCRRN 388
Query: 671 -------PQIAPCLFDIKNDPCEKNNLA 691
P P LF+I+ DPCE NNLA
Sbjct: 389 KLTENFVPMQPPYLFNIEKDPCELNNLA 416
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 30/56 (53%)
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
D S IDG+D+W+ L+ SKR IL N + + IS L ++KLV +G
Sbjct: 275 DASALGTIDGVDMWNSLTHRSASKRKEILLNSNSLYNISGLRYKQYKLVVGGGFDG 330
>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
Length = 502
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 184/332 (55%), Gaps = 24/332 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I++ILADDLGW DVGFHG D I TPNID LA G L+ YY +CTP+R+A MTG++
Sbjct: 60 PNIVYILADDLGWKDVGFHGSD-IKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGRY 118
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P GMQ V+ GL L + +LP+ LKE GY T GKWHLG K + P RGF+
Sbjct: 119 PFRYGMQTAVIPQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGFD 178
Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
S G G D+F H SA W + E +D +F EAV +I+ H
Sbjct: 179 SFYGALLGEIDHFTHKSANGNADWYRNNDALEEEGFD------NVLFADEAVRVINEHDQ 232
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+PLFLYLA +P+ P QAP +L + HI D R +AA++ +D+ +G+VVEA
Sbjct: 233 AKPLFLYLAF-----TSPHTPFQAPKEFLERNSHIADESRRNYAAMISVMDDGIGRVVEA 287
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGF-----NLNA---ASNWPLRGVKNTLWEGGVRGAGL 350
LE R M N++IVF SDNGG F N+ A N P R K T +EGG R GL
Sbjct: 288 LEMRGMRENTLIVFHSDNGGVVDAFFAGESNVKGKLPADNGPYREGKGTTYEGGTRVVGL 347
Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ W +E A +HV D PTL + A
Sbjct: 348 VNWPGKIEPG--TANGMMHVVDMYPTLAALAG 377
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 15/106 (14%)
Query: 464 NGPKNENTNPRYENGTHEYNIPRLENSI---NGNG---------TSENRSNDNSYQNE-I 510
NGP E YE GT + I NG T + +N+ +
Sbjct: 327 NGPYREGKGTTYEGGTRVVGLVNWPGKIEPGTANGMMHVVDMYPTLAALAGAELGKNKPL 386
Query: 511 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
DG+D+W VL +PS R I++NI E +A+ +G WKLV + ++
Sbjct: 387 DGMDMWPVLGEGKPSPRTEIVYNI--EPMTAAVRQGDWKLVWKTTL 430
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 23/94 (24%)
Query: 523 EPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWS 582
EP N ++H +D ++AL G +N+ +DG+D+W
Sbjct: 355 EPGTANGMMHVVDMYPTLAALA-------------GAELGKNKP--------LDGMDMWP 393
Query: 583 VLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
VL +PS R I++NI E +A+ +G WKLV
Sbjct: 394 VLGEGKPSPRTEIVYNI--EPMTAAVRQGDWKLV 425
>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
Length = 511
Score = 236 bits (602), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 128/336 (38%), Positives = 189/336 (56%), Gaps = 22/336 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ I+ADDLG +DV HG D I TPN+D LA G++L NYY +C+P+R ++MTG++
Sbjct: 20 PNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRY 78
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P G+QH L G GLPL E I+PQY+KE GY+T +VGKW LGF+K Y P RGF+
Sbjct: 79 PYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFD 138
Query: 180 SHLGYWTGHQDYFDHSA-----EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G G QDY+ + G D+R + P D KY ++ +A +
Sbjct: 139 EFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRDTSMKYQPFLYADKAREKFF 198
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK--FAAILHKLDESV 292
H+ +PL++Y+A + H PLQAP Y++++ + FA ++ LD +V
Sbjct: 199 AHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQMSNGQRVFAGMVTALDAAV 253
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G++V+ L+ + +N++I F +DNG S++ L+G KN W+GG+ G G
Sbjct: 254 GEIVDGLKDSGLWNNTLIYFSNDNG-------REDDSDFVLKGGKNDFWQGGITGYGFFS 306
Query: 353 SPLL--ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
P + + RG HVSD LPT+LSAA +P
Sbjct: 307 GPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH L G GLPL E I+PQY+KE GY+T ++
Sbjct: 84 LQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119
>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1517
Score = 236 bits (602), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 194/353 (54%), Gaps = 44/353 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PH++F L DDLG++DVG+ D + TP+IDALA G+ L YY+ CTP+R A++TG
Sbjct: 71 PHVVFALMDDLGFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFSCTPARGALLTGL 130
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P G+QH ++ + GLP ILPQ+L +LGYR+ +VGKWHLG + E PT RGF
Sbjct: 131 SPHRLGLQHGQVFPEQPWGLPSKFSILPQHLAKLGYRSHLVGKWHLGHFSAERLPTARGF 190
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWG-------------------------LDMRRDLEP 212
+S G G Q Y H A + K+ G D+R + +
Sbjct: 191 DSFFGGLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEHGCYFDLRENNDR 250
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHS-----TDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
DL G YST +F +A ++I HS ++PLFL L+ A H+ P+ AP+
Sbjct: 251 VEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-----PVWAPEDTY 305
Query: 268 NIH---RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
H ++ + R KFAA L +D++VG VV ALE+ RM +NSI++F SDNG
Sbjct: 306 ETHPDLLNVTNGNRRKFAAALRLVDDAVGNVVAALEEARMDANSILIFASDNGANPE--- 362
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
+ SN PLRG K L+EGGVR + +P RG + HV+DW+PTLL
Sbjct: 363 -HGGSNAPLRGSKGYLFEGGVRVPAFVRAPKYLPRGATYDHPFHVTDWVPTLL 414
>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
Length = 581
Score = 236 bits (601), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/336 (38%), Positives = 194/336 (57%), Gaps = 21/336 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IF+L DDLG++D+G+ D I +PNIDALA + LK +Y CTPSR+A +TG++
Sbjct: 24 PHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRY 82
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I GMQ V+ E G+PL E +L + K+ GYRT + GKWHLGFY ++ P RGF+
Sbjct: 83 NIRMGMQSGVIRATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQIRGFD 142
Query: 180 SHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDLH--GKYSTDVFTAEAVDIIHN 235
G++ G QD++ H + +K + D+ D DLH G YST +F + ++ +
Sbjct: 143 RFYGFYLGSQDFYFHDSGRLKAYPGNGDVENDTILD-DLHTNGTYSTKLFVDDFINDLAK 201
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---FKRSKFAAILHKLDESV 292
H PLF Y++ H PLQ + + + + D +R + + LD +
Sbjct: 202 HDPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHERILISTKITTLDHHI 256
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
GK+V AL+++ +N+++VF SDNGG ASNWPLRG K T+++GG++ I
Sbjct: 257 GKMVTALKEKNYWNNTVLVFTSDNGGQPK----EGASNWPLRGSKGTIYDGGLKSRAFIA 312
Query: 353 SPLLESR--GIVAEQYVHVSDWLPTLLSAANKSDIP 386
SPLL++R G HVSDW PTLL+ + D+P
Sbjct: 313 SPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG-CDVP 347
>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
Length = 455
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 122/260 (46%), Positives = 169/260 (65%), Gaps = 14/260 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FILADD GWND+G+H I TPN+D LA G+ L+NYY +C+PSR +MTG++
Sbjct: 1 PNIVFILADDYGWNDIGYHN-SFIRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRY 59
Query: 120 PIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
IH G+QH+V+ C+R GLPL E LPQ LKE GY T +VGKWHLGF++KEY P RGF
Sbjct: 60 QIHYGLQHSVIM-CDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGF 118
Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
E GY TG +DY+ H S + + GLD+R +P D +G YST +F +A+++
Sbjct: 119 ERFFGYLTGGEDYWTHRKPSQFSKDPSEFHGLDLRDQDKPVLDQNGTYSTHLFARKAIEM 178
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I NH +P+FLYL A H PL+AP+ Y I+ I++ +AA++ +DE+V
Sbjct: 179 ILNHDQSKPMFLYLPFQAVHG-----PLEAPEEYKRIYEDIDNSLVRTYAAMVTVMDEAV 233
Query: 293 GKVVEALEQRRMLSNSIIVF 312
G V AL+ R M N+I++F
Sbjct: 234 GNVTGALKHRGMWRNTILIF 253
>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
obscuriglobus UQM 2246]
Length = 459
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 131/336 (38%), Positives = 189/336 (56%), Gaps = 15/336 (4%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L F L+ V + P ++F+LADDLG D GF G +I TPNID +A +G L +
Sbjct: 7 LLFLLAPASVARAEAPAQPSVVFLLADDLGREDCGFMGGKEIKTPNIDKIAAAGATLDAF 66
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +C+P+R+A MTG++P+ G+Q V+ + GLPL E+ + Q LK+ GY T ++GK
Sbjct: 67 YAQPVCSPTRAAFMTGRYPMRHGLQVGVVRPWAQYGLPLDERTVAQGLKDAGYTTAVIGK 126
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLG + EY PT RGF+ G++ G DYF H ++ G D RD + D YS
Sbjct: 127 WHLGHFAPEYLPTKRGFDHQYGHYNGALDYFTH----IRDGGFDWHRDDKVNSDE--GYS 180
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T + +AV + H+ +P FLY+ A H+ P Q P+ YL + H++ +R K+
Sbjct: 181 THLVAKDAVQFVQTHAGKKPFFLYVPFNAVHA-----PHQVPEEYLKPYGHLKG-ERQKY 234
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
A +L +DE+VG++V A+E+ + N++ VF+SDNGG G SN R K TL+
Sbjct: 235 AGMLAAMDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQPGV---VTSNGKFRAGKGTLY 291
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
EGGVR A G + +HV DW PTLL
Sbjct: 292 EGGVRVAACAAFDGHIRPGTTITEPLHVVDWYPTLL 327
>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
Length = 427
Score = 235 bits (600), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 128/336 (38%), Positives = 189/336 (56%), Gaps = 22/336 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ I+ADDLG +DV HG D I TPN+D LA G++L NYY +C+P+R ++MTG++
Sbjct: 20 PNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRY 78
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P G+QH L G GLPL E I+PQY+KE GY+T +VGKW LGF+K Y P RGF+
Sbjct: 79 PYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFD 138
Query: 180 SHLGYWTGHQDYFDHSA-----EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G G QDY+ + G D+R + P D KY ++ +A +
Sbjct: 139 EFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRDTSMKYQPFLYADKAREKFF 198
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK--FAAILHKLDESV 292
H+ +PL++Y+A + H PLQAP Y++++ + FA ++ LD +V
Sbjct: 199 AHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQMSNGQRVFAGMVTALDAAV 253
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G++V+ L+ + +N++I F +DNG S++ L+G KN W+GG+ G G
Sbjct: 254 GEIVDGLKDSGLWNNTLIYFSNDNG-------REDDSDFVLKGGKNDFWQGGITGYGFFS 306
Query: 353 SPLL--ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
P + + RG HVSD LPT+LSAA +P
Sbjct: 307 GPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH L G GLPL E I+PQY+KE GY+T ++
Sbjct: 84 LQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119
>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 473
Score = 235 bits (600), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 137/354 (38%), Positives = 199/354 (56%), Gaps = 43/354 (12%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M V +S+V DL + PHI+FILADDLGW+DVG+H +I TPNID LA
Sbjct: 1 MYVCVFLGILAISVVHADL--GTLRPHILFILADDLGWHDVGYHD-SEIQTPNIDMLAAE 57
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+ L+NYY LCTPSR+ +MTG++ IH+GMQH VL LP E +LPQ LK+ GY
Sbjct: 58 GVKLENYYVTPLCTPSRAVLMTGRYLIHSGMQHGVLVAQNPRCLPTDEILLPQMLKDSGY 117
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T +VGKWHLGF K + TP RGF++ G+ ++A + + RR
Sbjct: 118 STHMVGKWHLGFCKFQCTPNHRGFDTFFGW---------YNAAVISPFISCRRR------ 162
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-I 273
+ + +H+T P+F+YL A H+ PLQ PD Y ++++ I
Sbjct: 163 ------------SAGCEFRVSHTT-VPMFIYLPFQAVHA-----PLQVPDKYSDMYKDTI 204
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
D R +A + +DE++G + L+ + + +N++IVF +DNGGA ASN+PL
Sbjct: 205 HDESRRTYAGMTSCMDEAIGNITRTLKDKGIWNNTVIVFSTDNGGART----FGASNYPL 260
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDI 385
RG K + +EGG+RG + SPLL+ RG + + +++ DW PT ++ A I
Sbjct: 261 RGQKASNFEGGIRGPAFVSSPLLDPLVRGTINNELMYIGDWFPTFMNLAGGHVI 314
>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
Length = 497
Score = 235 bits (599), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 183/336 (54%), Gaps = 32/336 (9%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A P+I++I++DD GW DVG+HG D I TPNID LA G L+ +Y +CTPSR+A
Sbjct: 50 AQGAAPNIVYIISDDSGWKDVGYHGSD-IRTPNIDRLAAEGARLEQFYVQPMCTPSRAAF 108
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
MTG++P G+Q V+ GL L E LPQ LK+ GY T + GKWHLG K Y P
Sbjct: 109 MTGRYPFRYGLQTAVIPQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHSKTAYWPR 168
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTDVFTAE 228
RGF+S G G D+F H A + P W + K Y + AE
Sbjct: 169 QRGFDSFYGALLGEIDHFTHKAA-----------NGNPDWYRNNKALKEEGYDNILIGAE 217
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
AV +I+ H +PLFLYLA A H+ P QAP YL+ + HI D R K+AA++ +
Sbjct: 218 AVRVINKHDQQKPLFLYLAFTAPHT-----PYQAPKEYLDRNSHIADESRRKYAAMISVM 272
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN--------AASNWPLRGVKNTL 340
D+ +GKVV ALE+R M N++IVF SDNGG F A N P R K T
Sbjct: 273 DDEIGKVVAALEKRGMRDNTLIVFHSDNGGVVNAFFAGESEVKGKLPADNGPYREGKGTT 332
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+EGG R AGL+ P G + + +HV D PTL
Sbjct: 333 YEGGTRVAGLVNWPGKIKPGTI-DGPMHVVDMYPTL 367
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 44/107 (41%), Gaps = 27/107 (25%)
Query: 464 NGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQ---------------- 507
NGP E YE GT R+ +N G + + D
Sbjct: 322 NGPYREGKGTTYEGGT------RVAGLVNWPGKIKPGTIDGPMHVVDMYPTLAGLVGATL 375
Query: 508 ---NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLV 551
+DG+D+W L+ +PS R I++NI E +A+ +G WKLV
Sbjct: 376 GKNKPLDGMDMWPTLAEGKPSPRTEIVYNI--EPMTAAVRQGDWKLV 420
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
+DG+D+W L+ +PS R I++NI E +A+ +G WKLV
Sbjct: 381 LDGMDMWPTLAEGKPSPRTEIVYNI--EPMTAAVRQGDWKLV 420
>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
Length = 460
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 116/269 (43%), Positives = 167/269 (62%), Gaps = 15/269 (5%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG +KE PT RGF++
Sbjct: 32 IHTGLQHLLIRVCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMCQKECLPTRRGFDT 91
Query: 181 HLGYWTGHQDYFDHSAEEM------KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ GY G ++Y+ H + LD R EPA + YST +F+ A +I
Sbjct: 92 YFGYLLGSENYYTHEVCALIEPLNVTRCALDFRDGEEPATKYNNTYSTHIFSQRATTLIA 151
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +DE+VG
Sbjct: 152 NHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRRIYAGMVSLMDEAVGN 206
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
V AL+ + +N++ VF +DNGG L+ +NWPLRG K TLWEGG+RG G + SP
Sbjct: 207 VTAALKSHGLWNNTVFVFSTDNGGQ----TLSGGNNWPLRGRKWTLWEGGIRGVGFVASP 262
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
LL+ +G+ + + +H+SDWLPTL++ A S
Sbjct: 263 LLKQKGVKSRELIHISDWLPTLVNLAGGS 291
>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
Length = 501
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/346 (37%), Positives = 199/346 (57%), Gaps = 19/346 (5%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
LP+ L + F + +S P+ ++I ADDLG++D+G D + TP +D LA G+ K
Sbjct: 3 LPILIAL-VGFFQGIHTSKQPNFVWIFADDLGYHDIGLRNPD-LHTPTLDKLATKGVQFK 60
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
N Y + C+PSR + M+G++P + MQ +V++ PL K LP+YLKELGY T V
Sbjct: 61 NNYVMHACSPSRHSFMSGRYPFTSQMQKDVIFPVSPDCSPLKLKFLPEYLKELGYGTHAV 120
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLG+ +++ PT RGF+S G G DY H + W + D + G+
Sbjct: 121 GKWHLGYCREDCMPTSRGFDSFYGTLDGEGDYLTHMSAGFYDWHTNGTVDRSKS----GQ 176
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA--PDHYLNIHRHIEDFK 277
+S D+ TA DII + ++P FLY A + NP+ PLQA D ++++ D
Sbjct: 177 HSQDLHTAALADIIERQTEEKPFFLYFA-----AQNPHTPLQAKKKDMKKYKNQNLSDTL 231
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
+ K+ ++ LD V + + LE++ +L+N+ I+F++DNG ++ SN P RG K
Sbjct: 232 K-KYYGLITGLDGMVKTIKDTLEKKNLLNNTYIIFLADNGA-----DVGIGSNEPFRGTK 285
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+TLWEGGV+ I+SPLL+ G + +H++DW+PTL+S A S
Sbjct: 286 STLWEGGVKSLSFIYSPLLKKTGYENDGLIHITDWMPTLVSLAGGS 331
>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
Length = 484
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 119/264 (45%), Positives = 172/264 (65%), Gaps = 7/264 (2%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A S PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 43 AAPSQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 101
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGF++KE P
Sbjct: 102 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLP 161
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+ RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I
Sbjct: 162 SCRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHI 221
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +HS PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V
Sbjct: 222 LASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 276
Query: 293 GKVVEALEQRRMLSNSIIVFVSDN 316
+ AL++ +NS+I+F SDN
Sbjct: 277 RNITWALKRYGFYNNSVIIFSSDN 300
>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 530
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/347 (38%), Positives = 193/347 (55%), Gaps = 29/347 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PHI+++L DD+GWND+G+ D TP + LA +G+ L YYT CT SR+A+++G
Sbjct: 27 PHIVYVLFDDMGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQSTCTVSRAALLSGV 86
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P+H G+ H + GLPL K+LPQYL+E GYRT +VGKW +G + +EY P RGF
Sbjct: 87 LPMHNGISHGTIVMDSPIGLPLKYKLLPQYLQESGYRTYMVGKWDIGHFNEEYLPHNRGF 146
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWG----LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ G++ YF H + D+R + E + +Y+TD+F AV +
Sbjct: 147 DHFFGFYGADITYFSHISSRGYCANPNCFPDLRNEDETMANASMRYTTDLFRERAVGFVE 206
Query: 235 ----NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDF---KRSKFAAILH 286
NH+TD PLFLYL+ A P+ P AP ++ N + F +R FAA+++
Sbjct: 207 GHAANHATD-PLFLYLSFNA-----PHYPTSAPQEFMRNEAELLAPFTNRERRVFAAVVN 260
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL-------NAASNWPLRGVKNT 339
D SVG+VVEAL+ ++S+I+F SDNG N N SNWPLRG+K T
Sbjct: 261 YADRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESNTGSNWPLRGMKAT 320
Query: 340 LWEGGVRGAGLIWSPLL---ESRGIVAEQYVHVSDWLPTLLSAANKS 383
WEGG R + +P + G + + VH++DW+PT++ +S
Sbjct: 321 YWEGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTIVRGVLRS 367
>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
Length = 1683
Score = 233 bits (595), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 172/283 (60%), Gaps = 17/283 (6%)
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
L++YY +CTP+RS +M+G++ IHTG+QH V+ G + GLPL++ + P+ K+ GYRT
Sbjct: 1233 LQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTH 1292
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
+VGKWHLGFY ++ P RGFESH GY G + +++HS G+D R
Sbjct: 1293 MVGKWHLGFYDEKMVPENRGFESHYGYLIGAEGHYNHSQFMQGQHGVDFRDGGASTNSSW 1352
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
G+YS D+F D++ H +E L++Y+ H PL+AP HY++ I+D
Sbjct: 1353 GQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHY-----PLEAPQHYVDQFSWIKDRD 1407
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R +AA+ +D++VG+++ + + + + ++I+ F +DNGG+ L +NWPLRG+K
Sbjct: 1408 RRVYAAMTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGST----LYGGNNWPLRGLK 1463
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVA---EQYVHVSDWLPTLL 377
NTLWEGG+RG G I P G+ A +Q +HV D PTLL
Sbjct: 1464 NTLWEGGIRGIGSIKIP-----GVSADKRDQLMHVVDMFPTLL 1501
>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
Length = 775
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 118/272 (43%), Positives = 179/272 (65%), Gaps = 14/272 (5%)
Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
GMQ L G ER G+PL+ +LP+YL+ LGY T +VGKWH+G++ K + PT RGF++ LG
Sbjct: 300 GMQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGFDTFLG 359
Query: 184 YWTGHQDYFDHSAEEMKMWGLDMRR------DLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
Y+TG YF+H+ E G D+ R +E +D Y TD+ T E II +H+
Sbjct: 360 YYTGMIQYFNHTLYESGQLGYDLHRIVGENHTVEYRYD----YMTDLLTDEVESIISSHN 415
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
T++P++L L+H A H+++ E ++ D + +I+D R K+A+++ LD+SVG+V
Sbjct: 416 TEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDLNRRKYASVVATLDKSVGRV 475
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
V+AL+++ +L+N+II+F++DNG G+ N SN+PLRGVK TL+EG VRG I+SPL
Sbjct: 476 VDALKRKDILNNTIIIFMADNGAQTEGYLANYGSNYPLRGVKFTLFEGAVRGVACIYSPL 535
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
++ V+ + H++DWLPTL SAA N SD+
Sbjct: 536 IDRSSRVSTKLFHITDWLPTLYSAAGGNPSDL 567
Score = 219 bits (557), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 113/257 (43%), Positives = 159/257 (61%), Gaps = 9/257 (3%)
Query: 125 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY 184
MQ + G E GLPL+ KILP++L+ LGY T ++GKWHLG++ ++TP++RGF+ G+
Sbjct: 1 MQGEDIQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGF 60
Query: 185 WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFL 244
+ H Y D+ M G DM PA+ L+ KY TD+FT +AV II NH + PL+L
Sbjct: 61 YNSHVSYHDYKYSYQNMSGYDMHCGDAPAYGLNDKYVTDLFTDKAVKIIENHDSFRPLYL 120
Query: 245 YLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRM 304
++H A H+ P E Q DH HI + R K+A ++ KLD+SVG++V L R M
Sbjct: 121 QISHLAVHA--PLENPQDYDHSDRRFIHIREQHRRKYARMVSKLDDSVGRIVHTLGNRGM 178
Query: 305 LSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
L +S+I+F++DNG A G N SN+PLRG GGVRG ++WSP L V +
Sbjct: 179 LRDSLILFLTDNGAAPIGKFRNFGSNYPLRG-------GGVRGVSVLWSPRLNKPARVCD 231
Query: 365 QYVHVSDWLPTLLSAAN 381
+H+SDWLPTL +AA+
Sbjct: 232 NLIHISDWLPTLYAAAD 248
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 19/138 (13%)
Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
+++DGI+ WS + + + +R +IL NI ++ + + + GK+KLV+ + D G
Sbjct: 568 DQLDGINQWSTIKKVKIDERRSILVNIQEDNEAALM--GKYKLVRDRSDNQKHYDYYSGN 625
Query: 633 DQVY----------------LSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPC 676
+ Y ++ +S A ++ LR A+I C + P + C
Sbjct: 626 NVSYPKYNVTNVLTSPAASAIASVSAHVLTAAKIKALRKKATIVCKNLTN-PSSCKNRTC 684
Query: 677 LFDIKNDPCEKNNLADRS 694
LF+I DPCE +L+ ++
Sbjct: 685 LFNIYKDPCEFTDLSSKN 702
Score = 40.0 bits (92), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQ L G ER G+PL+ +LP+YL+ LGY T ++
Sbjct: 301 MQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVG 336
>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 706
Score = 233 bits (594), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/382 (35%), Positives = 208/382 (54%), Gaps = 61/382 (15%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + P++ FI+ DD+GWND+G+ +D Q TP++D LA G+ + NYYT+ +CTP+R++
Sbjct: 126 AETAKPNVFFIMVDDMGWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARAS 185
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++ + G+Q++V+ GLPL+EKI P+Y+K+ GY T ++GKWH+G Y + P
Sbjct: 186 LMTGRYVMRYGLQYSVIQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIP 245
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGL----------------------DMRRD-- 209
+ RGF+++LGY + Y+ H + K G D RD
Sbjct: 246 SQRGFDTYLGYLNDEEMYWTHQSWTAKYNGRKFFDFGFGNATGYYDVIDREVPDQLRDGT 305
Query: 210 -----------------LEPAWDLHGKYSTDVFTAEAVDIIHNHS--TDEPLFLYLAHAA 250
G+YST VF A++IIH + DEPL++YLAH A
Sbjct: 306 ISDGDDDETMLSATSSYSSSGGFFKGRYSTQVFHNRALEIIHEKTPYDDEPLYMYLAHQA 365
Query: 251 THSANPYEPLQ--APDHYLNIHRHIE-------DFKRSKFAAILHKLDESVGKVVEALEQ 301
H P + PD L++ IE + R++FA +L LD ++G++VE LE
Sbjct: 366 VHDPLGLPPAEHFTPDQ-LDLLERIETGSEEGTEALRARFAKVLMYLDYTIGELVEYLET 424
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP--LLESR 359
+ NSII+ SDNGG + SN+PLRG+K++ WEGG + ++S + E R
Sbjct: 425 NGWMENSIIIVASDNGGCPS----CGGSNYPLRGIKHSYWEGGTKVPSFVYSKSHIPEER 480
Query: 360 -GIVAEQYVHVSDWLPTLLSAA 380
G + +HV+DWLPT+ AA
Sbjct: 481 WGTEYDGLMHVTDWLPTIAGAA 502
>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 534
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 189/345 (54%), Gaps = 39/345 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+F+LADDLGW+DVG+H I TPNID LA G+ L+NYY C PSR +MTG++
Sbjct: 69 PNIVFVLADDLGWHDVGYHD-SVIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGRY 127
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-GFYKKEYTPTFRGF 178
I TG+ +V C+ G + E L L E GY T +VGKWHL GF E PT RGF
Sbjct: 128 RIRTGLYGDV---CDFMG--IHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRGF 182
Query: 179 ESHLGYWTGHQDYFDHSA----EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
++ LGY G Q+YF H W D A G+YST +F EA II
Sbjct: 183 QTFLGYHGGSQNYFTHRRGGPHAPYDFWA----NDTSIAVKYEGQYSTMIFADEAQRIIR 238
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESVG 293
H+T +PLFLYL+ A H PL P Y + +R IED KR +AA+ +DE++G
Sbjct: 239 QHNTKQPLFLYLSFQAVHV-----PLLVPPSYEDQYRSLIEDDKRRVYAAMASCMDEAIG 293
Query: 294 KVVEALEQRRMLSNSIIVFVS-----------DNGGAAAGFNLNAASNWPLRGVKNTLWE 342
V L+++ + +N++ +F S DNG + SNWPLRG K +E
Sbjct: 294 NVTATLKEKGLWNNTVFIFSSARLNFKHDFCLDNGATPT----DGGSNWPLRGRKGLWYE 349
Query: 343 GGVRGAGLIWSPLLESR--GIVAEQYVHVSDWLPTLLSAANKSDI 385
GG+R G + S LL G + +HV+DW PTLL A K+D+
Sbjct: 350 GGIRAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLA-KADL 393
>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
Length = 466
Score = 232 bits (592), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 117/265 (44%), Positives = 163/265 (61%), Gaps = 11/265 (4%)
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRG 60
Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHN 235
F++ G G DY+ H + M G D+ + AWD +G YST ++T I+ +
Sbjct: 61 FDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILAS 120
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 121 HDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNV 175
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G + SPL
Sbjct: 176 TLALKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAIGFVHSPL 231
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L+++G V ++ VH++DW PTL+S A
Sbjct: 232 LKNKGTVCKELVHITDWYPTLISLA 256
>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
Length = 514
Score = 232 bits (592), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 176/315 (55%), Gaps = 14/315 (4%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GWNDV +H +++ +P ++ LA G+IL +Y + CTP+R+A+MTG++P G+Q + +
Sbjct: 36 GWNDVSWHN-ERMESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPYKLGIQSHGI 94
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
E GLPL L + LK GY T GKWHLG+ + TPT RGF++ G++ G QD
Sbjct: 95 RTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTFRGFYVGGQD 154
Query: 191 YFDHSAEEMKMW----GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
YF H+ K G D R E + G Y+T + + I D+P+FLY+
Sbjct: 155 YFSHTLSGGKTSATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQPDKPMFLYV 214
Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
A A H+ PLQ P Y + + KR +L +D +VG +V+ L+ + M
Sbjct: 215 AFQAVHA-----PLQVPTQYRKMCSIYRNPKRKLLCEMLAVMDHAVGLIVKKLKMKDMWR 269
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
N+++VF+SDNGG L +NWPLRG KNTL+EGG R + PL+ + G +
Sbjct: 270 NTLLVFISDNGGQI----LYGGNNWPLRGNKNTLFEGGTRVPAFVAGPLIRNGGRNSSSI 325
Query: 367 VHVSDWLPTLLSAAN 381
+HV DW PTLLSAA
Sbjct: 326 IHVVDWFPTLLSAAG 340
>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
Length = 539
Score = 231 bits (590), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 117/262 (44%), Positives = 160/262 (61%), Gaps = 11/262 (4%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 77 IHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDT 136
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
G G DY+ H + M G D+ + AWD +G YST ++T I+ +H
Sbjct: 137 FFGSLLGSGDYYTHYKCDSPGMCGYDLYENENAAWDYDNGIYSTQMYTQRVQQILASHDP 196
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V A
Sbjct: 197 KRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 251
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ +NSII++ SDNGG SNWPLRG K T WEGG+R G + SPLL++
Sbjct: 252 LKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 307
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
+G V ++ VH++DW PTL+S A
Sbjct: 308 KGTVCKELVHITDWYPTLISLA 329
>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
Length = 345
Score = 231 bits (590), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 20/324 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+ +FI+ADDLG++D+G D I TPN+DALA G+IL N Y LCTPSR A+M+G++
Sbjct: 26 PNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMSGRY 84
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P + MQ V+ E L K LPQYLKELGY+ +VGKWHLG+ + E PT RGF+
Sbjct: 85 PSASAMQSMVIQPMEARCAGLEYKFLPQYLKELGYKNHMVGKWHLGYCRDECLPTSRGFD 144
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-IIHNHST 238
+ G + G DY+ H W ++ E +G +S D+ E +D + H +
Sbjct: 145 TFYGLYAGAGDYWSHEIFGKYDWHINGDIHFEA----NGTHSQDL-EMEGLDKVFDEHDS 199
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLDESVGKVV 296
+PLFLY A NP+ P Q D +LN++ +D +R K+ + LD VGK+V
Sbjct: 200 KDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKFDDIRR-KYLGLTSGLDAMVGKIV 253
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
E LE ++ N+ ++FVSDNG N P RG K +L+EGG + ++SPLL
Sbjct: 254 EKLENNGLMDNTYVIFVSDNGADPP-----EGLNTPFRGGKGSLFEGGTKSNSFVYSPLL 308
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
G + +H++DWLPTL+ A
Sbjct: 309 NKTGYENDGLMHITDWLPTLVKLA 332
>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
Length = 455
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 133/348 (38%), Positives = 195/348 (56%), Gaps = 22/348 (6%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
+L +A TL+ V + P+I+F+LADDLG DV + G I TP +DALA SG L
Sbjct: 8 ILTIALTLASVATTFATDAPRPNIVFLLADDLGGADVSWRG-SPIKTPQLDALANSGAKL 66
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+ +Y +C+P+RSA++TG++P+ G+Q V+ GLPL E+ L + L++ GY T I
Sbjct: 67 EQFYVQPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQDAGYETAI 126
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD--MRRDLEPAWDL 216
VGKWHLG Y P RGF+ G++ G DYF H + W D + RD
Sbjct: 127 VGKWHLGHVSPAYLPMARGFDHQYGHYNGALDYFTHDRDGGHDWHKDDHVNRD------- 179
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
Y+T + EAV +I + +PLFLY+ A HS PLQ P+ Y + ++
Sbjct: 180 -EGYATHLIAQEAVRVIQDRDKKKPLFLYVPFNAVHS-----PLQVPESYAAPYGDMKK- 232
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R +A ++ LDE+VG++V+ ++++ ML N++ +F SDNGG G N PLRG
Sbjct: 233 RRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEPG---KLTDNGPLRGG 289
Query: 337 KNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K+TL+EGGVR W + + G E +H+ DW PTL+ A S
Sbjct: 290 KHTLYEGGVRVCAFASWKGRI-APGSKVEAPLHIVDWYPTLIELAGGS 336
>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 517
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 139/348 (39%), Positives = 201/348 (57%), Gaps = 29/348 (8%)
Query: 48 MVFVDLVASSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
++F+ L S+ P++I IL DDLGW DV +HG IPTPNIDALA +G+ L +Y
Sbjct: 4 LIFLILFVSASIYSKPNVIIILTDDLGWGDVSYHG-GHIPTPNIDALAKNGVELNRFYAS 62
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLY-GCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
+C+P+R++++TG H + G+ + E+ GLP+ KI+PQ+ KE GY+T + GKWH
Sbjct: 63 PVCSPTRASLLTGLHIFNHGIIRPLANPTAEQYGLPVDLKIMPQFFKEAGYQTALSGKWH 122
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
LG + +EY PT RGF+ G+ G YFDH LD R+ EP ++ YST+
Sbjct: 123 LGMHLEEYWPTNRGFDQSYGHMLGGIGYFDHVHSSR----LDWHRNEEPLFE--DGYSTE 176
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ EAV II + PLFLY+A A H+ P+QAPD + + +IED +AA
Sbjct: 177 LIANEAVRIIETKDPNRPLFLYVAFNAPHT-----PIQAPDKNIELFSYIEDPLDRAYAA 231
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----------AAGFNLNAASNWPL 333
++ LD +GK++ A+E + +L +IIVF SDNG A G S L
Sbjct: 232 NVNALDAEIGKIIYAIEDQGILEETIIVFFSDNGPVFDINPIIAVLAPGLVEARGSTAGL 291
Query: 334 RGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G K + EGG+R A ++W +L++ ++QY+ V D LPTLLSAA
Sbjct: 292 PGSKTSAREGGIRVPASIMWKGVLDNSK--SDQYIFVQDLLPTLLSAA 337
>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
Length = 471
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 114/270 (42%), Positives = 169/270 (62%), Gaps = 11/270 (4%)
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+++ ++ IHTG+QH+V+ + LPL LPQ LKE GY T +VGKWHLGFY+KE
Sbjct: 2 VLVSSRYQIHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECM 61
Query: 173 PTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAV 230
PT RGF++ G G DY+ H + +M G D+ + AWD +G YST ++T +
Sbjct: 62 PTQRGFDTFFGSLLGSGDYYTHYKCDSPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQ 121
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE
Sbjct: 122 QILASHNPRKPIFLYIAYQAVHS-----PLQAPGTYYERYRSINNINRRRYAAMLACLDE 176
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
++ V AL++ +NS+I++ SDNGG +NWPLRG K T WEGG+R G
Sbjct: 177 AINNVTLALKRYGYYNNSVIIYSSDNGGQPTA----GGNNWPLRGSKGTYWEGGIRAVGF 232
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ SPLL+++G V ++ +H++DW PTL++ A
Sbjct: 233 VHSPLLKNKGCVCKELIHITDWFPTLITLA 262
>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
Length = 543
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/295 (42%), Positives = 180/295 (61%), Gaps = 15/295 (5%)
Query: 91 LAYSGIIL-KNYYTVQLCTPS--RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
+A +GI L +++ ++ PS R +++ + IHTG+QH+++ + LPL LPQ
Sbjct: 48 MASAGITLFRSWLSLASSHPSDVRCGLLS-LYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 106
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDM 206
LKE+GY T +VGKWHLGFYKKE PT RGF++ G G DY+ H + M G D+
Sbjct: 107 KLKEVGYSTHMVGKWHLGFYKKECMPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDL 166
Query: 207 RRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
+ AWD +G YST ++T I+ +H+ +P+FLY+A+ A HS PLQAP
Sbjct: 167 YENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGR 221
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
Y +R I + R ++AA+L LDE++ V AL+ +NSII++ SDNGG
Sbjct: 222 YFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTA--- 278
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SNWPLRG K T WEGG+R G + SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 279 -GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA 332
>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
Length = 504
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/276 (42%), Positives = 165/276 (59%), Gaps = 11/276 (3%)
Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
C + + K+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGF
Sbjct: 24 CPQNEGSCFQQKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGF 83
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDV 224
Y+KE PT RGF++ G G DY+ H + M G D+ + + AW+ +G YST +
Sbjct: 84 YRKECMPTQRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDDAAWEYDNGIYSTQM 143
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
+T I+ +H PLFLY+A+ A HS PLQAP Y +R I + R ++AA+
Sbjct: 144 YTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAM 198
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
L LDE++ V AL++ NSI+++ SDNGG SNWPLRG K T WEGG
Sbjct: 199 LSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGG 254
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+R G + SPLL+++G V + VH++DW PTL++ A
Sbjct: 255 IRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLA 290
>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
Length = 602
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 194/335 (57%), Gaps = 19/335 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IF+L DDLG++D+G+ D I +PNIDALA + LK +Y CTPSR+A +TG++
Sbjct: 45 PHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRY 103
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I GMQ V+ E G+PL E +L + K+ GYRT + GKWHLGFY ++ P RGF+
Sbjct: 104 NIRMGMQSGVIRAPEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFD 163
Query: 180 SHLGYWTGHQDYFDHSAEEMKMW--GLDMRRD-LEPAWDLHGKYSTDVFTAEAVDIIHNH 236
G++ G QD++ H + ++ + D+ D + + +G YST +F + ++ + H
Sbjct: 164 RFYGFYLGSQDFYFHDSGRLEAYPGNGDVENDTILDDFHTNGTYSTKLFVDDFINDLAKH 223
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---FKRSKFAAILHKLDESVG 293
PLF Y++ H PLQ + + + + D +R + + LD +G
Sbjct: 224 DPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHERILISTKITTLDHHIG 278
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
++V AL+++ +N+++VF SDNGG ASNWPLRG K T+++GG++ + S
Sbjct: 279 RMVTALKEKNYWNNTVLVFTSDNGGQPN----EGASNWPLRGSKGTIYDGGLKSRAFVAS 334
Query: 354 PLLESR--GIVAEQYVHVSDWLPTLLSAANKSDIP 386
PLL++R G HVSDW PTLL+ + D+P
Sbjct: 335 PLLQNRMKGQNYNYLFHVSDWFPTLLTLSG-CDVP 368
>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
Length = 224
Score = 230 bits (586), Expect = 3e-57, Method: Composition-based stats.
Identities = 111/225 (49%), Positives = 149/225 (66%), Gaps = 1/225 (0%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+IIF+LADDLGW DV F G QIPTPN+D LA GIIL NYY + LCTPSR A+M+G
Sbjct: 1 PPNIIFLLADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGL 60
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+PIHTG+QH V E GLPL+ I+P++LK LGY T ++GKW+LG+YK+ YTPT RGF
Sbjct: 61 YPIHTGLQHYVQLPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGF 120
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+S G+ G +DY+DH+ + GLD P + YSTD+FT +A+ +I +H
Sbjct: 121 DSFYGFLNGGEDYYDHTILFVST-GLDFWDGTTPVRNQSHHYSTDLFTKKALALIKDHDQ 179
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+P+FLY +H A HS + L+AP + +I++ RS A
Sbjct: 180 AKPMFLYFSHQAVHSGDFKVELEAPASAIAHFPYIKEQNRSIIAG 224
>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
Length = 547
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 185/324 (57%), Gaps = 36/324 (11%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 50 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + + RGF++
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQSXXXXXXR-------------------------RGFDT 143
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 144 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPR 203
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 204 QPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 258
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 259 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 314
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 315 RRTSRALVHITDWYPTLVGLAGGT 338
>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 545
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 140/393 (35%), Positives = 201/393 (51%), Gaps = 63/393 (16%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPP 60
+QH V+ + LP+ E LPQ LKE GY T ++ G
Sbjct: 42 LQHGVIRHAQPNCLPMDEVTLPQKLKENGYTTNMV----------------------GKW 79
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HI F L D + GF + I L + L LC +
Sbjct: 80 HIGFYL-DACTPTERGFDSFFDV----IMVLWSRELCLVYLLRASLC-----------YW 123
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH V+ + LP+ E LPQ LKE GY T +VGKWH+GFY TPT RGF+S
Sbjct: 124 IHTGLQHGVIRHAQPNCLPVDEVTLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGFDS 183
Query: 181 HLGYWTGHQDYF------------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
G+ G +D++ D S + + G D+ + P +D G+YST +FT +
Sbjct: 184 FFGFLIGAEDHYTRMLSCEAYNANDGSTKTLS--GHDLHANKTPVFDYEGQYSTHLFTNK 241
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
VD+I H +PLF+YLA+ A H+ PLQ PD Y++ +++I D R +A ++ +
Sbjct: 242 TVDVIERHDKTKPLFIYLAYQAVHA-----PLQVPDSYMDPYKNIADKTRRTYAGMVSCM 296
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE +G V +AL+ + N+II+F +DNGG +NWPLRG K ++WEGG+ G
Sbjct: 297 DEGIGNVTQALKDAGLYDNTIIIFSTDNGGHID----RGGNNWPLRGSKGSMWEGGIHGV 352
Query: 349 GLIWSPLLES--RGIVAEQYVHVSDWLPTLLSA 379
G + SPLL +G V + +HVSDWLPT+++
Sbjct: 353 GFVHSPLLPDAVKGTVNHELMHVSDWLPTIVAG 385
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 3/82 (3%)
Query: 104 VQLCTPSRSAIMT---GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
Q C + + + T G + IHTG+QH V+ + LP+ E LPQ LKE GY T +VG
Sbjct: 18 AQRCILAAAIVGTPDFGHNMIHTGLQHGVIRHAQPNCLPMDEVTLPQKLKENGYTTNMVG 77
Query: 161 KWHLGFYKKEYTPTFRGFESHL 182
KWH+GFY TPT RGF+S
Sbjct: 78 KWHIGFYLDACTPTERGFDSFF 99
>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
Length = 500
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 135/375 (36%), Positives = 208/375 (55%), Gaps = 36/375 (9%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
RT + F +L L + VD + P+++FI+ADDLG+ND+ + + TPN+
Sbjct: 3 RTTLPFFLLLVLQ-HYGIADVDGQTLTQKPNVLFIMADDLGFNDLDWKD-STLHTPNLRN 60
Query: 91 LAY--SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQY 148
LA+ + +L N Y QLCTP+RSA MTG +P G Q+ V E G+P L +
Sbjct: 61 LAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSEN 120
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE----EMK--MW 202
+++L Y T +VGKWHLG+ KKE+ PT RGF+ G++ YF+HSA+ E+K +
Sbjct: 121 MRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVK 180
Query: 203 GLDMRRDL-----EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS---- 253
GLD+ ++ P + +G YSTD+FT A+ ++ NH+T +P F++L++ A H
Sbjct: 181 GLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVLDNHNTTKPFFMFLSYQAVHPPLQI 240
Query: 254 ----------ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
A +E ++ D Y + ++D R + +L +D ++G++VE L+
Sbjct: 241 KSEKKLTPKIATHFEVPRSFDKYCS---RVKDRYRRIYCGMLTAMDFAIGRLVEYLKASN 297
Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
+ N++IVF SDNGG + ASN PLRG K+T+WEGG + + SP+ G
Sbjct: 298 LYENTVIVFTSDNGGTSN----FGASNAPLRGEKDTIWEGGTKTTTFVHSPMYIEEGGTR 353
Query: 364 EQYVHVSDWLPTLLS 378
E HV DW T+LS
Sbjct: 354 EMMFHVVDWHATILS 368
>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
Length = 466
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 115/262 (43%), Positives = 164/262 (62%), Gaps = 11/262 (4%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 3 IHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDT 62
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
G G DY+ H + M G D+ + AWD +G YST ++T + I+ +H+
Sbjct: 63 FFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNP 122
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V A
Sbjct: 123 RKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 177
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L++ NSII++ SDNGG + SNWPLRG K T WEGG+R G + SPLL++
Sbjct: 178 LKRYGFYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 233
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
+G + ++ VH++DW PTL++ A
Sbjct: 234 KGTICKELVHITDWYPTLITLA 255
>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 614
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 187/340 (55%), Gaps = 20/340 (5%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRS 112
A+ PH++F++ DD+G+NDVG+ D TP +D L G+ + Y Q+CTPSR+
Sbjct: 57 AAAETRPHVVFVMLDDVGFNDVGYASSDLGEMTPFLDGLMADGVRVDRLYGQQVCTPSRA 116
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
A++TGK PIH +QH + E GLP E L QYLK LGY T +VGKWHLG Y T
Sbjct: 117 AMLTGKLPIHLELQHWQVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYNNAST 176
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL--DMRRDLEPAWDLHGKYSTDVFTAEAV 230
P RGF+S G+++G DY H +W D+ D P D HG++ T + A+
Sbjct: 177 PLNRGFDSFYGFYSGGVDYLTHDPSTGYVWRCYRDLWDDERPVTDAHGQHQTSLMNERAI 236
Query: 231 DIIHNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLN----IHRHIEDFKRSKFAA 283
++ H+ + EP+F Y+++ N + PLQ P L I + R FAA
Sbjct: 237 AVLERHAVEKKSEPVFAYVSY-----PNAHLPLQPPTELLERRNATLLDIPNHDRKNFAA 291
Query: 284 ILHKLDESVGKVV-EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
++ DES+ + L+ + ++IV SDNGG + + SN+PLRG K L+E
Sbjct: 292 LMMNADESLANLTNRGLKATGLYDTAVIVIASDNGGQVSA--MGGGSNYPLRGEKKYLFE 349
Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
GGVR +I SPLL +RG + +H+SDWLPTLL A
Sbjct: 350 GGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTLLQGA 389
>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1818
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 136/356 (38%), Positives = 189/356 (53%), Gaps = 51/356 (14%)
Query: 59 PPHIIFILADDLGWNDVGFHG----LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
PPH++F LADD+G+ DVG++G +++ TP IDALA +G+ L YYT CTPSR+A+
Sbjct: 43 PPHVVFFLADDVGFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAAL 102
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
++GK+P TG H VL GLPL +LP+ L YR+ VGKW +G + P
Sbjct: 103 LSGKYPATTGTYHGVLNPQSTWGLPLEHALLPEALPG-AYRSHAVGKWDVGHSSAKRLPE 161
Query: 175 FRGFESHLGYWTG-HQDYFDHSAEEM---------------------------KMWGLDM 206
RGF+S LG++ + D+S E+ GL
Sbjct: 162 ARGFDSFLGFYLCFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFSGLGG 221
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
+R D G Y+TDVF AVD+I + D PLFLY+A A H PL+A +
Sbjct: 222 QRR-----DYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEAD 271
Query: 267 LNIHRHI----EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+ D KR+ FAA+ +D ++G+VV+AL+ N+++VF SDNG
Sbjct: 272 VAAFEATFLPESDSKRATFAAVALAMDRAIGRVVDALQATGAYENALVVFASDNGAIPG- 330
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTL 376
+ SNWPLRG K + WEGGVR + SPLL + RG V + HV+DWLPT+
Sbjct: 331 -QMGGGSNWPLRGGKFSAWEGGVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTI 385
Score = 219 bits (559), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 143/399 (35%), Positives = 207/399 (51%), Gaps = 54/399 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHG----LDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
A+S PPH++ LADD G++DVG++ +Q+ TP +D+LA G+ L YYT CTPS
Sbjct: 614 AASNPPHLLLFLADDAGFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCTPS 673
Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
R+A+++G +P +GM H ++ GL L +++PQ L YR+ VGKW +G Y
Sbjct: 674 RAALLSGMYPASSGMYHKMITAQSNWGLDLDLELIPQRLPA-AYRSHAVGKWDVGHYTWS 732
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSA-------EEMKMWGLDMRR-------------DL 210
+ P FRGF S LG+++ DY+ H EEM++ + R D
Sbjct: 733 HVPQFRGFRSFLGFYSPIIDYYTHETFDTLQCLEEMELTECEARLASECSSSIKDFNFDG 792
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-- 268
+P G YSTDVF A A D+I + PLFLY+A A H+ PL+A + +
Sbjct: 793 DPLPLADGTYSTDVFAARARDLIRKEAPKHPLFLYVAFNAVHA-----PLEADEDVIESF 847
Query: 269 IHRHIEDFK--RSKFAAILHKLDESVGKVVEALEQ-RRMLSNSIIVFVSDNGGAAAGFNL 325
+E+ + R+ FAA+ +D ++ VV+ L+ NS+I+F SDNGG +
Sbjct: 848 ASSFLEEARDARASFAAVALGMDRAMEAVVDELKAVDGAYENSVIMFASDNGGLPG--QI 905
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKS 383
SNWPLRG K + +EGGVR + SPLL + RG HV+DWLPTL+ A +
Sbjct: 906 AGGSNWPLRGQKFSPFEGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLAGGA 965
Query: 384 DIPNYVNSTVENIIPRYENSILRYEN--GTHEYNSPRIE 420
PR + + +YE G H PR E
Sbjct: 966 P-------------PRNVDGVDQYEAIFGEHPAEHPRTE 991
>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
Length = 598
Score = 228 bits (582), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 119/285 (41%), Positives = 180/285 (63%), Gaps = 14/285 (4%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 215 HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 273
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E+GY T +VGKWHLGFYKK PT RGF++
Sbjct: 274 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 333
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW+ G+YST ++ A I+ +H+
Sbjct: 334 FLGSLTGNVDYYTYDNCDGPGVCGYDLHEGESVAWEQSGQYSTLLYAQRASQILASHNPR 393
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 394 QPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITWAL 448
Query: 300 EQRRMLSNSIIVFVSDNGGA---AAGFNLNAASNWPLRGVKNTLW 341
++ NS+I+F +DN A + +NL ++ G + ++W
Sbjct: 449 KRYGYYDNSVIIFSTDNXLANFPGSWWNLERLAD----GTRQSVW 489
>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
Length = 467
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 114/265 (43%), Positives = 166/265 (62%), Gaps = 11/265 (4%)
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY++E PT RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60
Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHN 235
F++ G G DY+ H + + G D+ + AWD +G YST ++T + I+ +
Sbjct: 61 FDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILAS 120
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 121 HNPSKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNV 175
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL++ NSII++ SDNGG + SNWPLRG K T WEGG+R G + SPL
Sbjct: 176 TLALKKYGYYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPL 231
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L+++G V ++ VH++DW PTL++ A
Sbjct: 232 LKNKGSVCKELVHITDWFPTLITLA 256
>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
Length = 333
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 184/331 (55%), Gaps = 23/331 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+ +FILADDLG++D+G D I TPN+DALA G+IL N Y LCTPSR A+MTG++
Sbjct: 3 PNFVFILADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMTGRY 61
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P + MQ +V+ L K+LPQYLK+LGY+ +VGKWHLG+ + E PT RGF+
Sbjct: 62 PSASAMQTSVILPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRGFD 121
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-------- 231
+ G + G DY+ H+ W + D E A H + + D
Sbjct: 122 TFYGLYAGTGDYWSHTFFGKYDWHTNADIDFE-ANSTHSQVRSSYMNFVLQDLEMERLDK 180
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLD 289
+ H + +PLFLY A NP+ P Q D +LN++ D +R K+ + LD
Sbjct: 181 VFDEHDSKDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKFNDIRR-KYLGLTSGLD 234
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
VGK+VE L + M++N+ ++FVSDNG N P RG K++L+EGG +
Sbjct: 235 AMVGKIVEKLIENGMMNNTYVIFVSDNGADPP-----EGLNTPFRGGKSSLFEGGTKSNS 289
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
I+SPLL+ + +H++DWLPTL+ A
Sbjct: 290 FIYSPLLKKTEYENDGLMHITDWLPTLVRLA 320
>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 187/341 (54%), Gaps = 19/341 (5%)
Query: 48 MVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
VF+ L S P PH+I I DD+GWNDV +H D I PN+ LA G+I +
Sbjct: 8 FVFLALAVVSKPSIADEDQPHVILIFVDDMGWNDVHWHNPD-IAMPNLMDLADDGVIFEQ 66
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
YT CTPSR+A+MTG +P TG QH +++ G+PL+ K+LP+ LKE+GY T +VG
Sbjct: 67 SYTHPTCTPSRAAMMTGLYPFKTGNQHQMVFNLHPSGVPLNFKLLPEKLKEVGYATHMVG 126
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
KWHLGF K EY PT RGF+SH G WT G DY + G D R +LE G
Sbjct: 127 KWHLGFCKDEYLPTNRGFDSHYGLWTLGVGDYDKLNGVLSPSAGYDFRDNLEVVPKSDG- 185
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
Y + A +I++NHS + PLFL P + L+ P+ Y ++ IED +
Sbjct: 186 YLALMLGDRAEEIVNNHSPETPLFLVFTLDI-----PAKHLEIPEEYEELYADIEDDRTR 240
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+F L +D+ +G VV+AL+ R M +S+ +F+ DNG ++ A SN+P RG+ T
Sbjct: 241 QFYGKLTMMDDIIGGVVDALKDRDMYDDSLFIFIGDNGALSS----QAGSNYPFRGIAGT 296
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
L+EG R ++ +E G + + + D T+L A
Sbjct: 297 LFEGATRVPSIVSGKGIEKTGYKSNELYSIVDIHRTILDVA 337
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 63/133 (47%), Gaps = 31/133 (23%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
E+DG+ +W S +EPS RN I++NIDD+ +A+ G +KL+ TG
Sbjct: 344 ELDGLSMWPTWSEDEPSPRNEIIYNIDDDPVSPGAAIRIGDYKLI------------TGH 391
Query: 632 PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPC--------EPQIAPCLFDIKND 683
PD +Y R+ +R + + P +P PQ LF+I +D
Sbjct: 392 PDLLY----PYRD-----IRLTDNYYNYGDPPASGLPTTDQEANAPAPQNVEYLFNIVDD 442
Query: 684 PCEKNNLADRSED 696
P E+++++ ++D
Sbjct: 443 PEERHDISADNQD 455
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
E+DG+ +W S +EPS RN I++NIDD+ +A+ G +KL+
Sbjct: 344 ELDGLSMWPTWSEDEPSPRNEIIYNIDDDPVSPGAAIRIGDYKLI 388
>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
Length = 518
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/334 (37%), Positives = 195/334 (58%), Gaps = 20/334 (5%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
+++++ DD GW D G HG ++ TPN+DA+A GI+L+ YYT Q+C+P+RS+++TG++PI
Sbjct: 20 VVYLMIDDFGWGDFGVHG-SKLETPNLDAIARDGILLEKYYTQQVCSPTRSSLLTGRYPI 78
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
GMQHNV+ + G+P +LPQ LK G+ T +VGKWH G + P RGF+S+
Sbjct: 79 RYGMQHNVILAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPFNRGFDSY 138
Query: 182 LGYWTGHQDYFDHSAEEMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
GY G +D++ + K W G+D + P G Y T++++A+ ++ S +E
Sbjct: 139 YGYLQGAEDHYSRIQCQAKEWCGVDFCTENGPTNSTWGTYGTEIYSAQVAQVLDKVSKEE 198
Query: 241 -PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD-ESVGKVVEA 298
P FLY A H +PLQAP+HY IED R +AA++ +D E+VG +
Sbjct: 199 KPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIEDPDRRTYAAMVSIMDEETVGDFMTM 253
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L +R + +++I+ +DNGG + +N+PLR K +L+EGGV ++ L++
Sbjct: 254 LHERNLYDDTLIIITADNGGETR----SGGNNFPLRSQKWSLYEGGVHVNAMLSGGLIKD 309
Query: 359 RG-IVAEQYVHVSDWLPTLLSA------ANKSDI 385
+ + VHVSDW T+L A A+KSD+
Sbjct: 310 QSKSTFDGLVHVSDWRETILEAMECKPPADKSDL 343
>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
Length = 318
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/323 (39%), Positives = 185/323 (57%), Gaps = 18/323 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+ +FI+ADDLG++D+G D I TPN+DALA G+IL N Y LC+PSR A+MTG++
Sbjct: 3 PNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCSPSRHALMTGRY 61
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P + MQ V+ E L K+LPQYLKELGY ++GKWHLG+ ++E PT RGF+
Sbjct: 62 PYKSAMQSFVVLPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRGFD 121
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G G DY++H+ + W L+ E + +G +S D+ + H
Sbjct: 122 SFYGLLDGAGDYWEHTTSGVYDWHLND----EVFHEAYGNHSQDLELDRLDKLFAEHDNK 177
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK--RSKFAAILHKLDESVGKVVE 297
+PLFLY A NP+ P Q + YL+++ +DF R ++ A+ LD VGK+V+
Sbjct: 178 DPLFLYFA-----PQNPHVPSQPTEEYLSMYS-ADDFSDVRRRYLALTSALDSMVGKIVQ 231
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
L+ M+ N+ ++F+SDNG N P RG K +L+EGG + I+S LL+
Sbjct: 232 HLKDNCMMDNTYLIFMSDNGADPG-----EGQNTPFRGGKTSLFEGGTKSNSFIYSQLLK 286
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+ +H++DWLPTL+ A
Sbjct: 287 KTEYENDGLMHITDWLPTLVKLA 309
>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
Length = 370
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 190/326 (58%), Gaps = 20/326 (6%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+IIFILADDLG+NDVGF D I TPNID LA G+++ N Y+ +CTPSR A+MTG+
Sbjct: 10 PPNIIFILADDLGYNDVGFRNPDII-TPNIDKLARKGVVMTNSYSTHVCTPSRHALMTGR 68
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P TGMQ+ V+ G L K LPQYLK LGY T VGKWHLG + E PT RGF
Sbjct: 69 YPYKTGMQNFVIPGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDECLPTERGF 128
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+S G G Y++H+ + +DL+ + +G +S D+ + +H+
Sbjct: 129 DSFYGLLLGGGGYWNHTYTLFGAYDWFNNKDLDLS--ANGTHSQDLMVDRLSAVFASHNR 186
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK----RSKFAAILHKLDESVGK 294
+EP+FLY A NP+ P +A +++L RH R ++ ++ LD VG+
Sbjct: 187 EEPMFLYFA-----PQNPHTPSEATENFL---RHYPSSMYTEIRRQYLGLVSGLDWMVGE 238
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
++E + ML N+ I+F SDNGG A A N P RG K +L+EGG + I+SP
Sbjct: 239 LLELVIANNMLDNTYIIFQSDNGGDAV-----EALNGPYRGGKGSLFEGGSKVVNFIYSP 293
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL+ G ++H++DW+PT++S A
Sbjct: 294 LLKKVGYENNGWMHITDWMPTIISLA 319
>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
Length = 488
Score = 226 bits (577), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 115/262 (43%), Positives = 163/262 (62%), Gaps = 11/262 (4%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY++E PT RGF+S
Sbjct: 25 IHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDS 84
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
G G DY+ H + + G D+ + AWD +G YST ++T + I+ +H+
Sbjct: 85 FFGSLLGSGDYYTHFKCDSPGICGYDLYENNNAAWDHDNGIYSTQMYTQKVQQILASHNP 144
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V A
Sbjct: 145 RKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSINNINRRRYAAMLACLDEAINNVTLA 199
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + NSII++ SDNGG + SNWPLRG K T WEGG+R G + SPLL++
Sbjct: 200 LRKYGYYENSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 255
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
+G V ++ VH++DW PTL++ A
Sbjct: 256 KGSVCKELVHITDWFPTLITLA 277
>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
Length = 418
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 113/260 (43%), Positives = 160/260 (61%), Gaps = 15/260 (5%)
Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ 189
++ C+ +PL EK+LPQ LKE GY T +VGKWHLG ++KE PT RGF+++ GY G +
Sbjct: 1 IWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSE 60
Query: 190 DYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
DY+ H A + LD R E A YST++FT A +I NH ++PLF
Sbjct: 61 DYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIANHPPEKPLF 120
Query: 244 LYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
LYLA + H EPLQ P+ YL ++ I+D R +A ++ +DE+VG V AL+
Sbjct: 121 LYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRHHYAGMVSLMDEAVGNVTAALKSHG 175
Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
+ +N++ +F +DNGG L +NWPLRG K +LWEGG+RG G + SPLL+ +GI
Sbjct: 176 LWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVASPLLKQKGIKN 231
Query: 364 EQYVHVSDWLPTLLSAANKS 383
+ +H+SDWLPTL++ A S
Sbjct: 232 RELIHISDWLPTLVTLAGGS 251
>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 635
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 188/352 (53%), Gaps = 37/352 (10%)
Query: 59 PPHIIFILADDLGWNDVGF----HGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
PPH++ ILADD+G+ND+G+ H +Q+ TP +D LA G+ L YYT C+PSR A+
Sbjct: 117 PPHLVLILADDMGYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGAL 176
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG +P TG+ H V+ GLPL ++PQ+L YR+ +GKW +G Y + PT
Sbjct: 177 LTGLYPASTGLYHGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPT 235
Query: 175 FRGFESHLGYWTGHQDYFDH----------SAEEMKMWGLDMRRDL-----EPAWDLHGK 219
RGF S++G++ DY+ H + +K D DL + + +
Sbjct: 236 GRGFHSYVGFYGTDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNYDGAATGDEYYNR 295
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF--- 276
YSTD+FT AV+++ S PLFLY+A A H PL+A D I R F
Sbjct: 296 YSTDIFTDRAVELLRTESARNPLFLYVAFNAVHG-----PLEADD--ATIDRFNASFLYE 348
Query: 277 ---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+R+ FAA +D +V +V+ + NSI++F SDNG A SNWPL
Sbjct: 349 ARTERATFAAAALLMDRAVESIVKTMHAVGAYDNSIVLFASDNGATLA--QTGGGSNWPL 406
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKS 383
RG K T +EGGVR + SPLL S RGI HV+D LPTL+ AA S
Sbjct: 407 RGSKFTPYEGGVRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAGGS 458
>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 496
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 179/324 (55%), Gaps = 16/324 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FILADDLG +DVG+ G +I TP++D LA +G L +Y +C+P+R+A++TG++
Sbjct: 53 PNIVFILADDLGSHDVGWRG-SEIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGRY 111
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P G Q V+ GLPL E+ LPQ LKE GY T I GKWHLG ++ Y PT RGF+
Sbjct: 112 PFRYGFQTGVVRPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGFD 171
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G++ G DY+ H ++ G D R+ + D YST++ EA +
Sbjct: 172 HQYGHYNGMLDYYTH----IRHGGFDWHRNDQENHDE--GYSTELVGKEAARRVRERDKS 225
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+ HS P Q P Y + E KR +A ++ LD +VG++V+A+
Sbjct: 226 RPLFLYVPFNGVHS-----PHQVPVEYETPYTQFEG-KRKTYAGMVTALDAAVGRIVQAV 279
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+++ N++ +F SDNGG SN PLR K T++EGGVR P
Sbjct: 280 RDEKLVENTLFIFSSDNGGPNPK---QLTSNGPLRAGKGTVYEGGVRVCAFATWPGKIKP 336
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
G + +H++DW PTLL A S
Sbjct: 337 GTSVQAPIHIADWFPTLLGLAGAS 360
>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 739
Score = 225 bits (573), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 113/271 (41%), Positives = 160/271 (59%), Gaps = 21/271 (7%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH V+ + LPL E LPQ LKE GY T +VGKWH+GFY TPT RGF+S
Sbjct: 318 IHTGLQHGVIRHAQPNCLPLDEVTLPQKLKESGYATNMVGKWHIGFYLDACTPTERGFDS 377
Query: 181 HLGYWTGHQDYFDHS----------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
G+ G +D++ HS + G D+ + P +D G+YST +FT + +
Sbjct: 378 FFGFLVGGEDHYTHSGHCEVYNADDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNKTI 437
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
D I H +PLF+YLA+ A H PLQ PD Y++ +++I D R +A ++ +DE
Sbjct: 438 DAIERHDKTKPLFMYLAYQAVHL-----PLQVPDRYMDPYKNIADENRRTYAGMVSCMDE 492
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+G V AL+ + N+II+F +DNGG + A+NWPLRG K +WEGG+ G G
Sbjct: 493 GIGNVTRALKDAGLYDNTIIIFSTDNGGPIQ----SGANNWPLRGSKGNMWEGGIHGVGF 548
Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSA 379
+ SPLL + +G V +HVSDWLPT+++
Sbjct: 549 VHSPLLPTSVKGTVNHDLMHVSDWLPTIVAG 579
>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 225 bits (573), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 136/351 (38%), Positives = 190/351 (54%), Gaps = 29/351 (8%)
Query: 44 FTLSMVFVDL---------VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+ L VFV L +A + PHI+ I DDLGWNDV +H D I P + LA
Sbjct: 2 YCLYFVFVVLALPYISKPCIAKAEQPHIVLIFVDDLGWNDVEWHNPD-IKMPVLSKLAAD 60
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+I YT CTPSR+A+MTG +P TG QH +++ G+PL K+LP+ LKE+GY
Sbjct: 61 GVIFNQSYTHPTCTPSRAAMMTGMYPFRTGNQHQMIFNLHPSGVPLEFKLLPEKLKEVGY 120
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWGLDMRRDL- 210
T +VGKWHLGF K+EY PT RGF+SH G WT GH D + + G D R ++
Sbjct: 121 FTHMVGKWHLGFCKEEYLPTSRGFDSHYGLWTLGVGHYDKMNGVLSPSE--GYDFRDNIG 178
Query: 211 -EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
P D +Y T + A II+ H PLFL P + L+ PD + +
Sbjct: 179 VVPKSD---EYLTLMLAERAEHIINGHYNKHPLFLQFTMDI-----PAKHLEIPDTFEAL 230
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
+ IED + KF L +D +G+VV+AL+ R M +++++F+ DNG A+ A S
Sbjct: 231 YCDIEDERTRKFYGKLSLMDHVIGRVVDALKAREMWDDTVLMFIGDNGALAS----QAGS 286
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N+P RG+ TL+EG R LI +L G V + ++ D T+L+ A
Sbjct: 287 NYPFRGIAGTLFEGATRVPSLIHGSMLRETGYVNNELWNLVDLHRTILALA 337
>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
Length = 496
Score = 225 bits (573), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 117/262 (44%), Positives = 156/262 (59%), Gaps = 11/262 (4%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+V+ + LPL LPQ L+E GY T +VGKWHLGFY++E PT RGF++
Sbjct: 33 IHTGLQHSVIRPAQPNCLPLGHATLPQKLREAGYATHMVGKWHLGFYRRECMPTHRGFDT 92
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
G G DY+ H + G D+ + AWD G YST ++T I+ H
Sbjct: 93 FFGSLLGSGDYYTHYKCDGPGACGYDLYENDRAAWDHDDGVYSTHMYTRRVQQILAAHDP 152
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+VG V A
Sbjct: 153 GKPLFLYIAYQAVHS-----PLQAPSRYFRHYRSIANTHRRRYAAMLSCLDEAVGNVTLA 207
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + +NSI+++ SDNGG + SNWPLRG K T WEGG+R G + SPLL
Sbjct: 208 LRAHGLYNNSILIYSSDNGGQPSA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLRQ 263
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
RG V + VH++DW PTL++ A
Sbjct: 264 RGTVCRELVHITDWYPTLVALA 285
>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 486
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/356 (37%), Positives = 194/356 (54%), Gaps = 27/356 (7%)
Query: 42 LAFTLSMVFVD---LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
LA TL+ + L A+ P+II I+ADDLGW DVGF+G I TP++DALA G
Sbjct: 9 LAGTLAASLLPQLPLAAAPSAPNIIHIVADDLGWQDVGFNGCKDIQTPHLDALAKGGARF 68
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+Y +CTP+R+A+MTG++P G+Q V+ GL E +LPQ L++ GY T I
Sbjct: 69 TQFYVQPMCTPTRAALMTGRYPFRYGLQTAVIPSVSTYGLDTGEYLLPQCLQDAGYTTAI 128
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
+GKWHLG K++ P RGFE G G DY+ HS + LD RD EP +
Sbjct: 129 IGKWHLGHADKKFWPKQRGFEYQYGAMIGELDYYTHSEHGV----LDWFRDNEPVHE--E 182
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y+T++ A+AV + D P +LYLA A H+ P QAP Y++ + HI D R
Sbjct: 183 GYTTNLLGADAVKYLEKQKADRPFYLYLAFNAPHT-----PYQAPQEYIDRYTHIADPTR 237
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----AAGFNLNAA------ 328
+A ++ LD+++G+VV AL+++ + N++I+F SDNGG AG + +
Sbjct: 238 RTYAGMVACLDDNIGRVVAALDKKGLRENTLILFHSDNGGTHNPMFAGVMADVSKIKIPC 297
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
N P R K +L+EGGVR L W ++ + + +H +D T A S
Sbjct: 298 DNSPYRDGKGSLFEGGVRVCALANWPGKIKPATV--DGLIHATDIFTTFTKLAGAS 351
>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 781
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 187/348 (53%), Gaps = 53/348 (15%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
S+ P++ F L DD+G+ D+G+ D TPN+DALA G+ L NYYT+ LCTP+R++I
Sbjct: 213 STSKPNVFFFLIDDMGFGDMGYQSTDLSEITPNLDALAAGGVKLSNYYTMTLCTPARASI 272
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
MTG++P+ GMQ++V+ GLP SEKILP+Y+ E GY + +VGKWHLG Y+ E P+
Sbjct: 273 MTGRYPVRYGMQYSVIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLGSYRDESLPS 332
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKM--------------------------------- 201
RGF++ LGY G + Y+ H E +
Sbjct: 333 QRGFKTFLGYLNGIETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNHDENVGGPCT 392
Query: 202 -----WGLDMRRDLEPA-WDLHGKYSTDVFTAEAVDIIHNHST--DEPLFLYLAHAATHS 253
WG D+ + +PA G YSTD F A I+ + + ++PLF+Y+AH + HS
Sbjct: 393 DGGPRWG-DVMENEDPADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMYIAHQSVHS 451
Query: 254 AN---PYEPLQAPDHYL--NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
PYE + L + E KR+ FA +L LD+ +G+ V+ LEQ L NS
Sbjct: 452 PTGPAPYEEFSPEELALLEGVKDATESEKRTAFAGVLLYLDKRIGEFVDMLEQEGWLENS 511
Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
+IV SDNG + SN+PLRG K +++EGG R G SP L
Sbjct: 512 VIVVSSDNGACPD----DGGSNYPLRGSKQSVFEGGSR-TGYQPSPAL 554
>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
Length = 328
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 184/331 (55%), Gaps = 22/331 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+ +FILADDLG+ND+GF D + TPN+D LA G+IL N Y +CTPSR A+MTG++
Sbjct: 1 PNFVFILADDLGYNDIGFRNPD-VQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRY 59
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P + MQ+ V+ + L K LPQYLKELGY+ ++GKW+LG+ ++E PT RGF+
Sbjct: 60 PYRSAMQNFVINPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFD 119
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS----TDVFTAEAVD---- 231
S G G DY++H+ + + + K S F +D
Sbjct: 120 SFFGLLDGAGDYWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVICFQDLELDRLDK 179
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK--RSKFAAILHKLD 289
I H +PLFLY A NP+ P Q D +LN + +DF+ R ++ A+ LD
Sbjct: 180 IFTEHDNKQPLFLYFA-----PQNPHTPSQPTDEFLNKYS-ADDFEDVRRRYLALTSGLD 233
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
VGKVVE L M+ N+ ++F+SDNG A N P RG K +L+EGG + +
Sbjct: 234 SMVGKVVELLNDNHMMDNTYLIFLSDNGADPA-----EGLNAPFRGGKGSLFEGGTKSSS 288
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
I+SPLL+ + +H++DWLPTL+ A
Sbjct: 289 FIYSPLLKKTEYENDGLMHITDWLPTLVKLA 319
>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
Length = 330
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 125/329 (37%), Positives = 184/329 (55%), Gaps = 21/329 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+ +FILADDLG+ND+G D I TPN+DALA G+IL N Y +C+PSR A+MTG++
Sbjct: 3 PNFVFILADDLGYNDLGLRDPDVI-TPNMDALASKGVILTNNYVQAVCSPSRHALMTGRY 61
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P + MQ V+ E L K LPQYLK+LGY+ ++GKWHLG+ ++E PT RGF+
Sbjct: 62 PSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFD 121
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYST------DVFTAEAVDII 233
+ G + DY++H M W + D E A H + + D+ +
Sbjct: 122 TFYGLYASSGDYWEHGIMGMYDWHTEAGVDFE-ARGTHAQVGSRYWHIYDLEMERLDKVF 180
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLDES 291
H +PLFLY A +H+ P Q D ++N++ D +R K+ + LD
Sbjct: 181 DEHDNKDPLFLYFAPQNSHT-----PHQPTDEFINLYSEDKFSDIRR-KYLGLTSGLDAM 234
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VGK+V+ L + M++N+ ++FVSDNG N P RG K +L+EGG + + I
Sbjct: 235 VGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAPFRGGKGSLFEGGTKSSSFI 289
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+SPLL+ + +H++DWLPTL+ A
Sbjct: 290 YSPLLKKTEYENDGLMHITDWLPTLVKLA 318
>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 514
Score = 223 bits (568), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 183/329 (55%), Gaps = 21/329 (6%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
II + GWNDVG+H D +PTPN++ LA G+IL N Y+ +CTPSR A+MTGK+P
Sbjct: 41 IIILTISASGWNDVGWHN-DFMPTPNLNTLAREGVILDNMYSQPICTPSRVALMTGKYPA 99
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
GMQH V+ LP + L + LKE GY IVGKWHLG +YTP +RGF+SH
Sbjct: 100 KVGMQHFVVLPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMWRGFDSH 159
Query: 182 LGYWTGHQDYFDHSAEEMKMW-------GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G G F+ M W G D+R + +G ++T +F+ A I+
Sbjct: 160 YGCHEGVTSNFE---THMLTWPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAERIVK 216
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
NH+ + PLFLY+ + A P+ PLQAP + ++D R FA ++ +D+ VG
Sbjct: 217 NHNPESPLFLYVPYMA-----PHFPLQAPQGFEEA-VQLDDTDRRTFAGMMAAMDQGVGN 270
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
+ +AL+ + M +++ VF+SDNGG + A SN+P RG K T+WEGGV+ I
Sbjct: 271 ITQALKDKGMWDDTLFVFLSDNGGD----KMFAGSNYPYRGNKATMWEGGVKVPAFIRGS 326
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+L++ G V + H +D TLL A +
Sbjct: 327 MLKTSGTVNNELYHFTDIFATLLKVAGGT 355
>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
Length = 488
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 194/345 (56%), Gaps = 27/345 (7%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCT 108
VD ++ P+++FI+ADDLG++DV + + TPN+ LA+ + +L N Y QLCT
Sbjct: 22 VDGQTATQKPNVLFIMADDLGFSDVDWKD-STLHTPNLRHLAFHKNTALLSNSYVNQLCT 80
Query: 109 PSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
P+RSA MTG +P G Q+ V E G+P L + +++L Y T +VGKWHLG+ K
Sbjct: 81 PTRSAFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCK 140
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAE----EMK--MWGLDMRRDLE-----PAWDLH 217
KE+ PT RGF+ G++ YF+HSA+ E+K + GLD+ ++ P + +
Sbjct: 141 KEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQN 200
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF- 276
G YSTD+FT A+ ++ NH+ +P F++L++ A H PLQ I + E
Sbjct: 201 GVYSTDLFTDVAMSVLDNHNNSKPFFMFLSYQAVH-----PPLQVSQQSKTIGQGKEATF 255
Query: 277 ---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+ +L +D ++G++VE L+ + N++IVF SDNGG A ASN PL
Sbjct: 256 ILRSHAHSTRMLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTAN----FGASNAPL 311
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
RG K+T+WEGG + + SP+ G + HV DW T+LS
Sbjct: 312 RGEKDTIWEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILS 356
>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 223 bits (567), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 124/322 (38%), Positives = 178/322 (55%), Gaps = 11/322 (3%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+ ILADDLGWNDVG+H ++ P ++ LA G+I Y CTP+R+A+M+G
Sbjct: 30 PPHIVLILADDLGWNDVGWHN-PEVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSGY 88
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P TG QH +L GGLPL K LPQ LK++GY T IVGKWHLGF K+ + PT RGF
Sbjct: 89 YPFKTGNQHQLLLNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRGF 148
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+S G T +F + G D D Y + AV II+ H
Sbjct: 149 DSFYGGLTLGTSHFSKMNGILSTPGYDF-YDNSGVVPQTNDYLAFMLADRAVKIINGHYQ 207
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+ PLF+Y + P + + P Y ++ +I D + +F L +D++VG VV+A
Sbjct: 208 EYPLFMYFSMDV-----PAKSPEVPPEYEALYANITDDRTRRFYGKLSVMDDAVGSVVDA 262
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L++R M +++++F+ DNG A+ ++ SNWPLRG+ TL+EG R I +L+
Sbjct: 263 LKKRDMWDDTLLIFIGDNGALAS----SSGSNWPLRGIAATLFEGATRVPAFIHGNMLKQ 318
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
G + H+ D T+LS A
Sbjct: 319 TGYENNELWHIVDLHKTILSLA 340
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 22/124 (17%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS---ALTKGKWKLVKVVKVMRYQVDLTG 630
EIDG+D+W S+NE S RN I++NIDD+ IS A+ G +KL+ G
Sbjct: 347 EIDGMDMWDTFSKNESSPRNEIVYNIDDD-PISPGAAIRVGDYKLI------------AG 393
Query: 631 GPDQVYLSGLSDRE--WLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKN 688
PD Y L ++ W + + P + P P + LF++ +DP E+N
Sbjct: 394 NPDLFYPLRLINQSDGWFNYGDVP---TSGLPVLPTGDEPPPPNVT-YLFNVIDDPEERN 449
Query: 689 NLAD 692
N+AD
Sbjct: 450 NIAD 453
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 4/49 (8%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS---ALTRGKWKLVKEN 554
EIDG+D+W S+NE S RN I++NIDD+ IS A+ G +KL+ N
Sbjct: 347 EIDGMDMWDTFSKNESSPRNEIVYNIDDD-PISPGAAIRVGDYKLIAGN 394
>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
Length = 515
Score = 222 bits (566), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 134/361 (37%), Positives = 197/361 (54%), Gaps = 40/361 (11%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M F L L FTLS+ P++I IL DDLGW DV +H IPTPNID+ +
Sbjct: 1 MKFYSLVLFFTLSL--------CSQPNVIIILTDDLGWGDVSYHN-GFIPTPNIDSFVSN 51
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLY-GCERGGLPLSEKILPQYLKELG 153
GI L +Y C+P+R++++TG H + G+ + E+ GLP KI+P+Y KE G
Sbjct: 52 GIELNRFYANPTCSPTRASLLTGLHIFNHGVIRPFMNPSAEQTGLPEHLKIMPEYFKEAG 111
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW---GLDMRRDL 210
Y+T + GKWHLG +K+EY PT RGF+S G+ G Y+DH W G+ + D
Sbjct: 112 YQTALSGKWHLGMHKEEYLPTNRGFDSSYGHMLGGIGYYDHVHTNRMDWHRDGVSLNED- 170
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
YST++ EA++II N D PLFLY+A A H+ P++AP+ +N
Sbjct: 171 --------GYSTELIADEAINIIKNKDDDRPLFLYVAFNAPHT-----PIEAPEEDVNNF 217
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----------A 320
+IED +AA + KLD +G+++ +++ +L +II+F+SDNG A
Sbjct: 218 LYIEDELDRNYAANISKLDIEIGRIINSIKDLGILEETIIIFLSDNGPVFDINPIVKTIA 277
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
G S LRG K + +GG+R A + W +LE ++Q++ + D LPTLL+A
Sbjct: 278 PGLTKAKGSTAGLRGSKTSALDGGIRVPAAIWWKGVLEKSK--SDQFIFIQDLLPTLLTA 335
Query: 380 A 380
Sbjct: 336 T 336
>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 469
Score = 222 bits (566), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 179/332 (53%), Gaps = 55/332 (16%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHG-LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
AS PHI+F++ DDLG+NDVG+H I T NIDALA G+ L++YY LCTPSRS
Sbjct: 24 ASPPTPHIVFVMVDDLGYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQ 83
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++GK+ IH GMQH V+ LPL + + L + GY T +VGKWHLGFYK+E P
Sbjct: 84 FLSGKYLIHNGMQHLVIDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWP 143
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF+S G L G+ + D
Sbjct: 144 LNRGFQSFFGM-------------------------------LLGQSTVD---------- 162
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR-HIEDFKRSKFAAILHKLDESV 292
+P+F+YL++ A P+ P + PD Y +R I + R +A ++ LDES+
Sbjct: 163 --QGGTKPMFMYLSYQA-----PHLPFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDESI 215
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
GK+ + L++ + ++++ +F +DNGG NA +NWPLRGVK +EGG+RG G +
Sbjct: 216 GKLTDTLKEEGLWNDTVFIFSTDNGGVG---KKNAGNNWPLRGVKGNYFEGGIRGVGFVA 272
Query: 353 SPLLES--RGIVAEQYVHVSDWLPTLLSAANK 382
PLL + +G ++ +H+SDW PTL+ K
Sbjct: 273 GPLLSTNVQGTISTDLMHISDWYPTLVEGVAK 304
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/127 (22%), Positives = 58/127 (45%), Gaps = 11/127 (8%)
Query: 575 IDGIDVWSVLSRNEPSKRN-TILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
+DG+++W V+S+ E + I++NID + Q A K+ + + + G
Sbjct: 314 LDGVNMWDVISQGESGDPDREIVYNIDPDMQRPASIDSKYDSYSTIFDISIHASIRRGDM 373
Query: 634 QVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 693
++ +D +WL A ++ G + A L++I DP E+++L+D
Sbjct: 374 KLVTGYQADTDWL----------APVESGFANVYATDAGKAIWLYNITADPTEQDDLSDT 423
Query: 694 SEDQRIN 700
D+ ++
Sbjct: 424 MADEVVD 430
>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
Length = 519
Score = 222 bits (565), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 138/355 (38%), Positives = 202/355 (56%), Gaps = 32/355 (9%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L + L +VF + SS P+II I+ DDLGW DV ++G I TPNID LA G+ + +
Sbjct: 2 LKYILILVFSISLLSSERPNIIIIITDDLGWGDVSYNG-GPINTPNIDKLADDGLQMNRF 60
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLY-GCERGGLPLSEKILPQYLKELGYRTRIVG 160
Y+ C+P+R+A+ TG + + G+ + ER GLPL KILP+YLKE+GY+T + G
Sbjct: 61 YSAPTCSPTRAALFTGINSLKNGIIRPLNNPTAERYGLPLKHKILPEYLKEIGYQTALSG 120
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
KWHLG + EY P RGFES G+ G YFDH+ LD R+ E ++ Y
Sbjct: 121 KWHLGMFSDEYLPRNRGFESTYGHLGGGIGYFDHALSGR----LDWHRNGEILYE--DGY 174
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
ST + EA+ II N + + PLFLY+A A H+ P+QA + +N I D K
Sbjct: 175 STTLIADEAIRIIENKNNETPLFLYVAFNAPHT-----PIQAEEKIINNLSDISDKKERV 229
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN--AASNWP------ 332
+AA + LD +GK+++A+E + +L+ +II+F SDNG F+++ AA+ P
Sbjct: 230 YAANIITLDREIGKIIDAIESKGILNETIIIFFSDNGPV---FDIDPIAATIAPDILDSK 286
Query: 333 -----LRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L+G K + +EGG+R +I + +LE ++ Q+ V D LPTL A N
Sbjct: 287 GNTLGLKGSKGSAYEGGIRVPAVIYYKGILEKS--LSNQFFFVDDILPTLFGALN 339
Score = 40.4 bits (93), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/26 (65%), Positives = 21/26 (80%)
Query: 9 CERGGLPLSEKILPQYLKELGYRTRI 34
ER GLPL KILP+YLKE+GY+T +
Sbjct: 93 AERYGLPLKHKILPEYLKEIGYQTAL 118
>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
Length = 476
Score = 221 bits (563), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 113/267 (42%), Positives = 165/267 (61%), Gaps = 10/267 (3%)
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RG
Sbjct: 8 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 67
Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
F++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +H
Sbjct: 68 FDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASH 127
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
S +PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 128 SPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNIT 182
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL
Sbjct: 183 WALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLL 238
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ + + VH++DW PTL+ A +
Sbjct: 239 KRKRRTSRALVHITDWYPTLVGLAGGT 265
>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 461
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 130/329 (39%), Positives = 181/329 (55%), Gaps = 15/329 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+ I+ADDLGW DV FHG QIPTPNID LA G+ L +YY +RSA+MTG +
Sbjct: 9 PHIVVIVADDLGWGDVSFHGSRQIPTPNIDTLAGDGLALHHYYVQSSGAATRSALMTGLY 68
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTG++H +PLS I+ ++ +V H+GF + +
Sbjct: 69 PIHTGIKHASHICISARLIPLSGGIVRCHIVLTIKYAFMVNIPHIGFLFEHISKGAHAVN 128
Query: 180 SHL---GYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
+ GYW+ F H+ + L + + E + ++ + I H
Sbjct: 129 ASWWTNGYWSKR---FIHA------FFLQLFMNAE-IFFFFNQWPCREMRSPKWKIAPLH 178
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+ PL LYLAH A H N Y PL+AP Y+ R+I+D +R FA ++ LD+SVG V
Sbjct: 179 NCSVPLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRRIFAGMVAALDQSVGDVF 238
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
AL+++ +L ++I++ SD+GGAA G SNWPLRG K TLWEGGVR GL+WS L
Sbjct: 239 RALQRKGILEDTIVLLTSDSGGAAGGVEQGVGSNWPLRGTKKTLWEGGVRAVGLLWSAKL 298
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
+RG VA Q +H S + P L +A + DI
Sbjct: 299 -ARGRVASQLIHRSAYFP-LTNADSCGDI 325
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 35/56 (62%)
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
T+ + D + IDG D+W+VL + PS R +L+NID W+++A+ G +KLV
Sbjct: 317 TNADSCGDITDLGSIDGKDLWTVLVDDRPSPRTELLYNIDYRWKLAAIRVGPYKLV 372
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 7/77 (9%)
Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
T+ + D + IDG D+W+VL + PS R +L+NID W+++A+ G +KLV
Sbjct: 317 TNADSCGDITDLGSIDGKDLWTVLVDDRPSPRTELLYNIDYRWKLAAIRVGPYKLVL--- 373
Query: 556 INGNGTSENRSNDNSYQ 572
G S + D+ Y+
Sbjct: 374 ----GASSSDQFDDWYE 386
>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 189/346 (54%), Gaps = 24/346 (6%)
Query: 46 LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
L V ++ H++ I+ADD GW DV FH I TPNID L G+ L NYYT
Sbjct: 9 LFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQH 68
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHL 164
+CTP+RSA++TG++PIHTG+Q NV+ + GL E +LP+YL+ + R +VGKWH+
Sbjct: 69 ICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHV 128
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYF--DHSAEEMKMWGLDMRRDLE----PAWDLHG 218
G TP RGFE+ GY G +D++ + E+ G+D P G
Sbjct: 129 GHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSWG 188
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
KYS D++ + +I+ + + F+Y A H PLQA + +L + I+D R
Sbjct: 189 KYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSWIKDTNR 243
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+AA++ LDE VG VV+ + R+L N+I FV+DNGG + +NWPL G K
Sbjct: 244 RNYAAMVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGETR----DGGNNWPLSGQKW 299
Query: 339 TLWEGGVRGAGLIWS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
TL EGGV+ G ++ PL + G+ HVSD LPT++ A N
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL-----FHVSDILPTVMEAFN 340
>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 189/346 (54%), Gaps = 24/346 (6%)
Query: 46 LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
L V ++ H++ I+ADD GW DV FH I TPNID L G+ L NYYT
Sbjct: 9 LFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQH 68
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHL 164
+CTP+RSA++TG++PIHTG+Q NV+ + GL E +LP+YL+ + R +VGKWH+
Sbjct: 69 ICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHV 128
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYF--DHSAEEMKMWGLDMRRDLE----PAWDLHG 218
G TP RGFE+ GY G +D++ + E+ G+D P G
Sbjct: 129 GHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSWG 188
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
KYS D++ + +I+ + + F+Y A H PLQA + +L + I+D R
Sbjct: 189 KYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSWIKDTNR 243
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+AA++ LDE VG V++ + R+L N+I FV+DNGG + +NWPL G K
Sbjct: 244 RNYAAMVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNGGETR----DGGNNWPLSGQKW 299
Query: 339 TLWEGGVRGAGLIWS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
TL EGGV+ G ++ PL + G+ HVSD LPT++ A N
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL-----FHVSDILPTVMEAFN 340
>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
Length = 873
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 116/269 (43%), Positives = 160/269 (59%), Gaps = 12/269 (4%)
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
++ + IHTG+QH+V+ + LPL LPQ L++ GY T +VGKWHLGFYK+
Sbjct: 415 SVFHSGYQIHTGLQHSVIRATQPNCLPLENVTLPQKLRDAGYSTHMVGKWHLGFYKRGCL 474
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMK-MWGLDMRRDLEPAWDL-HGKYSTDVFTAEAV 230
PT RGF+S G G DY+ H E+ M G D+ E AW+ G YST +FT +AV
Sbjct: 475 PTQRGFDSFFGSLLGSGDYYSHYKCEVPGMCGYDLYEGEEAAWEQDRGLYSTVMFTRKAV 534
Query: 231 DII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
I+ H++ PLFLYLA+ A HS PLQ P YL ++ I + R K+AA++ LD
Sbjct: 535 SILAHHNPRAHPLFLYLAYQAVHS-----PLQVPARYLERYKGISNLSRRKYAAMVSCLD 589
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E++ + AL++ N++IV+ SDNGG L SNWPLRG K T WEGG+R G
Sbjct: 590 EAIHNLTLALKRYGYYDNTVIVYSSDNGGQP----LAGGSNWPLRGSKATYWEGGIRAVG 645
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
+ SPLL +G +H++DW PTL++
Sbjct: 646 FVHSPLLMKKGTKCRSLIHITDWFPTLVT 674
>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
Length = 382
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/328 (37%), Positives = 184/328 (56%), Gaps = 16/328 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+II+I DD G+NDVGF + I +PN+DALA SGIIL N YT C+PSR + ++G+
Sbjct: 20 PPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGR 78
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ + MQH V+ + L L ILP YLKELGY T GKWHLG+ + E TPT RGF
Sbjct: 79 YSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGF 138
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+S G ++G ++++H+ + D E +D GK+S D+ + +
Sbjct: 139 DSFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQ 196
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---DFKRSKFAAILHKLDESVGKV 295
PLF+Y+A HS PL +L ++ ++ D+ F+ ++ +D +G +
Sbjct: 197 SSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDASTDYSGFFFSGLVSGMDYIIGGI 251
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
V+ +++ + N+ I+F SDNGG ++ N P RG K+TLWEGG + SPL
Sbjct: 252 VDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSPL 306
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
L + G+ +HV+DWLPT++ A S
Sbjct: 307 LGASGVENNGLMHVTDWLPTIVDLAGGS 334
>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 506
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/366 (36%), Positives = 191/366 (52%), Gaps = 33/366 (9%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L + +++V V +V S PHI+ ILA+DLGWNDVG+H D + P ++ LA G+I
Sbjct: 3 LACSILVALVCVSVVKSDKQPHIVLILAEDLGWNDVGWHNPD-LKMPILNQLAADGVIFN 61
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
Y CTPSRSA+ TG +P QH +L E GL L K LP+ LK++GY T +V
Sbjct: 62 QSYVQPACTPSRSALFTGYYPFKIKRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTHLV 121
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLGF K+EY P RGF+S G+ T D + + E + G D R + +
Sbjct: 122 GKWHLGFCKEEYLPNKRGFDSFYGWLTLGTDLY--TKENVLAPGYDFRDNTGVVQE-SDT 178
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
Y + AVDI+ H + PLFL + A P + L+ P Y +++ I+D +
Sbjct: 179 YLPFMLAERAVDIVMGHYKEYPLFLEFSMAL-----PGKFLEVPQDYEDLYSDIDDDRTR 233
Query: 280 KFAAI-----------------LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
KF I L +D S+G +VEAL+ R M +++ +F SDNG
Sbjct: 234 KFYGICNLNIVVSKNTKLPNRKLSAMDNSIGLLVEALKTRGMWDDTLFIFTSDNG----A 289
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL---SA 379
F + SNWPLRG TL+EGG R + +L+ G V + +H+ D T++ A
Sbjct: 290 FATESGSNWPLRGNIATLFEGGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELAGA 349
Query: 380 ANKSDI 385
+SDI
Sbjct: 350 TTESDI 355
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 65/123 (52%), Gaps = 16/123 (13%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+++IDG+D+W +S+ EPS R ++NIDD + L+ G ++V Y++ +TG
Sbjct: 352 ESDIDGMDMWKTISKGEPSPRTEFVYNIDD----NELSPGA-----AIRVGDYKL-ITGH 401
Query: 632 PDQVYLSGLSDR--EWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
PD +Y L +R +W + + I P P P LF+I +DP E+NN
Sbjct: 402 PDLMYPFRLLNRSDDWYNYGDEPI---SGILLPPTGNEP-PPANVTYLFNIIDDPEERNN 457
Query: 690 LAD 692
LAD
Sbjct: 458 LAD 460
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 54/102 (52%), Gaps = 8/102 (7%)
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTR-GKWKLVKEN-SINGNGTSE 563
+++IDG+D+W +S+ EPS R ++NIDD E A R G +KL+ + +
Sbjct: 352 ESDIDGMDMWKTISKGEPSPRTEFVYNIDDNELSPGAAIRVGDYKLITGHPDLMYPFRLL 411
Query: 564 NRSND--NSYQNEIDGIDVWSVLSRNEPSKRN-TILHNIDDE 602
NRS+D N I GI + + NEP N T L NI D+
Sbjct: 412 NRSDDWYNYGDEPISGILLPP--TGNEPPPANVTYLFNIIDD 451
>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 486
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 195/358 (54%), Gaps = 30/358 (8%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
AVL +AFT V A+ P+I+ I+ADDLGW DVGF+G +I TPNIDALA G
Sbjct: 11 AVLLVAFTA----VASQAADRQPNIVHIVADDLGWKDVGFNGCTEIKTPNIDALAKGGAK 66
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
+Y +CTP+R+ +MTG+ P G+Q V+ GL SE ++PQ L + GY+T
Sbjct: 67 FSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQCLGDAGYKTA 126
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
I+GKWHLG ++Y P RGF+ G G DYF H + LD RD +P +H
Sbjct: 127 IIGKWHLGHADQKYWPKQRGFDYQYGAMIGELDYFTHDEHGV----LDWFRDNKP---VH 179
Query: 218 GK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
+ Y+T + +AV IH +P +LYL A H+ P QAP Y+ + +I +
Sbjct: 180 EQGYTTTLIGDDAVKYIHGQDGKKPFYLYLTFNAPHT-----PYQAPKEYITKYLNIAEP 234
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA----AGFNLNAA---- 328
R +AA++ LDE++GKVV AL+Q+ + N++I F SDNGG AG + +
Sbjct: 235 TRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAGQMADMSKVVL 294
Query: 329 --SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
N P R K +L+EGG R L W ++++ + + +H D PT + A S
Sbjct: 295 PCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTV--DGMIHAVDLYPTFAALAGAS 350
>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 489
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 127/335 (37%), Positives = 184/335 (54%), Gaps = 13/335 (3%)
Query: 46 LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
+++V V +V S PHI+ ILA+DLGWNDVG+H D I P ++ LA G+I Y
Sbjct: 9 VALVCVIVVESGKQPHIVLILAEDLGWNDVGWHNAD-IKMPILNQLAADGVIFNQSYVQP 67
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
CTPSR+A++TG +P QH +L E GL L K LP+ LK++GY T +VGKWHLG
Sbjct: 68 ACTPSRAALLTGYYPFKIQRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLG 127
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
F K+EY P RGF+S G+ T + S E + G D R + + Y +
Sbjct: 128 FCKEEYLPNKRGFDSFYGWLTLGTTLY--SKENIIAPGYDFRDNTGVVQE-SDTYLPFML 184
Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
AVDI+ H + PLFL + A + L+ P Y +++ IED ++ KF L
Sbjct: 185 AERAVDIVMGHYKEYPLFLEFSMALSGKF-----LEVPQEYEDLYSDIEDDRQRKFYGKL 239
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
+D S+G +V+AL+ R M +++ +F SDNG A+ + SNWPLRG TL+EG
Sbjct: 240 SVMDNSIGLLVDALKTRGMWEDTLFIFTSDNGALAS----ESGSNWPLRGNIATLFEGAT 295
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
R +L++ G V + +H+ D T++ A
Sbjct: 296 RVPTFFHGSMLKNTGYVNNELMHIVDLHKTIIELA 330
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 64/123 (52%), Gaps = 16/123 (13%)
Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
+ +IDG+D+W +S+ +PS R ++NIDD + L+ G ++V Y++ +TG
Sbjct: 335 ECDIDGMDLWKTISKGKPSPRTEFVYNIDD----NELSPGA-----AIRVGDYKL-ITGH 384
Query: 632 PDQVYLSGLSDR--EWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
PD +Y + +R +W + + I P P P LF+I +DP E+NN
Sbjct: 385 PDLLYPFRMLNRSDDWYNYGDEPI---SGIPIPPTGNEP-PPSNVTYLFNIIDDPEERNN 440
Query: 690 LAD 692
LA+
Sbjct: 441 LAE 443
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 8/102 (7%)
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTR-GKWKLVKEN-SINGNGTSE 563
+ +IDG+D+W +S+ +PS R ++NIDD E A R G +KL+ + +
Sbjct: 335 ECDIDGMDLWKTISKGKPSPRTEFVYNIDDNELSPGAAIRVGDYKLITGHPDLLYPFRML 394
Query: 564 NRSND--NSYQNEIDGIDVWSVLSRNEPSKRN-TILHNIDDE 602
NRS+D N I GI + + NEP N T L NI D+
Sbjct: 395 NRSDDWYNYGDEPISGIPIPP--TGNEPPPSNVTYLFNIIDD 434
>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 181
Score = 219 bits (558), Expect = 6e-54, Method: Composition-based stats.
Identities = 94/159 (59%), Positives = 122/159 (76%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L F L ++ V ++ PP++IF++ADDLGWNDV FHG QIPTP+IDALA G+IL NY
Sbjct: 11 LTFFLLGCMLECVYTAQPPNVIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNY 70
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +CTP+RSAIMTGKHPIHTG+Q++V+ E GL +E I+PQYL+ LGYRT +VGK
Sbjct: 71 YVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGK 130
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
WHLGF+K+ TP+ RGFES+ GY+ G QDYF H + E++
Sbjct: 131 WHLGFFKESLTPSHRGFESYYGYYGGMQDYFTHESTEVR 169
>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
Length = 316
Score = 218 bits (556), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 113/263 (42%), Positives = 158/263 (60%), Gaps = 11/263 (4%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LP + LPQ L+ELGY T +VGKWHLGFYKKE PT RGF++
Sbjct: 2 IHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDT 61
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-T 238
+ G TG +Y+ + S + M G D+ AW GKYST ++T I+ H
Sbjct: 62 YFGSLTGSVNYYTYDSCDGPGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHDPR 121
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+PLF++L+ A H+ PLQ P Y+ +R +E+ R K+AA++ +DE+V +
Sbjct: 122 SQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNITYG 176
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + NSI++F +DNGG L+ SNWPLRG K T WEGGVRG G I SPLL
Sbjct: 177 LRKYGYYENSIMIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLRK 232
Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
+ V++ VH++DW PTL+ A
Sbjct: 233 KKRVSKALVHITDWYPTLVGLAG 255
>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
Length = 551
Score = 218 bits (556), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 182/325 (56%), Gaps = 14/325 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II+I DD G+NDVGF + I +PN+DALA SGIIL N YT C+PSR + M+G++
Sbjct: 26 PNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGRY 84
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ MQH V+ + L L ILP YLKELGY T GKWHLG+ + E TPT RGF+
Sbjct: 85 SYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFD 144
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G ++G ++++H+ + D E +D GK+S D+ + +
Sbjct: 145 SFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQS 202
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSKFAAILHKLDESVGKVVEA 298
PLF+Y+A HS PL +L ++ ++ R K+ ++ +D +G +V+
Sbjct: 203 SPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDK 257
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
+++ + N+ I+F SDNGG ++ N P RG K+TLWEGG + SPLL +
Sbjct: 258 IKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSPLLGA 312
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
G+ +HV+DWLPT++ A S
Sbjct: 313 TGVENNGLMHVTDWLPTIVDLAGGS 337
>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 182/343 (53%), Gaps = 19/343 (5%)
Query: 46 LSMVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L ++F+ L S P PHII I DDLGWNDV +H D I PN+ LA G+I
Sbjct: 6 LIVIFIALAVVSKPCIADAHQPHIILIFVDDLGWNDVHWHNPD-IAMPNLMDLAADGVIF 64
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
YT CTPSR+A+MTG +P TG QH +L+ GLPL K+LP+ LKE+GY T +
Sbjct: 65 NQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMLFNLHPSGLPLEFKLLPEKLKEIGYSTHM 124
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
VGKWHLGF K EY PT RGF+SH G WT G DY + G D R + +
Sbjct: 125 VGKWHLGFCKDEYLPTNRGFDSHYGIWTLGVGDYDKMNGVLSPSKGYDFRDNTGVVQKSN 184
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
G Y + A I++ H PLFL A P + L P+ Y N + IED +
Sbjct: 185 G-YLALMLGDRAEHIVNTHYPGTPLFL-----AFTLDIPAKHLAIPEEYENKYADIEDSR 238
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
F L +D+ +G VV+AL+ R + +S+I+F+ DNG ++ A SN+P RG+
Sbjct: 239 TRHFYGKLTMMDDIIGGVVDALKNRGIYDDSLIIFIGDNGALSS----QAGSNYPFRGIA 294
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
TL+EG R ++ ++ G + + + D T+L A
Sbjct: 295 GTLFEGATRVPSIVSGKGIKKTGYESNELYSIVDIHRTILDVA 337
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 61/133 (45%), Gaps = 31/133 (23%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
E+DG+ +W S+ +PS RN I++NIDD+ +A+ G +KL+ TG
Sbjct: 344 ELDGLSMWPTWSKGKPSPRNEIIYNIDDDPVSPGAAIRIGDYKLI------------TGH 391
Query: 632 PDQVYLS---GLSDREW-----LALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKND 683
PD +Y G++D + LA + AA + P LF++ D
Sbjct: 392 PDLLYPYRDIGITDNYYNYGDPLASGLPTADQAA---------MAPPPLDVEYLFNVIVD 442
Query: 684 PCEKNNLADRSED 696
P E+N+++ D
Sbjct: 443 PEERNDISADHPD 455
>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
Length = 473
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 203/338 (60%), Gaps = 32/338 (9%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++G P+I+FIL DDLG+ DVG+ G QI TPNID LA G+ LK+ Y +C+P+R+A+M
Sbjct: 39 ATGAPNILFILLDDLGYGDVGYLG-SQIQTPNIDNLASQGVTLKHGYAYPICSPTRAALM 97
Query: 116 TGKHPIHTG----MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
TGK+P++ G M+++ + LP +P+ +E GY+T +VGKWHLG K+
Sbjct: 98 TGKNPLNFGIDGPMENDAM-------LPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSA 150
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
P RGF+ G+ G DY+ H + GLD + + + + G + T++ TA+A+D
Sbjct: 151 MPHNRGFDDFYGFLGGFVDYYTH----VYFGGLDWQNN-DTSLREEG-FVTELLTAKAID 204
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I + D+P F+YL+++A H+ PLQ + +N + E R FA + +D+
Sbjct: 205 KITHFKGDKPFFMYLSYSAPHT-----PLQYVPNAINDYSGFESKDRQVFADMTSDVDQD 259
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA-ASNWPLRGVKNTLWEGGVRGAGL 350
+G+V+ AL++R + N+I+VF+SDNGG NL A A+N L+G K++++EGGVR L
Sbjct: 260 IGEVLAALDKRGIAENTIVVFMSDNGG-----NLEAGANNGSLQGGKSSVYEGGVRVPVL 314
Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
I W L+S+ + Q + DW PTLL AAN PN
Sbjct: 315 IRWPAQLQSKTMT--QPMFTQDWAPTLLEAANIDYQPN 350
>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
Length = 529
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/325 (37%), Positives = 182/325 (56%), Gaps = 14/325 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II+I DD G+NDVGF + I +PN+DALA SGIIL N YT C+PSR + ++G++
Sbjct: 4 PNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRY 62
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ MQH V+ + L L ILP YLKELGY T GKWHLG+ + E TPT RGF+
Sbjct: 63 SYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFD 122
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
S G ++G ++++H+ + D E +D GK+S D+ + +
Sbjct: 123 SFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQS 180
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSKFAAILHKLDESVGKVVEA 298
PLF+Y+A HS PL +L ++ ++ R K+ ++ +D +G +V+
Sbjct: 181 SPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDK 235
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
+++ + N+ I+F SDNGG ++ N P RG K+TLWEGG + SPLL +
Sbjct: 236 IKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSPLLGA 290
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
G+ +HV+DWLPT++ A S
Sbjct: 291 TGVENNGLMHVTDWLPTIVDLAGGS 315
>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
Length = 526
Score = 217 bits (553), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/359 (36%), Positives = 195/359 (54%), Gaps = 32/359 (8%)
Query: 40 LPLAFTLSMVF--VDLVASSGP-----------PHIIFILADDLGWNDVGFHGLDQIPTP 86
L ++ +L MVF L A+ P+I+ ILADDLGW DVG+HG D I TP
Sbjct: 18 LSVSLSLLMVFWIAGLTAAQASEMASNNKSDERPNIVIILADDLGWGDVGYHGSD-IQTP 76
Query: 87 NIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILP 146
+ID LA G L +Y C+P+R+A+MTG+ P+ G+ ++VL E GG+ L E LP
Sbjct: 77 HIDRLAKEGAKLNRFYATPFCSPTRAALMTGRDPLKLGVAYSVLMPWENGGVSLDEHFLP 136
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
Q + GY T +VGKWHLG +++TP RGF+ G+ YFDH G D
Sbjct: 137 QSFQAAGYNTAMVGKWHLGHTIEQHTPNARGFDLFYGHMHTQVSYFDHQIAN----GHDF 192
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
+ + +P D +G+Y+TDV A+A I + +P LY+ A HS P++AP+
Sbjct: 193 QENGKPV-DHNGEYATDVHGAQAARFITDLRDKTKPFLLYVPFLAPHS-----PMEAPEA 246
Query: 266 YLNIHRHIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+ + D R +AA++ +D+++GKV+ AL++ + N+I+ F SDNG G
Sbjct: 247 LKDKYSSRLDLPGSTRKTYAAMVDSMDQAIGKVLTALDEEGVADNTIVFFFSDNG----G 302
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
F + N P RG K ++EGG+R ++ P + G ++ V V D LPTL AA
Sbjct: 303 FENYGSDNGPYRGGKLEVYEGGIRVTAVMRWPEKLAAGSEVDEIVSVMDLLPTLTHAAG 361
>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/328 (38%), Positives = 184/328 (56%), Gaps = 22/328 (6%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PHIIF++ DDLG++D+G+ D + +PNID LA + + ++NYY CTPSR+A MT
Sbjct: 21 SKKPHIIFVMIDDLGFDDLGYVN-DDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMT 79
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ I GMQ V+ E +PLSE +LPQ K+ GY T + GKWHLGFY +++ P R
Sbjct: 80 GRYNIRYGMQSGVIKPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHLGFYTEKHCPQNR 139
Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMW---GLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
GF+ G++ G QDYF H S ++ G + R DL+G YST + +
Sbjct: 140 GFDRFFGFYLGSQDYFYHDSGNNCYLYEPNGTEKVR-----LDLNGTYSTKAIAEDFIAK 194
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQ-APDHYLNIHRHIEDFKRSKFAAILHKLDES 291
+ + + PLF +L+ H PLQ P L + E ++R + + +D
Sbjct: 195 LDEYDPETPLFEFLSFQEVHG-----PLQWLPAEVLKMAGGWEGWERKMLSRKIVIVDHF 249
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+G+VV+ L+++ ++I+V SDNGG ASNWPLRG K ++EGG+R I
Sbjct: 250 IGEVVKKLKEKGFWKDTILVITSDNGGQTR----EGASNWPLRGRKGDVFEGGIRSRAFI 305
Query: 352 WSPLLES--RGIVAEQYVHVSDWLPTLL 377
SP L +G + HV+DW PTLL
Sbjct: 306 HSPKLPDSLKGSSFQHVFHVTDWFPTLL 333
>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
Length = 571
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 190/339 (56%), Gaps = 33/339 (9%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+ ++I+ADDLG+NDV +H Q+ TPN+ +A +G+IL Y+ CTPSR++ MTG
Sbjct: 31 PPNFLYIMADDLGYNDVSWHN-PQVLTPNLGKMAKNGVILTESYSQAACTPSRASYMTGY 89
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G+Q++V+ +PL LP+ LKE GY + +VGKWHLG +++ TP RGF
Sbjct: 90 YPFRIGVQNSVVREGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRDVTPPGRGF 149
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP-----AWDLHGK----------YSTD 223
++ LG G+ DY+ K+ + D +P +D Y+TD
Sbjct: 150 DTFLGLLNGYNDYY-----TKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDYTTD 204
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-PDHYLNIHRHIEDFKRSKFA 282
+FT A+++I S D P FL A H P+ PLQ P + ++D R +
Sbjct: 205 IFTNRAIELIQ-QSKDTPFFL-----ALHYTAPHWPLQKHPGFSDKDYPDVKDLNRRIYL 258
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
+ + +D +GKVV+AL+Q +L N++IVF SDNGG +A+N PLR K TL+E
Sbjct: 259 SSITAMDTGIGKVVDALKQHNLLDNTLIVFQSDNGGDTHF----SANNGPLRDRKTTLYE 314
Query: 343 GGVRGAGLIWSP-LLESRGIVAEQYVHVSDWLPTLLSAA 380
GGV+ + + P LL + ++ H+SDW T+LSAA
Sbjct: 315 GGVKVPSMAYGPGLLTNTPRTSDDLFHISDWFTTILSAA 353
>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 223
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 105/223 (47%), Positives = 148/223 (66%), Gaps = 5/223 (2%)
Query: 125 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY 184
MQ + ++ G+ L EK LPQYLK +GY+T +GKWHLGF+ KEYTPT+RGF+S GY
Sbjct: 1 MQSDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGY 60
Query: 185 WTGHQDYFDHSAEEMKMWGLDMRRDLEPA----WDLHGKYSTDVFTAEAVDIIHNHSTDE 240
+ G DY+DHS WGLD+ D + ++ G YST +++ EA+D I NH++ +
Sbjct: 61 YGGQADYWDHSLASNGWWGLDLHYDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNSTQ 120
Query: 241 PLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
P+FLYLA+ A HSAN E PLQAP +++ HI+ R +AA+L +D +G++ L
Sbjct: 121 PMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSEL 180
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
++ ML +SII+F SDNGG A G N N A+N+PLRG K T++E
Sbjct: 181 LKQNMLDDSIIIFTSDNGGPANGLNNNWATNYPLRGTKTTVYE 223
>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 490
Score = 216 bits (550), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 138/377 (36%), Positives = 195/377 (51%), Gaps = 40/377 (10%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M F L ++ + +++ P+I+ I+ADDLG D+G+ G +I TP+IDALA
Sbjct: 1 MTFIKWFLPIFVTTLSFGAASAAERPNIVVIVADDLGNADLGYRG-SKIKTPHIDALAKG 59
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G+ L++YY + LCTP+R+A+MTG++P+ G+Q V++ R GLP EK LPQ LKE+GY
Sbjct: 60 GVRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVIFPSHRYGLPTDEKTLPQALKEVGY 119
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW---GLDMRRDLE 211
T +VGKWHLG K++ P RGF+ G G DYF + W G +R D
Sbjct: 120 HTAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVDYFTRERGGVVDWQRNGEFLRED-- 177
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
Y D+ EAV +I H +PLFLY A A H+ P QAP ++ +
Sbjct: 178 -------GYYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPHA-----PYQAPKADIDAYN 225
Query: 272 HIEDFKRSK-FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA-- 328
I D + + +A ++ LD VG+VV ALE M N+II F +DNGGA + A
Sbjct: 226 DIFDNEMHRTYAGMISNLDRQVGRVVAALETNGMRENTIIFFTTDNGGATSALFATGARA 285
Query: 329 ------------------SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
SN P G K +L +GGV ++ P +V E HV
Sbjct: 286 PEEREASGGVALGSKPPCSNLPFSGGKGSLKDGGVHLPAIVNWPGRLKPSVVNEPLHHV- 344
Query: 371 DWLPTLLSAANKSDIPN 387
D +PTLL+ A P+
Sbjct: 345 DLMPTLLALAGGKGDPS 361
>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 216 bits (549), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 187/343 (54%), Gaps = 19/343 (5%)
Query: 42 LAFTLSMVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L L VF + S PHIIF++ DDLG++D+G+ D + +PNID LA + + +
Sbjct: 3 LIIFLLFVFRECFGHGRFSKKPHIIFVMIDDLGFDDLGYVN-DDVISPNIDFLAKNALHI 61
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+NYY CTPSR+A MTG++ I GMQ V+ E +PLSE +LPQ LK+ GY T +
Sbjct: 62 ENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVIKPDEPEAIPLSETLLPQALKKCGYNTSM 121
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLH 217
GKWHLGFY +++ P RGF+ G++ G QDYF H S ++ R + DL+
Sbjct: 122 HGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQDYFYHDSGNNCYLYEPCYREKVR--LDLN 179
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DF 276
G YST + + + + + PLF +L+ H PLQ Y + + +
Sbjct: 180 GTYSTKAIAEDFIAKLDEYDPETPLFEFLSFQEVHG-----PLQWLSEYSTLFKEKDWTH 234
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R + + +D +G+VV+ L+++ ++I+V SDNGG ASNWPLRG
Sbjct: 235 YRKMLSRKIVIVDHFIGEVVKKLKEKGFWKDTILVITSDNGGQTR----EGASNWPLRGR 290
Query: 337 KNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLL 377
K ++EGG+R I SP L + +G HV+DW PTLL
Sbjct: 291 KGDVFEGGIRSRAFIHSPKLPNSLKGSSFPHVFHVTDWFPTLL 333
>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
Length = 511
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 185/343 (53%), Gaps = 38/343 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+F++ DD GWND GF G ++ TP ID L GI L +Y ++C+P+R+A+MTG++
Sbjct: 14 PHILFVVVDDHGWNDCGFAGT-RVKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGRY 72
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P G+Q G L +E +LPQY+K GY TR VGKWHLGF + ++TPTFRGF+
Sbjct: 73 PHRYGLQFPFCGGAAMA-LNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGFD 131
Query: 180 SHLGYWTGHQDYFDHS-----AEEMKMWGLDMRRDLEP---------AWDLHGK----YS 221
S G+++ +DYF H + + LD D P A++ G YS
Sbjct: 132 SFYGFYSCAEDYFFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAVGTDWQHYS 191
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSK 280
T +F VDI+ H +PLFLY A TH+ P + P Y + + + D R K
Sbjct: 192 TTLFAGRIVDIVDGHDPSQPLFLYFASQDTHN-----PGEVPQCYKDRYNTTVPDQVRRK 246
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG---AAAGFNLN------AASNW 331
AA L +D ++ + EA ++ M N +IV+ +DNGG +G N +SN+
Sbjct: 247 LAAKLTTVDSALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPNKGKQEDSIGSSNY 306
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY---VHVSD 371
PLRG K+ +EGGVR + P L++ + +Y +H+ D
Sbjct: 307 PLRGGKHNAYEGGVRSTAFLSGPALKTLARLVRKYQYKLHIGD 349
>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 470
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/362 (37%), Positives = 203/362 (56%), Gaps = 24/362 (6%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTP 86
Y+ F L F+LS + ++SG P+ + I+ADDLG+ D+ F G Q+ TP
Sbjct: 2 YKIGCFLFLQSLLIFSLSAQIISTPSTSGRNSHPNFVLIVADDLGYGDLSFTGSTQVKTP 61
Query: 87 NIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG------GLP 139
+ID LA SG+ + Y + +C+PSR+ ++TG++ + G +N L + G GLP
Sbjct: 62 HIDELAASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNN-LANSQPGFDPAFLGLP 120
Query: 140 LSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM 199
++ K + +LK+LGY T +VGKWHLG Y+ +++P RGF+ GY G DYF+ S E
Sbjct: 121 VNVKTVGDHLKKLGYVTGLVGKWHLG-YEDQFSPLNRGFDEFWGYLGGGHDYFEAS-EAK 178
Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
+ + ++ + + ++ Y TD E ++ I H DEP FLY + A H+ P
Sbjct: 179 RGYKAKIKCNYKTPQEI--TYITDDKGDECINFIQRHK-DEPFFLYASFNAPHT-----P 230
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+QA L I++HIED KR +AA++H+LD +VG++VEA+E+ +L N+I VF+SDNGG
Sbjct: 231 MQATAEDLAIYQHIEDRKRRTYAAMVHRLDINVGRIVEAVEEADLLENTIFVFISDNGGP 290
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
N + N P G K TL EGG+R I P G V E V D PT ++
Sbjct: 291 VM---TNGSINAPYNGKKGTLLEGGIRVPFFISWPGHLEPGKVFEHPVSSLDLTPTFVAL 347
Query: 380 AN 381
A
Sbjct: 348 AG 349
>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 184/342 (53%), Gaps = 12/342 (3%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+ + L++V +A + PH+I I DD+GWNDV +H D I PN+ LA G+I
Sbjct: 7 IAVFLALAVVSKPSIAEADQPHVILIFVDDMGWNDVHWHNPD-IAMPNLMDLAADGVIFN 65
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
YT CTPSR+A+MTG +P TG QH +++ G+PL K+LP+ LKE+GY T +V
Sbjct: 66 QSYTHPTCTPSRAAMMTGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMV 125
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
GKWHLGF K EY PT RGF+SH G WT G DY + G D R ++ G
Sbjct: 126 GKWHLGFCKDEYLPTNRGFDSHYGLWTLGVGDYDKMNGVLSPSAGYDFRDNMGVVPKSDG 185
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y + A I++ H PLFL A P + L+ P+ Y + IED +
Sbjct: 186 -YLALMLGDRAEHIVNTHYPGTPLFL-----AFTLDIPAKHLEIPEEYEEKYSDIEDDRT 239
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+F L +D+ +G VV+AL+ R M +S+I+F+ DNG ++ + SN+P RG+
Sbjct: 240 RQFYGKLTMMDDIIGGVVDALKNRGMYDDSLIIFIGDNGALSS----QSGSNYPFRGIAG 295
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T++EG R ++ ++ G + + + D T+L A
Sbjct: 296 TMFEGATRVPSIVSGKGIKKTGYESNELYSIVDIHRTILDVA 337
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 14/65 (21%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
E+DG+ +W S ++PS RN IL+NIDD+ +A+ G +KL+ TG
Sbjct: 344 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI------------TGH 391
Query: 632 PDQVY 636
PD +Y
Sbjct: 392 PDLLY 396
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
E+DG+ +W S ++PS RN IL+NIDD+ +A+ G +KL+
Sbjct: 344 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI 388
>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
Length = 452
Score = 212 bits (540), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 125/334 (37%), Positives = 186/334 (55%), Gaps = 35/334 (10%)
Query: 66 LADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCTPSRSAIMTGKHPIHT 123
+ADDLG++DV + + TPN+ LA+ + +L N Y QLCTP+RSA MTG +P
Sbjct: 1 MADDLGFSDVDWKD-STLHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRV 59
Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
G Q+ V E G+P L + +++L Y T +VGKWHLG+ KKE+ PT RGF+ G
Sbjct: 60 GTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYG 119
Query: 184 YWTGHQDYFDHSAE----EMK--MWGLDMRRDLE-----PAWDLHGKYSTDVFTAEAVDI 232
++ YF+HSA+ E+K + GLD+ ++ P + +G YSTD+FT A+ +
Sbjct: 120 FYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSV 179
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF--------AAI 284
+ NH+ +P F++L++ A H PLQ + I K + F +
Sbjct: 180 LDNHNNSKPFFMFLSYQAVH-----PPLQVSQQ----SKTIGQGKEATFILRSHAHSTRM 230
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
L +D ++G++VE L+ + N++IVF SDNGG A N ASN PLRG K+T+WEGG
Sbjct: 231 LTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTA---NF-GASNAPLRGEKDTIWEGG 286
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
+ + SP+ G + HV DW T+LS
Sbjct: 287 TKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILS 320
>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
Length = 898
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 148/222 (66%), Gaps = 10/222 (4%)
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
VGKWH+GFYKKEY PT RGF+S+ GY TG +DY+ H KM G+D+R + PA + G
Sbjct: 526 VGKWHVGFYKKEYLPTSRGFDSYFGYLTGSEDYYTHHRCYGKMCGMDLRENAVPA-NYSG 584
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
YST +F +A DI+ H +D+PLFLYL + HS PLQ P+ Y+ + HI+D KR
Sbjct: 585 IYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHS-----PLQVPEKYIQPYLHIKDKKR 639
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
K+A ++ +DE++G + E +Q+ + +N+++VF +DNGG L +N+PLRG K
Sbjct: 640 RKYAGMVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNGGQI----LEGGNNYPLRGWKA 695
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+LWEGG+ G G + +L+ +G+V++ +HV+DW PTL+S A
Sbjct: 696 SLWEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLA 737
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 56/82 (68%), Gaps = 3/82 (3%)
Query: 42 LAFTLSMVF--VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L F + + F + S PHI+F+LADD G+ND+G+HG +I TPN+D LA GI L+
Sbjct: 7 LTFLIVLFFNTFPVQGKSSQPHILFVLADDFGYNDIGYHG-SEIKTPNLDKLAGDGIRLE 65
Query: 100 NYYTVQLCTPSRSAIMTGKHPI 121
NYY +CTP+RS +++G++ +
Sbjct: 66 NYYVQPICTPTRSQLLSGRYQV 87
>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 485
Score = 211 bits (537), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 140/376 (37%), Positives = 200/376 (53%), Gaps = 51/376 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK-NYYTVQLCTPSRSAIMTGK 118
P+I+ +LADDLG+ ++G G QIPTP+ID+LA +G+ + Y T C+PSR+ ++TG+
Sbjct: 42 PNIVLLLADDLGYGELGCQGNPQIPTPHIDSLAANGVRFRCGYVTAAYCSPSRAGLLTGR 101
Query: 119 HPIHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+ G + N GLPL EK L + L + GY T +VGKWHLG + + P
Sbjct: 102 YQSRFGYEQNPTGARNEDPELGLPLEEKTLARRLHDAGYATGLVGKWHLGGTAR-FHPLR 160
Query: 176 RGFESHLGY-----------WTGHQDYFDHSA----------EEMKMWGLDMRRDLEPAW 214
RGF+ G+ + G + A +E M M D EPA+
Sbjct: 161 RGFDEFFGFLHEGHFFVPPPYEGVSTFLRRRALPNGKTGRWGDERLMLSTHMGHD-EPAY 219
Query: 215 D-----LHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
D L G Y TD FT EA D I + D P FLYLA+ A HS PLQ
Sbjct: 220 DANNPILRGGQPVEEAAYLTDAFTREACDFIARNQ-DRPFFLYLAYNAVHS-----PLQG 273
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
D Y+ HI D +R FAA+L LD+SVG+V++ LEQ ++ SN+I++F+SDNGG
Sbjct: 274 ADAYMQQFAHIADQQRRIFAAMLANLDDSVGEVLQTLEQHQLTSNTIVIFLSDNGGPTRE 333
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
+SN PLRG K ++EGG+R L+ P G V E+ + D PT L+AA +
Sbjct: 334 LT---SSNAPLRGEKGDMYEGGIRVPVLMQWPGQIPAGRVEERPIISLDLFPTCLAAA-E 389
Query: 383 SDIPNYVNSTVENIIP 398
+++P ++ N++P
Sbjct: 390 AELPTELDGV--NLLP 403
>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 463
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 183/344 (53%), Gaps = 21/344 (6%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
LA L F L A P+II +L DD GW DVGF+G IPTPN+D LA GI+ N
Sbjct: 9 LALALMTAFSSL-AQEKKPNIIVVLTDDQGWADVGFNGATDIPTPNLDRLASEGIVFDNA 67
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTR 157
Y C+PSR+ ++TG++ G N+ Y E G PLSEK++P+ LKE GYRT
Sbjct: 68 YVSHPYCSPSRAGLLTGRYQARFGHDCNMPYDSENDDTVGTPLSEKMIPEALKEHGYRTS 127
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
+GKWHLG + + P +GF+ G+ G +Y+ +K + R+ EP
Sbjct: 128 AIGKWHLGDHPSLH-PIHQGFDHWFGFAGGGMNYWGIPDGPIKT----IVRNGEPVPQNE 182
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y TD FT EA+D I D+P F+YLA+ A P+ P QA YL +HIE
Sbjct: 183 LRYLTDDFTDEAIDFI-TKKDDKPFFMYLAYNA-----PHAPDQATKQYLEQTKHIEYGG 236
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
RS +AA+++ +D +VG++ L M N+I+VF+SDNGG A N P RG K
Sbjct: 237 RSIYAAMVNAVDANVGRIDSTLVANGMKENTILVFLSDNGGRG-----EHADNRPYRGHK 291
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L+EGG++ I P G E+ V D PT L AA
Sbjct: 292 GMLFEGGIKVPFFITWPKKLKGGQHYEKMVSSLDLFPTFLEAAG 335
>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
Length = 485
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 113/324 (34%), Positives = 164/324 (50%), Gaps = 56/324 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FI+ADDLGWNDVG+H D + TP +D LA G+IL Y +CTPSR+A MTG
Sbjct: 28 PNILFIVADDLGWNDVGWHNPD-VKTPVLDQLANEGVILNQCYVNYVCTPSRTAFMTGYF 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P H G QH V + + G+P LP+ LK+LGY T +VG
Sbjct: 87 PYHVGSQHTVFFPQQAQGIPSKFPFLPEKLKQLGYATHMVG------------------- 127
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
+ D + G+Y T FT VDI+ H D
Sbjct: 128 ---------------------------KDDKDVVKTRSGQYGTYFFTDRVVDIVEKHPAD 160
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLF+YL + H +PLQ P + +++ ++D R K ++ LDE+VG V A+
Sbjct: 161 TPLFMYLPYQNVH-----QPLQVPQRFEDMYASVQDENRRKLLGMVSALDEAVGNVTMAM 215
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ + N++++F +DNGG + + +N+PLRG K TLWEGG RG +L+
Sbjct: 216 KKAGLWDNTLVIFTTDNGGWISA----SGNNYPLRGGKVTLWEGGTRGVAFAHGKMLQKT 271
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
G + +H DW PT+L+AA +
Sbjct: 272 GYTNNEMIHAVDWFPTILAAAGGT 295
>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 451
Score = 208 bits (530), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 115/269 (42%), Positives = 158/269 (58%), Gaps = 5/269 (1%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
I +GMQ V+ GLPL KI+PQ+ K+LGY+ ++GKWHLGF++ YTP RGF+S
Sbjct: 1 ILSGMQAEVIRDAAPWGLPLELKIMPQHFKDLGYQVNMIGKWHLGFFQTPYTPPKRGFDS 60
Query: 181 HLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF-TAEAVDIIHN-HST 238
GY+TG DY++H+A + L + + KYST + EA ++ N HS
Sbjct: 61 FFGYYTGENDYYNHTAGTLNRPPLYGTFKTKQTRCVLAKYSTHTWGLLEAFPLLLNLHS- 119
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+P F Y ++ A H+A P QAP+ + +I + R+ +A +L LDES+G+VVE
Sbjct: 120 -QPFFCYFSYQAVHAALMNVPFQAPERNVLKFPYIGETNRTLYAGMLDSLDESIGRVVEI 178
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L ML ++II F SDNGG N NWPLRG K TLWEGGVR +WS
Sbjct: 179 LGNAGMLEDTIIAFSSDNGGLPYNAFSNRGFNWPLRGAKGTLWEGGVRVPAFVWSSKFLK 238
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+ V++Q +H+SDWLPTL SAA D+ N
Sbjct: 239 KPRVSDQLMHISDWLPTLYSAAG-GDVSN 266
>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
Length = 274
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/227 (45%), Positives = 149/227 (65%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S P+IIFILADDLGW+DV FHG QIPTPN+DALA GIIL +Y LCTPSR+A+MT
Sbjct: 45 SSMPNIIFILADDLGWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMT 104
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++PI+TGMQH V+ E GLPL +++P++ +LGY+T +VGKWHLG +KK++ P R
Sbjct: 105 GRYPIYTGMQHFVIQPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRR 164
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
GF+S G++ QDY++ + E + G D + + + +YST +T AV +I +H
Sbjct: 165 GFDSFYGFYNADQDYYNKTLTEGEHTGYDFWLNEDIHIYPNNRYSTHHYTERAVSLIRSH 224
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ +PLFLYL++ L+AP+ +N +I + R+ +A
Sbjct: 225 NPSQPLFLYLSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRTIYAG 271
>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 480
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/322 (38%), Positives = 176/322 (54%), Gaps = 12/322 (3%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII I DD+GWNDV +H D I PN+ LA G+I YT CTPSR+A+MTG +
Sbjct: 10 PHIILIFVDDMGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLY 68
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P TG QH +++ G+PL K+LP+ KE+GY T +VGKWHLGF K EY PT RGF+
Sbjct: 69 PFKTGNQHQMVFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLPTNRGFD 128
Query: 180 SHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
SH G WT G DY + G D R ++ +G Y + A I++NH
Sbjct: 129 SHYGIWTLGVGDYDKMNGVLSPSAGYDFRDNMGVVPKSNG-YLALMLGDRAEHIVNNHYP 187
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFL A P + L+ P+ Y + IED + +F L +D+ +G VV+A
Sbjct: 188 GTPLFL-----AFTLDIPAKHLEIPEEYEEPYADIEDDRTRQFYGKLTMMDDIIGGVVDA 242
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ R M +S+I+F+ DNG ++ A SN+P RG+ TL+EG R ++ ++
Sbjct: 243 LKNRGMYDDSLIIFIGDNGALSS----QAGSNYPFRGIAGTLFEGATRVPSIVSGKGIKK 298
Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
G + + + D T+L A
Sbjct: 299 TGYESNELYSIVDIHRTILDVA 320
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 22/128 (17%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
E+DG+ +W S ++PS RN IL+NIDD+ +A+ G +KL+ TG
Sbjct: 327 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI------------TGH 374
Query: 632 PDQVYLS---GLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKN 688
PD +Y L+D + A+ + + PQ LF++ DP E+N
Sbjct: 375 PDLLYPYRDISLTDNYY----NYGDPPASGLPSTDQEANAPAPQNVEYLFNVVVDPEERN 430
Query: 689 NL-ADRSE 695
++ ADR +
Sbjct: 431 DISADRPD 438
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
E+DG+ +W S ++PS RN IL+NIDD+ +A+ G +KL+
Sbjct: 327 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI 371
>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
Length = 523
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 102/236 (43%), Positives = 149/236 (63%), Gaps = 15/236 (6%)
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM------KMWGLDMR 207
Y T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A LD+R
Sbjct: 128 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLR 187
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
EPA + YST++FT A +I NH ++PLFLYLA + H +PLQ P+ Y+
Sbjct: 188 DGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYM 242
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+ I+D R +A ++ LDE+VG V +AL+ R + +N++++F +DNGG +
Sbjct: 243 EPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SG 298
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+NWPLRG K TLWEGG+RGAG + SPLL+ +G+ + + +H++DWLPTL++ A S
Sbjct: 299 GNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 354
>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
Length = 252
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 102/237 (43%), Positives = 149/237 (62%), Gaps = 15/237 (6%)
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDM 206
GY T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A E + LD+
Sbjct: 2 GYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDL 61
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
R EPA + + YST++FT A +I NH ++PLFLYLA + H +PLQ P+ Y
Sbjct: 62 RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEY 116
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
+ + I+D R +A ++ +DE+VG V +AL+ + +N++ +F +DNGG +
Sbjct: 117 MEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----S 172
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+NWPLRG K TLWEGG+RG G + SPLL+ +G+ + + +H+SDWLPTL+ A S
Sbjct: 173 GGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGS 229
>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1605
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 191/351 (54%), Gaps = 27/351 (7%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
S PH++F+L DD+G NDVG+ D + TP ID LA G+ LK YY++ +CTP+R+A+M
Sbjct: 89 SAKPHVLFVLVDDMGSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCTPARAALM 148
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+G +P+ GMQ + G+P + + LK LGY T VGKW LG + + P
Sbjct: 149 SGHYPMRIGMQLENIKPDSPWGMPRELTTMAETLKNLGYNTHGVGKWGLGHSQHGFLPVN 208
Query: 176 RGFESHLGYWTGHQDYFDH-----------SAEEMKMWG--LDMRR----DLEPAWDLHG 218
RGF++ GY + DY+ H A M + + M R DL+ DL+G
Sbjct: 209 RGFDTWYGYLSDEIDYYSHEYPAPFETVEDGATVMASFTDYVFMERSRPYDLQYMPDLNG 268
Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA--PDHYLNIHRHIED 275
+S++++T I+ + +++ EPLF+Y A TH P A + I I+
Sbjct: 269 THSSELYTQRVQQIVKSANASREPLFVYYASQMTHGPLDVPPRSAFSDAEWARIGADIKG 328
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R KFA++L LD+SVG+VVE +++ + +++ VF SDNGG A +N+PLRG
Sbjct: 329 TVRRKFASLLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNGGCWA----QGGTNYPLRG 384
Query: 336 VKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSD 384
K+ LWEGGV+ ++S L+ S R HV+DW PTL+ S+
Sbjct: 385 GKHFLWEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTLVKLVASSN 435
>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
Length = 330
Score = 206 bits (524), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 171/318 (53%), Gaps = 36/318 (11%)
Query: 60 PHIIFILADDLG------------------WNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
P+ +FILADDLG +ND+G D I TPN+DALA G+IL N
Sbjct: 21 PNFVFILADDLGGMLASKLKNFQACSFSVGYNDLGLRDPDVI-TPNMDALASKGVILTNN 79
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +C+PSR A+MTG++P + MQ V+ E L K LPQYLK+LGY+ ++GK
Sbjct: 80 YVQAVCSPSRHALMTGRYPSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMIGK 139
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLG+ ++E PT RGF++ G + DY++H M W + D E G ++
Sbjct: 140 WHLGYCREECLPTSRGFDTFYGLYASSGDYWEHGIMGMYDWHTEAGVDFEA----RGTHA 195
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRS 279
D+ + H +PLFLY A +H+ PLQ D ++N++ D +R
Sbjct: 196 QDLEIERLDKVFDEHDNKDPLFLYFAPQNSHT-----PLQPTDEFINLYPEDKFSDIRR- 249
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
K+ + LD VGK+V+ L + M++N+ ++FVSDNG N P RG K +
Sbjct: 250 KYLGLTSGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAPFRGGKGS 304
Query: 340 LWEGGVRGAGLIWSPLLE 357
L+EGG + + I+SPLL+
Sbjct: 305 LFEGGTKSSSFIYSPLLK 322
>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
Length = 467
Score = 205 bits (522), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 129/349 (36%), Positives = 188/349 (53%), Gaps = 27/349 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FI+ADDLGW DVG+HG IPTPNIDALA GI + +YT + +P+R+ +MTG++
Sbjct: 32 PNILFIVADDLGWGDVGYHG-SVIPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTGRY 90
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTPTFRGF 178
P G++ V+ GL E+ L L GY R IVGKWHLG ++ Y P RGF
Sbjct: 91 PSRFGIRKTVIPPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNRGF 150
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
G G DYF H E LD D E D YSTD+ EAV I +++
Sbjct: 151 THFYGCLNGALDYFTHEREGE----LDWHNDWESCRDE--GYSTDLIADEAVRCIGGYAS 204
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK----RSK----FAAILHKLDE 290
+ P FLY+A A H+ P QAP+ + H E F R K + A++ ++D+
Sbjct: 205 EGPFFLYVAFNAPHT-----PFQAPEDEIAEHISPEKFAALTPREKDAYTYRAMVTRMDK 259
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW-PLRGVKNTLWEGGVRGAG 349
VG+++E LE+ ++ N++++F+SDNGG NL S+ PLRG K W+GG+R
Sbjct: 260 GVGRILEKLERSGLVENTLVLFMSDNGGIP---NLPGGSSCAPLRGHKGREWDGGLRVPA 316
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
++ + G+ +EQ D PT+ + + P + N++P
Sbjct: 317 VLCRKGRFAEGVRSEQLAAFVDMAPTIAALVGAGEPPRPYDGV--NLLP 363
>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
Length = 370
Score = 205 bits (521), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 115/315 (36%), Positives = 174/315 (55%), Gaps = 15/315 (4%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
G++D+GF D I TPNIDALA G+I N Y +CTPSR A++TG++P + MQ+ V+
Sbjct: 10 GYHDLGFRNPDVI-TPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRSAMQNLVI 68
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
+ L K LP+YLK+LGY T VGKWH+G+ ++E PT RGF+S G + G
Sbjct: 69 MSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFGLYDGDGY 128
Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
Y++H++ + D + G +S D+ I+ + EP FLY +
Sbjct: 129 YWNHTSTVIP-GAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNAKEPFFLYFS--- 184
Query: 251 THSANPYEPLQAPDHYLNIH--RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
NP+ P Q +LN++ D +R K+ ++ LD VG +V+ L++ +++N+
Sbjct: 185 --PQNPHTPSQPQAEFLNLYPENRYPDIRR-KYLGLVSGLDRMVGGIVDGLKRNGLMNNT 241
Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVH 368
I+FVSDNG + N P RG K +L+EGG + A I+SPLL G +H
Sbjct: 242 YIIFVSDNGA-----DPEEGLNDPFRGGKGSLFEGGTKSASFIYSPLLNKAGYENNGLMH 296
Query: 369 VSDWLPTLLSAANKS 383
++DW+PT+L A S
Sbjct: 297 ITDWMPTILKLAGGS 311
>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 454
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 183/352 (51%), Gaps = 36/352 (10%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK-NYY 102
F + L+ ++ P+I+ ILADDLG+ DVG+HGL++IPTPNID +A G+ Y
Sbjct: 4 FITLLFSCSLLWATDKPNILIILADDLGYADVGYHGLEEIPTPNIDRIANEGVQFSAGYS 63
Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQ----------HNVLYGCERGGLPLSEKILPQYLKEL 152
+C P+R+A+M+G + G + H V+ G+P K L QY +E
Sbjct: 64 NGSICGPTRAALMSGVYQQRIGCEGICGGRKLNEHVVV------GMPREVKTLAQYFQEA 117
Query: 153 GYRTRIVGKWHLG---FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
GY T + GKWHLG + K PT RGF+ G G Y D E K +R+D
Sbjct: 118 GYATGLFGKWHLGGERLFDKTLMPTSRGFDEFFGILEGASLYDDTVNRERKY----IRQD 173
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
D G+Y TD EAV I D+P FLYL A H+ P+QA + Y+
Sbjct: 174 T--VIDYEGEYFTDAIGREAVSFI-TRKGDKPFFLYLPFTAVHA-----PMQASEKYMQR 225
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
HI D R FAA+L +D+++G+V +ALE + +L N++IVF SDNGG + N +
Sbjct: 226 FAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKP---DNNYSL 282
Query: 330 NWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N PL+G K +EGG+R + W G +Q V + D P+ L AA
Sbjct: 283 NHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAA 334
>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
Length = 472
Score = 204 bits (520), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 199/344 (57%), Gaps = 22/344 (6%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHP 120
++ I+ADDLG+ D+GF G QI TP++D LA +G+ + Y + +C+PSR+ +TG +
Sbjct: 37 LVLIVADDLGYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQ 96
Query: 121 IHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G +N L G E G G+PLS+K + +L +LGY ++GKWHLG + ++ P
Sbjct: 97 VEFGHDNN-LAGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLG-KEPQFHPL 154
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
RGF+ GY G DYF+ S K + + + + + Y TD E+VD I
Sbjct: 155 KRGFDEFWGYTGGGHDYFE-SLPNGKGYKEPLESNFKTPDPI--TYITDDVGNESVDFIE 211
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H DEP FL+ A A H+ P+QA + L +++HIED KR +AA++H+LD +VGK
Sbjct: 212 RHK-DEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRRTYAAMVHRLDLNVGK 265
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
++ +LE++ + N+++VF SDNGG + NA+ N P RG K L EGG+ ++ P
Sbjct: 266 IMTSLEEQGLSENTLVVFFSDNGGPT---DSNASLNAPYRGQKGILLEGGIHVPFVMNLP 322
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
L G++ ++ V D +PT L+ A ++ + S V+ +IP
Sbjct: 323 GLLPEGLIYQEQVTSLDVVPTFLALAGDTETSMDMFSGVD-LIP 365
>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
siliculosus]
Length = 597
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/366 (35%), Positives = 190/366 (51%), Gaps = 53/366 (14%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLD--QIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
+SSG P++IFIL DD+G NDVG+ D Q+ TP ++ LA G+IL +YY+ ++CTPSR+
Sbjct: 69 SSSGQPNVIFILVDDVGTNDVGYESTDLWQL-TPFMNTLAAEGVILDDYYSNEICTPSRA 127
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
++MTG+ TGMQ V+ GLP+ E L + + GY T + GKWHLG Y
Sbjct: 128 SLMTGRDSFRTGMQFGVVEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANY 187
Query: 173 PTFRGFESHL----------------------GYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
P RGF++ L G +T +QD+ + + + R
Sbjct: 188 PFSRGFDTFLGYTGGGEGYYTHRECVTPEFEGGQYTCYQDFGYGDKDGYINFTTNTTRK- 246
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
++ GKYST V T A+++ H S +PLFLY+AH A H EPL +P
Sbjct: 247 --GPEMTGKYSTTVITDRAIEVAREHVEKSPSDPLFLYVAHQAVH-----EPLDSPPPGS 299
Query: 268 NIHRHIE-----DF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+E D+ +R A+I++ LD+ V ++ AL+ +L N++IVF SDNG
Sbjct: 300 FTDEEVELLEGLDYSHRERRALASIVYYLDKEVRRLHTALDALGVLDNAVIVFASDNGAC 359
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL-----ESRGIVAEQYVHVSDWLP 374
+ SN+PLRG K+T ++GGVR G ++S E+RG +H DW P
Sbjct: 360 PS----KGGSNYPLRGFKHTTFQGGVRVPGFVYSKSADLIPEEARGTRYSGMMHSMDWTP 415
Query: 375 TLLSAA 380
TL S
Sbjct: 416 TLGSVV 421
>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 474
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/380 (34%), Positives = 211/380 (55%), Gaps = 32/380 (8%)
Query: 26 KELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPT 85
K GY T ++AF L S P+I+ IL+DD GW DVGF+G IPT
Sbjct: 3 KYYGYLTLLLAFCCTTLQ----------AQKSKQPNILIILSDDQGWGDVGFNGATDIPT 52
Query: 86 PNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG----GLPL 140
PN++ALA G+I Y+ C+PSR+ ++TG++ G ++N + GLPL
Sbjct: 53 PNLNALAKDGVIFSQGYSSHPYCSPSRAGLLTGRYQQKFGHENNPENEKQNEDTVIGLPL 112
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
+E ++ + L++ Y T +GKWHLG K + P RGF+ G+ G +Y+ + + K
Sbjct: 113 NELMISEVLQQNNYHTCAIGKWHLGNAHK-FLPNQRGFKDWFGFSGGGFNYWGKTTPKNK 171
Query: 201 MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEP 259
G+ ++ +P + Y TD F+ +A++ I +S T++P F+YLA+ A P+ P
Sbjct: 172 ELGV--MKNGKPVPENTLTYLTDDFSNQAINYIDQYSKTEQPFFMYLAYNA-----PHAP 224
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+QA Y N+ HIE+ +R+ +AA++ +D +GKV++ L++ + +N++IVF SDNGG
Sbjct: 225 IQATKEYTNLVTHIENGERAAYAAMVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGGH 284
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLS 378
+ G AS+ P RG K L+EGG+R +I W ++ E + + D PT+L+
Sbjct: 285 SHG-----ASSAPYRGHKGMLFEGGIREPFVISWPNKIKGNRTYKEPIIAL-DIFPTILA 338
Query: 379 AANKSDIPNYVNSTVENIIP 398
A N + +P N N++P
Sbjct: 339 ATN-TQVPASKNIDGVNLLP 357
>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
Length = 548
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 117/323 (36%), Positives = 179/323 (55%), Gaps = 16/323 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II+ ADD G++D G+ D I TPNID L GI N Y+ Q C+PSRS+ ++G++
Sbjct: 29 PNIIWFFADDYGYHDAGYRNSD-IHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRY 87
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+GMQH V+ + L L YLKEL Y T GKWHLG+ KE TPT+RGF+
Sbjct: 88 AYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGFD 147
Query: 180 SHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G ++G Y++H+ + M W + D + G +S D+ A+ ++ + + +
Sbjct: 148 TFSGGYSGEGKYYEHTTFQGMYDWHNGTKLDRSAS----GIHSQDLIEADLLNQL-DEND 202
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLF Y A THS P +P P+H R K+ ++ +D +GKVV
Sbjct: 203 GTPLFYYAAFHNTHS--PMQP--KPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGKVVAK 258
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+++ M N+ I F SDNGG +++ N P RG K++L++GG SPLL+
Sbjct: 259 LKEKDMFDNTYIFFSSDNGG-----DVHEGDNSPKRGAKSSLFDGGCHAHSWASSPLLKK 313
Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
+G+ A+ +H++DWLPT++ A
Sbjct: 314 KGVEADGLLHITDWLPTIVKLAG 336
>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 441
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 132/361 (36%), Positives = 194/361 (53%), Gaps = 24/361 (6%)
Query: 43 AFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
F +++++ L +++ P+I+ I+ADDLG+ DVGFHG IPTP+ID+LA SG
Sbjct: 9 GFFFAILWMGLASAADAAKLPNILVIVADDLGYADVGFHGGKDIPTPHIDSLAASGTRFS 68
Query: 100 N-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+ Y + C+P+R+ ++TG++ G + N G GLPL+E + L+ GY T +
Sbjct: 69 SGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP--GGANKGLPLTETTIADRLQAAGYATGL 126
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
VGKWHLG K + P RGF G+ GH YFD +++ + EP
Sbjct: 127 VGKWHLGTDPK-FHPLKRGFGEFFGFLAGHHTYFDKQEADIQR---GTTKVTEPG----- 177
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y TD F EAV I H + P FLYLA A H+ PL+A D LN I+ R
Sbjct: 178 -YLTDAFGREAVSFIERHQ-NHPFFLYLAFNAVHT-----PLEADDDRLNAFSSIDATPR 230
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVK 337
+AA+L +D+++GKV+ +L + N++I F+SDNGG G +N++ N PLRG K
Sbjct: 231 RTYAAMLSAMDDAIGKVLASLRETAQDENTLIFFISDNGGPTMPGTTINSSINAPLRGSK 290
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENII 397
T EGG+R ++ P G V V D PT L+AA + P++ V N++
Sbjct: 291 RTTLEGGIRVPFVVRWPGHVPAGAVYAHPVIQLDIQPTALAAAGVTIQPDWKLDGV-NLL 349
Query: 398 P 398
P
Sbjct: 350 P 350
>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 468
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 129/353 (36%), Positives = 192/353 (54%), Gaps = 29/353 (8%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
+R + A L L L+ + A + P+I+ I+ DD+G++D+G HG IPTP++D
Sbjct: 3 FRYSLRALVALGL---LTAATTSMAADASRPNIVVIVGDDMGYHDLGVHGCKDIPTPHLD 59
Query: 90 ALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV-LYGCERGGLPLSEKILPQ 147
ALA SG+ Y + C+P+R+ ++TG++ G + N GLPLSE L
Sbjct: 60 ALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNPGPTPTGEIGLPLSETTLAD 119
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF----DHSAEEMKMWG 203
LK++GY+T +VGKWHLG +K + P RGF+ G+ G + YF + SA + G
Sbjct: 120 RLKKVGYKTGMVGKWHLGNDEKRH-PLSRGFDEFFGFLGGARTYFATPGNASAGTKLLRG 178
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
++ + E Y TD F EAV I + S P FLYL A H+ P++A
Sbjct: 179 REVVDEKE--------YLTDAFAREAVAYI-DRSKASPFFLYLTFNAVHT-----PMEAS 224
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
YL+ + D KR K+ A++ +D++VG+VV LE+ ++L N++I FVSDNGG A
Sbjct: 225 QKYLDRFTAVSDPKRQKYCAMMSAMDDAVGQVVAKLEREKLLENTLIFFVSDNGGPTAA- 283
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
N N PLRG K T WEGG+R + W + + G +Q V D++PT
Sbjct: 284 --NTGDNTPLRGFKATTWEGGIRVPYFVSWKGKIPA-GKTYDQPVIQIDFVPT 333
>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
Length = 176
Score = 203 bits (517), Expect = 3e-49, Method: Composition-based stats.
Identities = 91/162 (56%), Positives = 121/162 (74%), Gaps = 8/162 (4%)
Query: 44 FTLSMVFVDLVASSGP--------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
F L ++ + ++++SG P+I+ I+ DDLGWNDV FHG QIPTPNIDALAY G
Sbjct: 6 FGLVLLLLAVISASGDRSRGSNERPNIVVIVTDDLGWNDVSFHGSSQIPTPNIDALAYQG 65
Query: 96 IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
IIL +YT LCTPSR+++M+GKHPI+ GMQH+V+ E GL L +K++P+Y +E GYR
Sbjct: 66 IILNRHYTPPLCTPSRASLMSGKHPINVGMQHHVIESNEPWGLGLDQKLMPEYFREAGYR 125
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE 197
TR+VGKWHLGF++K YTPT RGF+SH GY + DY+DHS +
Sbjct: 126 TRLVGKWHLGFFRKAYTPTRRGFDSHFGYIGPYIDYWDHSLQ 167
Score = 42.4 bits (98), Expect = 1.3, Method: Composition-based stats.
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
MQH+V+ E GL L +K++P+Y +E GYRTR++
Sbjct: 95 MQHHVIESNEPWGLGLDQKLMPEYFREAGYRTRLVG 130
>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 610
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 136/361 (37%), Positives = 194/361 (53%), Gaps = 50/361 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSA 113
++SG PH+IFIL DD+G ND+G+ D TP +D+LA SG++L NYYT QLCTPSR++
Sbjct: 65 SASGQPHVIFILIDDVGTNDMGYRSTDLWELTPFLDSLASSGVVLDNYYTNQLCTPSRAS 124
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG+ TGMQH ++ GLP E L K GY T + GKWHLG + P
Sbjct: 125 LMTGRDSFRTGMQHGIVDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYP 184
Query: 174 TFRGFESHLGYWTGHQDYFDHSA-----------EEMKMWG-------LDMRRDLE---P 212
RGF++ LGY G + Y++HS +K +G +D + P
Sbjct: 185 FSRGFDTFLGYTGGGEGYYNHSTCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKEGP 244
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH----STDEPLFLYLAHAATHSANPYEPLQAP----- 263
A + YST + T A+D+ H S+D+PLFLY+A+ A H+ PL AP
Sbjct: 245 A--MVDNYSTTIMTDRAIDVAREHTGTASSDDPLFLYVAYQAAHT-----PLDAPPPGTF 297
Query: 264 --DHYLNIHRHIEDF-KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+ + R + D R FA I++ LD+ V ++ +ALE +L N++IV SDNG +
Sbjct: 298 TEEEEETLARVVSDEPDRHVFARIIYYLDKEVRRLHDALEALGVLDNAVIVLASDNGACS 357
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL-----ESRGIVAEQYVHVSDWLPT 375
+ + SN PLRG KNT ++GGVR ++S E+RG +H +DW T
Sbjct: 358 S----SGGSNHPLRGFKNTTFQGGVRVPAFVYSKSTELIPEEARGTTYSGMMHSTDWTQT 413
Query: 376 L 376
L
Sbjct: 414 L 414
>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 453
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 188/342 (54%), Gaps = 24/342 (7%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN- 100
L F+++ V +S P+I+ I+ADDLG+ D+GF G IPTP ID LA SG+ +
Sbjct: 15 LLFSMTAVVGWAADASSKPNIVLIVADDLGYGDLGFQGGRDIPTPRIDGLARSGVTCSSG 74
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQH---NVLYGCERGGLPLSEKILPQYLKELGYRTR 157
Y + C+P+R+ ++TG++ G + N E GL L E+ LPQ LK+ GY T
Sbjct: 75 YVSGPYCSPTRAGLLTGRYQQRFGHEFNPGNAARVTETFGLSLEERTLPQRLKQAGYATG 134
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF-DHSAEEMKMWGLDMRRDLEPAWDL 216
IVGKWHLGF ++ P RGF+ G+ G YF D ++ + +RR E +
Sbjct: 135 IVGKWHLGF-APQFQPLERGFDEFFGFLGGAHPYFPDANSND------PIRRGREAV--V 185
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
+Y TD F E+V I + + + P FLYLA A H+ P A +L + I D
Sbjct: 186 ESEYLTDAFARESVAYI-DRNKNRPFFLYLAFNAVHN-----PQHAKPEHLERFKSIADE 239
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R +A +L +D++VG+V++ L ++ +++VF+SDNGG A N ++N PL G
Sbjct: 240 RRRTYAGMLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPPA----NGSTNTPLNGQ 295
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
K T WEGGVR ++ P + G E+ V D LPT+L+
Sbjct: 296 KATTWEGGVRVPFVVSWPGQLAAGKKYEEPVIQLDLLPTILT 337
>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
Length = 515
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 188/343 (54%), Gaps = 22/343 (6%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHG--------LDQIPTPNIDALAYSGIILKNYYTVQ 105
VA + P+I+ I+ DD+GW + H + TP + L G+ L +Y +
Sbjct: 22 VARTSQPNIVLIIGDDVGWANTELHCNISDPAQVCSDVKTPFLLQLKDEGVHLLRHYAYK 81
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
C+P+RSA+M+G+ PIH + N Y + GG+ L+ +LP+ L +GY T VGKWHLG
Sbjct: 82 FCSPTRSALMSGRLPIHVN-EVNPTY-IKGGGVSLNFTMLPEKLSRVGYATHHVGKWHLG 139
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
PT RGF+S LGY G +D++ + + G+D+ + PA+ + +Y +F
Sbjct: 140 MSSHSRVPTGRGFDSSLGYLLGAEDHWKQTVTQGNATGVDLWLNEGPAYLRNNEYGDLLF 199
Query: 226 TAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
AV++IH+H+ D PLFLY+ HS P + P Y + + I+ R+
Sbjct: 200 GRRAVEVIHDHAKNHRDSPLFLYVGLQVNHS-----PAEVPKEYQDRYPGIQ-VDRAIVN 253
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
+ +D+++GK+ AL + M ++++V++SDNGG A+G N N +N+PLRG K+T +E
Sbjct: 254 GMSSVMDDAIGKITTALWETTMWEDTLVVYISDNGG-ASGINANNGNNYPLRGGKHTDFE 312
Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
GGVR A L+ L + RG + Y+H DW TL A S
Sbjct: 313 GGVRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGAS 355
>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 536
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 179/344 (52%), Gaps = 21/344 (6%)
Query: 60 PHIIFILADDLGWNDVGFHG----LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
PH+ I+ADDLG+ DV ++G + + TP +D LA GI L +Y+ CTP+R+A++
Sbjct: 33 PHVFLIVADDLGYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAALL 92
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++P +TGMQH V+ + LP +LP L E GYR +GKW +G + TPT
Sbjct: 93 TGRYPSNTGMQHEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPTA 151
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGL--DMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF+SHLGY+ Y +H+A G DM D +YST +F A+ ++
Sbjct: 152 RGFDSHLGYYGAEITYDEHAALRSCSNGTIRDMNHDGATLAATEDRYSTHLFADHAMALV 211
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPD----HYLNIHRHIEDFKRSKFAAILHKLD 289
+ + LFLYL A H +PL A + + D RS FAA+ +LD
Sbjct: 212 DREADEYKLFLYLCFQAVH-----QPLAADAALVKRFADAFLDESDAARSTFAAVALELD 266
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+V + VE + R +++ F+SDNG A SNWPLRG K T +EGGVR
Sbjct: 267 RAVERFVEFTKARGAYDDAVFFFMSDNGATLA--QSGGGSNWPLRGSKFTAYEGGVRVPA 324
Query: 350 LIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKSD-IPNYVN 390
+ S + RG HV D LPTL + A +D +P V+
Sbjct: 325 FLHSARVPPARRGAAHAGLFHVVDVLPTLAAFAESADRVPAGVD 368
>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
Length = 460
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 125/345 (36%), Positives = 185/345 (53%), Gaps = 21/345 (6%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L LAF F L+ + P+II +L DD GW DVGF+G IPTPN+D +A G+I
Sbjct: 7 LILAFCF-FYFAQLITAQKQPNIIVVLTDDQGWADVGFNGATDIPTPNLDRIASEGVIFS 65
Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYR 155
N Y C+PSR+ ++TG++ G N+ Y E G PLSEK++ + LKE GYR
Sbjct: 66 NGYVSHPYCSPSRAGLLTGRYQARFGHDCNMPYEGENDATVGTPLSEKLISEALKEQGYR 125
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T +GKWH+G + + P +GF+ G+ G +Y+ + +++ + R+ +P +
Sbjct: 126 TSAIGKWHIGDHPNLHPPA-QGFDHWFGFPGGSMNYWGKATSKIQT----IYRNTKPVAE 180
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
Y TD FT EA++ I N P F+YLA+ A P+ P A YL +HIE
Sbjct: 181 EELTYLTDDFTNEAINFI-NKKDKNPFFIYLAYNA-----PHAPDHATKKYLEQTKHIEY 234
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
RS +AA+++ +D ++GK+ L + N+I+VF+SDNGG + A N P RG
Sbjct: 235 AGRSVYAAMVNAVDANIGKIDSTLVANGIKDNTILVFLSDNGG-----RIQHADNRPYRG 289
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
K L+EGG++ I P + + D PTLL+AA
Sbjct: 290 HKGMLFEGGIKVPFFISWPSKIKEDSKYTKPISSLDLFPTLLNAA 334
>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
Length = 496
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 191/348 (54%), Gaps = 25/348 (7%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L L L + F+ +S P+I+ ILADDLG+ D+GF G D I TPN+D LA GI+L
Sbjct: 3 LGLLILLFVAFISGAKASEKPNIVIILADDLGYADLGFRGSD-IQTPNLDRLAAEGIVLN 61
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
+Y++ +CTP+RSA+MT + PI G + L E GG+ E +P+ K+ GY+T ++
Sbjct: 62 RFYSLPICTPTRSALMTARDPIKLGTAYAGLQPWENGGVSPDEHFMPESFKKAGYQTAMI 121
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA---EEMKMWGLDMRRDLEPAWDL 216
GKWH+G + P RGF+ G+ DY+ H++ +++ G ++RD
Sbjct: 122 GKWHIGRQYESLVPHHRGFDHFFGHLNTQVDYYTHASAGGHDLQENGKSLKRD------- 174
Query: 217 HGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
Y+TD+ E+V + +P LY+ A HS P+QAP+ ++ +
Sbjct: 175 --AYATDIHGDESVRYLKEIRDPSKPFLLYVPFLAPHS-----PMQAPEALEAKYKSRLN 227
Query: 276 FKRSK--FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
F K +AA++ +D+++G++++ L+++ + N+II+F SDNGG AGF N P
Sbjct: 228 FPVPKRTYAAMVDSMDQAIGRIMDELDRQGVSDNTIILFFSDNGG-FAGF---GGVNKPY 283
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
RG K +EGG+R ++ P G Q + V D PTL +AA+
Sbjct: 284 RGGKLETFEGGIRVNAIMRWPGAIPAGSKTAQAISVMDVFPTLAAAAD 331
>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 487
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 174/317 (54%), Gaps = 27/317 (8%)
Query: 32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
++I AFA LA ++ V ++ P+I+ I+ DD+G+ DVGFHG IPTPN+DAL
Sbjct: 14 SQISAFAFCMLALVITPVI-----AADRPNILLIVGDDMGYADVGFHGCKDIPTPNLDAL 68
Query: 92 AYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A SG+ Y T C+P+R+ ++TG++ G + N GLPL+E + LK
Sbjct: 69 AKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNP--SGANTGLPLTEVTIADRLK 126
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
++GY T +VGKWHLG + P RGFE +G+ G +FD + R
Sbjct: 127 QVGYTTGLVGKWHLG-SQPAMHPQERGFEEFIGFLGGAHSFFDAQG---------ILRGH 176
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
EP + Y+TD+F EAV I H D+P FLYL+ A H+ P+ A + +
Sbjct: 177 EPVKTI--DYTTDLFGREAVSFIEKHR-DKPWFLYLSFNAVHT-----PMHATEDRMAKL 228
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAAS 329
I D +R +AA++ +DE++GKV+ LE ++++F+SDNGG G +N +
Sbjct: 229 ASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSI 288
Query: 330 NWPLRGVKNTLWEGGVR 346
N PLRG K T EGG+R
Sbjct: 289 NTPLRGSKRTTLEGGIR 305
>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
Length = 913
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 131/417 (31%), Positives = 209/417 (50%), Gaps = 44/417 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI++ L DDLGW+DV +H + TP + ALA G++L +Y + C+PSRS++++G++
Sbjct: 411 PHIVYFLVDDLGWHDVPWHN-PSLKTPTLAALAADGVVLDRFYAYRFCSPSRSSLLSGRY 469
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+H Q+N+ GG+ ++ + + LK GY T +GKWH G + P RGF+
Sbjct: 470 PMHVN-QYNMAGDALGGGVHVNMTTIAKKLKGAGYATHQLGKWHAGQSSADLVPAARGFD 528
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWG-LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ LGY G +D++ + + +D+ PA+ +G Y ++ A+DII +H
Sbjct: 529 TSLGYLNGAEDHWTQARPACGVGNFVDLYATDGPAFGKNGTYGAQIYHDAALDIIADHDA 588
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK----RSKFAAILHKLDESVGK 294
PLF+Y A H+ P+Q PD Y ++ +D K RS + A+ D+ V
Sbjct: 589 SVPLFVYFAFQINHA-----PMQVPDAYARVY-PCDDPKTCGVRSTYQAMTAMADDVVRN 642
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
AL+ + M +++IV SDNGG +G + ++++N+PLRG K + EGGVR AGL+
Sbjct: 643 ATAALKAKGMWDDTLIVLTSDNGG-PSGTDADSSNNFPLRGGKYSDLEGGVRAAGLVAGG 701
Query: 355 LL--ESRGIV---AEQYVHVSDWLPTL-------------------LSAANKSDIPNYVN 390
L +RG A YVH+ DW T L A + D+ +++
Sbjct: 702 FLPDAARGRTLDGAGAYVHICDWYATFARLAGVDAADDASDADGRPLPAVDSLDVWDFL- 760
Query: 391 STVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN--RYENGT 445
S + PR E + G H+ RY+ + +P N Y NGT
Sbjct: 761 SGAADASPRTE---IVLTAGRHQPGGSGGGYIGGRYKVVFNTQSPAIWNGPEYPNGT 814
>gi|323454930|gb|EGB10799.1| hypothetical protein AURANDRAFT_22180 [Aureococcus anophagefferens]
Length = 530
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 171/340 (50%), Gaps = 25/340 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPT--PNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
PHI F+L DDLG+ND + P++ ALA + + YYT LCTP+R A MTG
Sbjct: 20 PHIAFMLVDDLGFNDAWAVNTSDVAGAWPHVGALANESVRIATYYTTMLCTPTRGAFMTG 79
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+ P G+ H V+ G + GLP +E + LK GY T VGKWHLG + PT RG
Sbjct: 80 RLPQRLGLHHGVIGGFQDYGLPANETTIADKLKGAGYATAHVGKWHLGNFDDASEPTRRG 139
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD------LEPAWDLHGKYSTDVFTAEAVD 231
F++ GY G +D+F + + + D ++ ++G +++ +F AV
Sbjct: 140 FDASYGYQNGEEDHFTRILQGLGDFQASSLADGASYYTVDDLLKINGTHNSFLFVDRAVA 199
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY--------LNIHRHIEDFKRSKFAA 283
++ H PLFLYLA TH+ P+QAP Y + RH D R +F
Sbjct: 200 VVGAHDASIPLFLYLALQDTHA-----PIQAPAGYADDAACASIGSSRHYPDGARREFCG 254
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
++ DES+ V A++ NS+IV DNGG G +NWPLRG K WEG
Sbjct: 255 MVRAADESIANVTAAIDANFGKRNSVIVIAGDNGGIVKG----GGNNWPLRGQKGQPWEG 310
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
GVR LI +P + G+ + VHV D TL++ A +
Sbjct: 311 GVRNHALIRAPGRLAPGLYDKGMVHVVDLHATLVALAGGA 350
>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 495
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 180/331 (54%), Gaps = 21/331 (6%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
VA++ P+I+F+L DD G+ DVGF I +PN D LA +G++L +Y + C PSR+
Sbjct: 17 FVATAKLPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+++TG+ P H Q N+ G L+ + P LK Y T +VGKWH GF+ Y
Sbjct: 76 SLLTGRWPHHV-YQWNLATD-ATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
P RGF++ G+ G +D+ +A + +D ++ P +G Y ++ + DI
Sbjct: 134 PINRGFDTSSGFLCGSEDHMTQNA----ICAIDYWKNNAPD-PRNGTYDAYIYRDDLTDI 188
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDES 291
I++H+TDEPLFLYL H PLQAP +L+I+ KR A++ D
Sbjct: 189 INSHNTDEPLFLYLPLHNVH-----LPLQAPKEWLDIYPANSTCDKRHTLQAMVSVADNV 243
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
G +VE L+++ + N+I+V +DNGGA + SN+PLRG K TL+EGGVR +
Sbjct: 244 TGHLVELLKKKGLWDNTIMVISADNGGAHC-----SGSNYPLRGCKETLFEGGVRSLAFV 298
Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
LL RG + ++H++DW T A
Sbjct: 299 NGGLLPESRRGQSTDGFIHIADWYTTFCKLA 329
>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
Length = 558
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/338 (35%), Positives = 183/338 (54%), Gaps = 24/338 (7%)
Query: 60 PHIIFILADDL-----GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
P++I+ ADD G+ D G+ I TPNID L GI N Y+ Q C+PSRS+
Sbjct: 66 PNVIWFFADDYDLALPGFQDAGYRN-SAIHTPNIDKLVGDGISFTNAYSSQQCSPSRSSF 124
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
++G++P +GMQH V+ + L K L YLK+L Y T VGKWHLG+ K+ TPT
Sbjct: 125 LSGRYPYKSGMQHGVISDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNKKCTPT 184
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAV-DI 232
+RGF++ G ++G DY++H+ W DL A G +S D+ A+ + ++
Sbjct: 185 YRGFDTFSGGYSGEGDYYEHTTHGGWYDWHNGTETDLTAA----GVHSQDLVEADMMKNL 240
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
N T P F Y A THS P +P P+H + R K+ ++ +D +
Sbjct: 241 DENDGT--PFFYYTAFHNTHS--PLQP--KPEHKALYNDMDATPSRKKYLGLVSGMDAVI 294
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G VV L+ + M N+ I F SDNGG ++N N P RG K++L+EGG
Sbjct: 295 GNVVAKLKAKGMFDNTYIFFSSDNGG-----DVNEGDNSPWRGAKSSLFEGGCHAHSWAS 349
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
SPL+ +G+ A+ +H++DWLPT+L+ A + +P+ ++
Sbjct: 350 SPLIRKKGVEADGLLHMTDWLPTILTLAGGT-VPDGID 386
>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
Length = 520
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 126/361 (34%), Positives = 190/361 (52%), Gaps = 30/361 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F LPL F + A S PP+I+ ILADD+G+ D+G G + TPN+D LA SG+
Sbjct: 34 FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 92
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
+ Y +C+PSR+ ++T + P G + N+ E GLP SEK L +
Sbjct: 93 LCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADH 152
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
L GY T ++GKWHLG + + P RGF+ G TG YF + + + + R
Sbjct: 153 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPATMKHV------IER 205
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
+ + D +Y TD FT E + I H + D+P F++ ++ A P+ P+ A +
Sbjct: 206 NGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNA-----PHTPMHATEA 260
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
L +I++ KR +AA+++ LD VG++ E LE+ N+++VF SDNGGA
Sbjct: 261 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGATN---- 316
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
N + N PLRGVK ++ EGG+R +IW+ P G++ + V D LPT SAA
Sbjct: 317 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAAGAEP 375
Query: 385 I 385
+
Sbjct: 376 L 376
>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 494
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/331 (36%), Positives = 182/331 (54%), Gaps = 21/331 (6%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
VA++ P+IIF+L DD G+ DVGF I +PN D LA +G++L +Y C+PSR+
Sbjct: 16 FVATAELPNIIFVLVDDWGFADVGFKN-PAISSPNFDHLAKTGLVLNRHYVYMYCSPSRA 74
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+++TG+ P HT Q N L G L+ ++P LK Y T +VGKWH GF+ Y
Sbjct: 75 SLLTGRWPHHT-HQWN-LGNNSTAGTNLAMTMIPAKLKAANYATHMVGKWHQGFFDPRYL 132
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
P RGF++ G+ G +D+ +A + +D ++ P +G Y ++ + DI
Sbjct: 133 PINRGFDTSSGFLCGSEDHMTQNA----ICAIDYWKNNAPD-PRNGTYDAYIYRDDLTDI 187
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDES 291
I++H+T+EPLFLYL H PLQAP +L+I+ KR A++ D
Sbjct: 188 INSHNTNEPLFLYLPLHNVH-----LPLQAPKEWLDIYPANSTCEKRHTLQAMISVADNV 242
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
G +VE L+++ + N+I+V +DNGGA + SN+PLRG K TL+EGGVR +
Sbjct: 243 TGHLVELLKKKGIWDNTIMVISADNGGAHC-----SGSNYPLRGCKETLFEGGVRSLAFV 297
Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
LL RG + ++H++DW T A
Sbjct: 298 NGGLLPESRRGQSTDGFIHIADWYTTFCKLA 328
>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 263
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 95/186 (51%), Positives = 127/186 (68%), Gaps = 2/186 (1%)
Query: 203 GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
G D+R ++ + ++G Y+T+VFT +A II H+ PLFLYLAH A HS NP PL+A
Sbjct: 2 GYDLRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLEA 61
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
P+ Y++ HI+D R FA ++ LDESVG + + L+ M +N+II+F +DNGG AAG
Sbjct: 62 PEKYMSRFPHIQDKNRKTFAGMVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAAG 121
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAA 380
FN N ASNWPLRG+K TLWEGGVRG G + SPLLE G V+ +HV DWLPT+ ++
Sbjct: 122 FNNNHASNWPLRGIKATLWEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIYHIAGG 181
Query: 381 NKSDIP 386
N S +P
Sbjct: 182 NVSSLP 187
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 33/59 (55%)
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
N +S +DG +VW LSR S R ILHNID QI+AL GK+K++ GN
Sbjct: 181 GNVSSLPTNLDGFNVWDTLSRGSVSPRTEILHNIDPILQIAALRVGKYKIIMGECYRGN 239
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%)
Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
N +S +DG +VW LSR S R ILHNID QI+AL GK+K++
Sbjct: 181 GNVSSLPTNLDGFNVWDTLSRGSVSPRTEILHNIDPILQIAALRVGKYKII 231
>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
Length = 486
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 110/312 (35%), Positives = 171/312 (54%), Gaps = 20/312 (6%)
Query: 81 DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
D I TPNIDA++ +G+ L+NYY +CTPSRS +++G++ IHTG+QH +++ LPL
Sbjct: 9 DIIKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPSALPL 68
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS-AEEM 199
++LP+ ++ GY T GKWHLG+ K TP RGF + GY G +DY+ + +
Sbjct: 69 DTELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTRCIDH 128
Query: 200 KMWGLDMRRDLEPAWDL-----HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
G+D D E + +YS + +A + I D+P FLYL + H+
Sbjct: 129 HKCGIDQNTDGEIFGERVYNADASEYSAFKYIRQAKNYIDGRDKDKPFFLYLPMQSVHA- 187
Query: 255 NPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
PL+AP Y+ + IED R +A ++ +DE +G + L++ + +N+I +F
Sbjct: 188 ----PLEAPAEYIAKYDGIIEDKPRKIYAGMVSVMDEGIGNITSHLKESGLWNNTIFIFS 243
Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR----GIVAEQYVHV 369
+DNGG A +N PLRG K WEGG+ G G + E R +V + +H+
Sbjct: 244 TDNGGQA----YVGGNNLPLRGNKAGYWEGGIHGNGFVAGGYFEDRRKGSELVNNELIHI 299
Query: 370 SDWLPTLLSAAN 381
SDW PT++ A +
Sbjct: 300 SDWYPTIMEATS 311
>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 543
Score = 199 bits (507), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 180/339 (53%), Gaps = 28/339 (8%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
+V + P+I+ I+ADDLG++DVGF+G +IPTP++D LA SG++ N Y C+PSR
Sbjct: 38 VVGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97
Query: 112 SAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
+ ++TG+H G N +G + G+PLSE L LKE GY T +GKWHLG
Sbjct: 98 AGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG- 156
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
K + P RGF+ G+ G Y+ MK L + R EP + TD F+
Sbjct: 157 DAKPFWPNRRGFDEWFGFSGGGFSYWGDLG--MKDPLLGVHRGDEPVDPKTLTHLTDDFS 214
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFA 282
EAV I H T EP FLYLA+ A H APDH H HIE R+ +
Sbjct: 215 TEAVKFIQRHET-EPFFLYLAYNAPH---------APDHATRAHLQKTAHIEYGGRAVYG 264
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A++ +DE +G+VV+ + + + N++I+F SDNGG A N+P RG K L+E
Sbjct: 265 AMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFE 319
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
GG+R L+ P G+ E + D PT L+AA
Sbjct: 320 GGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAG 358
>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 826
Score = 199 bits (507), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 125/339 (36%), Positives = 182/339 (53%), Gaps = 28/339 (8%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
+V + P+I+ I+ADDLG++DVGF+G +IPTP++D LA SG++ N Y C+PSR
Sbjct: 38 VVGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97
Query: 112 SAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
+ ++TG+H G + N +G + G+PLSE L LKE GY T +GKWHLG
Sbjct: 98 AGLLTGRHQQRFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG- 156
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
K + P RGF+ G+ G Y+ ++ + G+ R EP + TD F+
Sbjct: 157 DAKPFWPNHRGFDEWFGFSGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLTHLTDDFS 214
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFA 282
EAV I H + EP FLYLA+ A H APDH H HIE R+ +
Sbjct: 215 TEAVKFIQRHES-EPFFLYLAYNAPH---------APDHATRAHLQKTAHIEYGGRAVYG 264
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A++ +DE +G+VV+ + + + N++++F SDNGG A N+P RG K L+E
Sbjct: 265 AMVAGMDEGIGRVVDQIRESGLGENTMVIFYSDNGG-----RREHAVNFPYRGHKGMLFE 319
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
GG+R L+ P G+ E + D PT L+AA
Sbjct: 320 GGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAG 358
>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
Length = 697
Score = 199 bits (507), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 175/326 (53%), Gaps = 18/326 (5%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI++ L DDLG+ + G D+ TP+ID LA +G +L+ +YT + C+P+RS++++G+
Sbjct: 74 PPHIVYFLVDDLGYANTGVTN-DEPLTPHIDELARNGTMLRRFYTYKYCSPTRSSLLSGR 132
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
PIH Q N GG+ + L + L+ GY T VGKWH G + P RGF
Sbjct: 133 LPIHVN-QENRPPPVPGGGIDQNMTTLAEVLQRAGYATHQVGKWHCGMSSPDRLPVHRGF 191
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
S GY +G + H +++ +D+ RD +PA+ +G Y T ++ EA IIH H
Sbjct: 192 NSSFGYLSGAE----HHYSQIRQGYVDLWRDTQPAYGENGTYGTFMYAREAERIIHAHDP 247
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLFLYLA HS PL+ PD +L+ + ++ R A++ +DE++ V A
Sbjct: 248 TTPLFLYLAFQNVHS-----PLEVPDRFLDKYPDVDYRPRKHCLAMVSAVDEAINNVTLA 302
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + + +++I+F SDNGG + A+N+P RG K +EGGVR + ++
Sbjct: 303 LRRAGLFDDTLIIFSSDNGGPH-----DHANNYPFRGAKTADFEGGVRAVAFVSGGVVPK 357
Query: 359 R--GIVAEQYVHVSDWLPTLLSAANK 382
G +HV DW TL A +
Sbjct: 358 NMVGTNVHGLMHVCDWYSTLARLAGE 383
>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 826
Score = 199 bits (506), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 182/339 (53%), Gaps = 28/339 (8%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
+V + P+I+ I+ADDLG++DVGF+G +IPTP++D LA SG++ N Y C+PSR
Sbjct: 38 VVGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97
Query: 112 SAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
+ ++TG+H G + N +G + G+PL+E L LKE GY T +GKWHLG
Sbjct: 98 AGLLTGRHQQRFGHESNPEPDTQWHGEDTPGMPLTETTLADALKEAGYVTGAIGKWHLG- 156
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
K + P RGF+ G+ G Y+ ++ + G+ R EP + TD F+
Sbjct: 157 DAKPFWPNRRGFDEWFGFSGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLTHLTDDFS 214
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFA 282
EAV I H T EP FLYLA+ A H APDH H HIE R+ +
Sbjct: 215 TEAVKFIQRHET-EPFFLYLAYNAPH---------APDHATRAHLQKTAHIEYGGRAVYG 264
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A++ +DE +G+VV+ + + + N++I+F SDNGG A N+P RG K L+E
Sbjct: 265 AMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFE 319
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
GG+R L+ P G+ E + D PT L+AA
Sbjct: 320 GGIRVPFLVSWPGTVRSGMKEELPITALDLFPTALAAAG 358
>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
Length = 492
Score = 199 bits (506), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 180/365 (49%), Gaps = 67/365 (18%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IF+L DDLG++D+G+ D I +PNIDALA + LK +Y CTPSR+A +TG++
Sbjct: 24 PHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRY 82
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I GMQ V+ E G+PL E +L + K+ GYRT + GKWHLGFY ++ P RGF+
Sbjct: 83 NIRMGMQSGVIRPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFD 142
Query: 180 SHLGYWTGHQDYFDHSAEEM-------------------------KMWGLDMRRDLEPAW 214
G++ G QD++ H + + + WG D R
Sbjct: 143 RFYGFYLGSQDFYFHDSGRLSSHKFSYEDEVSGVNNKRWLQGPHGRPWGYDFREAYPANG 202
Query: 215 DL-----------HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
D+ +G YST +F + ++ + H LF Y++ H PLQ
Sbjct: 203 DVENDTILDDFHTNGTYSTKLFVNDFINDLAKHDPAVSLFNYVSFQDVHG-----PLQTV 257
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+ + +++ D + I K+ S++VF SDNGG
Sbjct: 258 NKFKKHYKNKTDTWTHERILISTKI------------------TSLLVFTSDNGGRPK-- 297
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR--GIVAEQYVHVSDWLPTLLSAAN 381
ASNWPLRG K T+++GG++ + SPLL++R G HVSDW PTLL+ +
Sbjct: 298 --EGASNWPLRGSKGTIYDGGLKSRAFVASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG 355
Query: 382 KSDIP 386
D+P
Sbjct: 356 -CDVP 359
>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 826
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 125/332 (37%), Positives = 179/332 (53%), Gaps = 28/332 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ I+ADDLG++DVGF+G +IPTP++D LA SG++ N Y C+PSR+ ++TG+
Sbjct: 45 PNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGR 104
Query: 119 HPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
H G + N +G + G+PLSE L LKE GY T +GKWHLG K + P
Sbjct: 105 HQQRFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWP 163
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF+ G+ G Y+ ++ + G+ R EP + TD F+ EAV I
Sbjct: 164 NHRGFDEWFGFSGGGFSYWGDLGKKDPLLGV--HRGDEPVEPKTLTHLTDDFSTEAVKFI 221
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFAAILHKLD 289
+ T EP FLYLA+ A H APDH H HIE R+ + A++ +D
Sbjct: 222 QRNET-EPFFLYLAYNAPH---------APDHATRAHLQKTTHIEYGGRAVYGAMVAGMD 271
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E +G+VV+ + + + N++I+F SDNGG A N+P RG K L+EGG+R
Sbjct: 272 EGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEGGIRVPF 326
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L+ P G+ E + D PT L+AA
Sbjct: 327 LVSWPGTVRSGMKEESPITALDLFPTALAAAG 358
>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
Length = 617
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 122/356 (34%), Positives = 189/356 (53%), Gaps = 42/356 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FI+ADDLG+ND+G++ ++ TPNI+ LA +GIIL ++Y+ +C+PSRS +TG++
Sbjct: 158 PNIVFIMADDLGYNDIGYNSIEAF-TPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRY 216
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
GMQH + + G+PL+EK+LP+ KE GY T GKWH G + + Y PT RGF+
Sbjct: 217 SFRYGMQHRNILPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFD 276
Query: 180 SHLGYWTG------HQDYFDHSAEE--MKMWGLDMRRD--LEPAWDLHGKYSTDVFTAEA 229
+G ++G H+ F+ + G DMR +E +L G +D E
Sbjct: 277 KFVGSYSGSSQHSTHEKCFNSPQRDSPFSFCGYDMREAEVVEARQNLTG---SDFLLREI 333
Query: 230 V--DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
+H T +P YE L HR+ +KR+K A++
Sbjct: 334 CIRITVHGPLTTDP----------RDEKIYEEL------YERHRNETIYKRTKMLAMVTG 377
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
LD ++G+++E+ ++ N+I F SDNGG+ A+N P RG KN L EGGVR
Sbjct: 378 LDRAIGRIIESFKKYGKYKNTIFAFASDNGGSLT----YRANNMPFRGGKNDLLEGGVRT 433
Query: 348 AGLIWSPLLESRGIVAE--QYVHVSDWLPTLLSA--ANKSDIPNYVN--STVENII 397
+ S L+ + I + ++DW PTLL A ++ ++PN ++ S EN
Sbjct: 434 LSFLHSSLIPNDKIGSRYSNLFDITDWFPTLLDAGKCSQKNLPNQLDGKSHFENFF 489
>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
Length = 545
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 193/363 (53%), Gaps = 36/363 (9%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L ++ V A S P+I+ ++ADDLGW DVG+HG D I TP++D LA G+ L +
Sbjct: 15 LLMMCVILGVPASAQSQKPNILIMVADDLGWADVGYHGGD-IDTPSLDRLAQQGVRLNRF 73
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
YT +C+P+R+A+MTG+ PI G+ + V++ + G+ E +P+ + GY+T I+GK
Sbjct: 74 YTTPICSPTRAALMTGRDPIRLGVTYGVIFPWDNIGVHPDEHFMPETFQAAGYQTAIIGK 133
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLG + Y P RGFE G+ ++ + + G D +R+ + D G Y
Sbjct: 134 WHLGHAQMTYHPNNRGFEHFYGHLHTEVGFYPPFSNQG---GKDFQRN-GVSIDDQG-YE 188
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP----DHYLNIH------- 270
T + E I D P +Y+ A H+ PL AP D Y +I
Sbjct: 189 TYLLADEVSRYIRERDRDRPFLVYMPFIAPHT-----PLDAPVELQDKYKDIETDLPMAR 243
Query: 271 -RHIEDFK-----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
R +D + R +AA++ +D+++G+V++ L+Q + N+I++F SDNGG
Sbjct: 244 SRQTDDTRLISRVMLQPSARPMYAAVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGG 303
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
AA ++ A+N PLRG K +EGG+R L+ P + G + EQ + V D PTL+
Sbjct: 304 AA--YSYGGANNAPLRGGKGETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVD 361
Query: 379 AAN 381
AA+
Sbjct: 362 AAD 364
>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 498
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 126/361 (34%), Positives = 188/361 (52%), Gaps = 30/361 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F LPL F + A S PP+I+ ILADD+G+ D+G G + TPN+D LA SG+
Sbjct: 12 FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 70
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
+ Y +C+PSR+ ++TG+ P G + N+ E GLP SEK L +
Sbjct: 71 LCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADH 130
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
L GY T ++GKWHLG + + P RGF+ G TG YF + + + R
Sbjct: 131 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGGHHYFPTTMNHV------IER 183
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
+ + + +Y TD FT E + I H + D+P F++ ++ A P+ P+ A +
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNA-----PHTPMHATEA 238
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
L +I+ KR +AA+++ LD VG++ E LE+ N+++VF SDNGGA
Sbjct: 239 DLARFANIQTQKRRTYAAMMYALDRGVGRIREHLEESGQWGNTLLVFFSDNGGATN---- 294
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
N + N PLRGVK ++ EGG+R +IW+ P G+ + V D LPT SAA
Sbjct: 295 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVVSSLDLLPTFCSAAGAEP 353
Query: 385 I 385
+
Sbjct: 354 L 354
>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
Length = 458
Score = 198 bits (504), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 188/348 (54%), Gaps = 28/348 (8%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
+L + +S + P+I+ +LADDLGW DVGFHG +I TP ++ALA G++L
Sbjct: 5 ILGASMVVSATTLSFAQQVERPNIVIVLADDLGWGDVGFHG-SEIKTPCLNALAAEGVVL 63
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR- 157
+YT + TP+R+ +MTG++P G++ V+ GL +E+ L L GY R
Sbjct: 64 DRFYTAPISTPTRAGLMTGRYPNRFGIRTTVIPPWREDGLDENEETLADMLARNGYSNRA 123
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
I+GKWHLG +K + P RGF G+ G DYFDH M+ LD D E +D
Sbjct: 124 IIGKWHLGHTRKVHYPINRGFSHFYGHLNGAIDYFDH----MREGELDWHNDWETCYD-- 177
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF- 276
YST++ T EAV I+ + + P LY+A+ A H+ PLQA + +I + +DF
Sbjct: 178 KGYSTELITQEAVRCINTYEKEGPFLLYVAYNAPHT-----PLQAQEK--DIELYCDDFG 230
Query: 277 -------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
KR + A++ +D +G +V+AL+++ ++ N+ ++F SDNG A + +S
Sbjct: 231 SLTPKEQKRVTYQAMVSCMDRGIGTIVDALKKKGIMDNTFLIFFSDNGPAG----VPGSS 286
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESR-GIVAEQYVHVSDWLPTL 376
+ LRG K W+GGVR + + ES ++ Q D +PTL
Sbjct: 287 SGKLRGRKFDEWDGGVRVPAVFYWKRAESNYKNLSSQVTGFVDIVPTL 334
>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 467
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/371 (33%), Positives = 206/371 (55%), Gaps = 30/371 (8%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL- 98
L ++ + +++ ++ P+II IL+DD G+ D +G +IPTPNI+ LA G +
Sbjct: 8 LSISLLVVLIYTTQAQTTTKPNIIVILSDDAGYADFECYGNKEIPTPNINRLAKEGTLFS 67
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKEL 152
K Y + +C PSR+ ++TG++ G ++N + E GL LSEK + +KE
Sbjct: 68 KAYVSASVCAPSRAGLLTGRYQQRFGFENNPTGKPREGFKKEDMGLALSEKTIGDRMKEE 127
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS---AEEMKMWGLDMRRD 209
GYRT VGKWHLG K + P RGF+ G+ GH+D+F AE+ +W D
Sbjct: 128 GYRTLAVGKWHLGNDAK-FFPLKRGFDEFYGFQEGHRDFFSFKKKRAEKYALWDND---K 183
Query: 210 LEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ P ++ Y TD+FT +A+ I N +P F+YLA+ A H+ PLQA + L+
Sbjct: 184 IIPEEEI--TYLTDMFTDKALKFIDENADKKQPFFIYLAYNAVHT-----PLQAKKNDLD 236
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
+ R + A+L +D ++G+V++ L+ + + N++++F++DNGGA N +
Sbjct: 237 KFAQVGSEGRQTYDAMLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNGGATT----NYS 292
Query: 329 SNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
N LRG+K ++WEGGVR G + W+ + + +V ++ V D LPT L+AA +
Sbjct: 293 DNGQLRGMKGSVWEGGVRVGYIMRWNGKIPAN-VVYDKAVSSLDILPTSLAAAGNTRKDK 351
Query: 388 YVNSTVENIIP 398
+++ N++P
Sbjct: 352 HLDGV--NLLP 360
>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 498
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 125/361 (34%), Positives = 189/361 (52%), Gaps = 30/361 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F LPL F + A S PP+I+ ILADD+G+ D+G G + TPN+D LA SG+
Sbjct: 12 FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 70
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
+ Y +C+PSR+ ++TG+ P G + N+ E GLP SEK L +
Sbjct: 71 LCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADH 130
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
L GY T ++GKWHLG + + P RGF+ G TG YF + + + + R
Sbjct: 131 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMKHV------IER 183
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
+ + +Y TD FT E + I H + D+P F++ ++ A P+ P+ A +
Sbjct: 184 NGKRVDGFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNA-----PHTPMHATEA 238
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
L +I++ KR +AA+++ LD VG++ E LE+ N+++VF SDNGGA
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGATN---- 294
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
N + N PLRGVK ++ EGG+R +IW+ P G+ + + D LPT SAA
Sbjct: 295 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAEP 353
Query: 385 I 385
+
Sbjct: 354 L 354
>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 498
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 125/361 (34%), Positives = 189/361 (52%), Gaps = 30/361 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F LPL F + A S PP+I+ ILADD+G+ D+G G + TPN+D LA SG+
Sbjct: 12 FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 70
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
+ Y +C+PSR+ ++TG+ P G + N+ E GLP SEK L +
Sbjct: 71 LCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPKSEKTLADH 130
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
L GY T ++GKWHLG + + P RGF+ G TG YF + + + R
Sbjct: 131 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMNHV------IER 183
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
+ + + +Y TD FT E + I H + D+P F++ ++ A P+ P+ A +
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNA-----PHTPMHATEA 238
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
L +I++ KR +AA+++ LD VG++ E LE+ N+++VF SDNGGA
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGATN---- 294
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
N + N PLRGVK ++ EGG+R +IW+ P G+ + + D LPT SAA
Sbjct: 295 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAEP 353
Query: 385 I 385
+
Sbjct: 354 L 354
>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 160
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 91/148 (61%), Positives = 115/148 (77%), Gaps = 1/148 (0%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHII I+ADDLGWND+GFHG +I TPNID LA +G++L NYY + +CTPSRSAIMTG++
Sbjct: 10 PHIILIVADDLGWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTGRY 69
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIHTGMQ + +YG G+ L+EK LPQYLK+ GY+T VGKWHLGF+ KEYTPT+RGF+
Sbjct: 70 PIHTGMQQDTIYGPNPYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRGFD 129
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
S+ G + G DY++HS E GLD+
Sbjct: 130 SYYGSYLGKGDYWNHSNTE-TFTGLDLH 156
>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 185/349 (53%), Gaps = 24/349 (6%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M+ L L FV A++ P+I+F+L DDLG+ DVGF I +PN D LA +
Sbjct: 1 MSILQCLLVCVLCSCFV---ATAKLPNIVFVLIDDLGFADVGFRN-PAISSPNFDQLAKT 56
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G++L +Y + C+PSR++++TG+ P H Q N L G ++ +LP LK Y
Sbjct: 57 GLVLNRHYVFKYCSPSRASLLTGRWP-HHAHQWNPLMD-STIGTNINMTMLPAKLKAANY 114
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
T +VGKWHLGF+ Y P RGF++ G++ +D+ + E +D ++ P
Sbjct: 115 STHMVGKWHLGFFDPRYLPINRGFDTSTGFFGCCEDHMN----EKSGCSIDYWKNNAPD- 169
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
+G Y + + D++ N++T+ PLFLYL H+ P QAP +L+I+
Sbjct: 170 PRNGTYDAYNYRDDLTDVMSNYNTENPLFLYLPLHNVHA-----PFQAPKEWLDIYPANS 224
Query: 275 DF-KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
KR A++ D G +VE L+++ + N+I+V +DNGGA A SN+PL
Sbjct: 225 TCEKRHTLQAMISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPC-----AGSNYPL 279
Query: 334 RGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
+G K T +EGGVR + LL RG + ++H++DW PT A
Sbjct: 280 KGCKTTFFEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLA 328
>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 461
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 119/320 (37%), Positives = 170/320 (53%), Gaps = 16/320 (5%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
+GWNDV +H D I PN+ LA G+I YT CTPSR+A+MTG +P TG QH +
Sbjct: 1 MGWNDVHWHNSD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GH 188
++ G+PL K+LP+ LKE+GY T +VGKWHLGF K EY PT RGF+SH G WT G
Sbjct: 60 VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 189 QDYFDHSAEEMKMWGLDMRRDLE--PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
DY + G D R ++ P D Y + A I++ H PLFL
Sbjct: 120 SDYDKMNGVLSPSAGYDFRDNMGVVPKSD---DYLALMLGDRAEHIVNTHYPGTPLFL-- 174
Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
A P + L+ P+ Y + IED + +F L +D+ +G VV+AL+ R M
Sbjct: 175 ---AFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYD 231
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
+S+I+F+ DNG ++ + SN+P RG+ TL+EG R ++ ++ G + +
Sbjct: 232 DSLIIFIGDNGALSS----QSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287
Query: 367 VHVSDWLPTLLSAANKSDIP 386
+ D T+L A P
Sbjct: 288 YSIVDIHRTILDVAGAEAEP 307
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 21/123 (17%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
E+DG+ +W S+ +PS RN IL+NIDD+ +A+ G +KL+ TG
Sbjct: 308 ELDGLSMWPTWSKGKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI------------TGH 355
Query: 632 PDQVYLS---GLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKN 688
PD +Y L+D + A+ + + PQ LF+I DP E+N
Sbjct: 356 PDLLYPYRDISLTDNYY----NYGDAPASGLPSTDQQSHAPAPQNVEYLFNIVVDPEERN 411
Query: 689 NLA 691
+++
Sbjct: 412 DIS 414
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
E+DG+ +W S+ +PS RN IL+NIDD+ +A+ G +KL+
Sbjct: 308 ELDGLSMWPTWSKGKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI 352
>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
Length = 540
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 186/348 (53%), Gaps = 34/348 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHIIF L DDLGWN FH ++I TP + L +G+ L ++YT C+P+R++ +TG+
Sbjct: 30 PHIIFFLTDDLGWNAPSFHN-NEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTGRF 88
Query: 120 PI-HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P H N+L GL LS L LK+ Y T +GKWHLG YKKEYTP +RGF
Sbjct: 89 PYKHEMTNTNLLPPTRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYRGF 148
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
++ G+ TG ++++ A + +D+ + PA+ ++G ++ +FT A+DII N++
Sbjct: 149 DTTFGFLTGGENHYTQRA-FVSPPAVDLWDEEAPAYGMNGTWTGKMFTDAALDIIRNNAQ 207
Query: 239 -------DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK--RSKFAAILHKLD 289
PLF+Y A H+ P Q+P ++ D +S F ++ +D
Sbjct: 208 LRNATGDAPPLFIYFALHDVHA-----PTQSPVRLAQLYNFTGDDAGLKSTFYGMISGVD 262
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FN--------------LNAASNWPLR 334
E+V V +AL+ M +++++V+ SDNG G FN SN+PLR
Sbjct: 263 EAVRNVTDALKAAGMWNDTLLVWTSDNGSPTMGSFNGGCKNVVPDSRGGACMCGSNYPLR 322
Query: 335 GVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
G K+T +EGGVR + P+L + RG + +HV DW TL S A
Sbjct: 323 GSKHTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVA 370
>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 810
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 130/350 (37%), Positives = 184/350 (52%), Gaps = 26/350 (7%)
Query: 44 FTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
F SM+ LV ++ P+I+ I+ADDLG++DVGF+G +IPTP +D LA G++
Sbjct: 7 FLSSMLIGLLVLAASSVDAKERPNIVLIVADDLGYSDVGFNGCKEIPTPRLDELAGEGVV 66
Query: 98 LKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKE 151
N Y C+PSR+ ++TG++ G + N +G + G+PLSE L LKE
Sbjct: 67 FTNGYASHPYCSPSRAGLLTGRYQQRFGHEGNPEPDPQWHGDDTPGMPLSETTLADALKE 126
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY T +GKWHLG K + P RGF+ G+ G Y+ + G+ R E
Sbjct: 127 AGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGFSGGGLSYWGDLGRMDPLLGV--HRGDE 183
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
P Y TD F+ EAV I H TD P FLYLA+ A P+ P QA +L
Sbjct: 184 PVDRKSLTYLTDDFSTEAVKFIQRHETD-PFFLYLAYNA-----PHAPDQATRAHLEKTA 237
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
HIE R+ + A++ +DE +G+VV+ + + + N++IVF SDNGG A N
Sbjct: 238 HIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLDDNTLIVFYSDNGG-----RREHAMNL 292
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P RG K L+EGG+R L+ P G+ + V D PT+L+AA
Sbjct: 293 PYRGHKGMLFEGGIRVPFLVSWPGTLPGGVREQSPVTALDVFPTVLAAAG 342
>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
Length = 427
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 99/225 (44%), Positives = 138/225 (61%), Gaps = 11/225 (4%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
+VGKWHLGFY+KE PT RGF++ G G DY+ H + M G D+ + AWD
Sbjct: 1 MVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDY 60
Query: 217 -HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
+G YST ++T I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I +
Sbjct: 61 DNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIIN 115
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++AA+L LDE++ V AL+ +NSII++ SDNGG SNWPLRG
Sbjct: 116 INRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRG 171
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
K T WEGG+R G + SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 172 SKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA 216
>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
Length = 507
Score = 196 bits (498), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 141/399 (35%), Positives = 202/399 (50%), Gaps = 63/399 (15%)
Query: 25 LKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLD 81
+K G R ++ +V LA + AS G P+++ ILADD+GWNDVG+HG D
Sbjct: 3 IKIPGSRIALIFLSVTVLAAAQTGAQTRSSASPGAIVRPNVVIILADDMGWNDVGYHGSD 62
Query: 82 QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS 141
I TP+ID LA G+ L +Y C+P+R+A+++G+ G+ ++ L GL L
Sbjct: 63 -IHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQSSQSLGI-YSPLSKLNPTGLALD 120
Query: 142 EKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM 201
+KI+P Y ++ GY+T +VGKWHLGFY+ EY P RGF+ G TG Y++H +
Sbjct: 121 QKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGFDHFYGNLTGGVGYWNH----VHG 176
Query: 202 WGLDMRRDLEPAWDLHGK------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
GLD W +GK YST + +AE +I ++PLFLY A A
Sbjct: 177 GGLD--------WQRNGKTLRQEGYSTHLQSAEITRLIQQRDPEKPLFLYAAFNA----- 223
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
P+ P +AP L + HIE+ R AA++ +LD ++G+++E L ML N++I F+SD
Sbjct: 224 PHLPNEAPADTLARYAHIENPNRRIHAAMVTELDSAIGQLMETLSTEGMLENTLIWFMSD 283
Query: 316 NGG---AAAGFNLNAAS-----------------------------NWPLRGVKNTLWEG 343
NGG A L + S N P R K +++EG
Sbjct: 284 NGGLNRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNALDGGSDNSPHRKGKQSIYEG 343
Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G R + W L I Q V V D LPTLLSA +
Sbjct: 344 GARVPSFVYWKGRLSPERIT--QMVTVKDVLPTLLSATD 380
>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 472
Score = 196 bits (498), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 129/375 (34%), Positives = 189/375 (50%), Gaps = 47/375 (12%)
Query: 37 FAVLP-LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
FA++ AF + V + S PPHI+ I+ADDLGWNDVGFHG +I TPNID LA
Sbjct: 5 FAIMTCFAFVGANVSAETNGGSRPPHIVHIIADDLGWNDVGFHG-SEIRTPNIDRLASES 63
Query: 96 IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
+ L +Y +C+P+R+ ++TG +P G+ V+ ++ GLP + P++L +LGY
Sbjct: 64 VTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPPQLETTPEHLSKLGYD 123
Query: 156 TR-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
R + GKWHLG + P G G++ G DYF + LD RD +
Sbjct: 124 HRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRE----RFGQLDWHRDFD--- 176
Query: 215 DLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA----------- 262
+H + YST++ VD I ++ P++ Y+A A HS PLQA
Sbjct: 177 SVHEEGYSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEYGFD 231
Query: 263 PDHYL--NIHRHIE------DFK--------RSKFAAILHKLDESVGKVVEALEQRRMLS 306
P L N R I D+ R FAA+ +D +G++++A+++ M
Sbjct: 232 PTGKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRE 291
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
N+++VF SDNG + SN PLRG K T WEGGVR ++ P GI +
Sbjct: 292 NTLVVFHSDNGADPK----HGGSNEPLRGNKFTTWEGGVRVVAMMRWPRELPAGIAYDSV 347
Query: 367 VHVSDWLPTLLSAAN 381
D LPT+++AA
Sbjct: 348 TSYVDLLPTMVAAAG 362
>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
anophagefferens]
Length = 351
Score = 195 bits (496), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 129/358 (36%), Positives = 180/358 (50%), Gaps = 51/358 (14%)
Query: 59 PPHIIFILADDLGWNDVGFH----GLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
PP++I LADDLG NDVG+ G +I +P IDALA ++L +Y +CTP+R+A+
Sbjct: 10 PPNLIVFLADDLGRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAAL 69
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
MTG+H TG+ + VL + GLP +E + + L++ GY T ++GKWHLGF KK PT
Sbjct: 70 MTGRHAYRTGLAYFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLPT 129
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEM--KMWGLDMRRDLEPA---------WD-------- 215
RGF+ GY G DY+ H + E + D EP WD
Sbjct: 130 SRGFDRFFGYCLGSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPRT 189
Query: 216 --LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI-HRH 272
+S D+F A A + D+PLFLY A A H+ H++ RH
Sbjct: 190 PKTENVHSADLFAARATETFATAPRDKPLFLYYASQAPHA-----------HFVGAPPRH 238
Query: 273 IEDF---------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+ R+ AA++ LDE VG VV+ + R N+++ F+ DNG
Sbjct: 239 VAAAAAMGFAAGDPRADVAALVAALDEQVGNVVDGVRASRGWENTVLWFMGDNGPEPG-- 296
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
AS+WPLRG K TL+EGGVR +++P + G + VHV D PTLL A
Sbjct: 297 --TGASSWPLRGAKRTLFEGGVREPSFVYAPGRVAPG-ASSAPVHVIDVAPTLLGLAG 351
>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 434
Score = 195 bits (496), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 122/321 (38%), Positives = 173/321 (53%), Gaps = 20/321 (6%)
Query: 64 FILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIH 122
IL DD GW DVGF+G IPTPN+D LA G+I N Y C+PSR+ ++TG++
Sbjct: 1 MILTDDQGWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQAR 60
Query: 123 TGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
G N+ Y + G PLSEK++ + LKE GYRT +GKWHLG + Y P +GF+
Sbjct: 61 FGHDCNMPYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLGDHPDLYPPA-QGFD 119
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G+ G +Y+ S E++ + R + P +L Y TD FT EA+ I +
Sbjct: 120 HWFGFPGGGMNYWGESKNEIQT--IYRNRKVVPEEEL--TYLTDDFTTEAIRFI-TQKDE 174
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+P F+YLA+ A H+ P QA YL +HIE RS +AA+++ +D +VGK+ L
Sbjct: 175 KPFFMYLAYNAPHA-----PDQATKEYLEKTKHIEYAGRSVYAAMVNAVDANVGKIDSTL 229
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ N+I+VF+SDNGG + A N P RG K L+EGG++ I P
Sbjct: 230 IANGLKENTILVFLSDNGG-----RIEHADNRPYRGHKGMLFEGGIKVPFFITWPTKIKE 284
Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
+ + D PT L+AA
Sbjct: 285 KYAYNEPISSLDLFPTFLNAA 305
>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
Length = 461
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 187/354 (52%), Gaps = 41/354 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+ + I+ADD GWND FHG +I TPN+D LA G+ L +YT C+P+R++++TG+
Sbjct: 39 PNFLVIIADDAGWNDFSFHG-SEIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTGRP 97
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
G+ + G LP S LPQ L +L Y+T ++GKWHLG K E P GF+
Sbjct: 98 ASRMGIVAPI-SGRSELNLPDSITTLPQALSKLNYKTALMGKWHLGL-KPESGPEVYGFD 155
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------STDVFTAEAVDII 233
G+ G D + H+ ++ + W +GK+ TD+ T AV I
Sbjct: 156 FSYGFLHGQLDQYAHT-----------YKNGDSTWYRNGKFISEKGHVTDLLTQSAVHYI 204
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
TD+ +L +A++A P+ PLQ P +L + I+D R +AA + +D +G
Sbjct: 205 DTLQTDQNFYLQVAYSA-----PHIPLQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIG 259
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGG-----------AAAGFNLNAASNWPLRGVKNTLWE 342
++++ L+ + + N++++FVSDNG G N + SN PLR K + +E
Sbjct: 260 EILQKLKDKDLEKNTVVLFVSDNGAQEKWVPNTQYDGKYGPNYSLGSNLPLRDFKTSNYE 319
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
G +R +I P + G + Y++V+DW+PT L+ AN ++P STVE +
Sbjct: 320 GALRVPAIISWPENLNSG-TSTNYINVTDWMPTFLNWANAEELP----STVEGV 368
>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 135/375 (36%), Positives = 201/375 (53%), Gaps = 29/375 (7%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHG-LDQIPTPNIDALAYSGIIL 98
L L F L V L AS+ P+II ILADDLG+ DV +HG L + TP+ID++A SG
Sbjct: 4 LALQF-LCFVLASLSASAAKPNIIVILADDLGYGDVSYHGTLKETTTPHIDSIAQSGAWF 62
Query: 99 KNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV----LYGCERGGLPLSEKILPQYLKELG 153
+N Y+ +C PSR+ +++G++ G N+ L GLPLS+K++P+ L + G
Sbjct: 63 QNGYSAAPVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEG 122
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF-----DHSAEEMKMWGLDMRR 208
Y T +VGKWH G K + P RGF+ G+ G + + +H+ +E WG R
Sbjct: 123 YATGMVGKWHDGDQHK-FWPYNRGFQEFYGFNNGAINNWVLKGENHTVDE---WGAVHRE 178
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ + G+Y T+ F EAV+ I H T EP FLYL+ A H PLQAP Y N
Sbjct: 179 NKRV--ENSGEYMTEAFGREAVEFIDRHKT-EPFFLYLSFNAVHG-----PLQAPKSYTN 230
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
+HI+ R+ A+L +D+++G V+E L + + N+II F SDNGG G N +
Sbjct: 231 QFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLKG---NYS 287
Query: 329 SNWPLRGVKNTLWEGGVRGA-GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
N RG KNT+++GG+ + W + ++ E VH D T+ +AA +I +
Sbjct: 288 FNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAG-VEIKD 346
Query: 388 YVNSTVENIIPRYEN 402
N++P +N
Sbjct: 347 EYKLDGRNLLPYLKN 361
>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 489
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 127/352 (36%), Positives = 181/352 (51%), Gaps = 46/352 (13%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHI+ I+ADDLGWNDVGFHG D I TPNID LA ++L +Y +C+P+R+ ++TG
Sbjct: 45 PPHIVHIIADDLGWNDVGFHGSD-IRTPNIDRLARESVVLDRFYVTPICSPTRAGVLTGL 103
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTPTFRG 177
+P G+ V+ ++ GLP + P++L +LGY R + GKWHLG + P G
Sbjct: 104 YPFRFGIWGGVVSPTKKHGLPSELETTPEHLAKLGYDHRAMFGKWHLGLASTLFHPLRHG 163
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVDIIHNH 236
G++ G DYF + LD R+ + +H + YST++ VD I H
Sbjct: 164 MTEFYGHYNGAIDYFSRE----RFGQLDWHRNHD---SVHEEGYSTELVGNAVVDFIDRH 216
Query: 237 STDEPLFLYLAHAATHSANPYEPLQA-----------PDHYL--NIHRHI-------EDF 276
+ +PL+ Y+A A HS PLQA P L N R I E
Sbjct: 217 AGQQPLYAYVAFNAPHS-----PLQALRSDLDDYGFDPTGKLAPNTDRKIAKRENALEYG 271
Query: 277 KRSK-------FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
KR K FAA+ +D +G++++A+++ M N+++VF SDNG + S
Sbjct: 272 KRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPK----HGGS 327
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
N PLRG K T WEGGVR + P G+ E D LPT+++AA
Sbjct: 328 NEPLRGNKFTTWEGGVRVVATMRWPREFPAGVTYESVASYVDLLPTMVAAAG 379
>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 471
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 126/354 (35%), Positives = 189/354 (53%), Gaps = 25/354 (7%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
+R ++ VL + F+L A++G P++I I++DD GW+DVGF+G +IPTP++D
Sbjct: 3 FRLVLVCLCVLGVLFSLLPE-----ANAGQPNVIVIVSDDQGWSDVGFNGCKEIPTPHLD 57
Query: 90 ALAYSGIILK-NYYTVQLCTPSRSAIMTGKHPIHTGMQ-----HNVLYGCERGGLPLSEK 143
ALA SG+ Y + C+PSR+ ++TG++ G + H E GLPLSE
Sbjct: 58 ALAKSGVAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECNPGAHGNDDAIEMEGLPLSET 117
Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
+L + GYRT +GKWHLG + ++ PT RGFE G+ G Y+ ++ + G
Sbjct: 118 LLSTVFRNAGYRTGAIGKWHLG-DEPQFWPTERGFEEWFGFSGGGLSYWGDLGKKPPLHG 176
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
+ D+ P +L Y TD F+ EAV + + P FLYLA+ A H+ P Q
Sbjct: 177 VLRNGDVVPKDEL--TYLTDDFSTEAVKFVEENRA-RPFFLYLAYNAPHA-----PDQVT 228
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+L HIE R+ + A++ +DE +G+VV+AL+ + + ++I F SDNGG A
Sbjct: 229 REHLKKTSHIEYGGRAVYGAMVAAMDEGIGRVVQALDDQNLRDETVIFFYSDNGGRA--- 285
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
A N+P RG K L+EGG+R + P G E V D PT+L
Sbjct: 286 --TLARNFPYRGHKGMLFEGGIRVPFCVSWPEQIEGGRRYEPPVTALDIFPTVL 337
>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 472
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 130/378 (34%), Positives = 190/378 (50%), Gaps = 49/378 (12%)
Query: 35 MAFA---VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
MAF V AF + V + S PPHI+ I+ADDLGWNDVGFHG +I TPNID L
Sbjct: 1 MAFPFAIVACFAFVGANVSAEANGGSQPPHIVHIIADDLGWNDVGFHG-SEIRTPNIDRL 59
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A + L +Y +C+P+R+ ++TG +P G V+ ++ GLP + P++L +
Sbjct: 60 ANESVTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVVSPTKKHGLPPQLETTPEHLSK 119
Query: 152 LGYRTR-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
LGY R + GKWHLG + P G G++ G DYF + LD R+
Sbjct: 120 LGYDHRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAIDYFSRE----RFGQLDWHRNF 175
Query: 211 EPAWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA------- 262
+ +H + YST++ VD I ++ P++ Y+A A HS PLQA
Sbjct: 176 D---SVHEEGYSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDE 227
Query: 263 ----PDHYL--NIHRHIE------DFK--------RSKFAAILHKLDESVGKVVEALEQR 302
P++ L N R I D+ R FAA+ +D +G++++A+++
Sbjct: 228 YGFDPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRN 287
Query: 303 RMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
M N+++VF SDNG + SN PLRG K T WEGGVR ++ P GI
Sbjct: 288 GMRENTLVVFHSDNGADPK----HGGSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGIT 343
Query: 363 AEQYVHVSDWLPTLLSAA 380
+ D LPT+++AA
Sbjct: 344 YDSVTSYVDLLPTMVAAA 361
>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
Length = 498
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/357 (34%), Positives = 192/357 (53%), Gaps = 28/357 (7%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
Y++ IM L +++ L A + P+I+ +LADDLGW DVGFHG +I TP++D
Sbjct: 36 YKSYIMKEEFFGLLCGCTLLPAFLHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLD 94
Query: 90 ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
AL G+ L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L
Sbjct: 95 ALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADML 154
Query: 150 KELGYRTR-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
GY+ R I+GKWHLG KK + P RGF G+ G DYFD + E LD
Sbjct: 155 ARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHN 210
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
D E D YST++ T EA+ I + + P LY+A+ A H+ PLQA + +
Sbjct: 211 DWETCHD--KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--D 261
Query: 269 IHRHIEDF--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
I + ++F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A
Sbjct: 262 IKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG 321
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ +S+ PLRG K W+GG A L W + ++ Q D +PTL
Sbjct: 322 ----VPGSSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 374
>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 498
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 128/364 (35%), Positives = 189/364 (51%), Gaps = 33/364 (9%)
Query: 38 AVLPLAFTLSMVFVDLVAS----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
AV L F VF AS S PP+I+ ILADD+G+ D+G G + TPN+D LA
Sbjct: 8 AVSRLVFLSLTVFFATKASRAAESAPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKIL 145
SG++ Y +C+PSR+ ++TG+ P G + N+ ER GLP+SEK L
Sbjct: 68 SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDERYATRPELLGLPVSEKTL 127
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
+L GY T ++GKWHLG + + P RGF+ G TG YF + +
Sbjct: 128 GDHLGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMNHV------ 180
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQA 262
+ R+ + D +Y TD FT E + I H D+P F++ ++ A P+ P+ A
Sbjct: 181 IERNGQRVEDFSNEYLTDFFTDEGLRFIDQHEAAKPDQPWFVFYSYNA-----PHTPMHA 235
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+ L +I++ KR +AA++ LD VG++ + LE+ N+++VF SDNGGA
Sbjct: 236 TEADLARFANIQNKKRRTYAAMMFALDRGVGRIRKHLEETGQWKNTLLVFFSDNGGATN- 294
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
N + N PLRGVK ++ EGG+R +IW+ P G + + D LPT SAA
Sbjct: 295 ---NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGKRYDGVISSLDLLPTFCSAAG 350
Query: 382 KSDI 385
+
Sbjct: 351 AQPL 354
>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
Length = 507
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/362 (33%), Positives = 191/362 (52%), Gaps = 32/362 (8%)
Query: 42 LAFTLSMV-FVDLVAS---SGPPHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYS 94
A TL+++ + LVA + PHI+ +L DD GW +VG+H ++ TPNID L
Sbjct: 2 FAVTLALLSMIGLVAGQPVTLKPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQ 61
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQH---------NVLYGCERGGLPLSEKIL 145
G+ L +Y + C+PSRS++++G+ PIH Q+ + + G +P + +
Sbjct: 62 GLELNQHYAYRCCSPSRSSLISGRLPIHVSDQNIAPTNYNPNDPISGFS--AIPRNMTGI 119
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL- 204
+ +KE GY T VGKW G ++TP RGF++ GY+ + DY+ + G+
Sbjct: 120 AEKMKEAGYATHQVGKWDAGMATPDHTPKGRGFDTSFGYFHHYNDYYTEVVDSCNGTGVV 179
Query: 205 DMRRDLEPAWDLHG----KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
D+ +PA ++G KY +F +DI+ H PLFLY A H+ PL
Sbjct: 180 DLWNTDQPAHGINGTGPDKYEEALFRERLLDIVSKHDPSTPLFLYYAPHIVHT-----PL 234
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
Q PD YLN I+D R + A+++ LD+ VG +V AL+++ M N + V SDNGG
Sbjct: 235 QVPDEYLNKFSFIDDKDRMYYHAMVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNGGPV 294
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLS 378
+ A+N+PL+G K + W+GG+R + L + RG + Y+H++DW T +
Sbjct: 295 --YPFGGANNYPLKGGKLSDWQGGIRVNAFVSGGHLPEKMRGQKTDGYIHLADWYGTFCA 352
Query: 379 AA 380
A
Sbjct: 353 IA 354
>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 115/331 (34%), Positives = 177/331 (53%), Gaps = 21/331 (6%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
A++ P+I+F+L DD G+ DVGF I +PN D LA +G++L +Y + C+PSR+
Sbjct: 16 FAATAKVPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYVFKYCSPSRA 74
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+ +TG+ P H Q N L G L+ +LP LK Y T +VGKWHLGF+ Y
Sbjct: 75 SFLTGRWP-HHAHQWNPLMD-NMIGTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRYL 132
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
P RGF++ G+ G +D+ + E +D ++ P +G Y + + D+
Sbjct: 133 PINRGFDTSTGFLGGGEDHMN----EKSGCSIDYWKNNAPD-PRNGTYDAYNYRDDLTDV 187
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDES 291
++NH+ D PLF YL H+ PLQAP +L+I+ KR A++ D
Sbjct: 188 MNNHNADNPLFFYLPLHNVHT-----PLQAPKEWLDIYPANSTCEKRHILQAMVSVADNV 242
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
G +VE L+++ M N+I+V +DNGGA + SN+PL+G K+T +EGG+R +
Sbjct: 243 TGHLVELLKKKGMWDNTIMVISADNGGAPC-----SGSNYPLKGCKSTFFEGGIRSLAFV 297
Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
LL RG + ++H++DW T A
Sbjct: 298 NGGLLPESRRGQSTDGFIHIADWYTTFCKLA 328
>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 539
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/374 (33%), Positives = 188/374 (50%), Gaps = 41/374 (10%)
Query: 32 TRIMAFAVLPLAF-TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
RI AF + L F T S VF + P+II ++ADDLGW DVGFHG I TP++D
Sbjct: 2 VRIGAFCLSVLVFLTSSSVF-----AGDKPNIIVMVADDLGWADVGFHGNQIIETPSLDR 56
Query: 91 LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
+A G L +YT +C+P+R+A+MTG+ PI G+ ++ + G+ E LP+
Sbjct: 57 IAAEGTQLNRFYTTPICSPTRAALMTGRDPIRLGVAYSTIMPWHNNGIHPEETFLPELFA 116
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
GY+T +VGKWHLG ++ Y P RGFE G+ +F A + G D +R+
Sbjct: 117 GAGYQTAMVGKWHLGHAQQTYHPNARGFEHFYGHLHTEVGFFPPFA---SLGGKDFQRN- 172
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
+ D G Y + + E I +P F+Y+ A H+ PL AP+ +
Sbjct: 173 GVSIDDQG-YESYLLADEVSRYIRERDAAKPFFIYMPFIAPHT-----PLDAPEDLKAKY 226
Query: 271 RHIEDFK-----------------------RSKFAAILHKLDESVGKVVEALEQRRMLSN 307
+ED + R +AA++ +D+S+G+V++ L+ + N
Sbjct: 227 ADMEDDRGKSRSKMADNTRFMAKVTRRESARPMYAAVVDGMDQSIGRVLDTLDAEGLTDN 286
Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
+I++F SDNGGA + + A N PLRG K +EGG+R + P + G + +
Sbjct: 287 TIVLFFSDNGGAV--YAIGGADNAPLRGGKGDTFEGGIRVVATMRWPEKIAPGGKVDSIM 344
Query: 368 HVSDWLPTLLSAAN 381
V D PTLL+AA
Sbjct: 345 SVMDVFPTLLAAAG 358
>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
Length = 637
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 110/241 (45%), Positives = 141/241 (58%), Gaps = 11/241 (4%)
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+V+ + LPL LPQ L+E GY T +VGKWHLGFY+ E PT RGF+S
Sbjct: 112 IHTGLQHSVIRPTQPNCLPLDAATLPQKLREAGYATHMVGKWHLGFYRTECLPTRRGFDS 171
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
G G DY+ H + + G D+ + AW G YST ++T I+ +H
Sbjct: 172 FFGSLLGSGDYYTHYKCDSPGVCGYDLHENDRAAWGRGGGAYSTQMYTQRVQRILASHDP 231
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+PLFLY+A+ A HS PLQAP YL +R I + +R ++AA+L LDE+V V A
Sbjct: 232 GKPLFLYIAYQAVHS-----PLQAPGRYLGHYRSIANARRRRYAAMLSCLDEAVANVTLA 286
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ NS+IV+ SDNGG A SNWPLRG K T WEGGVR G + SPLL
Sbjct: 287 LKAHGFYDNSVIVYSSDNGGQPAA----GGSNWPLRGRKGTYWEGGVRAVGFVHSPLLRQ 342
Query: 359 R 359
R
Sbjct: 343 R 343
>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 193 bits (491), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/331 (34%), Positives = 176/331 (53%), Gaps = 21/331 (6%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
VA++ P+I+F+L DD G+ DVGF I +PN D LA +G++L +Y + C PSR+
Sbjct: 17 FVATAKLPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+++TG+ P H Q N+ G L+ + P LK Y T +VGKWH GF+ Y
Sbjct: 76 SLLTGRWPHHV-YQWNLATD-ATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
P RGF++ GY G D+F+ + + +D ++ P +G Y + + + +I
Sbjct: 134 PINRGFDTSSGYLCGWVDHFN----QKQGCAVDCWKNNAPD-PRNGTYDSYYYRDDLTNI 188
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RSKFAAILHKLDES 291
++NH + PLF+YL H+ PL+AP +L+I+ K R A++ D
Sbjct: 189 VNNHDANNPLFIYLPLHNVHT-----PLEAPKEWLDIYPANSTCKTRHTLQAMVSVADNV 243
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
G +VE L+++ M N+I+V +DNGG ASN+PLRG K T +EGGVR +
Sbjct: 244 TGHLVELLKEKEMWDNTIMVISADNGGDQC-----VASNYPLRGCKYTFFEGGVRSLAFV 298
Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
LL RG + ++H+SDW T A
Sbjct: 299 NGGLLPESRRGQSTDGFIHISDWYTTFCKLA 329
>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 588
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/391 (33%), Positives = 195/391 (49%), Gaps = 79/391 (20%)
Query: 66 LADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTG 124
+ DD+GW+D+G+ +D + TP +D LA G+ + NYYT+ CTP+R+++MTG++ + G
Sbjct: 1 MIDDMGWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYG 60
Query: 125 MQHNVLYG-CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
MQ+NV E G+PLSEK+LP+Y KE GY T +VGKWHLG + E+ P+ RGF++++G
Sbjct: 61 MQYNVAINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMG 120
Query: 184 YWTGHQDYFDHSA----------------------------EEMKMWGLDMRRDLEPA-- 213
Y G + Y+ H ++G D R D + A
Sbjct: 121 YVGGFEAYWTHETVGVISDGRHVCDFGFGNASGYYDIIDRPSSQDIYGSDHREDQKEAEE 180
Query: 214 ----WDLHGKYST-------------------------DVFTAEAVDIIHNHS--TDEPL 242
D G +ST +F A++I+ S +EP+
Sbjct: 181 NRDDGDGEGVHSTTPTKTPWASSFRSNSGVMKGQQYSTQMFQERAIEILETKSPFDEEPI 240
Query: 243 FLYLAHAATHSANPYEPLQ--APDHYL---NIHRHIEDFK----RSKFAAILHKLDESVG 293
+++LAH A HS P + D L I I D + R +L LD+SVG
Sbjct: 241 YMHLAHQAVHSPLGLPPADSFSEDETLLLDEIEGKIGDVQGAQTRRHLTEVLMFLDKSVG 300
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
++ LE + NSI+V SDNGG + SN+PLRG K + WEGG + ++S
Sbjct: 301 HLIHYLEDEGWMENSIVVVASDNGGDSN----EGGSNYPLRGKKASYWEGGSKVPAFVYS 356
Query: 354 P--LLES-RGIVAEQYVHVSDWLPTLLSAAN 381
+ E+ RG + +HV+DWLPTL + A
Sbjct: 357 TSHIPEARRGSEYDGLMHVTDWLPTLAAGAG 387
>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
Length = 579
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/409 (31%), Positives = 199/409 (48%), Gaps = 47/409 (11%)
Query: 4 NVLYGCERGGLPLSEKILPQYLKELGYRTRIMAFAVLP-LAFTLSMVFVDLVASSGPPHI 62
+ +G G+ + ++I + FA++ A + V + S PPHI
Sbjct: 79 GICFGVVGDGIGMVDRIRSVFSHCACDLKMAFPFAIMTCFALVGANVSAETNGGSQPPHI 138
Query: 63 IFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIH 122
+ I+ADDLGWNDVGFHG +I TPNID LA + L +Y +C+P+R+ ++TG +P
Sbjct: 139 VHIIADDLGWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFR 197
Query: 123 TGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTPTFRGFESH 181
G+ V+ ++ GLP + P++L +LGY R + GKWHLG + P G
Sbjct: 198 FGIWGGVVSPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEF 257
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVDIIHNHSTDE 240
G++ G DYF + LD RD + +H + YST++ VD I ++
Sbjct: 258 YGHYNGAIDYFSRE----RFGQLDWHRDFD---SVHEEGYSTELVGNAVVDFIDRNANAG 310
Query: 241 PLFLYLAHAATHSANPYEPLQA-----------PDHYL--NIHRHIE------DFK---- 277
P++ Y+A A HS PLQA P++ L N R I D+
Sbjct: 311 PVYAYVAFNAPHS-----PLQALRSDLDEYGFDPNNKLAPNTDRKIAKREKALDYGKRGK 365
Query: 278 ----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
R FAA+ +D +G++++A+++ M N+++VF SDNG + +N PL
Sbjct: 366 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPK----HGGNNEPL 421
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
RG K T WEGGVR ++ P GI + D LP+++ AA
Sbjct: 422 RGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPSMVGAAGS 470
>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 307
Score = 193 bits (490), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 118/320 (36%), Positives = 168/320 (52%), Gaps = 16/320 (5%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
+GWNDV +H D I PN+ LA G+I YT CTPSR+A+MTG +P TG QH +
Sbjct: 1 MGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GH 188
+ G+PL K+LP+ LKE+GY T +VGKWHLGF K EY PT RGF+SH G WT G
Sbjct: 60 AFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 189 QDYFDHSAEEMKMWGLDMRRDLE--PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
DY G D R ++ P D Y + A I++ H PLFL
Sbjct: 120 GDYDKMDGVLSPSAGYDFRDNMGVVPKSD---DYLALMLGDRAEHIVNTHYPGTPLFL-- 174
Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
A P + L+ P+ Y + IED + +F L +D+ +G VV+A++ R M
Sbjct: 175 ---AFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDAMKNRGMYD 231
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
+S+I+F+ DN G ++ + SN+P RG+ TL+EG R ++ ++ G + +
Sbjct: 232 DSLIIFIGDNEGLSS----QSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287
Query: 367 VHVSDWLPTLLSAANKSDIP 386
+ D T+L A P
Sbjct: 288 YSIVDIHRTILDVAGAKAEP 307
>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 480
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/320 (38%), Positives = 170/320 (53%), Gaps = 11/320 (3%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GW DV FHG QIPTPNID LA G+IL NYY + LCTPSR+A+MTG +PIHTGM V+
Sbjct: 5 GWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGVI 64
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-GFYKKEYTPTFRGFE-SHLGYWTGH 188
GL L KI+PQ+ ++LGY ++GK H G + + T + G + H +
Sbjct: 65 QDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSHHDGCHNFDSTKS--GIDLLHTPLISSI 122
Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
D++ K + R + L + V ++ HS +P F YL+H
Sbjct: 123 PGQHDNTWMYGKTYNFYRTRAIC---RLQKRVIQGVVVYLIRFLLDLHS--QPFFCYLSH 177
Query: 249 AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES-VGKVVEALEQRRMLSN 307
A HSA +P QAP L +I + R+ +A K +G + +L + + +
Sbjct: 178 QAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETKKTHLPWLGLALVSLVKTKNYVD 237
Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
+++ SDNGG+ G N NWPLRG K LWEGGVR +WS + V++Q +
Sbjct: 238 ILLMGFSDNGGSPFGLFSNRGFNWPLRGAKANLWEGGVRVPAFVWSSKFLKKPRVSDQLM 297
Query: 368 HVSDWLPTLLSAANKSDIPN 387
H+SDWLPTL SAA D+ N
Sbjct: 298 HISDWLPTLYSAAG-GDVSN 316
>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
Length = 470
Score = 192 bits (489), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 123/371 (33%), Positives = 196/371 (52%), Gaps = 21/371 (5%)
Query: 43 AFTLSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
AF L + V +VA + P++I I++DD G+ D G +G +IPTPNIDALA +G + +
Sbjct: 8 AFLLCISCVTVVAQAQQKKPNVIVIVSDDAGYADFGCYGGKEIPTPNIDALAKNGTLFTD 67
Query: 101 -YYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKELG 153
Y + +C PSR+ I+TG + G +HN+ Y G+ K + +K G
Sbjct: 68 AYVSASVCAPSRAGILTGMYQQRFGFEHNISELPVKPYTLNDVGMDPKIKTIGDQMKHNG 127
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
YRT +GKWH G +Y P RGF+ G+ GH+ +F + + L + + +
Sbjct: 128 YRTIAIGKWHQGDL-PQYFPLKRGFDEFYGFVGGHRSFFGYPGGKAPSHELALFDNDKIV 186
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+ Y TD+FT +A+ + + + +P F+YLA+ A H P+ A ++ +I
Sbjct: 187 PENTIGYLTDMFTDKAISFVKENKS-KPFFMYLAYNAVHV-----PMNAKKELMDRFPNI 240
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
D R +AA++ LD+ VG++V+ L+++ + N++IVFV+DNG A N A N L
Sbjct: 241 TDPGRKAYAAMMVSLDDGVGELVKTLKEQNLYDNTLIVFVNDNGAATG----NYADNGKL 296
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTV 393
RG+K + WEGG+R A ++ P G V + V D +PT + A I V
Sbjct: 297 RGLKGSKWEGGIRVAYIMQYPDQIPAGKVYSKMVSGLDIMPTAVGAGKGELISGQVTDG- 355
Query: 394 ENIIPRYENSI 404
+N+IP N+I
Sbjct: 356 KNLIPYLNNTI 366
>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 479
Score = 192 bits (488), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 185/356 (51%), Gaps = 25/356 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ IL DDLG+NDVGF+G I TPN+D LA +G+I+ Y C PSR++IMTGK
Sbjct: 28 PNIVIILCDDLGYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMTGK 87
Query: 119 HPIHTGMQHNVLYGCERGG--LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ G Q N+ E G +PL+ K + + L+E GY T GKWHLG + P R
Sbjct: 88 YAHTMGAQFNIPSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLG-ADTPFHPNKR 146
Query: 177 GFESHLGYWTGHQDY----FDHSAEEMKMWGLDMRRDLEPAWDLHG------KYSTDVFT 226
GF+ G+ G DY + E +K G RD + +G +Y TD +
Sbjct: 147 GFDEFYGFLGGGHDYIPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEKEYITDGLS 206
Query: 227 AEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
EAV+ ++ S +P F+YLA+ A P+ PLQA + + + I+D KR +AA++
Sbjct: 207 REAVNFVYKASEKKQPFFMYLAYNA-----PHVPLQAKKEDMAVFKSIKDEKRRTYAAMV 261
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
+ +D VGK+VEAL+ + L N+++VF SDNGG A+N+PL K + +EGG
Sbjct: 262 YAVDRGVGKLVEALKANQQLENTLLVFFSDNGGKLG----KGANNFPLTEGKGSAYEGGH 317
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYE 401
R P G V D PT S A K+ N N NI+P +
Sbjct: 318 RVPMFFHWPNRIKAGQKFHNPVSALDLYPTFASLA-KTKPSNAKNLAGTNILPHLQ 372
>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 551
Score = 192 bits (488), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 197/384 (51%), Gaps = 42/384 (10%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
G T ++A + FT ++ D V P++I ++ADDLGWNDVG+HG I TP++
Sbjct: 9 GLTTTLIALVFVQTVFTQAVYAEDGVR----PNVIVMVADDLGWNDVGYHG-GNIDTPSL 63
Query: 89 DALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQY 148
D LA G+ L +YT +C+P+R+A+MTG+ P+ G+ + V+ + G+ +E +PQ
Sbjct: 64 DKLAEQGVQLNRFYTTPICSPTRAALMTGRDPMRLGIAYGVILPWDNIGVNPAEHFMPQS 123
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
+ GY+T +VGKWHLG + Y P RGFE G+ ++ A + G D +
Sbjct: 124 FQAAGYQTAMVGKWHLGHAQMTYHPNQRGFEHFYGHLHTEVGFYPPFA---NVGGKDFQE 180
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ + D G Y T + E I + ++P F+Y+ A H+ PL AP
Sbjct: 181 N-GVSIDDEG-YETYLLADEVSRYIRDRDEEKPFFIYMPFIAPHT-----PLDAPQELQE 233
Query: 269 IHRHIED-----------------------FKRSKFAAILHKLDESVGKVVEALEQRRML 305
++ IE R +AA++ +D+++G+V++ L++ +
Sbjct: 234 KYKDIETDLAPARSNQTDSTRRMAKLMMQPSARPMYAAVVDAMDQAIGQVLDTLDEEGLA 293
Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAE 364
N+I++F SDNGGAA ++ A N PLRG K +EGG+R L+ W +LE +
Sbjct: 294 DNTIVLFFSDNGGAA--YSYGGADNAPLRGGKGETFEGGIRVVSLMRWPGVLEPAQSF-D 350
Query: 365 QYVHVSDWLPTLLSAANKSDIPNY 388
Q + V D PTL A + + N+
Sbjct: 351 QIMTVMDVFPTLAEATGVTPLNNF 374
>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
Length = 328
Score = 192 bits (487), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 172/337 (51%), Gaps = 27/337 (8%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS-GIILKNYYTVQLCTPSRSAIMTGKH 119
H + I+ADDLG+ D+G+ D I TPNID LA G+IL N Y + C+PSR++ +TG+
Sbjct: 1 HHLQIMADDLGYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRV 59
Query: 120 PIHT--GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
PIH G GC L+ ++P L+ GYRT +VGKWH G E P RG
Sbjct: 60 PIHVHQGNPGLAAAGCTN----LNYTMIPAQLRRAGYRTAMVGKWHQGASLPECLPVNRG 115
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
F++ GY +G +D+ D + + D D PA +G YS+ + VDII H+
Sbjct: 116 FDTSFGYLSGEEDHMDQTTNGGQCNVTDFWLDSGPAIRRNGTYSSFQYNDAIVDIIQQHA 175
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYL-----------NIHRHIEDFKRSKFAAILH 286
++PL LY A H PYE P+ Y R A+++
Sbjct: 176 PEQPLMLYAALQNVHG--PYE---VPERYRAKFPADKNCTETGDNPTSQCTRDFMLAMVN 230
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA-SNWPLRGVKNTLWEGGV 345
LD+ VG + AL+ M +N+++VF SDNGGA G + + +N PLRG K +EGGV
Sbjct: 231 VLDDIVGNITRALKANGMWNNTVLVFSSDNGGAVPGAPQHGSMNNHPLRGGKVAYFEGGV 290
Query: 346 RGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
R A + SPLL RG +H+SDW T A
Sbjct: 291 RTAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLA 327
>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
Length = 504
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/343 (34%), Positives = 182/343 (53%), Gaps = 33/343 (9%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY---------- 102
VA++ P+I+F+L DD G+ DVGF I +PN D LA +G++L +Y
Sbjct: 17 FVATAKLPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYLAKTGLVLNR 75
Query: 103 --TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
+ C+PSR++ +TG+ P H Q N L + G L+ ++P LK Y T +VG
Sbjct: 76 HYVFKYCSPSRASFLTGRWP-HHAHQWN-LPQSDMAGTNLNMTMIPAKLKAAKYSTHMVG 133
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
KWH GF+ Y P RGF++ G+ G +D+ AE +D ++ P +G Y
Sbjct: 134 KWHQGFFDPRYLPINRGFDTSSGFLGGGEDHMTQEAE----CAIDYWKNDAPD-PRNGTY 188
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RS 279
+ + DI++NH+ ++PLF+YL H+ PLQAPD +LNI+ + R
Sbjct: 189 DAYNYGDDLTDIMNNHNVNDPLFIYLPLHNVHA-----PLQAPDEWLNIYSVNSTCENRR 243
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+ A++ D G +V+ L+++ M N+II+ +DNGGAA G SN+PL+G K +
Sbjct: 244 TYQAMVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG-----GSNYPLKGCKGS 298
Query: 340 LWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
+EGGVR + LL RG + ++H++DW T A
Sbjct: 299 FFEGGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLA 341
>gi|345317091|ref|XP_001516952.2| PREDICTED: arylsulfatase I-like, partial [Ornithorhynchus anatinus]
Length = 264
Score = 191 bits (486), Expect = 1e-45, Method: Composition-based stats.
Identities = 103/243 (42%), Positives = 145/243 (59%), Gaps = 12/243 (4%)
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT R
Sbjct: 5 ARYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRR 64
Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF++ LG TG+ DY+ + + + + G D+ AW+ GKYST ++ A I+ +
Sbjct: 65 GFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWEQSGKYSTLLYAQRAGKILAS 124
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 125 HDPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRNI 179
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL++ NS+IVF +DNGG + SNWPLRG K T WEGG A L SP
Sbjct: 180 TRALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGTPPATL--SPS 233
Query: 356 LES 358
L +
Sbjct: 234 LTA 236
>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 441
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/361 (34%), Positives = 192/361 (53%), Gaps = 47/361 (13%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+ + F L + + ++ PP+II ILADD G +D +G Q+ TP+ID++A++GI
Sbjct: 1 MKILFCLFSLLCTSLLANEPPNIIIILADDAGSSDFSCYGSKQLLTPHIDSIAHNGIKFT 60
Query: 100 NYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYG------CERGGLPLSEKILPQYLKEL 152
YT +C+PSR+ ++TG++ G N+ + E GLP++E L LKEL
Sbjct: 61 QAYTASSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLADSLKEL 120
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL-- 210
GY T +GKWHLG + P RGF++ G+ +G + YF G ++R D+
Sbjct: 121 GYSTHCIGKWHLG-EADHFHPNARGFDNFYGFLSGARTYF---------LGGELRGDMDR 170
Query: 211 --------EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
EP+ Y+T+VFT EA+ II D+P F+YL+H A H P+ A
Sbjct: 171 IMRNKEFAEPS----SGYTTEVFTQEAIRIIQEEQ-DKPFFIYLSHNAVHG-----PMDA 220
Query: 263 PDHYLNIHRHIEDFK---RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
D + + DFK R K++ ++ LD+ G +++AL+ + N++I F+SDNGG
Sbjct: 221 KDEDIMSY----DFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGP 276
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
N +SNWPLRG K + +EGG R L+ P S G+ +++ + D T + A
Sbjct: 277 TTH---NGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATCIQA 333
Query: 380 A 380
A
Sbjct: 334 A 334
>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 128/352 (36%), Positives = 183/352 (51%), Gaps = 26/352 (7%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
A +G P+++ ++ADDLG+ D+GF G IPTP++DALA G+ Y + C+P+R+
Sbjct: 32 APAGRPNVVLLVADDLGYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAG 91
Query: 114 IMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
++TG++ G + N + GLP++E L LK GY T +VGKWHLG
Sbjct: 92 LLTGRYQQRFGHEFNPGGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLGSEA 151
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
K + P RGF+ G+ G YF + D+ R E + Y TD F+ E
Sbjct: 152 K-FHPQKRGFDEFFGFLGGQHTYFASKSG-------DVYRGTEVVKEE--AYLTDAFSRE 201
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A+ I H D P FL L+ A H+ P+ A + + IED KR +AA+L L
Sbjct: 202 ALSFIDRHK-DHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRRTYAAMLSAL 255
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
DE+ G V+E L + ++I+F SDNGG AG N + N PLRG K T EGGVR
Sbjct: 256 DEAAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTTNGSKNDPLRGSKRTTLEGGVRV 315
Query: 348 AGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
++ W L + G V +Q + D LPT L+AA P + V N++P
Sbjct: 316 PFVLSWKGTLPA-GKVYDQPIIQLDILPTALAAAGVDAEPEWKLDGV-NLLP 365
>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 511
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 136/400 (34%), Positives = 202/400 (50%), Gaps = 37/400 (9%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F + A LS+ + L A P+I+ +L DDLG+ DVGF+G I TP +D LA +G
Sbjct: 26 FKLFLYAAVLSVANIGL-AQETRPNILVVLCDDLGYADVGFNGSTDILTPELDNLAQNGS 84
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGC-----ERGGLPLSEKILPQYLK 150
I + Y C PSRSAI+TG++P TG +N+ + + G+P+ E + + L+
Sbjct: 85 IFTSAYVAHPFCGPSRSAILTGRYPHLTGTAYNLFHNSSEDDKDNMGVPVEETYMSKVLQ 144
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF----DHSAEEMKMWGLDM 206
GY T +GKWHLG K + P RGF+ G+ G DYF + + K G
Sbjct: 145 NAGYYTSAIGKWHLGAAPK-FHPNKRGFDDFYGFLGGGHDYFPSEYQKTYKAQKKAGNPN 203
Query: 207 RRDLEPAWDLHGK------YSTDVFTAEAV-DIIHNHSTDEPLFLYLAHAATHSANPYEP 259
RD + +GK Y TD F+ EA+ +I + +P F+YLA+ A P+ P
Sbjct: 204 IRDYVFPMEHNGKPANETEYITDGFSREAIKNIKIAAAKKQPFFIYLAYNA-----PHVP 258
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
LQA + HI+D R +AA+++ +D VGK+V+ L++ + N++IVF+SDNGG
Sbjct: 259 LQAKAEDVAKFAHIKDKDRRTYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGG- 317
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--- 376
FN + A+N+PL+G K WEGG R P + + V D PT
Sbjct: 318 --NFN-HGANNYPLKGTKGDTWEGGYRVPMFFHWPKKIKKDQRFDFPVSSLDLYPTFTGL 374
Query: 377 ----LSAANKSDIPNYVNSTVENIIPRYENSI--LRYENG 410
L + D N ++ ++N P + I LRY G
Sbjct: 375 AEAKLPKGKQLDGKNIMDDVLKNTEPYKDEMIYSLRYREG 414
>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 286
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 154/275 (56%), Gaps = 12/275 (4%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
+GWNDV +H D I PN+ LA G+I YT CTPSR+A+MTG +P TG QH +
Sbjct: 1 MGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GH 188
++ G+PL+ K+LP+ LKE+GY T +VGKWHLGF K EY PT RGF+SH G WT G
Sbjct: 60 VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
DY + G D R ++ G Y + A I++ H PLFL
Sbjct: 120 GDYDKLNGVLSPSAGYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLFLTFTL 178
Query: 249 AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
P + L+ P+ Y + IED + +F L +D+ +G VV+AL+ R M +S
Sbjct: 179 DI-----PAKHLEIPEEYEEAYADIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYDDS 233
Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+I+F+ DNG ++ + SN+P RG+ TL+EG
Sbjct: 234 LIIFIGDNGALSS----QSGSNYPFRGIAGTLFEG 264
>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 512
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 117/338 (34%), Positives = 179/338 (52%), Gaps = 27/338 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH+IF+LADD GWN+ GF+ D + TP+ DAL +G+ L N+Y + C+P+RS+ ++G+
Sbjct: 21 PHLIFLLADDEGWNN-GFYNEDVL-TPHTDALVAAGVKLTNHYVYKFCSPTRSSFLSGRL 78
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
PIH Q N G+PL + + ++ GYRT VGKWH G +TP RGF+
Sbjct: 79 PIHVN-QENSATEQAGAGVPLGMSTIGETMRAAGYRTAHVGKWHCGMATPAHTPYGRGFD 137
Query: 180 SHLGYWTGHQDYF---------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
S +G++ +D++ + + G+D+ RD PA +G Y FT AV
Sbjct: 138 SSIGFFNFGEDHYTQVRGGQALEEGGASGQCSGVDLWRDGAPARGENGTYGGYTFTRAAV 197
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+ + T +PLFL+ A H PL+ P Y + R+ + R+ + LD+
Sbjct: 198 AAVEANDT-KPLFLFAAFQNLH-----PPLEVPPEY--VARYANETLRTTINGMAAFLDD 249
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
SVG V +AL+++ M +S+IV+ DNG G+ + P RG K + ++GGVR A
Sbjct: 250 SVGNVTKALKKQNMYDSSLIVYTPDNG----GYLTQGGDDAPFRGGKFSDFQGGVRVAAF 305
Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIP 386
I L+ + RG + VHV DW+ T +A IP
Sbjct: 306 ISGGLVPAGLRGTASAALVHVCDWMAT-FAALGGGAIP 342
>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
Length = 458
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYCDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K+ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGSAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLCWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
Length = 458
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LYAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K+ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 469
Score = 190 bits (482), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 170/308 (55%), Gaps = 18/308 (5%)
Query: 46 LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
+S+ ++ AS P+I+ ILADDLG+ D+GF G +I TPNIDALA +G KN Y
Sbjct: 3 MSINSFNVNASEQKPNIVVILADDLGYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTH 62
Query: 106 -LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKW 162
C PSR ++TG++ GM++NV Y + GLPLSE L+++GY T + GKW
Sbjct: 63 PYCGPSRVGLLTGRYQARLGMENNVSYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKW 122
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYF-DHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
HLG + P RGF+ G+ G +Y D + L + R+ + +Y
Sbjct: 123 HLGG-APHFQPNKRGFDYFYGFLDGGHNYMPDQVTVGGDGYSLPLMRNTQVT--EFDEYL 179
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T + +AV IH + P F+YL++ A H+ PLQAP Y+ ++HIED R +
Sbjct: 180 TTALSRDAVKYIHRQQ-ESPFFMYLSYNAPHT-----PLQAPAEYIEKYKHIEDEDRRVY 233
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW----PLRGVK 337
AA++ +DE +G VV+AL+ L N++I+F+SDNGG + S+W P R K
Sbjct: 234 AAMIDVMDEGIGMVVDALKDINQLDNTLIIFLSDNGGVFPE-SWQPTSDWADNSPFRRGK 292
Query: 338 NTLWEGGV 345
L EGG+
Sbjct: 293 AALLEGGI 300
>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
Length = 458
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
Length = 458
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 187/348 (53%), Gaps = 26/348 (7%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + + ++ + D
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIKLYTNNFD 230
Query: 276 F------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A + +S
Sbjct: 231 SLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPGSS 286
Query: 330 NWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 287 SGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
Length = 458
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
Length = 458
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 473
Score = 189 bits (481), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 127/345 (36%), Positives = 180/345 (52%), Gaps = 35/345 (10%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
A + P+I+ I ADD G+ D GF G Q+ TPN+D LA SG+ + Y + C PSR+
Sbjct: 24 AQTEKPNIVLIFADDAGFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAG 83
Query: 114 IMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
+MTGK+ G M N GLPL +K + YLKE GY+T + GKWHL
Sbjct: 84 LMTGKYQQRFGYEEINVPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHL 143
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK-----MWGLDMRRDL----EPAWD 215
G + + P RGF++ LG+ G + YF++S +EMK + + RD EP
Sbjct: 144 GDADR-FHPLKRGFDTFLGFRGGDRSYFNYSEQEMKNGNKHFFDKKLERDFGNYEEPK-- 200
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
+Y TDV EA I + DEP F+YLA A H+ PL++ L ++
Sbjct: 201 ---EYLTDVLGKEAAKYIE-QNKDEPFFIYLAFNAVHT-----PLESDPKDLAKFPNLTG 251
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
KR + AA+ LD + G V++ L++ + N+I+VF +DNGG + + NA++N PL G
Sbjct: 252 -KRKELAAMTLGLDRASGYVLDKLKELGLDDNTIVVFSNDNGGPS---DKNASNNAPLAG 307
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
K+ EGG+R LI P G + V D LPT SAA
Sbjct: 308 TKSNQLEGGIRVPFLISWPKHIKPGSTYDYPVSTLDLLPTFYSAA 352
>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
Length = 472
Score = 189 bits (481), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 183/331 (55%), Gaps = 20/331 (6%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRS 112
V ++ P++I I++DD G+ D G +G QIPTPNIDA+A G + Y + +C PSR+
Sbjct: 23 VKTAAKPNVIVIVSDDAGYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRA 82
Query: 113 AIMTGKHPIHTGMQHN---VL---YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
I+TG++ G +HN VL Y G+ SE+ + ++ GY+T +GKWH G
Sbjct: 83 GILTGRYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQGD 142
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
K + P RGF G+ GH+D+F + + L +++ P ++ Y TD+FT
Sbjct: 143 EPKHF-PLNRGFNEFYGFTGGHRDFFAYKGKRTNEHALYNNKEIVPENEI--TYLTDMFT 199
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
+A I + D+P F+YL++ A H+ P+ A + + I D R +AA++
Sbjct: 200 DKATSFI-TANKDKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRRAYAAMMT 253
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
LD+ +GKV+ L+ ++ N++I+F++DNGGA +N++ N PLRG+K + WEGG+R
Sbjct: 254 SLDDGIGKVMATLKANQLDKNTLIIFINDNGGA----TVNSSDNGPLRGMKGSKWEGGIR 309
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
A ++ P + + V D LPT +
Sbjct: 310 VAMMMKWPGHIAANKTDSRPVSSLDILPTAI 340
>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 478
Score = 189 bits (481), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 170/310 (54%), Gaps = 22/310 (7%)
Query: 47 SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQ 105
S++ L + + P+II I+ADD+G+ D GF G I TPN+D LA SG+I + Y T
Sbjct: 9 SLLIASLASGADRPNIIVIMADDMGYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHA 68
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
C PSR+ + G++ G +HN Y G+ + E + P L+++GY T I+GKWH
Sbjct: 69 FCGPSRAGFLAGRYQHRFGFEHNTPYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWH 128
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFD----HSAEEMKMWGLDMRRDLEPAWDLHGK 219
LG + P RGF+ G+ TG DYF+ + + GL + + + G
Sbjct: 129 LG-ASSPFYPLNRGFDYFYGFLTGGHDYFEIDVTQPVKSAYLQGLFRNKRVA---NFEG- 183
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
Y T + +AV + N + + P FL+L++ A P++PLQAP + + HI+D KR
Sbjct: 184 YLTTALSRDAVQFV-NDNKENPFFLFLSYNA-----PHQPLQAPQEDIARYAHIKDKKRR 237
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG----FNLNAASNWPLRG 335
+AA++ +D +G+VV AL++ + N++I F+SDNGG + N +SN P RG
Sbjct: 238 VYAAMVDVMDRGIGEVVTALDRNGLRENTLIFFLSDNGGPVSSKKHPGKGNGSSNGPFRG 297
Query: 336 VKNTLWEGGV 345
K +EGGV
Sbjct: 298 GKTDFYEGGV 307
>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 616
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 123/359 (34%), Positives = 174/359 (48%), Gaps = 51/359 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
P++IFIL DD+G ND+G+ D TP +D+L+ G+ L YYT QLCTPSR+++MTG+
Sbjct: 120 PNVIFILIDDVGTNDIGYQSTDLWELTPFMDSLSSEGVRLTKYYTNQLCTPSRASLMTGR 179
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
TGMQ+ V+ GLPL E L + K GKWHLG Y + P RGF
Sbjct: 180 DTFRTGMQYEVVEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFARGF 231
Query: 179 ESHLGYWTGHQDYFDHSA-----------EEMKMWGLD--------MRRDLEPAWDLHGK 219
++ LGY + Y H K +G + G
Sbjct: 232 DTFLGYMGAVRGYSSHEGCNTPTFEGGEYSCFKDFGYGDKDGYINHITNTTRQGPSFVGN 291
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI------ 273
YST + T A+++ H D P FLY++H A HS P +P PD Y H
Sbjct: 292 YSTTIITDRAIEVAKEHGED-PFFLYVSHQAVHS--PVDP--PPDDYFTEEEHATLARVV 346
Query: 274 -EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
ED R +FA ++ LD+ + ++ + L+ L N+++V SDNG SN+P
Sbjct: 347 SEDAYRQRFARVILFLDKEMRRLHDELDALGALDNTVLVVASDNGACPTA----GGSNYP 402
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLL-----ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
LRG K+T++EGGVR ++S E+RG +H +DW PT S D+P
Sbjct: 403 LRGYKHTIFEGGVRVPAFVYSKSTDLIPEEARGTRYSGMMHSTDWTPTFGSVV--PDLP 459
>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 443
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 118/291 (40%), Positives = 161/291 (55%), Gaps = 19/291 (6%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHP 120
+ ILADDLG+ DVGF G QI TP+ID LA G+I + Y + +C PSR+ +MTGK+
Sbjct: 11 FVLILADDLGYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGLMTGKNQ 70
Query: 121 IHTGMQHNVL-----YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+ G +N+ + E GLP+SEK L L E GY +VGKWHLG K++Y P
Sbjct: 71 VRFGFDNNLTNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLG-DKEQYHPLK 129
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
RGF GY G YF S K + + + + + Y TD E VD I
Sbjct: 130 RGFHEFWGYLGGGHHYF-RSKPNGKGYDCPIECNYKTPQPI--SYITDDKGDECVDFIRR 186
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H DEP FL+ + A P+ P+ A + L ++ HIE KR + A+ H LD +VGK+
Sbjct: 187 HK-DEPFFLFASFNA-----PHAPMHAKEEDLKLYSHIEGEKRRAYCAMTHNLDLNVGKI 240
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
V L+ + + N+++VF+SDNGG NA+ N PLRG K L EGG+R
Sbjct: 241 VAELKAQNIYDNTVVVFLSDNGGPTPN---NASLNAPLRGGKGILLEGGIR 288
>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 489
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 180/355 (50%), Gaps = 46/355 (12%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
S PPHI+ I+ADDLGWNDVGFHG +I TPNID LA + L +Y +C+P+R+ +
Sbjct: 41 GGSQPPHIVHIIADDLGWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGV 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTP 173
+TG +P G+ V+ ++ GLP + P++L +LGY R + GKWHLG + P
Sbjct: 100 LTGLYPFRFGIWGGVVSPTKKHGLPTLLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHP 159
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVDI 232
G G++ G DYF + LD RD + +H + YST++ VD
Sbjct: 160 LHHGMTEFYGHYNGAIDYFSRE----RFGQLDWHRDFD---SVHEEGYSTELVGNAVVDF 212
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQA-----------PDHYL--NIHRHIE----- 274
I ++ ++ Y+A A HS PLQA P++ L N R I
Sbjct: 213 IDRNANAGAVYAYVAFNAPHS-----PLQALRSDLDEYGFDPNNKLSPNTDRKIAKREKA 267
Query: 275 -DFK--------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
D+ R FAA+ +D +G++++A+++ M N+++VF SDNG
Sbjct: 268 LDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPK---- 323
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ SN PLRG K T WEGGVR ++ P GI + D LP +++AA
Sbjct: 324 HGGSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPIMVAAA 378
>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
XB1A]
Length = 458
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 123/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP++DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +P L
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPML 334
>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 184/368 (50%), Gaps = 42/368 (11%)
Query: 42 LAFTLSMVFVDLVASSG---PPHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSG 95
+A + + LVA PHI+ +L DD GW +VG+H ++ TPNID L G
Sbjct: 2 IAAVVLLCIAGLVAGQPVRQKPHIVLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQG 61
Query: 96 IILKNYYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILP 146
+ L +Y + C+PSRS +M+G+ PIH ++ + G +P + +
Sbjct: 62 LELDQHYAYKFCSPSRSCLMSGRLPIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTGIA 119
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-------SAEEM 199
+ +KE GY T VGKW G ++TP RGF++ GY+ DY+ + +
Sbjct: 120 EKMKEAGYATHQVGKWDAGMATPDHTPKGRGFDTSFGYYHHDNDYYTEVVGPQCSGSPIV 179
Query: 200 KMWGLDMRRDLEPAWDLHG----KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
+W D PA ++G KY +F +D++ H + PLFLY A H+
Sbjct: 180 DLWDTD-----HPAHGINGTGPDKYEEGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-- 232
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PLQ PD YLN I+D R + A+++ LD+ VG +V AL+Q+ M N + V SD
Sbjct: 233 ---PLQVPDDYLNKFSFIDDSDRKYYHAMVNYLDDVVGDLVAALKQKGMWDNLLYVTSSD 289
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWL 373
NGG + A+N+PL+G K T W+GG+R + L + RG + YVH++DW
Sbjct: 290 NGGPI--YAGGGANNYPLKGGKLTDWQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWY 347
Query: 374 PTLLSAAN 381
T + A
Sbjct: 348 GTFCAIAG 355
>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 458
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K+ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 461
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 192/345 (55%), Gaps = 25/345 (7%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
A++ PP+I+FILADDLG+ D+ G + TP+ID+L +G+ N+Y +C+PSR+A
Sbjct: 31 AANRPPNIVFILADDLGYGDLSCFGSTDLKTPHIDSLIGAGMRFTNFYANSSVCSPSRAA 90
Query: 114 IMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
++TG++P G+ + + G L S +LP YL++ GY + +GKWHLG +
Sbjct: 91 LLTGRYPERVGVPGVIRDEVQDSWGYLASSATLLPTYLRKQGYHSANIGKWHLGL-ESPN 149
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
P RGF+ G G D + G + R D G ++TDVFT AV
Sbjct: 150 LPNERGFQEFYGLLEGMMDDYVVKLRH----GQNFLRHNGQVIDPPG-HATDVFTDAAVR 204
Query: 232 IIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---DFKRSKFAAILHK 287
+++ + +P FLYLA+ A P++PLQ P YLN + + D +R+K A++
Sbjct: 205 YLNDRKAKKDPFFLYLAYTA-----PHDPLQPPVAYLNRVQKRQPGIDPQRAKLVALIEH 259
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR- 346
LD+++G+V+ L L+N+++VF SDNGG G A+N RGVK +EGG+R
Sbjct: 260 LDDNIGRVLATLRANGQLANTLVVFTSDNGGWGPG----KANNGQFRGVKGQFYEGGIRI 315
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
G +WS +E+ G + + + DW PTLL AA S +P+ +N
Sbjct: 316 PTGAMWSGHIEA-GSESGALLQLMDWFPTLLQAAGAS-VPDGING 358
>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 474
Score = 188 bits (478), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 122/341 (35%), Positives = 173/341 (50%), Gaps = 54/341 (15%)
Query: 48 MVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTV 104
+F +A + P P+I+FI+ADDLG+ + G +G IPTPNID L SG+ + Y +
Sbjct: 14 FLFCAQLAIAAPKRPNILFIVADDLGYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSA 73
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVG 160
C SR+A+MTG++ G ++N + G + GLP++EK + L+++GY T +VG
Sbjct: 74 PFCAASRAALMTGRYQTRFGFEYNPI-GAKNADPGTGLPVNEKTVADRLRDVGYATGLVG 132
Query: 161 KWHLGFYKKEYTPTFRGFESHLGY-----------WTGHQDYFDHSA------------E 197
KWHLG + P RGF+ G+ W+G + A +
Sbjct: 133 KWHLG-GTAPFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPD 191
Query: 198 EMKMWGLDMRRDLEPAWDLHGKYS------------TDVFTAEAVDIIHNHSTDEPLFLY 245
+W D+ + EPA+D TD FT EA I H +P FLY
Sbjct: 192 GHTVWSTDLHEN-EPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRHQA-QPWFLY 249
Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRML 305
LA+ A HS PLQ D Y+ HI D +R FAA+L LDE +GKV L +
Sbjct: 250 LAYNAVHS-----PLQGEDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLE 304
Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
N+++VF+SDNGG +SN PLRG K LW+GG+R
Sbjct: 305 ENTLVVFLSDNGGPTKELT---SSNLPLRGGKGDLWDGGIR 342
>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
Length = 488
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 182/327 (55%), Gaps = 26/327 (7%)
Query: 35 MAFAVLPLA---FTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPT 85
M F+ L LA LS + VA +G P+II ILADDLG+ D+GF G +I T
Sbjct: 1 MKFSALILAAKCLCLSTLLATSVAYAGQTKADSRPNIIVILADDLGYGDLGFTGSREIKT 60
Query: 86 PNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG--GLPLSE 142
PNIDALA+ G++ N Y C PSR+ ++TG++ GM+ N + + GLP+ E
Sbjct: 61 PNIDALAHKGVVFSNAYVTHPYCGPSRAGLLTGRYQARFGMEINAAHSPDDPFMGLPVDE 120
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE-EMKM 201
+ +++ GY+T ++GKWH+G + + P RGF+ G+ G DYF S + +
Sbjct: 121 PTFAKRMQKAGYKTAVIGKWHMGSHPN-FHPNNRGFDYFYGFLGGGHDYFPESVKVSNEE 179
Query: 202 WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
+ + + R+ +PA +Y T + EA + +T +P +Y+A+ A P++PL+
Sbjct: 180 YSIPLSRNGKPAQ--LNEYLTTAISKEAAEFA--MTTSQPFMMYVAYNA-----PHQPLE 230
Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA-- 319
A L ++HIED R +AA++ +D+ +G++V L+Q N++I F+SDNGG
Sbjct: 231 ATQKDLAKYQHIEDINRRTYAAMVDSMDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYP 290
Query: 320 -AAGFNLNAASNWPLRGVKNTLWEGGV 345
+ N N ASN P R K +L EGGV
Sbjct: 291 ESWMPNSNWASNKPFRRGKVSLTEGGV 317
>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
Length = 458
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP++DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GG A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
Length = 458
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP +DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETVADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RGF G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITKEAIHCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K+ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GGV A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 486
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 122/346 (35%), Positives = 189/346 (54%), Gaps = 41/346 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+FILADD+GW+D+G +G D TPNID A + + Y + +C+PSRS +MTGK
Sbjct: 26 PNILFILADDMGWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMTGK- 84
Query: 120 PIHTGMQHNVLY--GCERGG--------------LPLSEKILPQYLKELGYRTRIVGKWH 163
H H ++ G + GG LP SEK + YLK GY T ++GKWH
Sbjct: 85 --HAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWH 142
Query: 164 LGFYKKEYTPTFRGFESHLG--YWTGHQDY-FDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
LG + E+ P GF+ ++G W Q + + +S G + R + G+Y
Sbjct: 143 LGDW--EHYPEAHGFDINIGGTNWGAPQTFWWPYSGSGTH--GPEFRYIPHLEYGHPGEY 198
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP----DHYLNIHRHIEDF 276
TD T EA+ +I +H+ D+P F+YLAH A H+ P++A H+ +R +
Sbjct: 199 LTDRLTDEAIKVI-DHAGDQPFFVYLAHHAVHT-----PIEAKADDIQHFDAKYRDGMNH 252
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-----AAAGFNLNAASNW 331
+ + +AA+ +LDE+VG+V+E L++R + N++++F SDNGG +G N+ +N
Sbjct: 253 RHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNA 312
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
PLR K L+EGG+R +I P + G ++ V ++D L T L
Sbjct: 313 PLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFL 358
>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 490
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 127/332 (38%), Positives = 172/332 (51%), Gaps = 20/332 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+II I++DD G+ D F G I TPN+DALA SG+ + Y T +C+PSR+ +MTG+
Sbjct: 39 PNIIVIVSDDQGYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGR 98
Query: 119 HPIHTGMQHNVL--YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ G +N++ LP +E +LPQ L + GY T +VGKWHLG + P R
Sbjct: 99 YQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGL-QDGCRPYER 157
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE---PAWDLHGKYSTDVFTAEAVDII 233
GF+ G TG DYF + EE + + +E P + Y TD F A+AV II
Sbjct: 158 GFDEFFGIITGGHDYFVNHPEERAVGDQSYKARIERNGPVGEAVPGYLTDAFGADAVRII 217
Query: 234 ---HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAAILHKLD 289
H D+PLFLYLA A H+ P QAP ++ + +E R +AA + +D
Sbjct: 218 RESHTKRPDQPLFLYLAFNAPHT-----PTQAPKDLVDTMPATLESKDRRTYAAQITSMD 272
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
SVGKV AL++ M ++ IVF SDNGGA N N PLR K +L+EGG+R
Sbjct: 273 ASVGKVRAALKENGMEKDTFIVFFSDNGGA----NHPYYDNTPLRDHKGSLYEGGIRVPF 328
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P G V E V D T + A
Sbjct: 329 FAVYPGHIPAGSVCELPVTSLDVFATACALAG 360
>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
Length = 426
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 139/227 (61%), Gaps = 10/227 (4%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
+VGKWHLGFY+KE PT RGF++ LG TG+ DY+ + + + + G D+ AW L
Sbjct: 1 MVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
G+YST ++ A I+ +HS PLFLY+A A H+ PLQ+P YL +R + +
Sbjct: 61 SGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNV 115
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R K+AA++ +DE+V + AL++ +NS+I+F SDNGG + SNWPLRG
Sbjct: 116 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGR 171
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K T WEGGVRG G + SPLL+ + + +H++DW PTL+ A +
Sbjct: 172 KGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 218
>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 123/345 (35%), Positives = 189/345 (54%), Gaps = 39/345 (11%)
Query: 53 LVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTP 109
L A+ G P+++ +LADDLGW DVGF+G + TPN+D LA G K +YT +C P
Sbjct: 31 LAAAEGAKRPNVLLLLADDLGWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAP 90
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK- 168
SR+A+MTG++ IH G+ N LP E L + K GY T + GKWH G +
Sbjct: 91 SRAALMTGRYTIHDGVSRN------NDDLPAREVTLAEAFKTHGYDTALFGKWHHGQPRD 144
Query: 169 --KEYT-PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
K Y P +GF+ G+ T + ++ E+ +W R+L+P YS D+F
Sbjct: 145 GSKTYVHPMDQGFDEFFGF-TDAKHAWEKYPEQ--LW---HGRELKPV----SGYSDDMF 194
Query: 226 TAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----HIEDFK--R 278
A+D + H P FLY+ TH ++AP + +H+ ++ K R
Sbjct: 195 ADHAIDFLKRHKEKPTPFFLYVPFINTHFN-----IEAPAEEVALHQGKFTEVDPSKPIR 249
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA----SNWPLR 334
+ +AA++ +LD++VG+++ ALE + +++++VFVSD+G NL A+ SN P R
Sbjct: 250 ATYAAMVTQLDKNVGRIMTALEGLGLAADTLVVFVSDHGATFESGNLGASDYHDSNRPFR 309
Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
G K TLWEGG+R G+ P G+++ + VH++D PTLL+A
Sbjct: 310 GQKRTLWEGGIRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAA 354
>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
Length = 339
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 117/339 (34%), Positives = 182/339 (53%), Gaps = 31/339 (9%)
Query: 46 LSMVFVDLVASSGP-----------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+S++ V ++A++G P+II+ ADD G+ D G+ D I TPNID L
Sbjct: 7 VSVLVVMVLATAGKLKQRKRQGESKPNIIWFFADDYGYQDAGYRNSD-IHTPNIDKLVAD 65
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
GI N Y+ Q C+PSRSA ++G++ +GMQH V+ + + L + YLKEL Y
Sbjct: 66 GISFTNAYSAQQCSPSRSAFLSGRYAYTSGMQHGVIGDTKAHCMDLKYNFISDYLKELKY 125
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA-EEMKMW--GLDMRRDLE 211
T GKWHLG+ KE TPT+RGF++ G ++G +Y++H+ + M W G ++ R
Sbjct: 126 NTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEGEYYEHTTFQGMYDWHNGTELDRSAS 185
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
G +S D+ A+ ++ + + + PLF Y A THS P P P+H
Sbjct: 186 ------GIHSQDMIEADLLNQL-DKNDGTPLFYYAAFHNTHS--PMHP--KPEHEALYDD 234
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
R K+ ++ +D +GKVV L+++ M N+ I F SDNGG +++ N
Sbjct: 235 LDAPPTRRKYLGLVSGMDAVIGKVVTKLKEKDMFDNTYIFFSSDNGG-----DVHEGDNS 289
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
P RG K++L++GG SPLL+ +G+ A+ +V S
Sbjct: 290 PKRGAKSSLFDGGCHAHSWASSPLLKKKGVEADGWVFAS 328
>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
Length = 426
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 139/227 (61%), Gaps = 10/227 (4%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
+VGKWHLGFY+KE PT RGF++ LG TG+ DY+ + + + + G D+ AW L
Sbjct: 1 MVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
G+YST ++ A I+ +HS PLFLY+A A H+ PLQ+P YL +R + +
Sbjct: 61 SGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNV 115
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R K+AA++ +DE+V + AL++ +NS+I+F SDNGG + SNWPLRG
Sbjct: 116 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGR 171
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K T WEGGVRG G + SPLL+ + + +H++DW PTL+ A +
Sbjct: 172 KGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 218
>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
Length = 426
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 139/227 (61%), Gaps = 10/227 (4%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
+VGKWHLGFY+KE PT RGF++ LG +TG+ DY+ + + + + G D+ AW L
Sbjct: 1 MVGKWHLGFYRKECLPTRRGFDTFLGSFTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
G+YST + A I+ +HS PLFLY+A A H+ PLQ+P YL +R + +
Sbjct: 61 SGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNV 115
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R K+AA++ +DE+V + AL++ +NS+I+F SDNGG + SNWPLRG
Sbjct: 116 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGR 171
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K T WEGGVRG G + SPLL+ + + +H++DW PTL+ A +
Sbjct: 172 KGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 218
>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 501
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 202/400 (50%), Gaps = 73/400 (18%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L L F L F L A+ PP+II I++DD G+ D+G G ++I TP++D LA G
Sbjct: 16 FISLALLFCLCSAF-GLKAAETPPNIIMIVSDDQGYRDLGSFGSEEIMTPHLDRLAKEGA 74
Query: 97 ILKNYY-TVQLCTPSRSAIMTGKHPIHTGM------------------QHNVLYGCER-G 136
L ++Y T CTPSR +++TG++P G+ ++ V + ER G
Sbjct: 75 KLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTF--ERIG 132
Query: 137 GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA 196
G+ + EK+LP LK GY + I GKW LG +K+ + P RGF+ G+ DYF H
Sbjct: 133 GMDVREKLLPALLKPAGYVSAIYGKWDLGIHKR-FLPLARGFDDFYGFTNTGIDYFTH-- 189
Query: 197 EEMKMWGL-DMRRDLEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSA 254
+ +G+ M R+ +P + G Y T +F EAV I NH +P FLYL A H A
Sbjct: 190 ---ERYGVPSMYRNNQPTEEDKGTYCTYLFQREAVRFIKENHQ--KPFFLYLPFNAPHGA 244
Query: 255 NPYEP-----LQAPDHYLNIHRHIEDF-----------------------------KRSK 280
+ +P QAP+ Y N++ H++D +R +
Sbjct: 245 SSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSASKRRLE 304
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+ A + +D+++G+V+ L++ ++ N+I+VF SDNGG+ A N PL+G K +
Sbjct: 305 YVASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGSG------GADNSPLKGKKGMM 358
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+EGG+R L+ P G V ++ + + +PT L A
Sbjct: 359 FEGGIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEA 398
>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
Length = 482
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 125/391 (31%), Positives = 195/391 (49%), Gaps = 48/391 (12%)
Query: 23 QYLKELGYRTRIMAFAVLPLAFTL--------SMVFVDLVA--SSGPPHIIFILADDLGW 72
+YL E+ +RT+ + ++ F + S LVA SS P++I ILADDLG+
Sbjct: 36 RYLYEIRFRTKSVFLFLVRNPFCVRNPSNSFSSFFAAFLVAKGSSSKPNVILILADDLGF 95
Query: 73 NDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG 132
ND+ ++ I PN+ +LA +G IL N+Y +CTPSRSA+MT ++PI G+Q +V+
Sbjct: 96 NDMPWNN-PAIIAPNLHSLAKNGTILSNFYVQPVCTPSRSALMTSRYPIRLGLQTDVITA 154
Query: 133 CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF 192
+ LPL E + + GY T IVGKWHLG Y + P RGF++ GY TG +DY+
Sbjct: 155 PQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLGHYCPQCLPNNRGFDTFRGYLTGAEDYY 214
Query: 193 DHSA-------EEMKMWGLDMRRDLEPAWDLHGKYSTDV-------------FTAEAVDI 232
+ + G D + +G YST F + ++
Sbjct: 215 KKTFCIPLVPNQRPAACGFDFYDNENRMPKANGTYSTYQVLIYLFIHTIILKFADASREV 274
Query: 233 IHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I +H + P FLYL + H P+ P +Y +++ + ++ R ++ ++ +DE+
Sbjct: 275 IKSHEGSKTPFFLYLPFQSVH-----YPVMVPKNYSDLYPNEKNHTRKEYMGMVTAMDEA 329
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VG +V+ L+ N++ + +DNGG NA P RG K TLW+GG+R I
Sbjct: 330 VGHIVDELKAAGAYENTLFFYATDNGGLVGAGGRNA----PFRGQKATLWQGGLRVPAFI 385
Query: 352 WSPLLESRGIVAEQY---VHVSDWLPTLLSA 379
P + S A Y +H+ D T+L A
Sbjct: 386 SGPGVPS----ASNYKNMMHIVDVQMTVLDA 412
>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
Length = 482
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 180/351 (51%), Gaps = 43/351 (12%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAI 114
++GPP+I+FI+ADDLG+ DVG +G + TP+ID +A G+ + Y +C+ +R+A+
Sbjct: 33 AAGPPNIVFIMADDLGYADVGCYGRPDLKTPHIDRIAARGVRFTQGYANSAVCSATRTAL 92
Query: 115 MTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
+TG++ Q+ + G E GGLP LP L+ GYRT +VGKWHLG
Sbjct: 93 ITGRY------QYRLPVGLEEPVTTRFTGGLPAEHPTLPGLLRRAGYRTALVGKWHLGSV 146
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK--MWGLDMRRDLEPAWDLHGKYSTDVF 225
K + P RG+++ G+ G DYF H + K +W D+R + Y+T +
Sbjct: 147 AK-FGPLQRGYDTFWGFRGGTIDYFSHRGTDDKPDLWDGDVRVE-------EVGYATHLI 198
Query: 226 TAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-------RHIEDFK 277
AV+ I P FL L H P+ P + P+ RH +
Sbjct: 199 GRRAVETIGTAVRAGRPFFLSL-----HFNAPHWPWETPEDEAESRRIAGSNLRHFDGGT 253
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
++ + I+ ++D +G+V++ALEQ + ++I+VF SDNGG + WP G K
Sbjct: 254 QATYHRIVAEMDAQIGRVLDALEQHGIAQDTIVVFTSDNGGE------RFSDTWPFTGRK 307
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
L EGG+R ++ P G+V+ Q + DWLPTLL+AA S P Y
Sbjct: 308 TELLEGGLRIPAIVSWPRRLPAGVVSAQPMITMDWLPTLLAAAGASPDPVY 358
>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
Length = 510
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 187/331 (56%), Gaps = 28/331 (8%)
Query: 33 RIMAFAVLPLAFTLS-MVFV-DLVASSGP---------PHIIFILADDLGWNDVGFHGLD 81
R F++L +A L+ +VF DL A + P+I+ ILADDLG+ D+GF G
Sbjct: 19 RFNLFSLLFVALCLAGIVFSSDLAAQTKKTHKLVAKERPNIVVILADDLGYADLGFTGSK 78
Query: 82 QIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCE--RGGL 138
+I TPNIDALA +G++ KN Y T C PSR+ ++TG++ GM+ N + + GL
Sbjct: 79 EIFTPNIDALANNGVVFKNGYVTHPYCGPSRAGLLTGRYQARFGMEVNAAHSPDDPYMGL 138
Query: 139 PLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE 198
P+ E + +++ GY+T ++GKWH+G + + P RGF+ G+ G DYF S +
Sbjct: 139 PVEELTFAKRMQQAGYKTAVMGKWHMGSHPN-FHPNNRGFDEFFGFLGGGHDYFPESVKV 197
Query: 199 MKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
+ + + R+ +PA +Y T + EA + +T++P +Y+A+ A HS
Sbjct: 198 SSAEYSIALSRNGKPAQ--LNEYLTTAISKEAARFVS--ATEQPFMMYVAYNAPHS---- 249
Query: 258 EPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG 317
PLQA + L ++HI D R +AA++ +D+ VG++V AL+Q L N++I F+SDNG
Sbjct: 250 -PLQATEQDLAKYQHIADLDRRTYAAMIDSMDQGVGRIVAALKQSEKLHNTLIFFLSDNG 308
Query: 318 GAAA---GFNLNAASNWPLRGVKNTLWEGGV 345
G + + A+N P R K +L EGGV
Sbjct: 309 GVYPEEWMPDSDWANNAPFRRGKVSLTEGGV 339
>gi|347528892|ref|YP_004835639.1| putative arylsulfatase [Sphingobium sp. SYK-6]
gi|345137573|dbj|BAK67182.1| putative arylsulfatase [Sphingobium sp. SYK-6]
Length = 458
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 129/377 (34%), Positives = 188/377 (49%), Gaps = 30/377 (7%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
R + A LPLA + A + PPHI+FI+ADDLG+ D+ F G + TP+ID+
Sbjct: 5 RAFLGTAAALPLAARVE------AAPARPPHIVFIMADDLGYADLSFTGRTEYATPHIDS 58
Query: 91 LAYSGIILKNYYT-VQLCTPSRSAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQ 147
+A G + Y +C+P+R A+ TG++ + G+ + G E GLP +
Sbjct: 59 IAEQGCFFSDAYANSSVCSPTRMALATGRYQYRLRGGLDEPIGGGPEPVGLPPEHPTIAS 118
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
L+E GYRT +VGKWHLG ++P G++ G G DYF HS + M+ G
Sbjct: 119 RLREAGYRTVLVGKWHLG-SPPRFSPLLSGYDRFFGIHGGGADYFTHS-QSMQGGGSSDL 176
Query: 208 RDLEPAWDLHGKYSTDVFTAEAV-DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DH 265
D D HG Y TD+ T AV +I T +PLFL L H P+ P + P D
Sbjct: 177 YDGSSVTDRHG-YLTDILTERAVAEIDLAARTRKPLFLSL-----HYTAPHWPWEGPEDE 230
Query: 266 YLNIH----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
L + + S FA ++ LD+ VGKV+ ALE + M ++I+VF SDNGG
Sbjct: 231 ALAAQIKSLLNYDGGSDSTFAKMVQALDKGVGKVLAALEAQGMTQDTIVVFTSDNGGE-- 288
Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ WP G+K L EGG+R ++ P G + Q D++PT L+ A
Sbjct: 289 ----RFSKVWPFDGMKGDLLEGGIRVPMMMRWPGKIPAGSRSAQIAMTMDYMPTFLATAG 344
Query: 382 KSDIPNYVNSTVENIIP 398
S P + + + N++P
Sbjct: 345 GSADPAFPSDGI-NLMP 360
>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 500
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 120/332 (36%), Positives = 171/332 (51%), Gaps = 28/332 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+F+LADDLG+NDVGF+G I TPN+D LA +G+ Y C PSR+AIMTG+
Sbjct: 40 PNILFVLADDLGYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGR 99
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G Q N+ G+ E + Q +K GY T +GKWHLG EY P GF
Sbjct: 100 YPHKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLG-EASEYHPNKHGF 158
Query: 179 ESHLGYWTGHQDYFDHSAEE-------MKMWGLDMRRDLEPAWDLHGK------YSTDVF 225
+ G+ G +YF E M ++M L P + +GK Y TD
Sbjct: 159 DEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMY--LTPL-EHNGKEVRETEYITDGL 215
Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
+ EAV+ + + + P FLYLA+ A P+ PLQA + + + I+D KR +A +
Sbjct: 216 SREAVNFVDKAAAKKKPFFLYLAYNA-----PHVPLQAKEEDMAMFSQIKDKKRRTYAGM 270
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
++ +D VG++VE L++ N++IVF SDNGG A+N+PL+ K ++ EGG
Sbjct: 271 VYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNGGKLG----QGANNYPLKEGKGSVQEGG 326
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
R L+ P G V D PT
Sbjct: 327 FRTPMLVHWPKHMKAGSRFSHPVLALDLYPTF 358
>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 186 bits (472), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 194/355 (54%), Gaps = 33/355 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P++I I+ADDLG+NDVGF+G +IPTP ID++A +G+ N YT +C PSR+ +TG+
Sbjct: 21 PNLIVIMADDLGYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGR 80
Query: 119 HPIHTGMQHNVLYGCE--RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ G + N + LP SE + + L ++GY I+GKWHLG + P R
Sbjct: 81 YQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLG-AEPSLRPNKR 139
Query: 177 GFESHLGYWTGHQDYF------DHSAE---EMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
GF+ G+ G + H+ E E+ + + R+ P KY T+ F+
Sbjct: 140 GFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTT--KYLTEEFSD 197
Query: 228 EAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EAV I NH +P FL+L++ A P+ PLQA + YL HI+D KR +AA++
Sbjct: 198 EAVSFIKRNHQ--KPFFLFLSYNA-----PHLPLQATEKYLARFPHIKDPKRKTYAAMVS 250
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
+D+ V +V+++L++ + N+I+ F+SDNGG + + N + N+PL+G K+ +WEGG R
Sbjct: 251 AVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPS---HKNKSDNFPLKGQKSDVWEGGFR 307
Query: 347 GAGLIWSPLLESRGIVAEQ-YVHVSDWLPTLLSAANKSDIPNYVNSTVE--NIIP 398
+ P I A+Q Y H L + A+ + P + + ++ N+IP
Sbjct: 308 VPFAMQYP----AAIQAKQVYDHPVSSLDIFATIASLAQSPTHADKPLDGVNLIP 358
>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
Length = 458
Score = 186 bits (472), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 187/350 (53%), Gaps = 30/350 (8%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F L TL F L A + P+I+ +LADDLGW DVGFHG +I TP++DAL G+
Sbjct: 5 FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLDALVGEGV 61
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+ +YT + TP+R+ +MTG++P G++ V+ GL +E+ + L GY+
Sbjct: 62 ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121
Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
R I+GKWHLG KK + P RG G+ G DYFD + E LD D E D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
YST++ T EA+ I + + P LY+A+ A H+ PLQA + +I + ++
Sbjct: 178 --KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228
Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
F K++ ++A++ +D +G +V+AL+++ ++ N+ +F SDNG A +
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284
Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+S+ PLRG K W+GG A L W + ++ Q D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 186 bits (471), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 186/362 (51%), Gaps = 32/362 (8%)
Query: 42 LAFTLSMVFV-DLVASSGP----PHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAY 93
LA + VF+ L + + P PHI+ +L DD GW +VG+H ++ TPNID L
Sbjct: 2 LAVLVVGVFLLGLASGASPNANLPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDKLVK 61
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKI 144
G+ L +Y + C+PSRS++++G+ PIH ++ + G +P +
Sbjct: 62 EGLELNQHYAFRFCSPSRSSLISGRLPIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTG 119
Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL 204
+ + ++ GY T VGKW G ++TP RGF++ GY+ DY+ A + K+ +
Sbjct: 120 IAEKMRGAGYATHQVGKWDAGMATSDHTPKGRGFQTSFGYFHHANDYYTEIAGKCKIPIV 179
Query: 205 DMRRDLEPAWDLHGK----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
D+ +PA ++G Y +F + ++ NH PLFLY A H+ P
Sbjct: 180 DLWETDKPAHGINGTGPDNYEEGLFKERVLSVVSNHDPSTPLFLYYAPHIVHA-----PY 234
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+ PD YLN I++ R + A+++ LD+ VG++ +AL+ + M N + V SDNGG
Sbjct: 235 EVPDRYLNKFSFIDNHLRQYYHAMVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGPV 294
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLS 378
A+N+PL+G K T W+GGVR + L + RG + Y+H++DW T +
Sbjct: 295 --HEGTGANNYPLKGGKTTDWQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCA 352
Query: 379 AA 380
A
Sbjct: 353 IA 354
>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
Length = 157
Score = 185 bits (470), Expect = 8e-44, Method: Composition-based stats.
Identities = 78/157 (49%), Positives = 111/157 (70%), Gaps = 2/157 (1%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
AV F SM+ ++ PPHI+FI DDLGW+DVG+H + + TPNID LA G++
Sbjct: 2 AVFFYVFVASMLTA--CHAAKPPHIVFIFVDDLGWSDVGYHNITDLKTPNIDRLAGEGVV 59
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
L+NYY +CTP+R ++G++PIHTG+QH+ ++ E GLPL +LPQ LK+ GY T
Sbjct: 60 LENYYVQPICTPARGTFLSGRYPIHTGLQHSNIHETEPFGLPLDFTLLPQKLKKAGYSTH 119
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH 194
VGKWHLGF++KEYTP +RGF++ GY++G +D++ H
Sbjct: 120 AVGKWHLGFFEKEYTPLYRGFDTFFGYYSGSEDHYTH 156
>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
Length = 266
Score = 185 bits (470), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 151/268 (56%), Gaps = 42/268 (15%)
Query: 42 LAFTLSMVFVD-----LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
LA+ S + V+ + A P+I+ ++ADD G+ D+G+HG + TPN+D LA G+
Sbjct: 5 LAWLSSFLIVNCLRMVICAKQQQPNIVIVVADDYGYRDIGYHG-AEFATPNLDKLAAEGV 63
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
L+NYY +C+P+RS +MTG++ IHTG+QH++++ + GLPL + LK +GY T
Sbjct: 64 KLENYYVQPICSPTRSQLMTGRYQIHTGLQHDIIWPSQPYGLPLQFPTIADMLKSVGYST 123
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---------------------- 194
+GKWHLG YKKEYTP +RGF+S+ GY G +DY+ +
Sbjct: 124 HAIGKWHLGLYKKEYTPLYRGFDSYYGYLEGGEDYYTYYNCDTFHNRTTPADTSILESYS 183
Query: 195 --------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST-DEPLFLY 245
+E K G D+R EP D++G YST ++T +A+DII+ ST +P LY
Sbjct: 184 PKNILLGKHEDENKWCGYDLRDMNEPVTDMNGTYSTHLYTKKAIDIINGASTGGKPFLLY 243
Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRHI 273
LA+ A HS P++ P Y + I
Sbjct: 244 LAYQAVHS-----PMEVPAEYTKPYTFI 266
>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 466
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 187/354 (52%), Gaps = 26/354 (7%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+ FTL M+ V++ P+II IL+DD G+ D G +G IPTP+IDALA G+
Sbjct: 7 IVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGVRFT 66
Query: 100 N-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKEL 152
N Y T +C PSR+ ++ G++ +G +HNV Y + GL + + + ++
Sbjct: 67 NSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQMQDIGLSDTVRTIADQMQSN 126
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF---DHSAEEMKMWGLDMRRD 209
GY T +GKWH G K + P +GF G+ GH+ +F +E K+ D
Sbjct: 127 GYETMAIGKWHQGNETKHH-PLHKGFNHFFGFIGGHRSFFPIRTAIKQEEKILNDYTEVD 185
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
+ + Y TD+FT +A+ + D+P F+YL++ A H+ P++A L
Sbjct: 186 EKDVY-----YLTDMFTDKAISYM-RQKRDKPYFIYLSYNAVHT-----PVEATPQKLAQ 234
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
H++D +R +A ++ +D+ +G++VE L++ L N++++F++DNG A N A
Sbjct: 235 FAHLKDAQRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAATN----NGAD 290
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
N PLRG+K + WEGG+R ++ P G + V D +PT ++AA +
Sbjct: 291 NGPLRGLKGSKWEGGIRVPMIMHWPKQLPAGKDYKGLVSSLDLVPTCVAAAQGT 344
>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
Length = 484
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 115/297 (38%), Positives = 171/297 (57%), Gaps = 21/297 (7%)
Query: 60 PHIIFILADDLGWNDVGF-HGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTG 117
P+I+ ILADDLG+NDVGF +G +I TP +DALA G++ +N Y T C PSR+ ++TG
Sbjct: 28 PNIVVILADDLGYNDVGFTNGQTEIKTPRLDALANEGVVFENGYVTHPYCGPSRAGLITG 87
Query: 118 KHPIHTGMQHNVLYGCE--RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
++ GM++NV Y + GLPL+EK P L+E+GY+T I GKWHLG + P
Sbjct: 88 RYQARFGMENNVTYSPDDKYMGLPLTEKTFPARLQEVGYKTAIFGKWHLGG-APHFQPNE 146
Query: 176 RGFESHLGYWTGHQDYFD---HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
RGF+ G+ G +Y H + + MR +D +Y T + +A
Sbjct: 147 RGFDYFYGFLDGGHNYMPGEVHLGAGGYLLPI-MRNKGVAEFD---EYLTTALSRDAARY 202
Query: 233 IHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I S ++ P F+Y+++ A H+ PLQAP +YL + HI+D KR +AA++ +DE
Sbjct: 203 IERTSKEQAPFFIYMSYNAPHA-----PLQAPQNYLEKYAHIKDEKRRTYAAMVDAMDEG 257
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGA---AAGFNLNAASNWPLRGVKNTLWEGGV 345
+G +++AL++ L N++I F+SDNGG + + A N P R K L EGGV
Sbjct: 258 IGMLIDALDKSGELDNTLIFFLSDNGGLYPYSWWPDYTWADNSPFRFGKVALTEGGV 314
>gi|449692603|ref|XP_004213101.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 181
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/182 (50%), Positives = 131/182 (71%), Gaps = 2/182 (1%)
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLGF+ K+YTPT+RGF+S+ G + G DY++HS E GLD+ + + G YS
Sbjct: 1 WHLGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNTETYS-GLDLHDNENGVFSQDGNYS 59
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK 280
T+++TAEA+ I+NH++ EPLFLYLA+ A HSAN E PLQAP +++ +I+ +R K
Sbjct: 60 TEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRK 119
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+AA+L +D VG+V +AL +++ML NSII+F +DNGG A GF+ N A+N+PLRGVK TL
Sbjct: 120 YAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATL 179
Query: 341 WE 342
+E
Sbjct: 180 FE 181
>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
Length = 465
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/339 (35%), Positives = 183/339 (53%), Gaps = 22/339 (6%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIM 115
S PP++I ILADDLG+ D+G G QI TP++D LA G+ + Y T +C+PSR ++
Sbjct: 26 SSPPNMIVILADDLGYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLL 85
Query: 116 TGKHPIHTGMQHN----VLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
TG+ P G+ N + Y E GLP +EK++P+YL GYR+ + GKWHLG + K
Sbjct: 86 TGRFPKRYGITTNPNIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHLG-HTKG 144
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
YTP RGF G+ G + YF E + + + D+ Y TD T AV
Sbjct: 145 YTPPERGFTHWWGFLGGSRHYFPVKKEAEGLNPSMIVSNFTDKTDI--TYLTDDITDRAV 202
Query: 231 DIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+ + D +P F+++++ A H N +P + R++++ +R + A+++ +D
Sbjct: 203 EFLQEAGKDKKPFFMFVSYNAPHWPNEAKP-----EDIAKFRNVQNGERRVYCAMVYAMD 257
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+G++++AL+ + ++I+VF+SDNGGA + NA P RG K +EGGVR
Sbjct: 258 RGIGRILDALKADGLEKDTIVVFLSDNGGAPEASSCNA----PFRGAKRQHFEGGVRVPF 313
Query: 350 LIWSPLLESR--GIVAEQYVHVSDWLPTLLSAANKSDIP 386
+I P + G V Q V D LP LL AN IP
Sbjct: 314 IIRYPADKRLVPGSVCRQPVSSVDLLPALLK-ANGRHIP 351
>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
Length = 440
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 179/359 (49%), Gaps = 27/359 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLGW D+G +G TP++D LA G+ Y +C+ +R A++TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ + G++ + + GLP LP LK+ GY T ++GKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
G++ G +G DYF H + + D+ + EP Y T + EA +H
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSSDLWQGNEPV--ERTGYYTYILGDEATRYVHER 180
Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLDE 290
D+P FL L H P+ P + PD H + H + K+ I+ LD+
Sbjct: 181 KGQDKPFFLSL-----HFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDK 235
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+VG+V++AL+ N+I++F SDNGG + WP G K L EGG+R L
Sbjct: 236 AVGQVLQALDDSGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPTL 289
Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE 408
+ WS ++ + V EQ DWLPTLL+AA P++ S +NI+P E + E
Sbjct: 290 LRWSARIQPQ--VQEQVTASFDWLPTLLAAAGARQHPDF-PSDGQNILPILEGHAAQTE 345
>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
Length = 440
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 179/359 (49%), Gaps = 27/359 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLGW D+G +G TP++D LA G+ Y +C+ +R A++TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ + G++ + + GLP LP LK+ GY T ++GKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
G++ G +G DYF H + + D+ + EP Y T + EA +H
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSSDLWQGNEPV--ERTGYYTYILGDEATRYVHER 180
Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLDE 290
D+P FL L H P+ P + PD H + H + K+ I+ LD+
Sbjct: 181 KGQDKPFFLSL-----HFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDK 235
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+VG+V++AL+ N+I++F SDNGG + WP G K L EGG+R L
Sbjct: 236 AVGQVLQALDDSGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPTL 289
Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE 408
+ WS ++ + V EQ DWLPTLL+AA P++ S +NI+P E + E
Sbjct: 290 LRWSARIQPQ--VQEQITASFDWLPTLLAAAGARQHPDF-PSDGQNILPILEGHAAQTE 345
>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 192/396 (48%), Gaps = 47/396 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+FI+ DD G+++ G + TPN+D LA G++L Y+ CTPSRS+ ++G+
Sbjct: 41 PHIVFIMIDDWGFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGRL 100
Query: 120 PIHT-----------GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
P+H QH G G+P + +P ++++ GY+T +VGKW G
Sbjct: 101 PLHVFHSNRVSSASWDSQHPDTAGV---GIPRNMTTIPAFMRKAGYKTHMVGKWDAGIAT 157
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEM--------------KMWGLDMRRDLEPAW 214
+++P RGF+S L Y+ +Y+ ++ + +D+ EPA
Sbjct: 158 PQHSPLGRGFDSSLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPAD 217
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
G Y VF A+D+I H PLFLY A H+ PLQ P YL+ +HI
Sbjct: 218 APIGTYEEHVFRDHALDVISKHDASTPLFLYYASHIAHA-----PLQVPQAYLDRFQHIP 272
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA----GFN---LNA 327
D R + A++ D+ VG + AL+ + M N+++V SDNGGA G +
Sbjct: 273 DPIRRTYQAMVLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGG 332
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAAN-KSD 384
ASNWPLRG K + ++GG+R + L S RG V V ++DW+ T A +
Sbjct: 333 ASNWPLRGGKLSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAGVACE 392
Query: 385 IPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIE 420
P + + I + S+L N T SPR+E
Sbjct: 393 DPIAKKANLPAIDSQSLWSLLSGANST----SPRVE 424
>gi|167523060|ref|XP_001745867.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775668|gb|EDQ89291.1| predicted protein [Monosiga brevicollis MX1]
Length = 221
Score = 183 bits (465), Expect = 3e-43, Method: Composition-based stats.
Identities = 94/207 (45%), Positives = 131/207 (63%), Gaps = 16/207 (7%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
V VA+ PH++FI+ DDLG++D+ F G QI TPNIDAL G++ Y +C+PS
Sbjct: 21 VHAVAAGTKPHVLFIVIDDLGYDDLYFRG-HQIRTPNIDALQEEGLLFTQMYMQDVCSPS 79
Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
R++I++G++ +H G+ + + + GL L++ L ++E GY TR VGKWH+GFYK
Sbjct: 80 RASILSGRYAMHHGVT-DWIPPRDSYGLMLNDTTLADKMREAGYDTRAVGKWHMGFYKWA 138
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE---------PAWDLHGKYS 221
YTPTFRGF S LGY++G +DYF H + DM RD PAWDL G+YS
Sbjct: 139 YTPTFRGFNSFLGYYSGGEDYFTHETDN----AYDMHRDEGRHCGPNCSIPAWDLKGQYS 194
Query: 222 TDVFTAEAVDIIHN-HSTDEPLFLYLA 247
T +F+ EA+ II+ + D PLFLYLA
Sbjct: 195 TTIFSEEAIRIINQRQAADPPLFLYLA 221
>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 193
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 82/144 (56%), Positives = 105/144 (72%), Gaps = 3/144 (2%)
Query: 69 DLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM--- 125
+LG+NDV FHG QIPTPNID +A G+IL NYY + +CTPSRSAIMTG++PIHTG+
Sbjct: 44 NLGFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTGRYPIHTGIFYH 103
Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
++ ++ G+ L EK LPQYLK +GY+T +GKWHLGF+ KEYTPT+RGF+S GY+
Sbjct: 104 TYDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYY 163
Query: 186 TGHQDYFDHSAEEMKMWGLDMRRD 209
G DY+DHS WGLD+ D
Sbjct: 164 GGQADYWDHSLASNGWWGLDLHYD 187
>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 433
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 107/267 (40%), Positives = 146/267 (54%), Gaps = 25/267 (9%)
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+TGMQH VL E GLPL+ K+LP++L +LGY K L + + F
Sbjct: 1 YTGMQHFVLMEMEPRGLPLNFKLLPEWLGDLGYVHNQTTKQTLLHKELRFVYAFPYLRR- 59
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
G D RR L + G+Y T++ T EA+ II H ++P
Sbjct: 60 ---------------------GFDFRRGLLGGREFDGRYYTELMTEEAIRIIETHPLEKP 98
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
LFLYL+H A HS N +PLQ P + +++ I R+ +A ++ LDESVG VVEAL +
Sbjct: 99 LFLYLSHLAPHSGNLRDPLQVPKKFSDMYHDIGSRNRTIYAGMVSALDESVGAVVEALGR 158
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE-SRG 360
R ML ++++VF SDNG + G N N AS WP +G K T WEGGVR +IWSP L ++G
Sbjct: 159 RGMLGDTVLVFSSDNGANSRGENPNYASAWPFKGQKITPWEGGVRAPAIIWSPSLSGTQG 218
Query: 361 IVAEQYVHVSDWLPTL--LSAANKSDI 385
H+SDWLPTL L+ + SD+
Sbjct: 219 RDYSNIFHISDWLPTLYQLAGGDPSDL 245
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 17/145 (11%)
Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQ 625
D S +IDGI L + R +L NID ++A+ +G +KLV
Sbjct: 239 GGDPSDLGDIDGISQLEALRHGTEAPRKEVLINIDPIDNVTAIIEGHFKLVSGTVKGGAL 298
Query: 626 VDLTGGPDQVYLSGLSDREWL--ALAMRKLRDAA-SIQCG--------PVK------EVP 668
D P V R ++A + LR++ + CG PVK P
Sbjct: 299 DDWFQIPGNVAWDTDRTRHECETSVAAQVLRNSGHDVTCGSGQGSYAIPVKCGRRDTSKP 358
Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
C P +APCLFD+ DPCE NN+A++
Sbjct: 359 CTPTLAPCLFDLSEDPCEYNNVAEQ 383
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 762 AASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADR-SEVQR 804
A ++CG PC P +APCLFD+ DPCE NN+A++ SEV R
Sbjct: 345 AIPVKCGRRDTSKPCTPTLAPCLFDLSEDPCEYNNVAEQHSEVVR 389
>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 434
Score = 182 bits (462), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 118/339 (34%), Positives = 180/339 (53%), Gaps = 35/339 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+II DDLG+ D+G +G D + TP++D LA GI N+Y+ +C+PSR++++TGK
Sbjct: 4 PNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLTGK 63
Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P G+ + G +RG GL L + L LKE GY T + GKWHLG EY P
Sbjct: 64 YPAKAGV--TSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLG-ASAEYGPNAH 120
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGL--------DMRRDLEPAWDLHGKYSTDVFTAE 228
GF+ G+ G DY+ H + WG D+ R+ W+ +G+Y T+ T E
Sbjct: 121 GFDQFYGFRAGCIDYYSH----IFYWGQGGGVNPVHDLWRNETEVWE-NGEYMTEAITRE 175
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A I DEP F+Y+A+ A P+ P+ AP YL+ + R AA++ +
Sbjct: 176 ATSYIDAAPDDEPYFMYVAYNA-----PHYPMHAPKAYLDRFPDLPP-DRRIMAAMIAAV 229
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTL 340
D+ VG++V+AL+Q+ ++II F SDNG + N S RG K +L
Sbjct: 230 DDGVGEIVKALKQKGAYEDTIIFFSSDNGPSTESRNWLDGTEDLYYGGSAGRFRGHKASL 289
Query: 341 WEGGVRGAGLIWSP--LLESRGIVAEQYVHVSDWLPTLL 377
+EGG+R ++ P L E +G ++++ + D PT+L
Sbjct: 290 FEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTML 328
>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 512
Score = 182 bits (462), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 191/368 (51%), Gaps = 54/368 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMT-- 116
P+II I ADD+G++DVG+HG +I TPNID++A G+ + Y + +C PSR+ ++T
Sbjct: 21 PNIILIFADDMGYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRAGLLTGV 80
Query: 117 -------GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
G++P +G + + Y GLP S+ ++ + LK LGY ++GKWH+GF
Sbjct: 81 YQQRFGCGENPNGSGYPNQMKY--PMAGLPQSQSMISEELKTLGYTNGMIGKWHMGF-DM 137
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPA-----WDLHGK-- 219
P RG++ G+ G DY + + E K W + ++EPA D+ +
Sbjct: 138 SLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKG 197
Query: 220 -------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
Y TD+FT EAV+ I + + D+P FLYLA+ A H P Q H L+ H
Sbjct: 198 VKVVDENYLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH-----HPWQTTQHALDKTAH 251
Query: 273 IEDFKRSK-FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA----------- 320
++D K FA++++ +DE +GKV++ L+++ + N+II+F+SDNG
Sbjct: 252 LKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDP 311
Query: 321 ----AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
GF + +S RG K +EGG+R I P +G + + D PTL
Sbjct: 312 NRHRGGFTM--SSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISALDLQPTL 369
Query: 377 LSAANKSD 384
+ AA +D
Sbjct: 370 VKAAGGND 377
>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 182 bits (462), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 179/347 (51%), Gaps = 22/347 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P++I ADDLGW + G G QIPTP+ID++A +G+ + + C+PSR+ ++TG+
Sbjct: 41 PNVIIFYADDLGWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGR 100
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G + N + GL L E L L LGY+T VGKWHLG EY PT RGF
Sbjct: 101 YPTRFGHEFNRIANVS--GLDLQETTLADRLHGLGYKTACVGKWHLG-DGPEYRPTKRGF 157
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G T F H K + D+ D Y+TD + +V+ I
Sbjct: 158 DEFFG--TLANTPFFHPT---KFVDSRVSNDVAEVSD-ENFYTTDEYAKRSVEWI-GQQQ 210
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
P FLYL A H+ PLQAP YL+ I D KR FAA++ +D+++G+V+
Sbjct: 211 QSPWFLYLPFNAQHA-----PLQAPQKYLDRFESIADPKRKLFAAMMSAMDDAIGQVLGK 265
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLE 357
+ + N+++ F+SDNGG G + N PLRG K T +EGG R L+ W L
Sbjct: 266 VRELGQEENTLVFFISDNGGPTQG---TTSQNGPLRGFKMTTFEGGTRVPFLVQWKGKLP 322
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSI 404
+ G + V D LPT+L+AA P + V+ ++P + +SI
Sbjct: 323 A-GKTYDNPVINLDVLPTVLTAAGSKIDPAWKLDGVD-LVPYFTSSI 367
>gi|449138314|ref|ZP_21773584.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883087|gb|EMB13631.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 485
Score = 182 bits (462), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 124/387 (32%), Positives = 199/387 (51%), Gaps = 61/387 (15%)
Query: 42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L TL+M+ V + AS+ P+I+ I++DD G+ND+G G + I TP +D LA G L N
Sbjct: 11 LFTTLAMLLVAIKASAADRPNILLIVSDDQGYNDLGQLG-NGIITPALDRLANEGTRLTN 69
Query: 101 YYTV-QLCTPSRSAIMTGKHPIHTGM------------------QHNVLYGCER-GGLPL 140
+Y CTPSR++++TG++P G+ Q+ V + ER GG+
Sbjct: 70 FYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDYGHRYTPDQYAVTF--ERIGGMDE 127
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
E I+P LK GY++ I GKW LG ++ P RGF+ G+ DYF H +
Sbjct: 128 REVIIPAVLKRAGYKSGIYGKWDLGALRR-MLPNSRGFDDFYGFVNTGIDYFTHERYGVP 186
Query: 201 MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP- 259
M R+LEP + G Y T +F EA+ + H+ +EP FLY+ A H+++ +P
Sbjct: 187 C----MVRNLEPTEEDKGTYCTYLFQREALRFLDEHAGEEPFFLYVPFNAPHNSSSLDPK 242
Query: 260 ----LQAPDHYLNIHRHIED--------------------FKRSKFAAILHKLDESVGKV 295
+QAP+ + ++ +E +R + A + +D ++G++
Sbjct: 243 IRSSVQAPERFKAMYPPVESETRVTDRYRYGTPATVATPQARRRDYRAAVTCMDAAIGEM 302
Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSP 354
++ LE ++ML ++I+VF SDNGG+ A N PLRG K WEGG+R ++ W+
Sbjct: 303 LDRLEAKQMLEDTIVVFFSDNGGSG------GADNSPLRGHKAQTWEGGIRVPCIVRWTA 356
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
G V ++++ + LP+ SAA
Sbjct: 357 GKIPAGAVNDEFLTSLELLPSFASAAG 383
>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 196
Score = 182 bits (461), Expect = 9e-43, Method: Composition-based stats.
Identities = 96/223 (43%), Positives = 123/223 (55%), Gaps = 42/223 (18%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
I+G WHLG+YK+ YTPT RGFES GY+ +DY+
Sbjct: 10 ILGAWHLGYYKEAYTPTERGFESFYGYYNFGEDYY------------------------- 44
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
NH+ D PLFLYL+H A H+ Y P+QAP+ + +I
Sbjct: 45 -----------------NHTLDLPLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYIGVKN 87
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
RS +A + LD+SVG V +ALE++ ML N+++VF+SDNG G NA SNWPLRG K
Sbjct: 88 RSLYAGAVDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGAIPWGEWSNAGSNWPLRGAK 147
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
TLWEGGVR +WSPLL V+ Q +HVSDW+PTL S A
Sbjct: 148 FTLWEGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLA 190
>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 479
Score = 182 bits (461), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 129/365 (35%), Positives = 197/365 (53%), Gaps = 27/365 (7%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTP 109
V+ + SG P+I+ I+ADDLG+ D+G G +IPTP++D LA SGI N Y + C+P
Sbjct: 29 VNQTSKSGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSP 88
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
SR+ +TGK+ G + N G E + GLPL E + L+ GYRT ++GKWH GF
Sbjct: 89 SRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGF-S 147
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL---DM---RRDLEPAWDLHGKYST 222
K++ P RGF+ G+ G +Y H + + DM R++EP Y+T
Sbjct: 148 KDHHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYRGREVEP----QEGYAT 203
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ-APDHYLNIHRHIEDFKRSKF 281
D+FT EA+ + ++P FLYL++ A H+ PL+ AP I ++ R +
Sbjct: 204 DLFTNEALRWMSG-PPNKPWFLYLSYNAVHT-----PLEIAPHLQKRIPESVKLPARRGY 257
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASNWPLRGVKNT 339
++L LD+S+G++ + L Q + ++I+F+SDNGG+ A N+ N PLRG K
Sbjct: 258 LSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNHPLRGDKGQ 317
Query: 340 LWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSDIPNYVNSTVE--N 395
EGG+R + W L +R I EQ + D LPT+ AAN P + ++ N
Sbjct: 318 TLEGGIRVPFFVSWPGQLPARTIY-EQPIISLDLLPTVCQLAANNPAKPQPLPQGIDGVN 376
Query: 396 IIPRY 400
++P +
Sbjct: 377 LMPYW 381
>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
Length = 314
Score = 182 bits (461), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 111/309 (35%), Positives = 168/309 (54%), Gaps = 25/309 (8%)
Query: 74 DVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGC 133
DV F + IPTP+ID LA SG+ L++YY QLC+P+R+++++G++ GM V+
Sbjct: 21 DVAFGPM--IPTPSIDELALSGVRLESYYVNQLCSPTRTSLLSGRYAYTLGMNAEVIVDG 78
Query: 134 ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFD 193
LPL+ K + ++L+E G+ T GKW G TPT RGF++ G++ DYF
Sbjct: 79 HPDQLPLTVKTIGEHLQEGGWATFAGGKWDAGMTSWGCTPTCRGFDTFSGFYNAFNDYFT 138
Query: 194 HSAEEMKMWGLDMRRDLEP--AWDLH-GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHA 249
H LD+R D P A D H G Y T++ T+ I + T++ F Y+AH
Sbjct: 139 HRVGNY----LDLRHDFAPDLADDNHTGVYMTELLTSRVQQFITTAAETEQSTFAYVAHQ 194
Query: 250 ATHSANPYEPLQAPDHYLN---IHRHIEDF-KRSKFAAILHKLDESVGKVVEALEQRRML 305
A H+ N Q P YL + ED R ++ +DESV + +Q +
Sbjct: 195 AVHAPN-----QVPMSYLEGYCLDTIPEDRPTRRILCGMMRAVDESVRNITATYKQLGLW 249
Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVA 363
++++++F +DNGG SN+PLRG K T +EGG+RG G + SPLL RG ++
Sbjct: 250 NDTVLIFTTDNGGNPE----TGGSNYPLRGQKATTFEGGMRGVGFVNSPLLNESQRGFIS 305
Query: 364 EQYVHVSDW 372
++ +HVSDW
Sbjct: 306 DELIHVSDW 314
>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 234
Score = 182 bits (461), Expect = 1e-42, Method: Composition-based stats.
Identities = 94/225 (41%), Positives = 136/225 (60%), Gaps = 2/225 (0%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW-DL 216
I+ +WHLG+Y +Y P RGF++ LG+++ DY+ + G D + W D
Sbjct: 9 ILLQWHLGYYDSKYVPVNRGFDTFLGFYSPALDYYTQNFTFDNHTGHDFWNGDKIYWVDK 68
Query: 217 HGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
K YST +T + V +I H+ PLFL L+H A H++ LQ P + +I++
Sbjct: 69 EKKQYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSYIQE 128
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R+ FA ++ LD+SVG VV+AL++ ML+++I+VF +DNG GF N SNWPLRG
Sbjct: 129 ENRTLFAGMVDALDQSVGSVVDALQKADMLTDTILVFCTDNGALPWGFKSNRGSNWPLRG 188
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
K TL+EGGVR IWSPL+ + V+ Q +H++DWLPTL SAA
Sbjct: 189 TKFTLYEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAA 233
>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 445
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 117/332 (35%), Positives = 180/332 (54%), Gaps = 33/332 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+ I+ADDLG+ D+ +G + I TPN+D LA G++ +Y++ +C+P+R+A+MTGK
Sbjct: 28 PNIVLIMADDLGYGDLSCYGNEYINTPNLDLLASEGVLFTDYHSNGSVCSPTRAALMTGK 87
Query: 119 HPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+ TG++ V R GL L+E L + LK+LGY T + GKWHLG Y K + PT +G
Sbjct: 88 YQQRTGVEGVVTAKSHRDVGLALAEVTLAEELKQLGYNTGMFGKWHLG-YDKAFNPTLQG 146
Query: 178 FESHLGYWTGHQDYFDHSAEE--MKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
F+ +G+ +G+ DY H +E + W G+ ++ + Y+TD+ + V I
Sbjct: 147 FDEFVGFVSGNVDYHGHIDQEGYLDWWDGVKIKNE--------KGYTTDLISEYGVKFIQ 198
Query: 235 NHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYL------NIHRHIEDFKRSKFAAIL 285
H+ + P FLYL H A HS PY+ + D L I+D S + ++
Sbjct: 199 EHNPEVKRAPFFLYLPHEAPHS--PYQ--RRIDKVLREIGTAGTQEVIQDSISSIYKEMV 254
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
+DE VG+++++L++ N+I++F+SDNG G N LRG K +EGG
Sbjct: 255 EVMDEGVGRIMQSLKETGQYENTIVIFISDNGANHYG------DNGGLRGFKAGPYEGGS 308
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
R + P G V Q V D LPTLL
Sbjct: 309 RVPAIFSFPKEVKGGSVNNQTVLSMDLLPTLL 340
>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
Length = 471
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/334 (34%), Positives = 168/334 (50%), Gaps = 24/334 (7%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSR 111
L A P+IIFILADDLG D+G +G I TPN+D LA GI + Y +C+ +R
Sbjct: 26 LAAGKEQPNIIFILADDLGHADLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATR 85
Query: 112 SAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
A++TG++ + G++ + + GLP + LP LK+ GY T ++GKWHLG Y
Sbjct: 86 FALITGRYQYRLRGGLEEPIAGASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLG-YLP 144
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
+ P G++S G + G DYF H D+ P + Y TD+ A A
Sbjct: 145 TFGPLKSGYDSFFGNYGGAIDYFTHKPGVGPQVKEDLYEGEVPVHQI--GYYTDLLGARA 202
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF------KRSKFAA 283
VD + +P L L H P+ P + P I R I++ + +
Sbjct: 203 VDFVQKQQAGKPFLLSL-----HYTAPHWPWEGPGDE-AISRDIDNIFHYDGGDLATYGR 256
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
++ LD S+G+V++ALE + + N+I++F SDNGG + WPL G K L EG
Sbjct: 257 MVESLDASIGRVLKALEDKGLAENTIVIFTSDNGGE------RFSKTWPLTGQKTELLEG 310
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
G+R ++ P + G V+EQ DWLPTLL
Sbjct: 311 GIRVPAIVRWPAKIAAGQVSEQVAISMDWLPTLL 344
>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
Length = 440
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 119/353 (33%), Positives = 177/353 (50%), Gaps = 27/353 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLGW D+G +G TP++D LA G+ Y +C+ +R A++TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ + G++ + + GLP LP LK+ GY T ++GKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPSYGPQRS 122
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
G++ G +G DYF H + + D+ + EP Y T + EA +H
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSPDLWQGSEPV--ERTGYYTYILGDEATRYVHER 180
Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLDE 290
D+P FL L H P+ P + PD H + H + K+ I+ LD+
Sbjct: 181 KGQDKPFFLSL-----HFTAPHWPWEGPDDEHVAHNIKDLFHYDGGSLKKYGEIVEALDK 235
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+VG+V++AL+ N+I++F SDNGG + WP G K L EGG+R L
Sbjct: 236 AVGQVLQALDDTGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPTL 289
Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
+ WS ++ + V EQ DWLPTLL+AA P++ S +NI+P E
Sbjct: 290 LRWSARIQPQ--VQEQVTVSFDWLPTLLAAAGARPHPDF-PSDGQNILPILEG 339
>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 418
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/260 (40%), Positives = 146/260 (56%), Gaps = 5/260 (1%)
Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
G+QH + E GLPL ++PQ+ K+LGY T +VGK +G + T R + H
Sbjct: 1 GLQHLTIEEGEAWGLPLRFTLMPQHFKKLGYATHMVGK--VGQFIISDTSCRRYVKRHTQ 58
Query: 184 YWTGHQDYFDHSAEE---MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
+ + +E GLD+ + G Y TD+ T +AV II +H T +
Sbjct: 59 KSCSQRCKRERKVKEPCGGDHIGLDLWDNEVAVRTECGNYDTDMLTDKAVSIIASHDTTK 118
Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE 300
PLFLYL+H + HSA PLQAP+ + +I + R+ +A ++ +D++ G++VEAL
Sbjct: 119 PLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGERNRTIYAGMVDSMDDAFGRLVEALS 178
Query: 301 QRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG 360
+L+N+I+VF SDNG N NWPLRG K TLWEG VR IWSPLLES G
Sbjct: 179 GAGVLNNTIVVFSSDNGAVPIIEFPNRGFNWPLRGAKKTLWEGAVRVPAFIWSPLLESSG 238
Query: 361 IVAEQYVHVSDWLPTLLSAA 380
V++Q +H+ DWLPT S A
Sbjct: 239 RVSDQMMHIVDWLPTFYSVA 258
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 54/125 (43%), Gaps = 19/125 (15%)
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLT 629
S+ E DG D+W LS S R +L NID S+L ++K KVV
Sbjct: 263 SHLGEQDGFDMWKALSEGTKSPREEMLLNIDPVLNTSSL---RYKNHKVVL--------- 310
Query: 630 GGPDQVYLSGLSDREWLALAMRK-LRDAASIQCGPVKEVP--CEPQIAPCLFDIKNDPCE 686
Y +G D + R R A+++CG +P P LFD+ DPCE
Sbjct: 311 ----GTYHNGSFDHRFKTTGRRPGWRRRATVRCGECNLAKDNVKPSSPPYLFDVVADPCE 366
Query: 687 KNNLA 691
NNLA
Sbjct: 367 LNNLA 371
>gi|440716877|ref|ZP_20897380.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436438070|gb|ELP31646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 485
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 187/374 (50%), Gaps = 56/374 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ I++DD G+ND+G G + I TP +D LA G L N+Y CTPSR++++TG+
Sbjct: 30 PNILLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGR 88
Query: 119 HPIHTGMQHNVL-----YG-----------CER-GGLPLSEKILPQYLKELGYRTRIVGK 161
+P G+ + YG ER GG+ E ILP L+ GY++ I GK
Sbjct: 89 YPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGK 148
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
W LG ++ PT RGF+ G+ DYF H + M R+LEP G Y
Sbjct: 149 WDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTHERYGVPC----MVRNLEPTEADKGTYC 203
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIHRHIE-- 274
T +F EA+ + H+ +EP FLY+ A H+++ +P +QAPD + ++ +E
Sbjct: 204 TYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVEVE 263
Query: 275 ------------------DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
+R + A + +D ++G++++ LE ++ML N+I+VF SDN
Sbjct: 264 TRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDNTIVVFFSDN 323
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
GG+ A N PLRG K WEGG+R L+ W G+V +++ + LP+
Sbjct: 324 GGSG------GADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNNEFLTSLELLPS 377
Query: 376 LLSAANKSDIPNYV 389
AA P V
Sbjct: 378 FAIAAGVEPPPGVV 391
>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 484
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/356 (34%), Positives = 177/356 (49%), Gaps = 30/356 (8%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
L A L ++ + + P+I+ IL DDLG+NDVGF+G I TPN+D LA G I
Sbjct: 9 CALVSAVVLFSSCINAQSQTEQPNILLILCDDLGYNDVGFNGSTDITTPNLDQLAQDGTI 68
Query: 98 LKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-LYGCERG-GLPLSEKILPQYLKELGY 154
+ Y C PSR+A++TG++P G Q N+ G G G+ + EK + +++ GY
Sbjct: 69 FTSAYVAHPFCGPSRAALLTGRYPHTLGSQFNLPANGASTGKGISVEEKFMGVPMQKAGY 128
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM-------WGLDMR 207
T +GKWHLG EY P RGF G+ G YF EE K+ G
Sbjct: 129 YTGAIGKWHLG-ETAEYHPNKRGFNDFYGFLGGGHKYF---PEEYKLQYKHQKEMGTKNI 184
Query: 208 RDLEPAWDLHGK------YSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPL 260
D + +G Y TDV + E + H +P FLYLA+ A P+ PL
Sbjct: 185 NDYVLPLEHNGAIVEENDYLTDVLSREGIRFTKEAHDKKKPFFLYLAYNA-----PHVPL 239
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+A + L + IED R +AA+++ +D VG++V +L++ N++I+F+SDNGG
Sbjct: 240 EAKEKDLEKFKDIEDIDRRTYAAMVYAVDRGVGEIVASLKKTGQFDNTLIIFLSDNGGHT 299
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ A+N+PL G K WEGG R P +G + V D PT+
Sbjct: 300 G----HGANNYPLTGRKGDTWEGGFRVPMFFHWPKKIKKGQKFDYPVSALDLYPTI 351
>gi|32475143|ref|NP_868137.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32445684|emb|CAD78415.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 485
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 121/374 (32%), Positives = 187/374 (50%), Gaps = 56/374 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ I++DD G+ND+G G + I TP +D LA G L N+Y CTPSR++++TG+
Sbjct: 30 PNILLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGR 88
Query: 119 HPIHTGMQHNVL-----YG-----------CER-GGLPLSEKILPQYLKELGYRTRIVGK 161
+P G+ + YG ER GG+ E ILP L+ GY++ I GK
Sbjct: 89 YPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGK 148
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
W LG ++ PT RGF+ G+ DYF H + M R+LEP G Y
Sbjct: 149 WDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTHERYGVPC----MVRNLEPTEADKGTYC 203
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIHRHIE-- 274
T +F EA+ + H+ +EP FLY+ A H+++ P +QAPD + ++ +E
Sbjct: 204 TYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQFKAMYPPVEVE 263
Query: 275 ------------------DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
+R + A + +D ++G++++ LE ++ML +I+VF SDN
Sbjct: 264 TRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEILDRLEAKQMLDETIVVFFSDN 323
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
GG+ A N PLRG K WEGG+R L+ W G+V ++++ + LP+
Sbjct: 324 GGSG------GADNSPLRGHKAQTWEGGIRVPCLVRWPAGQIPAGVVNDEFLTSLELLPS 377
Query: 376 LLSAANKSDIPNYV 389
+AA P V
Sbjct: 378 FAAAAGVEPPPGVV 391
>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
Length = 466
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/354 (31%), Positives = 185/354 (52%), Gaps = 26/354 (7%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+ FTL M+ V++ P+II IL+DD G+ D G +G IPTP+IDALA GI
Sbjct: 7 IVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGIRFT 66
Query: 100 N-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKEL 152
N Y T +C PSR+ ++ G++ +G +HNV Y + GL + + + ++
Sbjct: 67 NSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQIQDIGLSDTVRTIADQMQSN 126
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF---DHSAEEMKMWGLDMRRD 209
GY T +GKWH G K + P +GF G+ GH+ +F +E K+ D
Sbjct: 127 GYETMAIGKWHQGNETKHH-PLHKGFNHFFGFIGGHRSFFPIRTAIKQEEKILNDYTEVD 185
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
+ + Y TD+FT +A+ + D+P F+YL++ A H+ P++A L
Sbjct: 186 EKDVY-----YLTDMFTDKAISYM-RQKRDKPYFIYLSYNAVHT-----PVEATPQKLAQ 234
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
+++ R +A ++ +D+ +G++VE L++ L N++++F++DNG A N A
Sbjct: 235 FARLKNAHRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAATN----NGAD 290
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
N PLRG+K + WEGG+R ++ P G + V D +PT ++AA +
Sbjct: 291 NGPLRGLKGSKWEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPTCVAAAQGT 344
>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 123/405 (30%), Positives = 192/405 (47%), Gaps = 56/405 (13%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGL----DQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
S+ PPHI+ +L DDLGW DVG+H I TP ID L GI L+ +Y ++C+P+R
Sbjct: 49 SNQPPHILMVLVDDLGWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTR 108
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
++ +G+ P+H G+ V+ R G+P + + Q+L + GY + VGKW +G +
Sbjct: 109 ASFQSGRLPVH-GIDGQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSH 167
Query: 172 TPTFRGFESHLGY------------WTGHQDYFDH-----SAEEMK-MWGLDMRRDLEPA 213
TP RG+ + L Y W G Q+ H + + K W D PA
Sbjct: 168 TPHGRGYNTSLNYFGHANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDFWDTD-----RPA 222
Query: 214 WDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
+L G Y +F DII H +PLFL A H PLQAP Y +
Sbjct: 223 HNLSGTLYEEQLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFAN 277
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA----- 327
IE R + A++ LD+++ + ++++ M +N+++VF SDNGG F+ +
Sbjct: 278 IEPPSRRVYHAMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHP 337
Query: 328 ------------ASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWL 373
A+N+PLRG K + +EGG+R + L+ + RG +H++DW
Sbjct: 338 SRGYMCYNGEAGANNYPLRGGKYSFFEGGIRVNAFVSGGLIPADQRGSNRTGIMHIADWY 397
Query: 374 PTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
T + A P + N+ P +++ Y G + SPR
Sbjct: 398 ATFCALAGVD--PTDHRAAAANLPPIDSHNMWPYLAGQARF-SPR 439
>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 491
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 115/334 (34%), Positives = 177/334 (52%), Gaps = 35/334 (10%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M + + L T + ++++ P+I+ ILADDLG++D+G+ G +I +P ID LA +
Sbjct: 1 MTWFRIQLLATFACALTAVLSADERPNILIILADDLGYSDLGYTGSTEIESPVIDKLANN 60
Query: 95 GIILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGC--ERGGLPLSEKILPQYLKE 151
G+I N Y T C PSR+ ++TG+H GM+ N Y + GLP+ E + L+
Sbjct: 61 GVIFANGYVTHPYCGPSRAGLITGRHQARFGMEINATYSPFDQHMGLPVDEPTFAKRLQP 120
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF-----------------DH 194
GYRT I+GKWHLG ++ P RGF+ G+ +G DYF ++
Sbjct: 121 AGYRTGIIGKWHLG-AAPQFHPNNRGFDYFYGFLSGGHDYFPESVNTHLELVLPNGKPNY 179
Query: 195 SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
A E + L +R +D Y T + +A + S+++P LYLA+ A H+
Sbjct: 180 GANEGTLLPL-LRNKNAAEFD---DYLTTALSKDAARFVT--SSEQPFCLYLAYNAPHT- 232
Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
PLQAP + + HI+D KR +AA++ ++D +G VV+ALEQ L N++I F+S
Sbjct: 233 ----PLQAPKETIAKYSHIKDPKRRIYAAMIDEMDAGIGLVVDALEQSGKLDNTLIFFLS 288
Query: 315 DNGGA---AAGFNLNAASNWPLRGVKNTLWEGGV 345
DNGG A + A N P + K + +EGG
Sbjct: 289 DNGGCVPQAWDSYSDFADNGPFKDGKTSFYEGGT 322
>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
Length = 503
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 176/351 (50%), Gaps = 43/351 (12%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
V + PP+++ +L DD+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+
Sbjct: 6 VGAPKPPNLVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRA 65
Query: 113 AIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
A++TG+ PI G N E GG+P E++LP+ LK GY ++IVGKWHLG
Sbjct: 66 ALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLG 125
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---- 221
++ ++ P RGF+ G H +D+ A + + RD W++ G+Y
Sbjct: 126 -HRPQFHPLKRGFDEWFGSPNCHFGPYDNRARP----NIPVYRD----WEMVGRYYEEFP 176
Query: 222 ----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
T ++T EA+D + H+ P FLY A ATH+ P+ A +L
Sbjct: 177 INLKTGEANLTQLYTQEALDFVQRQHAARRPFFLYWAIDATHA-----PVYASRPFLGTS 231
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
R R ++ + ++D+SVGK++ L R+ ++ + F SDNG A SN
Sbjct: 232 R------RGRYGDAVREVDDSVGKILSLLRDLRIAEDTFVFFTSDNGAALISAPKQGGSN 285
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 286 GPFLCGKQTTFEGGMREPAIAWWPGRVPAGQVSHQLGSIMDLFTTSLSLAG 336
>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 484
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 197/388 (50%), Gaps = 59/388 (15%)
Query: 42 LAFTLSMVFVDL-VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L TL+ + A + PP+I+ I++DD G+ND+G D I TP++D LA G L +
Sbjct: 9 LCLTLAAFCLPFSCAEAAPPNILLIVSDDQGYNDLGVLNSDLI-TPHLDRLAAEGTRLTD 67
Query: 101 YYTV-QLCTPSRSAIMTGKHPIHTGMQHNVL-----YG-----------CER-GGLPLSE 142
+Y CTPSR++++TG++P G+ + YG ER GG+ E
Sbjct: 68 FYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDYGYKYKPAEYEVSFERIGGMDQRE 127
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW 202
K+LP+YLK+LGY + I GKW LG K+ + PT RGF+ G+ DYF H + +
Sbjct: 128 KLLPEYLKKLGYTSAIFGKWDLGSLKR-FLPTNRGFDEFYGFVNTGIDYFTH-----ERY 181
Query: 203 GL-DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-- 259
G+ M R + G+Y+T++F EA+ + EP LYL A H+++ +P
Sbjct: 182 GVPSMFRQTSLTEEDRGEYATELFKREALAFLDRAEASEPFLLYLPFNAPHNSSSLDPRI 241
Query: 260 ---LQAPDHYLNIHRHIEDFKRS--------------------KFAAILHKLDESVGKVV 296
+QAP+ Y ++ ++ R+ + A + +D ++G+V+
Sbjct: 242 RSTVQAPEEYKAMYPAVDPEYRTVNRYRYGNPAQVVTPVARQRDYRAAVTCMDAAIGEVL 301
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPL 355
LE + +L N+I++F SDNGG+ A N PLRG K +WEGG+R L+ W
Sbjct: 302 GKLEDKGVLDNTIVIFFSDNGGSG------GADNSPLRGHKAQVWEGGIRVPCLVRWPDG 355
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + ++ + +P+L+ AA +
Sbjct: 356 GVPAGATNDSFLSSLELVPSLVKAAGGT 383
>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 170/336 (50%), Gaps = 35/336 (10%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A S P+ +F+L DD G+ DV F I +PN + LA G+IL +Y + C+PSR++
Sbjct: 17 AFSTKPNFVFVLVDDWGYADVSFRN-PAIHSPNFEKLAKEGLILDRHYVFKYCSPSRASF 75
Query: 115 MTGKHPIHTGMQH---NVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
+TG+ P H + + L G L ++P LK Y+T ++GKWH G YK+ Y
Sbjct: 76 LTGRWPHHAHQWNPPEDALVGAN-----LKMTMIPAKLKLARYKTHMIGKWHEGLYKEAY 130
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAE-EMKMW---GLDMRRDLEPAWDLHGKYSTDVFTA 227
P RGF++ G+ G +++ + W G D R +G Y +
Sbjct: 131 LPINRGFDTMSGFLGGGENHMNQQVGCATDFWKNDGPDSR---------NGSYDAYTYRD 181
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RSKFAAILH 286
+ DII NH+ +P FLYL H+ PLQAPD +LNI+ + R + A++
Sbjct: 182 DLTDIITNHNPSDPFFLYLPLHNVHT-----PLQAPDEWLNIYPANSTCQNRRIYQAMVS 236
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
D G +V+ L+ M N+++V +DNGGA G SN+PL+G K T +EGGVR
Sbjct: 237 VADNVTGHLVKLLQDNGMWDNTLLVISADNGGAPCG-----GSNYPLKGSKGTFYEGGVR 291
Query: 347 GAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
+ LL + +G + ++H++DW T A
Sbjct: 292 SLAFVNGGLLPTNRKGKSTQGFIHIADWYTTFCQLA 327
>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 508
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 118/378 (31%), Positives = 191/378 (50%), Gaps = 47/378 (12%)
Query: 43 AFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSGIIL 98
AF L L P P+I+ +L DD GW +VG+H ++ TPNID+L G+ L
Sbjct: 7 AFILLCTTACLAVPPPPGPNIVMMLVDDWGWANVGYHRNPPTKEVVTPNIDSLVRQGLEL 66
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV---LYGCER-----GGLPLSEKILPQYLK 150
+Y +C+PSRS++M+G+ PIH N+ Y + +P + + Q +K
Sbjct: 67 DQHYVFNVCSPSRSSLMSGRLPIHVN-DLNIEPDYYNPDDPVSGFSAIPRNMTGIAQKMK 125
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFD-------HSAEEMKMWG 203
GY T VGKW G +TP RGF+S GY+ D++ + + + +W
Sbjct: 126 LGGYDTHQVGKWDAGMATHTHTPKGRGFDSSFGYFHHANDFYTEIDGKPCNKTKIVDIWV 185
Query: 204 LDMRRDLEPAWDLHGK----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
D +P + L+G Y +F + + +++ H T +PLFLY A H+ P
Sbjct: 186 TD-----KPGYGLNGTGPDNYEEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----P 235
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
LQ P Y + I+D R + A+++ LD+ VG++V+AL+++ + N + V SDNGG
Sbjct: 236 LQVPQRYQDKFSFIDDHDRQIYHAMVNYLDDVVGELVDALKKKGLWENLLFVTSSDNGGP 295
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLL 377
+ A+N+PL+G K + W+GG+R + L + RG + Y+H++DW T
Sbjct: 296 V--YPGGGANNYPLKGGKASDWQGGIRSNAFVSGGYLPEKMRGKKLDGYIHLADWYATFC 353
Query: 378 SAAN---------KSDIP 386
+ AN K+D+P
Sbjct: 354 ALANVDPTDEAAAKADLP 371
>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 136/371 (36%), Positives = 197/371 (53%), Gaps = 32/371 (8%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL--DQIPTPNIDAL 91
I +F L LA L + FV L S P+II IL+DD G+ DV ++ D I TP+ DAL
Sbjct: 2 INSFTKLFLAL-LCVNFVALADSK--PNIIVILSDDQGYADVSYNPEHDDYISTPHTDAL 58
Query: 92 AYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGL--PLSEKILPQY 148
A SG+I + Y + +C+ +RS +MTG++ Q +Y GG L+ K +P Y
Sbjct: 59 AKSGVIFHRGYTSGSVCSTTRSGLMTGRY-----QQRYGIYTAGEGGTGTDLNAKFIPNY 113
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMR 207
LKE GY++ GKWHLG ++ +Y P RGF+ G+ G D+F E +G +
Sbjct: 114 LKEAGYKSMAFGKWHLG-HEMKYHPLHRGFDDFYGFMGRGAHDFFRLEKEYDGKFGGPIY 172
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
R LEP D Y T T E V I + D+P F Y+A+ A H+ P QAP
Sbjct: 173 RGLEPIDD--KGYLTTRITEETVKFIEENK-DKPFFAYVAYNAVHT-----PAQAPAE-- 222
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+I D R A+L LD VG++V+ L++ + N+II+++SDNGGA +
Sbjct: 223 DIKAVSGDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAKS----MV 278
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
A+N PLRGVK+ +++GG+R L+ P G + V D LPTLL AA +P
Sbjct: 279 ANNKPLRGVKHDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLDAAG---LPA 335
Query: 388 YVNSTVENIIP 398
+ E+++P
Sbjct: 336 LSDIDGESMLP 346
>gi|340369113|ref|XP_003383093.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 374
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 170/324 (52%), Gaps = 28/324 (8%)
Query: 43 AFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSG 95
+ + + L + + P PHI+ +L DD GW +VG+H ++ TPNID L G
Sbjct: 4 VLAVGVFLLGLASGASPNANLPHIVMMLVDDWGWANVGYHRNPPTCEVVTPNIDKLVKEG 63
Query: 96 IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL---YGCER-----GGLPLSEKILPQ 147
+ L +Y + C+PSRS++++G+ PIH Q N+L Y + +P + + +
Sbjct: 64 LELNQHYVYKFCSPSRSSLISGRLPIHVNDQ-NLLPTNYNPDDPVSGFSAIPRNMTGIAE 122
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM-KMWGLDM 206
++ GY T VGKW G ++TP RGF++ GY+ DY+ +A + +D+
Sbjct: 123 KMRGAGYATHQVGKWDAGMATSDHTPKGRGFQTSFGYFHHANDYYKETASACNRTQIVDL 182
Query: 207 RRDLEPAWDLHGK----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
+PA ++G Y +F +D++ NH PLFLY A H+ PLQ
Sbjct: 183 WETDKPAHGINGTGPDNYEEGLFKERVLDVVSNHDPSTPLFLYYAPHIVHT-----PLQV 237
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
PD YL + I D R + A+++ LD+ VG++ +AL+ + M N + V SDNGG
Sbjct: 238 PDRYLKLFSFINDHDRQYYHAMVNYLDDVVGEITDALKNKGMWDNLLFVTSSDNGGPV-- 295
Query: 323 FNLNAASNWPLRGVKNTLWEGGVR 346
+ A+N+PLRG K T W+GGVR
Sbjct: 296 YPGGGANNYPLRGGKVTDWQGGVR 319
>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
Length = 474
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 186/357 (52%), Gaps = 28/357 (7%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
+ L LA + + A P+I+FIL+DD G+ D GF G +IPTPN+D LA G+
Sbjct: 13 LSYLGLALLICSAWACYGAELAKPNIVFILSDDAGYADFGFQGSTEIPTPNLDQLAQEGV 72
Query: 97 ILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEKILPQ 147
+ K Y + +C PSR+ ++TGK+P G + N + G + G+ L + +
Sbjct: 73 VFKQAYVSASVCGPSRAGLLTGKYPQRFGFEENNVPGYMSSSGATGDDMGMRLDQLTMAN 132
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
YL E GYRT ++GKWH G + + P RGF+ G+ G + YF + R
Sbjct: 133 YLAERGYRTSLIGKWHQG-NEDRFHPLKRGFDEFFGFRGGARSYFPFTQAHPS----SRR 187
Query: 208 RD-LEPAWDLHGK---YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
D LE ++ +G+ Y TD E ++ I + +P F +L+ +A P+ P++A
Sbjct: 188 EDFLERNFNNYGESPLYLTDALANETIEFI-KRNKHQPFFTFLSLSA-----PHYPMEAE 241
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
L+ H+ +R +AA++ +D ++G+V++ L+Q + N++++F +DNGG +
Sbjct: 242 KSDLDQFPHLTG-ERKIYAAMMLNMDRAIGRVLQTLQQEGLSENTLVIFTNDNGGPS--- 297
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ N + N PL G K L EGG+R ++ P + G + V D LPT +AA
Sbjct: 298 DHNGSINLPLSGTKANLLEGGIRVPMIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAA 354
>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_a [Homo
sapiens]
Length = 708
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G + GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W++ G+Y T ++ EA+D I + P FLY A ATH+ P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ A +L +R ++ + ++D+S+GK++E L+ + N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
SN P K T +EGG+R L W P + G V+ Q + D T L+
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352
Query: 380 AN 381
A
Sbjct: 353 AG 354
>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 525
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 110/349 (31%), Positives = 177/349 (50%), Gaps = 30/349 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGK 118
P+I+ ++ADD+GW DVG+ G + TPNIDALA G+ + Y + + PSR+ I+TG
Sbjct: 32 PNIVLVIADDMGWGDVGYQGAVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGV 91
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G +N+ + +P + L + +++ GY T VGKWH+ E +P RGF
Sbjct: 92 YQQRFGFYNNLHPWAK---IPEGQSTLGEMVRDCGYATGFVGKWHMA-DSPEQSPNRRGF 147
Query: 179 ESHLGYWTGHQDYFDHS----AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ G+W+ DY+ + E L +++P G+Y TD FT EAV+ I
Sbjct: 148 DQFYGFWSDTHDYYRSTDKPGVELYDFCPLYRNGEIQPPLHESGEYITDCFTREAVEFID 207
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI--HRHIEDFKRSKFAAILHKLDESV 292
H++ P L L++ A HS P Q P+HY+N R R FAA++ LD+ +
Sbjct: 208 KHAS-SPFLLCLSYNAVHS-----PWQVPEHYVNRLEGRRFHHEDRKVFAAMVLALDDGI 261
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGG-------AAAGF------NLNAASNWPLRGVKNT 339
G+V+E+L + + N++ + +SDNG + G+ N +S P RG K
Sbjct: 262 GRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGYKAD 321
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
+EGG+R ++ P +G+V + V D PT++ A + Y
Sbjct: 322 TYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTVMQAVGGTSRQKY 370
>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_c [Homo
sapiens]
Length = 522
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G + GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W++ G+Y T ++ EA+D I + P FLY A ATH+ P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ A +L +R ++ + ++D+S+GK++E L+ + N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
SN P K T +EGG+R L W P + G V+ Q + D T L+
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352
Query: 380 AN 381
A
Sbjct: 353 AG 354
>gi|323448512|gb|EGB04410.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 572
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 172/362 (47%), Gaps = 48/362 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFH-------GLDQIPTPNIDALAYSGIILKNYYTVQLC 107
++ PPHII I+ADDLGW+ V +H D TP +DAL +G+ L+ +Y Q C
Sbjct: 31 GAAPPPHIIHIVADDLGWSYVDWHRDPRTADPPDTPLTPRLDALRRAGVTLERHYAFQYC 90
Query: 108 TPSRSAIMTGKHPIHTGMQHNV---------LYGCERGGLPLSEKILPQYLKELGYRTRI 158
+PSRSA+++G++PI +Q+ V + G + G+P + L + L+ GYRT
Sbjct: 91 SPSRSALLSGRNPISVNVQNVVPEVSNPEDPVGGWQ--GIPTAMTGLAEVLRRRGYRTEA 148
Query: 159 VGKWHLGFY-KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK------MWGLDMRRDLE 211
VGKW +G + P RGF++ GYW DY+ + E +W + D
Sbjct: 149 VGKWDVGMATAAHHHPRARGFDAWTGYWHHANDYWTFAEETCDGEAVQDLWHYNATFDGP 208
Query: 212 PAWDLHGK----------------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
W +G Y + AV + H PLFLY A H
Sbjct: 209 ATWLRNGPSCTQGNQTHTAASPCVYEEQLLADHAVRAVEEHDDTAPLFLYWALHLVHM-- 266
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PLQ PD YL I D R A++ D+ VG++V+ALE+R ML+N+++ F SD
Sbjct: 267 ---PLQVPDAYLARFDGIPDEPRRFMTAMVRYADDEVGRLVDALERRGMLANAVVTFHSD 323
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWL 373
NGG G +NWPL G K + +EGG+R + + RG Q + DW
Sbjct: 324 NGGEIMGAGACGGNNWPLTGGKFSNFEGGIRVNAFVSGGAVPEARRGAALSQLSTLWDWY 383
Query: 374 PT 375
T
Sbjct: 384 AT 385
>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
Length = 522
Score = 179 bits (453), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAGGLLFPNF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G + GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W++ G+Y T ++ EA+D I + P FLY A ATH+ P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ A +L +R ++ + ++D+S+GK++E L+ + N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
SN P K T +EGG+R L W P + G V+ Q + D T L+
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352
Query: 380 AN 381
A
Sbjct: 353 AG 354
>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 462
Score = 179 bits (453), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 117/371 (31%), Positives = 189/371 (50%), Gaps = 29/371 (7%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L F LS+ +V +S P++I I+ADDLG+ DVGF+G IPTP ID++A +G+ +
Sbjct: 5 LLFILSLGLSLIVQASSKPNLIVIMADDLGYADVGFNGCKDIPTPGIDSIANNGVKFSSG 64
Query: 102 YT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCE--RGGLPLSEKILPQYLKELGYRTRI 158
YT +C PSR+ +TG++ G + N + LP SE + + L+++GY I
Sbjct: 65 YTSYSVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLQQVGYHCGI 124
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF---------DHSAEEMKMWGLDMRRD 209
+GKWHLG + P RGF G+ G YF + EM + + R+
Sbjct: 125 IGKWHLG-AEPSLRPNQRGFNEFFGHLGGGHAYFPEKLRIIKTEDVKNEMDSYASYITRN 183
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
P KY TD F+ EA+ + + ++P FL+L++ A H+ PLQA YL+
Sbjct: 184 DTPVKTT--KYLTDEFSDEAIRFVEKN-YEQPFFLFLSYNAPHT-----PLQATQKYLDR 235
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
HIED R + A++ +D+ + ++++ L+ ++ ++I+ F+SDNGG N +
Sbjct: 236 FPHIEDQNRRTYCAMVSAMDDGIVRLMKKLDALKIKDDTIVFFLSDNGGPTTK---NNSD 292
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
N LRG K ++EGG R + P G + V D T+ S A P +
Sbjct: 293 NGDLRGKKGDVFEGGFRVPFAMQYPRELKPGQTYDLPVSSLDIFATIASLAQS---PTHA 349
Query: 390 NSTVE--NIIP 398
+ ++ N++P
Sbjct: 350 DKPLDGVNLLP 360
>gi|421613760|ref|ZP_16054831.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495346|gb|EKJ99933.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 485
Score = 179 bits (453), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 190/375 (50%), Gaps = 58/375 (15%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ I +DD G+ND+G G + I TP +D LA G L N+Y CTPSR++++TG+
Sbjct: 30 PNILLIASDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGR 88
Query: 119 HPIHTGMQHNVL-----YG-----------CER-GGLPLSEKILPQYLKELGYRTRIVGK 161
+P G+ + YG ER GG+ E ILP L+ GY++ I GK
Sbjct: 89 YPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAMLRPAGYKSGIYGK 148
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD-MRRDLEPAWDLHGKY 220
W LG ++ PT RGF+ G+ DYF H + +G+ M R+LEP G Y
Sbjct: 149 WDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTH-----ERYGVPCMVRNLEPTEADKGTY 202
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIHRHIE- 274
T +F EA+ + H+ +EP FLY+ A H+++ +P +QAPD + ++ +E
Sbjct: 203 CTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVEV 262
Query: 275 -------------------DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+R + A + +D ++G++++ LE ++ML +++VF SD
Sbjct: 263 ETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDETMVVFFSD 322
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLP 374
NGG+ A N PLRG K WEGG+R L+ W G+V ++++ + LP
Sbjct: 323 NGGSG------GADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNDEFLTSLELLP 376
Query: 375 TLLSAANKSDIPNYV 389
+ +AA P V
Sbjct: 377 SFAAAAGVEPPPGVV 391
>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 458
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 176/316 (55%), Gaps = 24/316 (7%)
Query: 47 SMVFVDLVASS---GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYY 102
+ +F+ L+ + P++I IL DD G+ DVGF+G IPTP++D++A +G+ + +
Sbjct: 4 TFIFLYLILMNSIIAGPNLIIILTDDQGYQDVGFNGCKDIPTPHLDSIAQNGVNCIDAHV 63
Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVG 160
+ +C PSR+ ++TG++ G N E GLPL EK + + LKE+GY + I+G
Sbjct: 64 SYPVCGPSRAGLLTGRYQDRFGFTTNPTVNPENPIAGLPLEEKNIAEVLKEVGYSSSIIG 123
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWG---LDMRRDLE 211
KWH+G + + P RGF+ G+ +G DYF +E ++W + RD E
Sbjct: 124 KWHMGTHPIHH-PLNRGFDHFFGFLSGGHDYFPAKYNLKDLSEVKRIWDWYRTHLIRDRE 182
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
G Y TD+ T AVD I ++++ P LYL++ A H+ PLQA + YL
Sbjct: 183 RIQVSEG-YLTDILTDAAVDFIDKKASEKKPFMLYLSYNAPHT-----PLQASEKYLKRF 236
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
HI+D KR +AA++ +D+ VG+V+ L++ ++ ++++ F+SDNGG G + N
Sbjct: 237 THIKDSKRRTYAAMVSAVDDGVGRVMARLKKHKIEEDTLVFFLSDNGGPLKG-PMPFTDN 295
Query: 331 WPLRGVKNTLWEGGVR 346
P K +L EGG R
Sbjct: 296 GPFSKGKGSLHEGGTR 311
>gi|383114003|ref|ZP_09934770.1| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
gi|382948682|gb|EFS34104.2| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
Length = 496
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/335 (35%), Positives = 182/335 (54%), Gaps = 31/335 (9%)
Query: 60 PHIIFILADDLGWNDVG-FHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
P+I+ ILADD G+ V + + +I TPNID LA SG+ ++ Y + L +P+R+ +MTG
Sbjct: 37 PNIVIILADDQGYGGVNCYPHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTG 96
Query: 118 KHPIHTGMQHNVLYGCER---GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
K+ Q YG GG+P +K+L +YL E GY T +GKWHLG Y + + P
Sbjct: 97 KY-----QQSFGFYGLSTPHVGGIPQDQKLLSEYLVENGYNTACIGKWHLGDYIRSH-PN 150
Query: 175 FRGFESHLGYWTGHQDYFD---HSAEEMKMWGLDMRRD-LEPAWDLHGKYSTDVFTAEAV 230
RGF++ G+ G DY+D + + GL D +EP ++ +YST +T AV
Sbjct: 151 NRGFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLDNMEPVTEM--EYSTYEYTKRAV 208
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFA-AILHKL 288
D I ++ D P FLYL + A HS PLQAP+ + + + ++ + A A+ L
Sbjct: 209 DFIQKNA-DHPFFLYLPYNAIHS-----PLQAPEELIGELAINPQEIGKDDIARAMTFAL 262
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D+ VGKVVE LEQ + N+II ++SDNG + + W RG K + +EGG+R
Sbjct: 263 DQGVGKVVETLEQLGLRDNTIIFYLSDNGA------VEYSDKWEFRGRKGSYYEGGIRVP 316
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
++ P ++G + + V D PT++ A S
Sbjct: 317 FIVSYPAKLAKGTIYNKPVMSIDIAPTVMELAGLS 351
>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 480
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 173/338 (51%), Gaps = 22/338 (6%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
AS P+II I+ADDLG+ DVGF+G I TPNID LA SG + Y C PSR+A
Sbjct: 25 ASEQKPNIILIVADDLGYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAA 84
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++P G Q N+ G+P K + + L E Y T +GKWH+G +Y P
Sbjct: 85 LMTGRYPHKIGSQFNLPTRGSNVGVPTDAKFISKLLNENNYFTGALGKWHMG-DAPQYHP 143
Query: 174 TFRGFESHLGYWTGHQDYF----DHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTD 223
RGF+ + G+ G +YF ++ + GL + + +GK Y TD
Sbjct: 144 NKRGFDEYYGFLGGGHNYFPDQYQPQYKKQQAQGLKNIFEYITPLEHNGKEVKETQYITD 203
Query: 224 VFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
+ EAV+ + + P FLYLA+ A P+ PLQA D + + +I++ R +A
Sbjct: 204 ALSREAVNFVDKAVNKKNPFFLYLAYNA-----PHVPLQAKDEDMAMFPNIKNKDRKTYA 258
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
+++ +D VGK+VE L++ N++IVF+SDNGG + A+N+PL+ K + E
Sbjct: 259 GMVYAVDRGVGKLVETLKKNNQYDNTLIVFMSDNGGKLS----KGANNFPLKAGKGSTQE 314
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
GG R L P G V D PT + A
Sbjct: 315 GGFRVPMLFHWPKHVPAGKRFSHPVSALDLYPTFAALA 352
>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 540
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/364 (34%), Positives = 187/364 (51%), Gaps = 38/364 (10%)
Query: 44 FTLSMV-FVDLV-ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
F +S+V F L A P+II L DDLGW D+ G I TP ID+L G+ L +
Sbjct: 6 FIVSLVTFCALANAEEKRPNIIIFLVDDLGWADISLRGA-PIDTPAIDSLFSEGLTLDRF 64
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
YT +C+P+R+A+MTG+ P+ G+ ++V+ G+ E +P+ K GY+T +VGK
Sbjct: 65 YTTPICSPTRAALMTGRDPLRLGISYSVVMPWMNNGVHPDEHFMPESFKAAGYQTAMVGK 124
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLG ++ + P RGF+ G+ YF A + G+D +R+ D Y
Sbjct: 125 WHLGHSQEIFHPNARGFDDFYGHLHTEVGYFLPFANQG---GVDFQRNGVTIAD--EGYE 179
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK---- 277
T + EA I D+P FLY+ A HS PL+APD + + ++ED +
Sbjct: 180 TFLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYENLEDTRELTR 234
Query: 278 -----------------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
R +AA++ LD+++G+V++ L + + +II+F SDNGGAA
Sbjct: 235 SAAIDRTRTLNGFSPSARPMYAAVVDGLDQAIGQVLDTLTKEGIEEETIILFSSDNGGAA 294
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
A N+PLRG K +EGG+R A + W L++ G E + V D PTL SA
Sbjct: 295 Y--AGGGADNFPLRGGKGDTYEGGIRVVAAMRWKGTLDA-GKSFESIMTVMDVFPTLASA 351
Query: 380 ANKS 383
A S
Sbjct: 352 AGVS 355
>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
Length = 377
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 104/242 (42%), Positives = 136/242 (56%), Gaps = 13/242 (5%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
VL LA + PHI+ I+ADDLGW+DV ++ + TPN+ LA +G+I
Sbjct: 3 VVLFLAIVGCFFGHGTAQDTAKPHILLIVADDLGWSDVSWNN-PYVVTPNLHTLATTGVI 61
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
Y C+PSR+A++TGK P GMQ V+ + GLPL E++LPQ LK+LGY T
Sbjct: 62 FNQTYAQPTCSPSRTALLTGKFPFRLGMQR-VMDSKKPHGLPLDEELLPQKLKKLGYATH 120
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
+VGKWHLG K EYTPT RGF+S GY G QDY+ H SA + W D + + D
Sbjct: 121 MVGKWHLGSCKWEYTPTERGFDSFYGYHHGSQDYYTHKSARGLDFW--DGKTSIS---DQ 175
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
+G YST+ F A +II H + PLFLYL + H+ P Q P YL I+D
Sbjct: 176 NGVYSTESFATRAENIISQHDPNTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDD 230
Query: 277 KR 278
R
Sbjct: 231 NR 232
>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
Length = 439
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 183/338 (54%), Gaps = 26/338 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ + + DDLG+ D+G +G D + TP++D LA G+ N+Y+ +C+PSR++++TGK
Sbjct: 4 PNFVIVYCDDLGYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLTGK 63
Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P+ G+ + G +RG GLP +E L + LK GYRT + GKWHLG KE +P
Sbjct: 64 YPVRAGVGE--ILGAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGL-SKETSPNAH 120
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
GF+ G+ G D++ H + G++ DL W+ +G+Y T++ T +VD
Sbjct: 121 GFDEFFGFKAGCVDFYSHIFYWGQGHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 179
Query: 233 I-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I + + P FL+ ++ A P+ P+ AP Y++ H+ + R AA++ +D+
Sbjct: 180 IKRSREQEAPFFLFASYNA-----PHYPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDG 233
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
VG++VEAL++ +++I F SDNG ++ N S RG K +L+EG
Sbjct: 234 VGEIVEALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDIYYGGSAGIFRGHKASLFEG 293
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+R ++ P G V ++ + D +PT L A
Sbjct: 294 GIREPAILNWPNGWEGGQVRDEVAAMMDIVPTFLDLAG 331
>gi|313216787|emb|CBY38029.1| unnamed protein product [Oikopleura dioica]
Length = 383
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 128/208 (61%), Gaps = 11/208 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH +FILADD+G++D+G+H +I TP +D LA +G+ L+ YY +CTP+R +MTG++
Sbjct: 176 PHFLFILADDMGYHDIGYHQA-EILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRY 234
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
I GMQH V+ + G+PL EK+LP+ L++ GY T ++GKWHLG + ++Y P RGF+
Sbjct: 235 QIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFD 294
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL--EPA---WDLHGKYSTDVFTAEAVDIIH 234
+G++TG QD++ H+ M G D R +P WDL+ YST VF E +
Sbjct: 295 HFMGFYTGSQDFYSHNKCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELEKRLS 354
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQA 262
+ EP F YL+ A HS PLQ
Sbjct: 355 KMNPSEPSFTYLSFQAVHS-----PLQG 377
>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
Length = 522
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G + GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W++ G+Y T ++ EA+D I + P FLY A ATH+ P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ A +L +R ++ + ++D+S+GK++E L+ + N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKMLELLQDLHVADNTFVFFTSDNGAA 292
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
SN P K T +EGG+R L W P + G V+ Q + D T L+
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVAAGQVSHQLGSIMDLFTTSLAL 352
Query: 380 AN 381
A
Sbjct: 353 AG 354
>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
mulatta]
Length = 503
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 174/345 (50%), Gaps = 42/345 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG
Sbjct: 11 PPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTG 70
Query: 118 KHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G + GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ +
Sbjct: 71 RLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQ 129
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A + + RD W++ G+Y
Sbjct: 130 FHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKT 181
Query: 222 -----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
T ++ EA+D I + P FLY A ATH+ P+ A +L
Sbjct: 182 GEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------ 230
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R ++ + ++D+SVGK++E L + N+ + F SDNG A A SN P
Sbjct: 231 QRGRYGDAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCG 290
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R L W P + G V+ Q + D T L+ A
Sbjct: 291 KQTTFEGGMREPALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAG 335
>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
Length = 502
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 173/346 (50%), Gaps = 43/346 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 9 PPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 68
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE++LPQ LKE GY T+IVGKWHLG ++ +
Sbjct: 69 RLPIRNGFYTTNGHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLG-HRPQ 127
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A + + R+ WD+ G++
Sbjct: 128 FHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRN----WDMVGRFYEEFPINVKT 179
Query: 222 -----TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++ EA+D I + P FLY A ATH+ P+ A +L I
Sbjct: 180 GESNLTQIYLQEALDFIRQQQAAQHPFFLYWAVDATHA-----PVYASRPFLGIS----- 229
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
+R ++ + ++D+SVGK++ L + NS + F SDNG A SN P
Sbjct: 230 -QRGRYGDAVREIDDSVGKILSLLRTLSIAENSFVFFTSDNGAALISAPSQGGSNGPFLC 288
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R + W P G V+ Q + D T L+ A
Sbjct: 289 GKQTTFEGGMREPAIAWWPKHIPAGQVSHQLGSIMDLFTTSLALAG 334
>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
Length = 440
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 176/350 (50%), Gaps = 29/350 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLGW D+G +G TP++D LA G+ Y +C+ +R A++TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDHLASQGVRFNQAYANSAVCSATRIALITGR 63
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ + G++ + + GLP LP LK+ GY T ++GKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQRS 122
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
G++ G +G DYF H + + D+ + EP Y T + EA +H
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSPDLWQGNEPV--ERTGYYTYILGDEATRYVHER 180
Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF------KRSKFAAILHKLD 289
D+P FL L H P+ P + P ++ R I+D K+ I+ LD
Sbjct: 181 KDQDKPFFLSL-----HFTAPHWPWEGPGDE-HVARSIKDLFHYDGGSLKKYGEIVEALD 234
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
++VG+V++AL+ N+I++F SDNGG + WP G K L EGG+R
Sbjct: 235 KAVGQVLQALDDTGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPT 288
Query: 350 LI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
L+ WS + R V EQ DWLPTLL+AA P++ S +NI+P
Sbjct: 289 LLRWSARI--RPQVQEQVTASFDWLPTLLAAAGARPHPDF-PSDGQNILP 335
>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
Length = 443
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/338 (33%), Positives = 180/338 (53%), Gaps = 26/338 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ I I DDLG+ D+G +G D + TP++D LA GI N+Y+ +C+PSR++++TGK
Sbjct: 8 PNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGK 67
Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P G+ + G +RG GLP E L + LK GYRT + GKWHLG +E +P
Sbjct: 68 YPARAGVGE--ILGAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGL-SEETSPNAH 124
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
GF+ G+ G D++ H + G++ DL W+ +G+Y T++ T +VD
Sbjct: 125 GFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 183
Query: 233 IHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I E P FL+ ++ A P+ P+ AP Y++ H+ + R AA++ +D+
Sbjct: 184 IQRSREQEAPFFLFASYNA-----PHYPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDG 237
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
VG++V+AL++ +++I F SDNG ++ N S RG K +L+EG
Sbjct: 238 VGEIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEG 297
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+R ++ P G V ++ V + D PT L A
Sbjct: 298 GIREPAILSWPNGWEGGQVRDEVVAMMDIAPTFLDLAG 335
>gi|431907857|gb|ELK11464.1| Arylsulfatase B [Pteropus alecto]
Length = 274
Score = 177 bits (449), Expect = 2e-41, Method: Composition-based stats.
Identities = 88/209 (42%), Positives = 127/209 (60%), Gaps = 13/209 (6%)
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
AI+T K IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE
Sbjct: 16 AILTLK--IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECL 73
Query: 173 PTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
PT RGF+++ GY G +DY+ H +A + LD R E A YST++FT
Sbjct: 74 PTRRGFDTYFGYLLGSEDYYSHERCTLINALNVTRCALDFRDGEEVATGYENMYSTNIFT 133
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
A +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 134 QRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVS 188
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+DE+VG V AL+ + +N++ +F ++
Sbjct: 189 LMDEAVGNVTAALKSHGLWNNTVFIFSTE 217
Score = 43.1 bits (100), Expect = 0.70, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 26 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 61
>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
Length = 534
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/358 (33%), Positives = 169/358 (47%), Gaps = 46/358 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PH++F+L DD+G+ DVG+ D I +P +D LA G+ L +Y C PSR+A++TG +
Sbjct: 27 PHLLFVLGDDVGFGDVGYSSPDVI-SPTLDRLAAEGLKLGRHYAYMWCAPSRAALLTGYY 85
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKE-LGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P TG+ + LPL +LP L++ GYRT VGKWHLGF + P RGF
Sbjct: 86 PSTTGVYST---SGAQNALPLEFALLPGLLRDRAGYRTAAVGKWHLGFMSEADLPERRGF 142
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL------EPAWD---LHGKYSTDVFTAEA 229
+ G+ G +D++ G D DL PA D G+YS +++ A
Sbjct: 143 DGFFGFLDGGEDHYSRVGAGAP--GCDRVFDLWDSRRRGPATDDPSAFGRYSAELYGEAA 200
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH------------------- 270
D I H EPLFLY A HS P + D Y +
Sbjct: 201 ADAIRGHDAAEPLFLYAAFQVAHSPLEVPPAKYFDAYADAALCANASTKGFACARPGADA 260
Query: 271 -----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
H R AA++ LD +V + AL + M ++ ++V+ DNGG +
Sbjct: 261 PYRAPNHDCGCDRLVVAAMVTALDAAVANMTTALASKNM-ADFVLVYSGDNGGP----EI 315
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
N LRG K T +EGGVR A + SPLL RG +H+ DWLPT L+ A +
Sbjct: 316 EGHWNGGLRGGKWTWFEGGVRPAAFVHSPLLTRRGW-HNGTLHLVDWLPTFLALAGAA 372
>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
Length = 452
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/338 (33%), Positives = 179/338 (52%), Gaps = 26/338 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ I I DDLG+ D+G +G D + TP++D LA GI N+Y+ +C+PSR++++TGK
Sbjct: 17 PNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGK 76
Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P G+ + G +RG GLP E L + LK GYRT + GKWHLG +E +P
Sbjct: 77 YPARAGVGE--ILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGL-SEETSPNAH 133
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
GF+ G+ G D++ H + G++ DL W+ +G+Y T++ T +VD
Sbjct: 134 GFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 192
Query: 233 IHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I E P FL+ ++ A P+ P+ AP Y++ H+ + R AA++ +D+
Sbjct: 193 IQRSREQEAPFFLFASYNA-----PHYPMHAPQKYMDRFAHLP-WDRQVMAAMIAAVDDG 246
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
VGK+V+AL++ +++I F SDNG ++ N S RG K +L+EG
Sbjct: 247 VGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEG 306
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+R ++ P G V ++ + D PT L A
Sbjct: 307 GIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAG 344
>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 389
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/348 (33%), Positives = 182/348 (52%), Gaps = 25/348 (7%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY-SGI 96
A+ L F LS + + P+++F+L DD G+ DV F I +P+ ++LA G+
Sbjct: 2 ALFKLVF-LSFFAIAAATVNAKPNLVFVLVDDWGFADVSFRN-PAISSPHFESLATKEGL 59
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
IL +Y + C+PSR++ +TG+ P H + G G ++ ++P LK GY+T
Sbjct: 60 ILDRHYVFKYCSPSRASFLTGRFPHHAHQWNPPQSGLV--GANINMTMIPAKLKTAGYKT 117
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
+VGKWH GFY K++ P RGF++ G+ G + DH +E+ + +D ++ A D
Sbjct: 118 HMVGKWHEGFYLKKFLPINRGFDTMSGFLGGGE---DHMTQEI-ICAVDYWKN--DAIDT 171
Query: 217 -HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RHIE 274
+G Y + + I+ NH +P FLYL H+ PLQAPD +LN++ +
Sbjct: 172 RNGTYDDYTYVEDLSKILTNHDPSDPFFLYLPLHNVHA-----PLQAPDEFLNLYPVNST 226
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
R A++ D G VV+ L+ + M N++IV +DNGGA SN+PL+
Sbjct: 227 CSDRRTIQAMVSVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPC-----EGSNYPLK 281
Query: 335 GVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
G K T +EGGVR + LL S RG E ++H++DW T A
Sbjct: 282 GSKMTFYEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLA 329
>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
[Strongylocentrotus purpuratus]
Length = 401
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 140/248 (56%), Gaps = 20/248 (8%)
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-------SAEEMKM 201
L++ GY T VGKWH+G YK PT RGF+S G TG ++ H + K
Sbjct: 5 LRKAGYATHAVGKWHMGLYKTCCWPTNRGFDSFFGLLTGRGGFYTHKHYGGHPGLVDSKN 64
Query: 202 W-GLDMRRDLEP-AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W G D+R +LE A D G YST +FT ++ +II H+ +PLFLY + A H P
Sbjct: 65 WSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVH-----YP 119
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
L+ P Y+ +I D +R +A ++ +DE+VG + L++ + +N+II+F SDNG
Sbjct: 120 LEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSDNG-- 177
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLL 377
A FN SNWPLRG+K +LWEGG++ G I SPLL RG V HV+DW PTL+
Sbjct: 178 -ANFNY-GGSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLVRGTVNNNLFHVTDWFPTLV 235
Query: 378 SAANKSDI 385
+ +
Sbjct: 236 RGVARGSL 243
>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
Length = 446
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 112/338 (33%), Positives = 184/338 (54%), Gaps = 26/338 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ I I DDLG+ D+G +G D + TPN+D+LA GI N+Y+ +C+PSR++++TGK
Sbjct: 11 PNFIDIYCDDLGYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLTGK 70
Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P G+ + G +RG GLP +E L + LK GYRT + GKWHLG +E +P
Sbjct: 71 YPARAGVGE--ILGAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGV-SEETSPNAH 127
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
GF+ G+ G D++ H + G++ DL W+ +G+Y T++ T +VD
Sbjct: 128 GFDEFFGFKAGCIDFYSHIFYWGQGHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 186
Query: 233 I-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I + ++P FL++++ A P+ P+ AP Y++ H+ + R AA++ +D+
Sbjct: 187 IKRSREQEDPFFLFVSYNA-----PHYPMHAPKEYMDRFAHLP-WDRQVMAAMIAAVDDG 240
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
VG++V+AL+ N++I F SDNG ++ N + RG K +L+EG
Sbjct: 241 VGEIVKALKAAGCYENTVIFFSSDNGPSSESRNWLDGTEDVYYGGTAGIFRGHKASLFEG 300
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+R ++ P G V ++ + D +PT L A
Sbjct: 301 GIREPAILSWPNGLEGGQVRDEVGVMMDIVPTFLDLAG 338
>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
Length = 622
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 117/384 (30%), Positives = 189/384 (49%), Gaps = 62/384 (16%)
Query: 40 LPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
L L F + VF+ + + P+I++IL DDLGW DVG++ TP +D L +
Sbjct: 4 LCLCFAIVAVFLKRESFAQETEDLRPNIVYILVDDLGWADVGWNNKGLESTPFMDKLVKN 63
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
G Y+ C+PSR+ +TG++ +GM + G+ +K LP+YLKE+GY
Sbjct: 64 GTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPIAREVPFGMNTQDKTLPEYLKEVGY 123
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---------------HQDYFDHSAEEM 199
T VGKWHLG Y PT RGF++ G+++G + D+FD++ E+
Sbjct: 124 DTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVDYRGHFIKRSKNFYHDFFDNTIEQH 183
Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYE 258
K LD+ D G+++TD+F +DI+ + P ++YLA A P+E
Sbjct: 184 K---LDLESD--------GQWTTDLFRDRTIDILKEAKRSKTPAYVYLAFNA-----PHE 227
Query: 259 PLQAPDHYLNIHRHIED-----FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
P +AP I R +E + R++ A ++++D +V ++ + ++ + ++IVF+
Sbjct: 228 PTRAPADL--IARILEKHPNLPYTRAEHLAAVNQIDTAVRQIYQQ-AKKDVKRETLIVFM 284
Query: 314 SDNGGAAAGF-----------------NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
SDNGGA F +L A N+P +G KN+L+EGG+ + S
Sbjct: 285 SDNGGAVKKFGGGRSPDNTGHEYDDEESLPRACNYPFKGFKNSLFEGGILSPSFLISTAR 344
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
+ H+ DW PT L A
Sbjct: 345 KFSRTRFNFPFHIIDWAPTFLDFA 368
>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 494
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 112/308 (36%), Positives = 159/308 (51%), Gaps = 33/308 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P++I ILADDLG+ DVGF G +I TP +D LA G+I N Y T C PSR+ ++TG+
Sbjct: 23 PNLILILADDLGYADVGFTGSTEIQTPVLDRLAAGGVIFNNGYVTHAYCGPSRAGLITGR 82
Query: 119 HPIHTGMQHNVLYGC--ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ G++ N Y GLP EK LK+ GYRT ++GKWHLG Y P R
Sbjct: 83 YQARFGVEVNFPYAPFDPHSGLPTDEKTFATRLKQSGYRTAMIGKWHLG-AAYPYHPNNR 141
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD----------------MRRDLEPAWDLHGKY 220
GF+ G+ G DY + L+ MR ++ +D +Y
Sbjct: 142 GFDYFYGFLGGAHDYMPENTSTTVPLTLENGKVNHMANAGSYLPLMRNNVNAEFD---EY 198
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
T + +A I T+ P +YL++ A H+ PLQAP + + HIE KR
Sbjct: 199 LTTALSRDAARFIEK--TEGPFCVYLSYNAPHT-----PLQAPKALIEKYAHIESQKRRT 251
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--AAAGF-NLNAASNWPLRGVK 337
+AA++ +D+ +G+VV+ALE+ L N++I F SDNGG G+ + A N P RG K
Sbjct: 252 YAAMIDSMDQGIGRVVDALEKSGKLENTLIFFFSDNGGPRPKPGYEGEDFADNTPYRGGK 311
Query: 338 NTLWEGGV 345
+ EGG
Sbjct: 312 GAMLEGGT 319
>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 494
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 116/342 (33%), Positives = 179/342 (52%), Gaps = 25/342 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ +L DDLG++DVGF+G I TP +D LA +G I + Y C PSR+A++TGK
Sbjct: 42 PNILLVLCDDLGYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAALLTGK 101
Query: 119 HPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P G Q N+ E G+ +E+ + + L+E GY T +GKWHLG +E+ P R
Sbjct: 102 YPHTIGSQFNLPANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLG-ATEEFHPNQR 160
Query: 177 GFESHLGYWTGHQDYF----DHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTDVFT 226
GF G+ G +YF ++ K + RD + +GK Y TD F+
Sbjct: 161 GFTDFYGFLGGGHNYFPEQYQAQYQKQKKAKKKIIRDYILPLEHNGKEVKETEYLTDAFS 220
Query: 227 AEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
EA + S + P FLYLA+ A P+ PL+A + L I+D R +AA++
Sbjct: 221 REASRFVKEASNKKKPFFLYLAYNA-----PHVPLEAKEEDLEKFSVIKDKDRRTYAAMV 275
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
+ +D + ++V L++ L N++I+F+SDNGG + A+N PL G K WEGG
Sbjct: 276 YAVDRGIKQIVNTLKETNQLENTLIIFLSDNGGNTD----HGATNNPLHGRKGDTWEGGY 331
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
R ++ P ++G + D PT L+ K+ +P+
Sbjct: 332 RVPMFMYCPKNIAKGHTNNFPITSLDLYPT-LAHIGKAKLPD 372
>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
Length = 459
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 189/362 (52%), Gaps = 34/362 (9%)
Query: 39 VLPLAFTLSMVFVD-LVASSG---------PPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
+ PL L++ + L A++G P+I+ IL DDLG+ D+ G + +PNI
Sbjct: 11 LCPLNAILALFSIGCLAAATGTCYAQERPDAPNILCILVDDLGYGDLSCQGATDLQSPNI 70
Query: 89 DALAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKIL 145
DALA +G+ N+Y +C+PSR+A++TG++P G+ + E G L ++
Sbjct: 71 DALAANGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVIRQNPENNWGNLADDAVLI 130
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGL 204
P L GY T I+GKWHLG + + TP RGF G+ D Y+DH + L
Sbjct: 131 PSELNPAGYHTGIIGKWHLGLEEPD-TPNDRGFTYFKGFLGDMMDDYWDHRRGGINWMRL 189
Query: 205 DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAP 263
+ R +++P ++TD+FT +D + +E P FLYLA+ A P+ P+Q P
Sbjct: 190 N-REEIDPK-----GHATDLFTDWTIDFLKERQGEEQPFFLYLAYNA-----PHFPIQPP 238
Query: 264 DHYLNIHRHIE---DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+L+ R E KR+K A + LD SVG+V+EAL+ + N+++VFVSDNGGA
Sbjct: 239 REWLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGGAL 298
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
A SN PLRG K ++EGG+R + + + G ++ + D PT A
Sbjct: 299 ----WYAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELA 354
Query: 381 NK 382
+
Sbjct: 355 GR 356
>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
Length = 482
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 175/334 (52%), Gaps = 22/334 (6%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIM 115
+ PP+I+FIL+DD G+ D GF G D + TPN+D LA G + + Y + +C PSR+ I+
Sbjct: 35 TSPPNILFILSDDAGYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAGIL 94
Query: 116 TGKHPIHTGMQHNVL--YGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
TGK+ G + N + Y + G GLPL +K + YL+E GY+T ++GKWH G
Sbjct: 95 TGKYQQRFGYEENNVPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQG-N 153
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+ PT RGF+ G+ G + YF A+ + D + +Y T+
Sbjct: 154 ADRFHPTKRGFDEFYGFRGGARSYFGFGAQNPVSYPEDKLERGFAHFQESKRYLTEALAT 213
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
E V+ I + P F++L+ A H+ P++A L +++ KR + AA+
Sbjct: 214 ETVEFIKRNQK-HPFFVFLSFNAVHT-----PMEAKPADLAQFSNLKG-KRQQLAAMTLS 266
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D +GKV+ L++ + N+++VF +DNGG + + NA++N PL G K EGG+R
Sbjct: 267 MDREIGKVLNTLDELGLSDNTLVVFTNDNGGPS---DTNASNNGPLSGTKANHLEGGIRV 323
Query: 348 AGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
++ W S+G + D LPT SAA
Sbjct: 324 PFIMRWPAGKVSQGEYPNP-ISTLDLLPTFFSAA 356
>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
Length = 703
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 125/349 (35%), Positives = 188/349 (53%), Gaps = 47/349 (13%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
PPH++ I+ DD G DVG++ + P TP +D+LA + LK YY +CTP+R+A+
Sbjct: 61 PPHVLMIVVDDAGVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCTPTRAAL 120
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK-ELGYRTRIVGKWHLGFYKKEYTP 173
+TG++ ++ GM L G GL S LP+ LK E Y T +VGKWHLG K + P
Sbjct: 121 LTGRYAVNVGMPFP-LIGDAISGLDGSIPTLPEMLKSEANYSTHLVGKWHLGAAKAKNRP 179
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMK-MWGLDMRRDLE-PAWDLHGK-YSTDVFTAEAV 230
RGF+S G D++ E++ +W + + E PA ++ K ++T +F+ EAV
Sbjct: 180 LARGFDSFYGLLGASFDHYTKKMGEVRDLW----KNEAEVPAKEVDEKEHATTLFSREAV 235
Query: 231 DIIHNHSTD----------EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
+I HS +PLFLYLA++A P+ PLQA + ++ + + + R
Sbjct: 236 KVIEEHSARGHAGAKDGDMDPLFLYLAYSA-----PHAPLQADEKFMKLCSDVPNRHRRT 290
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--AAAGFNLNAASNWPLRGVKN 338
F A++ +LDE +G V +L+ M ++++VF SDNGG GFN N RG K+
Sbjct: 291 FCAMMAQLDEGIGNVTASLKAHGMWEDTVVVFTSDNGGYFLVGGFNYN------YRGGKS 344
Query: 339 TLWEGGVRGAGLIWSPL------LESRGIVAEQYVHVSDWLPTLLSAAN 381
WEGGVR I +P + RG+ HV+DW+PTL+ A
Sbjct: 345 GGWEGGVRVPAFIRAPKRYNFAPRDFRGM-----AHVADWMPTLMGMAT 388
>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
Length = 478
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/334 (32%), Positives = 172/334 (51%), Gaps = 29/334 (8%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
PP+I+FI+ADDLG+ DV +G + TPN+D +A G+ L+ Y +C+ +R+A++TG
Sbjct: 38 PPNIVFIMADDLGYADVSCYGRPDLNTPNVDRVALKGVRFLQAYANSAVCSATRTALITG 97
Query: 118 KHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
++ + G++ + G + GLP LP L++ GYRT ++GKWHLG K + P
Sbjct: 98 RYQYRLPIGLEEPLGIGRDV-GLPPEHPTLPSLLRKAGYRTTLLGKWHLGALPK-FGPLQ 155
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G++ G+ G DY+ H+ + + D L Y TD+ + AVD+I+
Sbjct: 156 SGYDHFYGFRGGSVDYYTHAGPDQRDDLWDDDVPLR-----QSGYLTDLLGSRAVDVING 210
Query: 236 HS-TDEPLFLYLAHAATHSANPYEPLQAPDHYL-------NIHRHIEDFKRSKFAAILHK 287
+S +D P + L H + P+ P +AP RH + + + ++ +
Sbjct: 211 YSHSDRPFLVSL-----HFSAPHWPWEAPGDEAESNRIRSTTLRHYDGGTQKTYQRMIEQ 265
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D +G+V+ AL+ N+I+VF SDNGG A WP G K L EGG+R
Sbjct: 266 MDLQIGRVLGALDANGATDNTIVVFTSDNGGE------RFADTWPFTGRKTELLEGGLRI 319
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
LI P +G +Q DWLPTLL+AA
Sbjct: 320 PALISWPARIRQGATTDQVAMSMDWLPTLLAAAG 353
>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 472
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 109/319 (34%), Positives = 170/319 (53%), Gaps = 28/319 (8%)
Query: 42 LAFTLSMVFVDL--VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L FTL+++ + + P+II +ADD+G+ D GF G I TPN+D LA +G K
Sbjct: 6 LLFTLTIIGITFGHAQKNKKPNIIVFMADDMGYADTGFTGATDIQTPNLDNLAKNGAFFK 65
Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQYLKELGYRT 156
Y C PSR+A+++G++ G + N Y G+ + EK+ P+ L+E GY+T
Sbjct: 66 QGYANHAYCGPSRAALLSGRYQHRFGFETNPAYDPANPHMGIDVGEKLFPKRLQEAGYKT 125
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP---- 212
+GKWHLG E+ P RGF+ G+ G DYF + K+W L+P
Sbjct: 126 GAIGKWHLG-AAAEFHPLNRGFDYFYGFLGGGHDYFRIDGTK-KVW----EAYLQPLVRN 179
Query: 213 --AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
A + G Y T + +A + ++ + P FLY+A+ A P+ PLQAP + +
Sbjct: 180 KRADNFEG-YLTTALSNDAAQFVKDNK-ENPFFLYVAYNA-----PHMPLQAPKEDIARY 232
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG----AAAGFNLN 326
HI+D KR +AA++ +D +G V+++L+ + N++I F+SDNGG + N
Sbjct: 233 AHIKDNKRRVYAAMVDVMDRGIGTVIQSLKDSGIYENTLIFFLSDNGGPQSKGGSTKGWN 292
Query: 327 AASNWPLRGVKNTLWEGGV 345
+ N P RG K L++GGV
Sbjct: 293 GSDNQPFRGGKGNLYDGGV 311
>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
domestica]
Length = 520
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 187/371 (50%), Gaps = 47/371 (12%)
Query: 38 AVLPLAFTLSMVF----VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
A + L +S++F + A+S PP+I+F+L DD+GW D+G G TP++D +A
Sbjct: 2 ATVALVLGVSLLFGWAPLGAGATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAA 61
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
G++ N+YT LC+PSR+A++TG+ PI G N E GG+P SE +L
Sbjct: 62 EGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLL 121
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
P+ LK+ GY +IVGKWHLG ++ ++ P GF+ G H +D+ A M +
Sbjct: 122 PELLKKAGYVNKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKA----MPNIP 176
Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAA 250
+ R+ W++ G++ T ++ EAVD I + +P FLY A A
Sbjct: 177 VYRN----WEMVGRFYEDFPINHKTGEANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDA 232
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
TH+ P+ A +L +R ++ + ++D+S+GK+++ L+ + N+ +
Sbjct: 233 THA-----PVYASKSFLGTS------QRGRYGDAVREIDDSIGKILKLLQDLGIKENTFV 281
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
F SDNG A SN P K T +EGG+R + W P G V+ Q +V
Sbjct: 282 FFTSDNGAALISAPKEGGSNGPFLCGKQTTFEGGLREPAIAWWPGHIPAGQVSHQLGNVM 341
Query: 371 DWLPTLLSAAN 381
D T LS A
Sbjct: 342 DLFTTGLSLAG 352
>gi|301608268|ref|XP_002933701.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Xenopus
(Silurana) tropicalis]
Length = 520
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/371 (31%), Positives = 184/371 (49%), Gaps = 48/371 (12%)
Query: 39 VLPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
L + + L++ F++ V G P+I+ +L DD+GW D+G G TPN+D +A
Sbjct: 2 ALYIMYALALGFLNPVLVLGGMNGTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMAS 61
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
G++ N+YT LC+PSR+A++TG+ PI G N E GG+P SE +
Sbjct: 62 EGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNSHARNAYTPQEIMGGIPDSEVLF 121
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
P+ LK+ GY +I+GKWHLG ++ Y P RGF+ G H +D+ +
Sbjct: 122 PELLKKAGYVNKIIGKWHLG-HRPSYHPLRRGFDEWFGSPNCHFGPYDNKQRP----NIP 176
Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAA 250
+ RD WD+ G+Y T ++ EA++ I+ + D+ P FLY + A
Sbjct: 177 VYRD----WDMVGRYYEDFNIDHKTGESNLTQIYLQEALEFINRQAGDQQPFFLYWSIDA 232
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
TH+ P+ A +L R R + + ++D S+GK++ L Q+ ++ ++ +
Sbjct: 233 THA-----PVYASRPFLGTSR------RGLYGDAVREIDFSIGKILGLLNQKDIIKDTFV 281
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
F SDNG A + SN P K T +EGG+R G+ W P + G V+ Q +
Sbjct: 282 FFTSDNGAALISAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIM 341
Query: 371 DWLPTLLSAAN 381
D T LS A
Sbjct: 342 DLFTTSLSLAG 352
>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
Length = 462
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 179/324 (55%), Gaps = 26/324 (8%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+FI ADDLG+ND+ +G I TP ID+L GI ++Y+ +C+PSR+A++TG
Sbjct: 34 PPNIVFIFADDLGYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRAALLTG 93
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++PI G+ V + G+ +E + + L+E GYRT +VGKWHLG ++K + P G
Sbjct: 94 RYPIRQGIT-GVFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQK-HLPLQNG 151
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
F S+ G + +M M D+E ++++ Y+T +T EAV I +
Sbjct: 152 FHSYFGI---------PYSNDMDMVVYMRGNDVE-SYEVDQHYTTRRYTEEAVQFIEQNK 201
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
D+P FLYLAH+ P+ P+ A ++++ KR + ++ +LD SV ++++
Sbjct: 202 -DQPFFLYLAHSM-----PHVPIYASENFVGTS------KRGLYGDVIQELDWSVAQILD 249
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
L++ ++ N+++VF SDNG A +L ++ PLR K ++GG+R L+ P
Sbjct: 250 TLDKHQLSENTLVVFTSDNGPWTALKHLGGSAA-PLREGKMFTFDGGMRVPCLVRWPAQI 308
Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
G + ++ DW PT AN
Sbjct: 309 PAGQTSHAMANMMDWFPTFSRIAN 332
>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
Length = 463
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/294 (36%), Positives = 156/294 (53%), Gaps = 31/294 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ IL DD G+ND GF G ++ TPNIDAL G++ + + + +PSR+ ++TG+
Sbjct: 32 PNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGR 91
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ GLPL E+ + + K GYRT +GKWHLG + E P RGF
Sbjct: 92 YGHRFGYECNL--SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGF 148
Query: 179 ESHLGYWTGHQDYF------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+ G G +DYF D +E + D + E KY TD F+ +AV+
Sbjct: 149 DLFYGMKAGGRDYFYNEKKSDRPGDERNLLLNDRQVKFE-------KYLTDAFSEKAVEF 201
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I+ S +P +YLA+ A H+ P+QA D + E R K AA+ + LD V
Sbjct: 202 INESS--QPFMMYLAYNAVHT-----PMQATDEDM---AKFEGHPRQKLAAMTYALDRGV 251
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
G V+ L+ N++I F+SDNGGA N +SN+PL+G K +EGG R
Sbjct: 252 GTVIRGLKDSGKFDNTLIFFLSDNGGATT----NQSSNYPLKGFKGNKFEGGHR 301
>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 470
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 175/334 (52%), Gaps = 23/334 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ ++ADD G++D GF G + + TP ID LA ++ + Y T +C PSR+ + TGK
Sbjct: 27 PNILLVIADDAGYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYTGK 86
Query: 119 HPIHTGMQHNVLYG--------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ G + N + G ++ GLP ++ + ++LKELGY T + GKWH G + +
Sbjct: 87 YQQRFGFEENNVPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQGNH-DD 145
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEA 229
Y PT RGF++ G+ G + YF +S EE + + M RD + + H Y TD +
Sbjct: 146 YHPTKRGFDNFYGFREGARGYFAYSNEEQQAYPSQKMERDFKHYIE-HEGYLTDALATQT 204
Query: 230 VDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
I + +P F L+ +A H+ P+QA + L H+ KR AA+ +
Sbjct: 205 SHFIGQSVVNKQPFFAVLSFSAVHA-----PMQATNADLAQFPHLTG-KRKILAAMNFAM 258
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D ++ V++ L ++ N+++VF++DNGG + N A+N+PL G K EGG+R
Sbjct: 259 DRAIAVVLDKLNALKISDNTLVVFINDNGGPT---DQNYANNYPLSGTKANHLEGGIRVP 315
Query: 349 GLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L+ W L S V D LPT + AAN
Sbjct: 316 MLLKWPKRLSSNNTRYTYPVSSLDLLPTFIHAAN 349
>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
Length = 463
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/294 (36%), Positives = 156/294 (53%), Gaps = 31/294 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ IL DD G+ND GF G ++ TPNIDAL G++ + + + +PSR+ ++TG+
Sbjct: 32 PNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGR 91
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ GLPL E+ + + K GYRT +GKWHLG + E P RGF
Sbjct: 92 YGHRFGYECNL--SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGF 148
Query: 179 ESHLGYWTGHQDYF------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+ G G +DYF D +E + D + E KY TD F+ +AV+
Sbjct: 149 DLFYGMKAGGRDYFYNEKKSDRPGDERNLLLNDRQVKFE-------KYLTDAFSEKAVEF 201
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I+ S +P +YLA+ A H+ P+QA D + E R K AA+ + LD V
Sbjct: 202 INESS--QPFMMYLAYNAVHT-----PMQATDEDM---AKFEGHPRQKLAAMTYALDRGV 251
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
G V+ L+ N++I F+SDNGGA N +SN+PL+G K +EGG R
Sbjct: 252 GTVIRGLKDSGKFDNTLIFFLSDNGGATT----NQSSNYPLKGFKGNKFEGGHR 301
>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 500
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 177/350 (50%), Gaps = 24/350 (6%)
Query: 37 FAVLPLA-FTLSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
FA L L F+LS+ V + A++ P PHI+F+L DD G+ D F I TP+ L
Sbjct: 2 FAQLLLGLFSLSVCAVTVGAAAPPDKPHIVFVLVDDWGFADASFRN-PAIKTPSFQYLVD 60
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
+G+IL +Y + C+PSR++ +TG+ P H + G G ++ +LP LK G
Sbjct: 61 NGLILNRHYVFKYCSPSRASFLTGRFPHHVHQWNPTPPGMV--GTNINMTMLPAKLKTAG 118
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y T +VGKWH G Y Y P RGF++ G+ + +F+ + +D ++ P
Sbjct: 119 YSTHMVGKWHQGLYDPAYLPVNRGFDTSSGFLQAGEGHFNQTIG----CAVDFWKNHAPD 174
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RH 272
+G Y + ++ + I H +PLFLYL H+ P++AP +L+++ +
Sbjct: 175 -TRNGTYDSYIYNKDLTTIFSKHDASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPVN 228
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
R A++ D G VV L++ M N+I+V +DNGGA SN+P
Sbjct: 229 STCATRRVLQAMVSVADNVTGHVVNLLKENDMWDNTIMVISADNGGADC-----KGSNYP 283
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
L+G K+T +EGGVR L+ + RG + ++HV+DW T A
Sbjct: 284 LKGSKHTFFEGGVRVIAFASGGLIPAARRGKSTDGFIHVADWYTTFCKMA 333
>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 484
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 123/356 (34%), Positives = 181/356 (50%), Gaps = 36/356 (10%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKN 100
L F L + + + P+IIF+ +DD G+ D GF G + TPN+D LA SG +
Sbjct: 5 LIFILLLTVAINLNAQEKPNIIFLFSDDAGYADFGFQGSKIMKTPNLDKLAKSGAKFTQG 64
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILPQYLKE 151
Y T C PSR+ ++TGK+ G M N + + GLPL + + +LK+
Sbjct: 65 YVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSANSKFLADDMGLPLDQLTIADHLKK 124
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD-- 209
LGY+T + GKWHLG + Y PT RGF+ G+ G ++YF ++ ++ LD R +
Sbjct: 125 LGYKTAMYGKWHLGDADR-YHPTKRGFDEFYGFRGGARNYFGYN--DVSKANLDNRMERG 181
Query: 210 ----LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
EP +Y TD EAV I + + P F+YLA A H+ P+QA
Sbjct: 182 FGNYQEPT-----EYVTDALAKEAVSFIEKNKGN-PFFIYLAFNAVHT-----PMQATKE 230
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
LN ++ KR + AA+ LD + G V+ L++ + N+I+VF +DNGG +
Sbjct: 231 DLNKFPNLTG-KRKELAAMTLALDRACGTVLNKLKELGLDKNTIVVFSNDNGGPT---DK 286
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
NA+ N PL G K+ EGG+R LI W ++S+ V V D LPT +A
Sbjct: 287 NASLNLPLSGTKSNHLEGGIRVPFLISWPKQIKSK-TVYNFPVSTLDLLPTFYAAG 341
>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 460
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 181/358 (50%), Gaps = 32/358 (8%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L F F + ++ P+I+ I DD G NDVG +G +IPTP+ID LA G++ + Y
Sbjct: 10 LMFLSLFAFCSQLQAAERPNILIIFTDDQGINDVGCYG-SEIPTPHIDQLAKEGLLFRQY 68
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGC--------ERGGLPLSEKILPQYLKEL 152
Y+ +CTPSR I+TG++P + + L G + G+ E + L++
Sbjct: 69 YSASAICTPSRFGILTGRNPTRS---QDQLLGALMFMSDIDQNRGIQPGETTIADVLQQN 125
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP 212
GY+T ++GKWHLG + + PT GF+ G+ G DYF + + W + R E
Sbjct: 126 GYQTALLGKWHLGHGTESFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSEN 185
Query: 213 AWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEP--------LQAP 263
Y+TD+ T EA + + +TD+P FL+L++ A H + P +QA
Sbjct: 186 G------YATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQAR 239
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
L I+D R +FAA+ LD+ +G+V+ +L+ + N++++F++D+GG
Sbjct: 240 GDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGGDY--- 296
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ +N P RG K TL+EGG+R +I P G + D PT+ AN
Sbjct: 297 -VYGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFAN 353
>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
Length = 471
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/353 (33%), Positives = 172/353 (48%), Gaps = 30/353 (8%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKN 100
L LS+ + ++ P+IIF+ +DD G+ D GF G ++ TPN+D LA SG+ +
Sbjct: 4 LILFLSICCFSISSAQEKPNIIFLFSDDAGFADFGFQGSTEMKTPNLDKLANSGVKFTQG 63
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILPQYLKE 151
Y T C PSR+ ++TGK+ G M N + + GLPL + + YLK+
Sbjct: 64 YVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSENSKFLADDMGLPLDQLTIGDYLKK 123
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL- 210
LGY T + GKWHLG + P RGF+ G+ G + YF + M R
Sbjct: 124 LGYNTAMYGKWHLG-NADRFHPMNRGFDEFYGFRGGARSYFGYDVASSAHHDTKMERGFG 182
Query: 211 ---EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
EP +Y TD EA+ I + + P F+YLA A H+ P++A L
Sbjct: 183 NFEEPQ-----EYVTDALADEAISFIEKNKKN-PFFIYLAFNAVHT-----PMEATKEDL 231
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
N ++ KR + AA+ LD + G+V++ L+Q + N+IIVF +DNGG + NA
Sbjct: 232 NQFPNLTG-KRKELAAMTLALDRACGRVLDKLKQLGLDKNTIIVFSNDNGGPT---DKNA 287
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ N PL G K+ EGG+R L+ P + D LPT +A
Sbjct: 288 SLNLPLSGTKSNHLEGGIRVPFLMSWPKHIQPKTTYTFPISTMDLLPTFYAAG 340
>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 468
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 194/381 (50%), Gaps = 48/381 (12%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVA---SSGPPHIIFILADDLGWNDVGFHGLDQIPTP 86
+ TR+M F A +++ + VA S PP I+ I++DD G+ D+ G + TP
Sbjct: 2 FSTRLMTFVC---ALASALLVSNAVAAEKSKRPPSIVLIVSDDQGFADLSCIGDNGCRTP 58
Query: 87 NIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGM---------QHNVLYGCERG 136
+D LA SG L ++Y CTPSR+++MTG++P G ++ LY E
Sbjct: 59 RLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEEY 118
Query: 137 --------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH 188
G L E L LK+ GY + + GKW G K+ Y P RGF+ + G+
Sbjct: 119 AVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YLPLQRGFDQYYGFANTG 177
Query: 189 QDYFDHSAEEMKMWGL-DMRRDLEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYL 246
DYF H + +G+ M RD +P + G Y TD+F EA+ I NH D P FLYL
Sbjct: 178 VDYFTH-----ERYGVPSMFRDNQPTEEDKGTYLTDLFEREAIRFIDENH--DRPFFLYL 230
Query: 247 AHAATHSANPYEP-----LQAPDHYLNIHRHIE---DFKRSKFAAILHKLDESVGKVVEA 298
A HSA+ + QAP YL+ E + +R + A + ++DE++GKVV+
Sbjct: 231 PFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQ 290
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+Q ++ N++I+F+SDNG A N PLRG K ++EGG R ++ P
Sbjct: 291 LQQHQIADNTLIIFLSDNG------GGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVP 344
Query: 359 RGIVAEQYVHVSDWLPTLLSA 379
G V+ Q++ + PT+++A
Sbjct: 345 AGKVSNQFLTSLEVFPTVIAA 365
>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
Length = 502
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 173/345 (50%), Gaps = 42/345 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+Y+ L +PSR+A++TG
Sbjct: 4 PPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTG 63
Query: 118 KHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G + GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ +
Sbjct: 64 RLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQ 122
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A + + RD W++ G+Y
Sbjct: 123 FHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKT 174
Query: 222 -----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
T ++ EA+D I + P FLY A ATH+ P+ A +L
Sbjct: 175 GEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------ 223
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R ++ + ++D+S+GK++E L+ + N+ + F SDNG A SN P
Sbjct: 224 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG 283
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R L W P + G V+ Q + D T L+ A
Sbjct: 284 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAG 328
>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
Length = 522
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 177/363 (48%), Gaps = 43/363 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
Y LC+PSR+A++TG+ PI TG N E GG+P E +LP+ LK G
Sbjct: 72 YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 182
Query: 214 WDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYE 258
W++ G++ T ++ EA+D I +T P FLY A ATH+
Sbjct: 183 WEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA----- 237
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
P+ A +L +R ++ + ++D+SVG++V L ++ N+ + F SDNG
Sbjct: 238 PVYASRAFLGTS------QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGA 291
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
A SN P K T +EGG+R + W P G V+ Q V D T LS
Sbjct: 292 ALVSAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLS 351
Query: 379 AAN 381
A
Sbjct: 352 LAG 354
>gi|320102821|ref|YP_004178412.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319750103|gb|ADV61863.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 480
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 180/350 (51%), Gaps = 31/350 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ ILADDLGW D+ ++G Q TP++D LA G +YT +C PSR+A++TGK
Sbjct: 36 PNIVLILADDLGWGDLSYNGRIQWSTPHLDDLAARGARFDRFYTAGVVCAPSRAALLTGK 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE----YTPT 174
+PIH G N LP E + + LK GY T + GKWH G +E P
Sbjct: 96 NPIHCGTWRN------DHDLPAEEVTIAEALKAKGYHTALYGKWHQGRPTREGGTRTHPL 149
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+GF+ G+ + H+ E+ + R +E G+Y+ D FT + V+ I
Sbjct: 150 DQGFDETFGFLSA-----THAWEKFPNHLFNGRDRVEVP---QGRYADDWFTDKGVEFIA 201
Query: 235 NHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESV 292
H D P FLYL +TH ++AP+ + HR D ++ +AAI+ ++D V
Sbjct: 202 RHKDDPTPFFLYLPLTSTHFH-----IEAPEDEIAKHRSTFSDPIQTAYAAIVTRMDAQV 256
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA----SNWPLRGVKNTLWEGGVRGA 348
G++V+A+ + +++IVF SD+G G + A+ SN P RG K T++EGG+R
Sbjct: 257 GRIVQAVRDAGLEDSTLIVFTSDHGATFEGGSKEASSSLDSNHPFRGQKRTVYEGGIRVP 316
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
G++ P + G + V D LPT L A P++ + NI+P
Sbjct: 317 GIMTYPGVIPAGRIVSTPVQTIDLLPTFLELAGGRPEPDWKVDGL-NILP 365
>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 465
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 190/372 (51%), Gaps = 35/372 (9%)
Query: 42 LAFTLSMVFVDLVASS-GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-K 99
+ F + + LV ++ P+I+ + ADD G+ D GF G + TPN+D LA SG+ +
Sbjct: 4 ITFLICFFGIHLVVNAQNKPNIVLLFADDAGYMDFGFQGSKVMKTPNLDKLAKSGVTFTQ 63
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQ----------HNVLYGCERGGLPLSEKILPQYL 149
Y + C PSR+ +MTGK+ G + H+ L G E G LPL +K++ YL
Sbjct: 64 GYVSDPTCGPSRAGMMTGKYQARFGYEEINVPGYMSSHSALKGDEMG-LPLDQKLMSNYL 122
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
K+LGY+T + GKWHLG + P RGF+ G+ G + YF + R
Sbjct: 123 KDLGYKTAVYGKWHLG-NADRFHPLNRGFDEFYGFRGGARSYFAYKNPTGD-------RK 174
Query: 210 LEPAWDLH---GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
+E + + Y+TDVF +AV I + + P F+YL+ A H+ P++A +
Sbjct: 175 METNFGQYEEPNHYATDVFAEKAVHFIERNK-EHPFFIYLSFNAVHT-----PMEATEAD 228
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
L ++ KR + AA+ LD + GKV+ L++ + N+I+VF +DNGG + + N
Sbjct: 229 LAQFPNLTG-KRQQLAAMTLALDRACGKVLNKLKELGIDRNTIVVFSNDNGGPS---DKN 284
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
A++N PL G K+ EGG+R L+ P + V D LPT +AA D+
Sbjct: 285 ASNNKPLSGTKSNHLEGGIRVPFLMSWPKEIKKKSRFNYPVVTFDLLPTFYAAAG-GDLK 343
Query: 387 NYVNSTVENIIP 398
+ + N++P
Sbjct: 344 DLKDINGVNVLP 355
>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
variabilis ATCC 29413]
gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
ATCC 29413]
Length = 457
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 191/378 (50%), Gaps = 35/378 (9%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M A +A + +F A S P+++FIL DD+GW D+ +G TPN+D LA
Sbjct: 17 MTAAGTLMATASANLFSRATAQSSRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQ 76
Query: 95 GIILKNYYTVQ-LCTPSRSAIMTGKHP--IHTGMQHNVLYGCERG----GLPLSEKILPQ 147
G+ N Y Q +CTP+R A +TG++ + G++ + + G+P ++ +
Sbjct: 77 GVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIAS 136
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
LK GY T +VGKWH G Y + P +GF+ + G+ +G +YF H+ + LD+
Sbjct: 137 LLKANGYETALVGKWHAG-YPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTDRI---LDLY 192
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
+ P Y TD+FT AV+ I HS P +L L H P+ P Q P+
Sbjct: 193 ENDVPV--QRSGYVTDLFTDRAVEFIQRPHS--RPFYLSL-----HYNAPHWPWQGPNDQ 243
Query: 267 LNIHRHIED-----FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
+ ++ + ++ +AA++ LD+ VG+V++ALE N++++F SDNGG
Sbjct: 244 ASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGG--- 300
Query: 322 GFNLNAASNW-PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN+ P RG K +L+EGG+R +I P + V+ Q + D T+L+A
Sbjct: 301 ----ERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAAT 356
Query: 381 NKSDIPNYVNSTVENIIP 398
S PNY +N++P
Sbjct: 357 GTSFHPNYPPDG-QNLLP 373
>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 481
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 121/357 (33%), Positives = 184/357 (51%), Gaps = 31/357 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
PHI+ I+ DDLG+ D+G +G I TP ID+LA G L ++++ +CTP+R+A++TG+
Sbjct: 65 PHIVLIMTDDLGYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPVCTPTRAALLTGR 124
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P G++ + GLP+ E IL + LKE+GYRT +VGKWHLG Y+ E+ P GF
Sbjct: 125 WPQRVGLEWALSASDTEPGLPVEEPILSRPLKEVGYRTVMVGKWHLG-YRPEFGPNAHGF 183
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G +G+ D++ H K D + +P + G YSTD+ + AV I +
Sbjct: 184 DEFFGLLSGNVDHYSHREINGKE---DWYENTKPV-RVEG-YSTDLLSDRAVAAIQKTAA 238
Query: 239 D-----EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF--KRSKFAAILHKLDES 291
+PL+LY+A+ A H P Q P +I F R+ + ++ +D
Sbjct: 239 QPPDQRQPLWLYVAYNAVH-----WPFQPPGRPQDIRDRSTWFNGTRADYVKMVESIDAG 293
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
VG++++ALE M +++++F +DNGG + N P K T+WEGG R LI
Sbjct: 294 VGRILDALEAGGMADHTLVIFTNDNGGE------RLSDNGPFFHHKGTVWEGGHRVPCLI 347
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE 408
P + G Q D T+L+AA P V N++P ILR E
Sbjct: 348 RWPGRIAAGTEFGQPTIGMDLTATILAAAKIPPNPERPLDGV-NLVP-----ILRGE 398
>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
musculus]
gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
musculus]
Length = 520
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 174/346 (50%), Gaps = 43/346 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 27 PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ +
Sbjct: 87 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A+ + + RD W++ G++
Sbjct: 146 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197
Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++T EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN P
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 307 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352
>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 459
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 125/373 (33%), Positives = 191/373 (51%), Gaps = 39/373 (10%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
R + FA+ L + F A+ PP+I+ I+ADDLG+ D+ +G Q+ TP+ID LA
Sbjct: 8 RSVLFALFIAVSCLLIRFSAAEAAQQPPNIVLIMADDLGYGDLACYGNKQVKTPHIDRLA 67
Query: 93 YSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG---GLPLSEKILPQY 148
S + ++++ +CTP+R+A++TG++ G Q + GLP + +
Sbjct: 68 ASALKFTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAEL 127
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---HQDYFDHSAEEMKMWGLD 205
LK+ GY T GKWHLG Y+ + PT +GF+ G +G H + D S E W +
Sbjct: 128 LKQQGYATACFGKWHLG-YQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNE--DWWHN 184
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
+E Y+ D+ + +V + + T P FLY+ H A H P++ Q P H
Sbjct: 185 NEISMEKG------YTADLLSKYSVAFMEANRT-RPFFLYVPHLAIHF--PWQGPQDPPH 235
Query: 266 YLNIHRHIEDFKRSKFA-------------AILHKLDESVGKVVEALEQRRMLSNSIIVF 312
+ +D+ K+ A++ LD+SVGK++ AL++ + N++++F
Sbjct: 236 ----RKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIF 291
Query: 313 VSDNGG-AAAGFNL-NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
SDNGG G N N +SN PLRG K TL+EGG R LI P + + G V +Q H
Sbjct: 292 TSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAG-VTDQTAHSV 350
Query: 371 DWLPTLLSAANKS 383
D LPTL AA S
Sbjct: 351 DLLPTLAQAAGIS 363
>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
familiaris]
gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
familiaris]
Length = 522
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 176/368 (47%), Gaps = 53/368 (14%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER------------GGLPLSEKILPQY 148
Y+ LC+PSR+A++TG+ PI G Y R GG+P E +LP+
Sbjct: 72 YSANPLCSPSRAALLTGRLPIRNG-----FYTTNRHARNAYTPQEIVGGIPDQEHVLPEL 126
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
LKE GY ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + R
Sbjct: 127 LKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYR 181
Query: 209 DLEPAWDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHS 253
D W++ G+Y T V+ EA+D I + P FLY A ATH+
Sbjct: 182 D----WEMVGRYYEEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA 237
Query: 254 ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
P+ A +L +R ++ + ++D SVGK++ L+ R+ N+ + F
Sbjct: 238 -----PVYASRPFLGTS------QRGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFT 286
Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
SDNG A SN P K T +EGG+R + W P G V+ Q + D
Sbjct: 287 SDNGAALISAPNQGGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLF 346
Query: 374 PTLLSAAN 381
T LS A
Sbjct: 347 TTSLSLAG 354
>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 466
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 118/335 (35%), Positives = 178/335 (53%), Gaps = 22/335 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ I AD+LG+ D+G +G + TP +D LA G+ L ++YT CT SR+ ++TG+
Sbjct: 35 PNILLITADNLGYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGR 94
Query: 119 HPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+P G+ H + G GL SE ++P+YLK+ GYRT GKW++GF PT RG
Sbjct: 95 YPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVGFSPGS-RPTERG 153
Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
F+ G+ G+ DY+ H A +W R L+ + + G YSTD+F A I
Sbjct: 154 FDEFFGFAAGNIDYYHHYYAGRHDLW-----RGLKEVF-VEG-YSTDLFADAACQYISAE 206
Query: 237 STDEPLFLYLAHAATH--SANPYEP-----LQAPDHYLNIHRHIEDFK--RSKFAAILHK 287
S D+P F+YL A H S +P QAPD + + K + ++ A++
Sbjct: 207 S-DQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVTA 265
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVR 346
LD ++G+V++ L+ + +I+++ SDNG L ASN PLR TLWEGG+R
Sbjct: 266 LDSAIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGIR 325
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+I P G V + + D LPTL++ A
Sbjct: 326 VPAIIRYPGHLKAGTVNQSPLISLDILPTLITLAG 360
>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 461
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 120/380 (31%), Positives = 191/380 (50%), Gaps = 51/380 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+I+ ++ADD GWNDVG+HG +I TP +D+LA +G L+ +Y C+PSR+A++TG
Sbjct: 36 PNILLVIADDAGWNDVGYHG-SKIKTPVLDSLANNGAKLERFYVAPTCSPSRAALLTGIP 94
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
G+ + G + LP S LP+ +K+LGYRT + GKWHLG E P GF+
Sbjct: 95 ASRLGIVAPIA-GKSKIALPDSLVTLPKAMKKLGYRTALFGKWHLGL-TPENGPQAYGFD 152
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------STDVFTAEAVDII 233
+ G+ G D + H ++ +P+W +GK+ TD+ T A+
Sbjct: 153 TSYGFLHGQIDQYTHE-----------YKNGDPSWHKNGKFLKEDGHVTDLLTDAAIAYF 201
Query: 234 HNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI-EDFKRSKFAAILHKLDES 291
+ + T+ P F+ LA++A P+ PLQ + Y +R I D R +AA + LD +
Sbjct: 202 NQETKTETPSFVTLAYSA-----PHFPLQEEERYKRPYRTIFTDSSRVDYAAAMTHLDIA 256
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGG-----------AAAGFNLNAASNWPLRGVKNTL 340
G+++ +L++ L N++I+F+SDNG G N N PL+ K T
Sbjct: 257 FGRLLNSLKRSGKLDNTLILFMSDNGAMKNWYPKDQYQGKHGPNTTLGDNTPLKDYKTTN 316
Query: 341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
+EG +R + W + + + ++ V D LPTL+ S PN T+E
Sbjct: 317 YEGAIRVPAFVYWKNHFNGKTL--DDFIGVIDVLPTLVELTGASVKPN----TIEG---- 366
Query: 400 YENSILRYENGTHEYNSPRI 419
+S L NG + P++
Sbjct: 367 --SSFLNLLNGASQTKIPQL 384
>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 173/346 (50%), Gaps = 43/346 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 27 PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ +
Sbjct: 87 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF G H +D+ A+ + + RD W++ G++
Sbjct: 146 FHPLKHGFNEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197
Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++T EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN P
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 307 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352
>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
Length = 537
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 174/346 (50%), Gaps = 43/346 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 44 PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 103
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ +
Sbjct: 104 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 162
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A+ + + RD W++ G++
Sbjct: 163 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 214
Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++T EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 215 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 265
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN P
Sbjct: 266 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 323
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 324 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 369
>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
Length = 418
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 121/213 (56%), Gaps = 3/213 (1%)
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
Y PT RGF++ GY+ + YF + G D + D G Y T + T AV
Sbjct: 1 YLPTRRGFDTFYGYYNTAKYYFTQKLNFYNICGRDYWDNETLVTDKDGTYDTHLITDRAV 60
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
DII +H D+PLF+Y+A A H + AP L +I D R+ A + LDE
Sbjct: 61 DIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIGDHNRTLLAGAVDALDE 120
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
SVG++V AL QRRML+NS++VF SDNGG G N SNWPLRG K T+WEGGVR +
Sbjct: 121 SVGRIVAALHQRRMLANSVLVFTSDNGGIPWGDFSNTGSNWPLRGTKGTMWEGGVRLPAV 180
Query: 351 IWSPLLE--SRGIVAEQYVHVSDWLPTLLSAAN 381
+WS +L SR +V +H+ DWLPTL SAA
Sbjct: 181 VWSSMLAPGSR-VVVPHPMHLVDWLPTLYSAAG 212
>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
[Ailuropoda melanoleuca]
Length = 519
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 174/363 (47%), Gaps = 43/363 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 9 LLLVLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNF 68
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G N E GG+P E +LP+ LK G
Sbjct: 69 YSANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAG 128
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 129 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 179
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYE 258
W++ G+Y T V+ EA+D + + P FLY A TH+
Sbjct: 180 WEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTHA----- 234
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
P+ A +L R R ++ + ++D+SVGK++ L R+ N+ + F SDNG
Sbjct: 235 PVYASRPFLGTSR------RGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGA 288
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
A SN P K T +EGG+R + W P G V+ Q V D T L+
Sbjct: 289 ALISAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLA 348
Query: 379 AAN 381
A
Sbjct: 349 LAG 351
>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 471
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 176/342 (51%), Gaps = 29/342 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+F+ +DD G+ D GF G + + TPN+D LA G+ + Y + C PSR+ IMTG+
Sbjct: 27 PNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGR 86
Query: 119 HPIHTGM----------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
+ G +H+ + G E G+PL E + Y+K LGYRT GKWHLG
Sbjct: 87 YQQKFGYEEINVPGYMSEHSAIKGAEM-GIPLDEVTMGDYMKSLGYRTAFYGKWHLG-GT 144
Query: 169 KEYTPTFRGFESHLGYWTGHQDY--FDHSAEEMKMWGLDMRRDLEPAWDL---HGKYSTD 223
E P RGF+ G+ G + Y ++ +A E K + + LE D H Y TD
Sbjct: 145 DELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKS-AVFTDKKLEHGIDQFQEHEGYLTD 203
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
V +A I + D+P F++L+ A H+ P++A L ++ KR + AA
Sbjct: 204 VLAEKANQFIEK-APDKPFFIFLSFNAVHT-----PMEATPEDLAKFPQLKG-KRKEVAA 256
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+ LD + G V+ L++ + ++++VF +DNGG + NA+SN+PL G K+ EG
Sbjct: 257 MTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPT---DKNASSNYPLAGTKSNFLEG 313
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
G+R L+ P + G V ++ V D LPT A ++
Sbjct: 314 GIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKAGGGEEV 355
>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 474
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 181/357 (50%), Gaps = 27/357 (7%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
++M F++L L++ +S + + P+I+FILADD G+ D GF G + TP++D LA
Sbjct: 16 KVMLFSML-LSWCVSTL------AQKQPNIVFILADDAGYADFGFQGSSEFKTPHLDQLA 68
Query: 93 YSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEK 143
I + Y + +C PSR+ I+TGK+ G + N + G + GLPL +K
Sbjct: 69 SQSIRFSQAYVSAAVCGPSRAGILTGKYQQRFGYEENNVPGYMSASATTGDEMGLPLDQK 128
Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
+LP+YLKE GY+T + GKWH+G K + PT RGF++ G+ G + Y++ +
Sbjct: 129 LLPEYLKEQGYKTALFGKWHMGNADK-FHPTKRGFDTFYGFRGGARSYYEFNENNKNNRQ 187
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
D ++ Y TD D I + +P F+YL+ A H+ P++A
Sbjct: 188 EDRLERGFGNFEESKLYLTDALAEATTDFIEKNQ-KQPFFVYLSFNAVHT-----PMEAR 241
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
L +++ KR AA+ LD + G V+ L++ + N++IVF +DNGG +
Sbjct: 242 PDDLKQFPNLKG-KRKTLAAMTLGLDRACGNVLAKLDELGVRENTLIVFTNDNGGPSG-- 298
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
NA+SN PL G K EGG+R L+ P V + + D LPT A
Sbjct: 299 -TNASSNKPLGGTKANHLEGGIRVPFLMSWPKQLKENTVYDYPISTLDLLPTFYQIA 354
>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 460
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/333 (33%), Positives = 173/333 (51%), Gaps = 24/333 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P++I+ILADD+G+ D G TPN+D L G + + Y +C P+R+A++TG+
Sbjct: 8 PNVIYILADDMGYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLTGR 67
Query: 119 HPIHTGMQHNV-LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+P TG G +R + L E L + GYRT ++GKWHLG KEY P RG
Sbjct: 68 YPHRTGAVDTYEAIGGDR--MALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHPCRRG 125
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
F++ +G+ G DY+ + LD LE + G Y TDV T E++ I +
Sbjct: 126 FDTFIGFRGGWSDYYQYK--------LDRNGILEAS---DGTYMTDVITEESIRFIRENR 174
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
++P FL+ A+ A P+ P Q P+ Y R + + ++ +DE +GK+++
Sbjct: 175 -EQPFFLHAAYNA-----PHFPFQCPEEYAAPFRKRFNPTLATLYGMIACMDEGIGKLLD 228
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
LE+ + N+I++F SDNG G + N + + L G K +EGG+R +I P
Sbjct: 229 VLEETGLSDNTIVIFASDNGPQLYG-DTNRYNCY-LNGQKGEAFEGGIRVPAVIRWPGHI 286
Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
+ + H DW PTLLSA + ++P+ V+
Sbjct: 287 APDSRCHAFFHGVDWFPTLLSAC-RLELPDGVS 318
>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 442
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 121/356 (33%), Positives = 177/356 (49%), Gaps = 36/356 (10%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
++ F LPLA LV + P+II I++DD G+ D F TPNID +A
Sbjct: 7 LLLFIWLPLA---------LVVAQERPNIIVIVSDDAGYADFSFQSTKLTTTPNIDRIAQ 57
Query: 94 SGIILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG------GLPLSEKILP 146
G N Y T +C+PSR+ +++G + G +N + G E G+P + K +
Sbjct: 58 EGAKFTNAYVTASVCSPSRAGLLSGTNQPEFGHIYNYIKGVEYSIQQHEYGIPHNIKTIG 117
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
+YL+ LGY T GKWH GF +E+ P +GF+ G+ G Y H+ + +D+
Sbjct: 118 EYLQPLGYATAAFGKWHEGF-AEEFQPEQKGFDHFWGFLWGANHY--HTGK-----AIDV 169
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
D +P Y TD T EA+ H D P FLY+A+ A HS PL+A +
Sbjct: 170 IDDGKPVDPTTIPYMTDAITDEALSFASAHEED-PFFLYIAYNAVHS-----PLEAKPEH 223
Query: 267 LNIH--RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
+ ++ + +D KR AA+ H LDE+VGK+ LE L N+II F++DNGG+
Sbjct: 224 IALYEGKFPDDPKREILAAMTHSLDENVGKIFNRLEAMGELDNTIIFFINDNGGSKH--- 280
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+A N PL+G K L+EGG+R + P V D LP++L A
Sbjct: 281 -ISADNSPLKGYKGQLYEGGIRVPFAVRWPGKIKGKTVCHSITSSLDILPSILDAV 335
>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
Length = 470
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 164/331 (49%), Gaps = 32/331 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+IIFILADDLG+ D+ +G I TP ID + G+ +L+ Y +C+ +R+AIMTG+
Sbjct: 36 PNIIFILADDLGYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNSPVCSATRTAIMTGQ 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ + GLP S+ LP LK+ GY T ++GKWHLG Y K Y P G+
Sbjct: 96 YQYRLALGLEEPLAGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHLGAYPK-YGPLKSGY 154
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH-NHS 237
+ G+ Y++H G D D P Y TD+ + V++I + S
Sbjct: 155 DHFYGFRGSALSYYNH--------GKDFWEDDAPV--EKAGYFTDLLGDKTVELIQKSDS 204
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHY--------LNIHRHIEDFKRSKFAAILHKLD 289
+ P F A+ H P+ P +AP N + + + I+ ++D
Sbjct: 205 CERPFF-----ASVHFNAPHWPWEAPGEQGKKEAARKKNSLAAFDAGTQDVYRQIVERMD 259
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
VG++++AL +++ N+I+VF SDNGG ++ WP G K+ L EGG+R
Sbjct: 260 FQVGRILQALTDKKIRENTIVVFTSDNGGE------RFSNTWPFSGKKSELLEGGIRIPS 313
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
LI P G +EQ DWLPTLL AA
Sbjct: 314 LICWPAKIPSGQTSEQVTLSMDWLPTLLEAA 344
>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 175/351 (49%), Gaps = 43/351 (12%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
V + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+
Sbjct: 22 VGAPQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRA 81
Query: 113 AIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
A++TG+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG
Sbjct: 82 ALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG 141
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---- 221
++ ++ P GF+ G H +D+ A+ + + RD W++ G++
Sbjct: 142 -HRPQFHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFP 192
Query: 222 ----------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
T ++ EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 193 INRKTGEANLTQLYLQEALDFIRTQHARQGPFFLYWAIDATHA-----PVYASRQFLGTS 247
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN
Sbjct: 248 L------RGRYGDAVREIDDSVGKILNLLQNLGISKNTFVFFTSDNGAALISAPNEGGSN 301
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 302 GPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352
>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
Length = 452
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 137/416 (32%), Positives = 213/416 (51%), Gaps = 41/416 (9%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
A LAF+ + + S P+I+ IL DDLG+ D+ G + I TP+ID L G+
Sbjct: 8 AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
N++ + +PSR++++TG++P G+ + E LSE +LPQ LK+ G
Sbjct: 66 RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
Y +VGKW+LG + TPT RGF+ + G+ D Y+ H + +G + R+
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
D G ++T++F+ A+ + + EP FLYLA+ A H+ P+Q P +L +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233
Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
E KR+K A++ LD +VG+V EALEQ L N+II+F SDNGG A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+N P RG K ++EGG+R AG I W + R V + +V +SD PTL + + +P
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL---CDLTAVP- 343
Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
V+ ++ I SIL G + R+ + R N + Y +RY++
Sbjct: 344 -VSHEIDGI------SILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKD 392
>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
Length = 452
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 138/416 (33%), Positives = 212/416 (50%), Gaps = 41/416 (9%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
A LAF+ + + S P+I+ IL DDLG+ D+ G + I TP+ID L G+
Sbjct: 8 AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
N++ + +PSR++++TG++P G+ + E LSE +LPQ LK+ G
Sbjct: 66 RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
Y +VGKW+LG + TPT RGF+ + G+ D Y+ H +G + R+
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTHCR-----FGNNYMRENLK 179
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
D G ++T+VF+ A+ + + EP FLYLA+ A H+ P+Q P +L +
Sbjct: 180 EIDPQG-HATEVFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233
Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
E KR+K A++ LD +VG+V EALEQ L N+II+F SDNGG A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+N P RG K ++EGG+R AG I W + R V + +V +SD PTL + + +P
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL---CDLTAVP- 343
Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
V+ ++ I SIL G + R+ + R N + Y +RY++
Sbjct: 344 -VSHEIDGI------SILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKD 392
>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
Length = 522
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 175/359 (48%), Gaps = 35/359 (9%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 12 LLLMLSAAELGVAPAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDQMATEGMLFPNF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
YT LC+PSR+A++TG+ PI +G N E GG+P SE +LP LK G
Sbjct: 72 YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD E
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 186
Query: 214 WDLHGKYS----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQA 262
+ ++ T ++ EA++ I + P FLY A ATH+ P+ A
Sbjct: 187 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PVYA 241
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+L +R ++ + +LD+SVG+++ L+ + N+ + F SDNG A
Sbjct: 242 SKPFLGTS------QRGRYGDAIRELDDSVGRILRLLQDLGIAENTFVFFTSDNGAALIS 295
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 296 APRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAG 354
>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
norvegicus]
Length = 524
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 177/360 (49%), Gaps = 43/360 (11%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+
Sbjct: 17 VLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSA 76
Query: 105 Q-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRT 156
LC+PSR+A++TG+ PI G N E GG+P SE +LP+ LK+ GY
Sbjct: 77 NPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTN 136
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
+IVGKWHLG ++ ++ P GF+ G H +D+ + + + RD W++
Sbjct: 137 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEM 187
Query: 217 HGKYS--------------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQ 261
G++ T ++ EA+D I H+ P FLY A ATH+ P+
Sbjct: 188 VGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVY 242
Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
A +L R ++ + ++D+SVGK++ L+ + N+ + F SDNG A
Sbjct: 243 ASKQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI 296
Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 297 SAPKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 356
>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
Length = 522
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 174/359 (48%), Gaps = 35/359 (9%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 12 LLLVLSAAELGVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
YT LC+PSR+A++TG+ PI +G N E GG+P SE +LP LK G
Sbjct: 72 YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD E
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 186
Query: 214 WDLHGKYS----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQA 262
+ ++ T ++ EA++ I + P FLY A ATH+ P+ A
Sbjct: 187 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PIYA 241
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+L +R ++ + +LD+SVG+++ L + N+ + F SDNG A
Sbjct: 242 SKPFLGTS------QRGRYGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALIS 295
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 296 APRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAG 354
>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
Length = 452
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 124/349 (35%), Positives = 187/349 (53%), Gaps = 30/349 (8%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
A LAF+ + + S P+I+ IL DDLG+ D+ G + I TP+ID L G+
Sbjct: 8 AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
N++ + +PSR++++TG++P G+ + E LSE +LPQ LK+ G
Sbjct: 66 RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
Y +VGKW+LG + TPT RGF+ + G+ D Y+ H + +G + R+
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
D G ++T++F+ A+ + + EP FLYLA+ A H+ P+Q P +L +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233
Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
E KR+K A++ LD +VG+V EALEQ L N+II+F SDNGG A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
+N P RG K ++EGG+R AG I W + R V + +V +SD PTL
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL 336
>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
Length = 452
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 124/349 (35%), Positives = 187/349 (53%), Gaps = 30/349 (8%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
A LAF+ + + S P+I+ IL DDLG+ D+ G + I TP+ID L G+
Sbjct: 8 AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
N++ + +PSR++++TG++P G+ + E LSE +LPQ LK+ G
Sbjct: 66 RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
Y +VGKW+LG + TPT RGF+ + G+ D Y+ H + +G + R+
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
D G ++T++F+ A+ + + EP FLYLA+ A H+ P+Q P +L +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233
Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
E KR+K A++ LD +VG+V EALEQ L N+II+F SDNGG A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
+N P RG K ++EGG+R AG I W + R V + +V +SD PTL
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL 336
>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 508
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 127/396 (32%), Positives = 188/396 (47%), Gaps = 73/396 (18%)
Query: 59 PPHIIFILADDLGWNDVGFHGL----DQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
PP++IFILADDLG ND+ G ++ TPNID LA G + Y+ C PSR+
Sbjct: 18 PPNVIFILADDLGINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPSRAM 77
Query: 114 IMTGKHPIHTGMQHN--------------------------------VLYGCERGGLPLS 141
+MTG++P TG ++ ++ GLP
Sbjct: 78 LMTGRYPTRTGFEYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGLPTE 137
Query: 142 EKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM 201
E L + LK+ GY T +GKWHLG + P +GF+ L +G F +
Sbjct: 138 EVTLAEVLKDRGYHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASG---LFLPPGDPR-- 191
Query: 202 WGLDMRRDLEPA------------------WDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
G++ R D +P W G Y TD +T E++ +I + + + P F
Sbjct: 192 -GVEARLDFDPIDKFLWARMDFAASYNGSDWFEPGGYLTDYWTDESLKVI-DANKNRPFF 249
Query: 244 LYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
LYLAH H+ PLQA IE + +AA+L +D SVG+++E LE+
Sbjct: 250 LYLAHWGVHT-----PLQATKEDYEAVGDIEPHRLRVYAAMLRAVDRSVGRIMEKLEEEG 304
Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIV 362
+ N+IIVF SDNGG AG+ N P RG K TL+EGG+R + W ++ ++
Sbjct: 305 LADNTIIVFSSDNGG--AGYIGIPEVNAPYRGWKITLFEGGIRVPLFMKWPGTIQPGTVI 362
Query: 363 AEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
E H+ D +PT L+AA + +P+ V +N++P
Sbjct: 363 DEPVAHI-DLMPT-LAAATGARLPSSVEIDGQNLLP 396
>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
Length = 452
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 177/353 (50%), Gaps = 26/353 (7%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
AS+ P+I+FILADDLGW D+ +G TP++DALA G+ + Y +C+ +R A
Sbjct: 11 ASTQAPNIVFILADDLGWADLSVYGQADFTTPHLDALAGEGVRFTQAYANSAVCSATRFA 70
Query: 114 IMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
++TG++ + G++ ++ GLP LP L++ GY T ++GKWHLG +
Sbjct: 71 LITGRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRDAGYDTALIGKWHLGSLPT-F 129
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
P G++ G + G DYF H D+ P + Y T + EA
Sbjct: 130 GPLKSGYDRFFGNYGGAIDYFTHKPGVGDAVARDLYEGEVPVERV--GYYTQLLADEATR 187
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNIHR----HIEDFKRSKFAAILH 286
I S +P FL L H P+ P P D +++ H + + +A ++
Sbjct: 188 WIGERSAAKPFFLSL-----HFTAPHWPWVGPEDEEISLGLKDLFHYDGGNLATYARMVR 242
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
LD++VG+V+EAL+ + + N+I+VF SDNGG + WP G K L EGG+R
Sbjct: 243 SLDKAVGQVLEALKAQGLADNTIVVFTSDNGGE------RFSKTWPFTGQKTELLEGGIR 296
Query: 347 GAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
L+ W + + V++Q DWLPTLL+AA + +Y ENI+P
Sbjct: 297 VPTLLRWPARIAPQ--VSDQVTATMDWLPTLLAAAGVAPHADYPPDG-ENILP 346
>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 444
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 112/335 (33%), Positives = 169/335 (50%), Gaps = 32/335 (9%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+II DD G+ D+G +G + TP++D LA GI N+Y +CTPSR+ ++TG
Sbjct: 33 PPNIIVFFTDDQGYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSRAGLLTG 92
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++P + + VL+ GGL + + LK GY T +GKWHLG +K EY P +G
Sbjct: 93 QYPKRSNLHEAVLFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEYMPYNQG 151
Query: 178 FESHLG--YWTGHQDYF----DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
F++ G Y +Y+ D + + + + + +E D +Y T +T E V
Sbjct: 152 FDTFYGVPYSNDMDNYYYKNIDFQSPPLPFY--ENTKVIENGSD--QRYLTKRYTEETVK 207
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I N ++P F+YLAH H+ PL A + K + ++ +LD S
Sbjct: 208 RIKNRG-EKPFFIYLAHNMPHT-----PLFASPAFEGKS------KNGLYGDVIMELDWS 255
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
G+++ AL++ + N+I VF SDNG A S PLRG K WEGG R G+I
Sbjct: 256 AGEIINALKEEGIYENTIFVFTSDNGPA-------KGSAKPLRGKKAQTWEGGQRVPGII 308
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
P RG+V +++V D PT A+ + IP
Sbjct: 309 TWPGSIPRGVVTDEFVSTLDLFPTFAKLAS-TKIP 342
>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
Length = 520
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 176/365 (48%), Gaps = 45/365 (12%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 8 LLLVLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNF 67
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G N E GG+P E +LP+ LK G
Sbjct: 68 YSANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAG 127
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 128 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 178
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYE 258
W++ G+Y T V+ EA+D + + P FLY A TH+
Sbjct: 179 WEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTHA----- 233
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
P+ A +L R R ++ + ++D+SVGK++ L R+ N+ + F SDNG
Sbjct: 234 PVYASRPFLGTSR------RGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGA 287
Query: 319 A--AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
A +A SN P K T +EGG+R + W P G V+ Q V D T
Sbjct: 288 ALISAPKQGQCGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTS 347
Query: 377 LSAAN 381
L+ A
Sbjct: 348 LALAG 352
>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
Length = 453
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 187/359 (52%), Gaps = 26/359 (7%)
Query: 33 RIMAFAVLPLAFTLS--MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
+I A+L L T++ P+++ IL DDLG D+ + TPNID
Sbjct: 2 KIEKIALLALGTTMTSGAALASNTVDKATPNVLLILVDDLGLGDLSCQYAKDLSTPNIDR 61
Query: 91 LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQ 147
+ +G+ L N+Y + +PSR+A++TG+ P G+ + ++ G S +P+
Sbjct: 62 IFETGVRLDNFYANSSVSSPSRAALLTGRFPAMVGVPGVIRPSIDQNWGYFDPSAVTMPE 121
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDM 206
LK GYRT ++GKWHLG +K P RGFE G+ D Y+ H + L+
Sbjct: 122 VLKNGGYRTALIGKWHLG-WKSPNLPNERGFEHFHGFLADMMDDYYTHQRQGGNYMYLN- 179
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDH 265
+++ P ++T++FT+ +VD I + ++ P FLYLA+ A HS PLQ P
Sbjct: 180 DKEITPK-----GHATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVE 229
Query: 266 YLNIHRHIED---FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
++N + + KR++ A++ LD+++GKV++ALE+ LSN++++F SDNGG
Sbjct: 230 WVNKVQERDKSLPVKRARLIALIEHLDDNIGKVMQALEESGQLSNTLVIFASDNGGDRG- 288
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ A+N P RG K ++EGG+ + P + G +V + D +PT+ N
Sbjct: 289 ---SMANNGPTRGAKGDMFEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTICDFVN 344
>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 462
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 166/330 (50%), Gaps = 20/330 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+FIL+DD G+ D GFHG + TP +D A + + + Y + +C PSR+ ++TGK
Sbjct: 26 PNIVFILSDDAGYADFGFHGSKEFKTPELDKFAKNAVRFSQAYVSAAVCGPSRAGLLTGK 85
Query: 119 HPIHTGMQHNVLYGC--------ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ G + N + G + GLPL +K + YLKE GYRT + GKWH G
Sbjct: 86 YQQKFGFEENNVPGLMSKNGLTGDDMGLPLDQKTIADYLKEQGYRTALFGKWHQG-NADR 144
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
+ PT RGF+ G+ G + Y A+ D + H Y T+ EA+
Sbjct: 145 FHPTKRGFDEFYGFRGGARSYMPFGADNELTRNEDRLERGFGGFQEHEGYLTEELADEAI 204
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
I + + P F+YLA A H+ P++A + L + + KR AA+ +D
Sbjct: 205 AFIERNQKN-PFFVYLAFNAVHT-----PMEAKEEDLKKVKGLSG-KRKTLAAMTIAMDR 257
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+ GKV++ ++ + N++IVF +DNGG + + N +SN PL G K EGG+R L
Sbjct: 258 ACGKVLDRIKALGLDKNTLIVFTNDNGGPS---DANESSNLPLSGTKANHLEGGIRVPFL 314
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ P V E + D LPT ++AA
Sbjct: 315 MSWPGKLKPNTVYENPISTLDLLPTFVTAA 344
>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
Length = 474
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 197/390 (50%), Gaps = 40/390 (10%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
F L DL A+ GP ++I I ADDLG+ D+G G I TP +D +A G+ +Y+
Sbjct: 22 FLLLTSVADLNAAPGP-NVIVIYADDLGYGDLGCFGHPTIKTPALDQMAAEGMKFTQFYS 80
Query: 104 VQ-LCTPSRSAIMTGKHPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
+CTPSR+A++TG++PI +GM + VL+ GG+P SE L + LK GY+T V
Sbjct: 81 AAPVCTPSRAALLTGRYPIRSGMCSDKRRVLFPNSGGGIPASEVTLAEALKAAGYKTACV 140
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE-----------EMKMWGLDMRR 208
GKWHLG + ++ PT GF+S+ G + + D A+ E+K W + + R
Sbjct: 141 GKWHLG-HLPQFLPTNNGFDSYFG--IPYSNDMDRVADRKHGRSIFLKPEVKFWNVPLMR 197
Query: 209 DLEPAWDLHGKYSTDV--FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
+ E +L +T +T EA+ +I + +P F+YLAH P+ PL +
Sbjct: 198 NTE-VVELPADQTTITKRYTEEAIKLIQQNK-QQPFFIYLAHNM-----PHVPLFRSPEF 250
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
+ R R + ++ ++D SVG+V++ L+ + + N+I+ F SDNG FN
Sbjct: 251 QDKSR------RGLYGDVIEEIDWSVGEVLKELKTQGLDQNTIVWFCSDNGPWLI-FNEQ 303
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
S LR K + WEGG+R + W P G V+++ D T + A S
Sbjct: 304 GGSAGLLRDGKGSTWEGGMREPTIAWWPGHIPSGSVSQELGSTMDIYTTSIKLAGGSVPT 363
Query: 387 NYVNSTVE--NIIPRYENSI---LRYENGT 411
+ V V+ ++ + ENS L Y GT
Sbjct: 364 DQVVDGVDLTPVLLKKENSPREELAYYRGT 393
>gi|338212321|ref|YP_004656376.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
gi|336306142|gb|AEI49244.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
Length = 446
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 165/306 (53%), Gaps = 22/306 (7%)
Query: 48 MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQL 106
+ F S P+I+ IL DD+GW +VG +G TP++D LA G+ N+Y +
Sbjct: 15 LFFTSSAFSQQKPNIVMILVDDMGWGEVGIYGSHYCQTPHLDNLAKQGMRFTNFYANSTV 74
Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHL 164
C+P+R+A+MTG++P G+ + E S K LPQ LK GYRT +VGKWHL
Sbjct: 75 CSPTRAALMTGRYPDLVGVPGVIRGNVENSWGYFSPKAVTLPQVLKTAGYRTAMVGKWHL 134
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
G + E P GF G+ D + E K W + ++P+ ++TD+
Sbjct: 135 GL-EPENHPNRHGFTHFHGFLEDMMDDYYTHLREGKNWMRLNGQVIDPS-----GHATDL 188
Query: 225 FTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDF--KRSK 280
FT A++ ++ + +P FLYLA+ A P+ P+Q P +L + + + KR+K
Sbjct: 189 FTQWAIEYLNTQKKNPQPFFLYLAYNA-----PHFPVQPPREWLEKVKKRNPNLSEKRAK 243
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
A++ LDES+GKV+EAL Q N++IVF SDNGG + + A N +G K T+
Sbjct: 244 LTALIEHLDESIGKVMEALRQNGQADNTLIVFSSDNGGLLS----DEADNGIWKGGKQTM 299
Query: 341 WEGGVR 346
+EGG+R
Sbjct: 300 YEGGIR 305
>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 336
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/176 (47%), Positives = 119/176 (67%), Gaps = 1/176 (0%)
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE-PLQAPD 264
M ++ E ++ G YST +++ EA+D I NH++ +P+FLYLA+ A HSAN E PLQAP
Sbjct: 1 MAQNQENIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQ 60
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
+++ HI+ R +AA+L +D +G++ L ++ ML +SII+F SDNGG A G N
Sbjct: 61 EWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANGLN 120
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
N A+N+PLRG K T++EGGVRGA IWS L V+ +H++DWLPTLLSAA
Sbjct: 121 NNWATNYPLRGTKTTVYEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSAA 176
Score = 39.7 bits (91), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTR-GKWKLVKENSI 556
D + +DG+++W L PS R +L NID ++A R G WK+VK+N+
Sbjct: 179 DTTLLPNLDGLNLWDTLINQVPSPRTEVLINIDPLLYMNAALRVGDWKIVKQNNF 233
>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
Length = 466
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 177/360 (49%), Gaps = 43/360 (11%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+
Sbjct: 17 VLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSA 76
Query: 105 Q-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRT 156
LC+PSR+A++TG+ PI G N E GG+P SE +LP+ LK+ GY
Sbjct: 77 NPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTN 136
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
+IVGKWHLG ++ ++ P GF+ G H +D+ + + + RD W++
Sbjct: 137 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEM 187
Query: 217 HGKYS--------------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQ 261
G++ T ++ EA+D I H+ P FLY A ATH+ P+
Sbjct: 188 VGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVY 242
Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
A +L R ++ + ++D+SVGK++ L+ + N+ + F SDNG A
Sbjct: 243 ASKQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI 296
Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 297 SAPKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 356
>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
anophagefferens]
Length = 304
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 153/292 (52%), Gaps = 27/292 (9%)
Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
CTPSR ++++G++P+ G+Q V+ + GL + ++LP+ L+ GYRT +VGKWHLG
Sbjct: 1 CTPSRGSLLSGRYPVRLGLQRGVVSAASKWGLDPAIRLLPETLQTAGYRTALVGKWHLGH 60
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK-----------MWGLDMRRDLEPAWD 215
Y + TP RGF G+ G D+F H AE LD + + P
Sbjct: 61 YNESLTPARRGFGESYGFLGGFVDHFTHVAEAPPACASETPATDGTCALDWQVNGLPRPR 120
Query: 216 LHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDH---YLNIHR 271
G Y+ A ++ +H +EPLFLY A P+EP+ PD Y + +
Sbjct: 121 ATGVYAAYACRDAARRLVASHEGKEEPLFLYYA-----LPTPHEPITPPDDVETYDLLPK 175
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
+R +AA++ D ++G + AL++ + N+++V +SDNG +AAG SN
Sbjct: 176 LDVAGRRRDYAAVVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAG-----GSNA 230
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAAN 381
PLRG K +EGGVR L+ LL RG E VHV+DW PTLL+AA
Sbjct: 231 PLRGAKGGYFEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAG 282
>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
Length = 526
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 173/358 (48%), Gaps = 43/358 (12%)
Query: 47 SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ- 105
S V + V + P+I+ +L DD+GW D+G +G TPN+D +A G + N+Y+
Sbjct: 21 SSVGLGAVGAPQSPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANP 80
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRI 158
LC+PSR+A++TG+ PI G + GG+P SE +LP+ LKE GY ++I
Sbjct: 81 LCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKI 140
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
VGKWHLG ++ ++ P GF+ G H +D+ A + + RD W++ G
Sbjct: 141 VGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD----WEMVG 191
Query: 219 KYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAP 263
++ T ++ EA+D I + P FLY A ATH+ P+ A
Sbjct: 192 RFYEEFPINLKTGESNLTQIYLQEALDFIKRQQAQQHPFFLYWAIDATHA-----PVYAS 246
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+L +R ++ + ++D+SVGK+++ L + N+ + F SDNG A
Sbjct: 247 KAFLGTS------QRGRYGDAVREIDDSVGKILDLLRGLSIGENTFVFFTSDNGAALISA 300
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P V+ Q + D T LS A
Sbjct: 301 PRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAAQVSHQLGSIMDLFTTSLSLAG 358
>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
Length = 454
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 175/368 (47%), Gaps = 47/368 (12%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAI 114
++ P+I+FILADDLGW D+G +G TPN+D LA G+ + Y +C+ +R A+
Sbjct: 8 TTAAPNILFILADDLGWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATRIAL 67
Query: 115 MTGKHPIHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
+TG++ Q+ + G E GLP LP L+E GY T ++GKWHLG
Sbjct: 68 ITGRY------QYRLPAGLEEPIARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLG-KP 120
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD-----LHGKYSTD 223
Y P G++ G G DYF H G + RDL W+ Y T+
Sbjct: 121 PTYGPLKSGYDRFFGNIGGALDYFTHKPGV----GAQVPRDL---WEGDVPVERTGYYTN 173
Query: 224 VFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF------ 276
+ EA + + +P FL L H P+ P + P + R I D
Sbjct: 174 ILGDEASAYVRAREDEKKPFFLSL-----HFTAPHWPWEGPGDE-AVSREITDLFHYDGG 227
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+K+ ++ LD +VG+V+ AL++ N+++VF SDNGG + WP G
Sbjct: 228 NLAKYGELVEALDAAVGQVLAALDETGQADNTLVVFTSDNGGE------RFSKTWPFTGQ 281
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
K L EGG+R L+ P R V Q +DWLPTLL+AA P+Y S ENI
Sbjct: 282 KTELLEGGLRVPTLLRWP-ARVRPQVQAQVTITADWLPTLLAAAQVWPHPDY-PSDGENI 339
Query: 397 IPRYENSI 404
+P E +
Sbjct: 340 LPVLEGQV 347
>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
queenslandica]
Length = 2318
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 159/324 (49%), Gaps = 21/324 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
PHI+F+L DD G+ D F I TPN L +G+IL +Y + C+PSR++ +TG+
Sbjct: 1141 PHIVFVLVDDWGFADASFRN-PAIKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGRF 1199
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P H + G G ++ +LP LK GY T +VGKWH G Y Y P RGF+
Sbjct: 1200 PHHVHQWNPTPLG--MVGTNINMTMLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGFD 1257
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
+ G+ + +F+ + +D ++ P +G + ++ + + + H
Sbjct: 1258 TSSGFLQAEEGHFNQTIG----CAVDFWKNDAPD-TRNGTCDSYIYNKDLTTVFNEHDAS 1312
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDESVGKVVEA 298
+PLFLYL H+ P++AP +L+++ R A++ D G VV
Sbjct: 1313 KPLFLYLPLHNVHA-----PIEAPQEWLDLYPANSTCDTRRILQAMVSVADNVTGHVVNL 1367
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L + M N+++V +DNGGA SN+PL+G K T +EGGVR L+ +
Sbjct: 1368 LRENDMWDNTLMVVSADNGGADC-----KGSNYPLKGSKYTFFEGGVRVIAFASGGLIPA 1422
Query: 359 --RGIVAEQYVHVSDWLPTLLSAA 380
RG + ++H++DW T A
Sbjct: 1423 GRRGKSTDGFIHIADWYTTFCKMA 1446
>gi|374369744|ref|ZP_09627765.1| sulfatase [Cupriavidus basilensis OR16]
gi|373098668|gb|EHP39768.1| sulfatase [Cupriavidus basilensis OR16]
Length = 433
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 183/366 (50%), Gaps = 31/366 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGK 118
P++I+I+ADDLG+ D+G +G + +P +D+LA +G++L + Y +C+P+R A+MT +
Sbjct: 3 PNLIYIVADDLGYADLGCYGGREPVSPVLDSLAANGLMLTQGYANSPVCSPTRFALMTAR 62
Query: 119 HPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ G + RG GLP + LP L+ GYRT ++GKWHLGF + P
Sbjct: 63 YQYRLRGAAEEPINSKSRGSSALGLPPAHPTLPSLLRAQGYRTALIGKWHLGF-APAFGP 121
Query: 174 TFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
G+++ G G DYF H S+ +W D EP Y TD+ + AVD
Sbjct: 122 LLSGYQAFFGPMAGGVDYFTHCDSSGRHDLWHNDAEHQ-EPG------YLTDLLSRRAVD 174
Query: 232 IIHNHSTD-EPLFLYLAHAATHSANPYE----PLQAPDHYLNIHRHIEDFKRSKFAAILH 286
+ S + P FL L + A H P+E AP N+ H++ + ++
Sbjct: 175 YVTRMSGEGAPFFLSLHYTAPHW--PWETRDDAALAPQVSNNLF-HLDGGNIHTYRRMIS 231
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
+DE +G +V+AL Q L N++IVF SDNGG + NWPL G K L EGG+R
Sbjct: 232 HMDEGIGWLVQALRQLGQLDNTLIVFTSDNGGE------RFSDNWPLVGGKMDLTEGGIR 285
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILR 406
+ P + + G + Q+ DW TLL AA P Y V +++P ++ R
Sbjct: 286 VPWIAHWPAMIAPGGTSAQHCMTMDWSATLLDAAGVQADPGYPLDGV-SLMPLLRDAAHR 344
Query: 407 YENGTH 412
+ H
Sbjct: 345 FHRPLH 350
>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 452
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 123/349 (35%), Positives = 187/349 (53%), Gaps = 30/349 (8%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
A LAF+ + + S P+I+ IL DDLG+ D+ G + I TP+ID L G+
Sbjct: 8 AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
N++ + +PSR++++TG++P G+ + E LSE +LPQ LK+ G
Sbjct: 66 RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
Y +VGKW+LG + TPT RGF+ + G+ D Y+ H + +G + R+
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
D G ++T++F+ A+ + + EP FLYLA+ A H+ P+Q P +L +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233
Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
E KR+K A++ LD +VG+V EAL++ L N+II+F SDNGG A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQDDA----GA 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
+N P RG K ++EGG+R AG I W + R V E +V +SD PTL
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKNRI--RPAVRENFVMLSDMFPTL 336
>gi|374374259|ref|ZP_09631918.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
gi|373233701|gb|EHP53495.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
Length = 476
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 124/371 (33%), Positives = 183/371 (49%), Gaps = 35/371 (9%)
Query: 37 FAVLPLAF-TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
FA+ +F TL + +A+ P+I+ ILADDLG+ DVG +G +I TP+IDALA G
Sbjct: 4 FAIFFASFLTLLIRLPTRLAAQTKPNIVLILADDLGYGDVGVYGQQKIKTPHIDALAKEG 63
Query: 96 IILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKE 151
+ +Y +C PSRS+++TG+H HT ++ N+ G + G LP + + LK
Sbjct: 64 LRFTRFYCGTAVCAPSRSSLLTGQHTGHTPIRGNI--GVKPEGQWPLPGNTVTIATLLKS 121
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY--WTGHQDYF-DHSAEEMKMWGLDMRR 208
GY T GKW LGF P +GF+ GY T DYF DH +W + R
Sbjct: 122 AGYSTGCFGKWGLGFIGTSGDPLKQGFDRFYGYNCQTLAHDYFPDH------LWNNEQRV 175
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA----------NPY- 257
+L + YS + +A+ I H +P FL LA+ H+A N Y
Sbjct: 176 ELSNTHEKQTIYSAQLIHEQAMQFIRQHQR-QPFFLLLAYTLPHAALQLPKDDTLYNYYK 234
Query: 258 -----EPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
+P+ +H+ + + + +AA++ KLD VG+V + L+ + N++I+F
Sbjct: 235 KLFNEKPVPVAEHWNGKGYEPQAYPHAAYAAMVGKLDAYVGEVQQQLKVLGLDKNTMILF 294
Query: 313 VSDNGGAAAGFNLNAA--SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
SDNG G N A SN LRG+K L+EGG+R + W P G + Q
Sbjct: 295 ASDNGPHREGGNDPAFFNSNGGLRGIKRDLYEGGIRSPFIAWWPGTIQAGTQSNQVGAFW 354
Query: 371 DWLPTLLSAAN 381
D LPT + A
Sbjct: 355 DLLPTFAALAG 365
>gi|50753991|ref|XP_414208.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gallus gallus]
Length = 513
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 182/368 (49%), Gaps = 50/368 (13%)
Query: 35 MAFAVL-PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
MA AVL PL ++ + PP+++ +L DD+GW D+G G TPN+D +A
Sbjct: 1 MARAVLVPLWLCWAL------GCAAPPNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMAS 54
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKIL 145
G++ ++Y LC+PSR+A++TG+ P+ G + GG+ SE +L
Sbjct: 55 EGMLFLDFYAANPLCSPSRAALLTGRLPVRNGFYTTNAHARNAYTPQDIVGGIQDSEILL 114
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
P+ LK+ GY +I+GKWHLG ++ ++ P GF+ G H +D+ A + +
Sbjct: 115 PELLKKAGYTNKIIGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRA----LPNIP 169
Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAA 250
+ RD W++ G+Y T ++ EA+D I ++ +P FLY A A
Sbjct: 170 VYRD----WEMIGRYYEDFKIDLRTGEANLTQIYLQEALDFISKQQASQQPFFLYWAIDA 225
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
TH+ P+ A H+L +R ++ + ++D+SVGK+++ L++ + N+ +
Sbjct: 226 THA-----PVYASKHFLGTS------QRGRYGDAVREIDDSVGKILKHLQKLGISENTFV 274
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
F SDNG A SN P K T +EGG+R + W P G V+ Q V
Sbjct: 275 FFTSDNGAALISAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGSVSRQLGSVM 334
Query: 371 DWLPTLLS 378
D T LS
Sbjct: 335 DLFTTSLS 342
>gi|330505678|ref|YP_004382547.1| putative sulfatase [Pseudomonas mendocina NK-01]
gi|328919964|gb|AEB60795.1| probable sulfatase precursor [Pseudomonas mendocina NK-01]
Length = 629
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 125/388 (32%), Positives = 192/388 (49%), Gaps = 46/388 (11%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASS-GPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
G R R++ +L L + A S P+I+ +L DDLG ND+ G + PTP
Sbjct: 10 GARMRLVLLGIL-----LCTAYAGNAAESHAKPNILLVLVDDLGNNDIASWGDGRAPTPT 64
Query: 88 IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
+DAL+ S + L+ +YT C+PSR+++++G+HP+ G Q + L GL LP+
Sbjct: 65 LDALSASAVRLRRHYTDSTCSPSRASLISGRHPVSVGFQADGL------GLSPDLVTLPK 118
Query: 148 YLKELGYRTRIVGKWHLG--FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
L+ LGYRT VGKWHLG +E P +GF+ YW G ++F G
Sbjct: 119 SLRSLGYRTLHVGKWHLGEALEYEEIQPGQQGFD----YWFGMLNHFVLQGPGPD--GRP 172
Query: 206 MRR----------DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
+RR D A H Y D+ T +AV+++ + ++P F+ L + H+
Sbjct: 173 VRRQPTHINPWLQDNGSAPAQHQGYLDDILTDKAVELVKSGVGEKPWFINLWLFSPHT-- 230
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY+P A D K+ AIL +LD ++ ++++ALE+ N++++F SD
Sbjct: 231 PYQPSPA------FSTQFPDTPEGKYFAILSQLDHNMARLLKALEESGQADNTVVIFASD 284
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NGGA N+N +N+PL G K T EGGVR L+ P + E V + D PT
Sbjct: 285 NGGA----NINMDNNYPLIGKKATYTEGGVRTPLLLRWPGHYENQDIREPTVFM-DIFPT 339
Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENS 403
L+S A + N++P +E
Sbjct: 340 LVSIAGGNPPKGLAG---RNLLPLFEGG 364
>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
ATCC 8503]
Length = 452
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 135/416 (32%), Positives = 213/416 (51%), Gaps = 41/416 (9%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
A LAF+ + + S P+I+ IL DDLG+ D+ G + I TP+ID L G+
Sbjct: 8 AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
N++ + +PSR++++TG++P G+ + E LSE +LPQ LK+ G
Sbjct: 66 RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
Y +VGKW+LG + TPT RGF+ + G+ D Y+ H + +G + R+
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
D G ++T++F+ A+ + + EP FLYLA+ A H+ P+Q P +L +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233
Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
E KR+K A++ LD +VG+V EAL++ L N+II+F SDNGG A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQDDA----GA 289
Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+N P RG K ++EGG+R AG I W + R V + +V +SD PTL + + +P
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL---CDLTAVP- 343
Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
V+ ++ I SIL G + R+ + R N + Y +RY++
Sbjct: 344 -VSHEIDGI------SILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKD 392
>gi|255690672|ref|ZP_05414347.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
gi|260623696|gb|EEX46567.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 459
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 179/337 (53%), Gaps = 34/337 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ + I+ADD+G+ DVG +G + I TPNID +A G++ ++++ + +P+R ++TG+
Sbjct: 29 PNFVIIVADDMGYGDVGIYGNEYIKTPNIDQIAREGMMFTDFHSNGSVSSPTRCGLLTGR 88
Query: 119 HPIHTGMQHNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G++ +L + GLP E + L + GYRT ++GKWHLG+ +K +
Sbjct: 89 YQQRAGLEKVLLVPRDDKDKEVGLPSEEITFAKILGDNGYRTALIGKWHLGYLQKHHPMN 148
Query: 175 FRGFESHLGYWTGHQDYFDHSAE--EMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
F GF+ +G+ +G+ DY H +M W GL+M+ D+ G Y+T + T + D
Sbjct: 149 F-GFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEMK-------DMSG-YTTTLLTTLSED 199
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHRHIEDFKRSK------FAAI 284
I + D+P LY+AHAA HS P+Q PD + E K S + +
Sbjct: 200 YI-KENKDKPFCLYIAHAAPHS-----PMQGPDEKAVRTEATPEGDKNSDRSNKEIYKDM 253
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ +LD SVG+++E L++ ++ N+ +VF SDNG N S +G K + WEGG
Sbjct: 254 VEELDWSVGRILETLKKYKLDENTFVVFFSDNGPVIN----NGGSAGGYKGAKGSPWEGG 309
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
R G+ + P G EQ V D PT+L A+
Sbjct: 310 HRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLDMAD 346
>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 463
Score = 167 bits (422), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 124/411 (30%), Positives = 204/411 (49%), Gaps = 47/411 (11%)
Query: 36 AFAVLP-LAFTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPTPNI 88
+ A +P L +S+V V L S P+++ IL DDLG+ D+ +G + +PNI
Sbjct: 3 SLATVPKLQLLMSLVAVVLFGVSSTAQAADKPNVLLILVDDLGYGDLSCYGATDLQSPNI 62
Query: 89 DALAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKIL 145
D L G+ N+Y +C+P+R+AI++GK+P G+ + + G L ++L
Sbjct: 63 DKLVSRGLKFTNFYANCPVCSPTRAAILSGKYPDRVGVPGVIRTHADNSWGYLAPEAELL 122
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ-DYFDHSAEEMKMWGL 204
P L+ GY + I+GKWHLG + P RGF+ GY DY+DH + G+
Sbjct: 123 PSLLQPAGYHSAIIGKWHLGL-EAPNRPNDRGFDHFKGYLGDMMDDYYDH-----RRHGI 176
Query: 205 DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAP 263
+ R+ E D G ++TD+FT + D + + +E P FLYLA+ A H+ P+Q P
Sbjct: 177 NYMRENEQEIDPEG-HATDLFTEWSCDYLKEQADNEQPFFLYLAYNAPHT-----PIQPP 230
Query: 264 DHYLNIHRHIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+L + E +R+K A++ +D+ +G+V+ LEQ +++++F SDNGG
Sbjct: 231 QDWLEKVKKREAGIGEQRAKLVALIEHMDDGIGQVLTCLEQTGQADDTLVIFTSDNGG-- 288
Query: 321 AGFNLN-AASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
LN A+N R K +++EGG++ ++W +E+ G E D+ T+L
Sbjct: 289 ---QLNVGANNGATRDGKQSMYEGGLKVPTAVVWPGKIEA-GQTTEFQALSMDFCSTILD 344
Query: 379 AA---------NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIE 420
A +S +P + + + P + S R G YN IE
Sbjct: 345 VAGVEQPSDCDGRSILPTLLGNEQK---PLRDLSFFRRREGGIRYNGKTIE 392
>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
musculus]
gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
Length = 440
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 43/319 (13%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 27 PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ +
Sbjct: 87 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A+ + + RD W++ G++
Sbjct: 146 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197
Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++T EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN P
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306
Query: 336 VKNTLWEGGVRGAGLIWSP 354
K T +EGG+R + W P
Sbjct: 307 GKQTTFEGGMREPAIAWWP 325
>gi|359400556|ref|ZP_09193534.1| sulfatase [Novosphingobium pentaromativorans US6-1]
gi|357597898|gb|EHJ59638.1| sulfatase [Novosphingobium pentaromativorans US6-1]
Length = 493
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/346 (33%), Positives = 170/346 (49%), Gaps = 36/346 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+++ IL DDL W DV +G I TPNIDA+ +G + Y T +C+ SR+ +++G+
Sbjct: 45 PNVVVILVDDLAWTDVSANGASTIQTPNIDAIGAAGATFSSGYVTASVCSVSRAGLLSGR 104
Query: 119 HPIHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+P G +N+ ERG GLP+ L + LK LGY T + GKWH G E+
Sbjct: 105 NPTTFGFMYNI---TERGAVDKGAGLPVDVPTLAERLKPLGYDTAVFGKWHQG-STPEFY 160
Query: 173 PTFRGFESHLGYWTGHQDY--------------FD--HSAEEMKMWGLDMRRDLEPAWDL 216
PT RGF+ G+ G Y FD E L D+EP
Sbjct: 161 PTRRGFDEFFGFLAGESIYATPDTPGIVTTPTSFDKYQPKEREGAQRLFQGPDMEPVAQP 220
Query: 217 HGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
H KY T+ T AVD I + +P FLY+A+ A P+ PLQ P Y + H I D
Sbjct: 221 H-KYLTEEITEHAVDFIDRKADGKDPFFLYVAYNA-----PHWPLQVPRTYYDRHSEIAD 274
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R + A++ LD VG++++AL+ + + ++++VF+SDN G F ++ P G
Sbjct: 275 PARRTYVAMIDALDAGVGQILKALDDKGLREDTLVVFLSDN-GCPVQFGYCTQTH-PWGG 332
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T EGG+R + P + G V D +PT+L A +
Sbjct: 333 GKFTYLEGGMRVPFMASWPGHIAAGQRIATPVSSLDIVPTVLEAVS 378
>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
[Ciona intestinalis]
Length = 513
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 190/390 (48%), Gaps = 36/390 (9%)
Query: 43 AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
F L ++ +A S P+IIF+L DD+GW D+G +G TPN+D +A G + ++Y
Sbjct: 4 CFLLLLMCGYSLAESSKPNIIFMLMDDMGWGDLGINGQPSKETPNLDNMAKEGTLFTDFY 63
Query: 103 TVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGY 154
+ LC+PSR+A++TG+ PI G + +G GG+P E ++ + L GY
Sbjct: 64 SANPLCSPSRAALLTGRLPIRNGFYTSNYHGHNGYTPQHIVGGIPDHEILVSELLSSAGY 123
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWG---LDMRRDL 210
+++GKWHLG +++Y P GF G H Y D + + ++ + R
Sbjct: 124 TNKLIGKWHLG-QQEQYLPLKHGFHEWFGSPNCHFGPYDDKTTPNIPVYNNTEMVGRYYE 182
Query: 211 EPAWDLHGKYS--TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYL 267
E A + H S T + EA+D I +E P FLY A ATHS P+ + +
Sbjct: 183 EFAIESHKYLSNMTQYYIQEALDFIERMERNEKPFFLYWAPDATHS-----PVYSSPMFR 237
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
R R + + +LD VG +++ L+Q + N++++F SDNG A G
Sbjct: 238 GASR------RGPYGDAVMELDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMIGSAFEN 291
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
SN P K T +EGG+R + W P +G V+ Q + DW T L A+ + +P
Sbjct: 292 GSNGPFLCGKQTTFEGGIREPTIAWGPGFVQQGQVSHQVGSLMDWFATALDLASIA-VPK 350
Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSP 417
+ ++ + S+L NG E++ P
Sbjct: 351 --DRIIDGV------SLLPVLNGAAEFDRP 372
>gi|196232399|ref|ZP_03131252.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196223471|gb|EDY17988.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 452
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 127/372 (34%), Positives = 184/372 (49%), Gaps = 52/372 (13%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M FAV A T S VD + P+II ILADDLG + G I TPNID L +
Sbjct: 10 MMFAV---ALTCSAFAVD----ASKPNIIVILADDLGRGEYSDFGTKDIQTPNIDRLCHE 62
Query: 95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER---GGLPLSEKILPQYLK 150
G+ N+Y +C+PSR+A+MTG P G+ + + G L ++ PQ LK
Sbjct: 63 GLTFDNFYANSCVCSPSRAALMTGCFPNRVGVPGVIREEDQDNNWGWLAQGVRLFPQLLK 122
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG--------YWT---GHQDYFDHSAEEM 199
GY++ IVGKWHLG TPT RGF+ G YWT Q++ H+ E +
Sbjct: 123 PAGYQSAIVGKWHLGINSPN-TPTERGFDRFDGFLGDMMDDYWTHLRHGQNFMRHNQEVV 181
Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYE 258
G ++TDVFT A+ + S + EP FLYLA+ A P++
Sbjct: 182 SPEG----------------HATDVFTDWALRYLDERSKSKEPFFLYLAYNA-----PHD 220
Query: 259 PLQA-PDHYLNIHRHIEDF--KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
P+Q PD + KR+K A++ LD+ +GKV+ L++ + N+I++F SD
Sbjct: 221 PIQPKPDWLAKVKAREPGLSEKRAKLVALIEHLDDGIGKVLAKLDETGLAKNTIVLFSSD 280
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NGG + N A+N P R K ++EGG+R ++ P + G + + + D LPT
Sbjct: 281 NGGVLS----NEANNGPWRSGKTHMYEGGLRVPCMVRWPGHVAPGTHSNRVALLMDILPT 336
Query: 376 LLSAANKSDIPN 387
L ++ PN
Sbjct: 337 ALEIVGQTPPPN 348
>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
Length = 502
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 164/330 (49%), Gaps = 20/330 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY-SGIILKNYYTVQLCTPSRSAIMTGK 118
P+I+ IL+DD G+ D GF G QI TPN+D LA S + + Y + +C PSR+ I+TGK
Sbjct: 60 PNIVLILSDDAGYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAVCGPSRAGILTGK 119
Query: 119 HPIHTGMQHNVL--YGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ G + N + Y + G GLPL+++ + YL GY T ++GKWH G K
Sbjct: 120 YQQRFGFEENNVPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTALIGKWHQGNADK- 178
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
+ PT RGFE G+ G + YF+ + D + Y T EA+
Sbjct: 179 FHPTKRGFEHFYGFRGGARSYFEFGPNNPVSYPEDRLERGFAHYKESPHYLTQALADEAI 238
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
I + EP FL+L+ A H+ P+ A L + KR + AA+ +D
Sbjct: 239 KFIKQNQR-EPFFLFLSFNAVHT-----PMDANKEDLAQFPQLSG-KRQRVAAMTLSMDR 291
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
+GKV+ +L++ + N+++VF +DNGG + + NA+ N PL G K EGG+R L
Sbjct: 292 EIGKVLASLDELGLSENTLLVFTNDNGGPS---DTNASDNTPLSGTKANHLEGGIRVPFL 348
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ P + + D LPT +AA
Sbjct: 349 MRWPAGNIGPNTYHEPISTLDLLPTFFAAA 378
>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
Length = 453
Score = 166 bits (420), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 109/354 (30%), Positives = 183/354 (51%), Gaps = 24/354 (6%)
Query: 36 AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
A L A T P+++ IL DDLG D+ + TPNID + +G
Sbjct: 7 ALLALGAAMTSGAALASNTVDKSTPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETG 66
Query: 96 IILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKEL 152
+ L N+Y + +PSR+A++TG+ P G+ + ++ G S +P+ LK
Sbjct: 67 VRLDNFYANSSVSSPSRAALLTGRFPAMVGVPGVIRPSVDQNWGYFDPSAVTMPEVLKNG 126
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLE 211
GYRT ++GKWHLG ++ P RGF+ G+ D Y+ H + L+ ++++
Sbjct: 127 GYRTALIGKWHLG-WESPNLPNERGFDHFHGFLADMMDDYYTHRRQGGNYMYLN-DKEID 184
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
P ++T++FT+ +VD I + ++ P FLYLA+ A HS PLQ P ++N
Sbjct: 185 PK-----GHATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKV 234
Query: 271 RHIED---FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+ + KR++ A++ LD ++GKV+++LE+ L+N++++F SDNGG +
Sbjct: 235 QERDKSLPVKRARLIALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGDRG----SM 290
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
A+N P RG K ++EGG+ A + P + G +V + D +PT+ N
Sbjct: 291 ANNGPTRGAKGDMFEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVN 344
>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
musculus]
Length = 462
Score = 166 bits (420), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 43/319 (13%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 49 PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 108
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ +
Sbjct: 109 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 167
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A+ + + RD W++ G++
Sbjct: 168 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 219
Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++T EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 220 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 270
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN P
Sbjct: 271 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 328
Query: 336 VKNTLWEGGVRGAGLIWSP 354
K T +EGG+R + W P
Sbjct: 329 GKQTTFEGGMREPAIAWWP 347
>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
Length = 477
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 125/365 (34%), Positives = 179/365 (49%), Gaps = 51/365 (13%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L+ L VF ++ P+I+FILADDLG+ DV +G TPNID LA GI +
Sbjct: 5 LSLLLMAVFCLSTKAANKPNIVFILADDLGYTDVACYGSKYYETPNIDKLAKDGIKFTDG 64
Query: 102 YTVQL-CTPSRSAIMTGKHPIHTGM-----------QHNVLYGCER-GGLPLSEKILPQY 148
+T C P+R+++M+G++ TG+ Q L+ E LPL + L Q
Sbjct: 65 HTCGPNCQPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQS 124
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
LK+ GY T + GKWHLG KE+ P RGF+ L H D+ + +D +
Sbjct: 125 LKKAGYATGMFGKWHLG-EDKEHHPAQRGFDEALVSMGVHFDFVTNPK-------VDYPK 176
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL- 267
D +Y D T +A+D I H DEP FLYL H A H +PLQA +
Sbjct: 177 D---------EYLADFLTDKALDFIKRHK-DEPFFLYLPHYAVH-----KPLQAKKELIQ 221
Query: 268 --NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA----- 320
+ + ++ +AA++ +DESVG+VV L++ ++ N++++F SDNGG
Sbjct: 222 KFSAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTLVIFSSDNGGVGGYQRE 281
Query: 321 ----AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
AG + N PLRG K L+EGG R + P G V +Q + D PTL
Sbjct: 282 GIKKAG---DVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYPTL 338
Query: 377 LSAAN 381
L A
Sbjct: 339 LELAG 343
>gi|294053769|ref|YP_003547427.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613102|gb|ADE53257.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 490
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 105/307 (34%), Positives = 162/307 (52%), Gaps = 32/307 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+ IFI+ADDLG+ D+GF+G + TP +D LA++G+I +N Y T C PSR+ ++TG+
Sbjct: 23 PNFIFIVADDLGYGDLGFNGSKVVETPTLDRLAHNGVIFQNGYVTHPYCGPSRAGLITGR 82
Query: 119 HPIHTGMQHNVLYGC--ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ GM+ N+ Y GLPL EK + ++ GYRT ++GKWHLG + + P R
Sbjct: 83 YQARFGMEINLTYSPFDLHQGLPLDEKTFAERVRPAGYRTGMIGKWHLG-ASEPFHPNSR 141
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGL---------------DMRRDLEPAWDLHGKYS 221
GF+ G+ +G DYF L + R+ A +Y
Sbjct: 142 GFDYFYGFLSGGHDYFPEMVNTHPKLLLPNGKPHYSHNEGCTQPLLRNKNAA--EFNEYL 199
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSK 280
T + +A + + +++P LYLA+ A H PL AP + + +E D +R+
Sbjct: 200 TTALSKDAARFVSD--SEKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDKRRAT 252
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA--GFNLNA-ASNWPLRGVK 337
+ A++ +D+ +G +VEALEQ L N++I F+SDNGG + G A N P R K
Sbjct: 253 YLAMIDSMDQGIGLIVEALEQSGKLDNTLIFFMSDNGGVQSKPGHEYETWADNGPFRNGK 312
Query: 338 NTLWEGG 344
++ EGG
Sbjct: 313 GSMREGG 319
>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 446
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 181/354 (51%), Gaps = 26/354 (7%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL- 98
+ F+L M FV L+ ++ P+I+ + ADD+GW DV +HG++ TP IDA+A G+
Sbjct: 1 MKFLFSL-MGFVALLRAADKPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFE 59
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+ Y +C PSR+ I+TG++ G+ N G G+P S+K + + LK GY++
Sbjct: 60 QGYAAASVCGPSRAGILTGRYQQLFGVVTN---GDADKGIPKSQKNIAELLKPAGYKSGA 116
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF--DHSAEEMKMWGLDMRRDLEPAWDL 216
GKWHLG K ++ P RGF++ G+ G DY+ D + K + + +
Sbjct: 117 FGKWHLGSKKGQF-PNDRGFDTFYGFHFGAHDYYRADKKLNKKKKGYAPIYFNQDIVDYK 175
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIED 275
G Y T+ T AV+ I + D+P F+Y+A+ + HS P Q PD YL I +
Sbjct: 176 EGDYLTEKITDHAVEFIEENK-DQPFFMYVAYNSVHS-----PWQVPDEYLARIPESVPA 229
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG--------FNLNA 327
++R F A++ +D+ VG++ L++ + N+I VF +DNG G + ++
Sbjct: 230 YRRL-FLAMVLAMDDGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSM 288
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ RG K +EGG+R + P G E V D PT LSAA+
Sbjct: 289 SQG--FRGYKGDTYEGGIRVPFCMSWPKKIKSGNKFEAPVIAYDLAPTFLSAAS 340
>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
Length = 525
Score = 166 bits (419), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 36/359 (10%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS V + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 16 LLLVLSAAEVGVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNF 75
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
YT LC+PSR+A++TG+ PI +G N E GG+P SE +LP LK G
Sbjct: 76 YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 135
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD E
Sbjct: 136 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 190
Query: 214 WDLHGKYS----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQA 262
+ ++ T ++ EA++ I + P FLY A ATH+ P+ A
Sbjct: 191 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PVYA 245
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+L +R ++ + +LD+SVG+++ L + N+ + F SDNG A
Sbjct: 246 SKPFLGTS------QRGRYGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALIS 299
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 300 AP-RQGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAG 357
>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
Length = 465
Score = 166 bits (419), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 128/366 (34%), Positives = 188/366 (51%), Gaps = 43/366 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
FT ++ +A S P+ I I DD G+ DVG G I TPN+D +A G N+
Sbjct: 11 FLFTFLLIANYSLAQSQKPNFIIIFTDDQGYGDVGTFGHPTIKTPNLDQMAMEGQKWTNF 70
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHN---VLYGCERGGLPLSEKILPQYLKELGYRTR 157
Y +CTPSRSAIMTG+ P+ TGM N VL+ GGLP +E + + LK GY T
Sbjct: 71 YVAANVCTPSRSAIMTGRLPVRTGMYSNTRRVLFPDSGGGLPATENTIAKLLKTSGYSTA 130
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGY-WTGHQDYF-DHSAEEM------KMWGLDMRRD 209
+GKWHLG + EY PT GF+++ G ++ D D +A+E + + + + RD
Sbjct: 131 AIGKWHLG-HLPEYLPTSHGFDTYFGIPYSNDMDRINDVTAQEAFASPKPEYFNVPLMRD 189
Query: 210 LE----PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
E PA T +T EAV I + D+P F+YLAH+ P+ PL A +
Sbjct: 190 KEIIERPA---DQTTITKRYTEEAVSYIKANK-DQPFFIYLAHSL-----PHIPLFASED 240
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
+L +R + ++ ++D SVG+++ L+ + N+ ++F SDNG +N
Sbjct: 241 FLTKS------ERGLYGDVIEEIDWSVGQILSTLKSEGIDKNTYVIFTSDNGPWLV-YNE 293
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV----HVSDWLPTLLSAAN 381
S+ L G K T +EGGVR +IW P G + Q V D LPT +S+ +
Sbjct: 294 QGGSSGGLFGGKGTSYEGGVRVPTIIWGP-----GNIKPQVVSKIGSTLDLLPT-ISSLS 347
Query: 382 KSDIPN 387
++IPN
Sbjct: 348 GTEIPN 353
>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 489
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/339 (35%), Positives = 175/339 (51%), Gaps = 35/339 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+FI ADDLG+ D+GF G D TPN+D LA + Y+ C PSR +IMTGK
Sbjct: 39 PNIVFIFADDLGFADLGFTGSDTHLTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGK 98
Query: 119 HPIHTG-MQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
+P G + H L G GLP++E + + LK+ GY T +GKWH+G K E
Sbjct: 99 YPARLGAVSHGKLGGVAHPGPNDNGLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENN 156
Query: 173 PTFRGFESHLG----YWTGHQDY-FDHSAEEMKMWG-LDMRRDLEPAWDLHGKYSTDVFT 226
P RGF+ + G Y F+ + E+ ++ + D +P G + TD
Sbjct: 157 PGTRGFDVDIASNEFCCPGSYMYPFESNNEKQRVASKIPDLEDRKP-----GDFLTDALA 211
Query: 227 AEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQA-PDHYLNIHRHIEDFKRSK---F 281
EAV IH STDE P FL ++ A H+ P+ A P+ R I R K +
Sbjct: 212 EEAVKFIH--STDEKPFFLNMSFYAVHT-----PITAIPEKVEKYKRLIGPDARQKNPTY 264
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
A ++ LD++VG +++ALE++ ++ N+IIVF SDNGG N+PLR K + +
Sbjct: 265 AGLVEHLDDAVGAILKALEEKGIIDNTIIVFTSDNGGEILH---GITDNFPLRDGKGSSY 321
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
EGG R L+ P + V+ + + D+ PT LS A
Sbjct: 322 EGGTRVPLLVKWPGVTQVNTVSHERIIGFDYYPTFLSMA 360
>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 467
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 168/345 (48%), Gaps = 20/345 (5%)
Query: 46 LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTV 104
L V + + P+I+FIL+DD G+ D GF G + TP +D LA I + Y +
Sbjct: 13 LGFFKVSFLTAQQQPNIVFILSDDAGYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSA 72
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEKILPQYLKELGYRT 156
+C PSR+ I+TGK+ G + N + G + GLPL + + YL++LGY+T
Sbjct: 73 AVCGPSRAGILTGKYQQKFGFEENNVPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKT 132
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
+ GKWH G + PT RGF+ G+ G + Y + D +
Sbjct: 133 ALFGKWHQG-NADRFHPTKRGFDEFYGFRGGARSYMPYDDSNPLSKNEDRLERGFGNFLE 191
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
H Y TD EA+ I N + P F+YL+ A H+ P++A L H++
Sbjct: 192 HEGYLTDELAHEAISFI-NRNKKHPFFIYLSFNAVHT-----PMEATAEDLEQFPHLKG- 244
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
KR AA+ +D ++G V++ LE + + +N++IVF +DNGG + + N + N PL G
Sbjct: 245 KRKTLAAMTLAMDRAIGTVLKELETQGLSNNTLIVFTNDNGGPS---DANDSDNSPLSGT 301
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K EGG+R L+ P + + D PT LSAA+
Sbjct: 302 KANHLEGGIRVPFLMSWPKQLQKNKNYNYPISTLDLFPTFLSAAS 346
>gi|282900986|ref|ZP_06308919.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
raciborskii CS-505]
gi|281194077|gb|EFA69041.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
raciborskii CS-505]
Length = 434
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 126/395 (31%), Positives = 185/395 (46%), Gaps = 48/395 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLGW D+ G TP +D LA GI Y+ +CTP+R TG+
Sbjct: 12 PNIVFILADDLGWGDLSIDGQKNYQTPYLDQLARDGIRFNQAYSDSPVCTPTRIGFFTGR 71
Query: 119 HP--IHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+P + G +L + G GLP + L++ GY T +VGKWH G Y Y+P
Sbjct: 72 YPGRLAIGNYEPLLSFQQIGDSVGLPPEHPTIASLLRDNGYETVLVGKWHCG-YLPRYSP 130
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMK--MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
GF G ++G DYF H K +W D + Y TD+FT AVD
Sbjct: 131 LKSGFNKFFGNFSGAIDYFRHVDTNGKPDLWEADTPIE-------KIGYVTDIFTERAVD 183
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL---------NIHRHIEDFKRSKFA 282
I + P +L L + A P+ P Q P+ N+ I + +
Sbjct: 184 FIKGPHVN-PFYLSLHYTA-----PHWPWQGPEDVELSNDLIGRDNLENWINTGTKESYK 237
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA-AAGFNLNAASNWPLRGVKNTLW 341
A++ LD VGKV++ALE + + +I++F SDNGG + F L +G K L+
Sbjct: 238 AVVQSLDNGVGKVLQALEDVGIANRTIVIFASDNGGERFSNFGL-------FQGKKGNLY 290
Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY------VNSTVE 394
EGG+R LI WS ++ I ++Q + D T+L+A + P+Y + ++
Sbjct: 291 EGGIRVPTLIRWSGVIAPNQI-SDQVIITHDITATILAATKTNTNPDYPLDGEDLLPVIQ 349
Query: 395 NIIPRYENSIL-RYENGTHEYNSPRIENSNTRYEN 428
N IP Y+ + RY+ PR+ R EN
Sbjct: 350 NEIPTYDRVLFWRYKGNNRLGLPPRVRQGAVRSEN 384
>gi|225012438|ref|ZP_03702874.1| sulfatase [Flavobacteria bacterium MS024-2A]
gi|225003415|gb|EEG41389.1| sulfatase [Flavobacteria bacterium MS024-2A]
Length = 471
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/346 (34%), Positives = 171/346 (49%), Gaps = 19/346 (5%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
L L + F + PP+IIFIL DD G+ D+G +G I TPN+D LA G
Sbjct: 5 LHLLILVLFCFCNTQDKVTPPNIIFILTDDQGFGDLGVYGATDIKTPNLDRLAGEGARFT 64
Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
+YY Q +C+ SR++I+TG +P G+ HN + GL E L + LKE GY T I
Sbjct: 65 SYYATQPVCSASRASILTGCYPDRIGI-HNAYSPGSKVGLNPEETTLAELLKEKGYATGI 123
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR--RDLEPAWDL 216
GKWHLG E+ P GF+ + G + + H + D++ + P L
Sbjct: 124 FGKWHLG-DAPEFQPRKHGFDEYYGILYSNDMWPKHPQQGAVFNFPDIKLYENETPLRVL 182
Query: 217 HGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
+ + T T A+D I + + P F+YL H P+ PL A D N
Sbjct: 183 EDQTFLTGALTDRAIDFIKKNK-ENPFFVYLPH-----PQPHVPLFAIDSLKNTQ----- 231
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
KR + ++H++D SVG+++ LEQ ++ N+I++F SDNG + + ++ S R
Sbjct: 232 -KRGLYGDVIHEIDASVGRILITLEQEDLVENTIVIFTSDNGPWLS-YGGHSGSPGIFRE 289
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T WEGG R G+I P + +V E DWLPTL AN
Sbjct: 290 GKGTNWEGGHRVPGIIRFPNKIAANMVIEAPAMGIDWLPTLAEFAN 335
>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
Length = 500
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 166/318 (52%), Gaps = 54/318 (16%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
FAV + LS L A++ P+IIFI DD+G ND+G++ ++ TPN+D LA +G+
Sbjct: 12 FAVCQITIQLS-----LGAANQKPNIIFIAVDDMGNNDIGYNN-PEVDTPNLDNLANNGV 65
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQ-----HNVLYGCERGGLPLSEKILPQYLKE 151
IL++ Y +C+PSR+A MTG++ G Q H E ++EK+ Y
Sbjct: 66 ILESNYVYPVCSPSRAAFMTGRYAHKIGFQRGPVEHKQPAYIESNYKTVAEKLTTNY--- 122
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY ++GKWHLG+ K TPT RGF+S G++ G ++Y+ +++ K D R +L
Sbjct: 123 -GYAAHMIGKWHLGYCKDAVTPTNRGFDSFYGFYGGQENYYTYTSARYK----DFRDNLT 177
Query: 212 ---------PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
P D+ G YST + A++I+ NH PLFLYLA A HS PLQA
Sbjct: 178 AVTPQNPNYPREDVDG-YSTFEYKKRAIEIVGNHDKSVPLFLYLAFQAPHS-----PLQA 231
Query: 263 PDH----------YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIV- 311
P Y + + R+ F A++ +D++VG+V +E + +L N+IIV
Sbjct: 232 PQELKDKYPGDWTYTDWDGSTKTTSRATFYAMVTAVDQAVGEVYTEMETQGLLDNAIIVL 291
Query: 312 ---------FVSDNGGAA 320
F+S GG+A
Sbjct: 292 LHASDWLPTFISLAGGSA 309
>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
boliviensis]
Length = 482
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G + N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP+ LKE GY T+IVGKWHLG ++ ++ P GF+
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFDEW 119
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + RD W++ G+Y T ++
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+D I + P FLY A ATH+ P+ A +L R R ++ + +
Sbjct: 172 EALDFIKRQARRRPFFLYWAVDATHA-----PVYASKPFLGTSR------RGRYGDAVRE 220
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+S+GK+++ L+ R+ ++ + F SDNG A SN P K T +EGG+R
Sbjct: 221 MDDSIGKMLQLLQDLRVADSTFVFFTSDNGAALISAPEQGGSNSPFLCGKQTTFEGGMRE 280
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L W P + G V+ Q + D T L+ A
Sbjct: 281 PALAWWPGHITPGQVSHQLGSIMDLFTTSLALAG 314
>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 889
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/333 (33%), Positives = 168/333 (50%), Gaps = 49/333 (14%)
Query: 88 IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
++ LA G+ L ++Y CT SR+A+ TGK+P+H G+Q +V++ E G+PL E L Q
Sbjct: 1 MNKLASGGVTLTSFYASPTCTASRAALFTGKYPMHLGLQDSVIHPSEPRGVPLKETFLSQ 60
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDM 206
LK+ GY T VGKWHLGF++ YTP+ RGF+ G TG D+ H S E M G D
Sbjct: 61 KLKDAGYGTVFVGKWHLGFHQAAYTPSQRGFDEFFGILTGGGDHVAHASTESFAMRGPDY 120
Query: 207 R-------------------RDLEPAWDLHGKYSTDVFTAEAVDIIHNHS--TDEPLFLY 245
+ D E D H ST ++TA++ + ++ D PLFL
Sbjct: 121 KSKITTMTGSNLWHNGRPVGEDEEGVRDTH---STHLYTAKSRAYVAKYAEMADVPLFLT 177
Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRH---IEDFK---------RSKFAAILHKLDESVG 293
+++ A H+ P+Q PD Y++ + E + R + ++ +DES+
Sbjct: 178 VSYQAVHA-----PIQVPDEYVDGSAYANGCESLRAGSASATRLRRQLCGMVSMVDESLA 232
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-- 351
+ + L+ M N+++ F+SDNG G + +SN LRG K + +EGG+R +
Sbjct: 233 SIHDDLKAEAMWENTLLFFLSDNG----GILRHGSSNLDLRGEKGSYYEGGIRVPAFVSG 288
Query: 352 -WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ L S G V E VH++D T L A S
Sbjct: 289 GYLAKLLSAGRVFEGLVHITDLHATALRVAGVS 321
>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
Length = 468
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 181/365 (49%), Gaps = 29/365 (7%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTP 86
G TR A A L A V SGP P+I++I+ADD+G D+G +G I TP
Sbjct: 7 GAMTRRTALAALGSAVAAPAVARG--GRSGPRRPNILYIMADDMGHADLGCYGSRDIRTP 64
Query: 87 NIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEK 143
IDA+A G+ N Y +C+P+R A++TG++ G++ + + + LP +
Sbjct: 65 AIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEEPIAFNGDELSLPRGTR 124
Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
LP L++LGY T +VGKWH+G +P G++ G +G DYF H+ + G
Sbjct: 125 TLPGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASGGTDYFAHAT---TING 180
Query: 204 LDMRRDLEPAWDLH-GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQ 261
+M + E ++ Y TD+ A+A+D + + D P F+ L H P+ P Q
Sbjct: 181 HEMGKLFENRTEIQRPGYLTDLLGAKAIDRMQQAARQDRPFFISL-----HFTAPHWPWQ 235
Query: 262 APDHYLNIH-----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
P R ++ +AA++ +D +VG+V+ L + ++I++F SDN
Sbjct: 236 GPGDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLAELAALGLEEDTIVIFTSDN 295
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
GG + +WPL G+K L EGG+R ++ P +G + Q D LPTL
Sbjct: 296 GGE------RFSDSWPLTGMKGELLEGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTL 349
Query: 377 LSAAN 381
LSAA
Sbjct: 350 LSAAG 354
>gi|334145289|ref|YP_004538499.1| sulfatase [Novosphingobium sp. PP1Y]
gi|333937173|emb|CCA90532.1| sulfatase [Novosphingobium sp. PP1Y]
Length = 461
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 171/345 (49%), Gaps = 37/345 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADD+G+ D+G +G TP ID +A G++ + Y +C+ SR+A++TG+
Sbjct: 21 PNILFILADDMGYADLGCYGRTDYATPAIDRIAREGMLFRQAYANSAVCSASRTALITGR 80
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ + G++ + G GLP S +P L+ GY T ++GKWHLG K Y P
Sbjct: 81 YQNRLPVGLEEPL--GRRVRGLPPSHPTMPSLLRSQGYETTLIGKWHLGQLPK-YGPLQS 137
Query: 177 GFESHLGYWTGHQDYFDHSAEEM-KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G++ G G DYF H + +W D R++E Y TD+ A+ I +
Sbjct: 138 GYDHFWGLRGGGIDYFSHDFNGVPDLWEDD--REIE-----QPGYMTDLLAERAMTAIDD 190
Query: 236 HST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-----------RSKFAA 283
++ +P F+ L H P+ P + PD R K R +A
Sbjct: 191 YAARQQPFFMSL-----HFTAPHWPWEGPDDQTESQRLAASDKPFSMFDTDGGSRKTYAE 245
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
++ ++D +G+++E L++ + ++IIVF SDNGG + WP G+K L EG
Sbjct: 246 MVTRMDMEIGRILERLDRLGLAEDTIIVFTSDNGGE------RYSKTWPYNGLKTELLEG 299
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
G+R ++ P + G +EQ + DW+PT LS S P Y
Sbjct: 300 GLRVPAVMRWPRRIAAGQESEQVIMSMDWMPTFLSLIQASPDPRY 344
>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
Length = 453
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/386 (32%), Positives = 199/386 (51%), Gaps = 38/386 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+ ILADDLG+ D+ +G I TP+ID+L +G+ ++Y +C+PSR+A+++G+
Sbjct: 29 PNILLILADDLGYGDLSSYGAPDIRTPHIDSLVRAGMRFSHFYANSSVCSPSRAALLSGR 88
Query: 119 HPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P G+ + + G L S +LP LK+ GY T +VGKWHLG + P R
Sbjct: 89 YPEQVGVPGVIRTMPDDNWGYLSPSAVLLPSILKKNGYYTALVGKWHLGL-EPPNLPNDR 147
Query: 177 GFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF+ G+ D Y+ H + L+ R+ + P ++TD+FT A D +
Sbjct: 148 GFDLFHGFEGDMMDDYYTHLRHDRNYMRLN-RQTINPQ-----GHATDLFTQWATDYLEQ 201
Query: 236 HSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---DFKRSKFAAILHKLDES 291
+ P FLYLA+ A P++P+Q P +L + + KR+K ++ +D+
Sbjct: 202 RAGQSNPFFLYLAYNA-----PHDPIQPPADWLAKVKARQPGISEKRAKLVGLIEHMDDG 256
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR-GAGL 350
+GKV++ L + + N++IVFVSDNGG + A+N PLR K ++EGG+R A +
Sbjct: 257 IGKVIQTLRAKGLYENTLIVFVSDNGGKL----FDGATNGPLRSGKGHMYEGGIRIPACV 312
Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENG 410
+W + ++ ++Q + + D PTL A +N ++ R SILR E
Sbjct: 313 VWPGKVAAQS-QSQQPLLLMDIFPTLAEATG-----TVINYPIDG---RSFLSILRGERQ 363
Query: 411 THEYNSPRIENSNTRYENGTHEYNPK 436
P R E G+ EYN K
Sbjct: 364 LLAAERPLF---FIRREGGS-EYNGK 385
>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 468
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/355 (31%), Positives = 180/355 (50%), Gaps = 32/355 (9%)
Query: 46 LSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYT 103
+++VF L S+ +I+F+ ADD G+ D GF G PTPN+D LA G++ K Y T
Sbjct: 11 VALVFFSLSTSAQKKKNIVFLFADDAGYFDFGFQGSKTFPTPNLDQLAKEGMVFKQAYTT 70
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYG---------CERGGLPLSEKILPQYLKELGY 154
+C PSR+ ++TG++ G + N + G + GLPL EK + YL +LGY
Sbjct: 71 AAVCGPSRAGLLTGRYQQRFGFEENNVPGYMSKSSKLLGDDMGLPLDEKTMADYLGKLGY 130
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD-LEPA 213
++ ++GKWH+G Y P RGF G+ G + ++ + ++ D D LE
Sbjct: 131 QSIVLGKWHMG-NADRYHPLKRGFTEFYGFRGGARSFYPLTQKQ----AADKPEDRLEIG 185
Query: 214 WDLHG---KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
+ + KY T A D I + + +P F+Y++ A HS P+QA L
Sbjct: 186 YKKYQEPKKYLTYDLADAACDFI-DRNKKKPFFMYVSFNAVHS-----PMQATQEDLKKI 239
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
+ + KR AA+ LD++ G+++E L++ + +++IVF +DNGG N
Sbjct: 240 KGLTG-KRKVLAAMTLALDKACGQILEKLKEEGLEDDTLIVFSNDNGGPDG----TKTCN 294
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSD 384
+PL G K+ EGG+R ++ P + G E V D LPT ++ A K+D
Sbjct: 295 YPLSGCKSNNLEGGIRVPFVMKLPKVIQPGSSYEYPVSTLDLLPTFVNIAGGKAD 349
>gi|323452121|gb|EGB07996.1| hypothetical protein AURANDRAFT_64538 [Aureococcus anophagefferens]
Length = 1591
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/353 (30%), Positives = 172/353 (48%), Gaps = 33/353 (9%)
Query: 47 SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN-IDALAYSGIILKNYYTVQ 105
S V A G P I+ + DD+G ND+ + D +P P + L G+ L +YY
Sbjct: 1082 SAAHVARFADDGRPDIVLVFMDDVGLNDL-WRSTD-LPKPTEMSKLVRDGVELTSYYGQS 1139
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGL-----------PLSEKILPQYLKELGY 154
LC+P+R+ +MTGK H + + ++GG+ PL +LPQ +K LGY
Sbjct: 1140 LCSPARATLMTGK------FAHKIGFSDQQGGVREVTAYSNFSVPLGHDMLPQGMKRLGY 1193
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
+T +GKW++G +Y P RGF++ +GY+T Y DH ++ + ++
Sbjct: 1194 QTHAIGKWNIGHCNVKYMPWQRGFDTFVGYFTDGIGYTDHVSDTANTYTVNDGGLAFNGS 1253
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-------DHYL 267
+ G Y+T +FT A ++H+ D PLF++LA+ H + D +
Sbjct: 1254 EYEGTYTTALFTERAEKVLHDAPEDAPLFMWLAYHGMHDNDGVTDADTCAQTGDDDDDAI 1313
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
+ R F R F L +D VG+V AL+ R + ++V SDNGG A G++
Sbjct: 1314 FMERGHLPFARFNFGCGLRAIDRGVGQVRTALDDRA--RDYVLVVTSDNGGFACGYHCQ- 1370
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPL---LESRGIVAEQYVHVSDWLPTLL 377
SN+P RG K +EGG+R G+++SP RG + +H DW T +
Sbjct: 1371 GSNYPYRGAKFYEFEGGLRLPGVVYSPSRVPARRRGAAYDGLMHHVDWFGTFV 1423
>gi|239626021|ref|ZP_04669052.1| arylsulfatase E [Clostridiales bacterium 1_7_47_FAA]
gi|239520251|gb|EEQ60117.1| arylsulfatase E [Clostridiales bacterium 1_7_47FAA]
Length = 483
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 184/372 (49%), Gaps = 29/372 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+II L DD G+ D+ G + TPN+D LA G ++Y +C+PSR+ ++TG+
Sbjct: 17 PNIIVFLTDDQGYGDLSCMGSTDVCTPNLDILAAGGARFTDFYAGSAVCSPSRACLLTGR 76
Query: 119 HPIHTGMQHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+P TG++ ++L G + GL LK+LGY T +VGKWHLG E PT G
Sbjct: 77 YPYMTGVR-SILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAV-PECRPTHMG 134
Query: 178 FESHLGYWTGHQDYFD--HSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVD 231
F+ G+ +G DYF H E G++ DL E G+YST++F + ++
Sbjct: 135 FDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLE 194
Query: 232 IIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
I D P LY A A P+ P+ AP YL +H+ + R AA+L +D+
Sbjct: 195 FIREQVEKDMPFALYCAFNA-----PHYPMHAPYKYLERFKHLPE-DRQIMAAMLSAVDD 248
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWE 342
VG+++ L++R + +++II F SDNG + N S L+G K +L++
Sbjct: 249 GVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGHKFSLFD 308
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSD--IPNYVNSTVENIIP 398
GG+R + P + G V + +D PT ++AA N SD I V I
Sbjct: 309 GGIRVPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASDYEISGCDILPVMTIGA 368
Query: 399 RYENSILRYENG 410
R + L +E G
Sbjct: 369 RRDKDCLYWEMG 380
>gi|223936859|ref|ZP_03628768.1| sulfatase [bacterium Ellin514]
gi|223894428|gb|EEF60880.1| sulfatase [bacterium Ellin514]
Length = 460
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 174/358 (48%), Gaps = 70/358 (19%)
Query: 32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
TRI + +L +AF ++ F V ++ P+I+ ILADDLG+ D+G +G I TPN+D +
Sbjct: 2 TRISKY-LLVVAFLIAATFAPGVFAAKAPNIVIILADDLGYGDLGCYGHPSIRTPNLDRM 60
Query: 92 AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ---HNVLYGCERGGLPLSEKILPQ 147
A GI ++Y +CTPSR+ +MTG+ PI +GM ++VL GGLP +E +
Sbjct: 61 AAEGIRFTDFYVAANVCTPSRAGLMTGRWPIRSGMAGSTNDVLRVFSMGGLPTNEITIAA 120
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
LK GY T +GKWHLG ++ ++ PT GF DYF +GL
Sbjct: 121 ALKSKGYATACIGKWHLG-HELQFLPTHHGF-----------DYF---------YGLRFS 159
Query: 208 RDLEP---------AWDLHGKYS----------------TDV------FTAEAVDIIHNH 236
D+EP + LH K TD+ +T EA+ IH +
Sbjct: 160 NDMEPVRGKIPKNASSSLHPKLEWWNSALLQNDKILEQPTDLSTLTRRYTEEAIKFIHQN 219
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+P FLY H P+ PL A D + E R + ++ +LD SVG+V+
Sbjct: 220 K-KQPFFLYFPHTF-----PHVPLFASDAF------KEHSARGLYGDVVEELDWSVGQVL 267
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
L + + N+++ F SDNG +L S PL+ K + WEGG+R + W P
Sbjct: 268 NTLRKEGLAENTLVFFTSDNGPWLIR-DLAGGSAGPLKDGKGSTWEGGMREPAIAWWP 324
>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
Length = 980
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 186/369 (50%), Gaps = 40/369 (10%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHG------LDQIPTPNIDALAYSG 95
+ F L + + ++ P+I+ ADD G D GF G + TP++DALA G
Sbjct: 7 IVFRLITILPSVAGAADQPNILIFFADDAGNADFGFQGGGINGDFANL-TPHLDALAEGG 65
Query: 96 IILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEKILPQYLKELG 153
+ N Y + +CTPSR+ ++TG++ G++ +YG GLP SE + L++ G
Sbjct: 66 VRFTNGYVSAAVCTPSRAGMLTGRYQHRFGVE--TVYGRIPEAGLPASEITMADTLRKAG 123
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD---- 209
YRT +GKWHLG + E+ P RGF+ G TG + +F + G ++R+
Sbjct: 124 YRTYAIGKWHLGEHLHEHLPNQRGFDEFYGALTGARTFFPYRGNNP---GSKLQRNGVFL 180
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE---PLFLYLAHAATHSANPYEPLQAPDHY 266
EP L Y TD+ + V I +H + P FLYLA A H+ PL+A
Sbjct: 181 PEP---LDQPYFTDLLARQTVAYIDDHVANHANAPFFLYLAFTAVHT-----PLEADPKR 232
Query: 267 LNIHRHIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
L+ R I+D +R AA+ +D++VG V+ L + + N+I+VF+SDNGG
Sbjct: 233 LD-DRRIQDISPPQRKTLAAMTIAMDDAVGTVMAKLRENNLTENTIVVFLSDNGGPEDNR 291
Query: 324 NLNA---ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY---VHVSDWLPTLL 377
+L A + N L+G K+ +EGG+R +I P +RG+ + V D PT
Sbjct: 292 SLRAPNWSDNGALQGNKSQNFEGGIRVPFVIHWPRGIARGLSGKSLPDVVTALDLFPTFA 351
Query: 378 SAANKSDIP 386
AA K+ +P
Sbjct: 352 EAA-KATLP 359
>gi|86142609|ref|ZP_01061048.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
Length = 477
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/337 (34%), Positives = 171/337 (50%), Gaps = 58/337 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ + I ADD+G+ D+ +G I TPN+D +A G N+Y +CTPSR+AIMTG+
Sbjct: 32 PNFVIIFADDMGYGDLSSYGHPTIKTPNLDKMAMEGQKWTNFYVGSSVCTPSRAAIMTGR 91
Query: 119 HPIHTGMQ---HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
P+ GM H VL GLP SE + + LK+ GY T GKWHLG K+EY P
Sbjct: 92 LPVRNGMTSKVHRVLQPWSTNGLPQSEITIAEQLKKGGYGTAAFGKWHLG-DKEEYLPVN 150
Query: 176 RGFESHLGYWTG---------------HQDYFD----HSAEEMKMWGLDMRRDLEPAWDL 216
GF+ YW G QDY++ +E+ + + + R+ E
Sbjct: 151 NGFD----YWYGVPYSNDMDITLEMNEPQDYWNFWKSEERKEISSFNIPLMRNTEIIERP 206
Query: 217 HGKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
+++ T +T EA++ I H + P F+YLA+ P+ PL A + +
Sbjct: 207 ANQHTITKRYTNEAIEYIKEHK-ENPFFIYLAYNL-----PHVPLFASEEFEG------K 254
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG--------GAAAGFNLNA 327
KR + ++ ++D +VG+V+EAL+++ + N+I+VF SDNG G +AG
Sbjct: 255 SKRGLYGDVVEEIDHNVGRVMEALKEQGIAENTIVVFTSDNGPWLVMDEEGGSAGL---- 310
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
LR K + WEGG+R G+ WSP GIV E
Sbjct: 311 -----LRDGKGSTWEGGMREPGIFWSPGNLKPGIVTE 342
>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 455
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 116/338 (34%), Positives = 170/338 (50%), Gaps = 27/338 (7%)
Query: 50 FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCT 108
+ L A + P+++ I +DD G D G +G D + TP ID LA +GI Y +C+
Sbjct: 22 WTSLEARAERPNVLVIYSDDQGTIDAGCYGSDDLTTPAIDHLAETGIRFTQMYAPSCICS 81
Query: 109 PSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
SR+ ++TG+ P+ G+ NV G+P SE L K+ GYRT VGKWHLG Y
Sbjct: 82 ASRAGLLTGRLPVRAGVPANVSSKKGAEGMPASEVTLGDVFKQAGYRTAHVGKWHLG-YT 140
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD-----LH-GKYST 222
P +GF+S G+ G D + H W R DL W H G++
Sbjct: 141 PATMPLGQGFDSSFGHMGGCIDNYSH----FFYWNGPNRHDL---WRDGKEVFHDGQFFP 193
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
D+ E ++ S ++PLF+Y A P+ P Q D + N + +E +R ++A
Sbjct: 194 DLLVDEMQRLLTEKS-EQPLFVYWAINV-----PHYPYQGTDKWRNYYADLETPRR-EYA 246
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA---AGFNLNAASNWPLRGVKNT 339
A + LDE +G+V+ LE+ N+I+VF SD+G + A F +A P RG K
Sbjct: 247 AFVSTLDEKLGEVLATLEETGQRENTIVVFQSDHGHSTEERAFFGGGSAG--PYRGAKGC 304
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
L+EGG+R +I P +G V +Q DWLPTL+
Sbjct: 305 LFEGGIRVPSVISWPAGLPQGEVRDQMAVGCDWLPTLV 342
>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
rubripes]
Length = 519
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 181/373 (48%), Gaps = 44/373 (11%)
Query: 35 MAFAVLPLAFTLSM-VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
M F L L T++ + P+II +L DD+GW D+G G TPN+DA+A
Sbjct: 1 MGFETLILLCTITCYCLAQKPGNVSSPNIIIMLMDDMGWGDLGAFGQPSKETPNLDAMAA 60
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
G++L N+YT LC+PSR+A++TG+ P+ G N E GG+ E +L
Sbjct: 61 QGMLLLNFYTANPLCSPSRAALLTGRLPVRNGFYTTNGHARNAYTPQEIVGGISKDEILL 120
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
PQ LK+ GY +IVGKWHLG ++ +Y P GF+ G H +++S +
Sbjct: 121 PQMLKKRGYFNKIVGKWHLG-HRPQYLPLEHGFDEWFGAPNCHFGPYNNSVRP----NIP 175
Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIHNHS-TDEPLFLYLAHAA 250
+ R+ +W L G+Y T ++ E +D I + + +P FLY A A
Sbjct: 176 VYRN---SWML-GRYYEEFKIDKKTGESNLTQMYLLEGLDFIQSQAEAQKPFFLYWAPDA 231
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
TH+ P+ A +L +R ++ + +LD SVG+++ L + + SN+ +
Sbjct: 232 THA-----PVYASKDFLG------KSQRGRYGDAVMELDYSVGQILSLLRRLGIDSNTFV 280
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
F SDNG A + SN P K T +EGG+R + W P G V+ Q +V
Sbjct: 281 FFTSDNGAALMSGPNESGSNGPFLCGKETTFEGGMREPAIAWWPGHIREGTVSFQLANVM 340
Query: 371 DWLPTLLSAANKS 383
D T L+ A S
Sbjct: 341 DLFTTSLALAGMS 353
>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
Length = 468
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 113/363 (31%), Positives = 180/363 (49%), Gaps = 25/363 (6%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
G TR A A L A V + + P+I++I+ADD+G D+G +G I TP I
Sbjct: 7 GAMTRRTALAALGSAVAAPAVARGGRSGARRPNILYIMADDMGHADLGCYGSRDIRTPAI 66
Query: 89 DALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKIL 145
DA+A G+ N Y +C+P+R A++TG++ G++ + + + LP + L
Sbjct: 67 DAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEEPIAFNGDELSLPRGTRTL 126
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
P L++LGY T +VGKWH+G +P G++ G +G DYF H+ + G +
Sbjct: 127 PGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASGGTDYFAHAT---TINGHE 182
Query: 206 MRRDLEPAWDLH-GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAP 263
M + E ++ Y TD+ A+A+D + + D P F+ L H P+ P Q P
Sbjct: 183 MGKLFENRTEIQRPGYLTDLLGAKAIDRMRLAARQDRPFFISL-----HFTAPHWPWQGP 237
Query: 264 DHYLNIH-----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
R ++ +AA++ +D +VG+V+ L + ++I++F SDNGG
Sbjct: 238 GDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLAELAALGLQEDTIVIFTSDNGG 297
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
+ +WPL G+K L EGG+R ++ P +G + Q D LPTLLS
Sbjct: 298 E------RFSDSWPLTGMKGELLEGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTLLS 351
Query: 379 AAN 381
AA
Sbjct: 352 AAG 354
>gi|87198499|ref|YP_495756.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
gi|87134180|gb|ABD24922.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
Length = 491
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 111/357 (31%), Positives = 173/357 (48%), Gaps = 47/357 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I++I+ADDLG+ D+ +G TP +D LA G+ N Y +CT +R ++TG+
Sbjct: 56 PNILYIMADDLGYADLSCYGRRDFETPVLDKLAAQGLRFTNAYANSAVCTATRVGLITGR 115
Query: 119 HPIHTGMQHNVLYGCERG-------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
+ Q+ + G E GLP S LP L + GYRT ++GKWHLG ++
Sbjct: 116 Y------QYRLPVGLEEPLAFRPNIGLPPSHPTLPSLLAKAGYRTSLIGKWHLGSLP-DF 168
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
P G+++ G +G DY+ H+ + +W D +E A Y TD+ A
Sbjct: 169 DPLKSGYQTFWGIRSGGVDYYTHATSNGQPDLW--DGPTPVERA-----GYLTDLLADRA 221
Query: 230 VDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHR-----------HIEDFK 277
V I S+ E P F+ L H P+ P + PD R H +
Sbjct: 222 VSEIREASSGEAPWFMSL-----HFTAPHWPWEGPDDASESARIAKLKDPSALFHFDGGS 276
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
+ +AA++ +LD +G+V+EAL+ R ++I+VF SDNGG + WP G K
Sbjct: 277 AAIYAAMVRRLDYQIGRVLEALKANRAEQDTIVVFTSDNGGE------RFSDTWPFSGRK 330
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
L EGG+R ++ P + G ++ + DWLPT L+AA + P + + V+
Sbjct: 331 TELLEGGLRIPAIVRWPGVTRAGTTSDAQIISMDWLPTFLAAAGSAPDPGHPSDGVD 387
>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
Length = 526
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 170/334 (50%), Gaps = 31/334 (9%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ I DD G++DVG +G IPTPN+DA+A G++L N+Y Q +C+ SR+ ++TG
Sbjct: 73 PPNIVIIFTDDQGYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTG 132
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+P G+ HN L GL +E+ L + L++ GYRT I GKWHLG + ++ PT G
Sbjct: 133 CYPNRVGI-HNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDH-PDFLPTRHG 190
Query: 178 FESHLG------YWTGHQ---DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
F+ G W H FD + + R ++ D + T T
Sbjct: 191 FDEFFGIPYSNDMWPLHPLQGPVFDFGPLPL----YEQERVVDTLEDQ--RLLTRQITER 244
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
+VD I+ H +EP FLY+ H P+ PL D + R + ++ ++
Sbjct: 245 SVDFINRHK-EEPFFLYVPH-----PQPHVPLFVSDAFRGKS------GRGLYGDVIMEI 292
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D SVG+V+ ALE + ++ ++F SDNG A N + + PLR K T WEGGVR
Sbjct: 293 DWSVGQVLGALEDNGLTDDTWVIFTSDNGPWLAYGNHSGRAE-PLREGKGTNWEGGVREP 351
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
++ P RG V ++ + D LPT+ S
Sbjct: 352 CIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGS 385
>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 481
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 113/341 (33%), Positives = 174/341 (51%), Gaps = 29/341 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II I ADDLG+ D+G G I TPN+D +A+ G N+Y +CTPSR+ ++TG+
Sbjct: 33 PNIIVIFADDLGYGDLGVFGHPSIKTPNLDKMAFEGQKWTNFYVAAPVCTPSRAGLLTGR 92
Query: 119 HPIHTGMQHN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
PI +GM + VL+ GGLP SE + + LK GY+T +GKWHLG +K Y PT
Sbjct: 93 LPIRSGMSSDDRRVLFPDSDGGLPQSEITIAKALKGNGYQTAAIGKWHLG-HKSPYLPTD 151
Query: 176 RGFESHLGY-WTGHQDYFDHSA-------EEMKMWGLDMRRDLEPA-WDLHGKYSTDVFT 226
GF+S+ G ++ D D + E+ + + + +D E A + T +T
Sbjct: 152 HGFDSYFGIPYSNDMDKIDKTDHFTLTDNEKFDAYNVPLMKDAEIAERPTDQRTLTKRYT 211
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EAV I DEP F+YLAH H + D L + ++
Sbjct: 212 EEAVSKIKAFK-DEPFFIYLAHNLPHIPL-FRSAAFKDQSLG----------GIYGDVIE 259
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
++D SVG+++ L++ + N+++VF SDNG F + + LRG K +EGG+R
Sbjct: 260 EIDWSVGQILSTLKEEGIAENTLVVFTSDNGPWHV-FKTHGGTAGLLRGAKGGTFEGGMR 318
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+ W P G+V + + D LPT A + +++P+
Sbjct: 319 EPTVFWWPAQIKPGVVMDMGTTM-DLLPT-FCAISGTELPD 357
>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 115/357 (32%), Positives = 181/357 (50%), Gaps = 35/357 (9%)
Query: 42 LAFTLSMVFVD---LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-I 97
+ S++F L ++ P+II ILADDLG+ D+GF G I TP+IDALA SG+
Sbjct: 1 MKLIFSLIFFTYSTLALAAQKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNF 60
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER------GGLPLSEKILPQYLKE 151
+ Y + +C PSR+ ++TG++ G N E G+PL E+++ LK
Sbjct: 61 TQGYQSASVCGPSRAGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKP 120
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR---- 207
Y T ++GKWH+G E PT R + + G+ G Y +M M G M
Sbjct: 121 AAYTTGVIGKWHMGL-SHEQRPTQRSVDYYYGFLNGAHSY---REAKMDMKGAPMTWPIF 176
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
R+ EP Y+T+VF E V+ I + D+P FLY+++ + H P +A L
Sbjct: 177 RNNEPV--PFSGYTTEVFNDEGVNFI-KRNKDKPFFLYMSYNSVHG-----PWEAQPKDL 228
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA-------A 320
HI+ R ++A+L +D+ VG++++ L+ + N++++F+SDNG
Sbjct: 229 QRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAER 288
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
AG L ASN LRG K +EGG+R ++ P + + + V D +PTL+
Sbjct: 289 AGDYL--ASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLI 343
>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
Length = 466
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 103/340 (30%), Positives = 167/340 (49%), Gaps = 20/340 (5%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTP 109
V L+ P+I+FIL+DD G+ D GF G + TP +D LA + + Y + +C P
Sbjct: 18 VSLLFGQQKPNIVFILSDDAGYADFGFQGSTEFKTPELDKLAKKSVKFTQAYVSAAVCGP 77
Query: 110 SRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEKILPQYLKELGYRTRIVGK 161
SR+ ++TG++ G + N + G ++ GLP ++ + YLK+ GY T + GK
Sbjct: 78 SRAGMLTGQYQQKFGFEENNVPGYMSSSGLLGDQMGLPTNQLTIADYLKKQGYTTALFGK 137
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WH+G + PT RGF+ G+ G + Y + + D + H Y
Sbjct: 138 WHMG-NADLFHPTKRGFDEFYGFRGGSRSYLPYDKNNVLTKNEDRLEKGFGNYLEHKGYL 196
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
TD EA+ I N + P F+YL+ A H+ P++A L H+ KR
Sbjct: 197 TDKLANEAISFI-NKNKKTPFFIYLSFNAVHT-----PMEATKEDLKYFSHLTG-KRKTL 249
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
AA+ +D ++G V++ L+++ + N+++VF +DNGG + + N ++N PL G K
Sbjct: 250 AAMTLAMDRAIGTVLDELKKQGLDKNTLVVFTNDNGGPS---DANQSNNNPLSGTKANHL 306
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
EGG+R L+ P + + D PT L+ AN
Sbjct: 307 EGGIRVPFLMSWPGKLKKNTTYNYPISTLDLFPTFLNVAN 346
>gi|297180066|gb|ADI16291.1| arylsulfatase a and related enzymes [uncultured bacterium
HF0010_16H03]
Length = 543
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 135/431 (31%), Positives = 199/431 (46%), Gaps = 76/431 (17%)
Query: 33 RIMAFAVLPLAFTLS----------MVFVDLVASSGPPHIIFILADDLGWNDVGFH---- 78
R +A ++P+A LS V+ + P+II ILADDLG+NDV F+
Sbjct: 21 RDIALTLIPIALKLSNPISENQEINWPIVESSVITTKPNIILILADDLGFNDVSFYNGGA 80
Query: 79 GLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQH---------- 127
+ TPNID LA G+ N Y +C+PSR+AIMTG++ G +
Sbjct: 81 ADGSLMTPNIDNLAKEGVAFLNGYAASPVCSPSRAAIMTGRYSSRYGFEFTPYPAQAAKI 140
Query: 128 -NVL------------------YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
N+L G GGLP E + + L++ GY T +GKWHLG +
Sbjct: 141 MNLLREDDELETIYLEDVQLDEVGLTIGGLPDKEVTVAEMLQKNGYYTAHIGKWHLGGFT 200
Query: 169 KEYTPTFRGFE------SHLGYWTGHQDY----FDHSAEEMKMWG-------LDMRRDLE 211
P +GF+ S L H D D S E+M +W + +
Sbjct: 201 DGMKPNDQGFDDSLMLNSSLYLPKNHPDIVNAKIDSSIEDM-VWASSQFAASFNGSEPFK 259
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
P G Y TD +T EA+ +I+N+ D P FLYL H A P+ PLQ+ H
Sbjct: 260 P-----GGYITDYYTDEAIKVINNNK-DRPFFLYLGHFA-----PHNPLQSLRKDYEKHN 308
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
H+E+ ++ ++ LD S+GK+V LE+ + N++I+F SDNGG AG+ N
Sbjct: 309 HMENHTLQVYSGMIEALDRSIGKIVTTLEENGLTENTLIIFTSDNGG--AGYIGLDNINK 366
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
P RG K T +EGG+ P + + ++ +H +D T+L AA+ + P +
Sbjct: 367 PYRGWKLTHFEGGMHIPFFAKWPAKIKKDMRYDKRIHHTDIFSTILGAAS-IEPPKEIKI 425
Query: 392 TVENIIPRYEN 402
N+IP N
Sbjct: 426 DGINLIPFLTN 436
>gi|255690671|ref|ZP_05414346.1| twin-arginine translocation pathway signal [Bacteroides finegoldii
DSM 17565]
gi|260623695|gb|EEX46566.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 453
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 178/330 (53%), Gaps = 24/330 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+++ IL DDLG D+ + TPNID + +G+ L N+Y + +PSR+A++TG
Sbjct: 31 PNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAALLTGC 90
Query: 119 HPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
P G+ + ++ G S +P+ LK GYRT ++GKWHLG ++ P R
Sbjct: 91 FPAMVGVPGVIRPSIDQNWGYFGPSAVTMPEVLKNGGYRTALIGKWHLG-WESPNLPNER 149
Query: 177 GFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF+ G+ D Y+ H + L+ ++++P ++T++FT+ +VD I
Sbjct: 150 GFDHFHGFLADMMDDYYTHRRQGGNYMYLN-DKEIDPK-----GHATELFTSWSVDYIKK 203
Query: 236 HSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---FKRSKFAAILHKLDES 291
+ ++ P FLYLA+ A HS PLQ P ++N + + KR++ A++ LD +
Sbjct: 204 EAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPVKRARLIALIEHLDYN 258
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+GKV+++LE+ L+N++++F SDNGG + A+N P RG K ++EGG+ A +
Sbjct: 259 IGKVIQSLEESGQLNNTLVIFASDNGGDRG----SMANNGPTRGAKGDMFEGGIHVACAL 314
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P + G +V + D +PT+ N
Sbjct: 315 NMPGVFEGGRRDNHFVVMMDLMPTICDFVN 344
>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
africana]
Length = 513
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 179/365 (49%), Gaps = 47/365 (12%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + V + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 13 LLLVLSAAGLGAVGAPQPPNIVLMLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER---------GGLPLSEKILPQYLKE 151
Y+ LC+PSR+A++TG+ PI G G R GG+P SE +LP+ LK+
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFY--TTNGHARNAYTPQDIVGGIPDSEHLLPELLKK 130
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
Y T+IVGKWHLG ++ ++ P GF+ G H +D+ A+ + + RD
Sbjct: 131 ANYATKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRAKP----NIPVYRD-- 183
Query: 212 PAWDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANP 256
W++ G++ T ++ EA+D I S P FLY A ATH+
Sbjct: 184 --WEMVGRFYEEFPINLKTGEANLTQIYLQEALDFIKRQQSQQRPFFLYWAIDATHA--- 238
Query: 257 YEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
P+ A +L +R ++ + ++D+SVG++++ L + ++ I F SDN
Sbjct: 239 --PVYASKPFLGTS------QRGRYGDAVREVDDSVGQLLQLLRGLGIAEHTFIFFTSDN 290
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
G A A SN P K T +EGGVR + W P G V+ Q + D T
Sbjct: 291 GAALVSAPKEAGSNGPFLCGKQTTFEGGVREPAIAWWPGHIPAGQVSHQLGSLMDLFTTS 350
Query: 377 LSAAN 381
LS A
Sbjct: 351 LSLAG 355
>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
Length = 482
Score = 162 bits (411), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P GF+
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 119
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + RD W++ G+Y T ++
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+D I + P FLY A ATH+ P+ A +L +R ++ + +
Sbjct: 172 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 220
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+SVGK++E L + N+ + F SDNG A A SN P K T +EGG+R
Sbjct: 221 IDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQTTFEGGMRE 280
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L W P + G V+ Q + D T L+ A
Sbjct: 281 PALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAG 314
>gi|198275209|ref|ZP_03207740.1| hypothetical protein BACPLE_01368 [Bacteroides plebeius DSM 17135]
gi|198271792|gb|EDY96062.1| arylsulfatase [Bacteroides plebeius DSM 17135]
Length = 509
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 184/394 (46%), Gaps = 59/394 (14%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L TL+ + + + P+++FI+ DD GW DVG++G TPNID LA G+I
Sbjct: 10 TLAGGVTLAANMLHAASDNRQPNVVFIMVDDYGWADVGYNGSRFYETPNIDRLASEGMIF 69
Query: 99 KN-YYTVQLCTPSRSAIMTGKHPIHTGM-----------------QHNVLYGCERGGLPL 140
+ Y + +PSR ++MTGK+P TG+ Q+ +L +PL
Sbjct: 70 TDGYAAASISSPSRVSLMTGKYPARTGITDWIPGYQYGLKPEQLKQYKMLAPEMPLNMPL 129
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEM 199
E + + KE GY T VGKWH Y P ++GF+ ++G W G + S
Sbjct: 130 EEVTMAEAFKEHGYATYHVGKWHCA-EDSLYYPQYQGFDVNIGGWLKGSPNGIRRSQGGK 188
Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
+ R P G++ TD E++ +I N S D+P FLYLA A H+ P
Sbjct: 189 GAYCSPYRNPYLPD-GPEGEFLTDRLGDESIKLIKNSSADKPFFLYLAFYAVHT-----P 242
Query: 260 LQAPDHYLN----------------IHRHIEDFKR----------------SKFAAILHK 287
++A Y+ R++E +K +++AA+++
Sbjct: 243 IEAKPEYVKYFKWKAQRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAEYAALIYS 302
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG+V++AL+ + N+I+ +SDNGG + + N PLR K L+EGG+R
Sbjct: 303 MDENVGRVMQALKDNGLDKNTIVCLLSDNGGLSTAEG-SPTCNAPLRAGKGWLYEGGIRE 361
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+I P + G V V D+ PTLL A
Sbjct: 362 PFIIKYPQMVEAGSVCHTPVVAVDFYPTLLDMAG 395
>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
gorilla]
Length = 528
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 47 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 106
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P GF+
Sbjct: 107 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 165
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + RD W++ G+Y T ++
Sbjct: 166 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+D I + P FLY A ATH+ P+ A +L +R ++ + +
Sbjct: 218 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 266
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+S+GK++E L+ + N+ + F SDNG A SN P K T +EGG+R
Sbjct: 267 IDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMRE 326
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L W P + G V+ Q + D T L+ A
Sbjct: 327 PALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAG 360
>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
Length = 499
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 173/349 (49%), Gaps = 32/349 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P++I I DD G+ DVG G D + TPN+D +A G++L ++Y LC+PSR+ +MTG
Sbjct: 45 PNVIVIFTDDQGYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMTGS 104
Query: 119 HP----IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+P + TG VL + GL +E + + LK +GY T I GKWHLG + E+ PT
Sbjct: 105 YPRRVDMATGSNFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLG-DQPEFLPT 163
Query: 175 FRGFESHLGYWTGHQDYFDHSAE-EMKMWGLDMRRDLEPAWDL--HGKYSTDVFTAEAVD 231
+GF+ G H H + K L + + E +L + +Y T T A+D
Sbjct: 164 RQGFDEFFGLPYSHDIAPTHKRQAHFKFPDLPLMEN-ENVIELNPNPEYLTRRITERAID 222
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI-------HRHIEDF------KR 278
I H D P FLYL H P+ PL +LN +I D K+
Sbjct: 223 FIERHQ-DAPFFLYLPHPM-----PHGPLHVSQEFLNTVDKATLARLNISDADIAANSKK 276
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+ + ++ +LD +VG +++ L++ + SN+I++F SDNG +G + + L G K
Sbjct: 277 AMYPLVIGELDNAVGNILDTLKRLNLDSNTIVIFTSDNG--PSGRRNSDGTERYLSGHKT 334
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
EGG+R ++W P V++ D LPT+ A +D+P
Sbjct: 335 LTQEGGMRVPTVVWGPSQIPANTVSQALTTSMDILPTVAFLAG-ADVPT 382
>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
Length = 480
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 178/360 (49%), Gaps = 56/360 (15%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
I+ ++ + ++ A S P+++ I+ADDLG+ + G G +IPTP IDALA
Sbjct: 9 IVGRCIVCVTLATLTLYWQSTAVSSQPNLVVIIADDLGYGETGMMGNAEIPTPAIDALAR 68
Query: 94 SGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQ 147
SG+ Y T C+PSR+ M+G++ G N ER GLP +K +
Sbjct: 69 SGVRCTSGYVTSSYCSPSRAGFMSGRYQSRFGYDLNPT--GERNNHPNAGLPPQQKTFVE 126
Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY----------------WTGHQD- 190
+L+ GY T ++GKWHLG + PT +GF+ G+ WT +D
Sbjct: 127 HLQSAGYHTSLIGKWHLG-TRPPQVPTSKGFDRFFGFLHEGHFYVPGPPYENVWTMLRDN 185
Query: 191 -----YFDHSAEEMKMWGLDMRRDLEPAWD-----LHG-------KYSTDVFTAEAVDII 233
F+ + ++ + R EPA+D L G Y TD T +AVD I
Sbjct: 186 SLPAGQFETNQRTIRG---NYARINEPAYDTGNPVLDGGEPIDDWNYLTDTITDKAVDTI 242
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+ + P + +++ A HS P+QA HI D +R FA +L LD VG
Sbjct: 243 -SQAASNPFAMVVSYNAVHS-----PMQASLEDHAAMDHIADPQRRIFAGMLIALDRGVG 296
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
++++ L+++++ +++VF SDNGG A +SN PLRG K +L+EGGVR +IWS
Sbjct: 297 RIIKKLDRQKLRQETLVVFFSDNGGPTAELT---SSNAPLRGGKGSLYEGGVR-IPMIWS 352
>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
paniscus]
Length = 482
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P GF+
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 119
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + RD W++ G+Y T ++
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+D I + P FLY A ATH+ P+ A +L +R ++ + +
Sbjct: 172 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 220
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+S+GK++E L+ + N+ + F SDNG A SN P K T +EGG+R
Sbjct: 221 IDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMRE 280
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L W P + G V+ Q + D T L+ A
Sbjct: 281 PALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAG 314
>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
Length = 478
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 170/335 (50%), Gaps = 29/335 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGK 118
P+IIFILADDLG+ DV G TP+ID +A+ GI L + Y +C+ +R+A++TG+
Sbjct: 45 PNIIFILADDLGYADVSCFGTPGFTTPSIDRIAHEGIKLTRAYANSAVCSATRTALITGR 104
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ I G++ + + GLP LP+ L+ GY+T +VGKWHLG E+ P
Sbjct: 105 YQDRIACGLEEPIASSSGKIGLPPGLPTLPEQLRRAGYKTALVGKWHLG-NPPEFGPLKS 163
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD-MRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G++ G G DYF H + G D RD D HG Y T + EAV ++
Sbjct: 164 GYDHFYGISGGAADYFTHQGKP----GEDQFYRDDHLIHD-HG-YLTGLLGDEAVRLVET 217
Query: 236 HSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHR---HIEDF---KRSKFAAILHKL 288
+++ P FL L H P+ P + P+ R I D+ R + ++ +
Sbjct: 218 FASETSPFFLSL-----HFNAPHWPWEGPEDEAESKRLKGKIFDYDGGTRKTYGRMIKAM 272
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D+ +G+V+ L+ + N+I+VF SDNGG + WP G+K L EGG+R
Sbjct: 273 DDQIGRVLATLDSYHLTENTIVVFTSDNGGE------RFSDTWPFTGMKGELLEGGLRVP 326
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
++ P + G ++Q + DWLP+ L+A +
Sbjct: 327 AVLRWPARVAAGSESDQTLITMDWLPSFLAATGAT 361
>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 461
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 118/324 (36%), Positives = 163/324 (50%), Gaps = 33/324 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ ILADDLG+ D+G +G I TPNID LA G ++Y+ +C+PSR+A+MTG+
Sbjct: 28 PNIVVILADDLGYGDLGCYG-SPIATPNIDRLAEEGARFTSFYSASPVCSPSRAALMTGR 86
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P T ++ V+ G GLP SE + Q LK GYRT +GKWH+G Y PT RGF
Sbjct: 87 YP--TRVEVPVVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIG-STPGYLPTNRGF 143
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G +SA+ + + PA D S+ FT EA+D + +
Sbjct: 144 DEFFG--------VPYSADITPCPLMRGSSVVAPAVDCSTLTSS--FTQEALDFMR-RAQ 192
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
D P FLYLAH A P+ PL A R +A ++ +LD S G+V+ A
Sbjct: 193 DNPFFLYLAHTA-----PHLPLAAS------PRFAGQSGLGMYADVVQELDWSTGQVMAA 241
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L+ + SN++++F SDNG G S LRG K +EGG+R L P +
Sbjct: 242 LKATGLDSNTLVMFSSDNGPWYQG------SQGKLRGRKGETYEGGMREPFLARYPGVIP 295
Query: 359 RGIVAEQYVHVSDWLPTLLSAANK 382
GI D LPTL A
Sbjct: 296 SGIGCAGLATTMDLLPTLARLAGA 319
>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
Length = 311
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/275 (33%), Positives = 146/275 (53%), Gaps = 13/275 (4%)
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
SR + ++G++ + MQH V+ + L L ILP YLKELGY T GKWHLG+ +
Sbjct: 1 SRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRD 60
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
E TPT RGF+S G ++G ++++H+ + D E +D GK+S D+
Sbjct: 61 ECTPTHRGFDSFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYV 118
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSKFAAILHKL 288
+ + PLF+Y+A HS PL +L ++ ++ R K+ ++ +
Sbjct: 119 NKTLDEYDQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTDDRKKYLGLVSGM 173
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D +G +V+ +++ + N+ I+F SDNGG ++ N P RG K+TLWEGG +
Sbjct: 174 DYIIGGIVDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKAN 228
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL + G+ +HV+DWLPT++ A S
Sbjct: 229 SWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGS 263
>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
Length = 476
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/365 (31%), Positives = 172/365 (47%), Gaps = 51/365 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
AS P+IIFILADDLG+ D+GF G + I TPNID LA G+ ++Y+ +C PSRS+
Sbjct: 14 ASPEKPNIIFILADDLGYGDLGFLGQEYIETPNIDRLAKEGMFFSDFYSGSTVCAPSRSS 73
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEY 171
+TG H HT ++ N E G P+ + +L + +K+ GY T GKW LGF
Sbjct: 74 FLTGMHTGHTPIRGNSEVQPE-GQFPMPDSVLTVAKVMKQAGYVTGAFGKWGLGFIGSTG 132
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA--WDLHGKYSTDVFTAEA 229
P +GFE GY + + H +W D + DL P W G Y+ D+ +
Sbjct: 133 EPASQGFEEFYGY---NCQRYAHRYYPAYLWHNDEKIDL-PGNDWTTKGDYAPDLIQEKT 188
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD-HYLNIHRHI--------------- 273
++ I + + + P FL++ H+ L APD + +R +
Sbjct: 189 LEFI-DENKENPFFLFMPIVTPHAE-----LAAPDDEIMEKYRAMFPDEKPYVAGNGADY 242
Query: 274 -----------EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN-----G 317
+ + + FAA++ ++D VG+V++ LE + N++I+F SDN G
Sbjct: 243 GPDMRIPGYQSQPYPHATFAAMVERIDRYVGEVLQKLESNGLSENTVIIFTSDNGAHREG 302
Query: 318 GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
GA F SN RG K L+EGGVR + W P G + D LPT
Sbjct: 303 GADPEF---FDSNGKFRGFKRDLYEGGVRAPMIAWWPNKIKAGSTTDHVSAFWDLLPTFA 359
Query: 378 SAANK 382
+ +
Sbjct: 360 AIGGE 364
>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 462
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/348 (31%), Positives = 184/348 (52%), Gaps = 38/348 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
P+++ I++DD G++D+G G I TP +D LA G+ L N+Y T CTPSRS ++TG+
Sbjct: 25 PNVLLIISDDQGYHDLGSFGNKDILTPYLDQLAAEGVRLTNFYVTSSACTPSRSGLLTGR 84
Query: 119 HPIHTG----MQHNVL-YGCERG------------GLPLSEKILPQYLKELGYRTRIVGK 161
+P G ++N++ YG + G L E + + LKE GY +GK
Sbjct: 85 YPQRNGTFELFRNNMVNYGHQYTDYEYSVSPERILGTDLREIFISELLKEGGYTNGYIGK 144
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
W LG K+ Y P +G++ + G+ DYF H + M + + + G+Y+
Sbjct: 145 WDLGQLKR-YLPMQQGYDKYYGFANTGIDYFTHE----RYGSPSMIDGNQLSLEDKGRYT 199
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIH---RHI 273
T +F EA++ I N S +P FL LA A HSA+ +P +QA + YL ++
Sbjct: 200 TALFEREALEFI-NGSQGKPFFLTLAFNAPHSASNLDPEIKGTVQATEKYLKMYPKSNTK 258
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
K+ + A + ++D+S+G ++E +++ + N++++F+SDNGG ++ A N+PL
Sbjct: 259 RGEKQRAYKAAVTQMDQSIGTILERIKEMGLEENTLVIFLSDNGG------MSLADNYPL 312
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+G K +EGGVR L+ P G + ++ + P +++AA
Sbjct: 313 KGGKAQFFEGGVRVPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAG 360
>gi|355757044|gb|EHH60652.1| hypothetical protein EGM_12064 [Macaca fascicularis]
Length = 482
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/334 (31%), Positives = 164/334 (49%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G + N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP+ LK+ GY +++VGKWHLG ++ ++ P GF+
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKVVGKWHLG-HRPQFHPLKHGFDEW 119
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + RD W++ G+Y T ++
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+D I + P FLY A ATH+ P+ A +L +R ++ + +
Sbjct: 172 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 220
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+SVGK++E L + N+ + F SDNG A A SN P K T +EGG+R
Sbjct: 221 IDDSVGKMLELLHDLHIADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQTTFEGGMRE 280
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L W P + G V+ Q + D T L+ A
Sbjct: 281 PALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAG 314
>gi|47228126|emb|CAF97755.1| unnamed protein product [Tetraodon nigroviridis]
Length = 650
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/380 (32%), Positives = 170/380 (44%), Gaps = 68/380 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLC------------ 107
PHIIFIL DD G+ND+G+H I TP +D LA G+ L+NYY +C
Sbjct: 12 PHIIFILTDDQGFNDIGYHN-PTIKTPTLDKLAAEGVRLENYYVQPICTPSRSQLITGRY 70
Query: 108 -TPSRSAIMTGKH---------------PIHTGMQHNVLYGCERGGL-----------PL 140
TPS SA++ +H Q L RG P
Sbjct: 71 ATPSTSAMLVFFFLFIFFLQIPDPHRPPALHHQAQPAQLSAFARGHPARAPPPGRILHPP 130
Query: 141 SEKILPQYLKE------------LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH 188
++ P L+E L R G +G + + + R L + +
Sbjct: 131 GGQMAPGLLQEGVSSHPERLRHLLRLLDRQCGLLQVGLWPRPGS--VRRLSLGLVFLSPA 188
Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLA 247
Y S + + G D+ W GKYST +FT A I+ +H E PLFL L+
Sbjct: 189 PSYL--SCDGPGVCGYDLHDGEGVVWGQEGKYSTALFTRRARQILESHDPAERPLFLLLS 246
Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
A H+ PLQ P Y+ +R + + R K AA++ +DE+V V AL + N
Sbjct: 247 LQAVHT-----PLQTPKSYIYPYRDMTNVARRKLAAMVSTVDEAVRNVTYALRKYGYYRN 301
Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
S+I++ +DNG SNWPLRG K T WEGG+RG + SPLL R V++ +
Sbjct: 302 SVIIYSTDNGAQP----FTGGSNWPLRGRKGTYWEGGIRGVAFVHSPLLRRRRRVSKALL 357
Query: 368 HVSDWLPTL--LSAANKSDI 385
H++DW PTL L+ N S I
Sbjct: 358 HITDWFPTLVGLAGGNVSQI 377
>gi|423293951|ref|ZP_17272078.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
CL03T12C18]
gi|392677172|gb|EIY70591.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
CL03T12C18]
Length = 461
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 179/337 (53%), Gaps = 34/337 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ + I+ADD+G+ DVG +G + I TPNID +A G++ ++++ + +P+R ++TG+
Sbjct: 29 PNFVIIVADDMGYGDVGIYGNEYIKTPNIDQMAREGMMFTDFHSNGSVSSPTRCGLLTGR 88
Query: 119 HPIHTGMQHNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G++ +L + GL E + L + GYRT ++GKWHLG+ +K +
Sbjct: 89 YQQRAGLEKVLLVPRDDKDKEVGLQSEEITFAKILGDNGYRTALIGKWHLGYLQKHHPMN 148
Query: 175 FRGFESHLGYWTGHQDYFDHSAE--EMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
F GF+ +G+ +G+ DY H ++ W GL+M+ ++ G Y+T + T + D
Sbjct: 149 F-GFQKFVGFKSGNVDYQSHRNRYGDVDWWDGLEMK-------NMPG-YTTTLLTTLSED 199
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHRHIEDFKRSK------FAAI 284
I + D+P LY+AHAA HS P+Q P+ + +E K S + +
Sbjct: 200 YI-KENKDKPFCLYIAHAAPHS-----PMQGPNEKAIRTEATLEGDKNSDRPNKEIYKDM 253
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ +LD SVG+++E L++ ++ N+ +VF SDNG N S RG K WEGG
Sbjct: 254 VEELDWSVGRILETLKKYKLDKNTFVVFFSDNGPVIN----NGGSTGGYRGAKGAPWEGG 309
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
R G+ + P G + +Q V D PT+L AN
Sbjct: 310 HRVPGICYMPGTVKGGTMCKQTVMSFDLFPTMLDMAN 346
>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
Length = 446
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 171/354 (48%), Gaps = 27/354 (7%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
PP+I+FILADDLGW D+ +G TP++DALA G+ + Y +C+ +R A++TG
Sbjct: 8 PPNIVFILADDLGWADLSVYGQADFSTPHLDALAGEGVRFTQAYANSAVCSATRFALITG 67
Query: 118 KHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
++ + G++ ++ GLP LP L+ GY T ++GKWHLG + P
Sbjct: 68 RYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRNAGYDTALIGKWHLGSLPT-FGPLK 126
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G+ G + G DYF H + D+ P + Y T + EA +
Sbjct: 127 SGYNRFFGNYGGAIDYFTHKPGVGEAVPRDLYEGEVPVERV--GYYTQLLADEASGWLRG 184
Query: 236 HST-DEPLFLYLAHAATHSANPYEPLQAPDHY-----LNIHRHIEDFKRSKFAAILHKLD 289
S +P FL L H P+ P P+ L H + + ++ LD
Sbjct: 185 RSAGSKPFFLSL-----HFTAPHWPWVGPEDEEVSLGLKDLFHYDGGNLHTYGKMVRSLD 239
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+VG+V+ AL+++ + N+I+VF SDNGG + WP G K L EGG+R
Sbjct: 240 GAVGQVLHALQEQGLADNTIVVFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPT 293
Query: 350 LI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
L+ W + + V++Q DWLPTLL+AA + +Y S ENI+P E
Sbjct: 294 LLRWPARIAPQ--VSDQVTATMDWLPTLLAAAGVAPDADY-PSDGENILPVLEG 344
>gi|182677550|ref|YP_001831696.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
gi|182633433|gb|ACB94207.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
Length = 486
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/355 (30%), Positives = 174/355 (49%), Gaps = 30/355 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+IIFILADDLG+ DV +G + TPNID + G +L+ Y +C+ +R+A++TG+
Sbjct: 37 PNIIFILADDLGYADVSIYGRPDLSTPNIDGIGLKGARLLQAYANSAVCSATRTALLTGR 96
Query: 119 HPI--HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ G++ + G GLP LP LK+ GY T ++GKWHLG ++ P
Sbjct: 97 YQYRERVGLEEPIA-GNIHVGLPPQRPTLPSLLKKAGYTTTLIGKWHLGTL-PDFGPLQS 154
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
G++ G+ G DY+ H + + D + Y T++ A++ I+
Sbjct: 155 GYDHFYGFRGGAVDYYSHKGTDDQDDLWDQDTKVH-----QTGYLTELLGDRAIETINAS 209
Query: 237 S-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----HIEDF---KRSKFAAILHKL 288
+ T +P F+ L H P+ P +AP R + DF ++ + ++ +
Sbjct: 210 AKTGQPFFISL-----HFNAPHWPWEAPGDEAESARVAGTRLFDFDGGSQATYRGMIAAM 264
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D +G++V+AL+ + N+I++F SDNGG A WP G K L EGG+R
Sbjct: 265 DLQIGRIVQALQANGISENTIVIFTSDNGGE------RFADTWPFTGRKTELLEGGLRIP 318
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
L+ P +Q DWLPTLL+AA PN+ + + N++P S
Sbjct: 319 ALVSWPARIKADQTIDQVSISMDWLPTLLAAAGSEPDPNFPSDGI-NLLPFLSES 372
>gi|325107240|ref|YP_004268308.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324967508|gb|ADY58286.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 469
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/368 (32%), Positives = 185/368 (50%), Gaps = 48/368 (13%)
Query: 42 LAFTLSMVFVDL------VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
LAF L +F +L S PP+ + I DDLG+ D+G G I TP +D +A G
Sbjct: 8 LAFGL--LFCNLPGDGQAAEKSTPPNFVVIFCDDLGYGDLGCFGHPSISTPQLDQMADEG 65
Query: 96 IILKNYYT-VQLCTPSRSAIMTGKHPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKE 151
+Y +CTPSR+A++TG+ PI GM + VL+ +GGLP E + + LK+
Sbjct: 66 QRWTQFYVGASVCTPSRAALLTGRLPIRNGMMSAKRRVLFPDSKGGLPAQELTIAELLKK 125
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY--------WTGHQDYFDHSAEEMKMWG 203
GY T +GKWHLG ++ E+ PT GF+S+ G GH +Y +AE+ +
Sbjct: 126 NGYATAAIGKWHLG-HRSEFLPTEHGFDSYWGIPYSNDMDREKGHPNYRQKAAEDPHYFS 184
Query: 204 ---------LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
+D + +E + H T +T +A++ I + ++P FLYLAH
Sbjct: 185 PIEQYQVPIIDNKEVVERPANQH--TITKRYTEKAIEFIKGNR-EQPFFLYLAHNL---- 237
Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
P+ PL A D +L E R + ++ ++D VG++++ L++ + N+++VF S
Sbjct: 238 -PHIPLYASDSFL------ETSPRGIYGDVVTEIDHGVGQILDTLKETGLAENTLVVFTS 290
Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP-LLESRGIVAEQYVHVSDWL 373
DNG F+ + S LR K +EGG+R ++W P LE + V + D L
Sbjct: 291 DNGPWLL-FDTHGGSAGLLREGKGATFEGGMRVPTVMWWPGTLEPK--VQTELGTTMDLL 347
Query: 374 PTLLSAAN 381
PT S A
Sbjct: 348 PTFCSLAG 355
>gi|332263239|ref|XP_003280658.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Nomascus leucogenys]
Length = 528
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 164/334 (49%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 47 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 106
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P GF+
Sbjct: 107 NAHARNAYTPQEIVGGVPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 165
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + RD W++ G+Y T ++
Sbjct: 166 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+D I + P FLY A ATH+ P+ A +L +R ++ + +
Sbjct: 218 EALDFIKRQARRHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAIRE 266
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+S+GK++E L + N+ + F SDNG A SN P K T +EGG+R
Sbjct: 267 IDDSIGKMLELLHNLHIADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMRE 326
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
L W P + G V+ Q + D T L+ A
Sbjct: 327 PALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAG 360
>gi|149174645|ref|ZP_01853271.1| sulfatase [Planctomyces maris DSM 8797]
gi|148846755|gb|EDL61092.1| sulfatase [Planctomyces maris DSM 8797]
Length = 405
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/339 (30%), Positives = 172/339 (50%), Gaps = 20/339 (5%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAI 114
SS P++I I DD G D+ +G + TP++D++A GI +Y + +C+PSR+ +
Sbjct: 5 SSEKPNVIIIFTDDQGSVDLNCYGAKDLITPHMDSIARRGIRFTQFYASAPVCSPSRAGM 64
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG+ P G+ NV + G+P + + + +++ GY+T +GKWHLG Y E P
Sbjct: 65 LTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLG-YTPETMPH 123
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAV 230
+GFE+ G+ G D + H W R DL + W G + D+ +
Sbjct: 124 GQGFETSFGHMGGCIDNYSH----FFYWNGPNRHDLWENGKEVWR-DGAFFPDLMVEQCQ 178
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
D I + D+P FLY A P+ PLQ + + + H+ R K+AA + +D+
Sbjct: 179 DYIRK-AGDKPFFLYWAINV-----PHYPLQGKEKWRKTYAHLSS-PRDKYAAFVSTMDD 231
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL-NAASNWPLRGVKNTLWEGGVRGAG 349
+G+V+ L+ ++ +II+F SD+G + S P RG K +L+EGG+R
Sbjct: 232 CIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPA 291
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
+I P + G V +Q DWLPT +SA + +P +
Sbjct: 292 MISWPGTIAEGEVRDQLATGCDWLPT-ISALTGAPLPAH 329
>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 472
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 188/359 (52%), Gaps = 47/359 (13%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPS 110
+L A PP+I++IL DDLGW D+G +G + TPNID L G+ Y+ +C+PS
Sbjct: 21 NLSAQERPPNIVYILIDDLGWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPS 80
Query: 111 RSAIMTGKHPIHTGMQHNVL------YGCERGGLP--------LSEKILPQYLKELGYRT 156
R++I+TGK+P + G ++ Y E +P L EK++P+ L + GY +
Sbjct: 81 RASILTGKNPANIGFTGHITAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTS 140
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHL-GYWTGHQDYFDHSAEEMKMW-----GLDMRRDL 210
+GKWH+G +K + PT +GF ++ GY G + E K W LD R +
Sbjct: 141 ASIGKWHVGEEEK-FFPTHQGFAINIAGYEHGSPPTYWGPFESEKSWNPVIKNLDNREE- 198
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD-----H 265
G+Y T+ T EA++ I + + + P FLYL+H A H+ PL+APD +
Sbjct: 199 -------GQYLTNRLTDEAINFI-DENKEGPFFLYLSHYAVHT-----PLEAPDSLIKKY 245
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
L + +E K + +AA++ +D +VG+++++L+ + N+I++ SDNGG
Sbjct: 246 ELKLEGQMEQ-KNAIYAAMIENMDWNVGRLLKSLDSLGLEGNTIVILGSDNGGEG----- 299
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
+N PLR K ++EGG+R +I P ++ V D LPT++ A S+
Sbjct: 300 RVTNNVPLREGKGYIYEGGIRVPLVIKWPGKVKPNSSSDVPVITDDMLPTIVEMAGLSN 358
>gi|87309613|ref|ZP_01091747.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87287377|gb|EAQ79277.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 496
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/469 (28%), Positives = 210/469 (44%), Gaps = 73/469 (15%)
Query: 22 PQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFH 78
P YL+ ++ F ++ A S S+ P P+I+F L DDLGW D+G +
Sbjct: 7 PSYLQH-----KLTTFTLVIGAILFSFTLEKQANSAEPKRTPNIVFFLVDDLGWKDIGVY 61
Query: 79 GLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNV-------- 129
G TPN+D LA SG+ N Y Q+C+P+R++IMTGK+P G+ +
Sbjct: 62 GSSFYHTPNVDGLAASGMRFTNAYAACQVCSPTRASIMTGKYPQRVGITDYIGAAQPDKW 121
Query: 130 ------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
L + L L E L + LK+ GY T GKWHLG + P +GF+ ++G
Sbjct: 122 KRNTPLLPAPYQTRLALEETTLAEALKQRGYATFFAGKWHLG--PEGNWPEDQGFDVNIG 179
Query: 184 YWT-----GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
G + YF G D G++ D +E V I H
Sbjct: 180 GIDRGGPYGGKKYFSPYGNPRLTDGPD------------GEHLPDRLASETVKFIEQHQ- 226
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR-------------------- 278
D+P YL+ + H+ PL A + + I+ R
Sbjct: 227 DQPFLAYLSFYSVHT-----PLMAREDLKQKYDEIKQRIRFAGPIWGEEGKSKLRLVQEH 281
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
S +A ++ +D +VGKV++AL++ ++ N++++F SDNGG + + SN PLRG K
Sbjct: 282 SVYAGMVEAMDAAVGKVLDALDRLKLTDNTLVIFTSDNGGLSTSEG-HPTSNLPLRGGKG 340
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
++EGG+R ++ P + S G ++ V D+LPT+L+ +K + +++ +II
Sbjct: 341 WMYEGGIREPLVVRYPGVTSPGSESDALVTSPDFLPTILAVVDKPG--DKIDTDGVSIIS 398
Query: 399 RYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE 447
E L + G ++ P N E + K YE+G E
Sbjct: 399 ALEGKPL--DRGPIFWHYPHYGNQGGSPTAAVREGDWKLIEWYEDGKVE 445
>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
Length = 413
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/334 (32%), Positives = 173/334 (51%), Gaps = 33/334 (9%)
Query: 66 LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTG 124
+ADDLG+ D+ +G TP++D LA +GI ++++ +C+P+R+ ++TG++ G
Sbjct: 1 MADDLGYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAG 60
Query: 125 MQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
+ V+Y + GL +E L Q L++ GY+T + GKWHLG Y+++Y PTFRGF+
Sbjct: 61 ID-GVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLG-YQRQYNPTFRGFQQ 118
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
+GY +G+ DYF H + W + + E Y T + A++ I +
Sbjct: 119 FVGYVSGNVDYFAHLDGTGVFDWWHNAELNREEQ-----GYVTHLINDHALEFIRQQQ-E 172
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH-------KLDESV 292
+P F+Y+AH A HS P Q P D K +K I + ++D+ +
Sbjct: 173 KPFFVYIAHEAVHS-----PYQGPHDQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKGI 227
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G++V+ L++ + + I F+SDNG G SN LRG K +LWEGG R +
Sbjct: 228 GQIVDVLKEVNLTEKTFIFFLSDNGANKNG------SNGKLRGFKGSLWEGGHRVPAIAC 281
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
P G V ++ V D +PT+L AN + IP
Sbjct: 282 WPGRIPEGTVCDEPVISIDLMPTILELAN-AKIP 314
>gi|296124231|ref|YP_003632009.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016571|gb|ADG69810.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 467
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 171/360 (47%), Gaps = 32/360 (8%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNI 88
R+ +L FT + + P+I+ IL+DD G + G + TP+I
Sbjct: 8 RLWLVLILAFQFTSQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQGHPRYKTPHI 67
Query: 89 DALAYSGIILK-NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEKILP 146
D++ +G+ + Y + +C+PSR+ ++ G++ G + N+ E GLP SE +LP
Sbjct: 68 DSIGKNGVHFRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLP 127
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
Q LKE GYRT +GKWHLG Y ++ P RGF + G+ G + YF M
Sbjct: 128 QLLKEDGYRTIALGKWHLG-YAPQFHPMERGFTDYYGFLQGSRSYFPLKKPTRLN---QM 183
Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP--LQAPD 264
RD + Y TD EA+ I + +P +YLA ATHS N LQA D
Sbjct: 184 LRDRTAIPEEQFGYMTDHLADEAIAYIKQWQS-QPWMMYLAFNATHSPNDATAVDLQAAD 242
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
+K A+ LD +VGKV++AL++ + +++++F++DNGGA
Sbjct: 243 G-------------NKIYAMTIALDRAVGKVLDALKECGLSKDTLVIFINDNGGAG---- 285
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
N L G K + WEGG R L+ P G V ++ V D PT+L A D
Sbjct: 286 --GHDNGSLHGKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGD 343
>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
Length = 491
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 107/338 (31%), Positives = 166/338 (49%), Gaps = 47/338 (13%)
Query: 69 DLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQH 127
++GW D+G +G TPN+D +A G++ N+YT LC+PSR+A++TG+ PI G
Sbjct: 8 NMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYT 67
Query: 128 NVLYGCER---------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
G R GG+P SE++LP+ LKE GY ++IVGKWHLG ++ ++ P GF
Sbjct: 68 TS--GHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHGF 124
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDV 224
+ G H +D+ A + + RD W++ G+Y T +
Sbjct: 125 DEWFGSPNCHFGPYDNRARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQI 176
Query: 225 FTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ EA+D I + P FLY A ATH+ P+ A +L +R ++
Sbjct: 177 YLQEALDFIRRQQAARRPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGD 225
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+ ++D+SVGK++ +L+ + ++ + F SDNG A SN P K T +EG
Sbjct: 226 AVREIDDSVGKILHSLQDLGIAESTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTFEG 285
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+R + W P G V+ Q + D T LS A
Sbjct: 286 GMREPAVAWWPGRIPAGQVSHQLGSIMDLFTTSLSLAG 323
>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 476
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 200/422 (47%), Gaps = 59/422 (13%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
++ A +P AF S +G P+++ ILADDLG+ D+ +G + TPNIDAL
Sbjct: 12 VVVLAGVPEAFGAS-------PDAGRPNVLLILADDLGYGDLSSYGAADLKTPNIDALVA 64
Query: 94 SGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLK 150
SG+ +Y +C+P+R+A++TG +P G+ + + G L +LPQ LK
Sbjct: 65 SGVRFDRFYANSPVCSPTRAALLTGCYPDLVGVPGVIRTHPDDSWGVLSPQAVLLPQVLK 124
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRD 209
GY T +VGKWHLG P+ RGF+ G+ DY +H + MRRD
Sbjct: 125 GAGYHTALVGKWHLGLSGAS-LPSRRGFDLFHGFLGDMMDDYHNHRRHGINY----MRRD 179
Query: 210 ---LEPAWDLHGKYSTDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
++P ++TD+F+ A+D ++ + D P FL LA+ H+ P+Q P
Sbjct: 180 DREIDPK-----GHATDLFSQWAIDFLNESKGQDRPFFLELAYNVPHT-----PVQPPQE 229
Query: 266 YLNIHRHIE---DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+L R D KR++ A++ LD +G+V+ AL ++IVF SDNGG
Sbjct: 230 WLEKVRRRAPNLDPKRARLVALIEHLDHGIGQVLAALRANGQAERTLIVFTSDNGG---- 285
Query: 323 FNLNAASNW-PLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
L+A ++ P RG K ++EGG+R +W +E G +++ D PTL AA
Sbjct: 286 -QLDAGAHCGPYRGGKQDMYEGGIRVPLCAVWPGRIEP-GTRSDRVAVSMDLFPTLCEAA 343
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENS--NTRYENGTHEYNPKYE 438
V+ I S+L G H P IE R E G PKY
Sbjct: 344 GAR-----FEHPVDGI------SLLSTFQGHHNKAEPPIERDLIFVRREGG-----PKYH 387
Query: 439 NR 440
+
Sbjct: 388 GQ 389
>gi|392969626|ref|ZP_10335041.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387841820|emb|CCH57099.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 477
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 123/411 (29%), Positives = 192/411 (46%), Gaps = 44/411 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+FILADDLG+ D+ +G TP++D+LA GI + Y +C+PSR+AI+TGK
Sbjct: 37 PNIVFILADDLGYMDLRCYGNPYNETPHLDSLARRGIRFTQAYSACPVCSPSRAAILTGK 96
Query: 119 HPIHTGMQHNVLYGCERGG-------------LPLSEKILPQYLKELGYRTRIVGKWHLG 165
HP + + + G ER LP SE L + LK+ GY T +VGKWHLG
Sbjct: 97 HPARLHLTNFI--GGERVDTTSSLLPAEWRRYLPASETTLAELLKQQGYVTGMVGKWHLG 154
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
TPT +GF+ DY+++S + D +Y TD
Sbjct: 155 NTGDSLTPTAQGFDYERQISKNGLDYYNYSIASNNKTVFE---------DTGKEYLTDKL 205
Query: 226 TAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAP-----DHYLNIHRHIEDFKRS 279
T A++ I N + +PLFLYLA++A H L P +L +
Sbjct: 206 TDYALEFIDQNKAGQKPLFLYLAYSAPHI------LLVPRADKLSKFLYRYPRFNGRYNP 259
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA---AGFNLNAASNWPLRGV 336
+AA+L DE VG+V++ L+ + N+++VF SDNGG AG N PLR
Sbjct: 260 SYAAMLLSFDEGVGRVLQKLKDTGLADNTLVVFTSDNGGLGMPEAG--PTPTDNSPLRKW 317
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
K ++EGG+R ++ + +R ++Y+ +D++PT + + P + ++
Sbjct: 318 KGHVYEGGIRVPLIMAGSGITARNATCDRYLTGTDFVPTFMEMLQTNWKPAVPDG--KSF 375
Query: 397 IPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE 447
N ++ G ++ P N +R + + K YE G E
Sbjct: 376 YALLSNPAAAHKRGPVFWHYPHFSNQASRPSAAIRDGDYKLVEHYETGNTE 426
>gi|395537754|ref|XP_003770857.1| PREDICTED: arylsulfatase A [Sarcophilus harrisii]
Length = 507
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/359 (31%), Positives = 171/359 (47%), Gaps = 34/359 (9%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
+A + PP+II I ADDLG+ D+G +G TPN+D +A G+ ++Y V LCTPSR+
Sbjct: 15 LAVARPPNIILIFADDLGYGDLGSYGHPSSSTPNLDRMAAYGLRFTDFYVPVSLCTPSRA 74
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-YKKEY 171
A++TG++P+ +G+ VLY RGGLPL E + + L + GY T + GKWHLG +
Sbjct: 75 ALLTGRYPVRSGLYPGVLYPASRGGLPLDEVTIAEVLAKRGYLTGMAGKWHLGVGLNGTF 134
Query: 172 TPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-------MRRDL----EPAWD 215
P ++GF +LG H Q + + G D + +L +P W
Sbjct: 135 LPPYQGFHRYLGIPYSHDQGPCQKLICFPPDTLCFGGCDQGNVPIVLMANLTVKEQPPW- 193
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
L G V A+ ++ D P FLY A TH + +E+
Sbjct: 194 LPGLEKQYVAFAQNF-MVDAARQDRPFFLYYASHHTHYPQ-----------FSGESFVEN 241
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R F L ++D +VG ++ +E + +++ F +DNG S+ LR
Sbjct: 242 SGRGPFGDALMEMDSAVGALLNTVESLGLTEETLVFFTADNGPETMRL-FRGGSSGILRC 300
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
K T +EGGVR L + P R V+ + D LPT L+A ++ +PN V+
Sbjct: 301 GKGTTYEGGVRQPALAFWP-GHIRPGVSHELASTLDLLPT-LAALAQAQLPNVTLDGVD 357
>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
queenslandica]
Length = 523
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 171/348 (49%), Gaps = 44/348 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P++IF+L DD+GW D+G +G TPN+D +A G++L ++Y+ LC+PSR+A++TG+
Sbjct: 33 PNMIFMLMDDMGWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAAMLTGR 92
Query: 119 HPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
PI G + GG+P SE + P+ L++ GY T I+GKWHLG + Y
Sbjct: 93 LPIRNGFYTTNAHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWHLG-QQTHY 151
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV------- 224
P GF+ G H FD + + + R+ A G+Y D
Sbjct: 152 HPLKHGFDEFFGSTNCHFGPFDGKEQP----NMPVYRNATMA----GRYYQDFPINHKTG 203
Query: 225 -------FTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
+T EA+ I+ N + +P FLY ATH+ PL A +L
Sbjct: 204 ESNLTVEYTQEAIKFINKNAANKKPFFLYWTPDATHT-----PLFASKDFLGTS------ 252
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R + + +LD SVG++++ L+ + +++++F SDNGGA + SN P
Sbjct: 253 QRGLYGDAVKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGATYA-KESGGSNSPFLCG 311
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
K T +EGG+R + W P G V+ Q + DW T L A S+
Sbjct: 312 KETTFEGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLAGISE 359
>gi|296120840|ref|YP_003628618.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013180|gb|ADG66419.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 517
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 166/340 (48%), Gaps = 46/340 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P++IFI+ADDLG+ D+G G TP+ID LA G+ N++ Q C P+R+AIMTG++
Sbjct: 52 PNVIFIMADDLGYTDLGCFGSRYYETPHIDQLARQGVRFLNHHHCQNCAPTRAAIMTGQY 111
Query: 120 PIHTGM-----------QHNVLYGCER-GGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
TG+ Q L E LPL + Q ++ GY+T + GKWHLG
Sbjct: 112 GPRTGVYTVGSIDRFDWQSRPLRPVENVEKLPLDPATIAQQVQRSGYQTGMFGKWHLGL- 170
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
Y P+ RGF + H ++ A++ D GKY D T
Sbjct: 171 NGSYHPSQRGFHEAVESSGQHFNFKTTPAQK----------------DTEGKYLADYLTD 214
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH---IEDFKRSKFAAI 284
A+D I H + EP FLYL H HS P QA ++ + + + +AA+
Sbjct: 215 RAIDFIERHQS-EPFFLYLPHFGVHS-----PFQAKKEWIEKFENKPGVGGHRNPVYAAM 268
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA----GFNL--NAASNWPLRGVKN 338
+ +DESVG++++ L++ ++ ++++F SDNGG G + + N PLR K
Sbjct: 269 IASVDESVGRILDKLDELKLADKTVVIFASDNGGVGGYEREGLHKANDVTDNAPLRSGKG 328
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQ-YVHVSDWLPTLL 377
+L+EGG R ++ P + G+ +HV D PT L
Sbjct: 329 SLYEGGTRVPLIVRWPGVAPAGVECRTPTIHV-DLYPTFL 367
>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 548
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 128/377 (33%), Positives = 183/377 (48%), Gaps = 65/377 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQL-CTPSRSAIMTGK 118
P+I+ ILADD+G NDV G I TPNID LA G + N Y+ C PSR+A++TG+
Sbjct: 63 PNIVLILADDMGINDVSTFGGGMIETPNIDKLAAKGALFTNGYSGHANCAPSRAALLTGR 122
Query: 119 H---------PIHTGMQHNVL--------------YGCERG---------GLPLSEKILP 146
PI GM + Y E GLP SE ++P
Sbjct: 123 DATRTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYSAEADATNPTYDNRGLPGSEILIP 182
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE---EMKMWG 203
+ LKE GY T +GKWHLG E P +GF+ L +G DH ++ G
Sbjct: 183 EILKESGYHTMHIGKWHLG-RSPEMMPNAQGFDESLMMDSGLYLPVDHPESVNAPVESSG 241
Query: 204 LD------MRRDLEPAWDLH-----GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH 252
LD MR + W+ Y TD FT EA I ++ + P FLYLAH
Sbjct: 242 LDRFIWATMRYSVN--WNGGEIFKPNGYLTDYFTEEAEKAIEANA-NRPFFLYLAHWG-- 296
Query: 253 SANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
P+ P+QA I+ + +AA+L +D SV +V+ LE++ + N+I++
Sbjct: 297 ---PHNPVQAKRADYEAVGDIQPHNKRVYAAMLRSIDRSVERVMAKLEKQGIADNTIVIL 353
Query: 313 VSDNGGA--AAGFNLNAASNWPLRGVKNTLWEGGVRGA-GLIWSPLLESRGIVAEQYVHV 369
SDNGGA A +LN P RG KNT +EGG+R + W +++ ++ E H+
Sbjct: 354 SSDNGGADYVAINDLNK----PYRGWKNTFFEGGIRVPFSVTWPNVIDESTVIEEPVNHI 409
Query: 370 SDWLPTLLSAANKSDIP 386
D +PT+++ AN +D+P
Sbjct: 410 -DLMPTIINMAN-ADLP 424
>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 467
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 172/357 (48%), Gaps = 42/357 (11%)
Query: 46 LSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
+V L A + P P+ IFILADDLGW DVGFH +PTPN+D LA G+ L +Y
Sbjct: 25 FGLVVSSLAADTAPLRPNFIFILADDLGWGDVGFH-HGNVPTPNLDHLAGEGLELMQHYV 83
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
+C+P+R A ++G++ + +V L + LK +GY T + GKWH
Sbjct: 84 YPVCSPTRCAFLSGRY----ASRFSVTTPQNPRAFRWDTVTLARALKSVGYDTALCGKWH 139
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKY 220
LG K E+ P GF+ G G +DH E + W D + E +
Sbjct: 140 LG-SKPEWGPQKFGFDHSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIEEQG------H 192
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRS 279
TD+ T EAV+ + + TD+P FLY+ A H P++ PD L + I
Sbjct: 193 VTDLITKEAVEWLESR-TDKPFFLYVPFTAVHI-----PIREPDEILQRVPASITKPSLR 246
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN------------- 326
+ A + LD+SVGK++ ALE+ N++++F SDN GA G N
Sbjct: 247 HYGANVMHLDDSVGKILVALEKTGKAGNTLVIFGSDN-GAIPGVENNDPLYPPDHYPPGP 305
Query: 327 -AASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN PL G+K ++EGG+ A + W L+ + H++DW+PT + A
Sbjct: 306 AGGSNEPLHGMKGEVYEGGIHTAAVARWPGQLKPGKFLG--LAHITDWMPTFCALAG 360
>gi|148223113|ref|NP_001088386.1| galactosamine (N-acetyl)-6-sulfate sulfatase precursor [Xenopus
laevis]
gi|54038351|gb|AAH84661.1| LOC495239 protein [Xenopus laevis]
Length = 520
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 181/370 (48%), Gaps = 48/370 (12%)
Query: 39 VLPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
VL + +L++ F++ V G P+I+ +L DD+GW D+G G TPN+D +A
Sbjct: 2 VLYIMCSLALAFLNPVLVLGGMNVTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMAS 61
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
G++ N+YT LC+PSR+A++TG+ PI G N E GG+ SE +
Sbjct: 62 EGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFFTTNSHARNAYTPQEIVGGISDSEILF 121
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
P+ LK+ GY +I+GKWHLG ++ Y P GF+ G H +D+ ++ +
Sbjct: 122 PELLKKAGYINKIIGKWHLG-HRPSYHPLRHGFDEWFGSPNCHFGPYDNK----QIPNIP 176
Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIHNHST-DEPLFLYLAHAA 250
+ RD W++ G+Y T ++ EA++ I++ + +P FLY + A
Sbjct: 177 LYRD----WNMVGRYYEEFNINHKTGESNLTQIYLQEALEFIYSQAVAQQPFFLYWSIDA 232
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
TH+ P+ A +L R + + ++D S+GK++ L + ++ +
Sbjct: 233 THA-----PVYASRPFLGTS------NRGLYGDAVREIDFSIGKILNLLNLTGIAKDTFV 281
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
F SDNG A + SN P K T +EGG+R G+ W P + G V+ Q +
Sbjct: 282 FFTSDNGAALISAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIM 341
Query: 371 DWLPTLLSAA 380
D T LS A
Sbjct: 342 DLFTTSLSLA 351
>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
Length = 544
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 126/407 (30%), Positives = 190/407 (46%), Gaps = 66/407 (16%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGP--------PHIIFILADDLGWNDVGFH--- 78
Y+ I+ +A+ ++ + S GP P+II +LADD+G+ND+ H
Sbjct: 22 YKVNILVWAIPKISNVTVQENIPTTWSKGPDTPVDDNRPNIILVLADDMGYNDISIHNGG 81
Query: 79 -GLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG 136
+ T NIDALA SGI+ + Y C PSR++IMTGK+P G + + R
Sbjct: 82 AADGTLQTKNIDALAKSGILFTRGYAANATCAPSRASIMTGKYPTRFGYEFTPIPAFGRT 141
Query: 137 -------------------------------GLPLSEKILPQYLKELGYRTRIVGKWHLG 165
G+P + + + L++ GY T +GKWHLG
Sbjct: 142 VLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMPTEQITIAEVLRDAGYYTAHIGKWHLG 201
Query: 166 FYKKEYTPTFRGFESHLGYW------TGHQDY----FDHSAEEMKMWGLDMRRDLEPAWD 215
++ P +GF+ LG H D FD ++M +WG+ D
Sbjct: 202 -HEYGMDPMSQGFQDSLGLVGPLYLPEDHPDVVNAKFDTRIDKM-IWGMGQYSANFNGGD 259
Query: 216 LHG--KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
L KY TD +T EA+ +I N+ + P FLYL+H A H+ PLQA H+
Sbjct: 260 LFAPDKYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIHN-----PLQALRSDFEQMSHM 313
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
++ +++ LD SVGK++E L++ + ++I+F SDNGGA LN N P
Sbjct: 314 HGHNLQVYSGMINSLDRSVGKIIEKLKELDIYGKTLIIFTSDNGGANY-IELNDI-NKPY 371
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
RG K + ++GG+R +I P + G +E VH D PT+L AA
Sbjct: 372 RGWKISFFDGGIRVPYIISWPDEINPGKKSENAVHHFDIFPTILKAA 418
>gi|341889947|gb|EGT45882.1| CBN-SUL-3 protein [Caenorhabditis brenneri]
Length = 432
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 132/228 (57%), Gaps = 23/228 (10%)
Query: 51 VDLVASSGPP----HIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTV 104
D+V PP +++FI+ADDLG+ND+ + + TPN+ LA+ + +L N Y
Sbjct: 20 TDVVNGQSPPTQKPNVLFIMADDLGFNDLDWKD-STLHTPNLRNLAFHKNTALLTNSYVN 78
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
QLCTP+RSA MTG +P G Q V E G+P L + +++L Y T +VGKWHL
Sbjct: 79 QLCTPTRSAFMTGYYPFRVGTQAGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHL 138
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK------MWGLDMRRDLE-----PA 213
G+ KKE+ PT RGF+ G++ YF+HSA++ + GLD+ ++ P
Sbjct: 139 GYCKKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELRRVVKGLDLFEEVGNGKSVPD 198
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
+ +G YSTD+FT A+ +I NH+T +P F++L++ A H PLQ
Sbjct: 199 FSQNGVYSTDLFTDVAMSVIDNHNTTKPFFMFLSYQAVH-----PPLQ 241
>gi|260062069|ref|YP_003195149.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783631|gb|EAR14802.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
Length = 492
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 160/327 (48%), Gaps = 35/327 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ I + ADDLG+ D+ G I T N+D +A G N+Y +CTPSR+ ++TG+
Sbjct: 50 PNFIIVFADDLGYGDLSSFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGR 109
Query: 119 HPIHTGMQHN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
P+ G+ N V + G+P SE L + LK+ GY T +VGKWHLG +K+EY P
Sbjct: 110 LPVRNGLTSNEIGVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLG-HKEEYLPPN 168
Query: 176 RGFESHLGY-----------WTGHQDYFDHSAEEMKMWGLD------MRRDLEPAWDLHG 218
GF+ + G +T +QDYF E + + +R E ++
Sbjct: 169 HGFDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQ 228
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
T + EAV I H DEP F+YLAH+ P+ PL D + R
Sbjct: 229 NTITKRYNDEAVKWIREHK-DEPFFMYLAHSL-----PHVPLFTSDEFRGTS------AR 276
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+ ++ ++D VG+++E LE + N+I+VF SDNG ++ S LR K
Sbjct: 277 GLYGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNGPWLPT-GISGGSAGLLREGKG 335
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQ 365
T WEGG+R + W+P + +V +
Sbjct: 336 TTWEGGMREPTIFWAPGMLPAKVVMDM 362
>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
Length = 457
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 124/374 (33%), Positives = 188/374 (50%), Gaps = 40/374 (10%)
Query: 40 LPLAFTLSMVFVDLVA-SSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
+PL F + V + VA +S P P+I+ I+ DDLG D+ + TPNID L
Sbjct: 9 IPLLFAVGGVAISSVAHASSPQKLDEQKPNILLIMVDDLGLGDLSCQYARDLQTPNIDKL 68
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQY 148
G+ L N+Y + +PSR+ ++TG +P G+ + G L +LPQ
Sbjct: 69 FSDGVRLNNFYANSSVSSPSRAGLLTGCYPDMVGVPGVIRTETTASWGYLSPEAILLPQM 128
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMR 207
+K+ Y T I+GKWHLG ++ P RGF+ G+ D Y+ H L+
Sbjct: 129 MKKAEYNTAIIGKWHLGL-EEPNLPNSRGFDFFHGFLGDMMDNYYTHLRRGHNYMRLN-E 186
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
+EP ++TD+F++ A+D + P FLYLA+ A P+ PLQ P +
Sbjct: 187 EVIEPT-----GHATDLFSSWALDYFDTMRKSKVPFFLYLAYNA-----PHVPLQPPTEW 236
Query: 267 LNIHRHIE---DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
L R E KR+K A++ LD ++G+V +ALE+ L N++I+F SDNGG A
Sbjct: 237 LERVREREPGISEKRAKLVALIEHLDYNIGRVYKALEESGQLDNTLIIFCSDNGGDAGA- 295
Query: 324 NLNAASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
A+N P+RG K ++EGG++ + W L+ R I + V +SD PTL +
Sbjct: 296 ---EANNGPVRGSKGDMYEGGIKVPCAVYWRNHLKPRSI--DDLVIMSDIFPTLCDLVS- 349
Query: 383 SDIPNYVNSTVENI 396
IP VN ++ I
Sbjct: 350 --IP--VNHRIDGI 359
>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 653
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 167/350 (47%), Gaps = 50/350 (14%)
Query: 71 GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
GW DVGFH TPN+DA+ G+ L +YT CTPSR+ +MTG++ GMQ +VL
Sbjct: 48 GWKDVGFHDT-TFSTPNLDAMVAEGVELSTFYTAPTCTPSRAQLMTGRYSYRIGMQDSVL 106
Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
+ E G+PL+E + + L+ GY T VGKWHLG + ++ P RGF+ G TG
Sbjct: 107 HTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMPQFLPVERGFDDFYGILTGGGG 166
Query: 191 YFDH---SAEEMKMWGLDMRRDLEPAWDLHGK-----------YSTDVFTAEAVDIIHNH 236
++ H S E G R + G +ST+++T +A + +
Sbjct: 167 HYTHMSVSPEFTPRGGSGKTRTFSGPNIVEGSEVSDDNQDTRTHSTELYTKKAAEYVEAM 226
Query: 237 ST--DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---------------DFKRS 279
S D+P FLYL++ A H +P++ D ++N R E D+
Sbjct: 227 SAKEDQPWFLYLSYQAIH-----DPIETDDSWIN-GRSCESISAENSSIGETDETDYDNR 280
Query: 280 KFA-AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+ A ++ ++D +G + LE N++++F SDNGG + + + N P RG K
Sbjct: 281 RIACGMVAQMDNGLGALRGLLENLGEWDNTVVIFFSDNGGLPS----HGSVNRPFRGGKG 336
Query: 339 TLWEGGVR----GAGLIWSPLLESRGIVAEQY---VHVSDWLPTLLSAAN 381
WEGGV +G S L G+ +Y HV+D T+L A
Sbjct: 337 DYWEGGVHVPAFVSGGFVSAALRRTGVEPYRYGHLTHVTDVHATILGLAG 386
>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
Length = 482
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 51/354 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+IIFILADDLG+ D+G +G I TPN+DA+A G +Y +C PSRS+ +TG+
Sbjct: 35 PNIIFILADDLGYGDLGCYGQKLIQTPNLDAMAKQGTRFTQFYAGTAVCAPSRSSFLTGQ 94
Query: 119 HPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
H HT ++ N G + G +P S + + LK+ GY T GKW LG P
Sbjct: 95 HTGHTPIRGNK--GVQPEGQWPIPDSAVTIAEVLKKAGYATGDFGKWGLGPVASSGDPIK 152
Query: 176 RGFESHLGY---WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFTAEAVD 231
+GF++ GY H Y DH +W D R D H YS D+ +A+
Sbjct: 153 QGFDAFYGYNCQTLAHDYYPDH------LWENDTRIDFAANTPEHPTDYSADLIHQKALQ 206
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-------------------NIHRH 272
I H + P F++L++ H+A LQ PD L +
Sbjct: 207 FIDQHKGN-PFFVFLSYTLPHAA-----LQVPDDSLFEKYKKLLNEKPIPVPKWNGKNYA 260
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG------FNLN 326
+ + R+ +AA++ +LD VG+V++ L++ + N++++F SDNG G FN
Sbjct: 261 PQAYPRAAYAAMVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDNGPHKEGGYDPDYFN-- 318
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN PLRG+K L+EGG+R + W P G ++ D+LPT A
Sbjct: 319 --SNGPLRGIKRDLYEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAELA 370
>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 460
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 168/325 (51%), Gaps = 51/325 (15%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSA 113
A+ PP+I+ I ADDLG+ D+G +G I TP++D +A G+ ++Y ++CTPSR+A
Sbjct: 25 AADRPPNIVIIFADDLGYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSRAA 84
Query: 114 IMTGKHPIHTGM--QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
++TG++P+ +GM + VL+ GGLP E LP+ LK GY T VGKWHLG ++
Sbjct: 85 LLTGRYPVRSGMYGKRRVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLGIHEGS- 143
Query: 172 TPTFRGFESHLGY-WTGHQDY--------FDHSAEEMKMWGLDMRRDLE----PAWDLHG 218
P +GF+ G ++ D + W + + R+ E PA +H
Sbjct: 144 RPLDQGFDQSFGLPYSNDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQVH- 202
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
T +T EAV I + +P FLY+AH+ P+ PL A E+FK
Sbjct: 203 --LTGHYTEEAVKFIQQKKS-QPFFLYMAHSF-----PHVPLFA----------SENFKG 244
Query: 279 SKFAAI----LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG-----GAAAGFNLNAAS 329
A I + +LD SVG+V++ L + + N+++ F SDNG GA G S
Sbjct: 245 RSRAGIYGDTVEELDWSVGQVLQTLRKEGLSENTLVFFTSDNGPWLIMGAQGG------S 298
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSP 354
PL+ K + WEGG+R G+ W P
Sbjct: 299 AGPLKDGKGSHWEGGMRVPGIAWWP 323
>gi|423214667|ref|ZP_17201195.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692573|gb|EIY85810.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
CL03T12C04]
Length = 459
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 178/337 (52%), Gaps = 34/337 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+ + I+ADD+G+ DVG +G + I TPNID +A G++ ++++ + +P+R ++TG+
Sbjct: 29 PNFVIIVADDMGYGDVGIYGNEYIRTPNIDQMARKGMMFTDFHSNGSVSSPTRCGLLTGR 88
Query: 119 HPIHTGMQHNVLY----GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G++ +L + GLP E + + + GYRT ++GKWHLG+ +K +
Sbjct: 89 YQQRAGLERVLLVPRNDKDKEVGLPSEEITFAKVMGDNGYRTALIGKWHLGYLQKHHPMN 148
Query: 175 FRGFESHLGYWTGHQDYFDHSAE--EMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
F GF+ +G+ +G+ DY H +M W GL+++ D+ G Y+T + T + D
Sbjct: 149 F-GFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEVK-------DMPG-YTTTLLTTLSED 199
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNIHRHIEDFKRSK------FAAI 284
I + D+P LY+AHAA HS P+Q P + + E K S + +
Sbjct: 200 YI-KENKDKPFCLYIAHAAPHS-----PMQGPHEKAIRTEATPEGDKNSDRSNKEIYKDM 253
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ +LD SVG+++E L++ ++ N+ +VF SDNG N S +G K + WEG
Sbjct: 254 VEELDWSVGRILETLKKYKLDENTFVVFFSDNGPVIN----NGGSAGGYKGAKGSPWEGA 309
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
R G+ + P G EQ V D PT+L A+
Sbjct: 310 HRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLDMAD 346
>gi|72161757|ref|YP_289414.1| N-acetylgalactosamine-6-sulfatase [Thermobifida fusca YX]
gi|71915489|gb|AAZ55391.1| n-acetylgalactosamine-6-sulfate sulfatase [Thermobifida fusca YX]
Length = 471
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 170/348 (48%), Gaps = 36/348 (10%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
PP+I+FILADDLGW D+G +G I TPN+D LA GI Y C+ +R ++ TG
Sbjct: 20 PPNILFILADDLGWADLGCYGSTTIRTPNLDQLAAQGIRFTHGYAGSPWCSSTRISLYTG 79
Query: 118 KHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
++P + G++ ++ G+P L L E GY T + GKWH G+ Y+P
Sbjct: 80 RYPGRLQAGLEEPLVTRSPENGIPEGHPTLSSLLVEAGYATAMFGKWHCGWLPW-YSPLR 138
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GFE+ G + G DYF+H K D+ P ++ Y T++ + A + I
Sbjct: 139 IGFETFFGNFDGALDYFEHVDTLGKA---DLYEGETPVEEV--GYYTEIISERAAEYITA 193
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHY---LNIHR------------HIEDFKRSK 280
H + P ++ L + A P+ P + PD + I R H++ +K
Sbjct: 194 HR-NRPFYVQLNYTA-----PHWPWEGPDDHEVGQEIRRRYQQRWEHSPLMHLDGGSIAK 247
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+ ++ +D +G+V+ AL++ N+I+VF SDNGG + NWP G K L
Sbjct: 248 YGELVEAMDAGIGQVLAALDRAGAADNTIVVFSSDNGGE------RWSKNWPFVGEKGDL 301
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
EGG+R ++ P + V++ V DW TLL+AA P++
Sbjct: 302 TEGGIRVPLIVAWPEAIAGNQVSDHPVITMDWTATLLAAAGTEPHPDW 349
>gi|343087313|ref|YP_004776608.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342355847|gb|AEL28377.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 477
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 171/318 (53%), Gaps = 30/318 (9%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
SS PP+I+ I D++G+ D F D I TPN++ LA G +L N+YT CT SR++
Sbjct: 34 SSAPPNILIITVDNIGYGDFKFLNKDSPIITPNLNKLAKQGAVLTNFYTGSPTCTASRAS 93
Query: 114 IMTGKHPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKEL--GYRTRIVGKWHLGFYKK 169
++TG+ P + + + L G GLP +E I+P+ LK GY+T GKW++GF
Sbjct: 94 LLTGRIPQRNKLDYQLGGLAGNYGIGLPQTEIIIPKVLKSSPHGYQTGAFGKWNIGFAPG 153
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL---EPAWDLHGKYSTDVFT 226
PT RGF+ LG +G+ D+F H R DL + + HG+YSTD+F
Sbjct: 154 S-RPTERGFDEFLGIASGNVDHFSHVYAG--------RHDLYHNTTSINRHGEYSTDLFA 204
Query: 227 AEAVDIIHNHSTD-EPLFLYLAHAATH-----SANPYEP--LQAPDHYLNIHRHIEDFK- 277
A+D I+ +++ +P +YL A H + P E +APD+ + D +
Sbjct: 205 NAAIDFINEKTSENKPWMVYLPFDAPHHPGDRNIAPGEENIWKAPDYAFQPYGLSPDERD 264
Query: 278 -RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+ +F AI+ +D +VG+++++L++ + N+ + F SDNG L+ SN PL G
Sbjct: 265 PKKRFNAIVTAIDMAVGRLMDSLDELGIADNTFVFFYSDNGAFYP--YLDIQSNAPLNGA 322
Query: 337 KNTLWEGGVRGAGLIWSP 354
TLWEGG+R L+ P
Sbjct: 323 GVTLWEGGIRVPALVRWP 340
>gi|395508489|ref|XP_003758543.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Sarcophilus harrisii]
Length = 492
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/334 (30%), Positives = 163/334 (48%), Gaps = 42/334 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G G TP++D +A G++ N+YT LC+PSR+A++TG+ PI G
Sbjct: 33 MGWGDLGVFGEPSKETPHLDQMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTT 92
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE +LP+ LK+ GY +IVGKWHLG ++ ++ P GF+
Sbjct: 93 NAHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLG-HRPQFHPLKHGFDEW 151
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + R+ W++ G++ T ++
Sbjct: 152 FGAPNCHFGPYDNKARP----NIPVYRN----WEMVGRFFEDFPINLKTGEANLTQIYLQ 203
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EAVD I + +P FLY A ATH+ P+ A +L +R ++ + +
Sbjct: 204 EAVDFIKQQAHQQPFFLYWAVDATHA-----PVYASKSFLGTS------QRGRYGDAVRE 252
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+D+ VGK+++ L+ + N+ + F SDNG A + SN P K T +EGG+R
Sbjct: 253 IDDCVGKILKLLQDLGIQENTFVFFTSDNGAALISAPIQGGSNGPFLCGKQTTFEGGMRE 312
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ W P G V+ Q +V D T LS A
Sbjct: 313 PAIAWWPGHIPAGRVSHQLGNVMDLFTTSLSLAG 346
>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
Length = 460
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/353 (31%), Positives = 179/353 (50%), Gaps = 47/353 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II I+ADD GWNDV +HG I TPNID LA +G+ L +Y C+PSR+++ TG+
Sbjct: 35 PNIILIIADDAGWNDVEYHG-SVIQTPNIDFLAKNGVELNRFYANPTCSPSRASLFTGRP 93
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
G+ + + LP S LP+ L + Y+T ++GKWHLG ++ P GF+
Sbjct: 94 ASRMGIVAPI-SDKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLGL-QQSSGPKAYGFD 151
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------STDVFTAEAVD-I 232
G+ G D + H + ++ + +W +G++ +TD+ T EA+ +
Sbjct: 152 YSYGFLHGQIDQYTH-----------LYKNGDKSWYRNGEFIDEQGHATDLITNEAIHWL 200
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAP----DHYLNIHRHIEDFKRSKFAAILHKL 288
+++ FL +A++A P+ PLQ D Y+N I+D R FAA + L
Sbjct: 201 SEKRDSNKNFFLEVAYSA-----PHFPLQEEQKWKDPYMN---SIKDSSRRDFAAAMSHL 252
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGG-----------AAAGFNLNAASNWPLRGVK 337
D S+G ++E L+Q + N++++F+SDNG N N PLR K
Sbjct: 253 DYSIGILLEKLKQENLDKNTLVIFMSDNGAMENWDSRNEYNGVHPSNTVLGDNSPLRDWK 312
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
+ +EGG+R + W L + +E Y+ + D LP++L A ++P +
Sbjct: 313 TSNYEGGIRVPCIFYWKDHL--KNYKSESYISIIDLLPSILFLAGDKNLPQSI 363
>gi|430741911|ref|YP_007201040.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013631|gb|AGA25345.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 469
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 166/340 (48%), Gaps = 39/340 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQL-CTPSRSAIMTGK 118
P+I+FILADDLG+ DV +G TP+ID LA G+ + YT C P+R+A+M+G+
Sbjct: 32 PNIVFILADDLGFTDVACYGSRYYETPHIDKLAAQGLKFTDGYTCGPNCQPTRAALMSGQ 91
Query: 119 HPIHTGM-----------QHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
+ TG+ + L + GLPL + + Q L+ GY T + GKWHLG
Sbjct: 92 YGPRTGIYTVGSIDRFDWKSRPLRPVDNAQGLPLGKVTIAQTLQAAGYATGMFGKWHLG- 150
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
+ +Y P RGF+ + H D+ +A E+ PA G Y D T
Sbjct: 151 QQGDYHPRRRGFDEAIVSMGQHFDF--TTAPEVTY----------PA----GTYLADFLT 194
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
+AVD I H DEP FLYL H HS PY+ + +AA+L
Sbjct: 195 DKAVDFITRHK-DEPFFLYLPHFGVHS--PYQAKKDLVKKFAEKAAAGGHHDPTYAAMLA 251
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA------GFNLNAASNWPLRGVKNTL 340
+DESVG+VV+ L++ + +++++F SDNGG G N PLRG K L
Sbjct: 252 SVDESVGRVVKTLDELGLSDDTLVIFSSDNGGVGGYEREGIGKAGAVTDNAPLRGGKGML 311
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+EGG+R L P G V ++ ++ D PTL+ A
Sbjct: 312 YEGGIRAPYLFRWPGKIPAGTVCDRAINSVDLYPTLVELA 351
>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
Length = 443
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 178/360 (49%), Gaps = 26/360 (7%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
Y T+ LPL MV+ + A P+II+I+ DDLG+ D+ +G TPNID
Sbjct: 3 YLTQQYVKKCLPLFVLTFMVYSTVHAQH--PNIIYIMTDDLGYGDLSGYGRKDFLTPNID 60
Query: 90 ALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHT--GMQHNVLYGCERG---GLPLSEK 143
LA GI N Y+ LCTP+R+A MTG++P T G+ L +R GL +
Sbjct: 61 KLAAQGIKFVNAYSAAPLCTPTRTAFMTGRYPARTPVGLMEP-LTPSKRDSTVGLTAAFP 119
Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
+ ++ GY T ++GKWHLGF + +P GF+ G +G DY H
Sbjct: 120 SVATLMRASGYETALIGKWHLGFLPQN-SPVKNGFDYFFGIHSGAADYISHKTGPAGRRI 178
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
D+ + + + Y TD+FT +AV + ++P FL L + A H P Q P
Sbjct: 179 HDLYENDQAVYP--EGYLTDLFTQKAVTFL-KQKHNKPFFLTLTYNAAH-----WPWQGP 230
Query: 264 DH--YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
+ Y++ + +AA++++LDE +G++++ L++ ++ + +I++F +DNGG
Sbjct: 231 NDKPYVDSVDLRIGGSAAVYAAMMNRLDEGIGEIIKTLDEEQLSNQTIVIFTNDNGGE-- 288
Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ N L K LWEGG+R ++ P + G +Q DW T+LSA
Sbjct: 289 ----KYSDNGGLAKAKGALWEGGIRVPAIVKWPGKITAGSSTQQVAITMDWTKTILSAGG 344
>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
Length = 476
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 172/327 (52%), Gaps = 32/327 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II I DD G++DVG G I TP++D LA G+ N+Y Q +C+ SR+A++TG
Sbjct: 30 PNIIIIFTDDQGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAVCSASRAALLTGV 89
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ H L R GL E + LK LGY T + GKWHLG Y E+ PT +GF
Sbjct: 90 YSNRLGI-HGALDHMSRYGLHPEEATIADILKPLGYATAMFGKWHLGHY-PEFLPTNQGF 147
Query: 179 ESHLG------YWTGH---QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
+ +LG W H +DY+ + ++ D + ++ W+ ++T +FT ++
Sbjct: 148 DEYLGIPYSNDMWPNHPQTKDYY----PPLPLYQND--KVIDTIWNDQSMFTT-LFTEKS 200
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+D I + D P FLYLAH P+ PL D + R + ++ ++D
Sbjct: 201 IDFIERNK-DRPFFLYLAHPM-----PHVPLFVSDKFKG------KSNRGLYGDVIMEID 248
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
SVG+++E L+ + N+II+F+SDNG + ++ ++ S +PLR K T W+GGV+
Sbjct: 249 WSVGQIMETLKHYDLDENTIIIFLSDNGPWLS-YSGHSGSAYPLREGKGTSWDGGVKVPV 307
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTL 376
++ + G+V D LPTL
Sbjct: 308 IMRWKGTWASGLVQPNPAMSIDILPTL 334
>gi|410611985|ref|ZP_11323071.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
gi|410168398|dbj|GAC36960.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
Length = 508
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/335 (31%), Positives = 168/335 (50%), Gaps = 50/335 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+FI ADDLG+ D+G +G I TPNID +A G +Y +CTPSR+ ++TG+
Sbjct: 50 PNIVFIFADDLGYGDIGAYGSTTINTPNIDKMAAQGAKFDEFYAASPVCTPSRAGLLTGR 109
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+PI G+ HNV + G+ E + + LK GY T +VGKWHLG + ++Y P +GF
Sbjct: 110 YPIRQGI-HNVFFPESFQGMDPEEITIAEVLKGAGYATGLVGKWHLG-HHEQYMPWNQGF 167
Query: 179 ESHLGYW----TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
+ G G YF++ + + ++ +Y T +T +A+ I
Sbjct: 168 DEFFGLPYSNDMGGLYYFNNKDIDFE--------------EVDQRYMTKTYTDQALQFID 213
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H ++P FLYLAH P+ P+ A + + ++ +LD SVG+
Sbjct: 214 KHQ-EQPFFLYLAHNM-----PHVPIYASPEFEGQS------AGGAYGDVVEELDWSVGQ 261
Query: 295 VVEALEQRRMLSNSIIVFVSDNG--------GAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
V++ L++ + N++++F SDNG G +AG PLR K +EGG+R
Sbjct: 262 VLKRLDELNLSENTLVIFTSDNGPWLLMGDEGGSAG---------PLREGKQFTFEGGMR 312
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ + P S+G + + DWLPT + AN
Sbjct: 313 VPTVAYWPGTISQGTKPQGLATMMDWLPTFANLAN 347
>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
niloticus]
Length = 525
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/362 (31%), Positives = 172/362 (47%), Gaps = 35/362 (9%)
Query: 42 LAFTLSMVFVDLVASS-------GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
+A TL F ++ S P P+II +L DD+GW D+G G TPN+DA+A
Sbjct: 5 VALTLITFFGAIICCSWTKSIPNAPTPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAA 64
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKIL 145
G++L N+YT LC+PSR+A++TG+ PI G + GG+ E +L
Sbjct: 65 EGMLLPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGISKDEILL 124
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGL 204
PQ LK GY ++IVGKWHLG ++ +Y P GF+ G H Y D S + ++
Sbjct: 125 PQLLKTKGYVSKIVGKWHLG-HRPQYLPLKNGFDEWFGSPNCHFGPYNDQSKPNIPVYNN 183
Query: 205 D-----MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYE 258
D + + T ++ E +D I + +P FLY A ATH+
Sbjct: 184 SEMLGRFYEDFKIDRNTGESNLTQIYLMEGLDFILRQTKAQQPFFLYWAVDATHA----- 238
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
P+ A +L +R ++ + +LD S+G++++ L + +N+ + F SDNG
Sbjct: 239 PVYASKRFLG------KSQRGRYGDAVMELDYSIGQILKWLRTLGIDNNTFVFFTSDNGA 292
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
A + SN P K T +EGG+R + W P G V Q +V D T L
Sbjct: 293 AVMSGPNESGSNGPFLCGKETTFEGGMREPAIAWWPGHIKEGSVNFQLANVMDLFTTSLV 352
Query: 379 AA 380
A
Sbjct: 353 LA 354
>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
[Oryzias latipes]
Length = 523
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 175/374 (46%), Gaps = 53/374 (14%)
Query: 38 AVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
A +P LS ++ + P P+II +L DD+GW D+G G TPN+DA+A
Sbjct: 5 ASVPFLSVLSATICCFLSENTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAA 64
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
GI+ ++YT LC+PSR+A++TG+ PI G N E GG+ E +L
Sbjct: 65 QGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILL 124
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
PQ LKE GY +IVGKWHLG ++ +Y P GF+ G H ++++
Sbjct: 125 PQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWFGAPNCHFGPYNNTV--------- 174
Query: 206 MRRDLEPAW---DLHGKYS--------------TDVFTAEAVDIIHNHS-TDEPLFLYLA 247
R P + ++ G+Y T ++ +D I + P FLY A
Sbjct: 175 --RPNIPVYNNSEMLGRYFEEFKIDKKTGESNLTQMYLEAGLDFISRQAEAKRPFFLYWA 232
Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
ATHS P+ A +L +R ++ + +LD SVG+++ L++ + +N
Sbjct: 233 ADATHS-----PVYASKPFLG------KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNN 281
Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
+ + F SDNG A SN P K T +EGG+R + W P G V+ Q
Sbjct: 282 TFVFFTSDNGAALMSGPNEGGSNGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPA 341
Query: 368 HVSDWLPTLLSAAN 381
+V D T L+ A
Sbjct: 342 NVMDLFTTSLALAG 355
>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
Length = 511
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 172/344 (50%), Gaps = 45/344 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ I +L DD+GW D+G G TP +D +A G++ ++Y+ LC+PSR+A++TG+
Sbjct: 20 PNFILMLMDDMGWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTGR 79
Query: 119 HPIHTGM------------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
PI G N++ GG+P E +LP+ L++ GY++++VGKWHLG
Sbjct: 80 LPIRNGFYTTNGHARNAYTPQNIV-----GGIPDEEILLPELLQKAGYKSKLVGKWHLG- 133
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS----- 221
++ +Y P GF+ G H +D+ + + R+ E A + +
Sbjct: 134 HQAKYLPLKHGFDEWFGAPNCHFGPYDNVHTP----NIPVYRNEEMAGRYYQDFKIEKNG 189
Query: 222 ----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
T ++T EAV+ I H+ +P FLY A ATH EPL A +L
Sbjct: 190 ESNLTQLYTKEAVEFITRMHNKSKPFFLYWAVDATH-----EPLYASKPFLGTSN----- 239
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R + ++ +LD +VGK++ +L ++ +N+++ F SDNGGA + SN P
Sbjct: 240 -RGLYGDVVRELDSAVGKILASLRSLKIDNNTLVFFSSDNGGATYA-KQHGGSNGPFLCG 297
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
K T +EGG+R + W P +G V+ Q + D T + A
Sbjct: 298 KETTFEGGMREPTIAWWPGHIKKGQVSHQVGSLMDLYTTFVDFA 341
>gi|432342876|ref|ZP_19592103.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
gi|430772100|gb|ELB87901.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
Length = 459
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 161/346 (46%), Gaps = 37/346 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+++FIL D++GW D G +G PTP ID LA GI +Y CTP+RSAIMTG+H
Sbjct: 7 PNVVFILVDNVGWGDFGVYG-GTTPTPRIDKLASEGIRFNDYNVEAQCTPTRSAIMTGRH 65
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
P+ +G G G+ E + L E GY T + GKWHLG + PT +GF+
Sbjct: 66 PVRSGTYKLSWPGEGMSGMAPWEYTIADLLSEAGYATALYGKWHLGNTEGR-LPTDQGFD 124
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-------- 231
W G +D +D + W L L G+ A +D
Sbjct: 125 E----WWGIKDSWDEAG--YTAWPLYQELGLPAPMIWEGRKGEQSTPAMPLDLDVRPIVD 178
Query: 232 -----------IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
I N + +P F+YL ++ H P+ A +Y N +
Sbjct: 179 EKYLVPKTVEFIKKNAAAKKPFFVYLGYSQMHP-----PMIANPNYANTSTA----RSGV 229
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG-VKNT 339
+A L ++D+ VG++V+A++ + N+IIV SDN G A ++ SN P RG NT
Sbjct: 230 YADCLAEMDDRVGQIVDAVKAAGIEDNTIIVLSSDNAGGGALPHVGPGSNGPWRGNFFNT 289
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
+EG +R + +I P G+V Q + DWLPTL S++
Sbjct: 290 PFEGSMRVSAMIRWPGAIPAGVVTAQTLAAVDWLPTLAGMVGASEL 335
>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
[Oryzias latipes]
Length = 525
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 175/374 (46%), Gaps = 53/374 (14%)
Query: 38 AVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
A +P LS ++ + P P+II +L DD+GW D+G G TPN+DA+A
Sbjct: 5 ASVPFLSVLSATICCFLSENTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAA 64
Query: 94 SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
GI+ ++YT LC+PSR+A++TG+ PI G N E GG+ E +L
Sbjct: 65 QGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILL 124
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
PQ LKE GY +IVGKWHLG ++ +Y P GF+ G H ++++
Sbjct: 125 PQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWFGAPNCHFGPYNNTV--------- 174
Query: 206 MRRDLEPAW---DLHGKYS--------------TDVFTAEAVDIIHNHS-TDEPLFLYLA 247
R P + ++ G+Y T ++ +D I + P FLY A
Sbjct: 175 --RPNIPVYNNSEMLGRYFEEFKIDKKTGESNLTQMYLEAGLDFISRQAEAKRPFFLYWA 232
Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
ATHS P+ A +L +R ++ + +LD SVG+++ L++ + +N
Sbjct: 233 ADATHS-----PVYASKPFLG------KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNN 281
Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
+ + F SDNG A SN P K T +EGG+R + W P G V+ Q
Sbjct: 282 TFVFFTSDNGAALMSGPNEGGSNGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPA 341
Query: 368 HVSDWLPTLLSAAN 381
+V D T L+ A
Sbjct: 342 NVMDLFTTSLALAG 355
>gi|406661802|ref|ZP_11069914.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405554347|gb|EKB49446.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 486
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 177/357 (49%), Gaps = 56/357 (15%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+IIFILADDLG+ D+GF G I TPN+D LA GI N+Y +C PSRS +MTG
Sbjct: 34 PNIIFILADDLGYGDLGFLGQKIIETPNLDRLAAEGIFFSNHYAGATVCAPSRSTLMTGL 93
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
H HT ++ N E G PL + IL P+ KE GY T GKW LGF E P +
Sbjct: 94 HTGHTPVRGNFEIQPE-GQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTEGDPNHQ 152
Query: 177 GFESHLGY---WTGHQ---DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
GF+ GY H+ +Y +AE++ + G D W Y+ D+ E++
Sbjct: 153 GFDQFFGYNCQRIAHRYYPEYLWDNAEKVFLEGND--------WTQKITYAADIIHKESL 204
Query: 231 DIIHNHSTDEPLFLYLAHAATHS----------------ANPYEPLQAP------DHYLN 268
I +++ + P F+++ H+ P EP AP D +N
Sbjct: 205 KFIRDNANN-PFFMFVTSVMPHAELAVPDGELLAYYREKIGPEEPHIAPRGSDYGDEPIN 263
Query: 269 IHRHIED-FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--- 324
I + + + ++ +AA++ LD VG++++ L++ + +II+F SDNG G N
Sbjct: 264 IPGYQSNPYPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNGPHQEGGNDPD 323
Query: 325 -LNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVS---DWLPTL 376
N SN RG K L+EGG+R ++ W +++ Q HVS D+LPT
Sbjct: 324 FFN--SNGIFRGYKRDLYEGGIRVPMILKWPGVIQG----GAQSGHVSAFWDFLPTF 374
>gi|346225028|ref|ZP_08846170.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
gi|346226998|ref|ZP_08848140.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
Length = 488
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/341 (33%), Positives = 169/341 (49%), Gaps = 43/341 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ I I ADDLG+ D+G G I TPN+D +A+ G N+Y +CTPSR+ ++TG+
Sbjct: 36 PNFIVIFADDLGYGDLGIFGHPTINTPNLDRMAFEGQKWTNFYVAAPVCTPSRAGLLTGR 95
Query: 119 HPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
PI +GM + VL+ GGLP SE + + LKE GY+T +GKWHLG +K + PT
Sbjct: 96 LPIRSGMCSDKRRVLFPDSNGGLPQSEITIARALKENGYQTAAIGKWHLG-HKSPFLPTD 154
Query: 176 RGFESHLGY----------WTGHQDYFDHS---AEEMKMWGLD---MRRDLEPAWDLHGK 219
GF+S+ G TG +Y AEE + +R D +
Sbjct: 155 HGFDSYFGIPYSNDMDRVPGTGTGNYITSQIKLAEEENFHAYNVPILRNDKIIERPADQR 214
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
T +T EAV I + D P F+YLAH H + + R E+FK
Sbjct: 215 TITKRYTEEAVAKIKSMK-DGPFFIYLAHNLPH--------------IPLFRS-EEFKDV 258
Query: 280 KFAA----ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
A ++ ++D SVG++++ LE+ + N+++VF SDNG F + + LRG
Sbjct: 259 SLAGFYGDVIEEIDWSVGQILKTLEEEGLDENTLVVFTSDNGPWHI-FETHGGTAGLLRG 317
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
K +EGG+R + W P +G+V + + D PT
Sbjct: 318 AKGGTFEGGMREPTIFWWPGKLKQGVVMDMATTM-DLFPTF 357
>gi|32475083|ref|NP_868077.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
SH 1]
gi|32445623|emb|CAD75624.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
SH 1]
Length = 484
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 182/345 (52%), Gaps = 40/345 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+ ILADDLG+ D+G +G D+ TP +D LA G+ + Y C+P+R+A++TG+
Sbjct: 39 PNIVLILADDLGYGDLGCYGNDEQATPVLDRLATQGVRWTQAYANGPECSPTRAALLTGR 98
Query: 119 HPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
+ H G NV L+ GLP + L + L +GY T + GKWHL
Sbjct: 99 YQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFGKWHL 158
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTD 223
G Y+ +++P GF+ L G DY+ H + + + L + G+ Y TD
Sbjct: 159 G-YEAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEGYFTD 211
Query: 224 VFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK- 280
T +AV I + ++ D+P FLYL + A H+ PY+ P ++P L I + +K++
Sbjct: 212 TITDQAVRFIGDRNANDKPFFLYLPYTAPHT--PYQAPGESPVDPLPIDSPL--WKQNAD 267
Query: 281 ----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+ A++ +DE +GKV+ A+E+ +M ++++F SDNGG + A+ N PLRG
Sbjct: 268 PPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVIFASDNGGTS------ASRNEPLRGF 321
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K +EGG+R + P G+V++Q D ++L+AA
Sbjct: 322 KGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAG 366
>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
Length = 464
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 170/364 (46%), Gaps = 34/364 (9%)
Query: 43 AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNY 101
A T F SS P+I+FILADDLGW D+ +G +I TPN+D LA SG+ + Y
Sbjct: 31 AVTKETPFRARPGSSKRPNILFILADDLGWADLSSYGSPEIRTPNLDRLARSGVRFTQGY 90
Query: 102 YTVQLCTPSRSAIMTGKHPIHT--GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
+C+P+R A+ TG+ P T G+ + ER G+P + L LK+ GY T ++
Sbjct: 91 SAGAVCSPTRVALYTGRLPGRTPGGLPEPIGSVNERDGIPATHPTLASLLKKTGYATALI 150
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWH GF ++P G++ G ++G DY+ + K D+ D G+
Sbjct: 151 GKWHGGFLPN-FSPLRTGWDYFFGNYSGGVDYYSKYSHTGKY---DLFEGETEVPD-DGR 205
Query: 220 YSTDVFTAEAVD-IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
Y TDV +A D I +P L L + +P+ P +AP K
Sbjct: 206 YYTDVIADKAEDWITKAKKQGKPWLLNL-----NFTSPHWPWEAPGDRKISEEITAKIKA 260
Query: 279 SKFAAILH--------------KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
AI H LD +G+V+ AL + N+++VF SDNGG +
Sbjct: 261 GNTQAIFHNDGGSLETYTKMVENLDARIGQVLNALRRSGQEENTLVVFASDNGGERFSY- 319
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
NWPL G K +L EGG+R ++ P G V+ + V DW TLL A +
Sbjct: 320 -----NWPLTGNKGSLSEGGIRVPTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAP 374
Query: 385 IPNY 388
P++
Sbjct: 375 SPDH 378
>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 477
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 174/328 (53%), Gaps = 34/328 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II I DD G++DVG +G I TP++D LA G+ N+Y Q +C+ SR+A++TG
Sbjct: 31 PNIILIFTDDQGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAVCSASRAALLTGT 90
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G+ H L + GL E + LK LGY T + GKWHLG + E+ PT +GF
Sbjct: 91 YPNRLGI-HGALDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLG-HHPEFLPTNQGF 148
Query: 179 ESHLG------YWTGH---QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
+ + G W H +DY+ + ++ D + ++ W+ ++T FT ++
Sbjct: 149 DEYFGIPYSNDMWPNHPQTKDYY----PPLPIYQND--KVVDTIWNDQSMFTT-WFTEKS 201
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+D I + D P FLYLAH P+ PL D + R + ++ ++D
Sbjct: 202 IDFIERNK-DRPFFLYLAHPM-----PHVPLFVSDKFKG------KSNRGLYGDVIMEID 249
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
SVG++++ L+Q + N+II+F+SDNG + ++ ++ S +PLR K T W+GGV+
Sbjct: 250 WSVGQIMQTLKQNDLDENTIIIFLSDNGPWLS-YSEHSGSAYPLREGKGTSWDGGVKVPA 308
Query: 350 LI-WSPLLESRGIVAEQYVHVSDWLPTL 376
++ W S G+V + D LP L
Sbjct: 309 IMRWKGTWPS-GLVQKNPAMTIDILPML 335
>gi|300773187|ref|ZP_07083056.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300759358|gb|EFK56185.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 443
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/353 (31%), Positives = 175/353 (49%), Gaps = 25/353 (7%)
Query: 46 LSMVFVDLVA-----SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L +F+ L+A + P+ I I DD+G+ DVG +G I TPN+D +A G+ N
Sbjct: 7 LITLFICLLAVFNSSAQTQPNFIIIYVDDMGYGDVGINGNPNIETPNLDRMAMEGMRFSN 66
Query: 101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
YY+ CT SR A++TGK+P G + VL ++ G+ E + + LKE GYRT I
Sbjct: 67 YYSASPACTASRYALLTGKYPSRAGFRW-VLNPTDQIGIHQQESTIAERLKEKGYRTAIY 125
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
GKWHLG +KE+ P GF+ ++G + + + G D +L P
Sbjct: 126 GKWHLGSTRKEFLPLANGFDEYVGLPYSNDMIPPKYPDIALLSGYD-TLELNP----DQS 180
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
T ++T +A+ I ++ +P F+YL +A H+ PL A + +L KR
Sbjct: 181 KLTRLYTEKAIAFITKNA-KQPFFIYLPYAMPHT-----PLHASEDFLGKS------KRG 228
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+ ++ +LD +G+++ L++ ++ + +VF SDNG N N S R K +
Sbjct: 229 LYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNGPWLIQ-NQNGGSAGLFRDGKGS 287
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
WEGG+R +W +G V + D LPT+ + A S PN ++ T
Sbjct: 288 TWEGGMREPFFLWGHHTIPKGYVENEVFTALDMLPTITALAGISAGPNKIDGT 340
>gi|423348778|ref|ZP_17326457.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
CL03T12C32]
gi|409212738|gb|EKN05774.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
CL03T12C32]
Length = 461
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 168/337 (49%), Gaps = 40/337 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ +LADD G+ D GF G I TPNID LA G I + + + +PSRS ++TG+
Sbjct: 31 PNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGR 90
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ + GLP E++LP LK YRT +GKWHLG + P +GF
Sbjct: 91 YGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAKGF 147
Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
++ G GH+ YF D + ++ + G + D Y TD ++A
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFD---------SYFTDELASKAQ 198
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+ +++P LY++ A HS N +A + L E R K+AA+++ LD
Sbjct: 199 QFVTE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 248
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
VGK+V+ L+ N+II F+SDNGG+ N +SN PL+G K +EGG R
Sbjct: 249 GVGKIVDELKAAGKFDNTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 304
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
++W R +++ ++ L + + DIP
Sbjct: 305 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 337
>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
griseus]
Length = 493
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 106/338 (31%), Positives = 165/338 (48%), Gaps = 43/338 (12%)
Query: 67 ADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM 125
A+ +GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 8 AEVMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGF 67
Query: 126 ------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ ++ P GF
Sbjct: 68 YTSNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGF 126
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDV 224
+ G H +D+ + + + RD W++ G++ T +
Sbjct: 127 DEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEMVGRFYEEFPINLKTGEANLTQL 178
Query: 225 FTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ EA+D I H+ P FLY A ATH+ P+ A +L R ++
Sbjct: 179 YLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVYASRPFLGTSH------RGRYGD 227
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+ ++D+SVGK++ L+ + N+ + F SDNG A SN P K T +EG
Sbjct: 228 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFLCGKQTTFEG 287
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
G+R + W P + G V+ Q + D T LS A
Sbjct: 288 GMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAG 325
>gi|87308809|ref|ZP_01090948.1| arylsulfatase A [Blastopirellula marina DSM 3645]
gi|87288520|gb|EAQ80415.1| arylsulfatase A [Blastopirellula marina DSM 3645]
Length = 474
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/355 (31%), Positives = 169/355 (47%), Gaps = 49/355 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+ +F L DDLGW DVG +G TPNID LA SG+ N Y +C+P+R++IMTG+
Sbjct: 24 PNFVFFLVDDLGWADVGCNGSTFHETPNIDKLAGSGMRFTNGYAACPVCSPTRASIMTGR 83
Query: 119 HPIHT-------GMQ------HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
HP+ GM+ H + +R L E + + L+ GY T GKWHLG
Sbjct: 84 HPVRVDITDWIPGMKAKGAKNHRFQHIHDRDDLAPEETTIAESLRADGYSTFFAGKWHLG 143
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG---LDMRRDLEPAWDLHGKYST 222
+ PT +GF+ ++G GH D W L+ + D G+Y T
Sbjct: 144 --DQGNWPTEQGFDINIG---GH-DRGSPPGGYYSPWNNPVLEAKHD--------GEYLT 189
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-PDHYLNIHRHIEDFK---- 277
+ T E++ + +P LYLA+ H+ P +P Q HY + E
Sbjct: 190 ERLTEESIHFLETRDQAKPFLLYLAYYNVHT--PIQPYQKRVKHYQEKAKQFEGETPVQK 247
Query: 278 -----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
++A+++ +D+SVG ++ AL++ + N++++F SDNGG
Sbjct: 248 EGGGKSRMRQDNPQYASMVAAVDDSVGAILAALDRLSLTDNTVVIFFSDNGGLCTTGGAG 307
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN PLR K L+EGGVR +I +P + G V + V D+ PT+ S A
Sbjct: 308 PTSNLPLRSGKGWLYEGGVREPMIIRAPGVTKAGSVCDTPVVSMDFFPTIRSLAG 362
>gi|255038246|ref|YP_003088867.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254951002|gb|ACT95702.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 481
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 177/357 (49%), Gaps = 49/357 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLG+ DVGF+G I TPNID LA G+I +Y +C PSRS+++TG+
Sbjct: 30 PNIVFILADDLGYGDVGFNGQKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLTGQ 89
Query: 119 HPIHTGMQHNVLYGCE-RGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
H HT ++ N G E G P+++ + L + LK+ GY T GKW LG E P
Sbjct: 90 HTGHTYIRGNK--GVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLGPVGSEGDPNK 147
Query: 176 RGFESHLGY---WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVD 231
+GF+ GY H+ Y +H + K L+ + L +H K Y+ D+ +A+
Sbjct: 148 QGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGL-----IHNKEYAPDLIQKKALS 202
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL----------NIHRHI-------- 273
++ +P FL+L + H+ L PD L H+
Sbjct: 203 FVNAQDGKQPFFLFLPYILPHAE-----LVVPDDSLFRYYKGKFEEKPHKGADYGPGANG 257
Query: 274 -----EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG----GAAAGFN 324
+DF + FAA++ +LD VG+V+ AL+++ + N++++F SDNG G A
Sbjct: 258 GGYASQDFPHATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGADPRF 317
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
N+ + + RGVK L+EGG+R P G ++ D LPT AN
Sbjct: 318 FNSGAGF--RGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELAN 372
>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
Length = 470
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 169/344 (49%), Gaps = 17/344 (4%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
FT S + ++G P++I IL DD G D G + TP+ID LA G+ L +
Sbjct: 29 FCFTASGLASAADQATGKPNVIVILTDDQGTIDANCFGAKDLETPHIDGLAARGVRLTQF 88
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
Y +C+PSR+ ++TG++P+ G+ NV G+P ++ + + K GY T +G
Sbjct: 89 YAAAPVCSPSRAGLLTGRYPVRGGVPGNVSSTAGVAGMPTTQVTMAETFKAAGYATAHIG 148
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP---AWDLH 217
KWHLG Y ++ P +GF++ G+ G D F H W R DL
Sbjct: 149 KWHLG-YSEDTMPLAQGFQTSFGHMGGCIDNFSH----FFYWQGPNRHDLHKDGREVQRP 203
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
G++ D+ EA I + + P F+Y A P+ P Q +L ++ + +
Sbjct: 204 GRFFPDLMVEEASQFIGTNR-ERPFFVYFA-----LNGPHYPYQGDPRWLERYKDLP-YP 256
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA-AGFNLNAASNWPLRGV 336
R+ +AA + +DE +G +++ +++ + +I+VF SD+G + +L S RG
Sbjct: 257 RNLYAAFVSSIDERIGGLLKRVDELGLREKTIVVFQSDHGHSTEERAHLGGGSAGIYRGA 316
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
K +L+EGG+R +I P +G Q H DWLPTL + A
Sbjct: 317 KFSLFEGGIRVPAIISWPGRLPQGEARSQLAHGCDWLPTLAALA 360
>gi|255035933|ref|YP_003086554.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254948689|gb|ACT93389.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 497
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 119/374 (31%), Positives = 183/374 (48%), Gaps = 58/374 (15%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I++I ADDLG+ ++G +G +I TPN+D LA GI +YT +C P+R+ +MTGK
Sbjct: 27 PNIVYIYADDLGYGELGCYGQQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGK 86
Query: 119 HPIHTGMQHNVLYGC-----ERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
H H+ ++ N G ERG +PL +E + + LK+ GY T + GKW +G E
Sbjct: 87 HAGHSAIRGNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNTEG 146
Query: 172 TPTFRGFESHLGYW---TGHQDYFDHSAEEMKMWGL-----DMRRDLEPA-------WDL 216
TPT +GF+ + GY H Y H E + L D+ R L+PA
Sbjct: 147 TPTRQGFDYYYGYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDADFESF 206
Query: 217 HGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
GK Y+ T +A+ I + S P FLY+ + H + LQAPD Y + ++I
Sbjct: 207 KGKEYAPAKMTEKALAFI-DRSKAGPFFLYMPYTLPHVS-----LQAPDEY--VKKYIGQ 258
Query: 276 FKR----------------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
F S +A+++ LD+ VG +++ L+ + N+I++F SDNG
Sbjct: 259 FDEKPYYGEKNYASTKYPLSTYASMITFLDDQVGIILDKLKALGLDDNTIVMFSSDNGAT 318
Query: 320 AAG------FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
G FN A LRG+K ++EGG+R ++ P G V++ D +
Sbjct: 319 FNGGVNPQFFNSVAG----LRGLKMDVYEGGIREPFIVRWPGKIKPGRVSDHVSAQFDLM 374
Query: 374 PTLLSAANKSDIPN 387
PTL ++ P
Sbjct: 375 PTLAELTGQASPPT 388
>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
Length = 456
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 170/329 (51%), Gaps = 29/329 (8%)
Query: 58 GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMT 116
G P+I+ I+ DDLG D+ + TPNID L + G+ + N+Y + +PSR+ ++T
Sbjct: 34 GNPNILLIMVDDLGLGDLSCQYAKDVQTPNIDRLFHQGVRMDNFYANSSVSSPSRAGLLT 93
Query: 117 GKHPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
G P G+ V+ +G L +LP+ +K+ GY+T I+GKWHLG + P
Sbjct: 94 GCFPDMVGVP-GVIRTDPKGSWGYLSPDATLLPEMMKKAGYQTAIIGKWHLGL-ESPNLP 151
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-I 232
RGF+ G+ D ++ G + R D G ++TD+FT A D I
Sbjct: 152 NERGFDFFHGFLGDMMDDYNTHLRR----GFNYMRLNTEEIDPQG-HATDLFTDWASDYI 206
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY----LNIHRHIEDFKRSKFAAILHKL 288
D+P FLYLA+ A HS PLQ P + H +I + R K A++ +
Sbjct: 207 FKARKQDKPFFLYLAYNAPHS-----PLQPPVEWEKKVRKRHPNISE-TRGKLVALIEHM 260
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D+ VG+V+ +LE+ L N++I+F SDNGG + A+N P+RG K +++GG++ A
Sbjct: 261 DDGVGRVINSLEKSGQLDNTLIIFCSDNGGDRK----SEANNGPVRGDKGDMYDGGIKVA 316
Query: 349 -GLIWSPLLESRGIVAEQYVHVSDWLPTL 376
L W LE R + V +SD PTL
Sbjct: 317 CSLYWKGHLEHRRV--NNLVMMSDIFPTL 343
>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
Length = 478
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 182/372 (48%), Gaps = 51/372 (13%)
Query: 52 DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPS 110
D S P+ +FI ADDLG+ D+G G I TPNID +A GI ++ + +C+PS
Sbjct: 39 DQAIKSDLPNFVFIFADDLGYGDLGCFGASDIATPNIDRIAAEGIKFTSFLSASPVCSPS 98
Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
R+ ++TG+ P G+ + V + G+ E + + LK GYRT IVGKWHLG ++
Sbjct: 99 RAGLLTGRMPQRMGI-NTVFFPESLTGMDPEEITIAEILKTKGYRTGIVGKWHLGHLER- 156
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
+ P +GF + G Y + A + M G ++ A+ + +Y T +T E++
Sbjct: 157 FLPLNQGFYEYFGI-----PYSNDMASVVYMRGNEVE-----AYHVDQRYMTRTYTEESL 206
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
I + S D+P FLYLAH P+ P+ A + R + ++ +LD
Sbjct: 207 KFI-DASGDQPFFLYLAHNM-----PHVPIYASPEFEGTS------NRGLYGDVIQELDW 254
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNG--------GAAAGFNLNAASNWPLRGVKNTLWE 342
SVG+V+ LE++ +L N++IVF SDNG G +AG LR K +E
Sbjct: 255 SVGEVLRKLEEKGILENTLIVFSSDNGPWLVMEDHGGSAGH---------LREGKQFTFE 305
Query: 343 GGVRGAGL-IWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYE 401
GGVR + +W +E G V E DW PT S ++IP + II +
Sbjct: 306 GGVRVPTVAMWKGKIEP-GQVYEDLATQMDWFPT-FSQLVGAEIPQ------DRIIDGKD 357
Query: 402 NSILRYENGTHE 413
+ +ENG E
Sbjct: 358 LGPVLFENGNRE 369
>gi|154495003|ref|ZP_02034008.1| hypothetical protein PARMER_04049 [Parabacteroides merdae ATCC
43184]
gi|154085553|gb|EDN84598.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
Length = 464
Score = 157 bits (396), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 168/337 (49%), Gaps = 40/337 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ +LADD G+ D GF G I TPNID LA G I + + + +PSRS ++TG+
Sbjct: 34 PNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGR 93
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ + GLP E++LP LK YRT +GKWHLG + P +GF
Sbjct: 94 YGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAKGF 150
Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
++ G GH+ YF D + ++ + G + D Y TD ++A
Sbjct: 151 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFD---------GYFTDELASKAQ 201
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+ +++P LY++ A HS N +A + L E R K+AA+++ LD
Sbjct: 202 QFVTE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 251
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
VGK+V+ L+ N+II F+SDNGG+ N +SN PL+G K +EGG R
Sbjct: 252 GVGKIVDELKAAGKFDNTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 307
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
++W R +++ ++ L + + DIP
Sbjct: 308 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 340
>gi|311748006|ref|ZP_07721791.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|126574986|gb|EAZ79344.1| sulfatase family protein [Algoriphagus sp. PR1]
Length = 529
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 127/414 (30%), Positives = 194/414 (46%), Gaps = 72/414 (17%)
Query: 42 LAFTLSMVFVDLVASSGPP---HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L + S+ V ++ PP +I+FIL DDLG+NDV F G + TPNID +A G+I
Sbjct: 8 LLYIFSLAMVSCKSNPEPPSNPNIVFILVDDLGYNDVSFMGSEFYETPNIDKVASKGMIF 67
Query: 99 KN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG-----------LP------L 140
N Y +C+PSR+++MTG+ G+ + G +G LP L
Sbjct: 68 TNGYANSAVCSPSRASLMTGRFTAVHGITDWI--GAPQGEAWRSYKRYSKLLPAEYNHQL 125
Query: 141 SEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE-----SHL------GYWTG 187
S++I LP+ LK+ GY+T GKWHLG ++ PT GF+ H+ GY++
Sbjct: 126 SKEITTLPEALKQEGYKTFFAGKWHLGSAEENSLPTDHGFDINKGGYHVGGPYSGGYFSP 185
Query: 188 HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLA 247
++ F EE K L M+ A+ ++ + +E YL+
Sbjct: 186 FENPFMEDFEEEKGISLSMK------------------LAKETNLFIEQNKEEKFLAYLS 227
Query: 248 HAATHSA------------NPYEPLQAPDHYLNIHRHIEDFKRSK---FAAILHKLDESV 292
A H+ N E + + R + K +A ++ ++DE+V
Sbjct: 228 FYAVHAPIQTSKEKWSKYRNKAEAMGIDSTGFEMERVLPARKYQDNPVYAGLIEQMDEAV 287
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G VV+ LE+ + N+IIVF SDNGG +G N + + PLRG K WEGG R +I+
Sbjct: 288 GAVVDQLEKLGLSENTIIVFTSDNGGVVSGDNYSTNLD-PLRGGKGYQWEGGTRIPYMIY 346
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILR 406
P LE+ G + V D+ PTLL A ++I ++ ++I P E L
Sbjct: 347 VPWLENNGQRNDTPVSGVDFYPTLLDFAGVNEISQPLDG--QSIRPVLEGGTLE 398
>gi|423725143|ref|ZP_17699283.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
CL09T00C40]
gi|409234771|gb|EKN27595.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
CL09T00C40]
Length = 461
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 168/337 (49%), Gaps = 40/337 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ +LADD G+ D GF G I TPNID LA G I + + + +PSRS ++TG+
Sbjct: 31 PNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGR 90
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ + GLP E++LP LK YRT +GKWHLG + P +GF
Sbjct: 91 YGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAKGF 147
Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
++ G GH+ YF D + ++ + G + D Y TD ++A
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFD---------GYFTDELASKAQ 198
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+ +++P LY++ A HS N +A + L E R K+AA+++ LD
Sbjct: 199 QFVTE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 248
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
VGK+V+ L+ N+II F+SDNGG+ N +SN PL+G K +EGG R
Sbjct: 249 GVGKIVDELKAAGKFDNTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 304
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
++W R +++ ++ L + + DIP
Sbjct: 305 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 337
>gi|408371886|ref|ZP_11169642.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
gi|407742679|gb|EKF54270.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
Length = 480
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 178/350 (50%), Gaps = 44/350 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+F+LADDLG D+G +G + I TPN+D +A G++ ++Y+ +C PSR+++MTGK
Sbjct: 36 PNIVFVLADDLGIGDLGSYGQEYIKTPNLDRMAKEGMLFTDFYSGSTVCAPSRASLMTGK 95
Query: 119 HPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
H H ++ N G PL ++ + Q LK GY T + GKW LG +P +
Sbjct: 96 HTGHLTIRGN-------GEFPLKSTDTTIAQILKSQGYHTAMFGKWGLGLQGTSGSPEKK 148
Query: 177 GFESHLGYWTGHQDYFD-HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G++ +TGH + D H + +W ++ + A + ++ ++FT + ++ I
Sbjct: 149 GWDE----FTGHLHHVDGHFQKPDSLWSIEKGGVKKHAIE-DDAFANEIFTQKGINFIDK 203
Query: 236 HSTDEPLFLYLAHAATHS--------ANPY-----EPLQAPDHYLNIHRHI--EDFKRSK 280
S ++P FL++++ H+ PY E + P++ RH + + ++
Sbjct: 204 QSDNQPFFLFMSYTVPHAELRVEDKYLAPYLNKHGESIFEPENPWPTGRHYGQQKYPKAA 263
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG---------FNLNAASNW 331
+AA++ LD VGK++E L+ + + N++++F SDNG G FN A
Sbjct: 264 YAAMVTSLDSYVGKLLEKLKSKGLEENTLVIFTSDNGTHIEGGRTKEDVKFFNSTAG--- 320
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
LRGVK L+EGG+R + P +G ++ D LPT A
Sbjct: 321 -LRGVKRDLYEGGIRTPFIAHWPGTIPKGSISSVTAAYWDLLPTFTELAG 369
>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 443
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 187/374 (50%), Gaps = 41/374 (10%)
Query: 36 AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
+ VL LAF S+ P+++ IL DDLG+ D+G +G + I TP +D +A SG
Sbjct: 10 SLQVLCLAFWCSIPTAQSAEIQQQPNVVLILVDDLGYGDLGCYGSESIRTPRLDRMAASG 69
Query: 96 IILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
+ L ++Y +CTP+R+A+MTG + G+ L+ + G+ SE L + +++ GY
Sbjct: 70 MKLTSFYAAAPICTPTRAALMTGCYATRVGLP-TPLHVYDEIGINESEFTLGEAMQQCGY 128
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM-----WGLDMRRD 209
T VGKWHLG ++ + PT GF + G GH F+ A + LD R+
Sbjct: 129 ETVCVGKWHLG-HQPRFYPTEHGFNHYWGTPLGHM--FNRPAVGKAIGDTSDLFLDDTRE 185
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
+ D T+ T +AV+ I D P FL+LAH P+EPL A + + +
Sbjct: 186 IPFPEDAD---LTERLTEKAVEFIEAKR-DRPFFLFLAHPM-----PHEPLAASEKFADR 236
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG-----GAAAGFN 324
+ + ++ LD S G+V++ALE+ N+++ F+SDNG G AG
Sbjct: 237 S------QAGLYGDVVECLDWSTGEVLDALERTGQAENTLVAFLSDNGPKKGHGTTAG-- 288
Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
LRG K+ +EGG+R +I + L G +++ +V DW PTL++ A +
Sbjct: 289 --------LRGFKHDPYEGGMRVPCIIAAQGLIPAGQTSDEICNVMDWYPTLVNLAGGTT 340
Query: 385 IPNYVNSTVENIIP 398
+ V +N+ P
Sbjct: 341 VSEQVRDG-KNLTP 353
>gi|335058614|gb|AEH26503.1| arylsulphatase [uncultured Acidobacteria bacterium A11]
Length = 491
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 168/364 (46%), Gaps = 59/364 (16%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
L A P+I+FIL DD GW +G +G I TP++D LA G Y + CTP+
Sbjct: 14 TSLFAGESKPNILFILTDDQGWPTLGCYGNKLIATPHLDRLATEGARFTAAYAMPQCTPT 73
Query: 111 RSAIMTGKHPIHTGMQH---NVLYG---------CERGGLPLSEKILPQYLKELGYRTRI 158
R+A+ TG+H TGM H N YG R GLP LP+ L+ GY T
Sbjct: 74 RAALFTGQHTARTGMWHVLTNPWYGYPWAPMREPTWREGLPRDWFTLPKGLRAAGYVTGT 133
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
GKWHL + +R + G G FD A R D + A G
Sbjct: 134 AGKWHL---TADADGDYRALKPEAGDAFG----FDFVAP---------RGDSQSA----G 173
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
D T A+ I ++P F YLAH H P+ AP+ + HR ++
Sbjct: 174 DKQVDFLTDAAIGFIREQR-EKPWFFYLAHHTVHG-----PVLAPEALIARHRARGAPEK 227
Query: 279 ----SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN---------- 324
+ F A L LD SVG+++ AL++ + +++VF+SDNGG + ++
Sbjct: 228 GSGNATFLAALEHLDNSVGRLMAALDELGLRERTLVVFMSDNGGVSRSYDPKPFTEGPGT 287
Query: 325 ---LNAAS----NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
L+ A+ N PLRG K + +EGG+R L+ P + + G V E HV DWLPTLL
Sbjct: 288 DTQLHLANAEFPNAPLRGWKGSPYEGGIRVPCLVRWPGVVAAGRVVEAPAHVVDWLPTLL 347
Query: 378 SAAN 381
A
Sbjct: 348 EVAG 351
>gi|114799529|ref|YP_761144.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114739703|gb|ABI77828.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 459
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/334 (32%), Positives = 165/334 (49%), Gaps = 39/334 (11%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
A++ PP+II I+ADDLGW D+ +G I TPNID + GI L ++Y +C+PSR+A
Sbjct: 34 AAAKPPNIIIIMADDLGWGDISLNGAALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAA 93
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++PI +GMQH V++ + GLP E + + LK GYRT +VGKWHLG +++EY P
Sbjct: 94 LLTGRYPIRSGMQH-VIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWP 151
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
T +GF D+F M D+ R E + + A+A
Sbjct: 152 TNQGF-----------DWFYGVPYSNDMAPFDLYRGKEIIESPADQSQLSLNYAKAAKEF 200
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
S+D+P FLY A P+ PL P+ + ++ +D +G
Sbjct: 201 IEDSSDKPFFLYYAETF-----PHIPLFVPEDRSGTS------DAGLYGDVVETVDAGIG 249
Query: 294 KVVEALEQRRMLSNSIIVFVSDNG----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
V++ L++ + +++I+F SDNG G+A F RG K EGG R
Sbjct: 250 IVLDTLDEAGVADDTLIIFTSDNGPWFEGSAGEF----------RGRKGETHEGGFRVPF 299
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
L P +G V+ + D LPT S + +
Sbjct: 300 LARWPGHIPKGSVSHEMAMNIDLLPTAASLSGAT 333
>gi|254294928|ref|YP_003060951.1| sulfatase [Hirschia baltica ATCC 49814]
gi|254043459|gb|ACT60254.1| sulfatase [Hirschia baltica ATCC 49814]
Length = 487
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 179/352 (50%), Gaps = 39/352 (11%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
+A P+IIFI+ADDLG+ DV +G + TP ID+LA++G+ Y+ +C+ +R+
Sbjct: 49 IAQKKRPNIIFIMADDLGYADVSCYGRREYKTPAIDSLAHTGVKFTQAYSNSPVCSATRT 108
Query: 113 AIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+++TG++ + G++ + + GLP + LP LK+ GY+T +VGKWHLG K
Sbjct: 109 SVITGRYQYRLPVGLEEPLTF--RDVGLPTTHPTLPGQLKKSGYQTALVGKWHLGALPK- 165
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHS--AEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFTA 227
Y P G++ G G DYF H A++ +W ++ ++H +Y T +
Sbjct: 166 YGPLKSGYDHFWGVRGGSVDYFTHDSMAKQNDLWDGEV--------EIHETEYLTKLIGG 217
Query: 228 EAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----------HIEDF 276
+++++I + +EP F+ L H P+ P + P+ R H++
Sbjct: 218 KSIELIEKMNENEEPFFMSL-----HYTAPHWPWEGPEDKTESDRLKQGGPFSLLHLDGG 272
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+AA++ +D +G+++ L++ + ++IVF SDNGG + WP G+
Sbjct: 273 SLETYAAMVTSMDNQIGRLLNKLDELGITEETLIVFTSDNGGE------RFSDTWPFSGI 326
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
K L EGG+R +I P + ++Q D +PTLL A + P++
Sbjct: 327 KTELLEGGLRVPLIIRWPGITKANTTSDQVSMSMDLMPTLLKAGQAAPHPDF 378
>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 651
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 122/407 (29%), Positives = 190/407 (46%), Gaps = 89/407 (21%)
Query: 60 PHIIFILADDLGWNDVGFH---------GLDQIPTPNIDALAYSGIILKNYYTVQ-LCTP 109
P+II I+ADDLG+ DVGF+ G+ IPTP +D LA +GII N + C P
Sbjct: 28 PNIIVIVADDLGYADVGFNRDANFPAEKGV--IPTPELDQLANNGIICTNGHVAHPFCGP 85
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
SR+A+MTG P G+Q+N+ G+PL E P+ L++ Y T GKWHLGF +
Sbjct: 86 SRAALMTGVQPSRIGVQYNLPNDINTSLGIPLEETYFPKILQQNNYHTAAFGKWHLGFTQ 145
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEM---KMWG-----------LDMRRDLEPAW 214
+Y P RGF+ G+ G + YF+ E++ ++ G L +RD
Sbjct: 146 GKYQPLDRGFDYFFGFLGGGKAYFEREYEDLYYRRLGGSNPVTNEYQDPLQRQRDYVAKD 205
Query: 215 DL-HGKYSTDVFTAEAVD-IIHNHSTDEPLFLYLAHAATHSA--------------NP-Y 257
+ +Y TD+ T EA++ I N + +P F+Y+A+ A H+ NP +
Sbjct: 206 EFNQDEYLTDILTDEAINYIAENKTKSDPFFMYVAYNAPHTPLQAPAEEIAQFKEENPNF 265
Query: 258 EPLQAPDHYLN---------IHRHIEDFK------------------------------R 278
E L YL + R ++ K R
Sbjct: 266 ESLVRNSDYLKESGAYELDKVERELKKVKDYDYNSLSSSEKEEKVADYREHLLDKFTESR 325
Query: 279 SKFAAILHKLDESVGKVVEALEQ-RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
+A ++ +D+++G++V L++ +++ N++IVF+SDNGG A N PL +K
Sbjct: 326 IVYATMVSNMDKNIGRIVAELKKDQKVFDNTMIVFLSDNGGYTYS---KGAVNHPLYALK 382
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+++EGG R I W ++S Q + D PTL+ A S
Sbjct: 383 GSVFEGGHRVPFFIHWPNKIKSPKKYKYQ-ISSLDLYPTLVDLAGAS 428
>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 585
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 178/360 (49%), Gaps = 55/360 (15%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
+A L + + A + P+II L DD GW D+ +G I TPNID+LA+ G + +N+
Sbjct: 4 IALVLIALIQFMAAENKKPNIIIFLTDDQGWGDLSINGNKDISTPNIDSLAHDGALFENF 63
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +C+P+R+ ++TG++ +G++ + G ER L E+ + K+ GY T GK
Sbjct: 64 YVQPVCSPTRAELLTGRYAFRSGVR-STSEGGER--FNLDEQTIADVFKKAGYATGAFGK 120
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-- 219
WH G + Y P RGF+ G+ +GH WG P + +GK
Sbjct: 121 WHSGM-QYPYHPNGRGFDEFYGFCSGH-------------WG----NYYSPMLEHNGKIV 162
Query: 220 ----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR---- 271
+ D FT++A+D I + ++P +YL + HS P+Q PD + ++
Sbjct: 163 KGQGFCVDDFTSKAMDFIEENK-EKPFLVYLPYNTPHS-----PMQVPDRFWEKYQDKTL 216
Query: 272 -----------HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
H EDF ++ A+ +D +VG+V+ L++ ++ N+I+++ SDNG +
Sbjct: 217 TMKNTYNKKWGHGEDFVKAAL-AMCENIDWNVGRVLNKLDELQIADNTIVLYFSDNGPNS 275
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+N ++G K ++ EGGVR + P ++G V ++ D LPTL S A
Sbjct: 276 PRWNGE------MKGKKGSVDEGGVRSPLHLRWPQKVAQGQVIKEVTGAIDLLPTLSSMA 329
>gi|296122901|ref|YP_003630679.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296015241|gb|ADG68480.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 519
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 174/358 (48%), Gaps = 35/358 (9%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M V+ T++ + V+ S P+II ++ DD G+ D+ HG + TP++D L
Sbjct: 22 MIATVIGAGLTIARI-VEADESKTRPNIILMMTDDQGYGDLSLHGNPVVKTPHLDQLGRQ 80
Query: 95 GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
+ + ++ C P+R++IMT +H +G+ H +L ER L L ILPQ+LK GY
Sbjct: 81 SVRFEQFHVSPTCAPTRASIMTSRHEFSSGVTHTIL---ERERLSLKATILPQFLKRAGY 137
Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFES---HLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
T I GKWHLG + Y P RGF+ H G G Q Y + + +
Sbjct: 138 TTGIFGKWHLG-DEDAYQPGKRGFDEVFIHGGGGIG-QSYPGSCGDA------PLNKYFN 189
Query: 212 PAWDLHGK------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
P +GK Y T VF +A+ I + ++P F Y+ A H+ PL P
Sbjct: 190 PVIRHNGKFVATNGYCTKVFVDQAITWISSQPDNQPFFCYITPNAPHA-----PLDCPKE 244
Query: 266 YLNIH-RHI-EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
Y + H+ ED R F ++ D+ +G++++ALE R + ++I++F++DNG A
Sbjct: 245 YYEPYLEHVPEDVAR--FYGMITHWDDQLGRLLKALEDRDISKDTIVIFMTDNGSATGAK 302
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ +A +R K T +EGG+R WS + V ++ D LPTL AN
Sbjct: 303 HFSAG----MRANKGTPYEGGIRVPAF-WSWAGHWQPQVRQEVTCHYDILPTLTELAN 355
>gi|325109525|ref|YP_004270593.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969793|gb|ADY60571.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 453
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 183/364 (50%), Gaps = 35/364 (9%)
Query: 32 TRIMAFAVLPLAF---TLSMVFV---DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPT 85
T++ F+ +PL LS+V L A+ P+I+FIL DD G+ D+G +G +I T
Sbjct: 2 TKLTLFSPIPLVLFETLLSLVLFAGGQLAAADQRPNIVFILTDDQGYGDLGCYGSTEIAT 61
Query: 86 PNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL 145
P+ID LA G+ +++Y C+P+R+A MTG H G+ V+Y ++ GL E +
Sbjct: 62 PHIDRLAAEGMKFESFYVHNRCSPTRAAFMTGCHAQRVGLG-KVIYRKDKTGLSADEITV 120
Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
+ L+ GYRT IVGKWHLG Y ++++P GF++ G+ ++ D+ +
Sbjct: 121 AELLQGAGYRTGIVGKWHLGEY-EQFSPVHHGFDTFYGF----MEWDDNRTTAIDNNRTI 175
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA-NPYEPLQAPD 264
+ RD+ G +S + A I N D P FLY A H+ P+E
Sbjct: 176 VERDVSKT---DGVHSPKLLAAGIEFITENQ--DRPFFLYYASPLPHTRWKPHE------ 224
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
R ++ + ++ ++D VG++++ L++ + N+++VF SDNG N
Sbjct: 225 ------RFAGSSEQGTYGDVVQEIDWQVGELMQTLDKLGLTDNTLVVFASDNG---PQLN 275
Query: 325 LNA-ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ S PLR K T +EGG+R L+ P + G V + V + D LPT A
Sbjct: 276 VEGHGSAGPLRDGKWTNFEGGIRVPCLMRWPSVVPAGSVNNEIVGIIDMLPTFCEIAG-V 334
Query: 384 DIPN 387
D+P+
Sbjct: 335 DVPS 338
>gi|168702101|ref|ZP_02734378.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Gemmata
obscuriglobus UQM 2246]
Length = 596
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 132/435 (30%), Positives = 202/435 (46%), Gaps = 47/435 (10%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
+ AFAVL L F F +++G P+++ I+ DDLG D+G +G TPNID +A
Sbjct: 2 LRAFAVLALGF-----FALPASAAGKPNVVLIVIDDLGQRDLGCYGSTFYKTPNIDRMAK 56
Query: 94 SGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG-----------LPLS 141
G+ ++Y +C+P+R++IMTGK+P G+ + + G L L
Sbjct: 57 DGVRFTDFYAACPVCSPTRASIMTGKYPQRVGITDWLPGRKDLPGQRLKRPELKNELALE 116
Query: 142 EKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHL-GYWTGHQ-DYFDHSAEE- 198
E + + LK GY T +GKWHLG K + P +GF+ ++ G TG YF A +
Sbjct: 117 EVTVAETLKGHGYVTAHIGKWHLG--GKGFEPEKQGFDVNVAGDHTGTPLSYFAPFANKA 174
Query: 199 -MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
M GL+ E Y TD AEA I + D+P FLYL H H+
Sbjct: 175 GATMPGLEKAAPDE--------YLTDRLAAEAETFI-TANKDKPFFLYLPHYGVHT---- 221
Query: 258 EPLQAPDHYLNIHRHIEDFKRSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
PL+AP ++ ++ R +AA++ +D +VG+V++ L+ ++ N++++F S
Sbjct: 222 -PLRAPQPLVDKYKTQAVHGRQSNPVYAAMVESMDAAVGRVLKRLDDLKLSDNTLVLFTS 280
Query: 315 DNGGAAA--GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDW 372
DNGG A G N PLR K L+EGGVR + P G V +Q D+
Sbjct: 281 DNGGLATLEGMPFAPTINAPLREGKGYLYEGGVRVPLIAKWPGKVKPGTVMDQVACSIDF 340
Query: 373 LPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHE 432
T+L A + + +++P + L+ Y P N +R
Sbjct: 341 FDTILEATGAT---SAARRDGVSLVPAFGGEKLKPRALYWHY--PHYANQGSRPGGAVRA 395
Query: 433 YNPKYENRYENGTHE 447
N K YE+G E
Sbjct: 396 GNYKLVEYYEDGRRE 410
>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 518
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 118/407 (28%), Positives = 192/407 (47%), Gaps = 57/407 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P++I +L DD+GW D+G G TPN+D +A G ++ ++Y LC+PSR++++TG+
Sbjct: 27 PNVILMLMDDMGWGDLGVLGNPAKETPNLDRMASEGALMTDFYAPNPLCSPSRASLLTGR 86
Query: 119 HPIHTGM-QHNVLYGCER------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
PI G N C GG+P SE +LP+ L + GYR++I+GKWHLG ++ +Y
Sbjct: 87 LPIRNGFYTTNDHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRSKIIGKWHLG-HQTQY 145
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---------- 221
P GF+ + G H +D+ + + + RD D+ G+Y
Sbjct: 146 HPLKHGFDEYFGAPNCHVGPYDNKKQP----NIPVYRDA----DMIGRYYEEFKIDKSGE 197
Query: 222 ---TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
T +F EA+ I H T E FLY A+HS P A +L +
Sbjct: 198 SNLTQMFIEEAIAFIEKQHQTGEQFFLYWTPDASHS-----PHYASKSFLGTS------Q 246
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R + + +LD VG+++ L++ ++ +N+ + F SDNG A +G SN P K
Sbjct: 247 RELYGDAVMELDYGVGQILGKLKELQIENNTFVFFSSDNGAALSG-GAYVGSNGPFLCGK 305
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA-----------NKSDIP 386
+T +EGG+R + W P G V+ Q + D T L A S +P
Sbjct: 306 HTTFEGGMREPSIAWWPGRIKPGKVSHQLGTIMDLFTTALDIAGVTLPTDRIIDGTSLLP 365
Query: 387 NYVNSTVEN---IIPRYENSILRYENGTHEYNSPRIENSNTRYENGT 430
VN +++ I + ++ +G ++ + NS ++ GT
Sbjct: 366 ALVNGKIDSNKTIFFYRGDELMAARHGLYKAHFWTWTNSEEEFKRGT 412
>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
Length = 462
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 167/332 (50%), Gaps = 25/332 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK-NYYTVQLCTPSRSAIMTGK 118
P+I+FI+ADDLG+ D G I TP ID++ G++L+ Y + +C+P+R+A++TG
Sbjct: 35 PNIVFIMADDLGYADTSATGSRHIRTPAIDSIGAGGVMLRQGYSSTPICSPTRTALLTGC 94
Query: 119 HP--IHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+ G++ + G G+PL + +K LGYRT +VGKWHLG + P
Sbjct: 95 YAQRFAIGVEEPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKWHLG-EPPAHGPLK 153
Query: 176 RGFESHLGYWTGHQDYFDHS-AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G++ LG G DYF H K G+ + D + D G Y TD+F EAV +I
Sbjct: 154 HGYDHFLGIVEGGADYFVHRMVMSGKPAGVGLAED-DAQTDRTG-YLTDIFGDEAVRVIE 211
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLD 289
++P FL L H P+ P + + L H E +K+ ++ +D
Sbjct: 212 -EGGNQPFFLSL-----HFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMVETMD 265
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
++V KV+ A+++ N+++VF SDNGG + WP G K + EGGVR
Sbjct: 266 QNVAKVLAAIDRSGKADNTVVVFTSDNGGE------RFSDTWPFVGHKGEVLEGGVRVPL 319
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
++ P G +EQ + D+LPTLL A
Sbjct: 320 MVRWPRRIKAGSRSEQVMVSMDFLPTLLGMAG 351
>gi|440714153|ref|ZP_20894738.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436440952|gb|ELP34233.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 484
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 183/349 (52%), Gaps = 40/349 (11%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAI 114
S+ P+I+ ILADDLG+ D+G +G D+ TP +D LA G+ + Y C+P+R+A+
Sbjct: 35 SNARPNIVLILADDLGYGDLGCYGNDEQATPVLDRLATQGVRWTQAYANGPECSPTRAAL 94
Query: 115 MTGKHPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
+TG++ H G NV L+ GLP + L + L +GY T + G
Sbjct: 95 LTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFG 154
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK- 219
KWHLG Y+ +++P GF+ L G DY+ H + + + L + G+
Sbjct: 155 KWHLG-YEAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEG 207
Query: 220 YSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFK 277
Y TD T +AV I + ++ D+P FLYL + A H+ PY+ P ++P L + + +K
Sbjct: 208 YFTDTITDQAVRYIGDRNANDKPFFLYLPYTAPHT--PYQAPGESPVDPLPLDSPL--WK 263
Query: 278 RSK-----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
++ + A++ +DE +GKV+ A+E+ M ++++F SDNGG + A+ N P
Sbjct: 264 QNADPPGVYRAMVRHMDEGIGKVLHAIEESEMTDRTLVIFASDNGGTS------ASRNEP 317
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
LRG K +EGG+R + P G+V++Q D ++L+AA
Sbjct: 318 LRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAG 366
>gi|344237970|gb|EGV94073.1| N-acetylgalactosamine-6-sulfatase [Cricetulus griseus]
Length = 483
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 163/335 (48%), Gaps = 43/335 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM--- 125
+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A++TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTS 60
Query: 126 ---QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ ++ P GF+
Sbjct: 61 NGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEW 119
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ + + + RD W++ G++ T ++
Sbjct: 120 FGSPNCHFGPYDNKVKP----NIPVYRD----WEMVGRFYEEFPINLKTGEANLTQLYLQ 171
Query: 228 EAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA+D I H+ P FLY A ATH+ P+ A +L R ++ +
Sbjct: 172 EALDFIRTQHARQSPFFLYWAIDATHA-----PVYASRPFLGTSH------RGRYGDAVR 220
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
++D+SVGK++ L+ + N+ + F SDNG A SN P K T +EGG+R
Sbjct: 221 EIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFLCGKQTTFEGGMR 280
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ W P + G V+ Q + D T LS A
Sbjct: 281 EPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAG 315
>gi|404447701|ref|ZP_11012695.1| sulfatase [Indibacter alkaliphilus LW1]
gi|403766287|gb|EJZ27159.1| sulfatase [Indibacter alkaliphilus LW1]
Length = 486
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 106/337 (31%), Positives = 178/337 (52%), Gaps = 30/337 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ I DD G+NDVG G I TPN+D +A G+ N+Y Q +C+ SR+A++TG
Sbjct: 41 PNVVIIFTDDQGYNDVGVFGSKDISTPNLDNMAKEGVRFSNFYVAQAVCSASRAALLTGT 100
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ H L GL E + + LK LGY+T I GKWHLG + E+ P +GF
Sbjct: 101 YSNRLGI-HGALDHSSNHGLNPDETTIAEMLKPLGYQTAIFGKWHLG-HHPEFLPVQQGF 158
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--TDVFTAEAVDIIHNH 236
+ + G + + +H + L + + E LH S T V T +++D I +
Sbjct: 159 DEYFGIPYSNDMWPNHPETKNYYPPLPLYENDEVVDTLHHDQSMLTTVLTEKSLDFIERN 218
Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
+ P FLYLAH P+ PL D + +R + ++ ++D SVG+V+
Sbjct: 219 K-ETPFFLYLAHPM-----PHVPLFVSDKFEG------KSERGLYGDVIMEIDWSVGQVL 266
Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
E+L+ + + N++++F+SDNG + ++L+A S +PL+ K T W+GGV+ ++
Sbjct: 267 ESLKSKGLEENTLVIFLSDNGPWLS-YSLHAGSAYPLKEGKGTSWDGGVKIPS-----IM 320
Query: 357 ESRGIVAEQYVHVS-----DWLPTL--LSAANKSDIP 386
+ +G + E V + D LPT+ ++ A ++P
Sbjct: 321 QWKGKIPEGKVQTNPAMSIDILPTIAEITGAKLPELP 357
>gi|449138380|ref|ZP_21773646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448882925|gb|EMB13473.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 467
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 110/343 (32%), Positives = 178/343 (51%), Gaps = 36/343 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+ ILADDLG+ D+G +G D TP +D LA G+ + Y C+P+R+A++TG+
Sbjct: 22 PNIVLILADDLGYGDLGCYGNDAQATPTLDRLARQGVRWTQAYANGPECSPTRAALLTGR 81
Query: 119 HPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
+ H G NV L+ GLP L + L ++GY T + GKWHL
Sbjct: 82 YQQHVGGLECAIGVGNVGRYDDAIRLHLVSELGLPADRPTLAKRLSDVGYETALFGKWHL 141
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTD 223
G Y+ +++P GFE L G DY+ H + + + ++ R+ P + G+ Y TD
Sbjct: 142 G-YEAQFSPMQHGFEEALYCIGGAMDYY-HYLDSVATY--NLFRNGRP---ISGEGYFTD 194
Query: 224 VFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPL-QAPDHYLNIHRHI---EDFKR 278
T AV + + + +++P FLYL + A H+ PY+P Q+P L + + D
Sbjct: 195 TITDHAVQYVRDRNDSEQPFFLYLPYTAPHT--PYQPPGQSPVDPLPLDSPLWKQSDDPP 252
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
+ +++ +DE +GKV+ A+E+ M ++++F SDNGG + A+ N PLRG K
Sbjct: 253 GVYRSMVRHMDEGIGKVLHAIEESGMTDRTLVIFASDNGGTS------ASRNEPLRGFKG 306
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+EGG+R + P V++Q D ++L+AA
Sbjct: 307 QTFEGGIRVPLIARWPGHLPEDTVSQQVTITFDLTASMLAAAG 349
>gi|322434839|ref|YP_004217051.1| sulfatase [Granulicella tundricola MP5ACTX9]
gi|321162566|gb|ADW68271.1| sulfatase [Granulicella tundricola MP5ACTX9]
Length = 477
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 164/342 (47%), Gaps = 29/342 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+FILADD+G+ D +G +I TPNID +A G+ + Y +C PSR +MTG
Sbjct: 35 PNIVFILADDMGYGDPAVYGQTKIRTPNIDRMAAGGMRFTQGYAGAPVCAPSRCTLMTGM 94
Query: 119 HPIHTGMQHNVLYGC-----------ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
H H ++ N R L ++ + YL+ GYRT ++GKWHL Y
Sbjct: 95 HGGHARVRDNFALAAGHVGHKKKEEIRRASLTTEDRTVADYLRGAGYRTGLMGKWHLDGY 154
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+ PT GFE G W D E +M G + E A G+Y T + T
Sbjct: 155 DPDAVPTKHGFEEFKG-WLTQIDETQGYWPEKRMHGETLIDIPENAGGKQGRYDTTMITE 213
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH--YLNIHRHIEDFKRSKFAAIL 285
++VD I H D P FLY+A+ + HS P APD Y + +D K +AA++
Sbjct: 214 DSVDYIERHKAD-PFFLYVAYDSPHS-----PYTAPDFGPYKDHPTWADDEK--TYAAMI 265
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA------SNWPLRGVKNT 339
+D +GK+++ L++ ++ ++++ F SDNG + SN L G K
Sbjct: 266 WYMDLGIGKILDTLKRLKLDEDTVVFFASDNGPRSEPTVQQTRVIDFFDSNGNLTGYKRD 325
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
++EGG+R + P G V + + D+LPT L A
Sbjct: 326 MYEGGIRDPLIARWPGHIPAGAVNQMSAYFPDFLPTALDLAG 367
>gi|255035424|ref|YP_003086045.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254948180|gb|ACT92880.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 457
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 171/343 (49%), Gaps = 28/343 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+FILADDLG+ D+G HG + TPNIDALA G+I + + +C+PSRS ++TG
Sbjct: 23 PNIVFILADDLGYGDIGAHGQKLLRTPNIDALAKEGMIFTDIHAGAPVCSPSRSVLITGL 82
Query: 119 HPIHTGMQHNV-----LYG------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
H HT ++ N + G R L + + + + + GY T + GKWHL Y
Sbjct: 83 HTGHTTIRGNATIRGGIVGNKGKQTVRRANLAAGDFTVGKLMAQSGYTTALTGKWHLDGY 142
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFT 226
TP RGF+ G+ + + + K + + +D+E + G Y+ D+ T
Sbjct: 143 DTLATPIHRGFDQFSGWLIAYPGTYANGYWPAKRYVNGVLKDVEQNENGRKGYYADDLTT 202
Query: 227 AEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI-EDFKRSKFAAI 284
E++ + +P L + + + HS PL A D R +D K + A
Sbjct: 203 DESLAFLAAQKDAKKPFVLMINYNSPHS-----PLDAADSSAYKDRDWPQDMK--IYGAQ 255
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA------SNWPLRGVKN 338
+H LDE+VGK+ + L + + N+I+ F SDNG + G A SN LRG K
Sbjct: 256 VHHLDENVGKIKKYLTESGLAKNTIVFFCSDNGPRSEGTPQQTAIAEFFDSNGRLRGYKR 315
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
++EGG+R ++W+P + G V+ + + +D +PT A
Sbjct: 316 DMYEGGIRVPMVVWAPGIVKPGSVSSEPAYFADIMPTFADIAG 358
>gi|443694992|gb|ELT96000.1| hypothetical protein CAPTEDRAFT_113428 [Capitella teleta]
Length = 521
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 170/345 (49%), Gaps = 49/345 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II +L DD+GW D+G +G TPN+D +A G++ ++Y+ +C+PSR+++MTG+
Sbjct: 27 PNIIVMLMDDMGWGDLGSYGHPARETPNLDRMAAEGMLFTDFYSAAGICSPSRASVMTGR 86
Query: 119 HPIHTGM-------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
PI G +++ L GG+ SE + P+ LK+ GYR +IVGKWHLG +++++
Sbjct: 87 LPIRNGFYSTNQFAKNSYLPQDVMGGISDSEVLFPELLKQNGYRNKIVGKWHLG-HREQF 145
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---------- 221
P GF+ G H Y D + ++ + + G+Y
Sbjct: 146 LPLQHGFDEWFGAPDCHYKYDDKVMPNIPVYKDSL---------MQGRYYDHFKIDIKTG 196
Query: 222 ----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
T ++ EA+D + D P FLY A ATH+ PL A + + +
Sbjct: 197 LSNLTQLYIQEALDFLEKER-DRPFFLYWAPDATHT-----PLYASEKFRGKS------E 244
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN-----LNAASNWP 332
R + + +LD VGK++ L++ + +N+ F SDNG A F ++ SN P
Sbjct: 245 RGLYGDAVMELDSGVGKILNKLKELGIENNTFAFFSSDNGAATYAFTEVSIIISGGSNGP 304
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
K T +EGG+R L W P G +++Q ++ D T+L
Sbjct: 305 FLCGKQTTFEGGMREPALAWWPTKIRPGSISKQVGNLMDIFSTVL 349
>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
Length = 461
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 169/337 (50%), Gaps = 40/337 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ +LADD G+ D GF G I TPNID LA G I + + + +PSRS ++TG+
Sbjct: 31 PNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGR 90
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ + G+P E++LP LK GYRT +GKWHLG + P +GF
Sbjct: 91 YGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SEPSQRPNAKGF 147
Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
++ G GH+ YF D + ++ + G + D Y TD ++A
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGQKLSFD---------GYFTDELASKAR 198
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+ +++P LY++ A HS N +A + L E R K+AA+++ LD
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHSPN-----EAAEEDLA---RFEGQPRQKYAAMMYALDR 248
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
VGK+V+ L+ ++II F+SDNGG+ N +SN PL+G K +EGG R
Sbjct: 249 GVGKIVDELKAAGKFDDTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 304
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
++W R +++ ++ L + + DIP
Sbjct: 305 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 337
>gi|319794759|ref|YP_004156399.1| sulfatase [Variovorax paradoxus EPS]
gi|315597222|gb|ADU38288.1| sulfatase [Variovorax paradoxus EPS]
Length = 439
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 169/347 (48%), Gaps = 36/347 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+IIFI+ADDLG+ D+G +G +P +D LA +G+ L + Y +C+P+R A+
Sbjct: 5 PNIIFIVADDLGYADLGCYGGRDAAFGPVSPVLDGLAANGLKLTQGYANSPVCSPTRFAM 64
Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+T ++ G + RG GLP LP LK GY+T ++GKWHLG Y
Sbjct: 65 ITARYQYRLRGAAEEPIRSSSRGSTTLGLPTDHPTLPSLLKASGYQTALIGKWHLG-YPP 123
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGL--DMRRDLEPAWDLHGKYSTDVF 225
+ P G++ G +G DYF H SA + +W D + D Y TD+
Sbjct: 124 HFGPLRSGYDEFFGPMSGGVDYFTHCDSAGQHDLWFGEEDKKED---------GYLTDML 174
Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK---F 281
+ AVD + + E P FL L + A H P+E + I ++ D +
Sbjct: 175 SKRAVDYVERMAKQEQPFFLSLHYTAPHW--PWETRDDAEKAPAIKDNLFDLAGGNIHVY 232
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
++H +DE +G ++ AL++ M N+++VF SDNGG + NWPL G K L
Sbjct: 233 RRMIHHMDEGIGWIMAALQKHGMSDNTLVVFTSDNGGE------RFSDNWPLVGGKMDLT 286
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
EGG+R + P + ++G + Q DW T+L AA NY
Sbjct: 287 EGGIRVPWIAHWPAVIAKGGESAQLCMTMDWSATMLEAAGVKADANY 333
>gi|384098332|ref|ZP_09999449.1| chondroitinsulfatase [Imtechella halotolerans K1]
gi|383835828|gb|EID75248.1| chondroitinsulfatase [Imtechella halotolerans K1]
Length = 464
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 171/350 (48%), Gaps = 46/350 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+IIFILADDLG D+G +G I TP+ID LA G+ N+Y +C PSR+A++TG+
Sbjct: 25 PNIIFILADDLGIGDIGVYGQSIIQTPHIDRLAEEGMKFNNFYAGSTVCAPSRAALLTGQ 84
Query: 119 HPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
H HT ++ N G PL ++KI+P+ LK GY I GKW +G TP R
Sbjct: 85 HTGHTQVRGN-------GEFPLDPAKKIIPEVLKNAGYTNAIFGKWGMGLNGSGSTPDKR 137
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD--MRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
GF+ + GH + SA K +D + DL+ Y + FT +A+ I+
Sbjct: 138 GFD----VFAGHIHHV--SAHYQKPDSIDAIVHGDLKRIGLPKDTYVNEYFTEQALQFIN 191
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---------------FKRS 279
N D+P FL++++ H+ P H L+ + D F ++
Sbjct: 192 NQPKDKPFFLFMSYTIPHAELVVPPKYLQKHLLSAEQSKHDSEKEWPAGRHYGPQPFPKA 251
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA-----ASNWPLR 334
+AA++ LD+ G+++ AL+ + + N+I++F SDNG G S+ +
Sbjct: 252 AYAALVESLDDYTGQILAALKAKGIDENTIVIFTSDNGTHTEGGRTQEDVDYFGSSAHYQ 311
Query: 335 GVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVS---DWLPTLLSAA 380
GVK L+ GG++ LI W + +R Q HVS D PT A
Sbjct: 312 GVKRDLYTGGIKEPFLIRWPGTIPAR----TQSNHVSAFWDLYPTFAELA 357
>gi|332665095|ref|YP_004447883.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332333909|gb|AEE51010.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 531
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 187/395 (47%), Gaps = 77/395 (19%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+IIFI+ADDLG++D+G +G + TPN+D LA GI L+++Y C P+R++++TG +
Sbjct: 43 PNIIFIMADDLGYSDIGCYG-GEAQTPNLDKLATKGIKLRSFYNAGRCCPTRASLLTGNY 101
Query: 120 PIHTGMQHNVLY-------GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
GM + V + G +G L + + ++L+++GY T + GKWH+G + E+
Sbjct: 102 SHAAGMGNMVSFDDQKVTPGPYQGYLDPNTPTIAEHLRQVGYHTYMTGKWHVG-ERPEHW 160
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
P RGF+ + G +G +F+ E+ K + + +D E G Y+TD FT A++
Sbjct: 161 PLKRGFDRYFGLISGASSFFEILQEKRKRYMV--LQDQEWVLPQEGFYATDAFTDRAIEF 218
Query: 233 IHNHS-TDEPLFLYLAHAATH-SANPYE-------------------------------- 258
I + +P FLYLA+ A H + YE
Sbjct: 219 IQGQAPQSKPFFLYLAYTAPHFPLHAYEADIAKYEALYLQGWDAIRQQRYEKMQGLGLVD 278
Query: 259 --------PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
P PD + K + +AA++ ++D+++GK+++ LE R+ L N++I
Sbjct: 279 KRYQLSPRPANIPDWNSVQDKQTWARKMAVYAAMIDRMDQNIGKLIQTLEARQELDNTLI 338
Query: 311 VFVSDNGGAAAGFN-----------------------LNAASNWPLRGVKNTLWEGGVRG 347
VF+SDNGG + + SN P + K+ + EGG+
Sbjct: 339 VFLSDNGGTSENVDDRKLGLPGKRIGERGSYGTYDAPWANVSNTPFKLYKSYMHEGGIIS 398
Query: 348 AGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
++ W ++ + VHV D LPT L A
Sbjct: 399 PCILHWPSKIKPKLGYLNTPVHVIDLLPTSLELAQ 433
>gi|323455471|gb|EGB11339.1| hypothetical protein AURANDRAFT_2359, partial [Aureococcus
anophagefferens]
Length = 367
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/348 (33%), Positives = 175/348 (50%), Gaps = 33/348 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP-----TPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
PHI+ +LAD++GW V +H +P TP +DAL SG+ L +YT + C+PSRSA+
Sbjct: 1 PHIVIMLADNVGWAVVEWHRPPALPAREISTPKLDALRASGLELDRHYTYKFCSPSRSAL 60
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY-TP 173
M+G+ P H + +N G P +++P L++ GYRT VGKWHLG + TP
Sbjct: 61 MSGRLPAHVNV-YNDDPAMAGAGAPAHMRLMPAMLRDAGYRTHFVGKWHLGMASRSLQTP 119
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG-------KYSTDVFT 226
RGF++ LGY+ +Y+D + E G DL WD G Y VF
Sbjct: 120 AARGFDTSLGYFHSTNNYYDGTRAEGC--GDAAHADL---WDGDGPSHLNGTAYEELVFR 174
Query: 227 AEAVDIIHNHSTDEPLFLYLA-HAATHSANP-------YEPLQAPDHYLNIHRHIEDFKR 278
VD++ H PLF+Y A H + +P + LQ Y + I+D R
Sbjct: 175 DRVVDVVEAHDAARPLFIYYALHTSCVGYDPDGTAGGERDSLQPDRAYYDKFSFIDDDDR 234
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN-WPLRGVK 337
A++ +D+ G VV AL + + +++++ SDNGGA +L SN WPLRG
Sbjct: 235 RANVAMVALMDDVAGDVVGALRTAGLWAKTLLLWSSDNGGA---VHLGGGSNTWPLRGGY 291
Query: 338 NTLWEGGVRGAGLIWSPLLE--SRGIVAEQYVHVSDWLPTLLSAANKS 383
+EGGVR + L +RG + ++++++DW T A S
Sbjct: 292 YNNFEGGVRAPAFLAGGALPEVARGKLLTEWIYIADWYATFARLAGAS 339
>gi|390944774|ref|YP_006408535.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
gi|390418202|gb|AFL85780.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
Length = 470
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 160/322 (49%), Gaps = 44/322 (13%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSR 111
+ S P+I+FI+ADDLG D+ +G TPN+ LA GI+ K++Y +C PSR
Sbjct: 25 VFGQSKKPNIVFIMADDLGVGDIEPYGQSIFETPNLSRLAQEGIMFKDFYAGSTVCAPSR 84
Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSE--KILPQYLKELGYRTRIVGKWHLGFYKK 169
+++MTG+H ++ N G PL KILP+ LKE GY + GKW LG +
Sbjct: 85 ASLMTGQHTGKIRIRGN-------GEFPLDPDVKILPEMLKENGYTNLMFGKWGLGLQES 137
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
E +P RG++ ++ GH + ++ L + L G Y+ D FT A
Sbjct: 138 ESSPEKRGWD----FFLGHLHHVSAHFQKPDSLDLFLNNTLGRVKVPEGTYANDWFTEGA 193
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL--------------------NI 269
++ I S +EP F+YL++ H+ L PD Y
Sbjct: 194 INYIETSSNEEPFFIYLSYTIPHAE-----LLVPDRYFKDFVNENGQSVFPNEIAWPDGK 248
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA-- 327
H +++ ++ +AA++H LD+ VG+V++ALE + + N+I++F SDNG G
Sbjct: 249 HYGQQNYPKAAYAALVHSLDDYVGQVMDALESKGIADNTILIFTSDNGTHIEGGRRMEDV 308
Query: 328 ---ASNWPLRGVKNTLWEGGVR 346
S+ RGVK L++GG++
Sbjct: 309 DYFQSSGQYRGVKRDLYDGGIK 330
>gi|149199845|ref|ZP_01876874.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149137016|gb|EDM25440.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 491
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 179/355 (50%), Gaps = 41/355 (11%)
Query: 47 SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQL 106
SM + S P+IIFIL DD G+ D+ HG + TPN+D L + +Y
Sbjct: 15 SMTLCSMAKQSKSPNIIFILTDDQGYGDMAVHGHPYLETPNMDRLHSESVRFDRFYVSPS 74
Query: 107 CTPSRSAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
C+P+R+A+MTG H G+ H V LY +G L +++ LKE GY+T VGK
Sbjct: 75 CSPTRAALMTGMHEFRNGVTHTVQPREKLY---KGALTIAD-----ILKEGGYKTGFVGK 126
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
WHLG K Y P +RGF+ + G ++FD ++M R+ + + G +
Sbjct: 127 WHLG-NDKGYAPQYRGFDWYAKNAKGPHNHFD----------VEMIRNGK-RFQTKG-FR 173
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRS 279
D F EA+ + + ++P FLYL + +P+ PL AP+ L ++ + D +
Sbjct: 174 EDAFFDEAMTFM-KEAGEQPFFLYLC-----TYSPHTPLGAPEDLLKKYKAKGLND-NHA 226
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+ A++ +D+++G++ + L++ + ++I++F++DN G G ++ A +RG K T
Sbjct: 227 AYLAMIENIDDNLGRLDQFLKKENLYDDTILIFMNDN-GVTVGLDVYNAD---MRGPKCT 282
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
+WEGG R L P V H+ D LPTL A D+P V +E
Sbjct: 283 IWEGGTRAFSLWRWPKKWQPKTVENLTAHL-DVLPTLCELAG-VDVPEKVQGELE 335
>gi|386819701|ref|ZP_10106917.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386424807|gb|EIJ38637.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 479
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 175/358 (48%), Gaps = 52/358 (14%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTP 109
V + S P+I+FI+ADD+G D+G +G I TPNID LA +G+ N+Y +C P
Sbjct: 32 VTISNHSQKPNIVFIIADDMGIGDLGCYGQQYIQTPNIDKLASTGMKFTNFYAGSTVCAP 91
Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHLGFY 167
SR+++MTG+ HTG+ Y G PL K I+ Q LK+ GY T + GKW LG
Sbjct: 92 SRASLMTGQ---HTGIS----YIRGNGEFPLRTKDTIISQILKKRGYTTAMFGKWGLGTE 144
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFD-HSAEEMKMWGL---DMRRDLEPAWDLHGKYSTD 223
E +P +G++ Y+TGH + D H + +W + +M + P Y+ +
Sbjct: 145 TTEGSPEKKGWD----YFTGHLHHVDAHFQQPDSLWTIKKGEMAQIATPP----NSYANE 196
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH------------- 270
+FT +AV I N + + P FLYL+ P+ L P+ YL ++
Sbjct: 197 IFTDKAVGFIENQTKENPFFLYLSFTV-----PHAELVVPNKYLKLYKNDQGQSIFMPEK 251
Query: 271 -----RHI--EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
RH + + ++ +AA++ +D +G+V+E ++ + N+++VF SDNG G
Sbjct: 252 AWPSGRHYGKQKYPKAAYAALVTSMDAYIGRVMEVIKSKGFDENTLVVFTSDNGTHIEGG 311
Query: 324 NL-----NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
S+ +GVK L+EGG+R + P G + D PTL
Sbjct: 312 RTIEDVEFFKSSSIYKGVKRDLYEGGIREPFIASWPEKIKAGQTSNHIAAFWDITPTL 369
>gi|109094666|ref|XP_001113032.1| PREDICTED: arylsulfatase A isoform 2 [Macaca mulatta]
gi|109094668|ref|XP_001113064.1| PREDICTED: arylsulfatase A isoform 3 [Macaca mulatta]
gi|109094670|ref|XP_001113092.1| PREDICTED: arylsulfatase A isoform 4 [Macaca mulatta]
Length = 507
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 162/350 (46%), Gaps = 40/350 (11%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+I+ I ADDLG+ D+G +G TPN+D LA G+ L ++Y LCTPSR+A++TG
Sbjct: 20 PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPASLCTPSRAALLTG 79
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-YKKEYTPTFR 176
+ P+ TGM VL RGGLPL E L + L GY T + GKWHLG + + P +
Sbjct: 80 RLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPHQ 139
Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
GF LG H Q+ G D + + +P W L
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLEA 199
Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y A A D++ + D P FLY A TH + E
Sbjct: 200 RY-----VAFARDLMADAQRQDRPFFLYYASHHTHYPQ-----------FSGQNFAERSG 243
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R F L +LD +VG ++ A+ +L ++++F +DNG + S LR K
Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
T +EGGVR L + P + G+ E + D LPTL + A + +PN
Sbjct: 303 GTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350
>gi|423288214|ref|ZP_17267065.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
CL02T12C04]
gi|392671103|gb|EIY64579.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
CL02T12C04]
Length = 473
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/352 (30%), Positives = 166/352 (47%), Gaps = 46/352 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
A S +I+ + DDLGW D+G TPN+D L G+ + Y Q +PSR++
Sbjct: 33 AESKRMNIVMMFVDDLGWADLGLRN-PHYNTPNVDRLFADGMNFERAYVTQPASSPSRAS 91
Query: 114 IMTGKHPIHTGMQHNV----------------------LYGCERGGLPLSEKILPQYLKE 151
+MTGK P+ + ++ Y R LP+ E I PQ +KE
Sbjct: 92 LMTGKQPVRFQLVRHIDNKVAGTTEPYSYWPENSPELNEYMPSRNWLPMEETIFPQLMKE 151
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY VGKWHLG E P RG++ G + + ++ A +M G D + +
Sbjct: 152 CGYYNMFVGKWHLG--GPEVWPDKRGYDEVYG-LSDFGNVSNYYAPFFRMRGADTTKKIA 208
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
+G Y TD T +AV +I N+ D+P L + + HS N + D +
Sbjct: 209 K----NGDYITDFLTDKAVSMIENYDRDQPFMLSMHYYNVHSPN----IGRKDLFDKYKA 260
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
++ +AA + +DESVG++ +A++ R + N+I+++ SD GG SN+
Sbjct: 261 KGMTDIQANYAAQIEAVDESVGRIRKAIKDRGIEDNTIVIYFSDQGG--------LYSNY 312
Query: 332 PLRGVK---NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
PLRG K +T+ EGG+R I+ P + +G V + V D+ PT + A
Sbjct: 313 PLRGCKLIDDTMAEGGIRVPLAIYYPGVTKKGSVCQTPVQALDFFPTFMEIA 364
>gi|126663704|ref|ZP_01734700.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
gi|126624287|gb|EAZ94979.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
Length = 468
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 178/363 (49%), Gaps = 40/363 (11%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
I+ FA+L F + ++ P+I+FILADD+G+N++G +G I TPNID LA
Sbjct: 6 ILTFAILIATFGIQ---AQETKNTKKPNIVFILADDMGYNELGSYGGKIIETPNIDQLAK 62
Query: 94 SGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKE 151
G+ N+Y +C PSR +MTGKH H ++ N E +P SE + + LK
Sbjct: 63 EGMKFSNHYCGSNICAPSRGTLMTGKHTGHAYIRDNKPLPYEGNEPIPASEITVAEILKT 122
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY--WTGHQDYFDHSAEEMKMWGLDMRRD 209
GY T GKW LG+ E +P +GF+ GY +YF + + L+ D
Sbjct: 123 AGYTTGAFGKWGLGYPASEGSPNNQGFDQFYGYNGQIHAHNYFTSYLRKNDLVELNANID 182
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
+ YS D+ A++ + + + P FLY H NPY Q D L
Sbjct: 183 AP-----YSVYSADIIKDRALEFVEVNKNN-PFFLYFCPTLPH--NPYH--QPDDKTLEY 232
Query: 270 H----------RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ H E+F K+AA+ +LD+ VG+++ L++ +L N++I+F SDNG A
Sbjct: 233 YAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSA 292
Query: 320 AAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL---ESRGIVAEQYVHVS---DW 372
+ + LRG K+ ++EGG++ SPL+ + + I H+S D+
Sbjct: 293 LTKEEDSYLRTGGDLRGRKSEVYEGGIK------SPLIAFWKGKIIPGSSSNHISAFWDF 346
Query: 373 LPT 375
LPT
Sbjct: 347 LPT 349
>gi|402824519|ref|ZP_10873876.1| sulfatase family protein [Sphingomonas sp. LH128]
gi|402261924|gb|EJU11930.1| sulfatase family protein [Sphingomonas sp. LH128]
Length = 455
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/335 (31%), Positives = 161/335 (48%), Gaps = 48/335 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ I IL DDLG+ D+ G IPTPN++ +A G++L +YY Q +CTPSR+ ++TG+
Sbjct: 32 PNFIVILCDDLGYGDIEPTGGTAIPTPNLNRMAREGMVLTDYYAPQNICTPSRAGLLTGR 91
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+PI TG+ + ++ LPL E +P+ L+ GYR+ + GKWHLG ++ PT GF
Sbjct: 92 YPIRTGLAWGAILQPDKRILPLDEVTIPKALQPAGYRSGLFGKWHLGHLGPDWQPTRHGF 151
Query: 179 ESHLGYWTGH-----QDYFDHSA------EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+ G H Q + D S EE M L R F
Sbjct: 152 DRFYGIPYSHDMKPLQIFTDTSPDRRASEEEPVMAELQQR-----------------FYE 194
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EAV+ I + D+P F+ LA +A H AN P RH +D S + ++ +
Sbjct: 195 EAVNFIEANK-DQPFFVELALSAPHLANYPNP-----------RHRDDKMASAYGEVVEE 242
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL-WEGGVR 346
+D VG ++ L + +++++F SDNG G S+ PLR K ++GG R
Sbjct: 243 IDAIVGALMVKLRTLGLDRDTLVLFTSDNGPWYEG------SSGPLRDRKGGAGYDGGYR 296
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ W P G+ + D+LPT + A
Sbjct: 297 VPCIAWQPGTVPAGLRCDSIAMGIDFLPTFCAMAG 331
>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
Length = 461
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/296 (34%), Positives = 154/296 (52%), Gaps = 35/296 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+I+ +LADD G+ D GF G I TPNID LA G I + + + +PSRS ++TG+
Sbjct: 31 PNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGR 90
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G + N+ + G+P E++LP LK GYRT +GKWHLG K P +GF
Sbjct: 91 YGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SKPFQRPNAKGF 147
Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
++ G GH+ YF D + ++ + G + D Y TD ++A
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGQKLSFD---------GYFTDELASKAR 198
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
+ +++P LY++ A HS N +A + L E R K+AA+++ LD
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 248
Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
VGK+V+ L+ ++II F+SDNGG+ N +SN PL+G K +EGG R
Sbjct: 249 GVGKIVDELKAVGKFDDTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQR 300
>gi|298377639|ref|ZP_06987590.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|298265342|gb|EFI07004.1| arylsulfatase [Bacteroides sp. 3_1_19]
Length = 589
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 167/344 (48%), Gaps = 49/344 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II +L+DD GW D+GF G + TPNID +A+ G IL+N+Y + +P+R+ +TG++
Sbjct: 27 PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ +G+ ++ G ER L EK + +Y +E GY T + GKWH G + Y P RGFE
Sbjct: 87 HVRSGV-NSTTGGGER--FNLGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142
Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G+ +GH +H+ E + G + D T +A+D I
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
+H + P F++L++ HS P+Q PD + N + +R+ F
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+ LD ++G+V+ L + +I+++ SDNG + F N ++G K + E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
GGVR I P +G V Q D +PTLL A P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338
>gi|417301368|ref|ZP_12088525.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327542298|gb|EGF28785.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 470
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 167/339 (49%), Gaps = 33/339 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P++I + DD G+ND+G +G I TPN+D LA G ++Y+ +C+PSR+A++TG
Sbjct: 28 PNVIVVFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGC 87
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G+ +VL+ + GL E + +LK GY T VGKWHLG + KE PT GF
Sbjct: 88 YPKRVGLHQHVLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGF 146
Query: 179 ESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLHGKYS 221
+S+ LG + + D S+ + +W + +D E + +
Sbjct: 147 DSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVDQRTV 205
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T +T A++ + + D+P FLYL H+ P+ PL P+ + ++ +
Sbjct: 206 TRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------PQNAY 253
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
++ +D VG++V+ + + ++IV+ SDN G F + S PLR K T +
Sbjct: 254 KCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTF 312
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
EGG R ++W+P G + + D LPT+ S
Sbjct: 313 EGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVT 351
>gi|325110588|ref|YP_004271656.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324970856|gb|ADY61634.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 458
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 189/384 (49%), Gaps = 42/384 (10%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPH--IIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
+ AV +A T+S A+ P H ++ I+ADDLG+ D+ +G ++ TP +D +
Sbjct: 13 FLLLAVTTVALTVSQG----TAAEKPSHRNVVVIMADDLGYGDLSCYGCERYQTPFLDQM 68
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYL 149
A G+ ++++ +C+P+R+A++TG++ G+ + R GLPL +
Sbjct: 69 AAEGMKWSDFHSNGNVCSPTRAALLTGQYQQRWGVDRVITAAGHRDRGLPLEAVTFAETF 128
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMR 207
+ GY T I GKWH G Y+ + P +GF+ +GY +G+ D+ H A + W D
Sbjct: 129 SQHGYATAIYGKWHQG-YQARFNPVRQGFDHFVGYVSGNVDFQSHIDQAGQADWWHNDEL 187
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
+ Y T + T AV I ++ ++P LYL H P+ P Q P+
Sbjct: 188 TE-------EAGYVTHLITQYAVKFIEDNK-EQPFCLYLPHE-----TPHYPYQGPNDLA 234
Query: 268 N-------IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
+ ++ D K + +A ++ ++D +G+V++ L++ + ++++F SDNGG +
Sbjct: 235 DRTVGGKFVNHGSRDDKAAAYAEMMIEMDRGIGEVLQTLQKHGLAEQTLVLFFSDNGGTS 294
Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G N LRG K + WEGG R + W+P G +Q D PTL A
Sbjct: 295 LG------DNGGLRGTKGSDWEGGHRVPMIAWNPETVPAGRDCDQLAITHDIFPTL---A 345
Query: 381 NKSDIPNYVNSTVENI--IPRYEN 402
+ +D+P + +++ I +P E+
Sbjct: 346 DYADLPLPKDQSLDGISLLPYLED 369
>gi|227540350|ref|ZP_03970399.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
gi|227239674|gb|EEI89689.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
Length = 473
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 177/371 (47%), Gaps = 46/371 (12%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
+L AF L + + L A P+I+FILADDLG+ D+G +G +I TPNID LA G+
Sbjct: 9 LLTAAF-LCLSTIGLQAQERKPNIVFILADDLGYGDLGAYGQKKIQTPNIDRLAAEGMKF 67
Query: 99 KNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYR 155
+ +Y +C PSRS++MTG+H HT ++ N E G PL++ + Q L++ GY
Sbjct: 68 EQFYAGTSVCAPSRSSLMTGQHTGHTYIRGNREIEPE-GQQPLADSVQTFVQLLQKAGYA 126
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T GKW LG P +GF+ GY Q H +W D R +L+ D
Sbjct: 127 TGAFGKWGLGMVGTTGAPDQKGFDEFYGYNCQRQ---SHRYFPTHLWHNDQRVELQ-GND 182
Query: 216 LHGK--YSTDVFTAEAVDIIHNHSTDEPLFLY----LAHA-------------------A 250
L K Y+ + + + I H DEP FL+ L HA
Sbjct: 183 LKNKVQYAPALIQEQTLRFIDAHK-DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFEET 241
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
H N Y P Y ++ + R+ FA+++ ++D VG+V++ L+ + N+I+
Sbjct: 242 PHKGNDYGPGATVTGYASVEK-----PRATFASMVSRMDAYVGQVLKKLDDLGLTENTIV 296
Query: 311 VFVSDNG----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
+F SDNG G A N++ LRG K L+EGG+R ++ P +E
Sbjct: 297 IFSSDNGAHREGGADPVFFNSSGG--LRGYKRDLYEGGIRTPFIVKWPHKVKSHSTSEFT 354
Query: 367 VHVSDWLPTLL 377
D +PTLL
Sbjct: 355 GAFWDVMPTLL 365
>gi|150010519|ref|YP_001305262.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
8503]
gi|149938943|gb|ABR45640.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
8503]
Length = 589
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 167/344 (48%), Gaps = 49/344 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II +L+DD GW D+GF G + TPNID +A+ G IL+N+Y + +P+R+ +TG++
Sbjct: 27 PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ +G+ ++ G ER L EK + +Y +E GY T + GKWH G + Y P RGFE
Sbjct: 87 HVRSGV-NSTTGGGER--FNLGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142
Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G+ +GH +H+ E + G + D T +A+D I
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
+H + P F++L++ HS P+Q PD + N + +R+ F
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+ LD ++G+V+ L + +I+++ SDNG + F N ++G K + E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
GGVR I P +G V Q D +PTLL A P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338
>gi|291225019|ref|XP_002732502.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 197
Score = 154 bits (389), Expect = 2e-34, Method: Composition-based stats.
Identities = 72/143 (50%), Positives = 92/143 (64%), Gaps = 1/143 (0%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
L++V +A S PH+I I DD+GWNDV +H D I PN+ LA G+I YT
Sbjct: 11 LALAVVSKPSIAESDQPHVILIFVDDMGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYT 69
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
CTPSR+A+M G +P TG QH +++ G+PL K+LP+ LKE+GY T +VGKWH
Sbjct: 70 HPTCTPSRAAMMKGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWH 129
Query: 164 LGFYKKEYTPTFRGFESHLGYWT 186
LGF K EY PT RGF+SH G WT
Sbjct: 130 LGFCKDEYQPTNRGFDSHYGLWT 152
>gi|301308937|ref|ZP_07214882.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
gi|423338414|ref|ZP_17316156.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
CL09T03C24]
gi|300832963|gb|EFK63588.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
gi|409233843|gb|EKN26675.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
CL09T03C24]
Length = 589
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 167/344 (48%), Gaps = 49/344 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II +L+DD GW D+GF G + TPNID +A+ G IL+N+Y + +P+R+ +TG++
Sbjct: 27 PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ +G+ ++ G ER L EK + +Y +E GY T + GKWH G + Y P RGFE
Sbjct: 87 HVRSGV-NSTTGGGER--FNLGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142
Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G+ +GH +H+ E + G + D T +A+D I
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
+H + P F++L++ HS P+Q PD + N + +R+ F
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+ LD ++G+V+ L + +I+++ SDNG + F N ++G K + E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
GGVR I P +G V Q D +PTLL A P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338
>gi|404450774|ref|ZP_11015752.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
gi|403763554|gb|EJZ24502.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
Length = 469
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 177/368 (48%), Gaps = 49/368 (13%)
Query: 42 LAFTLSMVFVDLVA-----SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
L L M+F S P+I+FI+ADDLG D+G +G IPTPN+D +A +G+
Sbjct: 8 LGVMLVMIFGPFAGLYAQHFSEKPNIVFIIADDLGIGDIGPYGQTIIPTPNLDKMASTGM 67
Query: 97 ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
++Y +C PSR+++MTG+H HT ++ N Y E G KILP+ LKE GY
Sbjct: 68 KFNDFYAGSTVCAPSRASLMTGQHTGHTLVRGNGEYPLESG-----RKILPERLKEAGYT 122
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD--MRRDLEPA 213
+ GKW LG +P G++ ++ GH + SA K LD + ++
Sbjct: 123 NAMFGKWGLGLANTHSSPEKVGWD----HFVGHLHHV--SAHYQKPDSLDCIIEGEISRM 176
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL------ 267
GKY+ D+F +++ I + ++P FLYL+ H+ L P+ Y
Sbjct: 177 PVPKGKYANDLFVENSIEFIETQTDEQPFFLYLSLTVPHAE-----LLVPERYFQPFLDE 231
Query: 268 --------------NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
+H + + ++ +AA++ +D+ VG+V+ ALE+ + N++++F
Sbjct: 232 SGKSIFPSEKAHPEGLHYGAQPYPKAAYAALVSSIDDYVGRVMSALEKIGIAENTLVIFT 291
Query: 314 SDNGGAAAGFNLNA-----ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVH 368
SDNG G S+ RGVK L++GG++ ++ P G ++
Sbjct: 292 SDNGTHVEGGRRMEDVDYFKSSAHFRGVKRDLFDGGIKVPLMVNWPGKIKEGSTSDFRGA 351
Query: 369 VSDWLPTL 376
D PTL
Sbjct: 352 FWDLFPTL 359
>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
Length = 500
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 186/394 (47%), Gaps = 64/394 (16%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVAS--------SGPPHIIFILADDLGWNDVGFHGLDQ 82
R +IM AV + F ++V+ L+ S + P+ +FILADDLGW DVGF+G
Sbjct: 2 RMKIMKTAVERIVFGGNLVWALLLTSLCATRVHAADRPNFVFILADDLGWKDVGFNGSTF 61
Query: 83 IPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG---- 137
TPN+D LA G+ + Y +C+P+R++IMTGK+P + + ++
Sbjct: 62 YETPNLDRLAREGMRFTDAYAACSVCSPTRASIMTGKYPARLHLTDWLPGRPDKPDQILK 121
Query: 138 -------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHL-GYWTGH- 188
LP +E L + L+E GY+T +GKWHLG + P GF+ ++ G GH
Sbjct: 122 HPKIITELPAAEITLAKALQEGGYKTAFIGKWHLGGLG--HWPEQAGFDINIGGCGMGHP 179
Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
YF G P G+Y D T EAV I N + P LYL+H
Sbjct: 180 SSYFSPYKNPTLKDG--------PV----GEYLADRLTDEAVKFIEN-TKGTPFLLYLSH 226
Query: 249 AATHSANPYEPLQAPDHYLN-IHRHIEDFKRSK--------------------FAAILHK 287
+ H+ PLQA + + + +K +AA++
Sbjct: 227 YSVHT-----PLQAKKGLIEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIYAAMMQS 281
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
LDESVG+V++ L++ + N++I+F SDNGG + SN PLR K +EGGVR
Sbjct: 282 LDESVGRVLDKLKELGLDKNTVIIFTSDNGGLSTAEG-APTSNMPLRAGKGWPYEGGVRE 340
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
++ P + V++ V +D+ PTLL A
Sbjct: 341 PLVVKWPGVTKAASVSDHQVMSTDYYPTLLEIAG 374
>gi|421612498|ref|ZP_16053605.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
gi|408496794|gb|EKK01346.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
Length = 487
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 169/343 (49%), Gaps = 33/343 (9%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
+S P++I I DD G+ND+G +G I TPN+D LA G ++Y+ +C+PSR+A+
Sbjct: 43 TSEKPNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAAL 102
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG +P G+ +VL+ + GL E + +LK GY T VGKWHLG + KE PT
Sbjct: 103 LTGCYPKRVGLHQHVLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPT 161
Query: 175 FRGFESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLH 217
GF+S+ LG + + D S+ + +W + +D E +
Sbjct: 162 SNGFDSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVD 220
Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+ T +T A++ + + D+P FLYL H+ P+ PL P+ +
Sbjct: 221 QRTITRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------P 268
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
++ + ++ +D VG++++ + + ++IV+ SDN G F + S PLR K
Sbjct: 269 QNAYKCVIEHVDTEVGRLIQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGK 327
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T +EGG R ++W+P G + + D LPT+ S
Sbjct: 328 GTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVT 370
>gi|444722183|gb|ELW62881.1| N-acetylgalactosamine-6-sulfatase [Tupaia chinensis]
Length = 764
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 170/351 (48%), Gaps = 48/351 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
A G P +F + GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A
Sbjct: 61 AGPGTPTALFFM----GWGDLGVYGEPSRETPNLDQMAAEGMLFPNFYSANPLCSPSRAA 116
Query: 114 IMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
++TG+ PI TG N E GG+P SE +LP+ LK GY ++IVGKWHLG
Sbjct: 117 LLTGRLPIRTGFYTTNAHARNAYTPQEIVGGIPSSEHLLPELLKGAGYVSKIVGKWHLG- 175
Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------ 220
++ ++ P GF+ G H +D+ A + + RD W++ G++
Sbjct: 176 HRPQFHPLRHGFDEWFGAPNCHFGPYDNKARP----NIPVYRD----WEMVGRFYEEFPI 227
Query: 221 --------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
T ++ EA+D I + P FL+ A ATH+ P+ A +L
Sbjct: 228 SLKTGEANLTQIYLQEALDFIKRQAGRRPFFLHWAIDATHA-----PVYASQPFLGTS-- 280
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASN 330
+R ++ + ++D+SVGK++ L + +++ + F SDNG A +A SN
Sbjct: 281 ----QRGRYGDAVREIDDSVGKILALLRALGIGNSTFVFFTSDNGAALISAPRQGKCGSN 336
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 337 GPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAG 387
>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 167
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 105/161 (65%)
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+FT +A+ +I +H +P+FLY +H A H + L+AP + +I++ RS A
Sbjct: 1 LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
+ +LD+SVG V+EAL +R ML NSIIVF +DNGG G N+ NWPLRG K TLWEG
Sbjct: 61 AVFELDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGKEPNSGYNWPLRGSKFTLWEG 120
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
GVRGA +WSPLL G ++ Q +H++DWLPT SAA + +
Sbjct: 121 GVRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAGEKE 161
>gi|418528067|ref|ZP_13094017.1| sulfatase [Comamonas testosteroni ATCC 11996]
gi|371454443|gb|EHN67445.1| sulfatase [Comamonas testosteroni ATCC 11996]
Length = 442
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/341 (30%), Positives = 167/341 (48%), Gaps = 24/341 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
P++IFILADDLG+ D+G G +P +DA+A G+ + Y +C+P+R
Sbjct: 6 PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65
Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
A+ TG+ + + + + ++G + GLP + L++ GY T +VGKWHLG Y
Sbjct: 66 ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
+ P G+E G+ G DYF H + D + +P Y TD+ +
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEKP--HAEDGYLTDLLSRR 179
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDFKRSKFAAILHK 287
AVD + + +P +L L ++A H +A L + +HI+ + ++H
Sbjct: 180 AVDFVKRQTAKQPFYLSLHYSAPHWPWLTREDRAESERLQGMGKHIDGGSIDTYQRMIHH 239
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE +G +++ALE++ M N++IVF SDNGG ++ WP G K L EGG+R
Sbjct: 240 MDEGIGWLLDALEEKGMSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGIRV 293
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
L P + G V + DW T L+AA S +Y
Sbjct: 294 PLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVSADADY 334
>gi|355785121|gb|EHH65972.1| hypothetical protein EGM_02852 [Macaca fascicularis]
Length = 507
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 162/350 (46%), Gaps = 40/350 (11%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+I+ I ADDLG+ D+G +G TPN+D LA G+ L ++Y LCTPSR+A++TG
Sbjct: 20 PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPAPLCTPSRAALLTG 79
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-YKKEYTPTFR 176
+ P+ TGM VL RGGLPL E L + L GY T + GKWHLG + + P +
Sbjct: 80 RLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPHQ 139
Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
GF LG H Q+ G D + + +P W L
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLEA 199
Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y A A D++ + D P FLY A TH + +
Sbjct: 200 RY-----VAFARDLMADAQRQDRPFFLYYASHPTHYPQ-----------FSGQNFAKRSG 243
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R F L +LD +VG ++ A+ +L ++++F +DNG + S LR K
Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
T +EGGVR L + P + G+ E + D LPTL + A + +PN
Sbjct: 303 GTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350
>gi|196234349|ref|ZP_03133178.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196221616|gb|EDY16157.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 483
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 171/354 (48%), Gaps = 41/354 (11%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
A+ P++IFILADDLG D+G +G +I TPNID LA G+ L++Y +C PSR A
Sbjct: 22 ATPAKPNVIFILADDLGIGDLGCYGQQKIRTPNIDHLAADGMRFLQHYTGCSVCAPSRCA 81
Query: 114 IMTGKHPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+MTG+H H ++ N G G +P + + ++ GY T I+GKW LG +
Sbjct: 82 LMTGRHMGHAAIRDNAQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLGMPEDH 141
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE--PAWDLHGK--------- 219
+P GF GY H+ +W + R L P++D+ K
Sbjct: 142 SSPRDMGFNYSFGYLC---QSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEPKGEI 198
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH--------SANPYE------PLQAPDH 265
YS DV ++A+ + +H D+P FLYLA H S + Y P + H
Sbjct: 199 YSHDVMASDALKFVRDHH-DKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPFRNTKH 257
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGF 323
Y N + R+ +A ++ ++D VG+++ L++ + N+++ F SDNG AG
Sbjct: 258 YAN-----NETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVFFSSDNGAVFPLAGT 312
Query: 324 N-LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ + S RG K L+EGG+R + P G+ +Q D+LPT+
Sbjct: 313 DPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASVFYDFLPTM 366
>gi|440714613|ref|ZP_20895192.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436440809|gb|ELP34113.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 470
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 178/363 (49%), Gaps = 38/363 (10%)
Query: 39 VLPLAFTLSMVFVDLVAS-----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
V+ LA L++V +A+ S P++I I DD G+ND+G +G I TPN+D LA
Sbjct: 2 VIGLAALLTLVGRPGLAAESNEKSDKPNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLAS 61
Query: 94 SGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL 152
G ++Y+ +C+PSR+A++TG +P G+ +VL+ GL E + +LK
Sbjct: 62 EGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSA 121
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESH----------------LGYWTGHQDYFDHSA 196
GY T VGKWHLG + KE PT GF+S+ LG + + D S+
Sbjct: 122 GYATACVGKWHLG-HHKETLPTSNGFDSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS 180
Query: 197 EEMKMWGLDMRRDLEPA-WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
+ +W + +D E + + T +T A++ + + D+P FLYL H+
Sbjct: 181 -AVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEFVEANQ-DKPFFLYLPHSM----- 233
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
P+ PL P+ + ++ + ++ +D VG++V+ + + ++IV+ SD
Sbjct: 234 PHIPLYVPEDVYDPD------PQNAYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSD 287
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
N G F + S PLR K T +EGG R ++W+P G + + D LPT
Sbjct: 288 N-GPWLQFKNHGGSAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPT 346
Query: 376 LLS 378
+ S
Sbjct: 347 IAS 349
>gi|221069561|ref|ZP_03545666.1| sulfatase [Comamonas testosteroni KF-1]
gi|220714584|gb|EED69952.1| sulfatase [Comamonas testosteroni KF-1]
Length = 442
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 169/341 (49%), Gaps = 24/341 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
P++IFILADDLG+ D+G G +P +DA+A G+ + Y +C+P+R
Sbjct: 6 PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65
Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
A+ TG+ + + + + ++G + GLP + L++ GY T +VGKWHLG Y
Sbjct: 66 ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
+ P G+E G+ G DYF H + D R+ EP Y TD+ +
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWRNEEP--HTEDGYLTDLLSRR 179
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDFKRSKFAAILHK 287
AVD + + ++P +L L ++A H +A L + +HI+ + ++H
Sbjct: 180 AVDFVKRQTAEQPFYLSLHYSAPHWPWLTREDRAESERLQGMGKHIDGGSIDTYQRMIHH 239
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE +G +++ALE++ M N++IVF SDNGG ++ WP G K L EGG+R
Sbjct: 240 MDEGIGWLLDALEEKGMSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGIRV 293
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
L P + G V + DW T L+AA + +Y
Sbjct: 294 PLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVAAHADY 334
>gi|32476258|ref|NP_869252.1| arylsulfatase A [Rhodopirellula baltica SH 1]
gi|32446802|emb|CAD76638.1| arylsulfatase A [Rhodopirellula baltica SH 1]
Length = 489
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 102/337 (30%), Positives = 166/337 (49%), Gaps = 33/337 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P++I I DD G+ND+G +G I TPN+D LA G ++Y+ +C+PSR+A++TG
Sbjct: 47 PNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGC 106
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G+ +VL+ GL E + +LK GY T VGKWHLG + KE PT GF
Sbjct: 107 YPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGF 165
Query: 179 ESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLHGKYS 221
+S+ LG + + D S+ + +W + +D E + +
Sbjct: 166 DSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVDQRTV 224
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T +T A++ + + D+P FLYL H+ P+ PL P+ + ++ +
Sbjct: 225 TRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------PQNAY 272
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
++ +D VG++V+ + + ++IV+ SDN G F + S PLR K T +
Sbjct: 273 KCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTF 331
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
EGG R ++W+P G + + D LPT+ S
Sbjct: 332 EGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIAS 368
>gi|408674712|ref|YP_006874460.1| sulfatase [Emticicia oligotrophica DSM 17448]
gi|387856336|gb|AFK04433.1| sulfatase [Emticicia oligotrophica DSM 17448]
Length = 518
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 188/399 (47%), Gaps = 87/399 (21%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II I+ADD+G++D+G +G +I TPN+D LA +G+ L+N+Y C P+R++++TGK+
Sbjct: 32 PNIIIIMADDMGFSDIGCYG-SEISTPNLDKLAANGLKLRNFYNAGRCCPTRASLLTGKY 90
Query: 120 PIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
GM + V + ++ G L S + + LK++GY T + GKWH+G +Y
Sbjct: 91 SHAVGMGNMVSFEDQKVPKDNYQGYLEPSVPTIAEDLKKVGYHTYMTGKWHVG-ESPDYW 149
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
P RGFE + G +G YF+ E+ K ++ D E G Y+TD FT +A++
Sbjct: 150 PLKRGFERYFGLISGASSYFEVLQEKRKR--FVVQDDKEYVLPKDGYYATDAFTDKAIEF 207
Query: 233 IHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDH----YLNIHRHIED------FKRSK- 280
+ + + P FLYLA+ A P+ PL A + Y N + D FK+ K
Sbjct: 208 LESSDKQNNPFFLYLAYTA-----PHFPLHAYEEDIAKYENFYLQGWDKTRTDRFKKMKQ 262
Query: 281 -----------------------------------FAAILHKLDESVGKVVEALEQRRML 305
+AA++ ++D+++GK+VE L++ + +
Sbjct: 263 IGLLDKRYQLTERPNDIQPWEDVKEKKSWARKMAVYAAMIDRMDKNIGKLVEYLKKSQQI 322
Query: 306 SNSIIVFVSDNGGA-----------------------AAGFNLNAASNWPLRGVKNTLWE 342
N+ I+F++DNG + F SN P + K + E
Sbjct: 323 DNTFIIFLADNGASPENVDDRKLGEPNKRIGERGSYGTYDFQWANVSNTPFKLYKKFMHE 382
Query: 343 GGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
GG+ +I W ++ + + HV D LPT L A
Sbjct: 383 GGIISPCIIHWPAKIKPKKEFIDAPSHVIDLLPTSLELA 421
>gi|305665425|ref|YP_003861712.1| arylsulfatase A [Maribacter sp. HTCC2170]
gi|88710180|gb|EAR02412.1| arylsulfatase A (precursor) [Maribacter sp. HTCC2170]
Length = 489
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 170/358 (47%), Gaps = 38/358 (10%)
Query: 42 LAFTLSMVFVDL---------VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
F +S +F+ L S PP+++ I DD G+ DVG G I TP++D +A
Sbjct: 16 FLFVISFLFILLSCKDNDQKVTKKSPPPNVVLIFTDDQGYQDVGVFGSPNIKTPHLDRMA 75
Query: 93 YSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
G+ L +YY+ Q +C+ SR+ ++TG +P G+ HN L G+ SE + + LK
Sbjct: 76 SEGVKLTSYYSAQAVCSASRAGLLTGCYPNRIGI-HNALGPGNTHGINSSETTMAEMLKN 134
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG--YWTGHQDYFDHSAEEMKMWGLDMRRD 209
GY+T I GKWHLG + E+ P GF+ G Y Y L + +
Sbjct: 135 NGYKTAIFGKWHLG-HHPEFMPNKHGFDEFFGIPYSNDMWPYHPQQGPIFNFPDLPLYEN 193
Query: 210 LEPAWDLHGKYS--TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
E D + S T T +VD I N + D P FLY+ H P+ PL
Sbjct: 194 -ETVIDTLTEQSNLTTQITERSVDFI-NRNKDNPFFLYVPH-----PQPHVPLFVS---- 242
Query: 268 NIHRHIEDFK----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
E FK R + ++ ++D SVG+++EAL++ + N+I++F SDNG +
Sbjct: 243 ------EKFKGKSDRGLYGDVIMEIDWSVGQIIEALKKNGLEENTIVIFTSDNGPWLSYG 296
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
N +A S P R K T WEGG R ++ P G V + + D LPTL S N
Sbjct: 297 N-HAGSALPFREGKGTAWEGGQREPFIMKYPNKLKAGEVIDVPMMAIDLLPTLASLTN 353
>gi|398990930|ref|ZP_10694093.1| arylsulfatase A family protein [Pseudomonas sp. GM24]
gi|399012076|ref|ZP_10714404.1| arylsulfatase A family protein [Pseudomonas sp. GM16]
gi|398116682|gb|EJM06441.1| arylsulfatase A family protein [Pseudomonas sp. GM16]
gi|398142218|gb|EJM31121.1| arylsulfatase A family protein [Pseudomonas sp. GM24]
Length = 568
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 125/435 (28%), Positives = 196/435 (45%), Gaps = 116/435 (26%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK- 118
P+I+ I+ADD+G++D+ G +I TPN+DA+A +G+ NY+T C+P+RS +M+G
Sbjct: 38 PNIVLIVADDMGYSDLSAFG-SEIQTPNLDAIARAGVTFSNYHTAPTCSPTRSMLMSGTD 96
Query: 119 -HPIHTGMQHNVLYGCERGGLP----LSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEY 171
H G ++ + G L+++++ P+ L+ GYRT +VGKWHLG E
Sbjct: 97 NHIAGVGTMAELVLPEQAGKRGYEGYLTDRVVAFPKLLQANGYRTMMVGKWHLG-KTPEL 155
Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK---------YST 222
P RGF+ G D+F G R+ + ++ L+G+ YS+
Sbjct: 156 GPKARGFDQSFALLEGSADHFTQ--------GGTTDRNPKASYRLNGEDIDLPKNFDYSS 207
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED------- 275
D FT A++ I +TD+P F YLA+ A P+ PLQ P YL +R + D
Sbjct: 208 DYFTRTAIEQI-GAATDKPFFAYLAYTA-----PHWPLQVPTAYLERYRGLYDKGYANIA 261
Query: 276 ---FKRSK--------------------------------------FAAILHKLDESVGK 294
KR K +AA++ LD++VGK
Sbjct: 262 EARLKRMKDKGLVAQDVQINPGPGVWPTWEKLSPRERALESRRMEVYAAMITNLDDNVGK 321
Query: 295 VVEALEQRRMLSNSIIVFVSDNG-------------------------------GAAAGF 323
VV+ L+ N++ VF SDNG G+ +G+
Sbjct: 322 VVKHLKDTGQYDNTVFVFFSDNGAEGSNPDDVTPNNAKWIAENFDNSLDNIGKPGSFSGY 381
Query: 324 NLNAA--SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
N A S+ P R K +EGG+R A I P + G ++ Y+ V D PT L A
Sbjct: 382 GPNWARVSSTPFRMYKAFTYEGGIRTAAFIAGPGMR-HGEASDAYMSVKDLAPTFLELA- 439
Query: 382 KSDIPNYVNSTVENI 396
+ + P+ +++VE +
Sbjct: 440 RVEHPSTRDTSVEKM 454
>gi|403282838|ref|XP_003932845.1| PREDICTED: arylsulfatase A [Saimiri boliviensis boliviensis]
Length = 509
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 162/350 (46%), Gaps = 40/350 (11%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y V LCTPSR+A++TG
Sbjct: 22 PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTG 81
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
+ P+ GM VL RGGLPL E L + L GY T + GKWHLG + + P +
Sbjct: 82 RLPVRMGMYPGVLVPSSRGGLPLEEVTLAEVLAAQGYLTGMAGKWHLGVGPEGAFLPPHQ 141
Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
GF LG H Q+ + G D + + +P W L
Sbjct: 142 GFHRFLGIPYSHDQGPCQNLTCFPPDTPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA 201
Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y A A D++ + D P FLY A TH + E
Sbjct: 202 RY-----IAFAHDLMADAQRQDRPFFLYYASHHTHYPQ-----------FSGQSFAERSG 245
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R F L +LD +VG ++ A+ +L ++++F +DNG + S LR K
Sbjct: 246 RGPFGDSLMELDAAVGALMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 304
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
T +EGGVR L + P + G+ E + D LPTL + A + +PN
Sbjct: 305 GTTYEGGVREPALAFWPGHITPGVTHELSSSL-DLLPTLAALAG-APLPN 352
>gi|300771164|ref|ZP_07081040.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
spiritivorum ATCC 33861]
gi|300761834|gb|EFK58654.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
spiritivorum ATCC 33861]
Length = 473
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 120/371 (32%), Positives = 176/371 (47%), Gaps = 46/371 (12%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
+L AF L + + L A P+I+FILADDLG+ D+G +G QI TPNID LA G+
Sbjct: 9 LLTAAF-LCLSTIGLQAQERKPNIVFILADDLGYGDLGAYGQKQIQTPNIDRLAAEGMKF 67
Query: 99 KNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYR 155
+ +Y +C PSRS++MTG+H HT ++ N E G PL++ + Q L++ GY
Sbjct: 68 EQFYAGASVCAPSRSSLMTGQHTGHTYIRGNREIEPE-GQQPLADSVQTFVQLLQKAGYA 126
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T GKW LG P +GF+ GY Q H +W D R +L D
Sbjct: 127 TGAFGKWGLGMVGTTGAPDQKGFDEFYGYNCQRQ---SHRYFPTHLWHNDQRVELH-GND 182
Query: 216 LHGK--YSTDVFTAEAVDIIHNHSTDEPLFLY----LAHA-------------------A 250
L K Y+ + + + I H DEP FL+ L HA
Sbjct: 183 LKNKVQYAPALIQEQTLRFIDVHK-DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFQET 241
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
H N Y P Y ++ + R+ FA+++ ++D VG+V++ L+ + N+I+
Sbjct: 242 PHKGNDYGPGATVTGYASVEK-----PRATFASMVSRMDAYVGQVLKKLDDLGLTENTIV 296
Query: 311 VFVSDNG----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
+F SDNG G A N++ LRG K L+EGG+R ++ P +E
Sbjct: 297 IFSSDNGAHREGGADPVFFNSSGG--LRGYKRDLYEGGIRTPFIVKWPHKVKPNSTSEFA 354
Query: 367 VHVSDWLPTLL 377
D +PTLL
Sbjct: 355 GAFWDVMPTLL 365
>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
Length = 553
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 175/371 (47%), Gaps = 53/371 (14%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
A + PP+I+ ILADDLG+ND+ G +PTPNID++A G + Y+ C PSR+
Sbjct: 66 AGNRPPNIVVILADDLGFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAM 125
Query: 114 IMTGKHPIHTGMQ----------------------HNVLYGCERG---------GLPLSE 142
IMTG++ TG + H +L E GLP SE
Sbjct: 126 IMTGRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSE 185
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---HQDYFDHSAEEM 199
L + LK GY +GKWHLG E+ P +GF+ + +G +D D ++
Sbjct: 186 ITLAEALKPKGYHNIHIGKWHLG-NAPEFLPNAQGFDESVMLESGLFLPEDSPDVVNAKL 244
Query: 200 KMWGLD------MRRDLE---PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
+D M+ AW Y TD +T EA+ I + + P FLYLAH
Sbjct: 245 PFDPIDQFLWARMQYATSYNGSAWFEPKGYLTDFYTDEAIKAIEANR-NRPFFLYLAHWG 303
Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
H+ PLQA + HIED + +AA++ LD SVG+V+++L++ + N+++
Sbjct: 304 VHT-----PLQASKADYDALSHIEDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLV 358
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
+F SDNG A G+ N P RG K T +EGG+R P G V
Sbjct: 359 IFSSDNG--APGYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHL 416
Query: 371 DWLPTLLSAAN 381
D PT+++AA
Sbjct: 417 DMFPTIVAAAG 427
>gi|296121822|ref|YP_003629600.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014162|gb|ADG67401.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 638
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 172/366 (46%), Gaps = 39/366 (10%)
Query: 25 LKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIP 84
L G + +M A + + + A PP+I+ +LADD GW D G +
Sbjct: 15 LMGFGISSYVMLLACCVVLISPTSSEAQQSARPQPPNIVVLLADDAGWGDYSVSGNPYVK 74
Query: 85 TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI 144
TP+ID++A G+ L N+Y +C+P+R+ +TG++ + TG++ V G ER L L E+
Sbjct: 75 TPHIDSIAQQGVSLTNFYVCPVCSPTRAEFLTGRYALRTGVR-GVSLGEER--LNLDEQS 131
Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWG 203
+ ++ ++ GYRT I GKWH G + Y P RGF+ LGY GH +Y D E G
Sbjct: 132 IAEHFRKAGYRTGIFGKWHNG-SQGPYHPLARGFDVQLGYTAGHWSEYIDAPLESQ---G 187
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
+ + Y D A+D I S+ P F Y+ HS P P
Sbjct: 188 RPVTSE---------GYIVDTCMNAAIDFI--SSSQPPFFCYVPLTTPHS-----PWCVP 231
Query: 264 DHYLN--------IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
Y N + D R + A++ + D++VG+++ L+ + SN+I+++ SD
Sbjct: 232 QTYWNRWKDRDVGLTGKEADAVRCVY-AMMEQQDDAVGRLLARLDSLHLSSNTIVLYFSD 290
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
NG +N + +RG K T+ EGGVR G + P G + D LPT
Sbjct: 291 NGPNTVRWNGD------MRGRKGTVDEGGVRSVGFLRWPGHIPPGSTQTGLIGAIDLLPT 344
Query: 376 LLSAAN 381
L AN
Sbjct: 345 LAGLAN 350
>gi|390958067|ref|YP_006421824.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390958408|ref|YP_006422165.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390412985|gb|AFL88489.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390413326|gb|AFL88830.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
Length = 470
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 119/368 (32%), Positives = 172/368 (46%), Gaps = 47/368 (12%)
Query: 43 AFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKN 100
AF LS + A + P+IIFILADD+G+ D +G + TP+ID LA G+ +
Sbjct: 21 AFALSNLTKSFAAQTKRRPNIIFILADDMGYADTSVYGQRKFKTPHIDRLAAEGVRFAQA 80
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGC-----------ERGGLPLSEKILPQYL 149
Y +C PSRS +MTG + HT ++ N R L ++ + YL
Sbjct: 81 YAGAPVCAPSRSVLMTGLNTGHTRVRDNFALAAGHVGFKGKEEIRRASLTDADHTIADYL 140
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWGLDM 206
GY+T + GKWHL Y TPT GF++ G+ T Q YF
Sbjct: 141 ARSGYKTALFGKWHLDGYDPAATPTRHGFQTFKGWLTQTASTQGYFP-----------TQ 189
Query: 207 RRDLEPAWDL-------HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
R D E DL G+Y T + T ++V+ I S+D P FLY+A+ A HS P
Sbjct: 190 RYDNERLIDLPENANGKQGRYDTTMITDDSVNYIQ-QSSDTPFFLYVAYDAPHS-----P 243
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
APD + + D +AA++ LD VGK++ AL+Q+ + ++I+ F SDNG
Sbjct: 244 YIAPDQ-GSYAKEPWDEDEKIYAAMIGHLDNGVGKILAALKQKGIDEDTIVFFASDNGPR 302
Query: 320 AAGFNLNAA------SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
+ SN L+G K ++EGG+R + P G V++ + D L
Sbjct: 303 SEPTPQQTKVVNFFDSNGALQGYKRDMYEGGIREPFIARWPGHIPAGKVSQVPIFFPDVL 362
Query: 374 PTLLSAAN 381
PT L A
Sbjct: 363 PTALDLAG 370
>gi|431799329|ref|YP_007226233.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430790094|gb|AGA80223.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 454
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 172/350 (49%), Gaps = 25/350 (7%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+II I ADDLG+ D+G +G I TP +D +A G+ +Y +CTPSR+A++TG
Sbjct: 29 PPNIIVIFADDLGYGDLGCYGHPTIQTPYLDRMAKEGMRFTQFYVGADVCTPSRAALLTG 88
Query: 118 KHPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ PI GM + VL+ GLP +E + LK GYRT IVGKWHLG + EY PT
Sbjct: 89 RLPIRYGMAGEERGVLFPDSSKGLPHTEATMASALKAAGYRTGIVGKWHLG-HLPEYLPT 147
Query: 175 FRGFESHLG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
GF+ + G Y + + + ++ + DL D+ + T +T EA+
Sbjct: 148 THGFDFYFGIPYSNDMRPNPHNHVPPLPLYRNEEVIDL----DIDQRQLTKRYTKEAIRF 203
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I + + P FLY + P+ PL A + KR + ++ +LD SV
Sbjct: 204 IEENQ-ERPFFLYYPNNF-----PHVPLYASSDFEG------KSKRGIYGDVVSELDWSV 251
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G+++E L+ + N++++F SDNG + ++ G K + +EGG+R + W
Sbjct: 252 GRILEKLKALDLAENTLVIFTSDNGPWLSQKEKGGSAGLLFEG-KASTYEGGMRVPAIAW 310
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
P ++ V D PT L+ A ++P +NI+P N
Sbjct: 311 WPGTIKPNQISTALVASMDLYPTFLTMAGL-NMPQDKTLDGDNILPILRN 359
>gi|449138001|ref|ZP_21773306.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448883380|gb|EMB13908.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 470
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 166/339 (48%), Gaps = 33/339 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P++I I DD G+ND+G +G I TPN+D LA G ++Y+ +C+PSR+A++TG
Sbjct: 28 PNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGC 87
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P G+ +VL+ GL E + +LK GY T VGKWHLG + KE PT GF
Sbjct: 88 YPKRVGLHQHVLFPQSNYGLHPEEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGF 146
Query: 179 ESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLHGKYS 221
+S+ LG + + D S+ + +W + +D E + +
Sbjct: 147 DSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVDQRTI 205
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T +T A++ + + D+P FLYL H+ P+ PL P+ + ++ +
Sbjct: 206 TRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------PQNAY 253
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
++ +D VG++V+ + + ++IV+ SDN G F + S PLR K T +
Sbjct: 254 KCVIEHIDTEVGRLVQTVRDLGLSKKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTF 312
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
EGG R ++W+P G + + D LPT+ S
Sbjct: 313 EGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVT 351
>gi|351700549|gb|EHB03468.1| Arylsulfatase A [Heterocephalus glaber]
Length = 507
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 172/372 (46%), Gaps = 42/372 (11%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
L + + +A++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 6 ALCVAWAASLATTSPPNIVLIFADDLGYGDLGAYGHPSSTTPNLDRLAAGGLRFTDFYVP 65
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
V LCTPSR+A++TG+ P+ TGM VL +GGLPL E L + L GY T + GKWH
Sbjct: 66 VSLCTPSRAALLTGRLPVRTGMYPGVLGPTSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 125
Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGHQD-----------------YFDHSAEEMKMWGLD 205
LG + + P +GF LG H D + + +
Sbjct: 126 LGVGPRGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPAATRCDGVCDQGLVPIPLLA-N 184
Query: 206 MRRDLEPAW--DLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA 262
+ +++P W L +Y A A D++ + P FLY A TH
Sbjct: 185 LSVEVQPPWLPGLEARY-----VAFARDLMADAQHQGRPFFLYYASHHTHYPQ------- 232
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+ + R F L +LD +VG ++ A+ +L ++++F +DNG
Sbjct: 233 ----FSGQSFVGSSGRGPFGDSLMELDAAVGALMMAVGDLGLLGETLVIFTADNGPETMR 288
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
+ S LR K T +EGGVR + + P + G+ E + D LPT L+A +
Sbjct: 289 MSRGGCSGL-LRCGKGTTFEGGVREPAVAFWPGHITPGVTHELASSL-DLLPT-LAALSS 345
Query: 383 SDIPNYVNSTVE 394
+ +PN V+
Sbjct: 346 APLPNVTLDGVD 357
>gi|410965904|ref|XP_003989478.1| PREDICTED: arylsulfatase A [Felis catus]
Length = 507
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 171/381 (44%), Gaps = 60/381 (15%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
L++ F +A++GPP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 6 ALALAFAIGLATAGPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVP 65
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
V LCTPSR+A++TG+ P+ G+ VL RGGLPL E L + L GY T I GKWH
Sbjct: 66 VSLCTPSRAALLTGRLPVRMGLYPGVLEPSSRGGLPLEEVTLAEVLAARGYLTGIAGKWH 125
Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGHQD-----------------YFDHSAEEMKMWGLD 205
LG + + P +GF LG H D + + +
Sbjct: 126 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSCDQGLVPIPLLA-N 184
Query: 206 MRRDLEPAW--DLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA 262
+ + +P W L +Y A A D++ + P FLY
Sbjct: 185 LSVEAQPPWLPGLEARY-----VAFARDLMADAQRQGRPFFLY----------------- 222
Query: 263 PDHYLNIHRHIEDFK---------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
Y + H H F R F L +LD +VG ++ A+ +L ++++F
Sbjct: 223 ---YASHHTHYPQFSGQSFSGRSGRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFT 279
Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
+DNG + S LR K T +EGGVR L + P + G+ E + D L
Sbjct: 280 ADNGPETMRMSHGGCSGL-LRCGKGTTFEGGVREPALAFWPGHIAPGVTHELASSL-DLL 337
Query: 374 PTLLSAANKSDIPNYVNSTVE 394
PT L+A + +PN V+
Sbjct: 338 PT-LAALTGAPLPNVTLDGVD 357
>gi|121609253|ref|YP_997060.1| sulfatase [Verminephrobacter eiseniae EF01-2]
gi|121553893|gb|ABM58042.1| sulfatase [Verminephrobacter eiseniae EF01-2]
Length = 442
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 166/346 (47%), Gaps = 31/346 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+IIFI+ADDLG+ D+G +G +P +DALA G+ L + Y +C+P+R A+
Sbjct: 5 PNIIFIVADDLGYADLGCYGARAAGFGPVSPTLDALAAGGLRLTQGYSNSPVCSPTRFAL 64
Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
MT ++ G + RG GLP LP L+ GYRT ++GKWHLG Y
Sbjct: 65 MTARYQYRLRGAAEEPINSQSRGSTTLGLPPEHPTLPSLLRGAGYRTALMGKWHLG-YPP 123
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
+ P G+E G +G DYF H + + L + +P Y TD+ T A
Sbjct: 124 AFGPLRSGYEEFFGPMSGGVDYFTHCSSSGQH-DLYLGAQEQP----QDGYLTDLITEHA 178
Query: 230 VDIIHNHS----TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS---KFA 282
+D + + P FL L + A H P+E +HRH+ +
Sbjct: 179 LDYVARMAPGAKAGTPFFLSLHYTAPHW--PWETRDDQALAPQLHRHLFHLHGGSIHSYR 236
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
++H +DE +G+++ L Q + ++++VF SDNGG + NWPL G K L E
Sbjct: 237 RMIHHMDEGIGRLMALLAQHGLTRDTLLVFTSDNGGE------RFSDNWPLVGGKMDLTE 290
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
GG+R + P + + G V+ Q DW TLL AA + +Y
Sbjct: 291 GGIRVPWIAHWPAVIAPGGVSAQTCMTMDWSATLLDAATVAPDADY 336
>gi|264676341|ref|YP_003276247.1| sulfatase [Comamonas testosteroni CNB-2]
gi|262206853|gb|ACY30951.1| sulfatase [Comamonas testosteroni CNB-2]
Length = 442
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 165/336 (49%), Gaps = 28/336 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
P++IFILADDLG+ D+G G +P +DA+A G+ + Y +C+P+R
Sbjct: 6 PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65
Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
A+ TG+ + + + + ++G + GLP + L++ GY T +VGKWHLG Y
Sbjct: 66 ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
+ P G+E G+ G DYF H + D + EP Y TD+ +
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HTEDGYLTDLLSRR 179
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
AVD + S ++P +L L ++A H P+ + L+ +HI+ + ++
Sbjct: 180 AVDFVKRQSAEQPFYLSLHYSAPHW--PWLTRDDLEESRRIGGFGKHIDGGSIDTYQRMI 237
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
H +DE +G +++ALE+ + N++IVF SDNGG ++ WP G K L EGG+
Sbjct: 238 HHMDEGIGWLLDALEENGLSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGI 291
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
R L P + G V + DW T L+AA
Sbjct: 292 RVPLLARWPARMTAGAVCDTPSLTMDWSATFLAAAG 327
>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
Length = 457
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 180/373 (48%), Gaps = 42/373 (11%)
Query: 28 LGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
+ YR I A VL L+ + A+ P+I+FIL DD+G D G +G TP+
Sbjct: 5 MNYRQCIAAILVL-----LASGALHSDAAPTKPNIVFILIDDMGCKDAGCYGATNFSTPH 59
Query: 88 IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER---GG------- 137
ID LA G+ + Y +C+P+R+++MTGKHP + N + R G
Sbjct: 60 IDRLANQGMRFTDAYAAPVCSPTRASLMTGKHPARLHLT-NFIPQIGRQLPAGKLIPPGF 118
Query: 138 ---LPLSEKILPQYLKELGYRTRIVGKWHLG-FYKKEYTPTFRGFES-HLGYWTGHQDYF 192
LPL EK + Q L GY+ ++GKWHLG + EY P RGF+ L G +YF
Sbjct: 119 NHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYF 178
Query: 193 DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH 252
++ K W P G Y D T EA+D + + + P FLYL+H + H
Sbjct: 179 YPFVDQQK-WPYAGPLPGNP-----GDYLPDRLTDEAIDFVRENR-ERPFFLYLSHWSVH 231
Query: 253 SANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
AP+ + +R +E+ + + +AA++ +D SVG+++ L++ + N++
Sbjct: 232 GR-----YFAPESLIAKYRERGLEE-RPAIYAAMMETVDNSVGRLMATLDELNLADNTLF 285
Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
VF+SDNGG S PLRG K +L+EGGVR ++ P + V
Sbjct: 286 VFMSDNGGE------RITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISH 339
Query: 371 DWLPTLLSAANKS 383
D PT L A +S
Sbjct: 340 DLFPTFLDFAERS 352
>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 590
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 116/349 (33%), Positives = 170/349 (48%), Gaps = 54/349 (15%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PP+++ ILADD GW D+ HG + TPNID+LA G + + +Y +C P+R+ +TG+
Sbjct: 23 PPNVVVILADDQGWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFLTGR 82
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G++ V G ER L L+EK + + K GY T GKWH G + Y P RGF
Sbjct: 83 YHPRGGVR-GVTSGGER--LDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNARGF 138
Query: 179 ESHLGYWTGH-QDYFD----HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
+ + G+ +GH +YFD H+ ++ G + TD T A+ I
Sbjct: 139 DEYYGFTSGHWGEYFDPPLEHNGRPVQGNG----------------FITDDLTDHAISFI 182
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY--------LNIHRH---IEDFKRSKFA 282
S D P F YL HS P+Q PD + L + H ED ++ A
Sbjct: 183 -KASKDRPFFCYLPFNTPHS-----PMQVPDRFYDKFKNAALKLRAHDAEREDLMMTRAA 236
Query: 283 -AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
A+ +DE+VG+V++ LE + +I+++ SDNG + +N +RG K +
Sbjct: 237 LAMCENIDENVGRVLKTLEDLSLDQKTIVLYFSDNGPNSWRWNGG------MRGRKGSTD 290
Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN---KSDIP 386
EGGVR LI W + VA+ + D LPTL A KSD P
Sbjct: 291 EGGVRSPLLIRWPKQIRPETRVAKISAAI-DLLPTLTDLAGIPVKSDKP 338
>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 327
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 84/165 (50%), Positives = 120/165 (72%), Gaps = 4/165 (2%)
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP-YEPLQAPDHYLNIHRHIEDFKRSKFA 282
++T++A+ I NH+ EPLFLYLA A HS+N EPLQAP +++ +I+ +R K+A
Sbjct: 1 MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+L +D +G+V EAL +++ML NS+I+F +DNGG + GFN N A+N+PLRGVK TL+E
Sbjct: 61 AMLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGPSNGFNNNWANNFPLRGVKATLFE 120
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
GGVRG G ++S L+E + V+ +HV+DWLPTL+S A N SDI
Sbjct: 121 GGVRGVGFVYSKLIE-KSRVSHDLIHVTDWLPTLVSLAGGNVSDI 164
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 13/126 (10%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTRGKWKLVKENSINGNG----TSEN 564
+DG D W+ L + S+RN IL NID+ W+ AL G WKL+KE N +G S N
Sbjct: 167 LDGFDQWATLQNKQSSQRNEILLNIDEHVWKNEALRVGSWKLIKEGLKNWDGWYPPPSYN 226
Query: 565 RSNDNSYQNEI--DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM 622
S D S + I G D V++ + + H ++D +I L+K K V+ M
Sbjct: 227 ESMDESLNSSILKCGHDFPVVINHCD---DYCLFHVVEDPCEIVDLSK---KFPDVLATM 280
Query: 623 RYQVDL 628
Y++++
Sbjct: 281 MYRLNI 286
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 60/132 (45%), Gaps = 33/132 (25%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
+DG D W+ L + S+RN IL NID+ W+ AL G WKL+K
Sbjct: 167 LDGFDQWATLQNKQSSQRNEILLNIDEHVWKNEALRVGSWKLIK---------------- 210
Query: 634 QVYLSGLSDRE-WLAL-----AMRKLRDAASIQCG---PVKEVPCEPQIAPCLFDIKNDP 684
GL + + W +M + +++ ++CG PV C+ CLF + DP
Sbjct: 211 ----EGLKNWDGWYPPPSYNESMDESLNSSILKCGHDFPVVINHCDDY---CLFHVVEDP 263
Query: 685 CEKNNLADRSED 696
CE +L+ + D
Sbjct: 264 CEIVDLSKKFPD 275
>gi|340622362|ref|YP_004740814.1| chondroitinsulfatase [Capnocytophaga canimorsus Cc5]
gi|339902628|gb|AEK23707.1| Chondroitinsulfatase [Capnocytophaga canimorsus Cc5]
Length = 456
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 165/323 (51%), Gaps = 37/323 (11%)
Query: 48 MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQL 106
+ F+ ++ + P+IIFILADD+G DV +G I TPNI L+ G+ ++Y +
Sbjct: 2 LFFLGVIQAQDKPNIIFILADDMGIGDVSPYGQKTIQTPNIQKLSDEGMTFSDFYAGSTV 61
Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHL 164
C PSR++++TG+H HT ++ N G PL +K I P+ LK+ GY+ I GKW +
Sbjct: 62 CAPSRASMLTGQHTGHTKVRGN-------GEYPLDDKKQIFPEILKKAGYQNAIFGKWGM 114
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD--LEPAWDLHGKYST 222
G TP RGF++ G+ H D + + LD+ R LE G Y+
Sbjct: 115 GLKNSPSTPLSRGFDAFAGFLH-HIDAHFQTPDS-----LDVIRQGRLERIALQQGAYAN 168
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLA----HAATHSANPYEPLQAPDHYLNIHRH------ 272
D F + +D I ++ P F+YL+ HA A+ + Q + +IH +
Sbjct: 169 DYFLNQTLDFIDKNANKSPFFIYLSLTVPHAELSVADIHYEKQFDSNGTSIHPNEKAFKG 228
Query: 273 ----IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN-----GGAAAGF 323
++F ++ +AA++ +D VG++++ + + + N+II+F SDN GG A
Sbjct: 229 GHYGAQEFPKAAYAAMVSSIDYYVGQILQKVADKNIDDNTIIIFSSDNGTHVEGGRTAQD 288
Query: 324 NLNAASNWPLRGVKNTLWEGGVR 346
S+ RGVK L+EGG+R
Sbjct: 289 VAYFQSSGEYRGVKRDLYEGGIR 311
>gi|126339031|ref|XP_001366628.1| PREDICTED: arylsulfatase A [Monodelphis domestica]
Length = 506
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 176/371 (47%), Gaps = 34/371 (9%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
++++L++ F +A + PP+ I I ADDLG+ D+G +G TPN++ +A G+ L ++
Sbjct: 3 ISWSLTLAFTTSLAFARPPNFILIFADDLGYGDLGCYGHPSSSTPNLNRMAAKGLRLTDF 62
Query: 102 YT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
Y V LCTPSR+A++TG++P+ +G+ VLY GGLPLSE + + L E GY T + G
Sbjct: 63 YVPVPLCTPSRAALLTGRYPVRSGLYPGVLYPGSLGGLPLSEVTMAEVLAERGYLTGMAG 122
Query: 161 KWHLGFYK-KEYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-------MR 207
KWHLG + P +GF LG H Q+ + G D +
Sbjct: 123 KWHLGVGPNGSFLPPHQGFHRFLGVPYSHDQGPCQNLTCFPPATLCYGGCDQGVVPIPLM 182
Query: 208 RDL----EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
+L +PAW L G V A+ I+ +P FLY A TH
Sbjct: 183 ANLSVKEQPAW-LPGLEKQYVAFAQNF-IVDAARQGQPFFLYYASHHTHYPQ-------- 232
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+ R F L +LD +VG +++ +E + +++ F +DNG
Sbjct: 233 ---FGEETFVGRSGRGPFGDALMELDSAVGTLLDTVENLGLSEETLVFFTADNGPETKRM 289
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+S LR K T +EGG+R L + P G V+ + D LPT+ + A+ +
Sbjct: 290 FRGGSSGL-LRCGKGTTYEGGIREPALAFWPGHIMPG-VSHELASSLDLLPTMAALAS-A 346
Query: 384 DIPNYVNSTVE 394
+PN V+
Sbjct: 347 HLPNVTLDGVD 357
>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
Length = 383
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 78/181 (43%), Positives = 109/181 (60%), Gaps = 10/181 (5%)
Query: 201 MWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
M G D+ + AWD +G YST ++T I+ +H+ +P+FLY+A+ A HS P
Sbjct: 1 MCGYDLYENDHAAWDQDNGIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHS-----P 55
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
LQAP Y +R I + R ++AA+L LDE++ V AL+ NSI+++ SDNGG
Sbjct: 56 LQAPGRYFEHYRSIININRRRYAAMLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQ 115
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
SNWPLRG K T WEGG+R G + SPLL++RG V + VH++DW PTL+S
Sbjct: 116 PTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWYPTLISL 171
Query: 380 A 380
A
Sbjct: 172 A 172
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 76/184 (41%), Gaps = 28/184 (15%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
++DG D+W +S S R ILHNID + + A G W + ++V +++
Sbjct: 181 QLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKL 240
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCLFDIKN 682
LTG P G SD W+ A GP + V + LF+I
Sbjct: 241 -LTGNP------GYSD--WVP-------PQAFSNLGPTRWHNERVTLSTGKSVWLFNITA 284
Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
DP E+ +L+ R + + +FN+ A P + ++ + + K++
Sbjct: 285 DPYERVDLSSRYPGI-VKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWGPWYKEEY 343
Query: 743 KKKK 746
KKK+
Sbjct: 344 KKKQ 347
>gi|171910116|ref|ZP_02925586.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 480
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 170/358 (47%), Gaps = 44/358 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ L DDLG D+G G TP IDALA SG+ ++Y+ +C+P+R+A+MTGK
Sbjct: 26 PNVLLFLVDDLGSQDLGVEGSKFYETPAIDALAASGVRFSSFYSAHPVCSPTRAALMTGK 85
Query: 119 HPIHTGMQHNVLYGCERGG--LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
P G+ Y + G LP +E + + GY+T VGKWHLG + P
Sbjct: 86 VPQRVGITD---YIKPKSGVALPTAETTIGEAFAAQGYQTGYVGKWHLGEADAD-QPAQH 141
Query: 177 GFESHLGYWTGHQD---YFDHSAEEMK--MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
GF+ G Q Y+ + ++ K +W + DLEP + G Y TD T ++++
Sbjct: 142 GFQWTAAVNRGGQPASYYYPYRKKDGKDTLWDVP---DLEPGTE--GDYLTDALTGKSLE 196
Query: 232 IIHNHSTDEPLFLYLAHAATHSA---------------------NPYEPLQAPDHYLNIH 270
+ T +P FL +H A H+ P P++AP +
Sbjct: 197 FLKQRDTTKPFFLCFSHYAVHTPIQPPQELVAKYQAKAKAMYGDAPAPPVEAPFGAQSRP 256
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA--AGFNLNAA 328
R + +AA+L LD ++G+V++ALE++++ +I+VF SDNGG A
Sbjct: 257 RQDD----PAYAALLENLDTNIGRVLKALEEQKLREKTIVVFTSDNGGLCTLAKGRTGPT 312
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
SN PLR K +EGG+R I P G V + D PTLL + +P
Sbjct: 313 SNLPLRCGKGWNYEGGIRTPCYISWPGHLKGGTVIGTPAYTPDLYPTLLELSGLPQLP 370
>gi|239816550|ref|YP_002945460.1| sulfatase [Variovorax paradoxus S110]
gi|239803127|gb|ACS20194.1| sulfatase [Variovorax paradoxus S110]
Length = 438
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 184/394 (46%), Gaps = 40/394 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+IIFI+ADDLG+ D+G +G +P +D LA +G+ L + Y +C+P+R A+
Sbjct: 4 PNIIFIVADDLGYADLGCYGGRDAAFGPVSPVLDGLAANGLKLTQGYSNSPVCSPTRFAM 63
Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+T ++ G + RG GLP LP LK GY+T ++GKWHLG Y
Sbjct: 64 ITARYQYRLRGAAEEPINSKSRGSTTLGLPTEHPTLPSLLKAGGYQTALIGKWHLG-YPP 122
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+ P G++ G +G DYF H S +W + + E Y TD+ +
Sbjct: 123 TFGPLRSGYDEFFGPMSGGVDYFTHCDSTGRHDLWFGEEDKQEE-------GYLTDILSK 175
Query: 228 EAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK---FAA 283
AVD + + D P FL L + A H P+E D + ++ D +
Sbjct: 176 RAVDYVERMAKQDAPFFLSLHYTAPHW--PWETRDDADKAPLVKDNLFDLAGGNIHVYRR 233
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
++H +DE +G ++ AL + M N+++VF SDNGG + NWPL G K L EG
Sbjct: 234 MIHHMDEGIGWIMAALRKHGMADNTLVVFTSDNGGE------RFSDNWPLVGGKMDLTEG 287
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
G+R + P + ++G + Q DW T+L AA +Y V +++P +++
Sbjct: 288 GIRVPWIAHWPAVIAKGGESRQLCMTMDWSATMLDAAGVKADADYPLDGV-SLMPVLKDA 346
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
H + P N R + + + KY
Sbjct: 347 -------RHSFRRPLHWRMNHRGQQAMRDGDWKY 373
>gi|340620655|ref|YP_004739108.1| sulfatase [Zobellia galactanivorans]
gi|339735452|emb|CAZ98829.1| Sulfatase, family S1-17 [Zobellia galactanivorans]
Length = 609
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 177/360 (49%), Gaps = 50/360 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P++I + ADD G+ D+ HG I TPN+DA A + L N++ C PSR+ +MTG++
Sbjct: 40 PNVIIVFADDQGYGDLASHGNPYIKTPNLDAFAKESLELTNFHVGTTCAPSRAGLMTGRN 99
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
H + GC L EK + + KE GY T + GKWHLG + P RGFE
Sbjct: 100 GNRNNAWHTIA-GCSI--LLEDEKTMAEVFKEGGYNTAMFGKWHLG-DNYPFRPHDRGFE 155
Query: 180 SHL-----------GYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
L YW + YFD + R+ EP L G Y TDV+ E
Sbjct: 156 HALYNGGGGVQQTPDYW--NNTYFDDT----------YFRNGEPV-KLEG-YCTDVWFNE 201
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILH 286
A+ I + DEP FLYL A HS P P+ Y ++ + + D+++ +F ++
Sbjct: 202 AIKHIQ-RTEDEPFFLYLPLNAAHS-----PFNVPESYAKMYENEPLADYQK-RFYGMIS 254
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN-----AASNWPLRGVKNTLW 341
+DE+ GK+V L+ R++ N+I+++ +DN G AAG + N L+G K + +
Sbjct: 255 NIDENFGKLVRFLKDRKLFDNTIVIYTTDN-GTAAGIKTDKDGNVTGYNAGLKGTKGSHY 313
Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE--NIIP 398
+GG + I W +G V + V D LPTL A+ S IP V+ ++ N+ P
Sbjct: 314 DGGHKVPFFISWPDGDIKKGSVNNELVANVDLLPTL---AHMSGIPFEVDKPLDGANVAP 370
>gi|406830821|ref|ZP_11090415.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 497
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 155/320 (48%), Gaps = 18/320 (5%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
++A ++ L FT L A+ PP+II I+ DD G+ D HG + TP++D L
Sbjct: 22 VLAASICLLLFTWGGRLPRLHAA--PPNIIVIITDDQGYGDFSCHGNPVLKTPHLDRLHD 79
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G+ +++ C P+RSAIMTG+H G+ H +L ER L L+ LPQ LK +G
Sbjct: 80 EGVRFLDFHVSPTCAPTRSAIMTGRHEFRNGVTHTIL---ERERLTLTATTLPQVLKTVG 136
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG--HQDYFDHSAEEMKMWGLDMRRDLE 211
Y T I GKWHLG + Y P RGF+ + G Q Y + D
Sbjct: 137 YTTGIFGKWHLG-DEPAYRPGQRGFDETFIHGGGGIGQTYPGSCGDAPGNTYFDPAIMHN 195
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-PDHYLNIH 270
++ Y TDVF +A+ I+ +P F Y+A A P+ PLQ P+
Sbjct: 196 GMFEKTQGYCTDVFFGQAMTWINTVKGRQPFFCYIATNA-----PHGPLQVRPEDEQRYR 250
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
+ D +KF ++ +D++VG+++ L + +++VF++DNGG A N
Sbjct: 251 GKVADSNVAKFFGMISNIDDNVGRLLAKLTDWGIERETLVVFMNDNGGTVG----VAIHN 306
Query: 331 WPLRGVKNTLWEGGVRGAGL 350
+RG K T W GG R A
Sbjct: 307 AGMRGQKGTPWLGGTRAASF 326
>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
Length = 452
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 97/321 (30%), Positives = 164/321 (51%), Gaps = 13/321 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ I DD G DV +G + TPNID LA G++ +Y +C+PSR++++TG+
Sbjct: 28 PNVLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGR 87
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P + +N G+P S+ + + K+ GY T +GKWH+G Y E P +GF
Sbjct: 88 YPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG-YSPETMPNQQGF 146
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH-NHS 237
+ G+ G D + H D+ R+ + W+ GK+ D+ E + N
Sbjct: 147 DYSFGFMGGCIDNYSHYFYWAGPNRHDLWRNGQEIWE-DGKFFADLTVQEVNGFLEKNKR 205
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
D+P FLY A P+ PLQ + + ++ + +R +AA + +DE +G+V++
Sbjct: 206 ADKPFFLYWAINM-----PHYPLQGQEKWRQYYKDLPAPRR-MYAAAVSTMDEKIGQVLQ 259
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS-NWPLRGVKNTLWEGGVRGAGLI-WSPL 355
L++ + N+I+VF SD G + + P RG K +L+EGG+R +I W+
Sbjct: 260 QLDRLGLAENTIVVFQSDQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIRVPAIIRWTGH 319
Query: 356 LESRGIVAEQYVHVSDWLPTL 376
L + + V++ DW PTL
Sbjct: 320 LPKNEVRDQLCVNI-DWYPTL 339
>gi|417303302|ref|ZP_12090360.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327540274|gb|EGF26860.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 419
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 164/338 (48%), Gaps = 59/338 (17%)
Query: 98 LKNYYTV-QLCTPSRSAIMTGKHPIHTGM------------------QHNVLYGCER-GG 137
+ N+Y CTPSR++++TG++P G+ ++ V + ER GG
Sbjct: 1 MTNFYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDFGHRYTPEEYAVTF--ERIGG 58
Query: 138 LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE 197
+ E ILP L+ GY++ I GKW LG ++ PT RGF+ G+ DYF H
Sbjct: 59 MDEREVILPAVLRPAGYKSGIYGKWDLGALRR-MLPTSRGFDDFYGFVNTGIDYFTHERY 117
Query: 198 EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
+ M R+LEP G Y T +F EA+ + H+ +EP FLY+ A H+++
Sbjct: 118 GVPC----MVRNLEPTEADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSL 173
Query: 258 EP-----LQAPDHYLNIHRHIE--------------------DFKRSKFAAILHKLDESV 292
+P +QAPD + ++ +E +R + A + +D ++
Sbjct: 174 DPTIRSSVQAPDQFKAMYPTVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAI 233
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI- 351
G++++ LE ++ML +I+VF SDNGG+ A N PLRG K WEGG+R L+
Sbjct: 234 GEMLDRLEAKQMLDETIVVFFSDNGGS------GGADNAPLRGHKAQTWEGGIRVPCLVR 287
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
W G+V ++++ + LP+ +AA P V
Sbjct: 288 WPAGQIPAGVVNDEFLTSLELLPSFAAAAGVEPPPGVV 325
>gi|443722642|gb|ELU11403.1| hypothetical protein CAPTEDRAFT_152949 [Capitella teleta]
Length = 486
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 107/339 (31%), Positives = 167/339 (49%), Gaps = 44/339 (12%)
Query: 65 ILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHT 123
+L DD+GW D+G G TPN+D +A G++L ++Y+ LC+PSR+A++TG+ PI
Sbjct: 1 MLMDDMGWGDLGVFGHPAKETPNLDKMAAEGLLLPDFYSANPLCSPSRAALLTGRLPIRN 60
Query: 124 GMQHNVLYG-------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G + + GG+P E + P+ L++ GY +I+GKWHLG + +Y P
Sbjct: 61 GFYSDNAHARNAYTPQTIVGGIPDEEILFPELLQKTGYTNKIIGKWHLG-QQAQYHPLLH 119
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS-------------TD 223
GF+ G H FD ++ + + R+ E + G+Y T
Sbjct: 120 GFDEWFGAPNCHFGPFDDKSQP----NIPVYRNAE----MIGRYYEEFEITKNGESNLTQ 171
Query: 224 VFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
++T EA D I+ S+D +P LY A ATH +PL A +L +R +
Sbjct: 172 MYTQEATDFIYRLSSDKQPFLLYWAVDATH-----DPLYASKKFLGTS------QRGLYG 220
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
+ +LD VG++++ALE ++L N++++F SDN GAA SN P K T E
Sbjct: 221 DAMRELDWGVGEILKALETAQVLDNTLVIFSSDN-GAATYAKTGGGSNGPFLCGKETTLE 279
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
GG+R + + P G V+ Q + D T L A
Sbjct: 280 GGMREPTIAYWPGKIKPG-VSHQLGSLMDIFTTTLDLAG 317
>gi|326802120|ref|YP_004319939.1| sulfatase [Sphingobacterium sp. 21]
gi|326552884|gb|ADZ81269.1| sulfatase [Sphingobacterium sp. 21]
Length = 471
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 175/367 (47%), Gaps = 45/367 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+IIFILADDLG+ DVGF+G I TPN+D LA GI +Y +C PSRS++M+G+
Sbjct: 26 PNIIFILADDLGYGDVGFNGQRLINTPNLDRLAKEGIRFNQFYAGTAVCAPSRSSLMSGQ 85
Query: 119 HPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
H HT ++ N+ G E G +P S + + + LK+ GY+T GKW LG E P
Sbjct: 86 HTGHTFIRGNL--GVEPEGQYPIPDSVETIAETLKKAGYQTGAFGKWGLGPVGSEGDPNK 143
Query: 176 RGFESHLGY---WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+GF+ GY H+ Y H + + L DLE H +Y+ D+ +A+
Sbjct: 144 QGFDRFYGYNCQSLAHRYYPTHLWDNSEKVVLTENGDLE----YHKQYAPDLIQEQALGF 199
Query: 233 IHNHSTDEPLFLYLAHAATHS------------------ANPYEPLQAPDHYLNIHRHIE 274
I H D P FL+L + H+ P++ + +
Sbjct: 200 IEEHK-DRPFFLFLPYILPHAELLVPDDSIFQYYKGKFEEKPFKGFDYGKGAKSGGYASQ 258
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN-----GGAAAGFNLNAAS 329
+ S FAA++ +LD VG++ E L + N++IVF SDN GGA F ++
Sbjct: 259 QYPHSTFAAMVARLDLYVGQIEEKLRSLGLDENTLIVFTSDNGPHKEGGADPNFFQSSGG 318
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL-----LSAANKSD 384
LRG+K L+EGG+R + P G +E D PT + A +D
Sbjct: 319 ---LRGIKRDLYEGGIRVPFVAKWPKAIRAGQQSEFIGAFWDIFPTFAELGGAATAASTD 375
Query: 385 IPNYVNS 391
++VN+
Sbjct: 376 GISFVNT 382
>gi|299533802|ref|ZP_07047173.1| sulfatase [Comamonas testosteroni S44]
gi|298718218|gb|EFI59204.1| sulfatase [Comamonas testosteroni S44]
Length = 442
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 168/341 (49%), Gaps = 24/341 (7%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
P++IFILADDLG+ D+G G +P +DA+A G+ + Y +C+P+R
Sbjct: 6 PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65
Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
A+ TG+ + + + + ++G + GLP + L++ GY T +VGKWHLG Y
Sbjct: 66 ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
+ P G+E G+ G DYF H + D + EP Y TD+ +
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HTEDGYLTDLLSRR 179
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDFKRSKFAAILHK 287
AVD + S ++P +L L ++A H +A L + +HI+ + ++H
Sbjct: 180 AVDFVKRQSAEQPFYLSLHYSAPHWPWLTREDRAESERLQGMGKHIDGGSIDTYQRMIHH 239
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE +G +++ALE++ M N++IVF SDNGG ++ WP G K L EGG+R
Sbjct: 240 MDEGIGWLLDALEEKGMSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGIRV 293
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
L P + G V + DW T L+AA + +Y
Sbjct: 294 PLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVAAHADY 334
>gi|196230145|ref|ZP_03129008.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225742|gb|EDY20249.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 487
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 167/337 (49%), Gaps = 38/337 (11%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
+A+ PP++I I+ADDLG+ DVG +G TPN D LA+ G +++ Q +C+ SR+
Sbjct: 27 LAADRPPNVILIVADDLGYADVGVYGAKGFETPNFDRLAHEGRRFTDFHVAQAVCSASRA 86
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
AIMTG +P G++ + + G+ E +PQ K GY T +VGKWHLG E+
Sbjct: 87 AIMTGCYPNRIGIE-GAMEPWYKFGISDQELTMPQMFKRKGYATGMVGKWHLG-TPTEFL 144
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM------------RRDLEPAWDLHGKY 220
PT RGF+ G + + H + K L + RD+E
Sbjct: 145 PTHRGFDEWFGLPYSNDQWPLHPEKPGKFPPLPLYEGDKVINPGINHRDME--------Q 196
Query: 221 STDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
T +T AV+ I NH D+P FLY+A P+ PL D + KR
Sbjct: 197 LTTQYTERAVNFIDRNH--DKPFFLYVAQTM-----PHVPLAVSDKFRGAT------KRG 243
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+ + ++D S G+++ AL++ + ++II+F+SDNG F +A S +PLR K T
Sbjct: 244 LYGDAVEEIDWSTGEILAALKKYGLEKDTIIMFLSDNGPWLI-FGNHAGSAYPLREGKTT 302
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
W+GG R +I P G V+ + D +PTL
Sbjct: 303 TWDGGTRVPFIISWPGHIPAGTVSTEMACAIDLMPTL 339
>gi|443704175|gb|ELU01350.1| hypothetical protein CAPTEDRAFT_214223 [Capitella teleta]
Length = 336
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 89/252 (35%), Positives = 136/252 (53%), Gaps = 11/252 (4%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II+ ADD G+ D G+ D I TPNID L GI N Y+ Q C+PSRS+ ++G++
Sbjct: 94 PNIIWFFADDYGYQDAGYRNSD-IHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRY 152
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+GMQH V+ + L L YLKEL Y T GKWHLG+ KE TPT+RGF+
Sbjct: 153 AYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGFD 212
Query: 180 SHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G ++G Y++H+ + M W + D + G +S D+ A+ ++ + + +
Sbjct: 213 TFSGGYSGEGKYYEHTTFQGMYDWHNGTKLDRSAS----GIHSQDLIEADLLNQL-DEND 267
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
PLF Y A THS P +P P+H R K+ ++ +D +GKVV
Sbjct: 268 GTPLFYYAAFQNTHS--PMQP--KPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGKVVAK 323
Query: 299 LEQRRMLSNSII 310
L+++ M N+++
Sbjct: 324 LKEKDMFDNTVM 335
>gi|323449289|gb|EGB05178.1| hypothetical protein AURANDRAFT_72312 [Aureococcus anophagefferens]
Length = 514
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 98/275 (35%), Positives = 134/275 (48%), Gaps = 41/275 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PHI+F+LADDLG+NDVG+ LD TPN+DA+ G+ L + Y CTPSR+A++T +
Sbjct: 103 PHIVFVLADDLGYNDVGYGSLDLHACTPNLDAMWRDGVELTSLYAAPTCTPSRAALLTAR 162
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+P GMQH L GL S L + + GY T VGKWHLG + + PT RGF
Sbjct: 163 YPTTLGMQHWQLEAAAPYGLDSSTATLGEVFQRQGYATVFVGKWHLGHFSEAVLPTRRGF 222
Query: 179 ESHLGYWTGHQDYFDHSAEEM---------------------KMWGL---DMRRDLEPAW 214
E G++TG ++YF +E W + DL
Sbjct: 223 EKFYGFYTGGENYFTRVSEGACAPVVNTSVPQVSEVGDIAKECFWDAIDNEAHEDLPAPE 282
Query: 215 DLHGK-YSTDVFTAEAVDIIHNHST------DEPLFLYLAHAATHSANPYEPLQAPDHYL 267
+LHGK +ST F A D++ H + +P+ L LA NP+ PL AP
Sbjct: 283 ELHGKSHSTYAFAMRASDLVARHGSFSGGGLAQPILLVLA-----LPNPHVPLLAPAAVF 337
Query: 268 NIH----RHIEDFKRSKFAAILHKLDESVGKVVEA 298
H R I + +R FAA+ DE++G V A
Sbjct: 338 RTHDAILRQITNNQRRTFAALTILWDEALGNVSAA 372
>gi|291231158|ref|XP_002735532.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 191
Score = 152 bits (383), Expect = 1e-33, Method: Composition-based stats.
Identities = 74/175 (42%), Positives = 107/175 (61%), Gaps = 9/175 (5%)
Query: 35 MAFAVLPLAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
M+ +VL T+ ++F+ A P+I+FI ADD GWND+G+H TPN+++LA
Sbjct: 1 MSISVL---LTVPVLFLSAFARCDSQPNIVFIFADDYGWNDIGYHN-PIFQTPNLNSLAA 56
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
GI L+NYY +CTPSR ++TG++ + G+ H + +R LPL E LP+ +K+ G
Sbjct: 57 DGIKLENYYVAPVCTPSRGQLLTGRYAMRYGLVHRNIRPAQRMCLPLDEVTLPEKMKQAG 116
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
Y T +VGKWH GFY PT RGF+S G++ +DYF HSA G D++R
Sbjct: 117 YATHMVGKWHQGFYTPACIPTQRGFDSFFGFYICTEDYFTHSASG----GFDLKR 167
>gi|149196404|ref|ZP_01873459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
gi|149140665|gb|EDM29063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
Length = 456
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 171/350 (48%), Gaps = 34/350 (9%)
Query: 42 LAFTLSMVFVDLVASSG-PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
+ + VF+ A+S P+IIFI+ DD+G+ +G +G I TP +D +A G+ L +
Sbjct: 1 MKYISVFVFLMFAANSADKPNIIFIMCDDMGYGQLGSYGQKMIKTPRLDQMAKEGLRLTD 60
Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
YY +C PSR ++MTG+H HT ++ N Y + +P + + +KE GY T ++
Sbjct: 61 YYAGTAVCAPSRCSLMTGQHVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALI 120
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGY-----WTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
GKW LG+ E P +GF+ GY H F EE + +++E
Sbjct: 121 GKWGLGYPGSEGEPNKQGFDYFFGYNDQKHAHNHFPKFLLRNEETLTLKNNSGKEIE--- 177
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP---DHYLNIHR 271
YS + T EA I + D P FLYLA+ HS LQ P + YL
Sbjct: 178 -----YSQYMLTDEAKGFIKKNK-DNPFFLYLAYVIPHSR-----LQIPGDDECYLQYKD 226
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG------FNL 325
K+ K A ++ +LD+ VG +++ L++ + N+++VF SDNG G FN
Sbjct: 227 ESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGARPEFFN- 285
Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
+ PL G+K +++EGGVR + P + G V+ D +PT
Sbjct: 286 ---DSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNHIGAHWDLMPT 332
>gi|333912861|ref|YP_004486593.1| arylsulfatase [Delftia sp. Cs1-4]
gi|333743061|gb|AEF88238.1| Arylsulfatase [Delftia sp. Cs1-4]
Length = 441
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 170/346 (49%), Gaps = 45/346 (13%)
Query: 60 PHIIFILADDLGWNDVGFHG---LDQIP-TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+I+FI+ADDLG+ D+G +G D P +P +D LA G+ L + Y +C+P+R A+
Sbjct: 4 PNILFIVADDLGYADLGCYGGRAADFGPVSPMLDRLAAGGLRLTQGYANSPVCSPTRFAL 63
Query: 115 MTGKH----------PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
T ++ PI++ + L E+ GLP + L++ GYRT ++GKWHL
Sbjct: 64 ATARYQYRLRGAAEEPINSKTRGTPL--GEKLGLPTDMPTVASMLRDAGYRTALIGKWHL 121
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYST 222
G Y + P G+E + G +G DYF H S+ + +W + E Y T
Sbjct: 122 G-YPPHFGPLRSGYEEYFGPMSGGVDYFTHLSSSGQHDLWVGEEEHHDE-------GYLT 173
Query: 223 DVFTAEAVDIIHNHST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR------HIED 275
D+ + +VD +H + D P FL L H P+ P + D H++
Sbjct: 174 DLLSQRSVDFVHRMAQGDAPFFLSL-----HYTAPHWPWETRDDRSTAEALGAGIAHLDG 228
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
++ ++H +DE +G +VEAL L N++IVF SDNGG + NWPL G
Sbjct: 229 GNIHQYRRMIHHMDEGIGWIVEALRANGQLDNTLIVFTSDNGGE------RFSDNWPLVG 282
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K L EGG+R + P + + G + Q+ DW T+L AA
Sbjct: 283 GKMDLTEGGIRVPWIAHWPAVIAPGRSSPQHCMSMDWSATVLDAAG 328
>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 580
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 137/473 (28%), Positives = 202/473 (42%), Gaps = 103/473 (21%)
Query: 31 RTRIMAFAVLPLAFTLSMVF---VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
R + FA+ LAF+ S + P+II I+ DDLG++D G +G +I TPN
Sbjct: 2 RPILFVFALF-LAFSASAQKKGKTTPAPTPSKPNIILIMVDDLGYSDFGAYG-SEIQTPN 59
Query: 88 IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQH---NVLYGCERGGLPLSEKI 144
ID LAY G+ LK +Y +C P+R++++TG++P G+ + N+ +G L
Sbjct: 60 IDKLAYGGLRLKEFYNNSICAPTRASLITGQYPHKAGLGYFNTNLGLPAYQGWLNQESLT 119
Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS-----AEEM 199
+ L++ GY T + GKWH+G Y P RGF G+ G +Y+D S A +
Sbjct: 120 FGEVLQQGGYNTYLTGKWHVG-NDSLYWPNQRGFNKFYGFIGGASNYYDISPYPEKAPPV 178
Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
++ + R +L P GKY TD T A+ I N S D+P FLYLA A P+ P
Sbjct: 179 ELVENNQRINLAP-----GKYLTDEITNHALSYI-NESKDKPFFLYLAFNA-----PHWP 227
Query: 260 LQAPDHYLNIHRHIEDFK----------------------RSKFA--------------- 282
LQAP +I ++ +K R K A
Sbjct: 228 LQAPAE--DIAKYKGKYKIGWDSLRAQRYRRQLALGIADPRQKVAERDPEVVAWESLTFD 285
Query: 283 -------------AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG------------ 317
A++ +D+SVG+VVE L++ + N++IV +SDNG
Sbjct: 286 EKELWQRKMEVYAAMVDHVDQSVGRVVEELKKLKKDDNTLIVLISDNGAQGGLNQLGQRR 345
Query: 318 --------GAAAGF-----NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
G+A + N SN P R K T EGG+ + W P G + +
Sbjct: 346 RQRNSGPIGSAGSYDYQEQNWAYVSNTPFRNYKATSHEGGISSPFIAWFPKKIKAGTITK 405
Query: 365 QYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSP 417
H+ D PT A + NY N +P L +E N P
Sbjct: 406 GTGHLIDLAPTFYDVAGVNYPTNY-RGVQTNTLPGVSLKNLLFEGNDLVRNEP 457
>gi|260824685|ref|XP_002607298.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
gi|229292644|gb|EEN63308.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
Length = 178
Score = 152 bits (383), Expect = 1e-33, Method: Composition-based stats.
Identities = 72/142 (50%), Positives = 93/142 (65%), Gaps = 8/142 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP-------SRS 112
P+I+FI+ADDLGWNDVG+H D + TP +D LA G+IL Y +CTP SR+
Sbjct: 28 PNILFIVADDLGWNDVGWHNPD-VKTPVLDQLANEGVILNQSYVNYVCTPFPVVKSRSRT 86
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
A MTG P H G QH V + + G+P + LP+ LK+LGY T +VGKWHLGF YT
Sbjct: 87 AFMTGYFPYHVGTQHQVFFPFQAQGIPSNFSFLPEKLKDLGYATHMVGKWHLGFCNWNYT 146
Query: 173 PTFRGFESHLGYWTGHQDYFDH 194
PT+RGF+S GY+ G +DY+ H
Sbjct: 147 PTYRGFDSFFGYYNGAEDYYTH 168
>gi|325108958|ref|YP_004270026.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969226|gb|ADY60004.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 499
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 191/385 (49%), Gaps = 50/385 (12%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
++A + + F D +S PP+++ I+ADDLG+ ++G +G I TP +D LA
Sbjct: 19 VVAMLIAIVMFAAESTAADKGEASPPPNVVLIMADDLGYQELGCYGQKWIRTPFVDQLAA 78
Query: 94 SGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHN----------VLYGCERGG---LP 139
GI L +Y+ +C PSR +MTGKHP H +++N Y E G LP
Sbjct: 79 EGIKLTQFYSGNAVCAPSRCCLMTGKHPGHAWVRNNGDPKLPEELREKYQWEFAGQFPLP 138
Query: 140 LSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM 199
SE + + LK+ GY T +GKW LG + P +G + G+ + H+
Sbjct: 139 ESEVTVAELLKQKGYATAAIGKWGLGHFGTTGAPDRQGIDLFYGF---NSQRHAHNHYPR 195
Query: 200 KMWGLDMRRDLEPAWD--LHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP 256
+W D +++++P D L+G+ YS D FT A++ I + + D+P FLYL A H +
Sbjct: 196 FLWRND-QKEMQPGNDRTLNGETYSQDRFTEVALEFI-DQNKDQPFFLYLPFAIPHLS-- 251
Query: 257 YEPLQAPDHYLNIHRH---IEDFK----------RSKFAAILHKLDESVGKVVEALEQRR 303
+Q P+ L + E++K R+ +AA++ +D +GK++ L +
Sbjct: 252 ---IQVPESSLAEYTDEIPEEEYKHRGYLKHPKPRAGYAAMITHMDRDIGKIMARLNEYG 308
Query: 304 MLSNSIIVFVSDN-------GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
+ N++I+F SDN GG+ + F +A +RG+K +L+EGG+R + P
Sbjct: 309 LTDNTLIIFTSDNGPTYDRLGGSDSDFFESAGK---MRGLKGSLYEGGIRVPCVARWPGQ 365
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
G ++ + D PT+ AA
Sbjct: 366 IPAGNTSDLISALWDLFPTIAEAAG 390
>gi|262384881|ref|ZP_06078013.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
gi|262293597|gb|EEY81533.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
Length = 589
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/344 (30%), Positives = 166/344 (48%), Gaps = 49/344 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II +L+DD GW D+GF G + TPNID +A+ G IL+N+Y + +P+R+ +TG++
Sbjct: 27 PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ +G+ ++ G ER EK + +Y +E GY T + GKWH G + Y P RGFE
Sbjct: 87 HVRSGV-NSTTGGGER--FNQGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142
Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
G+ +GH +H+ E + G + D T +A+D I
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
+H + P F++L++ HS P+Q PD + N + +R+ F
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A+ LD ++G+V+ L + +I+++ SDNG + F N ++G K + E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294
Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
GGVR I P +G V Q D +PTLL A P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338
>gi|326927530|ref|XP_003209945.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Meleagris
gallopavo]
Length = 485
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/332 (30%), Positives = 163/332 (49%), Gaps = 43/332 (12%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G G TPN+D +A G++ ++Y LC+PSR+A++TG+ P+ G
Sbjct: 3 MGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAASPLCSPSRAALLTGRLPVRNGFYTT 62
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+ SE +LP+ LK+ GY +I+GKWHLG ++ ++ P GF+
Sbjct: 63 NAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLG-HRPQFHPLKHGFDEW 121
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A + + + RD W++ G+Y T ++
Sbjct: 122 FGSPNCHFGPYDNRA----LPNIPVYRD----WEMIGRYYEDFKIDLKTGEANLTQIYLQ 173
Query: 228 EAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA+D I ++ +P FLY A ATH+ P+ A H+L +R ++ +
Sbjct: 174 EALDFISKQQASQQPFFLYWAIDATHA-----PVYASKHFLGTS------QRGRYGDAVR 222
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
++D+SVGK+++ L++ + N+ + F SDNG A SN P K T +EGG+R
Sbjct: 223 EIDDSVGKILKHLQKLGISENTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTFEGGMR 282
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
+ W P G V+ Q V D T LS
Sbjct: 283 EPAIAWWPGHIPAGGVSRQLGSVMDLFTTSLS 314
>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 469
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 163/327 (49%), Gaps = 19/327 (5%)
Query: 40 LPLAFTLSMVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
L TLS+ ++ P P++I I+ADDLG+ D+G+ G IPTPNID LA
Sbjct: 5 LSTVLTLSIAACASTFAAHPTENRQDQPNVIVIMADDLGYGDLGYTGSKHIPTPNIDRLA 64
Query: 93 YSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYL 149
G+ Y T Q C PSR+ +TG++ G + N Y G+P SE++ + L
Sbjct: 65 NEGVECTYGYVTHQYCGPSRAGFLTGRYQQRFGFETNPPYDRHNTIAGVPASERLFAERL 124
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
+ +GY+T IVGKWH+G + + GH + + E M LD
Sbjct: 125 QAVGYKTGIVGKWHIGSHSIHHPNNRGFDFFFGFLGGGHDFFRVDTREPMDEGYLDPMMR 184
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
+ D+ G Y T T EA+ I + D P FL+L++ A P+ PLQAP+ +
Sbjct: 185 NGSSVDVEG-YLTTQLTDEAIGFIERNEKD-PFFLFLSYNA-----PHAPLQAPEESIAK 237
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA--GFNLNA 327
H+E +R ++A+++++D +G+++ LE R + +II F+SDNGG +
Sbjct: 238 FSHVEGKERRVYSAMVYEMDLEIGRILSMLEDRDLAEETIIFFLSDNGGPPHWDKSQDSY 297
Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSP 354
SN RG K ++GGV L + P
Sbjct: 298 TSNGAFRGYKGDTYDGGVHVPFLAYWP 324
>gi|333911967|ref|YP_004485699.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
gi|333742167|gb|AEF87344.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
Length = 442
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 166/336 (49%), Gaps = 28/336 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
P++IFILADDLG+ D+G G +P +DA+A G+ + Y +C+P+R
Sbjct: 6 PNLIFILADDLGYADLGCTGARDAHSRATDVSPRLDAMAAQGMRFTRGYSNSAVCSPTRF 65
Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
A+ TG+ + + + + ++G + GLP + L++ GY T +VGKWHLG Y
Sbjct: 66 ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
+ P G+E G+ G DYF H + D + EP Y TD+ +
Sbjct: 125 PHFGPRMSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HEEEGYLTDLLSRR 179
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
AVD + + D+P +L L ++A H P+ + L+ +H + + ++
Sbjct: 180 AVDFVERQTADKPFYLSLHYSAPHW--PWLTRDDLEESRRIGGFGKHTDGGTIETYQRMI 237
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
H +DE +G V++ALE++ M N++IVF SDNGG ++NWP G K L EGG+
Sbjct: 238 HHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGE------RFSNNWPFIGQKMDLLEGGI 291
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
R + P G V++ DW T L+AA
Sbjct: 292 RVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAG 327
>gi|421613814|ref|ZP_16054885.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495400|gb|EKJ99987.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 468
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 178/345 (51%), Gaps = 40/345 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P+I+ ILADDLG+ D+G +G D+ T +D LA G+ + Y C+P+R+A++TG+
Sbjct: 23 PNIVLILADDLGYGDLGCYGNDEQATRVLDRLATQGVRWTQAYANGPECSPTRAALLTGR 82
Query: 119 HPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
+ H G NV L+ GLP L + L +GY T + GKWHL
Sbjct: 83 YQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPADRPTLAKRLSSVGYETALFGKWHL 142
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTD 223
G Y+ +++P GF+ L G DY+ H + + + L + G+ Y TD
Sbjct: 143 G-YEAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEGYFTD 195
Query: 224 VFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK- 280
T +AV I + ++ D+P FLYL + A H+ PY+ P ++P L + + +K++
Sbjct: 196 TITDQAVRYIGDRNANDKPFFLYLPYTAPHT--PYQAPGESPVDPLPLDSPL--WKQNAD 251
Query: 281 ----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+ A++ +DE +GKV+ A+E+ M ++++F SDNGG + A+ N PLRG
Sbjct: 252 PPGVYRAMVRHMDEGIGKVLHAIEESEMTDRTLVIFASDNGGTS------ASRNEPLRGF 305
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K +EGG+R + P G+V++Q D +L+AA
Sbjct: 306 KGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTALMLAAAG 350
>gi|334344099|ref|YP_004552651.1| N-acetylgalactosamine-6-sulfatase [Sphingobium chlorophenolicum
L-1]
gi|334100721|gb|AEG48145.1| N-acetylgalactosamine-6-sulfatase [Sphingobium chlorophenolicum
L-1]
Length = 487
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 187/392 (47%), Gaps = 61/392 (15%)
Query: 55 ASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPS 110
A GP P+I+FI+ADDLG+ DV +G TP ID LA G+ L+ Y +C+ +
Sbjct: 40 APVGPARKPNILFIVADDLGYADVSCYGRRDYRTPAIDRLAAEGLRFLQGYANSSVCSAT 99
Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHL 164
R+A++TG++ Q+ + G E GLP S LP L+ GY T ++GKWHL
Sbjct: 100 RTALITGRY------QYRLPLGLEEPLDERDVGLPPSHPTLPSILRRQGYGTTLIGKWHL 153
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
G Y+P G++ G+ G DYF H K +G D + + +G Y TD+
Sbjct: 154 GALPN-YSPLKSGYDHFWGFRAGGVDYFTH-----KYFGKPDLWDGDVPIERNG-YLTDL 206
Query: 225 FTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR-----------H 272
A++ I ++ D P + L H P+ P + P R H
Sbjct: 207 LAERAIESIDAYAKADRPFLISL-----HFNAPHWPWEGPADQAESVRLDESGKPEAMLH 261
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
+ + +A ++ ++D +G++++ L+ ++ ++I++F SDNGG A WP
Sbjct: 262 FDGGSQRIYAEMVTRMDMQIGRILDRLDALKIADDTIVIFTSDNGGE------RFADTWP 315
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK---------- 382
G K L EGG+R +I P + G ++Q + DW+PTLL+AA
Sbjct: 316 FTGRKTELLEGGLRVPAIIRWPGRVAAGEASQQVMMSMDWMPTLLAAAGASADPAYPLDG 375
Query: 383 -SDIPNYVNSTVENIIPRYENSILRYENGTHE 413
S +P N++ +I+PR RY+N E
Sbjct: 376 VSLLPALANTS--HIVPR--TLCWRYKNLDQE 403
>gi|441597518|ref|XP_003266414.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Nomascus
leucogenys]
Length = 431
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 72/134 (53%), Positives = 97/134 (72%), Gaps = 1/134 (0%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH 194
G TG+ DY+ +
Sbjct: 167 FXGSLTGNVDYYTY 180
>gi|337280754|ref|YP_004620226.1| arylsulfatase [Ramlibacter tataouinensis TTB310]
gi|334731831|gb|AEG94207.1| arylsulfatase precursor-like protein [Ramlibacter tataouinensis
TTB310]
Length = 439
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 115/376 (30%), Positives = 184/376 (48%), Gaps = 43/376 (11%)
Query: 58 GPPHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
G P+++FI+ADDLG+ D+G +G + +P +D LA G+ + Y +C+P+R
Sbjct: 3 GRPNVVFIVADDLGFADLGCYGGREAGFSRVSPVLDGLAAKGLRFTQGYANSPVCSPTRF 62
Query: 113 AIMTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
A+MT ++ G + RG GLP +P LK GYRT ++GKWHLG Y
Sbjct: 63 ALMTARYQYRLRGAAEEPINSRSRGSTTLGLPPEHPTVPSLLKAAGYRTALIGKWHLG-Y 121
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
+ P G+E G +G DYF H + + +W + R E Y TD+
Sbjct: 122 PPAFGPLRSGYEEFFGPMSGGVDYFSHCSSNGQHDLWLGEQERAEE-------GYLTDLL 174
Query: 226 TAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEP-------LQAPDHYLNIH-RHIEDF 276
+ AVD + + D P FL L + A H P+E + D+ ++H ++E +
Sbjct: 175 SRRAVDYVARMARQDAPFFLSLHYTAPHW--PWETREDAGKVAEVKDNLFHLHGGNVETY 232
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+R ++H +DE +G V++AL + + ++++VF SDNGG + NWPL G
Sbjct: 233 RR-----MIHHMDEGIGWVMDALRRHGLERDTLVVFTSDNGGE------RFSDNWPLVGG 281
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
K L EGG+R + P + + G + Q DW T+L AA + P Y V ++
Sbjct: 282 KMDLTEGGIRVPWIAHWPAVIAPGGASAQPCMTMDWSATVLDAAGAAADPAYPLDGV-SL 340
Query: 397 IPRYENSILRYENGTH 412
+P ++ R+ H
Sbjct: 341 LPVLRDASRRFARPMH 356
>gi|160895745|ref|YP_001561327.1| sulfatase [Delftia acidovorans SPH-1]
gi|160361329|gb|ABX32942.1| sulfatase [Delftia acidovorans SPH-1]
Length = 442
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 166/336 (49%), Gaps = 28/336 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
P++IFILADDLG+ D+G G +P +DA+A G+ + Y +C+P+R
Sbjct: 6 PNLIFILADDLGYADLGCTGARDAHSRATDVSPRLDAMAAQGMRFTRGYSNSAVCSPTRF 65
Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
A+ TG+ + + + + ++G + GLP + L++ GY T +VGKWHLG Y
Sbjct: 66 ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
+ P G+E G+ G DYF H + D + EP Y TD+ +
Sbjct: 125 PHFGPRMSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HEEEGYLTDLLSRR 179
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
AVD + + D+P +L L ++A H P+ + L+ +H + + ++
Sbjct: 180 AVDFVERQTADKPFYLSLHYSAPHW--PWLTRDDLEESRRIGGFGKHTDGGTIETYQRMI 237
Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
H +DE +G V++ALE++ M N++IVF SDNGG ++NWP G K L EGG+
Sbjct: 238 HHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGE------RFSNNWPFIGQKMDLLEGGI 291
Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
R + P G V++ DW T L+AA
Sbjct: 292 RVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAG 327
>gi|325298612|ref|YP_004258529.1| arylsulfatase [Bacteroides salanitronis DSM 18170]
gi|324318165|gb|ADY36056.1| Arylsulfatase [Bacteroides salanitronis DSM 18170]
Length = 465
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 201/404 (49%), Gaps = 59/404 (14%)
Query: 28 LGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
LG+ T + +F LP A + V+ P+IIFILADD+G+ D+ +G I TPN
Sbjct: 7 LGWGT-LASFFFLPPADASADAGQAGVSHPEKPNIIFILADDMGYADLSCYGSRYIETPN 65
Query: 88 IDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG--------- 137
ID LA SG + Y + +PSR A+MTG++ +T ++ N C+ GG
Sbjct: 66 IDKLATSGTRFTQCYAGSGISSPSRCALMTGRNTGNTTIRDNF---CKAGGIEGKKGKAT 122
Query: 138 ------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---- 187
LP ++ + L GYRT +V KWHL + E TP RGF+ G+
Sbjct: 123 IRRMHLLP-NDTTIATVLSAAGYRTCLVNKWHLDGFNPEATPLNRGFDEFYGWLISTAYS 181
Query: 188 ----HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
+ Y+ EE+ ++++ + + HG ++TD+ T A+D I + + P F
Sbjct: 182 NDPYYYPYWRFENEEL----VNIQANADGK---HGIHNTDLSTRNAIDFIRRNR-ENPFF 233
Query: 244 LYLAHAATHSANPYEPLQAPD-HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
LYLA+ A P+EP D H+ + + KR +A+++ +D+++G++++ LE+
Sbjct: 234 LYLAYDA-----PHEPYHIDDTHWYDAREWSMNTKR--YASLVTHMDQAIGRLLDELERL 286
Query: 303 RMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGI 361
+ N++I+F SDNGGA N RG K L+EGG+R ++ W RG
Sbjct: 287 HLRENTLIIFASDNGGAVQAPLQQLGCNGVFRGRKGQLYEGGIRVPFIVNW------RGK 340
Query: 362 VAEQ----YVHVSDWLPTLLSAANKSD-IPNYVNSTVENIIPRY 400
V Q ++ D +PTL A S +P+ +N +I P +
Sbjct: 341 VPVQTLDNLIYFPDVMPTLARLAGASGKLPSRLNGI--DITPLF 382
>gi|323450061|gb|EGB05945.1| hypothetical protein AURANDRAFT_66030 [Aureococcus anophagefferens]
Length = 611
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 166/343 (48%), Gaps = 32/343 (9%)
Query: 60 PHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PH++ L DD+G+ND + D P LA G++L NYY Q CTPSR+A++TG+
Sbjct: 73 PHVVIALLDDMGYNDAPWTSTDLGAAMPFARRLAADGVVLSNYYADQSCTPSRAALLTGR 132
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL---GYRTRIVGKWHLGFYKKEYTPTF 175
H + G+ V++ GL + K LP +++ Y VGKW +G Y P
Sbjct: 133 HAVELGVYSTVVWDSAW-GLDENAKTLPALIEDARPGAYHKACVGKWDVGHSTAAYLPLA 191
Query: 176 RGFESHLGYWTGHQ-DYFDHSAEEMKMWGL-DMRRDLEPAWDLHGKYSTDVF-TAEAVDI 232
RGF++ G DY H AE + D+ P++D Y+TD++ +A +
Sbjct: 192 RGFDTFTGMLGDMMTDYSTHMAEFTSSGPVYDLFEGDAPSYDYQNTYATDIWTSAATSAV 251
Query: 233 IHNHSTDEPLFLYLAHAATHSA----------NPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
H + L LYLA+ A H+A + Y+ L AP L+ + E +R A
Sbjct: 252 AAAHEAAKGLLLYLAYNAIHTAITLPKNFEEDDEYKALVAP---LDAGDYTE--QRQLAA 306
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
L +D S+ + + L+ M ++++ F SDNGG GF SN+PLRG K TL+E
Sbjct: 307 GALLVVDRSLETLFQTLDAAGMTDDTLLFFGSDNGG-LTGFG---GSNYPLRGEKLTLFE 362
Query: 343 GGVRGAGLIWSP-----LLESRGIVAEQYVHVSDWLPTLLSAA 380
GGVR W+ + G VHVSD PTL AA
Sbjct: 363 GGVRVPAFFWAGASFPRFRATAGAAYGGIVHVSDVAPTLFGAA 405
>gi|224536296|ref|ZP_03676835.1| hypothetical protein BACCELL_01168 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522090|gb|EEF91195.1| hypothetical protein BACCELL_01168 [Bacteroides cellulosilyticus
DSM 14838]
Length = 465
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/363 (30%), Positives = 179/363 (49%), Gaps = 42/363 (11%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
+IIFILADD+G+ D+ +G I TPNID LA +G Y + +PSR A+MTG++
Sbjct: 37 NIIFILADDMGYCDLACYGNKYIETPNIDKLAATGTSFTQCYAGSGISSPSRCALMTGRN 96
Query: 120 PIHTGMQHNVLYGCERGGL-------PL-------SEKILPQYLKELGYRTRIVGKWHLG 165
+T ++ N C GG+ P+ ++ + L GYRT +V KWHL
Sbjct: 97 TGNTTIRDNF---CNAGGIEGMKGTKPIRRMHLLPNDTTIATVLSSAGYRTCLVNKWHLD 153
Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD---MRRDLEPAWDLHGKYST 222
+ E TP RGF+ G+ + + W + + E D H K++T
Sbjct: 154 GFNPEATPLNRGFDEFYGWLI--STVYSNDPYYYPYWRFNNEKLENIKENEGDKHIKHNT 211
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
D+ T +A+ I+ + T+ P FLYLA+ A H PY D + D ++A
Sbjct: 212 DLSTEDAIKFINRNKTN-PFFLYLAYDAPHE--PY----IIDETVWYDDETWDMNTKRYA 264
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
A++ +D ++G+++ L++ + N++I+F SDNG A L+G+K L+E
Sbjct: 265 ALITHMDRAIGRLLAELDRLGLRENTMIIFASDNGAAKQAPLAELGCKGSLKGMKGQLYE 324
Query: 343 GGVRGAGLIWSPLLESRGIVAEQ----YVHVSDWLPTLLSAANKSD-IPNYVNSTVENII 397
GG+R ++ P G V Q ++ D +PTL + AN +D +P +N N++
Sbjct: 325 GGIRVPFIVNQP-----GKVPVQKLNNIIYFPDVMPTLAALANATDKLPQNLNGI--NVL 377
Query: 398 PRY 400
P +
Sbjct: 378 PLF 380
>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 159
Score = 151 bits (381), Expect = 2e-33, Method: Composition-based stats.
Identities = 70/159 (44%), Positives = 102/159 (64%)
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
+T AV +I +H+ +PLFLYL++ A H L+AP+ + HI + R+ +A +
Sbjct: 1 YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
+ LD+S+G ++EAL++ ++ N+I+VF SDNG G N NWPLRG K TLWEGG
Sbjct: 61 VDALDQSIGTIMEALQEADLIGNTIVVFTSDNGALPWGVKSNRGYNWPLRGGKFTLWEGG 120
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+R IWSPLLE V+ Q +H++DWLPTL SAA ++
Sbjct: 121 LRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAGET 159
>gi|417301182|ref|ZP_12088351.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327542503|gb|EGF28978.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 443
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 177/342 (51%), Gaps = 40/342 (11%)
Query: 63 IFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPI 121
+ ILADDLG+ D+G +G D+ TP +D LA G+ + Y C+P+R+A++TG++
Sbjct: 1 MLILADDLGYGDLGCYGNDEQATPVLDRLATQGVRWTQAYANGPECSPTRAALLTGRYQQ 60
Query: 122 HTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
H G NV L+ GLP L + L +GY T + GKWHLG Y
Sbjct: 61 HVGGLECAIGVGNVGRYDDAIRLHLVNELGLPADRPTLAKRLSSVGYETALFGKWHLG-Y 119
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFT 226
+ +++P GF+ L G DY+ H + + + L + G+ Y TD T
Sbjct: 120 EAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEGYFTDTIT 173
Query: 227 AEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK---- 280
AV I + ++ D+PLFLYL + A H+ PY+ P ++P L + + +K++
Sbjct: 174 DHAVRYIGDRNANDKPLFLYLPYTAPHT--PYQAPGESPVDPLPLDSPL--WKQNADPPG 229
Query: 281 -FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+ A++ +DE +GKV+ A+E+ M ++++F SDNGG +A+ N PLRG K
Sbjct: 230 VYRAMVRHMDEGIGKVLHAIEESEMTDRTLVIFASDNGGT------SASRNEPLRGFKGQ 283
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+EGG+R + P G+V++Q D ++L+AA
Sbjct: 284 AFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAG 325
>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 483
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 189/411 (45%), Gaps = 83/411 (20%)
Query: 23 QYLKELGYRTRIMAFAVL-PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLD 81
+L +L YR + VL P F + P+I+FIL DDLGW D+ +G
Sbjct: 1 MHLHKLFYRNLWVLLTVLGPTVFA---------QTDKQPNIVFILVDDLGWMDIAANGST 51
Query: 82 QIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG--- 137
TPNID LA GI K Y +C+P+R++I+TGK+P + + GG
Sbjct: 52 FYETPNIDQLAKEGIRFTKAYAASPICSPTRASILTGKNPARIDLTQWI------GGPGN 105
Query: 138 ------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE--YTPTFRGFESHL-GYWTGH 188
LPL E + P+ L++ GY+T +GKWHL E + P +GF+ ++ G++ G
Sbjct: 106 PDYLRNLPLEEVLFPELLQDAGYKTAFMGKWHLNNVAGEEKFWPDKQGFDVNVAGHFRG- 164
Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDL-------HGKYSTDVFTAEAVDIIHNHSTDEP 241
GL ++ W++ G+Y TD EA+D I +S P
Sbjct: 165 --------------GLYIKNKYFSPWNIPNLENGPEGEYMTDRLAKEAIDFIDENSK-AP 209
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIH-------------RHIED------FK----- 277
LY + + H+ P AP + + R E+ FK
Sbjct: 210 FLLYFSLYSVHA-----PFDAPADRVEKYEKKKKALALTDKERFAEEKNAQKPFKYRIKQ 264
Query: 278 -RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
+AA++ +D +VGK++ L+++ + N++++F SDNGG + + A N PLR
Sbjct: 265 DHPTYAAMVESMDMAVGKILGKLQEKGIADNTVVIFFSDNGGLSTSEGIPTA-NTPLRAG 323
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
K L+EGG+R +I P G V++ + D+ PT+L ++ P+
Sbjct: 324 KGWLYEGGIREPAIIRWPGTIKPGTVSDAVITSMDFYPTILEMTGQALRPD 374
>gi|421610710|ref|ZP_16051876.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408498494|gb|EKK02987.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 606
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/340 (30%), Positives = 169/340 (49%), Gaps = 34/340 (10%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
AS P++I ++ DD G+ D+ HG + TPN+D LA + L+N++ CTP+R+A
Sbjct: 41 TASESRPNVILVVTDDQGYGDMSCHGNPWLDTPNLDRLATQSVRLENFHVDPFCTPTRAA 100
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++ G V G R L E + + +E GYRT + GKWHLG + P
Sbjct: 101 LMTGRYCTRVGAW-AVTEG--RQLLDPDETTMAEIFRESGYRTGMFGKWHLG-DPPPFAP 156
Query: 174 TFRGFESHLGYWTGHQ---------DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
RGFE+ + + G DYFD + R+ +P + Y TD+
Sbjct: 157 RERGFETVVRHMAGGADEIGNPTGNDYFDDT----------YYRNGKP--ESFDGYCTDI 204
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAA 283
+ EA+D I S ++P F+Y+ A HS P D Y + R + +R+ F
Sbjct: 205 WFDEAIDFIQKES-EQPFFVYIPTNAMHS-----PYLVADRYSDPFKRQGIEPQRAAFYG 258
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASNWPLRGVKNTLW 341
++ DE++G++++ L+Q + +N++++F+SDNG A A+ N N +RG K +++
Sbjct: 259 MIQNFDENLGRLLKRLDQDNLRNNTLLIFMSDNGTAQGASEQNRKVGFNAGMRGKKGSVY 318
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
EGG R P +Q DWLPTL+ +
Sbjct: 319 EGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCD 358
>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
Length = 555
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 180/383 (46%), Gaps = 63/383 (16%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGL----DQIPTPNIDALAYSGIILKNYYT-VQ 105
D + PP++I ILADDLG ND+ +G Q+ TPNID LA SG Y+
Sbjct: 56 ADAASQDRPPNVIVILADDLGINDISTYGGGLANGQVQTPNIDELAASGANFSQAYSGAG 115
Query: 106 LCTPSRSAIMTGKHPIHTGMQH--------------------------------NVLYGC 133
C PSR+ ++TG++P TG + + +
Sbjct: 116 TCAPSRAMLLTGRYPTRTGFEFTPTPDGMGAMATMIASSHDNGLPPSIYNKAGADKIPSF 175
Query: 134 ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG------ 187
+ GLP SE L + L++ GY T +GKWHLG K ++P +GFE L +G
Sbjct: 176 DEQGLPSSEITLAEILRDRGYYTAHIGKWHLG-RGKGFSPEEQGFEQSLLMHSGLYLPVD 234
Query: 188 HQDY----FDHSAEEMKMWGLDMR-----RDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
H D D + +W MR D P ++ Y TD +T E++ II +
Sbjct: 235 HPDVVNAKVDFDPIDQFLWA-KMRYAAAFNDGGPVFEPK-SYLTDYWTDESIKIIKANK- 291
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
+ P FLYLAH H+ PLQA I+ + +AA++ +D SVG+++
Sbjct: 292 NRPFFLYLAHWGAHT-----PLQATREDYEAVGDIKPHRARVYAAMVRAVDRSVGRILAT 346
Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
L++ + N+I+VF SDNGG AG+ N P RG K T++EGG+R + P +
Sbjct: 347 LKEEGLSDNTIVVFSSDNGG--AGYIGIPDINAPYRGWKLTMFEGGIRVPLFMRWPAHIA 404
Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
R + D+LPTL++AA
Sbjct: 405 RNTSIDTPAAHIDFLPTLVAAAQ 427
>gi|343085667|ref|YP_004774962.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354201|gb|AEL26731.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 465
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 174/352 (49%), Gaps = 32/352 (9%)
Query: 43 AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
+ ++ + + P+I+FI+ADDLG+ D+G +G + I TPNIDALA G++ Y
Sbjct: 16 SMAFQSIYAQQINNGDRPNIVFIMADDLGYGDLGVYGQELIQTPNIDALAKEGMVFSQAY 75
Query: 103 TVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
+CT SR +MTG H HT + NV + L + L LKE GYRT VGK
Sbjct: 76 AGGPVCTSSRYVLMTGLHNGHTVARDNVPHYPTY--LQEEDITLAMVLKEAGYRTGGVGK 133
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGH--QD--------YFDHSAEEMKMWGLDMRRDLE 211
W LG E T +GF++ WTG+ QD Y DH ++++ + R+
Sbjct: 134 WSLGDANTEGRATNKGFDT----WTGYLNQDHAHYYYPSYLDHDDRKIQLTDNPILRN-- 187
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP---LQAPDHYLN 268
YS D+ T ++ I H P F Y A+ + H + E L P Y
Sbjct: 188 -------NYSHDILTNATLNFIRMHRK-VPFFFYAAYTSPHFSAAEEDEHGLTVPSTYPY 239
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
+ + + K+AA++++LD+ VGK+V+ L + + N++I+F SDNGG + ++
Sbjct: 240 TEKDWPEAAK-KYAAMIYRLDKDVGKIVKLLNELGIRENTLIIFTSDNGGHSNVWD-KFR 297
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+N L+G K L+EGG+R + P + ++ + D LPT A
Sbjct: 298 TNGKLKGHKRDLYEGGIRVPFIASWPGIIPASRKSDALIGFQDMLPTFAEIA 349
>gi|149174163|ref|ZP_01852791.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
8797]
gi|148847143|gb|EDL61478.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
8797]
Length = 470
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/363 (30%), Positives = 174/363 (47%), Gaps = 37/363 (10%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
+ + AV+ L F S+ A P +++F L DDLGW D+G +G D +PNID LA
Sbjct: 8 LFSQAVILLCFLSSITQPTHAADEKPWNVVFFLVDDLGWTDLGCYGSDFYQSPNIDQLAA 67
Query: 94 SGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL-YGCERGGLPLSE--------- 142
G+ +NY C+P+R A++TG +P T + + + PL
Sbjct: 68 EGMKFTQNYSACNACSPTRGALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQ 127
Query: 143 --KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHL-GYWTG--HQDYFDHSAE 197
LP+ L+ GY+T VGKWHLG + P GF+ ++ G G +F + +
Sbjct: 128 KYTTLPEALRTAGYQTFHVGKWHLG--GRGNLPQDHGFDVNISGTNRGLPRSYHFPYGGD 185
Query: 198 EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
MK W + + +Y TD EAV +I D+P FLY + + HS
Sbjct: 186 AMK-WDSSLTEA-----ERQDRYLTDRMADEAVALIRQQQ-DKPFFLYCSFYSVHS---- 234
Query: 258 EPLQAPDHYLNIHRHIEDFKRSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
P+Q + ++ + KR K +AA++ +DE++G+V L++ + ++IVF S
Sbjct: 235 -PIQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTS 293
Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLP 374
DNGG ++N PLRG K WEGG R ++ P + G V + + D+ P
Sbjct: 294 DNGGV----RRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYP 349
Query: 375 TLL 377
T+L
Sbjct: 350 TIL 352
>gi|436838037|ref|YP_007323253.1| sulfatase [Fibrella aestuarina BUZ 2]
gi|384069450|emb|CCH02660.1| sulfatase [Fibrella aestuarina BUZ 2]
Length = 478
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 172/356 (48%), Gaps = 50/356 (14%)
Query: 59 PPHIIFILADDLGWNDVGFH-----GLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
PP++I ILADD+G+ DVG + G I TPN++ +A G+ ++YT +C PSR
Sbjct: 34 PPNLIVILADDMGYGDVGCYRAASAGPALIQTPNLNRMAAQGMRFTDFYTGSTVCAPSRC 93
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHLGFYKKE 170
A+MTG+H HT ++ N G PL ++ ++P+ LK+ Y T + GKW LG E
Sbjct: 94 ALMTGQHTGHTQIRGN-------GEKPLRQEDVVIPELLKKANYTTGMFGKWGLGMPDTE 146
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
P +G+++ G+ + +F ++ + +PA G Y+ D FT +A+
Sbjct: 147 GAPHRKGWDAFFGHVNHEEAHFQQHPFLWQISDGATVKVEQPA----GSYNNDAFTQQAL 202
Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR------------ 278
+ +T +P LYLA H+ L PD YLN +R R
Sbjct: 203 AFLDRQTT-KPFLLYLAFTLPHAE-----LHTPDTYLNQYRDATGKSRFQPEKPWPAGRH 256
Query: 279 --------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN---- 326
+ +AA++ ++D VG+V+ L+Q+ + +N++++F SDNG G
Sbjct: 257 YGEQPEPKAAYAAMVSQIDGYVGQVLAKLDQKGLANNTLVLFASDNGTHIEGGRTQDDVS 316
Query: 327 -AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
S+ PLRGVK L++GG+R ++ P G D LPT AN
Sbjct: 317 YMQSSGPLRGVKRDLFDGGIRTPFIVRWPGHVKAGSTTAFVGAFYDLLPTFCELAN 372
>gi|336413800|ref|ZP_08594149.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
3_8_47FAA]
gi|335934817|gb|EGM96800.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
3_8_47FAA]
Length = 465
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 114/368 (30%), Positives = 185/368 (50%), Gaps = 41/368 (11%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVA--SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
+M ++L + L FV++ A +S P+IIF+LADD+G D+G +G +I TPNID L
Sbjct: 1 MMNKSILLMGTLLVSTFVEIAAQKNSENPNIIFVLADDMGIGDLGCYGQKKIKTPNIDKL 60
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG----LPLSEKILP 146
A G++ N+Y+ + PSR +++TGKH H ++ N E G LP E +
Sbjct: 61 AEKGVLFTNHYSGSTVSAPSRCSLLTGKHTGHAYIRGNKGIKSEEGFFDLHLPSDEVTVA 120
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWG 203
+ LK GY T VGKW LG +P +GF+ GY + H+ Y ++ E
Sbjct: 121 EILKRKGYATMCVGKWGLGGPHTTGSPVRKGFDYFFGYLSQGAAHRYYPEYLYENEDKVM 180
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL-QA 262
L+ + YS + + +D I ++ P F Y A H+ Y L Q
Sbjct: 181 LNKKV-----------YSHFLIMEKGLDFIRKNA-GHPFFAYFAITPPHADLDYPDLSQY 228
Query: 263 PDHYLN---IHRHIEDFK-----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
D + I+ + FK ++ +A+++ ++D++VG++++ L+++ +L N+II+F S
Sbjct: 229 EDAFPETPYINNKQKGFKTQMKPKAAYASMVSEIDKNVGQIIQLLKEKGILENTIIIFSS 288
Query: 315 DNGGAAAG------FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVH 368
DNG G F+ SN P RG K L+EGGVR ++ P + + E
Sbjct: 289 DNGVHCVGGHEPDFFD----SNGPYRGYKRDLYEGGVRAPFIVSWPKMIKKKRTVEHITT 344
Query: 369 VSDWLPTL 376
D+LPT+
Sbjct: 345 FWDFLPTV 352
>gi|204927175|ref|ZP_03218377.1| sulfatase [Salmonella enterica subsp. enterica serovar Javiana str.
GA_MM04042433]
gi|374999764|ref|ZP_09724105.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Infantis
str. SARB27]
gi|416423753|ref|ZP_11691142.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 315996572]
gi|416429490|ref|ZP_11694552.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 495297-1]
gi|416441871|ref|ZP_11701958.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 495297-3]
gi|416446887|ref|ZP_11705399.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 495297-4]
gi|416455076|ref|ZP_11710701.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 515920-1]
gi|416458094|ref|ZP_11712696.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 515920-2]
gi|416477886|ref|ZP_11721589.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. NC_MB110209-0054]
gi|416487277|ref|ZP_11725587.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. OH_2009072675]
gi|416495848|ref|ZP_11728755.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CASC_09SCPH15965]
gi|416506199|ref|ZP_11734417.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. SARB31]
gi|416545192|ref|ZP_11753251.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 19N]
gi|416558161|ref|ZP_11760062.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 42N]
gi|416572463|ref|ZP_11767208.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 4441 H]
gi|416580385|ref|ZP_11771776.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 81038-01]
gi|416587997|ref|ZP_11776533.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MD_MDA09249507]
gi|416592560|ref|ZP_11779370.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 414877]
gi|416600365|ref|ZP_11784312.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 366867]
gi|416609010|ref|ZP_11789742.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 413180]
gi|416615356|ref|ZP_11793268.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 446600]
gi|416625575|ref|ZP_11798548.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 609458-1]
gi|416629908|ref|ZP_11800425.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 556150-1]
gi|416644798|ref|ZP_11807012.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 609460]
gi|416652444|ref|ZP_11811765.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 507440-20]
gi|416655742|ref|ZP_11812718.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 556152]
gi|416667431|ref|ZP_11818234.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB101509-0077]
gi|416683049|ref|ZP_11824165.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB102109-0047]
gi|416694147|ref|ZP_11826960.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB110209-0055]
gi|416708252|ref|ZP_11833114.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB111609-0052]
gi|416710327|ref|ZP_11834432.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 2009083312]
gi|416717531|ref|ZP_11839783.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 2009085258]
gi|416726451|ref|ZP_11846512.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 315731156]
gi|416732451|ref|ZP_11849752.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2009159199]
gi|416742001|ref|ZP_11855518.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008282]
gi|416742918|ref|ZP_11855868.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008283]
gi|416752811|ref|ZP_11860623.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008284]
gi|416763578|ref|ZP_11867252.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008285]
gi|416770088|ref|ZP_11871440.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008287]
gi|417472114|ref|ZP_12167921.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar Rubislaw
str. A4-653]
gi|418483635|ref|ZP_13052642.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 80959-06]
gi|418487815|ref|ZP_13056006.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035278]
gi|418494224|ref|ZP_13060680.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035318]
gi|418501265|ref|ZP_13067654.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035320]
gi|418503986|ref|ZP_13070345.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035321]
gi|418508644|ref|ZP_13074947.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035327]
gi|418513428|ref|ZP_13079658.1| sulfatase [Salmonella enterica subsp. enterica serovar Pomona str.
ATCC 10729]
gi|418524081|ref|ZP_13090069.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008286]
gi|452121665|ref|YP_007471913.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Javiana
str. CFSAN001992]
gi|204323840|gb|EDZ09035.1| sulfatase [Salmonella enterica subsp. enterica serovar Javiana str.
GA_MM04042433]
gi|322615792|gb|EFY12712.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 315996572]
gi|322620640|gb|EFY17500.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 495297-1]
gi|322621731|gb|EFY18581.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 495297-3]
gi|322627457|gb|EFY24248.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 495297-4]
gi|322630763|gb|EFY27527.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 515920-1]
gi|322638017|gb|EFY34718.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 515920-2]
gi|322645937|gb|EFY42455.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. NC_MB110209-0054]
gi|322651173|gb|EFY47558.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. OH_2009072675]
gi|322656570|gb|EFY52858.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CASC_09SCPH15965]
gi|322658771|gb|EFY55028.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 19N]
gi|322661786|gb|EFY58002.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 81038-01]
gi|322666467|gb|EFY62645.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MD_MDA09249507]
gi|322672374|gb|EFY68486.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 414877]
gi|322676314|gb|EFY72385.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 366867]
gi|322679593|gb|EFY75638.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 413180]
gi|322684304|gb|EFY80308.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 446600]
gi|323191787|gb|EFZ77036.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 609458-1]
gi|323198944|gb|EFZ84042.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 556150-1]
gi|323201010|gb|EFZ86079.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 609460]
gi|323212904|gb|EFZ97706.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 556152]
gi|323216648|gb|EGA01373.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB101509-0077]
gi|323219854|gb|EGA04333.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB102109-0047]
gi|323226080|gb|EGA10297.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB110209-0055]
gi|323228733|gb|EGA12862.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. MB111609-0052]
gi|323236653|gb|EGA20729.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 2009083312]
gi|323239846|gb|EGA23893.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 2009085258]
gi|323242106|gb|EGA26135.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 315731156]
gi|323247453|gb|EGA31408.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2009159199]
gi|323250554|gb|EGA34436.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008282]
gi|323259284|gb|EGA42926.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008283]
gi|323263837|gb|EGA47358.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008284]
gi|323265567|gb|EGA49063.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008285]
gi|323270013|gb|EGA53461.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008287]
gi|353077982|gb|EHB43741.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Infantis
str. SARB27]
gi|353655321|gb|EHC96360.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar Rubislaw
str. A4-653]
gi|363554797|gb|EHL39029.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. SARB31]
gi|363573015|gb|EHL56902.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 4441 H]
gi|363577470|gb|EHL61293.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 42N]
gi|366061128|gb|EHN25381.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 80959-06]
gi|366062541|gb|EHN26772.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035318]
gi|366066779|gb|EHN30937.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035320]
gi|366070570|gb|EHN34679.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035278]
gi|366072314|gb|EHN36406.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035321]
gi|366079899|gb|EHN43881.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. CT_02035327]
gi|366081789|gb|EHN45729.1| sulfatase [Salmonella enterica subsp. enterica serovar Pomona str.
ATCC 10729]
gi|366827048|gb|EHN53958.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 507440-20]
gi|372208393|gb|EHP21889.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. IA_2010008286]
gi|451910669|gb|AGF82475.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Javiana
str. CFSAN001992]
Length = 497
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 123/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
+T I + L L T S + V A+ P+++FI+ADDLG+ D+ +G + TPNID
Sbjct: 3 KTVIASMIGLALCAT-STISVVHAAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61
Query: 91 LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
LA G+ YY LC+PSR+ ++TG+ P TG++ + G + +E + L
Sbjct: 62 LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120
Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
K+ GY T ++GK HL G + P GF+ L G D +A+E +G+
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179
Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
+ P W +GK+ S + ++E V+ + N D+P FLY+A HS
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231
Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
PL +P YL+++ +++ ++++ ++ A + +D VGKV++
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290
Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
++ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DW+PTL AN D +P ++++P E L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402
>gi|320103547|ref|YP_004179138.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319750829|gb|ADV62589.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 502
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 115/344 (33%), Positives = 167/344 (48%), Gaps = 46/344 (13%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
PP+ + ++ADDLGW+D+ +G TPN+DALA G Y +C+P+R++I TG
Sbjct: 49 PPNFVILVADDLGWSDLACYGHPFHETPNLDALAAGGARFTAAYAAAPVCSPTRASIQTG 108
Query: 118 KHPIHTGMQHNV----------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
+HP G+ ++ LPL + L + LK GYRT GKWHLG
Sbjct: 109 RHPARYGLTAHIPGHWRPFEKLAEPPPAQALPLEAETLGERLKAHGYRTGYFGKWHLG-- 166
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+ PT +G++ L ++ H+ + D R P +V
Sbjct: 167 GNGFGPTDQGYDEAL-------EFSSHNYPPGQQAKPDQPRRRGP----------EVLAD 209
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-----RSKFA 282
+AVD I N D P L + + A H PLQA +R + R +A
Sbjct: 210 KAVDFI-NRQDDRPFLLQVHYFAVHI-----PLQADPRLEAKYRAKPQVEGRPPARPDYA 263
Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA---AAGFNLNAASNWPLRGVKNT 339
A+L ++D S+G+++ AL QR + SN+ +VF+SDNGG + G+ SN PLR K T
Sbjct: 264 ALLEEMDTSIGRILNALNQRGLDSNTWVVFLSDNGGLERESGGW--PGTSNRPLRDQKGT 321
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
L+EGG+R + P L G V E V D PTLL+AA+ S
Sbjct: 322 LYEGGIRIPMIARIPGLTKPGTVVETPVVTHDLAPTLLAAAHPS 365
>gi|291414059|ref|XP_002723273.1| PREDICTED: arylsulfatase A [Oryctolagus cuniculus]
Length = 507
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 160/350 (45%), Gaps = 40/350 (11%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y V LCTPSR+A++TG
Sbjct: 20 PPNIVLIFADDLGYGDLGSYGHPSSTTPNLDQLATGGLRFTDFYVPVSLCTPSRAALLTG 79
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
+ P+ GM VL RGGLPL E L + L GY T + GKWHLG + + P +
Sbjct: 80 RLPVRMGMYPGVLEPSSRGGLPLEEVTLAEVLSARGYLTGMAGKWHLGVGPEGAFLPPHQ 139
Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
GF LG H Q+ G D + + +P W L
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA 199
Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y A A D++ + P FLY A TH + H
Sbjct: 200 RY-----VAFARDLMADAQRQGRPFFLYYASHHTHYPQ-----------FSGHSFAGHSG 243
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R F L +LD +VG ++ A+ +L ++++F +DNG + S LR K
Sbjct: 244 RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
T +EGGVR L + P + G+ E + D LPTL S A + +PN
Sbjct: 303 GTTFEGGVREPALAFWPGRITPGVTHELASSL-DLLPTLASLAG-APLPN 350
>gi|168230293|ref|ZP_02655351.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
str. CDC 191]
gi|194473008|ref|ZP_03078992.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
str. CVM29188]
gi|238910791|ref|ZP_04654628.1| sulfatase [Salmonella enterica subsp. enterica serovar Tennessee
str. CDC07-0191]
gi|194459372|gb|EDX48211.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
str. CVM29188]
gi|205334980|gb|EDZ21744.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
str. CDC 191]
Length = 497
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
+T I + L L T ++ V A+ P+++FI+ADDLG+ D+ +G + TPNID
Sbjct: 3 KTVIASMIGLALCATSTLSVVH-AAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61
Query: 91 LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
LA G+ YY LC+PSR+ ++TG+ P TG++ + G + +E + L
Sbjct: 62 LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120
Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
K+ GY T ++GK HL G + P GF+ L G D +A+E +G+
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179
Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
+ P W +GK+ S + ++E V+ + N D+P FLY+A HS
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231
Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
PL +P YL+++ +++ ++++ ++ A + +D VGKV++
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290
Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
++ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DW+PTL AN D +P ++++P E L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402
>gi|161612357|ref|YP_001586322.1| hypothetical protein SPAB_00045 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|167552131|ref|ZP_02345884.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA29]
gi|168244457|ref|ZP_02669389.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. SL486]
gi|194451527|ref|YP_002044003.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. SL476]
gi|200388296|ref|ZP_03214908.1| sulfatase [Salmonella enterica subsp. enterica serovar Virchow str.
SL491]
gi|386589918|ref|YP_006086318.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. B182]
gi|417536328|ref|ZP_12189519.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|419731865|ref|ZP_14258774.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41579]
gi|419733203|ref|ZP_14260104.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41563]
gi|419741457|ref|ZP_14268151.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41573]
gi|419745011|ref|ZP_14271656.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41566]
gi|419750318|ref|ZP_14276781.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41565]
gi|421569549|ref|ZP_16015251.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00322]
gi|421576762|ref|ZP_16022356.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00325]
gi|421581248|ref|ZP_16026794.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00326]
gi|421583838|ref|ZP_16029354.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00328]
gi|161361721|gb|ABX65489.1| hypothetical protein SPAB_00045 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|194409831|gb|ACF70050.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. SL476]
gi|199605394|gb|EDZ03939.1| sulfatase [Salmonella enterica subsp. enterica serovar Virchow str.
SL491]
gi|205323154|gb|EDZ10993.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA29]
gi|205336651|gb|EDZ23415.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. SL486]
gi|353670566|gb|EHD07127.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|381290892|gb|EIC32148.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41579]
gi|381291686|gb|EIC32922.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41573]
gi|381300912|gb|EIC41969.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41563]
gi|381306844|gb|EIC47712.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41566]
gi|381307271|gb|EIC48132.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
str. 41565]
gi|383796962|gb|AFH44044.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. B182]
gi|402517722|gb|EJW25120.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00325]
gi|402517774|gb|EJW25169.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00326]
gi|402526800|gb|EJW34068.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00322]
gi|402531739|gb|EJW38944.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00328]
Length = 497
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
+T I + L L T ++ V A+ P+++FI+ADDLG+ D+ +G + TPNID
Sbjct: 3 KTVIASMIGLALCATSTLSVVH-AAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61
Query: 91 LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
LA G+ YY LC+PSR+ ++TG+ P TG++ + G + +E + L
Sbjct: 62 LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120
Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
K+ GY T ++GK HL G + P GF+ L G D +A+E +G+
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179
Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
+ P W +GK+ S + ++E V+ + N D+P FLY+A HS
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231
Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
PL +P YL+++ +++ ++++ ++ A + +D VGKV++
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290
Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
++ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DW+PTL AN D +P ++++P E L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402
>gi|167762012|ref|ZP_02434139.1| hypothetical protein BACSTE_00358 [Bacteroides stercoris ATCC
43183]
gi|167700104|gb|EDS16683.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
Length = 463
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 190/406 (46%), Gaps = 45/406 (11%)
Query: 30 YRTRIMAFAVLPLAFTLSMVFV--DLVASSG-PPHIIFILADDLGWNDVGFHGLDQIPTP 86
+R I+ V F + + LV G P+IIFILADD+G+ D+ +G I TP
Sbjct: 2 FRKDIVLGGVCAGMFVMPQLMTARQLVTDGGDKPNIIFILADDMGYCDLSCYGNKYIETP 61
Query: 87 NIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG-------- 137
NID LA +G Y + +PSR A+MTGK+ +T ++ N C GG
Sbjct: 62 NIDRLAATGTAFTQCYAGSGISSPSRCALMTGKNTGNTTIRDNF---CIAGGIEGLKGTK 118
Query: 138 ------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDY 191
L ++ + L GYRT +V KWHL + E TP RGF+ G+
Sbjct: 119 TIRRMHLQPNDTTIATVLGAAGYRTCLVNKWHLDGFNPEATPLNRGFDEFYGWLISTA-- 176
Query: 192 FDHSAEEMKMWGLD---MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
+ + W + + E D H K++TD+ T +A+ I N + + P FLYLA+
Sbjct: 177 YSNDPYYYPYWRFNNEKLENVKENEGDKHIKHNTDLSTEDAIKFI-NRNKNNPFFLYLAY 235
Query: 249 AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
A P+EP D D ++A+++ +D ++G+++ L++ + N+
Sbjct: 236 DA-----PHEPYNI-DETTWYDDEAWDMNTKRYASLITHMDRAIGRLLAELDRLGLRENT 289
Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQ--- 365
+++F SDNG A L+G+K L+EGG+R ++ P G V Q
Sbjct: 290 LVIFASDNGAAKQAPLEELGCKGSLKGMKGQLYEGGIRVPFIVNQP-----GKVPVQKLN 344
Query: 366 -YVHVSDWLPTLLSAANKSD-IPNYVNSTVENIIPRYENSILRYEN 409
++ D +PTL + A +D +P +N NI+P + L +N
Sbjct: 345 NIIYFPDVMPTLAALAGATDKLPQKLNGI--NILPLFYGQQLDTDN 388
>gi|351712929|gb|EHB15848.1| N-acetylgalactosamine-6-sulfatase [Heterocephalus glaber]
Length = 482
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 165/335 (49%), Gaps = 44/335 (13%)
Query: 70 LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
+GW D+G +G TPN+D +A G++ N+Y+ LC+PSR+A+++G+ PI +G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLSGRPPIRSGFYTT 60
Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
+ GG+P SE++LP LKE GY T+IVGKWHLG ++ ++ P GF+
Sbjct: 61 NAHARNAYTPQEIVGGIPDSERLLPSLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFDEW 119
Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
G H +D+ A+ + + +D W++ G++ T ++
Sbjct: 120 FGSPNCHFGPYDNKAKP----NIPVYKD----WEMVGRFYEEFPINVKTGESNLTQIYLQ 171
Query: 228 EAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
EA+D I + P FLY A ATH+ P+ A +L +R ++ +
Sbjct: 172 EALDFIKRQQAARRPFFLYWAVDATHA-----PVYASRPFLG------SSQRGRYGDAVR 220
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
++D+SVGK++ L+ + ++ + F SDNG A + SN P K T +EGG+R
Sbjct: 221 EIDDSVGKMLSLLQVLGLAESTFVFFTSDNGAALVS-APSQGSNGPFLCGKQTTFEGGMR 279
Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+ W P G V+ Q + D T L+ A
Sbjct: 280 EPAIAWWPRHIPPGQVSHQLGSIMDLFTTSLALAG 314
>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
Length = 463
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 171/350 (48%), Gaps = 43/350 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+++ IL DDLGW D+ +G I TP++D LA SG+ + Y+ +C+P+R + TG+
Sbjct: 49 PNLLVILGDDLGWADLSAYGSPDIKTPHLDRLAASGVRFTDAYSASSVCSPTRFGLYTGR 108
Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+P + G+ + + G+PL L LKE GY T + GKWH G Y Y+PT
Sbjct: 109 YPGRLAGGLPEPISAPNKLHGIPLDHPTLASLLKEQGYATALFGKWHCG-YLPWYSPTRV 167
Query: 177 GFESHLGYWTGHQDYF---DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G++ G ++G DYF +H+ D+ D DL +Y T++ T AV I
Sbjct: 168 GWDEFFGNFSGGLDYFSKINHNGAH------DLYEDEVEVEDL--RYYTEIVTERAVSFI 219
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL--------------NIHRHIEDFKRS 279
D P L L + P+ P + P+ + H + +
Sbjct: 220 ERDH-DTPWLLNL-----NFTTPHWPWEGPEDKAVSDELTARIEGGEKGVLFHTDGGSLA 273
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
+ ++ LD+S+G+V++ALE+ L ++++ F SDNGG ++ WP G K+
Sbjct: 274 TYKTMVEDLDQSIGQVIKALERTGQLRDTVVFFASDNGGE------RFSNVWPFTGAKSG 327
Query: 340 LWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
L EGG+R ++ W +L R V+++ V +DW T L A + +Y
Sbjct: 328 LNEGGIRVPTILSWPGILRGRQ-VSDEPVVTTDWTATFLELAGATPAEDY 376
>gi|296122982|ref|YP_003630760.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296015322|gb|ADG68561.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 466
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 173/360 (48%), Gaps = 56/360 (15%)
Query: 51 VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTP 109
A S P+I+ I ADDLG+ D+G +G I TP++D +A G+ ++Y+ ++CTP
Sbjct: 27 ASFAADSTKPNIVVIFADDLGYGDLGCYGSPTIRTPHLDEMAAEGLRFTDFYSAAEVCTP 86
Query: 110 SRSAIMTGKHPIHTGM--QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
SR+A++TG+ PI +GM VL+ +GGLP +E + + LKE GY T +GKWHLG +
Sbjct: 87 SRAALLTGRLPIRSGMCGARRVLFPNSKGGLPPAEITIAEALKEKGYATAQIGKWHLGIH 146
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAE-----------EMKMWGLDMRRDLEPAWDL 216
P +GF+ G + + D A+ + W + + R+ E
Sbjct: 147 PGS-RPLDQGFDQSFGL--PYSNDMDARADLPKGSTGSPNPPLDGWNVALLRNGEVVEQP 203
Query: 217 HGKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
+ + T +T EA+ I + P FLY+ H P+ P+ A +D
Sbjct: 204 ANQTTLTKRYTEEAIKFI-TEKKNVPFFLYMPH-----TFPHVPMFAS----------QD 247
Query: 276 FKRSKFAAI----LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG--------GAAAGF 323
FK A I + +LD SVG+V+ AL + + N+++ F SDNG G +AG
Sbjct: 248 FKGKSRAGIYGDAVEELDWSVGQVLGALCREGIAENTLVFFSSDNGPWLIMGDQGGSAGL 307
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
L+ K + WEGG+R G+ W P G V Q D PT L+ A S
Sbjct: 308 ---------LKDGKGSTWEGGMRVPGIAWMPSRIKPG-VTSQLASAMDVFPTALALAGAS 357
>gi|160900836|ref|YP_001566418.1| sulfatase [Delftia acidovorans SPH-1]
gi|160366420|gb|ABX38033.1| sulfatase [Delftia acidovorans SPH-1]
Length = 457
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 168/346 (48%), Gaps = 45/346 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+I+FI+ADDLG+ D+G +G +P +D LA G+ L + Y +C+P+R A+
Sbjct: 20 PNILFIVADDLGYADLGCYGGRAADFGAVSPVLDRLAAGGLRLTQGYANSPVCSPTRFAL 79
Query: 115 MTGKH----------PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
T ++ PI++ + L E+ GLP + L++ GYRT ++GKWHL
Sbjct: 80 ATARYQYRLRGAAEEPINSKTRGTPL--GEKLGLPPDMPTVASMLRDAGYRTALIGKWHL 137
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYST 222
G Y + P G+E + G +G DYF H S+ + +W + E Y T
Sbjct: 138 G-YPPHFGPLRSGYEEYFGPMSGGVDYFTHLSSSGQHDLWVGEEEHHDE-------GYLT 189
Query: 223 DVFTAEAVDIIHNHST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR------HIED 275
D+ + +VD +H + D P FL L H P+ P + D H++
Sbjct: 190 DLLSQRSVDFVHRMAQGDAPFFLSL-----HYTAPHWPWETRDDRSTAEALGAGIAHLDG 244
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
++ ++H +DE +G +VEAL L N++IVF SDNGG + NWPL G
Sbjct: 245 GNIHQYRRMIHHMDEGIGWIVEALRANGQLDNTLIVFTSDNGGE------RFSDNWPLVG 298
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K L EGG+R + P + + G + Q+ DW T+L AA
Sbjct: 299 GKMDLTEGGIRVPWIAHWPAVIAPGRSSPQHCMSMDWSATVLDAAG 344
>gi|339998071|ref|YP_004728954.1| secreted sulfatase [Salmonella bongori NCTC 12419]
gi|339511432|emb|CCC29132.1| putative secreted sulfatase [Salmonella bongori NCTC 12419]
Length = 497
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 123/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
+T I + L L T S + V A+ P+++FI+ADDLG+ D+ +G + TPNID
Sbjct: 3 KTVIASMIGLALCAT-STLSVAHAAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61
Query: 91 LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
LA G+ YY LC+PSR+ ++TG+ P TG++ + G + +E + L
Sbjct: 62 LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120
Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
K+ GY T ++GK HL G + P GF+ L G D +A+E +G+
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179
Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
+ P W +GK+ S + ++E V+ + N D+P FLY+A HS
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231
Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
PL +P YL+++ +++ ++++ ++ A + +D VGKV++
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290
Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
++ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DW+PTL AN D +P ++++P E L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402
>gi|114798724|ref|YP_760874.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114738898|gb|ABI77023.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 505
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/325 (30%), Positives = 162/325 (49%), Gaps = 41/325 (12%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
A+S P+I+ I DD+G+ D+G G TPN+D LA G ++Y +CTPSR+
Sbjct: 40 AAASEQPNIVLIFVDDMGYADIGSFGSPIARTPNLDRLAMEGQKWTSFYAPAPVCTPSRA 99
Query: 113 AIMTGKHPIHTGM-----QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
+MTG+ + +GM +VL+ GGLP SE + + L++ GY + GKWH+G +
Sbjct: 100 GLMTGRLAVRSGMAGLVQARHVLFPTSTGGLPQSEVTIAELLQQEGYVSAAFGKWHMG-H 158
Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA----WDLHGKYSTD 223
E+ PT GF+S+ G + + + W +D+ EP WD+ +
Sbjct: 159 LPEFLPTSHGFQSYFG--IPYSNDMNMPGGGETPWSIDLF--FEPPNIQNWDVPLMQDEE 214
Query: 224 V-------------FTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
+ +T A++ + +H+ +P FLYLAH H+ PL + + +
Sbjct: 215 IIERPADQFTLTQRYTERAIEFMETSHAEGQPFFLYLAHNMPHT-----PLFTSEGFTGV 269
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
+ ++ +LD SVG++V+AL+ ++ N++++F SDNG A ++ S
Sbjct: 270 S------AGGAYGDVIEELDWSVGEIVDALKDMKIEKNTLVIFTSDNGPWLA-MKTHSGS 322
Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSP 354
LR K T WEGG+R + W P
Sbjct: 323 AGMLRDGKGTTWEGGMRVPAIFWWP 347
>gi|149177572|ref|ZP_01856174.1| hypothetical protein PM8797T_00157 [Planctomyces maris DSM 8797]
gi|148843552|gb|EDL57913.1| hypothetical protein PM8797T_00157 [Planctomyces maris DSM 8797]
Length = 599
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 105/352 (29%), Positives = 176/352 (50%), Gaps = 43/352 (12%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
TL + + ++ P+++ I+ DD GW DV H I TP D LA G + +Y
Sbjct: 16 TLILSRGSFLQAAERPNVLLIMTDDQGWGDVRSHDNPLIETPQQDLLASQGARFERFYVS 75
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
+C P+RS+++TG++ + TG+ H V G E + E + + K GY+T GKWH
Sbjct: 76 PVCAPTRSSLLTGRYSLRTGV-HGVTRGFEN--MRAEETTIAEMFKAAGYKTGAFGKWHN 132
Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQD-YFD----HSAEEMKMWGLDMRRDLEPAWDLHGK 219
G + + P +GF+ G+ GH + YFD H+ + +K G
Sbjct: 133 GRHYPMH-PNGQGFDEFFGFCGGHWNRYFDTNLEHNKQPVKTEG---------------- 175
Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFK 277
Y TDV T A+D I + D+P F Y+ + A HS P P+ Y + ++ ++D
Sbjct: 176 YITDVLTDRAIDFI-KQNKDQPFFCYVPYNAPHS-----PWIVPEKYWDKYANKGLDDKA 229
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R + A++ +D+++G++++ L+ ++ N+I++F++DNG + +N N +RG K
Sbjct: 230 RCAY-AMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNSNRYNGN------MRGRK 282
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL---SAANKSDIP 386
++ EGG+R + P G V + D LPTLL S N +D P
Sbjct: 283 GSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTADQP 334
>gi|449138176|ref|ZP_21773471.1| arylsulfatase [Rhodopirellula europaea 6C]
gi|448883200|gb|EMB13738.1| arylsulfatase [Rhodopirellula europaea 6C]
Length = 516
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 186/366 (50%), Gaps = 50/366 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSA 113
A++ P+I+ I+ADDLG++++G +G +I TP +D LA GI L N+Y+ +C PSR
Sbjct: 47 ATASRPNIVLIVADDLGYSELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCC 106
Query: 114 IMTGKHPIHTGMQHNV-------------LYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
+MTGKHP H +++N L + LP+ E + +YLK +GYRT G
Sbjct: 107 LMTGKHPGHAYVRNNGDPKIDPAVREELKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFG 166
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD--LHG 218
KW LG + P +GF+ G+ + H+ +W + ++++P D LHG
Sbjct: 167 KWGLGHFGTTGDPNEQGFDLFYGF---NCQRHAHNHYPKFLW-RNRVKEVQPGNDRTLHG 222
Query: 219 K-YSTDVFTAEAVDIIHN---HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI- 273
+ YS D F EA + I T +P F YL A H + +Q P+ ++ + +
Sbjct: 223 ETYSQDQFVNEACEFIRQSVAEDTTQPFFAYLPFAVPHLS-----IQVPEEEVDAYEGVI 277
Query: 274 --EDFK----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG---- 317
D++ R+ +AA++ ++DE VG++V+ ++ + N++I+F SDNG
Sbjct: 278 EEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQIVDLIDSMGLGENTLILFTSDNGPTYD 337
Query: 318 --GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
G + ++AS ++G+K L EGG+R + + G ++ D+LPT
Sbjct: 338 RLGGSDSDYFDSASG--MKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFLPT 395
Query: 376 LLSAAN 381
+ AA
Sbjct: 396 ITDAAG 401
>gi|116620007|ref|YP_822163.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116223169|gb|ABJ81878.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 443
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 176/371 (47%), Gaps = 41/371 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
+A +L+ + ++ P+++ ++ DDLG +D+G+ G + TP+IDALA G+ +N+
Sbjct: 9 MAASLAAPLIRAAPAADRPNVLVVVLDDLGCHDLGYLGAADLKTPHIDALAARGLKFRNW 68
Query: 102 YT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRI 158
Y+ +C P+RSAI+TG+ P G+ N G L+ I L LK GY+T
Sbjct: 69 YSNAPVCAPARSAILTGRFPASAGVPDN--------GPALAHGIPTLASVLKGSGYQTGC 120
Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH- 217
GKWHLG E PT GF+S G+ +G DY+ H WG D DL W
Sbjct: 121 FGKWHLG-STDETAPTGHGFDSFYGFHSGCVDYYSHR----FYWG-DNYHDL---WHNRT 171
Query: 218 -----GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
G+Y T+ EA I + P Y+A A P+ P+ AP Y +
Sbjct: 172 EIFEDGRYLTERIADEAAGFI---GRNRPFLGYVAFNA-----PHYPMHAPAQYKARFPN 223
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA---AGFN---LN 326
+ +R +AA++ +D+ +G++ ALE N+++ F+ DNG AG N
Sbjct: 224 LAP-ERQTYAAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGATTEKRAGLNGDFAT 282
Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
A N +G K +L++GG+ G + P +G ++ D LPT+ A P
Sbjct: 283 AGDNGVFKGYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSMDILPTICRATGAPLPP 342
Query: 387 NYVNSTVENII 397
S + N I
Sbjct: 343 RVDGSDLLNTI 353
>gi|167524284|ref|XP_001746478.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775240|gb|EDQ88865.1| predicted protein [Monosiga brevicollis MX1]
Length = 520
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 164/360 (45%), Gaps = 68/360 (18%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQ----IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
PHI ++ DD GW +VG+H ++ + TP IDAL GI L +Y CTP+
Sbjct: 31 PHIFMVIVDDFGWAEVGYHRSEEYKQDVQTPTIDALVQEGIELNRHYVHYACTPNP---- 86
Query: 116 TGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
C+ G G+P + + ++L+ GY T VGKW +G ++TP
Sbjct: 87 -----------------CQPGCGMPQNMTGVAEHLQSAGYATHFVGKWDVGMTTPQHTPK 129
Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----------------EPAWDLHG 218
RG+++ L Y+ GH ++ AE + + R DL +PA +L+G
Sbjct: 130 GRGYDTSLNYF-GHGNWMWSEAEWQGSY--NHRSDLPPCEPPACFKDFWETDKPAAELNG 186
Query: 219 -KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y +F I+ H PLFL A H PLQAP Y IE+
Sbjct: 187 TQYEEAIFRDRVYSILEQHDPSTPLFLTYASKVAHY-----PLQAPIEYQEKFSFIEEPH 241
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG---------------AAAG 322
R + A++ LD+++ L+ + M +N+++V SDNGG A
Sbjct: 242 RRVYHAMVAFLDDNLANFTSLLKSKGMWNNTLMVLTSDNGGYVKSFLGDCDNNTSHGYAC 301
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
FN A +N+PL+G K + EGGVR + + E RG ++ VHV+DW T + A
Sbjct: 302 FNGEAGANYPLQGGKYSTLEGGVRVNAFVSGGFVPEERRGTREDEMVHVADWYATFAALA 361
>gi|149198815|ref|ZP_01875857.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149138013|gb|EDM26424.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 598
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 181/376 (48%), Gaps = 50/376 (13%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
L + ++ L A+ P+ I I DD G+ D+G G +I TP ID +A G N+Y+
Sbjct: 8 LVLGLAYLHLQATDKKPNFIVIFTDDQGYQDLGCFGSPKIKTPEIDQMAKEGARYTNFYS 67
Query: 104 V-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
+C+ SR+A++TG++P G+ H V Y GL SE + + LK GYRT I+GKW
Sbjct: 68 ANAICSASRAALLTGRYPSRNGVFH-VYYPGASQGLKPSEITIAEVLKTAGYRTSIIGKW 126
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS---AEEMKMWG---------------- 203
HLG + ++ PT +GF+S+ G + + A+++K++G
Sbjct: 127 HLG-DRNQFLPTNQGFDSYFGIPFSNDMWMSKDLALADDIKLFGGVTVEQIKSGEASKAV 185
Query: 204 --------LDMRRDLEPA-WDLHGKYSTDVFTAEAV-DIIHNHSTDEPLFLYLAHAATHS 253
+ + RD E + + Y T +T EA+ I + +P F+YLA+A
Sbjct: 186 KGEKRGGKVPLMRDEEVVEYPVDQTYITQRYTDEALKIIKESEKKKQPYFIYLAYAM--- 242
Query: 254 ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
P+ PL A + R + + ++D VG++++ L+ N++++F
Sbjct: 243 --PHVPLYASPKFAGKS------ARGPYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFT 294
Query: 314 SDNGGAAAGFNLN--AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSD 371
SDNG +NL S PLRG K + +EGG R ++W P G + + D
Sbjct: 295 SDNG----PWNLGERGGSALPLRGAKFSTYEGGHRVPCVMWWPGTIPAGTDSAEIATTLD 350
Query: 372 WLPTLLSAANKSDIPN 387
++PT AN + +PN
Sbjct: 351 FMPTFAKLAN-AQLPN 365
>gi|170727990|ref|YP_001762016.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169813337|gb|ACA87921.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 484
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 102/309 (33%), Positives = 157/309 (50%), Gaps = 52/309 (16%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHP 120
++ I DDLG D G +G Q PTPNID LA SG+ + Y C PSR+++MTG P
Sbjct: 40 VVIIYVDDLGIMDTGIYGSAQYPTPNIDKLANSGVRFTQAYANAANCAPSRASLMTGLTP 99
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILP---------------QYLKELGYRTRIVGKWHLG 165
G+ + G G K++P K+ GY T ++GKWHLG
Sbjct: 100 AEHGI---LTVGSSERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYATAVIGKWHLG 156
Query: 166 FYKKEYTPTFRGFE-----SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
+ PT GF+ SHLG+ + ++ +S + K+ GL+ L +Y
Sbjct: 157 ----KTAPTEYGFDTAIAASHLGHPPSY--FYPYSKGKRKLIGLE-------EGGLKDEY 203
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH---RHIEDFK 277
++ T EAV+ I S +P FLYL A H+ P++AP ++N H + + K
Sbjct: 204 LSNRITREAVNYI--SSQRQPFFLYLPFYAVHT-----PIEAPKEWVNQHNARQQAGEIK 256
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
+ +AA++ LD VGK+++AL++ N+++VF SDNG A S+ P RG K
Sbjct: 257 SAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFASDNGAYDP-----ATSSLPYRGYK 311
Query: 338 NTLWEGGVR 346
++L+EGG++
Sbjct: 312 SSLFEGGIK 320
>gi|392965484|ref|ZP_10330903.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387844548|emb|CCH52949.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 477
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 177/360 (49%), Gaps = 46/360 (12%)
Query: 55 ASSGP--PHIIFILADDLGWNDVGFH-----GLDQIPTPNIDALAYSGIILKNYYT-VQL 106
+ +GP P+IIFILADDLG+ D+G + G I TPN+D +A G+ +Y +
Sbjct: 30 SPAGPRKPNIIFILADDLGYGDLGCYRNTGSGQQTIRTPNLDRMAREGMRFTQHYAGSTV 89
Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
C PSRS +MTG +HTG H + G R L +K + +YLK GY T ++GKW LG
Sbjct: 90 CAPSRSTLMTG---LHTG--HTRIRGNARVPLQPDDKTVAEYLKGAGYTTALIGKWGLGE 144
Query: 167 YKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWGLDMR---------------R 208
P +GF+ GY H Y D +W + + +
Sbjct: 145 DGSTGIPRQQGFDYFYGYLNQTHAHNHYPDF------LWRNETKVKLPNEVIYVENGGVK 198
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-----P 263
+ A +++ D+FT EA+ I S +P FLYL++ + H+ N ++ + A P
Sbjct: 199 GIGSAATKRVEWTQDLFTNEALQFI-EQSKQQPFFLYLSYVSPHANNEHKLVAAHGIEVP 257
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
D+ + D +++ FAA + LD VGKV+ L + ++ N++++F SDNG A G
Sbjct: 258 DYGPYANEEFPDVQKA-FAASITYLDAHVGKVLAKLHELKLDDNTLVIFSSDNGPHAEGG 316
Query: 324 NLNAA--SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
N A SN LRG+K L+EGG+R L P G ++ D+LPT A
Sbjct: 317 NDPAYFNSNGGLRGIKRDLYEGGIRVPMLARWPGRIKAGTISNHPSAFWDFLPTACEVAG 376
>gi|440750638|ref|ZP_20929879.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
gi|436480856|gb|ELP37068.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
Length = 478
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 109/355 (30%), Positives = 180/355 (50%), Gaps = 26/355 (7%)
Query: 41 PLAFTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
PL + + M+++ L A S P P++I I DD+G+ D+G +G + I TPN+D +A +
Sbjct: 9 PLKWGMLMLWISL-AISFPTLAQDRPNVILIFVDDMGYGDLGAYGAEDIATPNLDQMAEN 67
Query: 95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G+ N+Y Q +C+ SR++++TG + G+ H L + GL E + + LKE G
Sbjct: 68 GVKFTNFYVSQAVCSASRASLLTGTYANRLGI-HGALDHTAKHGLNPEEITIAEMLKEQG 126
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y+T + GKWHLG + + PT +GF+ G + + H + L + +
Sbjct: 127 YQTAMFGKWHLG-HHPPFLPTKQGFDEFYGIPYSNDMWPQHPQTKNYYPPLPLFFNDNVV 185
Query: 214 WDLHGKYS--TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
LH S T T AV+ I D+P FLYLAH P+ PL + +
Sbjct: 186 DTLHNDQSRLTTELTERAVNFIQRKK-DQPFFLYLAHPM-----PHVPLFVSEKFRG--- 236
Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
KR + +L ++D SVG++++ L + ++++F+SDNG + F+ +A S +
Sbjct: 237 ---KSKRGLYGDVLMEIDWSVGQILKTLRDNGLEEKTLVIFISDNGPWLS-FSGHAGSAY 292
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
PLR K T W+GGV+ L++ G V D LPT ++ ++D+P
Sbjct: 293 PLREGKGTSWDGGVKVPALMYWKGKIPAGTVQTNPAMTIDMLPT-IAKLTRADLP 346
>gi|149174691|ref|ZP_01853316.1| hypothetical protein PM8797T_26185 [Planctomyces maris DSM 8797]
gi|148846385|gb|EDL60723.1| hypothetical protein PM8797T_26185 [Planctomyces maris DSM 8797]
Length = 600
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 177/350 (50%), Gaps = 29/350 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+II ++ DD G+ D G +I TP I LA G+ +Y +C P+R+ +MTG+H
Sbjct: 35 PNIILVMTDDQGYWDTEISGNPKIKTPTIKKLAAEGVTFTRFYANMVCAPTRAGLMTGRH 94
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+ TG+ +N +G + L +E + Q L++ GY+T + GKWHLG Y +Y P RGF+
Sbjct: 95 YLRTGL-YNTRFGGDT--LGPNETTIAQVLQKAGYKTGLFGKWHLGRY-AQYQPQRRGFD 150
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
G++ GH + + + +++ + G + Y TD+FT A+D I +
Sbjct: 151 HFFGHYHGHIERYT-NPDQVVVNGTPVETR---------GYVTDLFTDAAIDFIQRNQ-Q 199
Query: 240 EPLFLYLAHAATHSANPYEP--LQAPDHYLNIHRHIED---FKRSKFAAILHKLDESVGK 294
+P F YLA+ A HS + P+ I +++ + ++ A++ ++D+++ +
Sbjct: 200 QPFFCYLAYNAPHSPFLLDTSHFGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSR 259
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WS 353
+++ + ++ ++++F SDNGG + GF L+G K + +EGG R ++ W+
Sbjct: 260 LLQTVHDLKLDQETVVIFTSDNGGVSRGFKAG------LKGSKASAYEGGTRVPFVVRWT 313
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+ G + V +D PT A +P+ V E+I+ E
Sbjct: 314 DHFPA-GKTTDAMVAQTDLFPTFCQLAGVP-VPSNVKLDGESILSLMEQG 361
>gi|32475708|ref|NP_868702.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32446250|emb|CAD76079.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 667
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 161/337 (47%), Gaps = 53/337 (15%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+++ +L DD GW D+ HG + TP+ID+LA G+ +KN+Y +C+P+R+ +TG++
Sbjct: 93 PNVLVVLTDDQGWGDLSLHGNPNLQTPHIDSLARDGVQIKNFYVCAVCSPTRAEFLTGRY 152
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
+G+ + G ER LSE+ + + GYRT GKWH G + Y P RGF+
Sbjct: 153 HTRSGV-FSTSAGGER--FDLSERTIGDAFQAAGYRTAAFGKWHSGM-QAPYHPNARGFD 208
Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTDVFTAEAVDII 233
G+ +GH WG P +L+G+ + D T A+D +
Sbjct: 209 EFYGFCSGH-------------WG----NYFSPMLELNGEIVKGDGFIVDDLTQHAIDFM 251
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIH-RHI-----------EDFKRSK 280
+ P F+YL HS P+Q PD + N + I ED + ++
Sbjct: 252 E-RDRENPFFIYLPLNTPHS-----PMQVPDEDWQNFEGKEIVPDPRPENAKKEDVQHTR 305
Query: 281 FA-AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
A A+ +D++VG++++ALE+ + N+I+VF DNG + FN LRG K
Sbjct: 306 AALALCENIDDNVGQLLDALERLSLSENTIVVFFCDNGPNGSRFNGG------LRGRKGA 359
Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ EGG+R LI P G D PTL
Sbjct: 360 VHEGGLRSPCLIRYPSKIPAGQTVGGIAGAIDLFPTL 396
>gi|417306372|ref|ZP_12093283.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327537358|gb|EGF24091.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 606
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 105/340 (30%), Positives = 167/340 (49%), Gaps = 34/340 (10%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
AS P++I ++ DD G+ D+ HG + TPN+D LA + L+N++ CTP+R+A
Sbjct: 41 TASESRPNVILVVTDDQGYGDMSCHGNPWLNTPNLDRLATQSVRLENFHVDPFCTPTRAA 100
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+MTG++ G V G R L E + + +E GYRT + GKWHLG + P
Sbjct: 101 LMTGRYCTRVGAW-AVTEG--RQLLDPDETTMAEIFRESGYRTGMFGKWHLG-DPPPFAP 156
Query: 174 TFRGFESHLGYWTGHQ---------DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
RGFE+ + + G DYFD + R ++D Y TD+
Sbjct: 157 RDRGFETVVRHMAGGADEIGNPTGNDYFDDTY---------YRNGTPESFD---GYCTDI 204
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAA 283
+ EA+D I S ++P F+Y+ A HS P D Y + R + +R+ F
Sbjct: 205 WFDEAIDFIQKES-EQPFFVYIPTNAMHS-----PYLVADRYSDPFKRQGIEPQRAAFYG 258
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASNWPLRGVKNTLW 341
++ DE++G++++ L+Q + N++++F+SDNG A A+ N N +RG K +++
Sbjct: 259 MIQNFDENLGRLLKRLDQDNLRDNTLLIFMSDNGTAQGASEQNRKVGFNAGMRGKKGSVY 318
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
EGG R P +Q DWLPTL+ +
Sbjct: 319 EGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCD 358
>gi|16763425|ref|NP_459040.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. LT2]
gi|167989888|ref|ZP_02570988.1| sulfatase [Salmonella enterica subsp. enterica serovar 4,[5],12:i:-
str. CVM23701]
gi|168262300|ref|ZP_02684273.1| sulfatase [Salmonella enterica subsp. enterica serovar Hadar str.
RI_05P066]
gi|168464434|ref|ZP_02698337.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
SL317]
gi|194443287|ref|YP_002039265.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
SL254]
gi|197265267|ref|ZP_03165341.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA23]
gi|205351379|ref|YP_002225180.1| sulfatase [Salmonella enterica subsp. enterica serovar Gallinarum
str. 287/91]
gi|207855551|ref|YP_002242202.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. P125109]
gi|374982441|ref|ZP_09723762.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|375122149|ref|ZP_09767313.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Gallinarum str. SG9]
gi|378443490|ref|YP_005231122.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|378448318|ref|YP_005235677.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. 14028S]
gi|378698019|ref|YP_005179976.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
gi|378982577|ref|YP_005245732.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|378987440|ref|YP_005250604.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. UK-1]
gi|379699254|ref|YP_005240982.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. ST4/74]
gi|418761586|ref|ZP_13317728.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35185]
gi|418767351|ref|ZP_13323415.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35199]
gi|418770519|ref|ZP_13326540.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21539]
gi|418776596|ref|ZP_13332538.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 33953]
gi|418778881|ref|ZP_13334789.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35188]
gi|418785063|ref|ZP_13340896.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21559]
gi|418803046|ref|ZP_13358671.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35202]
gi|418805727|ref|ZP_13361305.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|418810088|ref|ZP_13365629.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|418816483|ref|ZP_13371975.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21538]
gi|418820794|ref|ZP_13376225.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22425]
gi|418830698|ref|ZP_13385659.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|418834225|ref|ZP_13389136.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|418842721|ref|ZP_13397530.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|418843982|ref|ZP_13398777.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19443]
gi|418849810|ref|ZP_13404532.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|418852784|ref|ZP_13407480.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19593]
gi|418859340|ref|ZP_13413945.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19470]
gi|418861375|ref|ZP_13415934.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19536]
gi|419790263|ref|ZP_14315937.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. Levine 1]
gi|419794733|ref|ZP_14320341.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. Levine 15]
gi|421356939|ref|ZP_15807254.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 622731-39]
gi|421363662|ref|ZP_15813903.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 639016-6]
gi|421365681|ref|ZP_15815892.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 640631]
gi|421371888|ref|ZP_15822045.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 77-0424]
gi|421374966|ref|ZP_15825082.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607307-6]
gi|421379270|ref|ZP_15829341.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 485549-17]
gi|421384252|ref|ZP_15834279.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 596866-22]
gi|421391480|ref|ZP_15841447.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 596866-70]
gi|421394207|ref|ZP_15844150.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 629164-26]
gi|421399851|ref|ZP_15849744.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 629164-37]
gi|421402263|ref|ZP_15852122.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 639672-46]
gi|421406273|ref|ZP_15856089.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 639672-50]
gi|421410732|ref|ZP_15860505.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 77-1427]
gi|421415823|ref|ZP_15865545.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 77-2659]
gi|421420220|ref|ZP_15869898.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 78-1757]
gi|421424731|ref|ZP_15874371.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 22510-1]
gi|421429251|ref|ZP_15878850.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 8b-1]
gi|421437106|ref|ZP_15886631.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648905 5-18]
gi|421438407|ref|ZP_15887905.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 6-18]
gi|421442608|ref|ZP_15892056.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 50-3079]
gi|422024148|ref|ZP_16370644.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm1]
gi|422029150|ref|ZP_16375427.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm2]
gi|427542722|ref|ZP_18925933.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm8]
gi|427576467|ref|ZP_18935084.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm9]
gi|427597934|ref|ZP_18940002.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm3]
gi|427616845|ref|ZP_18943594.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm4]
gi|427624819|ref|ZP_18945198.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm6]
gi|427658888|ref|ZP_18954487.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm10]
gi|427659234|ref|ZP_18954812.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm11]
gi|427664472|ref|ZP_18959672.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm12]
gi|427686227|ref|ZP_18964513.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm5]
gi|436640158|ref|ZP_20516274.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 22704]
gi|436650106|ref|ZP_20516727.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE30663]
gi|436794434|ref|ZP_20522137.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CHS44]
gi|436805911|ref|ZP_20526431.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1882]
gi|436816887|ref|ZP_20534074.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1884]
gi|436846224|ref|ZP_20539155.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1594]
gi|436848790|ref|ZP_20540259.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1566]
gi|436856562|ref|ZP_20545656.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1580]
gi|436863842|ref|ZP_20550061.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1543]
gi|436873421|ref|ZP_20556178.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1441]
gi|436880834|ref|ZP_20560453.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1810]
gi|436890643|ref|ZP_20565921.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1558]
gi|436892808|ref|ZP_20566935.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1018]
gi|436900219|ref|ZP_20571299.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1010]
gi|436908107|ref|ZP_20575654.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1729]
gi|436916454|ref|ZP_20580301.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0895]
gi|436926456|ref|ZP_20586410.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0899]
gi|436935208|ref|ZP_20590758.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1457]
gi|436939569|ref|ZP_20593875.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1747]
gi|436954674|ref|ZP_20602038.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0968]
gi|436964862|ref|ZP_20606436.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1444]
gi|436968632|ref|ZP_20607873.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1445]
gi|436976668|ref|ZP_20611978.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1559]
gi|436988995|ref|ZP_20616351.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1565]
gi|437002748|ref|ZP_20621366.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1808]
gi|437015410|ref|ZP_20625696.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1811]
gi|437034454|ref|ZP_20632968.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0956]
gi|437040513|ref|ZP_20634762.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1455]
gi|437052769|ref|ZP_20642171.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1575]
gi|437057662|ref|ZP_20644584.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1725]
gi|437063227|ref|ZP_20647902.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1745]
gi|437072173|ref|ZP_20652345.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1791]
gi|437084625|ref|ZP_20659790.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1795]
gi|437092595|ref|ZP_20663791.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 576709]
gi|437100632|ref|ZP_20666012.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 635290-58]
gi|437123361|ref|ZP_20672971.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607308-16]
gi|437132890|ref|ZP_20678289.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607308-19]
gi|437136026|ref|ZP_20679576.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607307-2]
gi|437142727|ref|ZP_20683786.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607308-9]
gi|437154122|ref|ZP_20690948.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 629163]
gi|437160648|ref|ZP_20694808.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE15-1]
gi|437172431|ref|ZP_20701043.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_N202]
gi|437177812|ref|ZP_20704256.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_56-3991]
gi|437183316|ref|ZP_20707668.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_76-3618]
gi|437197348|ref|ZP_20711192.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 13183-1]
gi|437262676|ref|ZP_20719065.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_81-2490]
gi|437268696|ref|ZP_20722148.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SL909]
gi|437280533|ref|ZP_20727963.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SL913]
gi|437291929|ref|ZP_20731771.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_69-4941]
gi|437312720|ref|ZP_20736666.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 638970-15]
gi|437323311|ref|ZP_20739275.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 17927]
gi|437335206|ref|ZP_20742712.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CHS4]
gi|437387395|ref|ZP_20750868.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 22558]
gi|437410148|ref|ZP_20752711.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 543463 22-17]
gi|437447830|ref|ZP_20759096.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 543463 40-18]
gi|437465442|ref|ZP_20764008.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 561362 1-1]
gi|437475322|ref|ZP_20766495.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 642044 4-1]
gi|437489818|ref|ZP_20770601.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 642046 4-7]
gi|437508578|ref|ZP_20776377.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648898 4-5]
gi|437524653|ref|ZP_20779474.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648899 3-17]
gi|437559350|ref|ZP_20785766.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648900 1-16]
gi|437569968|ref|ZP_20788215.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 1-17]
gi|437596182|ref|ZP_20796230.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 39-2]
gi|437610477|ref|ZP_20800788.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648902 6-8]
gi|437618677|ref|ZP_20803290.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648903 1-6]
gi|437649574|ref|ZP_20809504.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648904 3-6]
gi|437664336|ref|ZP_20814294.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 653049 13-19]
gi|437678211|ref|ZP_20817621.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 642044 8-1]
gi|437691463|ref|ZP_20820806.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 561362 9-7]
gi|437706009|ref|ZP_20825211.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 543463 42-20]
gi|437732765|ref|ZP_20831769.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 16-16]
gi|437796549|ref|ZP_20837611.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 76-2651]
gi|437803845|ref|ZP_20838659.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 33944]
gi|437929892|ref|ZP_20851095.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 6.0562-1]
gi|438018315|ref|ZP_20854794.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 50-5646]
gi|438085126|ref|ZP_20858588.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 81-2625]
gi|438103369|ref|ZP_20865284.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 62-1976]
gi|438113035|ref|ZP_20869443.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 53-407]
gi|445128077|ref|ZP_21380069.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Gallinarum str. 9184]
gi|445167160|ref|ZP_21394296.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE8a]
gi|445182501|ref|ZP_21398565.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 20037]
gi|445231003|ref|ZP_21405623.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE10]
gi|445246091|ref|ZP_21408258.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 436]
gi|445339825|ref|ZP_21416472.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 18569]
gi|445344607|ref|ZP_21417722.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 13-1]
gi|445357353|ref|ZP_21422183.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. PT23]
gi|16418530|gb|AAL18999.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. LT2]
gi|194401950|gb|ACF62172.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
SL254]
gi|195632839|gb|EDX51293.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
SL317]
gi|197243522|gb|EDY26142.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA23]
gi|205271160|emb|CAR35946.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|205331405|gb|EDZ18169.1| sulfatase [Salmonella enterica subsp. enterica serovar 4,[5],12:i:-
str. CVM23701]
gi|205348763|gb|EDZ35394.1| sulfatase [Salmonella enterica subsp. enterica serovar Hadar str.
RI_05P066]
gi|206707354|emb|CAR31626.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|261245269|emb|CBG23054.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|267991696|gb|ACY86581.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. 14028S]
gi|301156667|emb|CBW16137.1| hypothetical secreted sulfatase [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|312911005|dbj|BAJ34979.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|321222395|gb|EFX47467.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|323128353|gb|ADX15783.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. ST4/74]
gi|326626399|gb|EGE32742.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Gallinarum str. SG9]
gi|332986987|gb|AEF05970.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Typhimurium str. UK-1]
gi|392613900|gb|EIW96352.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. Levine 1]
gi|392614000|gb|EIW96451.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. Levine 15]
gi|392734643|gb|EIZ91824.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35199]
gi|392737113|gb|EIZ94274.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21539]
gi|392739935|gb|EIZ97063.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35185]
gi|392745782|gb|EJA02805.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 33953]
gi|392752160|gb|EJA09101.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21559]
gi|392755088|gb|EJA12003.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35188]
gi|392776443|gb|EJA33131.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 35202]
gi|392782199|gb|EJA38836.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|392783954|gb|EJA40563.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|392789844|gb|EJA46346.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21538]
gi|392791532|gb|EJA48001.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22425]
gi|392801388|gb|EJA57616.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|392806133|gb|EJA62248.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|392806434|gb|EJA62532.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|392816031|gb|EJA71962.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19443]
gi|392819884|gb|EJA75741.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|392828027|gb|EJA83724.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19593]
gi|392830508|gb|EJA86157.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19470]
gi|392838672|gb|EJA94226.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19536]
gi|395985903|gb|EJH95068.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 639016-6]
gi|395992836|gb|EJI01947.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 622731-39]
gi|395992876|gb|EJI01986.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 640631]
gi|396002412|gb|EJI11404.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 77-0424]
gi|396006135|gb|EJI15105.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607307-6]
gi|396007260|gb|EJI16218.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 485549-17]
gi|396011888|gb|EJI20794.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 596866-70]
gi|396016552|gb|EJI25420.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 629164-26]
gi|396019886|gb|EJI28736.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 596866-22]
gi|396026440|gb|EJI35208.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 629164-37]
gi|396032547|gb|EJI41269.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 639672-46]
gi|396034345|gb|EJI43045.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 639672-50]
gi|396045337|gb|EJI53930.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 78-1757]
gi|396046434|gb|EJI55019.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 77-2659]
gi|396047823|gb|EJI56391.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 77-1427]
gi|396051564|gb|EJI60081.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648905 5-18]
gi|396059533|gb|EJI67987.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 8b-1]
gi|396059688|gb|EJI68141.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 22510-1]
gi|396072743|gb|EJI81051.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 50-3079]
gi|396073577|gb|EJI81878.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 6-18]
gi|414024521|gb|EKT07894.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm1]
gi|414024756|gb|EKT08114.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm8]
gi|414026042|gb|EKT09326.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm9]
gi|414026640|gb|EKT09905.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm3]
gi|414026814|gb|EKT10072.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm2]
gi|414033200|gb|EKT16164.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm4]
gi|414040786|gb|EKT23388.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm10]
gi|414054826|gb|EKT36758.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm6]
gi|414058843|gb|EKT40475.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm11]
gi|414062440|gb|EKT43756.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm12]
gi|414067999|gb|EKT48231.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
str. STm5]
gi|434956385|gb|ELL50116.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 22704]
gi|434962770|gb|ELL55931.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CHS44]
gi|434969924|gb|ELL62598.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1882]
gi|434971942|gb|ELL64435.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1884]
gi|434977871|gb|ELL69949.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1594]
gi|434989218|gb|ELL80783.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1566]
gi|434991336|gb|ELL82844.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1580]
gi|434996839|gb|ELL88134.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1543]
gi|434998963|gb|ELL90175.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1441]
gi|435004802|gb|ELL95751.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1810]
gi|435006463|gb|ELL97358.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1558]
gi|435020507|gb|ELM10911.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1018]
gi|435022546|gb|ELM12863.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1010]
gi|435028094|gb|ELM18178.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE30663]
gi|435030435|gb|ELM20453.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1729]
gi|435032943|gb|ELM22860.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0895]
gi|435037632|gb|ELM27436.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0899]
gi|435040243|gb|ELM30010.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1457]
gi|435047063|gb|ELM36665.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0968]
gi|435047681|gb|ELM37255.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1444]
gi|435047836|gb|ELM37409.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1747]
gi|435060074|gb|ELM49346.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1445]
gi|435071554|gb|ELM60496.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1559]
gi|435071746|gb|ELM60685.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1565]
gi|435073758|gb|ELM62614.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1808]
gi|435075956|gb|ELM64753.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_0956]
gi|435084088|gb|ELM72675.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1811]
gi|435087491|gb|ELM75998.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1455]
gi|435089940|gb|ELM78345.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1575]
gi|435095344|gb|ELM83658.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1725]
gi|435102209|gb|ELM90314.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1745]
gi|435106059|gb|ELM94085.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1795]
gi|435110360|gb|ELM98278.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CDC_2010K_1791]
gi|435111566|gb|ELM99455.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 576709]
gi|435121426|gb|ELN08966.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607308-19]
gi|435121651|gb|ELN09184.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607308-16]
gi|435124407|gb|ELN11864.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 635290-58]
gi|435136762|gb|ELN23835.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607307-2]
gi|435140179|gb|ELN27143.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 629163]
gi|435142032|gb|ELN28957.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 607308-9]
gi|435144796|gb|ELN31626.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_N202]
gi|435147039|gb|ELN33820.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE15-1]
gi|435154272|gb|ELN40858.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_56-3991]
gi|435161856|gb|ELN48070.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_81-2490]
gi|435162865|gb|ELN49018.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_76-3618]
gi|435170680|gb|ELN56427.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SL913]
gi|435173991|gb|ELN59458.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SL909]
gi|435181325|gb|ELN66397.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CVM_69-4941]
gi|435182734|gb|ELN67731.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 638970-15]
gi|435195043|gb|ELN79457.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 17927]
gi|435198125|gb|ELN82350.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. CHS4]
gi|435200940|gb|ELN84899.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 22558]
gi|435204656|gb|ELN88327.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 543463 22-17]
gi|435209441|gb|ELN92763.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 543463 40-18]
gi|435216872|gb|ELN99344.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 561362 1-1]
gi|435223188|gb|ELO05222.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 642044 4-1]
gi|435224957|gb|ELO06892.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 13183-1]
gi|435231467|gb|ELO12719.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 642046 4-7]
gi|435233327|gb|ELO14369.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648898 4-5]
gi|435239771|gb|ELO20260.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648900 1-16]
gi|435248791|gb|ELO28642.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648899 3-17]
gi|435249895|gb|ELO29655.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 1-17]
gi|435250092|gb|ELO29835.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 39-2]
gi|435251907|gb|ELO31504.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648902 6-8]
gi|435259296|gb|ELO38525.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648903 1-6]
gi|435265810|gb|ELO44608.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 653049 13-19]
gi|435268863|gb|ELO47438.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648904 3-6]
gi|435273656|gb|ELO51918.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 642044 8-1]
gi|435280834|gb|ELO58522.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 561362 9-7]
gi|435287413|gb|ELO64612.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 648901 16-16]
gi|435290524|gb|ELO67441.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 76-2651]
gi|435291871|gb|ELO68661.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 543463 42-20]
gi|435303211|gb|ELO79123.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 33944]
gi|435312120|gb|ELO86107.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 6.0562-1]
gi|435319123|gb|ELO91977.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 81-2625]
gi|435323006|gb|ELO95175.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 62-1976]
gi|435329881|gb|ELP01179.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 53-407]
gi|435333700|gb|ELP04473.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 50-5646]
gi|444855729|gb|ELX80774.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Gallinarum str. 9184]
gi|444863770|gb|ELX88586.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE10]
gi|444865716|gb|ELX90481.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. SE8a]
gi|444871319|gb|ELX95760.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 20037]
gi|444872305|gb|ELX96662.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 18569]
gi|444880005|gb|ELY04090.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 13-1]
gi|444886638|gb|ELY10387.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. PT23]
gi|444890351|gb|ELY13694.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 436]
Length = 497
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R + +++ LA V A++ P+++ ILADDLG+ D+ +G + TPNID L
Sbjct: 3 RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +YY LC+PSR+ ++TG+ P TG++ + G + L +E + LK
Sbjct: 63 AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121
Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
+ GY T ++GK HL G + GF+ L G D +A+E +G+
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179
Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ P W +G K S + ++E V+ + N D P FLY+A HS
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231
Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
PL +P YL+++ +++ D+++ ++ A + +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291
Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
+ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DWLPTL AN D +P ++++P ++ L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|189230138|ref|NP_001121393.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Xenopus (Silurana)
tropicalis]
gi|158253505|gb|AAI54124.1| Galns protein [Danio rerio]
gi|183986312|gb|AAI66093.1| LOC100158481 protein [Xenopus (Silurana) tropicalis]
Length = 488
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 165/334 (49%), Gaps = 27/334 (8%)
Query: 65 ILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHT 123
+L DD+GW D+G G TP +D +A G++ N+YT LC+PSR+A++TG+ P+
Sbjct: 1 MLMDDMGWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRN 60
Query: 124 GMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G + GG+ E +LP++LK Y ++IVGKWHLG ++ +Y P
Sbjct: 61 GFYTTNAHARNAYTPQEIVGGISADEILLPEHLKNKHYISKIVGKWHLG-HRTQYLPLKH 119
Query: 177 GFESHLGYWTGH-QDYFDHSAEEMKMWG-LDMRRDLEPAWDLHGKYS----TDVFTAEAV 230
GF+ G H Y D S + ++ +M+ ++++ K T ++ E +
Sbjct: 120 GFDEWFGAPNCHFGPYNDSSRPNIPVYNNSEMKGRYYEEFEINVKTGESNLTQLYLKEGL 179
Query: 231 DIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
D I + + P FLY A ATH+ P+ A +L +R ++ + +LD
Sbjct: 180 DFISQQAMAQRPFFLYWAPDATHA-----PVYASKPFLG------KSQRGRYGDAVMELD 228
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
+S+G+++ L + +++++ F SDNG A L + SN P K T +EGG+R
Sbjct: 229 DSIGQILAHLVSLGIQNDTLVFFTSDNGAALMSGPLQSGSNGPFLCGKETTFEGGMREPA 288
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ W P G V+ Q V D T LS A S
Sbjct: 289 IAWWPGQIPAGTVSHQLASVMDLFSTSLSVAGVS 322
>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
Length = 426
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 167/347 (48%), Gaps = 38/347 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+++FIL DDLGW D+ +G +I TPN+D LA G+ + Y+ C+P+R ++ TG+
Sbjct: 12 PNVLFILGDDLGWADLSSYGAPEIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGR 71
Query: 119 HPIHT--GMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+P T G+ + +R GL + L L++ GYRT ++GKWH G Y +Y+PT
Sbjct: 72 YPGRTKGGLAEPI---ADRSVGLEPTHPTLASLLRDAGYRTALIGKWHCG-YLPDYSPTR 127
Query: 176 RGFESHLGYWTGHQDYFD--HSAEEMKMWGLDMR-RDLEPAWDLHGKYSTDVFTAEAVDI 232
G+E G + G +Y+ S E ++ D RDL +Y T + T AV+
Sbjct: 128 SGWEEFFGNFGGALEYYSKLSSGGEYDLYEGDAEYRDL--------RYYTRILTERAVEY 179
Query: 233 IHNHSTDEPLFLYLAHAATH-----------SANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
+ P L L H SA ++A D H+ + +K+
Sbjct: 180 VGRDHGRTPWLLNLNFTTPHWPWIADGDEETSAEIVRRIKAGDRRALWHQ--DGGSLAKY 237
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
++ LD SVGKV+ AL++ ++++ F SDNGG + NWPL G K +L
Sbjct: 238 RQLVEDLDRSVGKVLAALKRSGQEEDTLVFFASDNGGERFSY------NWPLSGNKGSLL 291
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
+GG+R ++ P G V+ V+ DW TLL P Y
Sbjct: 292 DGGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGGARPHPAY 338
>gi|407940996|ref|YP_006856637.1| sulfatase [Acidovorax sp. KKS102]
gi|407898790|gb|AFU47999.1| sulfatase [Acidovorax sp. KKS102]
Length = 444
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 175/355 (49%), Gaps = 49/355 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+IIFI+ADDLG+ D+G +G + +P +D LA +G+ L + Y +C+P+R A+
Sbjct: 7 PNIIFIVADDLGYADLGCYGGREAAFGPVSPVLDQLAANGLKLTQGYSNSPVCSPTRFAL 66
Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
MT ++ G + RG GLP LP L+ GYRT ++GKWHLG Y
Sbjct: 67 MTARYQYRLRGAAEEPINSKSRGSTTLGLPPEHPTLPSLLRASGYRTALMGKWHLG-YPP 125
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
+ P G+E G +G DYF H A ++ + + ++D Y TD+
Sbjct: 126 AFGPLRSGYEEFFGPMSGGVDYFTHCSSNGAHDLYLGEEEKQQD---------GYLTDLI 176
Query: 226 TAEAVDIIHNHS----TDEPLFLYLAHAATHSANPYE-------PLQAPDHYLNIH-RHI 273
T A+D + + P FL L + A H P+E + + ++H +I
Sbjct: 177 TDHALDYVQRMAPGAQAGTPFFLSLHYTAPHW--PWETRDDEALAQEVKTNLFHLHGGNI 234
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
E ++R ++H +DE +G+++ LE + + ++++VF SDNGG + NWPL
Sbjct: 235 ETYRR-----MIHHMDEGIGRIMALLEAKGLARDTLVVFTSDNGGE------RFSDNWPL 283
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
G K L EGG+R + P + + G V+ Q DW T+L AA + +Y
Sbjct: 284 VGGKMDLTEGGIRVPWIAHWPAVIAPGGVSAQTCMTMDWSATMLDAAGAATHTDY 338
>gi|418787538|ref|ZP_13343339.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19447]
gi|418792641|ref|ZP_13348381.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|418796450|ref|ZP_13352142.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|392765795|gb|EJA22579.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19447]
gi|392766594|gb|EJA23367.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|392771319|gb|EJA28040.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
Length = 497
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R + +++ LA V A++ P+++ ILADDLG+ D+ +G + TPNID L
Sbjct: 3 RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +YY LC+PSR+ ++TG+ P TG++ + G + L +E + LK
Sbjct: 63 AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121
Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
+ GY T ++GK HL G + GF+ L G D +A+E +G+
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179
Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ P W +G K S + ++E V+ + N D P FLY+A HS
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231
Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
PL +P YL+++ +++ D+++ ++ A + +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291
Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
+ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DWLPTL AN D +P ++++P ++ L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
Length = 467
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 183/370 (49%), Gaps = 44/370 (11%)
Query: 44 FTLSMVF--VDLVASSG------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
F + VF V L+A SG P+I+ I+ADDLG ++G +G +I TP+ID LA G
Sbjct: 2 FRTAAVFLAVALLAPSGRAADAPKPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGG 61
Query: 96 IILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKEL 152
+Y+ +C PSR +MTGKH H +++NV E G P+ + + LK
Sbjct: 62 AKFTRFYSGSPVCAPSRCVLMTGKHSGHATVRNNVEAKPE-GQFPIRAEDVTVADALKAH 120
Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP 212
GY T +GKW LG + +P GF+ GY + HS ++ D R +L+
Sbjct: 121 GYATGAMGKWGLGMFDTAGSPLKHGFDLFFGY---NCQRHAHSHYPTYIYRNDKRVELKG 177
Query: 213 AWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
GK ++ D+F EA+ I + +P FLYL H A +Q P+ LN ++
Sbjct: 178 NDGKTGKQFTQDLFEEEALGFIEANKA-KPFFLYLPFTVPHVA-----VQVPEDSLNEYK 231
Query: 272 H------IEDFKR---------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
D K+ + +AA++ ++D SVG+VVE L + N++++F SDN
Sbjct: 232 GQLGDDPAYDGKKGYQPHPAPHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDN 291
Query: 317 G-----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSD 371
G G A N+A LRG+K +++EGG+R + + P G ++ ++ D
Sbjct: 292 GPTHNVGGADSSFFNSAGK--LRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPD 349
Query: 372 WLPTLLSAAN 381
LPTL + A
Sbjct: 350 VLPTLCAFAG 359
>gi|421450242|ref|ZP_15899618.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 58-6482]
gi|396067684|gb|EJI76042.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Enteritidis str. 58-6482]
Length = 497
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R + +++ LA V A++ P+++ ILADDLG+ D+ +G + TPNID L
Sbjct: 3 RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +YY LC+PSR+ ++TG+ P TG++ + G + L +E + LK
Sbjct: 63 AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121
Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
+ GY T ++GK HL G + GF+ L G D +A+E +G+
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179
Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ P W +G K S + ++E V+ + N D P FLY+A HS
Sbjct: 180 ---VHPMGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231
Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
PL +P YL+++ +++ D+++ ++ A + +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291
Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
+ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DWLPTL AN D +P ++++P ++ L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|198243820|ref|YP_002213990.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
CT_02021853]
gi|445139185|ref|ZP_21384062.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
SL1438]
gi|445149882|ref|ZP_21389433.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
HWS51]
gi|197938336|gb|ACH75669.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
CT_02021853]
gi|444853819|gb|ELX78885.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
SL1438]
gi|444857455|gb|ELX82464.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
HWS51]
Length = 497
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R + +++ LA V A++ P+++ ILADDLG+ D+ +G + TPNID L
Sbjct: 3 RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +YY LC+PSR+ ++TG+ P TG++ + G + L +E + LK
Sbjct: 63 AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121
Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
+ GY T ++GK HL G + GF+ L G D +A+E +G+
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179
Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ P W +G K S + ++E V+ + N D P FLY+A HS
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231
Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
PL +P YL+++ +++ D+++ ++ A + +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291
Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
+ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DWLPTL AN D +P ++++P ++ L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|297284666|ref|XP_002802639.1| PREDICTED: hypothetical protein LOC697850 [Macaca mulatta]
Length = 1113
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 97/300 (32%), Positives = 153/300 (51%), Gaps = 42/300 (14%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G + N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNF 72
Query: 102 YTV-QLCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G N E GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183
Query: 214 WDLHGKY--------------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W++ G+Y T ++ EA+D I + P FLY A ATH+ P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ A +L +R ++ + ++D+SVGK++E L + N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAA 292
>gi|398810130|ref|ZP_10568960.1| arylsulfatase A family protein [Variovorax sp. CF313]
gi|398083821|gb|EJL74525.1| arylsulfatase A family protein [Variovorax sp. CF313]
Length = 438
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 167/345 (48%), Gaps = 32/345 (9%)
Query: 60 PHIIFILADDLGWNDVG-FHGLDQI---PTPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
P+IIFI+ADDLG+ D+G + G D + +P +D LA +G+ L + Y +C+P+R +
Sbjct: 4 PNIIFIVADDLGYADLGCYGGRDAVFGPVSPVLDGLAANGLKLTQGYANSPVCSPTRFGM 63
Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+TG++ G + RG GLP LP LK GY T ++GKWHLG Y
Sbjct: 64 ITGRYQYRLRGAAEEPINSRSRGSATLGLPTDHPTLPSLLKASGYSTALIGKWHLG-YPP 122
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
+ P G++ G +G DYF H S +W + + E Y TD+ +
Sbjct: 123 SFGPLRSGYDEFFGPMSGGVDYFTHCDSTGRHDLWSGEEDKVEE-------GYLTDILSK 175
Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK---FAA 283
AV+ + + E P FL L + A H P+E + ++ D +
Sbjct: 176 RAVEYVERMAKQEAPFFLSLHYTAPHW--PWETRDDAAKAPAVKDNLFDLAGGNIHVYRR 233
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
++H +DE +G ++ AL++ M N+++VF SDNGG + NWPL G K L EG
Sbjct: 234 MIHHMDEGIGWIMAALQKHGMADNTLVVFTSDNGGE------RFSDNWPLVGGKMDLTEG 287
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
G+R + P + +G + Q DW T+L AA + +Y
Sbjct: 288 GIRVPWIAHWPAVIGKGGESTQLCMTMDWSATMLDAAGVAAHADY 332
>gi|410029536|ref|ZP_11279368.1| sulfatase [Marinilabilia sp. AK2]
Length = 486
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 174/353 (49%), Gaps = 48/353 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+IIFILADDLG+ D+GF G I TPN+D LA G+ N+Y +C PSRSA+MTG
Sbjct: 34 PNIIFILADDLGYGDLGFLGQKIIETPNLDRLAAEGMFFSNHYAGATVCAPSRSALMTGL 93
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
H HT ++ N E G PL + IL P+ KE GY T GKW LGF E P +
Sbjct: 94 HTGHTPVRGNFEIQPE-GQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTEGDPNHQ 152
Query: 177 GFESHLGY---WTGHQ---DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
GF+ GY H+ +Y +AE++ + G D W Y+ D+ E++
Sbjct: 153 GFDQFFGYNCQRIAHRYYPEYLWENAEKVFLEGND--------WIQKITYAPDIIHQESL 204
Query: 231 DIIHNHSTDEPLFLYLAHAATHS--ANP--------------YEPLQAP------DHYLN 268
I +++ + P +++ H+ A P +P AP D +N
Sbjct: 205 KFIGDNANN-PFLMFIPSVMPHAELAVPDGELLAYYREKIGEEKPHIAPVGWDYGDEPIN 263
Query: 269 IHRHIED-FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--- 324
I + + + ++ +AA++ LD VG++++ L++ + +II+F SDNG G N
Sbjct: 264 IPGYQSNPYPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNGPHQEGGNDPD 323
Query: 325 -LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
N SN RG K L+EGG+R ++ P + G + D+LPT
Sbjct: 324 FFN--SNGIFRGYKRDLYEGGIRVPMIVKWPGVIQGGAQSGHVSAFWDFLPTF 374
>gi|429093555|ref|ZP_19156139.1| Arylsulfatase [Cronobacter dublinensis 1210]
gi|426741528|emb|CCJ82252.1| Arylsulfatase [Cronobacter dublinensis 1210]
Length = 502
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 115/377 (30%), Positives = 184/377 (48%), Gaps = 53/377 (14%)
Query: 47 SMVFVDLVASSGP---------------PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
S+V + L AS+G P+++ I+ADDLG+ D G +G + TPNID+L
Sbjct: 8 SLVSLMLAASAGAQSPAPTPGSDTTAKRPNVVIIMADDLGYGDTGIYGHPIVKTPNIDSL 67
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ YY LC+PSR+ ++TG+ P TG++ + G + L +EK + YLK
Sbjct: 68 AQQGMRFTEYYAPAPLCSPSRAGLLTGRTPFRTGIRSWIPSGGKNVALGRNEKTIASYLK 127
Query: 151 ELGYRTRIVGKWHL-GFYKKEYTPTFR--GFESHL----GYWTGHQDYFDHSAEEMKMWG 203
E GY T ++GK HL + P + GF+ L G+ T D ++
Sbjct: 128 EQGYDTAMMGKLHLNAGADRTDQPQAKDMGFDYSLVNAAGFVTSDLDKVKTRPRYGVVYP 187
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
R+ +P + + S ++ ++EA+ + + ++P FLY+A H+ PL +P
Sbjct: 188 NGFYRNGQPIGTVK-QMSGELVSSEAIHWLDSRKDNKPFFLYVAFTEVHT-----PLASP 241
Query: 264 DHYLNIHR-HIEDFKR------------------SKFAAILHKLDESVGKVVEALEQRRM 304
YL++++ ++ +++R ++ A + LDE VGKVV ++
Sbjct: 242 QKYLDMYKSYMSEYERQHQDLFYADWADKPWRGTGEYYANISYLDEQVGKVVAKIKAMGE 301
Query: 305 LSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSPLLESR 359
N+II+F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 302 EDNTIILFTSDNGPVTREARKWYELNMAGETSGLRGRKDNLWEGGIRVPAIIKYGSHIKA 361
Query: 360 GIVAEQYVHVSDWLPTL 376
G V++ V D LPTL
Sbjct: 362 GTVSDSPVTGLDVLPTL 378
>gi|168822186|ref|ZP_02834186.1| sulfatase [Salmonella enterica subsp. enterica serovar Weltevreden
str. HI_N05-537]
gi|197249148|ref|YP_002145016.1| sulfatase [Salmonella enterica subsp. enterica serovar Agona str.
SL483]
gi|409248332|ref|YP_006889021.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar
Weltevreden str. 2007-60-3289-1]
gi|416526182|ref|ZP_11742236.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. LQC 10]
gi|416533378|ref|ZP_11746346.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. SARB30]
gi|416555343|ref|ZP_11758828.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 29N]
gi|421887223|ref|ZP_16318385.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Senftenberg str. SS209]
gi|437816210|ref|ZP_20842570.1| sulfatase [Salmonella enterica subsp. enterica serovar Enteritidis
str. SARB17]
gi|440765646|ref|ZP_20944661.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
str. SH11G1113]
gi|440769763|ref|ZP_20948718.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
str. SH08SF124]
gi|440772152|ref|ZP_20951060.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
str. SH10GFN094]
gi|197212851|gb|ACH50248.1| sulfatase [Salmonella enterica subsp. enterica serovar Agona str.
SL483]
gi|205341423|gb|EDZ28187.1| sulfatase [Salmonella enterica subsp. enterica serovar Weltevreden
str. HI_N05-537]
gi|320089066|emb|CBY98822.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar
Weltevreden str. 2007-60-3289-1]
gi|363558247|gb|EHL42440.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. LQC 10]
gi|363560160|gb|EHL44307.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. 29N]
gi|363568662|gb|EHL52640.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
str. SARB30]
gi|379983257|emb|CCF90658.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
Senftenberg str. SS209]
gi|435308800|gb|ELO83718.1| sulfatase [Salmonella enterica subsp. enterica serovar Enteritidis
str. SARB17]
gi|436411975|gb|ELP09920.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
str. SH11G1113]
gi|436413160|gb|ELP11096.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
str. SH08SF124]
gi|436419847|gb|ELP17720.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
str. SH10GFN094]
Length = 497
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R + +++ LA V A++ P+++ ILADDLG+ D+ +G + TPNID L
Sbjct: 3 RTVVASMIGLALCAGSVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +YY LC+PSR+ ++TG+ P TG++ + G + L +E + LK
Sbjct: 63 AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121
Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
+ GY T ++GK HL G + GF+ L G D +A+E +G+
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179
Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ P W +G K S + ++E V+ + N D P FLY+A HS
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231
Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
PL +P YL+++ +++ D+++ ++ A + +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291
Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
+ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DWLPTL AN D +P ++++P ++ L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|115906036|ref|XP_797340.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 162
Score = 149 bits (376), Expect = 6e-33, Method: Composition-based stats.
Identities = 69/117 (58%), Positives = 80/117 (68%), Gaps = 6/117 (5%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFI+ADDLGW+DV HG QI TPNID LA G+ L NYY +CTP+RSAIMTG+
Sbjct: 32 PPHIIFIVADDLGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGR 91
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
HPIHTGMQH+ + E GL L EK + Q+LK LGY T VGK +TP F
Sbjct: 92 HPIHTGMQHDTIGAAEPWGLGLDEKTMAQHLKSLGYSTHAVGK------SSNWTPQF 142
>gi|373956153|ref|ZP_09616113.1| sulfatase [Mucilaginibacter paludis DSM 18603]
gi|373892753|gb|EHQ28650.1| sulfatase [Mucilaginibacter paludis DSM 18603]
Length = 468
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 183/374 (48%), Gaps = 40/374 (10%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
M L L F L+ V + P+IIFILADDLG+ DVG +G +I TPNID LA
Sbjct: 10 MKIRSLFLLFCLTGVLSVKAQTRQQPNIIFILADDLGYGDVGVYGQTKILTPNIDLLARE 69
Query: 95 GIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNV-----LYG------CERGGLPLSE 142
G L ++Y +C+PSR ++TG + H ++ N+ L G R GL +
Sbjct: 70 GTKLTDFYAGAPVCSPSRGVLLTGLNTGHATIRGNMAIKGGLAGNKGSDKIRRAGLLPQD 129
Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW 202
+ L+ GY T ++GKWH+ + TP GF+ G+ + + S W
Sbjct: 130 STIGDLLQRSGYTTGLIGKWHVDGFDTLATPQRHGFDYFYGWLVSYPQTY-ASTYWPDTW 188
Query: 203 GLDMRRDLEPAWDLHGK---YSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYE 258
D + P + +G+ Y++++ T +A+ + H + +P FL + H +NP+
Sbjct: 189 YRDGKLVAVPQ-NQNGQKNYYTSEIITDDAIRYLAKHKKSKKPFFLMVNH-----SNPHS 242
Query: 259 PLQAPDHYLNIHRHIEDFK-----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
PL AP H I + K + +AA++ LD SVGK+ + L + + N+I++F
Sbjct: 243 PLDAP------HNTIYEGKDWTSDQKTYAAMVTYLDSSVGKIRKYLIESGLDKNTIVIFT 296
Query: 314 SDNGG-AAAGFNLNAASNW-----PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
SDNG + +L A S++ PLRG K + EGG+R ++W+ V+
Sbjct: 297 SDNGPRSEPTTSLTAVSDFFHSSGPLRGYKRDVTEGGIRVPMIVWNKNRIKSDAVSHTPG 356
Query: 368 HVSDWLPTLLSAAN 381
+ +D LPTL A
Sbjct: 357 YFADILPTLAGIAG 370
>gi|343086294|ref|YP_004775589.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354828|gb|AEL27358.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 511
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 183/382 (47%), Gaps = 64/382 (16%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+II+ILADDLG+ D+G +G ++I TPNIDALA SG++ +Y+ +C P+R+ ++TG
Sbjct: 28 PPNIIYILADDLGYGDLGCYGQEKIETPNIDALAASGMMFTQHYSGAPVCAPARAVLLTG 87
Query: 118 KHPIHTGMQHNVLYGCE---------------RGGLPLSEKI--LPQYLKELGYRTRIVG 160
+H H+ ++ N +G G P++ L + L+E GY T +VG
Sbjct: 88 QHSGHSQVRGNDEWGSRGDVWNYHAMLADSTLEGQRPIATGTYTLGRMLQEAGYTTAVVG 147
Query: 161 KWHLGFYKKEYTPTFRGFESHLGY---WTGHQDYFDHSAE-EMKMW-------------- 202
KW LG E PT +GF+ GY H Y H E E +++
Sbjct: 148 KWGLGAPHTEGIPTNQGFDYFFGYNCQRQAHTLYPVHLYENEHRVYLNNDTIAPNKKLAE 207
Query: 203 GLDMRRDLEPAWD-LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
G D +LE D Y+ ++ + + + S++EP FLY A + P+ PLQ
Sbjct: 208 GAD-PYNLENYGDYFLTDYAPELMFDKITNFVDEQSSEEPFFLYWA-----TPIPHVPLQ 261
Query: 262 AP----DHYLNIHRHIEDF-----------KRSKFAAILHKLDESVGKVVEALEQRRMLS 306
AP DHY+ E + R+ +AA++ +DE VG++VE L+ + +
Sbjct: 262 APTKWIDHYVEKFGEEEPYLGENSYYPSRYPRATYAAMVSYMDEQVGELVEQLKSKGLYE 321
Query: 307 NSIIVFVSDNGGAAAG------FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG 360
N++I+F SDNG G FN A K L+EGG+R + P G
Sbjct: 322 NTLIIFTSDNGPTFNGGTDSPWFNSGGAFREEKGFGKGFLYEGGIRVPMIASWPGKIKEG 381
Query: 361 IVAEQYVHVSDWLPTLLSAANK 382
V++ D +PTL A K
Sbjct: 382 SVSDHASVFYDVMPTLAEVAGK 403
>gi|410638700|ref|ZP_11349253.1| sulfatase [Glaciecola lipolytica E3]
gi|410141228|dbj|GAC16458.1| sulfatase [Glaciecola lipolytica E3]
Length = 522
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 115/361 (31%), Positives = 176/361 (48%), Gaps = 51/361 (14%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P++I+IL DDLG D+ +G + I TPN+ +A G++ +Y +C PSR+A+MTG+
Sbjct: 44 PNVIYILVDDLGIGDIEPYGQELINTPNLQRMADQGMVFSQHYAGNPVCAPSRAALMTGQ 103
Query: 119 HPIHTGMQHNVLYGC-----ERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
H H ++ N G E G +PL + + + +K+ GY T ++GKW LG
Sbjct: 104 HSGHNQIRGNFELGGFADEEEFGQMPLQPGTETVAKLMKKAGYTTALIGKWGLGGPGSYG 163
Query: 172 TPTFRGFESHLGYW---TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL------------ 216
PT +GF+ GY H Y H + + L+ L+P L
Sbjct: 164 VPTKQGFDYFFGYLDQKQAHNHYPTHLWQNETWFPLNNEY-LDPHKGLGKHDDPNDEASY 222
Query: 217 ----HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
G ++ T EA+ I N DEP FLYL++AA H+A LQAPD + + H
Sbjct: 223 EKFKRGDFAQQRLTDEALSYIDN-KKDEPFFLYLSYAAPHAA-----LQAPDEEIAKYNH 276
Query: 273 IED--------------FK-RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG 317
ED F+ R+ AA++ +DE VG V++ L++ + N++++F SDNG
Sbjct: 277 FEDTPYDGVADGGYVPSFRPRATRAAMISHIDEGVGLVLDKLKKLGLDENTLVIFSSDNG 336
Query: 318 GA-AAGFNLNAA-SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
+ G +L SN RG K L++GG+R + P G V++ D LPT
Sbjct: 337 PSFEGGADLEFFDSNAQYRGFKRDLYDGGIRMPTIARWPTKVPAGKVSDHVSAFWDVLPT 396
Query: 376 L 376
L
Sbjct: 397 L 397
>gi|392391665|ref|YP_006428268.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390522743|gb|AFL98474.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 510
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 118/412 (28%), Positives = 190/412 (46%), Gaps = 64/412 (15%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
AVL +FT+ V L S P+IIFILADDLG+ D+ +G + TPNID LA G++
Sbjct: 8 AVLSASFTIPAV---LWGQSDKPNIIFILADDLGYGDLACYGQKHVETPNIDKLAKQGMM 64
Query: 98 LKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHN---------VLYGCER-----GGLPL-- 140
Y +C PSR+ IMTG+H HT ++ N ++YG G P
Sbjct: 65 FTQAYAGAPVCAPSRATIMTGQHTGHTHVRGNKEYWQKSPKIMYGQNEEYSLVGQEPYDA 124
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY---WTGHQDYFD---- 193
++ ILP+ +K+ GY T + GKW G+ P RG + GY + H Y +
Sbjct: 125 NQVILPEVMKKNGYTTGMFGKWAGGYEGSHSVPEKRGVDEFYGYICQFQAHSYYPNFLNR 184
Query: 194 HSAEEMKMWGLDMRRDLEPAWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
+S +E + + D + + G +YS D+ +A++ + + +P + +
Sbjct: 185 YSQKEGDTQTIRVVMDENVKYPMTGEDYYKRPQYSADMIHHKALEWLDEQNGKKPFYGFF 244
Query: 247 AHAATHSANPYEPLQAPDHYLNIHR-----------------HIEDFKRSKFAAILHKLD 289
+ H+ E +Q D L ++ H ++FAA++ +LD
Sbjct: 245 TYTLPHA----ELMQPNDSLLKKYKKQFFEDKTWGGSDGSRYHATTHTHAQFAAMITRLD 300
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDN-----GGAAAGFNLNAASNWPLRGVKNTLWEGG 344
VG+++E L+++ + N+I+VF SDN GGA F + LRG+K +EGG
Sbjct: 301 SYVGEIMEKLKEKGLDKNTIVVFSSDNGPHEEGGADPEF---FGRDGKLRGIKRQCYEGG 357
Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP-NYVNSTVEN 395
+R ++ P G V++ + D +PT A P Y N +N
Sbjct: 358 IRVPFIVKWPGKIKPGTVSDHQIAFYDIMPTFADIAGDKKFPKRYFNKKSQN 409
>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
Length = 476
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 123/382 (32%), Positives = 187/382 (48%), Gaps = 57/382 (14%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSG--PPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
I+A A PL+ L A G P+II I DD G+ D+ G + TP ID +
Sbjct: 12 IVALAQWPLS--------SLAAEQGEAQPNIIIIFTDDQGYADLSCFGGTHVSTPRIDQM 63
Query: 92 AYSGIILKNYYTVQ-LCTPSRSAIMTGKHP----IHTGMQHNVLYGCERGGLPLSEKILP 146
A G L ++Y +CTPSR+A+MTG +P + G VL ++ GL E +
Sbjct: 64 AAEGAKLTSFYVAAPVCTPSRAALMTGTYPKRIDMARGSNFVVLLAGDKKGLNPKEITIA 123
Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH--------QDYFDHSAEE 198
+ LK +GY+T + GKWHLG + E+ PT +GF+ G H Q +F +
Sbjct: 124 EVLKAVGYKTGMFGKWHLG-DQPEFLPTRQGFDEFFGLPYSHDIHPYHPQQSHFKFPSLP 182
Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ LD +E D Y T FT AV I + D+P FLY+ H H+
Sbjct: 183 L----LDGEEVIE--MDPDADYLTKRFTERAVQFIEKNK-DQPFFLYMPHPIPHT----- 230
Query: 259 PLQAPDHYL-NIHRHIE----------DFK-RSKFAA-ILHKLDESVGKVVEALEQRRML 305
PL A Y+ ++ I+ D+K R++ + + ++D SVG++++AL++ +
Sbjct: 231 PLHASPAYMKDLPEEIKAKLSKENNYIDYKTRTQLLSRAIGEVDWSVGQILDALKKNGID 290
Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQ 365
N+I++F SDNG A + A+ PLRG KN EGG+R +I P GI ++
Sbjct: 291 DNTIVIFSSDNGPA-----IGKAT--PLRGRKNQTLEGGMRVPTVIHWPGKIPAGIANDE 343
Query: 366 YVHVSDWLPTLLSAANKSDIPN 387
+ D LPT A ++IP+
Sbjct: 344 IMTTMDLLPTFAKLAG-AEIPS 364
>gi|421614606|ref|ZP_16055659.1| arylsulfatase [Rhodopirellula baltica SH28]
gi|408494615|gb|EKJ99220.1| arylsulfatase [Rhodopirellula baltica SH28]
Length = 524
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 185/366 (50%), Gaps = 50/366 (13%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSA 113
A++ P+I+ I+ADDLG+ ++G +G +I TP +D LA GI L N+Y+ +C PSR
Sbjct: 55 ATASRPNIVLIVADDLGYGELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCC 114
Query: 114 IMTGKHPIHTGMQHNV----------LYGCERGG---LPLSEKILPQYLKELGYRTRIVG 160
+MTGKHP H +++N E G LP+ E + +YLK +GYRT G
Sbjct: 115 LMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFG 174
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD--LHG 218
KW LG + P +GF+ G+ + H+ +W + ++++P D LHG
Sbjct: 175 KWGLGHFGTTGDPNEQGFDLFYGF---NCQRHAHNHYPNFLW-RNRVKEVQPGNDRTLHG 230
Query: 219 K-YSTDVFTAEAVDIIHNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI- 273
+ YS D F EA + I + +P F YL A H + +Q P+ ++ + +
Sbjct: 231 ETYSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLS-----IQVPEEEVDAYEGVI 285
Query: 274 --EDFK----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG---- 317
D++ R+ +AA++ ++DE VG++V+ ++ + N++I+F SDNG
Sbjct: 286 EEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQIVDLVDSLGLGENTLIMFTSDNGPTYD 345
Query: 318 --GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
G + N+AS ++G+K L EGG+R + + G ++ D+LPT
Sbjct: 346 RLGGSDSDYFNSASG--MKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFLPT 403
Query: 376 LLSAAN 381
+ AA
Sbjct: 404 ITDAAG 409
>gi|387791821|ref|YP_006256886.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
gi|379654654|gb|AFD07710.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
Length = 469
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 166/336 (49%), Gaps = 52/336 (15%)
Query: 43 AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
F+ +F + S P+IIFILADD+G+ D+G +G I TPNID LA G+ ++Y
Sbjct: 21 CFSPFCLFSQDLRSKEKPNIIFILADDMGYGDLGCYGQQLIKTPNIDQLANQGMRFTDFY 80
Query: 103 T-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIV 159
+C PSR+++MTG+H H ++ N LPL ++ L QYLK+ GY +V
Sbjct: 81 AGSTVCAPSRASLMTGQHTGHAFIKGN-------DELPLRQQDSTLSQYLKQAGYVNGMV 133
Query: 160 GKWHLGFYKKEYTPTFRG---FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
GKW LG P +G F HL + GH DH +W ++ + D
Sbjct: 134 GKWGLGLQGTSGAPEKKGWDFFVGHLHHVEGHFQQTDH------LWKIENGESKKEMID- 186
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL--------- 267
Y ++FT A+ I + P FLY+++ H+ L+ P+ YL
Sbjct: 187 STVYVNELFTQSALQFIQKNK-KHPFFLYVSYTLPHAE-----LKVPEKYLQLYLNKDGS 240
Query: 268 -----------NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
+H + + ++ +AA++ +D+ VG++++ L+ + N+I+VF SDN
Sbjct: 241 SKFAPEIAQPAGLHYGQQPYPKAAYAAMITSMDDYVGQIMQQLKAYGIDKNTIVVFTSDN 300
Query: 317 GG-AAAGFNLNAASNW-----PLRGVKNTLWEGGVR 346
G G + A+ + PLRG+K L+EGG+R
Sbjct: 301 GTHIEGGRKMTDATTFFKSSGPLRGIKRDLYEGGIR 336
>gi|390356764|ref|XP_792809.2| PREDICTED: N-acetylgalactosamine-6-sulfatase-like
[Strongylocentrotus purpuratus]
Length = 417
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 173/348 (49%), Gaps = 38/348 (10%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
V + PP+II +L DD+GW D+G +G TPN+D +A GI+L ++Y+ LC+PSR+
Sbjct: 23 VVVAAPPNIIVMLMDDMGWGDLGIYGNPAKETPNLDQMAAEGILLPDFYSGNPLCSPSRA 82
Query: 113 AIMTGKHPIHTGMQHNVLYG-------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
A++TG+ PI G + GG+ +E +LP+ L++ GYR ++VGKWHLG
Sbjct: 83 ALLTGRLPIRNGFYTTNEHARNAYTPQIIVGGIQDNEILLPELLQKAGYRNKLVGKWHLG 142
Query: 166 FYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD- 223
++ ++ P GF+ G H Y D + + ++ +D E A + +++ D
Sbjct: 143 -HQPQFHPLKHGFDEWFGAPNCHFGPYNDKNTPNIPVY-----QDAEMAGRYYEEFAIDK 196
Query: 224 ---------VFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
++ E V I + +P FLY ATH EPL A +L
Sbjct: 197 KTGESNLTQLYIQEGVSFIEKSVAAMQPFFLYWTPDATH-----EPLYASKPFLGRS--- 248
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
+R + + +LD VG+V+ + + + +N+ ++F SDN GAA + + SN P
Sbjct: 249 ---QRGLYGDAVMELDYGVGQVLGKIRELGVANNTFVLFSSDN-GAALYASESGGSNGPF 304
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R + W P G V Q V D T LS A+
Sbjct: 305 LCGKETTYEGGMREPTIAWWPNHIKPGRVTHQIGSVMDIFTTALSVAD 352
>gi|423138563|ref|ZP_17126201.1| arylsulfatase [Salmonella enterica subsp. houtenae str. ATCC
BAA-1581]
gi|379051117|gb|EHY69008.1| arylsulfatase [Salmonella enterica subsp. houtenae str. ATCC
BAA-1581]
Length = 497
Score = 149 bits (376), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R + +++ LA V A++ P+++ ILADDLG+ D+ +G + TPNID L
Sbjct: 3 RTVVASMIGLALCAGSVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62
Query: 92 AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +YY LC+PSR+ ++TG+ P TG++ + G + L +E + LK
Sbjct: 63 AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121
Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
+ GY T ++GK HL G + GF+ L G D +A+E +G+
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179
Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
+ P W +G K S + ++E V+ + N D P FLY+A HS
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231
Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
PL +P YL+++ +++ D+++ ++ A + +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291
Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
+ N+I++F SDNG A + LN A LRG K+ LWEGG+R +I
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
+G+V + V+ DWLPTL AN D +P ++++P ++ L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|161504832|ref|YP_001571944.1| hypothetical protein SARI_02956 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866179|gb|ABX22802.1| hypothetical protein SARI_02956 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 497
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 194/392 (49%), Gaps = 53/392 (13%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
A++ P+++ ILADDLG+ D+ +G + TPNID LA G+ +YY LC+PSR+
Sbjct: 25 AATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKLAQEGVKFTDYYAPAPLCSPSRA 84
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL---GFYKK 169
++TG+ P TG++ + G + L +E + LK+ GY T ++GK HL G
Sbjct: 85 GLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLKQQGYDTAMMGKLHLNAGGDRTD 143
Query: 170 EYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDMRRDLEP-AWDLHG-------KY 220
+ GF+ L G D +A+E +G+ + P W +G K
Sbjct: 144 QPQAKDMGFDYTLVNPAGFVSDATLDNAKERPRYGM-----VHPTGWIRNGQHIGRADKM 198
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RHIEDFKR- 278
S + ++E V+ + N D P FLY+A HS PL +P YL+++ +++ D+++
Sbjct: 199 SGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS-----PLASPKKYLDMYSQYMTDYQKQ 253
Query: 279 -----------------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--- 318
++ A + +DE VGKV++ ++ N+II+F SDNG
Sbjct: 254 HPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKIKAMGEEDNTIIIFTSDNGPVTR 313
Query: 319 -AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
A + LN A LRG K+ LWEGG+R +I +G+V + V+ DWLPTL
Sbjct: 314 EARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYGKHIPQGMVTDTPVYGLDWLPTL 373
Query: 377 LSAANKSD--IPNYVNSTVENIIPRYENSILR 406
AN D +P ++++P ++ L+
Sbjct: 374 ---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402
>gi|340617020|ref|YP_004735473.1| sulfatase [Zobellia galactanivorans]
gi|339731817|emb|CAZ95082.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 452
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 108/340 (31%), Positives = 177/340 (52%), Gaps = 35/340 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+IIFILADDLG+ D+G +G + I TP+ID +A G+ +Y +C PSR+++MTG+
Sbjct: 22 PNIIFILADDLGYGDLGCYGQETISTPHIDRMASEGLRFTQHYAGATVCAPSRNSLMTGQ 81
Query: 119 HPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
H M + + E+ P+ S+ + + LK+ GYRT ++GKW LG +
Sbjct: 82 H-----MGNTTIKSMEK---PIKDSDLTVAEVLKDAGYRTGVIGKWGLGNVGTSGYANAQ 133
Query: 177 GFESHLGYWT---GHQDYFDHSAEEMKMWGL-----------DMRRDLEPAWDLHGKYST 222
GF+ GY+ H Y D+ E + + L + + A +YS
Sbjct: 134 GFDYSFGYYDQIRAHNYYPDYLMENGEKFPLKNEVVYVSDSTNYAVGIGNAAIKKKEYSN 193
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY-----EPLQAPDHYLNIHRHIEDFK 277
D+FT +A+D I + EP FLYLA+ H+ N ++ PD+ + K
Sbjct: 194 DLFTEKALDFIKTDAK-EPFFLYLAYTIPHANNESFLINEHGMEVPDYGIYDKEPWPSEK 252
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG-GAAAGFNLNAA-SNWPLRG 335
++ AA++ +LD VG++++ L++ + N++++F SDNG G+ ++ SN PLRG
Sbjct: 253 KAG-AAMITRLDSYVGQILDLLKKNNLDGNTLVIFTSDNGPHQEGGWQVDYFDSNGPLRG 311
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
+K L+EGG+R + P +G +Q D+LPT
Sbjct: 312 MKRDLYEGGIRVPFIAKWPGKIEKG-ETDQVATFWDFLPT 350
>gi|406836223|ref|ZP_11095817.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 456
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 168/328 (51%), Gaps = 22/328 (6%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I++I+ADDLG +D+G +G I TPNID LA G Y +C PSRS +MTG
Sbjct: 26 PNIVYIMADDLGPHDLGCYGQKLIQTPNIDKLAIEGTRYTQVYAGASVCAPSRSVLMTGL 85
Query: 119 HPIHTGMQHN-------VLYGCERGGLPL-SEKI-LPQYLKELGYRTRIVGKWHLGFYKK 169
H HT ++ N + E+G +PL SE + + + LK GY T I GKW LG
Sbjct: 86 HTGHTRVRGNHSKFGGVIGESGEKGRVPLRSEDVTVAEVLKRAGYSTGITGKWGLGEPHT 145
Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD-LHGKYSTDVFTAE 228
P +GF+ LGY + HS +W R DL D +YS D+FT
Sbjct: 146 NGVPNRKGFDQWLGYLNQN---VAHSYFPSALWLNGERFDLPGNSDGRRQQYSHDLFTGF 202
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A++ I + +EP FLY+ H+A + PD + R ++A+++ ++
Sbjct: 203 ALNFIRANR-EEPFFLYVPFTIPHAA-----WEVPDLGPYADQTWPKGAR-EYASMVTRM 255
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
D VG++V L++ ++ N+I+ F SDNGG A F +N LRG K ++EGG+R
Sbjct: 256 DADVGRIVALLDELKLRDNTIVFFCSDNGGPAP-FGETFQTNGALRGKKGQVYEGGLRVP 314
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTL 376
+ P G +++ + +D LPTL
Sbjct: 315 MIARWPGHVPAGKISDVPWYFADVLPTL 342
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.133 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,137,002,189
Number of Sequences: 23463169
Number of extensions: 785727486
Number of successful extensions: 11847664
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 16641
Number of HSP's successfully gapped in prelim test: 19134
Number of HSP's that attempted gapping in prelim test: 9335316
Number of HSP's gapped (non-prelim): 1141554
length of query: 905
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 753
effective length of database: 8,792,793,679
effective search space: 6620973640287
effective search space used: 6620973640287
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 82 (36.2 bits)