BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy1088
         (905 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
          Length = 543

 Score =  513 bits (1322), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 246/377 (65%), Positives = 298/377 (79%), Gaps = 13/377 (3%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             L  +FV + AS  P ++IFILADDLGWNDVGFHG  QIPTPNIDALAYSG+IL+NYY 
Sbjct: 8   LILFPLFVTIRASERP-NVIFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLILQNYYV 66

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             +CTPSRSA+MTGK+PIHTGMQH VL+G E  GLPL+EKILP+YL+ELGY  R+VGKWH
Sbjct: 67  TPICTPSRSALMTGKYPIHTGMQHTVLFGAEPRGLPLTEKILPEYLRELGYTNRLVGKWH 126

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
           LG Y KEYTP +RGF+SHLGYWTGHQDY+DH+A E   WG DMRR+++ A+DLHG+YSTD
Sbjct: 127 LGSYTKEYTPLYRGFDSHLGYWTGHQDYYDHTAVENPGWGFDMRRNMDLAYDLHGQYSTD 186

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           VFT EAV II NH+T  PLFLYLAH A HSANPY PL APD  +    +I  +KR +FAA
Sbjct: 187 VFTQEAVKIIENHNTTNPLFLYLAHVAVHSANPYNPLPAPDETVEKFSNIPSYKRQRFAA 246

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           +L KLD+SVG VVE+L +R+ML NSII+F +DNGGAAAGFNLNAASN+PLRGVKNT+WEG
Sbjct: 247 MLSKLDDSVGAVVESLSKRKMLKNSIIIFSTDNGGAAAGFNLNAASNFPLRGVKNTVWEG 306

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
           GVRGAGL+WSPL++    VA+Q++H+SDWLPTLLSA          NS+++NI     + 
Sbjct: 307 GVRGAGLLWSPLIKQPQRVAQQFMHISDWLPTLLSAVG-------ANSSLKNI-----DG 354

Query: 404 ILRYENGTHEYNSPRIE 420
           +  +E+ +   NSPR E
Sbjct: 355 LDLWESLSQNKNSPRSE 371



 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 82/162 (50%), Gaps = 17/162 (10%)

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV------- 621
           NS    IDG+D+W  LS+N+ S R+ ILHNIDD +  +ALT G WKLVK           
Sbjct: 346 NSSLKNIDGLDLWESLSQNKNSPRSEILHNIDDIYGNAALTVGAWKLVKGTTYNGAWDGW 405

Query: 622 -----MRYQVDLT---GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQI 673
                  Y  DL+     P  V L  + +   +   + KLR+A+ I C   + +PC+P  
Sbjct: 406 YGPSGRDYTYDLSLVVNSPTGVALKSI-NMSLVPDEITKLRNASRIDCATDQPIPCKPLE 464

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             CLF++  DPCE  NLA  +    +N   T + ++N  A P
Sbjct: 465 ETCLFNVIQDPCELRNLAKENPKIVMNLEET-LNKYNNTALP 505



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 29/36 (80%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL+G E  GLPL+EKILP+YL+ELGY  R++ 
Sbjct: 88  MQHTVLFGAEPRGLPLTEKILPEYLRELGYTNRLVG 123



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 27/42 (64%)

Query: 756 MRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           + KLR+A+ I C   + +PC+P    CLF++  DPCE  NLA
Sbjct: 441 ITKLRNASRIDCATDQPIPCKPLEETCLFNVIQDPCELRNLA 482



 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 31/52 (59%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDLK 905
           + K  P ++  +E+ L   N TA+ P N P D  G+PK +++ W+ FGD +K
Sbjct: 481 LAKENPKIVMNLEETLNKYNNTALPPGNLPLDPRGNPKYWNNTWTNFGDYIK 532


>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
          Length = 544

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 239/351 (68%), Positives = 281/351 (80%), Gaps = 8/351 (2%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           R  + AF ++P        F+  V ++  PHI+FILADDLGWNDVGFHG  QIPTPNIDA
Sbjct: 5   RAVLWAFLIVP--------FLRHVTAAERPHIVFILADDLGWNDVGFHGSGQIPTPNIDA 56

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           LAYSG++L  YY   +CTPSRSA+MTGK+PIHTGMQH VL   E  GLPL EK+LP+YLK
Sbjct: 57  LAYSGLLLDRYYVSPICTPSRSALMTGKYPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLK 116

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           ELGYRT IVGKWHLGFY K+YTPT+RGF+SH+G+W+GHQDYFDH+A E   WGLDMRR +
Sbjct: 117 ELGYRTHIVGKWHLGFYTKQYTPTYRGFDSHIGFWSGHQDYFDHTAVESPYWGLDMRRGM 176

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
           E AWDLHG+YSTDVFT+EAV +I+NH+  +PLFLYLAHAA HS NPY+PL APD  +   
Sbjct: 177 EAAWDLHGQYSTDVFTSEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDVDVAKF 236

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
            +I D+ R +FAA+L KLDESVG+VVEAL+++ ML NSIIVF +DNGG  AGFNLNAASN
Sbjct: 237 TNIFDYNRRRFAAMLSKLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAGFNLNAASN 296

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           WPLRG KNTLWEGGVRGAGLIWSP L   G V+ Q  H+SDWLPTLL+AA 
Sbjct: 297 WPLRGTKNTLWEGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAG 347



 Score = 82.8 bits (203), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 74/142 (52%), Gaps = 34/142 (23%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
            IDG+D+W  LS +  S R ++LHNIDD + +S +T G+WK ++       Q D   GP 
Sbjct: 354 SIDGLDLWHALSEDTESPRQSVLHNIDDIFGVSGITIGEWKYIQ-GSTYEGQWDGWYGP- 411

Query: 634 QVYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVKE-VP- 668
                  S REW+             A+A          +R+LR+ A I+C P  + +P 
Sbjct: 412 -------SGREWVYDIGGVIGSPTGRAVASVGYSLSSENVRRLREQAMIKCRPRNDTLPL 464

Query: 669 CEPQIAPCLFDIKNDPCEKNNL 690
           C+P IAPCLF +  DPCE NNL
Sbjct: 465 CKPLIAPCLFHVLKDPCEDNNL 486



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 65/158 (41%), Gaps = 60/158 (37%)

Query: 749 YSNEEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRIN 806
           YS   E +R+LR+ A I+C P  + +P C+P IAPCLF +  DPCE NNL        IN
Sbjct: 437 YSLSSENVRRLREQAMIKCRPRNDTLPLCKPLIAPCLFHVLKDPCEDNNL--------IN 488

Query: 807 HYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
            Y                                                  P++   M 
Sbjct: 489 EY--------------------------------------------------PEIARAMR 498

Query: 867 KELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
           ++L   N TA+ P N P+DK  +P+ +DH W  FGD L
Sbjct: 499 EDLKKWNATALLPGNLPWDKRANPELWDHTWINFGDHL 536



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 27/36 (75%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL   E  GLPL EK+LP+YLKELGYRT I+ 
Sbjct: 91  MQHGVLKCAEPRGLPLQEKLLPEYLKELGYRTHIVG 126


>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
          Length = 546

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 232/341 (68%), Positives = 272/341 (79%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           + L   L++  VD V +S  PHI+FILADDLGWNDVGFHGL QIPTPNIDALAY+G++L 
Sbjct: 8   MTLCICLTVFVVDHVVASARPHIVFILADDLGWNDVGFHGLSQIPTPNIDALAYTGLLLD 67

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
            YY   +CTPSRSA+MTGKHPIHTGMQH VL   E  GLPL EK+LP+YL+ LGY T +V
Sbjct: 68  RYYVSPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMV 127

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLGFY KEYTPT+RGF+SHLG+W+GH DYFDH+A E   WGLDMRR LEPAWDLHG+
Sbjct: 128 GKWHLGFYTKEYTPTYRGFDSHLGFWSGHHDYFDHTAVEEPYWGLDMRRGLEPAWDLHGQ 187

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           YSTDVFT EAV +I NH+T  P+FLYL+HAA HS NPY PL A DH +     I D+ R 
Sbjct: 188 YSTDVFTKEAVRLIDNHNTSRPMFLYLSHAAVHSGNPYNPLPAHDHDVAKFPKILDYNRR 247

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           +FA +L KLDESVG VVEAL +R+ML NS+IVF +DNGG  AGFNLNAASN+PLRG KNT
Sbjct: 248 RFAGMLSKLDESVGLVVEALRKRKMLENSVIVFSTDNGGPPAGFNLNAASNFPLRGTKNT 307

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           LWEGGVRG GL+WSP L + G ++ Q +H+SDWLPTLL+A 
Sbjct: 308 LWEGGVRGTGLVWSPKLINPGRISRQLIHISDWLPTLLTAV 348



 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 72/135 (53%), Gaps = 16/135 (11%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG--GP 632
           IDG+D+W  L+ +  S R  +LHNIDD + I+ +T G WK ++         D  G  G 
Sbjct: 357 IDGVDMWRALNEDTISPRKMVLHNIDDIYGIAGITYGDWKFIQGSTYNGQWDDWYGPSGR 416

Query: 633 DQVY-----LSGLSDREWLALAM-------RKLRDAASIQCGPVKE-VP-CEPQIAPCLF 678
           D  Y     +   + R   ++ +        KL+  A+I+C    E +P C+P  APCLF
Sbjct: 417 DWAYDPYAVMESAAGRAAASVGLSLTAGNIEKLQQGAAIKCSTRNENLPLCKPLEAPCLF 476

Query: 679 DIKNDPCEKNNLADR 693
           +++ DPCEKNNL ++
Sbjct: 477 NVREDPCEKNNLIEK 491



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 60/151 (39%)

Query: 756 MRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVG 813
           + KL+  A+I+C    E +P C+P  APCLF+++                          
Sbjct: 446 IEKLQQGAAIKCSTRNENLPLCKPLEAPCLFNVRE------------------------- 480

Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
             DP ++ N I     EK                          YP ++S ++++L+ +N
Sbjct: 481 --DPCEKNNLI-----EK--------------------------YPSIVSDLKEKLSLLN 507

Query: 874 RTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
            +A+ P N P+DK G+P  +DH W+ FGD L
Sbjct: 508 ASAILPGNLPWDKRGNPDYWDHTWTNFGDYL 538



 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL   E  GLPL EK+LP+YL+ LGY T ++ 
Sbjct: 93  MQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMVG 128


>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 543

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 234/341 (68%), Positives = 269/341 (78%), Gaps = 2/341 (0%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           + L   L++  V  VA+   PHI+FILADDLGWNDVGFHG  QIPTPNIDALAYSG++L 
Sbjct: 7   ITLCLCLTVFVVGHVAAR--PHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLD 64

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
            YY   +CTPSRSA+MTGKHPIHTGMQH VL   E  GLPL EK+LP+Y ++LGY T IV
Sbjct: 65  RYYVSPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPEYFRDLGYSTHIV 124

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLGFY KEYTP +RGF+SH+G+W+GH DYFDHSA E   WGLDMRR LEPAWDLHG+
Sbjct: 125 GKWHLGFYTKEYTPMYRGFDSHIGFWSGHHDYFDHSAVEEPYWGLDMRRGLEPAWDLHGQ 184

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           YSTDVFT EAV +I NH+T  P+FLYLAHAA HS NPY PL A D  +    +I ++ R 
Sbjct: 185 YSTDVFTKEAVKLIDNHNTSRPMFLYLAHAAVHSGNPYNPLPAHDQDVAKFTNIFNYNRR 244

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           +FA IL KLDESVG VVEAL +R+ML NS+IVF +DNGGA AGFNLNAASNWPLRG KNT
Sbjct: 245 RFAGILSKLDESVGLVVEALRRRKMLENSVIVFSTDNGGAPAGFNLNAASNWPLRGTKNT 304

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           LWEGGVRG GL+WSP L   G V  Q +H+SDWLPTLL+AA
Sbjct: 305 LWEGGVRGVGLLWSPKLIKPGRVGRQLIHISDWLPTLLTAA 345



 Score = 82.4 bits (202), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 52/144 (36%), Positives = 75/144 (52%), Gaps = 34/144 (23%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           IDG+D+W  LS +  S R  +LHNIDD + ++ +T G WK ++            G  D 
Sbjct: 354 IDGMDIWHALSEDTESPRKMVLHNIDDIYGVAGITYGDWKFIQ-------GSTYNGQWDG 406

Query: 635 VYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVKE-VP-C 669
            Y  G S REW+             A+A          +RKLR+ A I+C P  + +P C
Sbjct: 407 WY--GPSGREWVYDVGGVIGSAAGRAVASVGLSLSADNIRKLREDAMIKCPPRNDNLPLC 464

Query: 670 EPQIAPCLFDIKNDPCEKNNLADR 693
           +P  APCLF+++ DPCE NNL ++
Sbjct: 465 KPLEAPCLFNVRQDPCEDNNLIEK 488



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 33/49 (67%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
           IDG+D+W  LS +  S R  +LHNIDD + ++ +T G WK ++ ++ NG
Sbjct: 354 IDGMDIWHALSEDTESPRKMVLHNIDDIYGVAGITYGDWKFIQGSTYNG 402



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 36/52 (69%), Gaps = 2/52 (3%)

Query: 750 SNEEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADR 799
           S   + +RKLR+ A I+C P  + +P C+P  APCLF+++ DPCE NNL ++
Sbjct: 437 SLSADNIRKLREDAMIKCPPRNDNLPLCKPLEAPCLFNVRQDPCEDNNLIEK 488



 Score = 45.8 bits (107), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 30/47 (63%)

Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
           YP V+S +++ L  +N +AV P N P+DK  +P  +DH W  FGD L
Sbjct: 489 YPSVVSDLKERLRVLNASAVLPGNLPWDKRANPDYWDHTWINFGDYL 535



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL   E  GLPL EK+LP+Y ++LGY T I+ 
Sbjct: 90  MQHGVLKCAEPRGLPLHEKLLPEYFRDLGYSTHIVG 125


>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 545

 Score =  491 bits (1264), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 228/337 (67%), Positives = 269/337 (79%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
           F L++  V  VA++  PHI+FILADDLGWNDVGFHG  QIPTPNIDALAYSG++L  YY 
Sbjct: 11  FYLTVFVVCHVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYV 70

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             +CTPSRSA+MTGKHPIHTGMQH VL   E  GLPL EK+LPQYL+ELGY T IVGKWH
Sbjct: 71  TPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWH 130

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
           LGFY KEYTP +RGF+SH+G+W+GH DYFDHSA E   WGLDMRR L  AWDLHG+YSTD
Sbjct: 131 LGFYTKEYTPMYRGFDSHIGFWSGHHDYFDHSAVESPYWGLDMRRGLNSAWDLHGQYSTD 190

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +FT EAV +I++H+   P+FLYL+HAA HS N Y PL APD  +    +I +++R +FA 
Sbjct: 191 IFTKEAVKLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQDVAKFTNIFNYERRRFAG 250

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           +L KLDESVG+VVEAL ++ ML NS+IVF +DNGG  AGFNLNAASNWPLRG KNTLWEG
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEG 310

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           GVRGAGL+WSP L   G V+ Q  H+SDWLPTL++AA
Sbjct: 311 GVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAA 347



 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 75/145 (51%), Gaps = 34/145 (23%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
            IDG+D+W  LS +  S R  +LHNIDD + ++ +T G WK ++       Q D   GP 
Sbjct: 355 SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQ-GSTYNGQWDGWYGP- 412

Query: 634 QVYLSGLSDREW--------------------LAL---AMRKLRDAASIQCGPVKE-VP- 668
                  S REW                    LAL   A+R+LR+ A I+C P  + +P 
Sbjct: 413 -------SGREWVYDIGGVIGSTAGRAVASVGLALSADAIRRLREDAMIKCPPKNDSLPL 465

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
           C+P  APCLF+++ DPCE NNL ++
Sbjct: 466 CKPLEAPCLFNVQQDPCEDNNLINK 490



 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 62/155 (40%), Gaps = 60/155 (38%)

Query: 752 EEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYT 809
             + +R+LR+ A I+C P  + +P C+P  APCLF+++ DPCE NNL        IN Y 
Sbjct: 441 SADAIRRLREDAMIKCPPKNDSLPLCKPLEAPCLFNVQQDPCEDNNL--------INKYP 492

Query: 810 TEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKEL 869
           + V  L  K R                                                 
Sbjct: 493 SLVTELQDKLR------------------------------------------------- 503

Query: 870 ANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
             +N TA+ P N P+D   +P  +DH W+ FGD L
Sbjct: 504 -KLNATAILPGNLPWDNKANPDYWDHTWTNFGDYL 537



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 39/70 (55%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
            IDG+D+W  LS +  S R  +LHNIDD + ++ +T G WK ++ ++ NG        + 
Sbjct: 355 SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQGSTYNGQWDGWYGPSG 414

Query: 569 NSYQNEIDGI 578
             +  +I G+
Sbjct: 415 REWVYDIGGV 424



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL   E  GLPL EK+LPQYL+ELGY T I+ 
Sbjct: 92  MQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVG 127


>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
          Length = 545

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 227/337 (67%), Positives = 267/337 (79%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
           F L++  V  VA++  PHI+FILADDLGWNDVGFHG  QIPTPNIDALAYSG++L  YY 
Sbjct: 11  FYLTVFVVCHVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYV 70

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             +CTPSRSA+MTGKHPIHTGMQH VL   E  GLPL EK+LPQYL+ELGY T IVGKWH
Sbjct: 71  TPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWH 130

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
           LGFY KEYTP +RGF+SH+G+W+GH DYFDHSA E   WGLDMRR L  AWDLHG+YSTD
Sbjct: 131 LGFYTKEYTPMYRGFDSHIGFWSGHHDYFDHSAVESPYWGLDMRRGLNSAWDLHGQYSTD 190

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +FT EAV +I++H+   P+FLYL HAA HS N Y PL  PD  +    +I +++R +FA 
Sbjct: 191 IFTKEAVKLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQDVAKFTNIFNYERRRFAG 250

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           +L KLDESVG+VVEAL ++ ML NS+IVF +DNGG  AGFNLNAASNWPLRG KNTLWEG
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEG 310

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           GVRGAGL+WSP L   G V+ Q  H+SDWLPTL++AA
Sbjct: 311 GVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAA 347



 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 74/145 (51%), Gaps = 34/145 (23%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
            IDG+D+W  LS +  S R  +LHNIDD + ++ +T G WK ++            G  D
Sbjct: 355 SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQ-------GSTYNGQWD 407

Query: 634 QVYLSGLSDREWL-----------------------ALAMRKLRDAASIQCGPVKE-VP- 668
             Y  G S REW+                       A A+R+LR+ A I+C P  + +P 
Sbjct: 408 GWY--GPSGREWVYDIGGVIGSTAGRAVASVGLTLSADAIRRLREDAMIKCPPKNDSLPL 465

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
           C+P  APCLF+++ DPCE NNL ++
Sbjct: 466 CKPLEAPCLFNVQQDPCEDNNLINK 490



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 64/155 (41%), Gaps = 60/155 (38%)

Query: 752 EEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYT 809
             + +R+LR+ A I+C P  + +P C+P  APCLF+++ DPCE NNL        IN Y 
Sbjct: 441 SADAIRRLREDAMIKCPPKNDSLPLCKPLEAPCLFNVQQDPCEDNNL--------INKYP 492

Query: 810 TEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKEL 869
           + V                 E + + +K                                
Sbjct: 493 SLV----------------TELRDRLRK-------------------------------- 504

Query: 870 ANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
             +N TA+ P N P+D   +P  +DH W+ FGD L
Sbjct: 505 --LNATAILPGNLPWDNRANPDYWDHTWTNFGDYL 537



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL   E  GLPL EK+LPQYL+ELGY T I+ 
Sbjct: 92  MQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVG 127


>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 545

 Score =  486 bits (1252), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 229/343 (66%), Positives = 269/343 (78%), Gaps = 6/343 (1%)

Query: 50  FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
            V     + PPHIIFI+ADDLGWNDVGFHG  QIPTPNIDALAYSG+IL  YY   +CTP
Sbjct: 18  IVAAAPETTPPHIIFIVADDLGWNDVGFHGSGQIPTPNIDALAYSGLILNRYYVSPICTP 77

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           SRSA+MTGK+PIHTGMQ  VL G E  GLPL EK+LP+YL+ELGYRT IVGKWHLGFY K
Sbjct: 78  SRSALMTGKYPIHTGMQRGVLKGAEPRGLPLKEKLLPEYLRELGYRTHIVGKWHLGFYTK 137

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           EYTPT+RGFESHLGYWTGHQDY+DHSA E   WG+DMRR++EPAWDLHG+YSTDVFT EA
Sbjct: 138 EYTPTYRGFESHLGYWTGHQDYYDHSAVEEPYWGMDMRRNMEPAWDLHGQYSTDVFTKEA 197

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           V +I +H+  +P+FLYLAHAA HSANPY PL   D  ++  + I ++KR +FA +   +D
Sbjct: 198 VKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDEIVDKIK-IPEYKRRRFAGMASAMD 256

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            SVG VVEALE+R ML + +IVF +DNGG A GFNLNAASNWPLRGVK+TL+EGGVRG G
Sbjct: 257 ASVGAVVEALEKRNMLRDCLIVFSTDNGGPAEGFNLNAASNWPLRGVKDTLFEGGVRGVG 316

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
           L+WS  L   G ++E   H++DWLPTLL AA       YVN+T
Sbjct: 317 LVWSERLARPGRLSEATFHIADWLPTLLRAAG-----GYVNAT 354



 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 82/162 (50%), Gaps = 21/162 (12%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
            +IDG+D W  L+ ++ S R  +LHNIDD    +A++  +WKL+K     R + +   GP
Sbjct: 355 GKIDGLDAWEALADDKESPRRVVLHNIDDVLGTAAISVDQWKLIK-GSTYRGRWNQWYGP 413

Query: 633 ---DQVY-LSGLSDREWLALAMRK------------LRDAASIQCGPVKEVP---CEPQI 673
              D  Y +  +++   +A A+++            LR+ A+I CG   + P   C P  
Sbjct: 414 SGRDYPYDVEAVAEGCLVAKALKRLGRGLDRGKLVELRNQATIHCGEQPQQPQKACNPLE 473

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           A CL D++ DPCE+ NLAD   D  +     E+ R N  A P
Sbjct: 474 AACLIDVRADPCERRNLADEQPDV-VRRLEEELARLNASAVP 514



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 76/203 (37%), Gaps = 68/203 (33%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           +T  GR+NQ   P   +     +   +     K  K+  +   + K       + +LR+ 
Sbjct: 401 STYRGRWNQWYGPSGRDYPYDVEAVAEGCLVAKALKRLGRGLDRGK-------LVELRNQ 453

Query: 763 ASIQCGPVKEVP---CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQ 819
           A+I CG   + P   C P  A CL D++                            DP +
Sbjct: 454 ATIHCGEQPQQPQKACNPLEAACLIDVRA---------------------------DPCE 486

Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAP 879
           R N    L  E+                           PDV+ ++E+ELA +N +AV P
Sbjct: 487 RRN----LADEQ---------------------------PDVVRRLEEELARLNASAVPP 515

Query: 880 INKPFDKGGDPKNFDHAWSIFGD 902
            N P+D   DP  +DH W+ FGD
Sbjct: 516 GNLPWDSQADPALWDHTWNNFGD 538



 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 27/36 (75%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQ  VL G E  GLPL EK+LP+YL+ELGYRT I+ 
Sbjct: 93  MQRGVLKGAEPRGLPLKEKLLPEYLRELGYRTHIVG 128


>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
          Length = 519

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 218/310 (70%), Positives = 258/310 (83%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GWNDVGFHG  QIPTPNIDALAYSG++L  YY   +CTPSRSA+MTGK+PIHTGMQH++L
Sbjct: 10  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIHTGMQHSIL 69

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
            G E  GLPL EKILP+YL+ELGY T IVGKWHLGFYK+EYTPT+RGF++H+GYWTGH D
Sbjct: 70  KGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHIGYWTGHHD 129

Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
           Y+DH+A E   WGLDMRR ++PAWDLHG+YSTDVFT EAV +I+NH++  P+FLYLAHAA
Sbjct: 130 YYDHTAVENPYWGLDMRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSRPMFLYLAHAA 189

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            HS NPY PL APD  +    +I D+ R +FA IL KLD+SVG+VVEAL +  ML +S+I
Sbjct: 190 VHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGILSKLDQSVGQVVEALYKNNMLQDSVI 249

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           VF +DNGG AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP L   G V+ Q +H++
Sbjct: 250 VFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPRLVRAGRVSRQMIHIT 309

Query: 371 DWLPTLLSAA 380
           DWLPTL++AA
Sbjct: 310 DWLPTLITAA 319



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 77/165 (46%), Gaps = 35/165 (21%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
            IDG+++W  LS +  S R  ILHNIDD + +SA+T G WK ++            G  D
Sbjct: 328 SIDGVNLWDALSEDTESPRTNILHNIDDIYGVSAITVGDWKFIQ-------GSTYNGVWD 380

Query: 634 QVYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVKE-VP- 668
             Y  G S REW+             A+A          +  LR+ A I+C P  + +P 
Sbjct: 381 GWY--GPSGREWIYDADGVISSTAGRAIASVGLSVTTEIISSLRENAMIKCPPRNDSLPI 438

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIA 713
           C P   PCLF+I  DPCE NNL  +     +     E+ +FN  A
Sbjct: 439 CRPLQEPCLFNIYQDPCEDNNLV-KQFPTIVRKLQEELKKFNNTA 482



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 28/36 (77%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          MQH++L G E  GLPL EKILP+YL+ELGY T I+ 
Sbjct: 64 MQHSILKGAEPRGLPLHEKILPEYLRELGYSTHIVG 99



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 39/155 (25%), Positives = 58/155 (37%), Gaps = 60/155 (38%)

Query: 750 SNEEEGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINH 807
           S   E +  LR+ A I+C P  + +P C P   PCLF+I                     
Sbjct: 412 SVTTEIISSLRENAMIKCPPRNDSLPICRPLQEPCLFNI--------------------- 450

Query: 808 YTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
                 Y DP +  N +                               K +P ++ ++++
Sbjct: 451 ------YQDPCEDNNLV-------------------------------KQFPTIVRKLQE 473

Query: 868 ELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           EL   N TA+ P N P+D   DP  +DH W+ FGD
Sbjct: 474 ELKKFNNTAILPGNLPWDSKADPNLWDHTWNNFGD 508


>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 532

 Score =  476 bits (1226), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 218/311 (70%), Positives = 253/311 (81%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GWNDVGFHG  QIPTPNIDALAYSG++L  YY   +CTPSRSA+MTGK+PIH GMQH VL
Sbjct: 26  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGVL 85

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
            G E  GLPL EKILP+YL++LGY T IVGKWHLGFY KEYTPT+RGF SH G+WTGHQD
Sbjct: 86  KGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQD 145

Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
           YFDH+A E   WGLDMRRD+EPAWDLHG+YSTDVFT EA+ +I  H++  PLFLYLAHAA
Sbjct: 146 YFDHTAVESPYWGLDMRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAHAA 205

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            HS NPY PL APD  +    +I D+ R +FA IL KLD+SVG+VVE L ++ ML +S+I
Sbjct: 206 VHSGNPYNPLPAPDEEVAKFDNIFDYNRRRFAGILSKLDKSVGQVVEGLRKKGMLRDSVI 265

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           +F +DNGG AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP L   G V+ Q +H++
Sbjct: 266 IFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPKLTRSGRVSRQLLHIT 325

Query: 371 DWLPTLLSAAN 381
           DWLPTL++AA 
Sbjct: 326 DWLPTLITAAG 336



 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/167 (36%), Positives = 82/167 (49%), Gaps = 36/167 (21%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           IDG+D+WS LS +  S R ++LHNIDD + +SA+T G WKLV+            G  D 
Sbjct: 344 IDGMDLWSALSDDTESPRTSVLHNIDDIYGVSAVTVGDWKLVQ-------GSTYNGAWDG 396

Query: 635 VYLSGLSDREWL-----------------------ALAMRKLRDAASIQCGPVKE-VP-C 669
            Y  G S REW+                       A  +R LR+ A I+C P  + +P C
Sbjct: 397 WY--GPSGREWVYDVGGVIGSEAGRAVASVGLGITAETVRMLREKAMIKCPPKNDSLPLC 454

Query: 670 EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIA-YP 715
           +P   PCLF++  DPCE NNL  R     +     E+ +FN  A YP
Sbjct: 455 KPLQEPCLFNVYQDPCEDNNLV-RQSPTIVRKLQEELKKFNSSAIYP 500



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 27/36 (75%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL G E  GLPL EKILP+YL++LGY T I+ 
Sbjct: 80  MQHGVLKGAEPRGLPLHEKILPEYLRDLGYSTHIVG 115



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 61/154 (39%), Gaps = 60/154 (38%)

Query: 754 EGMRKLRDAASIQCGPVKE-VP-CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTE 811
           E +R LR+ A I+C P  + +P C+P   PCLF++                         
Sbjct: 431 ETVRMLREKAMIKCPPKNDSLPLCKPLQEPCLFNV------------------------- 465

Query: 812 VGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELAN 871
             Y DP +  N +                               +  P ++ ++++EL  
Sbjct: 466 --YQDPCEDNNLV-------------------------------RQSPTIVRKLQEELKK 492

Query: 872 INRTAVAPINKPFDKGGDPKNFDHAWSIFGDDLK 905
            N +A+ P N P+D  GDP  +DH W+ FGD +K
Sbjct: 493 FNSSAIYPGNLPWDNKGDPSLWDHTWNNFGDYVK 526


>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
          Length = 528

 Score =  476 bits (1225), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 219/319 (68%), Positives = 261/319 (81%), Gaps = 1/319 (0%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           +I +LA   GWNDVGFHG  QIPTPNIDALAYSG++L  YY   +CTPSRSA+MTGK+PI
Sbjct: 13  LILLLAVH-GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPI 71

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
           HTGMQH VL G E  GLPL EK+LP+YL+ELGY T IVGKWHLGFYKKEYTPT+RGF++H
Sbjct: 72  HTGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYRGFDTH 131

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
           +G+WTGH DYFDH+A E   WGLD+RR ++PAWDLHG+YSTD+FT EAV +I NH++  P
Sbjct: 132 IGFWTGHHDYFDHTAVENPYWGLDIRRGMQPAWDLHGQYSTDIFTKEAVRLIDNHNSSRP 191

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
           +FLYLAHAA HS NPY PL APD  +    +I D+ R +FA +L KLD+SVG+VV+AL +
Sbjct: 192 MFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGMLSKLDQSVGQVVDALRK 251

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGI 361
           R ML +SII+F SDNGG AAGFNLNAASNWPLRGVKNTLWEGGVRG GLIWSP L   G 
Sbjct: 252 RDMLRDSIIIFSSDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGTGLIWSPRLVRPGR 311

Query: 362 VAEQYVHVSDWLPTLLSAA 380
           V+ Q +H++DWLPTL++AA
Sbjct: 312 VSRQMLHITDWLPTLITAA 330



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 72/143 (50%), Gaps = 36/143 (25%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           +DG+D+W+ LS +  S R  +LHNIDD + +SA+T G WKL++            G  D 
Sbjct: 339 VDGMDLWNALSEDTESPRANVLHNIDDIYGVSAITVGDWKLIR-------GSTYKGVWDG 391

Query: 635 VYLSGLSDREWL-------------ALA----------MRKLRDAASIQCGPVK--EVP- 668
            Y  G S REW+             A+A          +  LR+ A I+C P++   +P 
Sbjct: 392 WY--GPSGREWVYDVDGVISSTTAHAIASVGLGITTDVVLMLRENALIKC-PLRNDSLPI 448

Query: 669 CEPQIAPCLFDIKNDPCEKNNLA 691
           C+P    CLF+I  DPCE NNL 
Sbjct: 449 CKPLEEACLFNIYQDPCEDNNLV 471



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 27/36 (75%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL G E  GLPL EK+LP+YL+ELGY T I+ 
Sbjct: 75  MQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVG 110



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 31/49 (63%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           + K +P ++ +++ EL   N +A+ P N P+D  GDP  +DH W+ FGD
Sbjct: 470 LVKQFPTIVRKLQDELKKFNSSALLPGNLPWDSKGDPNLWDHTWNNFGD 518


>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 552

 Score =  476 bits (1225), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 228/347 (65%), Positives = 269/347 (77%), Gaps = 4/347 (1%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           ++  AVL LAFT+ +V     +S  PP+IIFILADDLGWNDVGFHG  QIPTPN+DALAY
Sbjct: 3   LVVIAVLILAFTIVLV----TSSKPPPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAY 58

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
           SGIIL  YY   +CTPSR+A+MTG++PIHTGMQH VLYG E  GLPL EK+LP+YL+ELG
Sbjct: 59  SGIILNRYYVTPICTPSRAALMTGRYPIHTGMQHAVLYGMEPRGLPLEEKLLPEYLRELG 118

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y+  IVGKWHLG Y + YTP  RGF+SH+G+WTGH   FDHSA E + WGLDMRR  + A
Sbjct: 119 YKNHIVGKWHLGHYTRRYTPLERGFDSHVGFWTGHHHMFDHSAVETETWGLDMRRGYDVA 178

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
           +DLHGKY+T V   EAV  I NHS  +PLFLY+AHAA HSANPY+ L APD  +    H+
Sbjct: 179 YDLHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAHAAVHSANPYDFLPAPDVTVAGLEHV 238

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
           E + R KFAA+L +LDESVG +VEAL+ + ML N+IIVF SDNGG A GFN NAASNWPL
Sbjct: 239 EPYPRRKFAAMLSELDESVGAIVEALKIKGMLDNTIIVFSSDNGGPAEGFNSNAASNWPL 298

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           RGVKNTLWEGGVR AG IWSPL+     V++Q +H+SDWLPTLL AA
Sbjct: 299 RGVKNTLWEGGVRAAGFIWSPLIAESRRVSQQMIHISDWLPTLLDAA 345



 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 68/144 (47%), Gaps = 18/144 (12%)

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
           N  +    +DGI+VW  L     ++R  ILHNIDD W  +ALT   WK++K     + Q 
Sbjct: 348 NVETLPKTLDGINVWKSLLNGNSTERREILHNIDDIWGSAALTVNNWKILKGTN-YKGQW 406

Query: 627 DLTGGPD-------------QVYLSG--LSDREWLAL--AMRKLRDAASIQCGPVKEVPC 669
           D   GP              Q   +G  LS+   L     +R LR  A++ C       C
Sbjct: 407 DNWYGPPGDRDPEAYNITAVQHCPTGKALSEMNLLPTEATIRNLRLNATVHCSKEVRATC 466

Query: 670 EPQIAPCLFDIKNDPCEKNNLADR 693
            P   PCL+D+  DPCE +NLA+R
Sbjct: 467 NPLEQPCLYDVLQDPCELDNLAER 490



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 55/153 (35%), Gaps = 58/153 (37%)

Query: 753 EEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEV 812
           E  +R LR  A++ C       C P   PCL+D+  DPCE +NLA+R             
Sbjct: 444 EATIRNLRLNATVHCSKEVRATCNPLEQPCLYDVLQDPCELDNLAERFP----------- 492

Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANI 872
                       A LD                        M+KK            L   
Sbjct: 493 ------------AILD-----------------------SMLKK------------LDAY 505

Query: 873 NRTAVAPINKPFDKGGDPKNFDHAWSIFGDDLK 905
           N TAV P N   D  GDP+ + + W  FGDDL+
Sbjct: 506 NATAVPPGNLALDTRGDPRFWGYTWHNFGDDLE 538



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 28/36 (77%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VLYG E  GLPL EK+LP+YL+ELGY+  I+ 
Sbjct: 90  MQHAVLYGMEPRGLPLEEKLLPEYLRELGYKNHIVG 125


>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
          Length = 557

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 210/328 (64%), Positives = 251/328 (76%), Gaps = 2/328 (0%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P++IFILADD GWNDV FHG  QIPTPN+DALA+SG+IL+NYY   LCTPSRSA+MTGKH
Sbjct: 26  PNLIFILADDFGWNDVSFHGSKQIPTPNLDALAFSGLILQNYYVTPLCTPSRSALMTGKH 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH+VLYG  R GLPLSE  LP+YLK+LGY+  IVGKWHLG YK  YTP FRGF+
Sbjct: 86  PIHTGMQHDVLYGYSRYGLPLSEITLPEYLKDLGYKNHIVGKWHLGHYKSVYTPLFRGFD 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           SH GYWTGHQDY+DH+A E   WG DMRR+    W  +GKY+T + T EA D+I  H   
Sbjct: 146 SHYGYWTGHQDYYDHTAVEWNAWGYDMRRNHSVDWSAYGKYTTTLLTDEACDVITKHDVS 205

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+AH A HSANPY PLQAP+  + +   IE+ +R ++AA++H+LD SVGK+V++L
Sbjct: 206 SPLFLYVAHLAVHSANPYSPLQAPEETVEMFSSIENLQRRRYAAMVHELDVSVGKIVKSL 265

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--E 357
               ML N++IVF +DNGG A GFN NAASNWPL+GVKNT WEGGVR AGLIWSPL+   
Sbjct: 266 GDNNMLENTVIVFSTDNGGPAEGFNQNAASNWPLKGVKNTPWEGGVRAAGLIWSPLIPKS 325

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKSDI 385
            RG V    + +SDWLPTL  AA +S I
Sbjct: 326 RRGQVMSNLMDISDWLPTLFEAAGQSRI 353



 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 37/152 (24%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK----------------- 617
           +DG+  W+ +  ++PS RN +LHNIDD+   +A+ K  WKLVK                 
Sbjct: 363 LDGVSHWNTVLYDKPSARNHVLHNIDDQLGYAAIRKENWKLVKGTTYEGSWDGWYGPSGR 422

Query: 618 --------------VVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG- 662
                         V  + +  V  +     + + G+S  + L    ++L DA  I CG 
Sbjct: 423 TNESSTNFINSVDDVSLIYKNAVLDSDAAQYIAMLGMSATQNL---YQQLLDA-EISCGE 478

Query: 663 -PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 693
            PV   PC P  A CLFDI+ DPCE NNLA++
Sbjct: 479 RPVDAHPCLPLDALCLFDIEQDPCEYNNLAEK 510



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 28/36 (77%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH+VLYG  R GLPLSE  LP+YLK+LGY+  I+ 
Sbjct: 91  MQHDVLYGYSRYGLPLSEITLPEYLKDLGYKNHIVG 126



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/45 (55%), Positives = 30/45 (66%), Gaps = 3/45 (6%)

Query: 757 RKLRDAASIQCG--PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           ++L DA  I CG  PV   PC P  A CLFDI+ DPCE NNLA++
Sbjct: 467 QQLLDA-EISCGERPVDAHPCLPLDALCLFDIEQDPCEYNNLAEK 510


>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
          Length = 563

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 208/321 (64%), Positives = 257/321 (80%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ I+ADDLGWNDVGFHG +QIPTPNID +A+SG+ L NYY   +CTPSR+A+MTGK+
Sbjct: 49  PNIVLIIADDLGWNDVGFHGSNQIPTPNIDIMAWSGVSLHNYYVTPICTPSRAALMTGKY 108

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH V++  E  GLPL+EKILPQYLKELGY+T +VGKWHLG YKKEY P  RGF+
Sbjct: 109 PIHTGMQHTVIFAAEPRGLPLTEKILPQYLKELGYKTHLVGKWHLGSYKKEYLPLNRGFD 168

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           SHLG+W G  D +DH+ +E   WG D RRD   A DL G+Y+TDV+T EAV II +H+T 
Sbjct: 169 SHLGFWNGKIDMYDHTNQEKGYWGFDFRRDFSTAHDLFGQYATDVYTNEAVKIIKSHNTS 228

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFL L+H+A H+ NP EP++AP+       HI+DF+R KFAA+L KLDESVG+VV AL
Sbjct: 229 SPLFLMLSHSAVHTGNPSEPIRAPEKLFVNFTHIQDFQRRKFAAVLTKLDESVGEVVAAL 288

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           + + +L++SI+VF +DNGGAAAGFN NAASN+PLRGVKNTLWEGGVRGAG +WSP ++ R
Sbjct: 289 KAKGVLNDSIVVFTTDNGGAAAGFNDNAASNYPLRGVKNTLWEGGVRGAGWLWSPFIDKR 348

Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
             VA Q +H+ DWLPTLLSAA
Sbjct: 349 SRVATQRMHLVDWLPTLLSAA 369



 Score = 86.3 bits (212), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 80/158 (50%), Gaps = 21/158 (13%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
           IDG+D W  LS++ PS R +++HNIDDE   +++T  +WK   V K   Y     G    
Sbjct: 379 IDGVDQWCALSQDLPSARESLVHNIDDESGSASITYKQWK---VHKGTNYGGSWDGWYGP 435

Query: 631 -GPDQVY-----LSGLSDREWLALAMR-------KLRDAASIQCGPVKEVPCEPQIAPCL 677
            G +  Y     L+  +    L + M        +LR  A+++CG  + +PC P  APCL
Sbjct: 436 AGREGAYDTTRLLASKAAGALLDIGMLPDTEHILRLRSEATVECGDREALPCRPLEAPCL 495

Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           F+I  DPCE  NLAD   D  +     E+ R N+ A P
Sbjct: 496 FNIDEDPCETRNLADIHPDV-LQVMLKELDRVNRTAVP 532



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 32/47 (68%)

Query: 752 EEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
           + E + +LR  A+++CG  + +PC P  APCLF+I  DPCE  NLAD
Sbjct: 464 DTEHILRLRSEATVECGDREALPCRPLEAPCLFNIDEDPCETRNLAD 510



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 29/36 (80%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V++  E  GLPL+EKILPQYLKELGY+T ++ 
Sbjct: 114 MQHTVIFAAEPRGLPLTEKILPQYLKELGYKTHLVG 149



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 30/45 (66%)

Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           +PDVL  M KEL  +NRTAV P N+P   GGDPK + +  + FGD
Sbjct: 512 HPDVLQVMLKELDRVNRTAVPPNNQPLTPGGDPKYWGYVITNFGD 556


>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
 gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
          Length = 533

 Score =  450 bits (1158), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 210/326 (64%), Positives = 266/326 (81%), Gaps = 2/326 (0%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S PPHIIFI+ADDLGWNDVG+HG D+IPTPNIDALAY+GIIL  YY + +CTPSRSA+MT
Sbjct: 7   SKPPHIIFIMADDLGWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMT 66

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G+HPIH GMQH VL+G E  GLPL+EK+LP+YL++LGY T IVGKWHLGFYKKEYTP +R
Sbjct: 67  GRHPIHNGMQHRVLFGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYR 126

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           GFESH+G+WTGHQDY+DH+AEE ++WGLDMR  ++PAW LHG+YST V+T E+V II N+
Sbjct: 127 GFESHIGFWTGHQDYYDHTAEEERLWGLDMRHGMKPAWYLHGEYSTHVYTRESVKIIKNY 186

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           ++ +PLFLY+AHAA HS N Y PL APD  ++   HI+++ R ++AA++ +LD SVG ++
Sbjct: 187 NSTKPLFLYVAHAAVHSGNKYNPLPAPDKTVDKLDHIQNYNRRRYAAMVSELDTSVGNII 246

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL+   MLSNSI++F +DNGG A G N N ASNWP+RGVKNTLWEGGVRG G++WS   
Sbjct: 247 SALKDANMLSNSIVIFSTDNGGPANGLNYNYASNWPMRGVKNTLWEGGVRGVGVVWSSKF 306

Query: 357 E--SRGIVAEQYVHVSDWLPTLLSAA 380
           +  S G V+ + +H+ DWLPTL +AA
Sbjct: 307 KPNSGGRVSNRMMHICDWLPTLYAAA 332



 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 71/141 (50%), Gaps = 24/141 (17%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
           +DG ++W+ L  + PS RN ILHNIDD +  SALT  +WKLVK      +     G    
Sbjct: 343 LDGYNMWNSLLDDLPSPRNEILHNIDDIFGNSALTMSQWKLVKGTTYNGFYDSWFGPTGR 402

Query: 631 GPDQVY-----LSGLSDREWLAL--------AMRKLRDAASIQCGPVKEVP-------CE 670
            P   Y     LS LS++    L        ++R++R+ A+I CG  + +        C 
Sbjct: 403 NPGYFYNIGQVLSSLSNKALSTLQTVPLNPESIRRIREEATIDCGNHQLISSESRFSNCN 462

Query: 671 PQIAPCLFDIKNDPCEKNNLA 691
           P    CLF++  DPCE +N+A
Sbjct: 463 PLKESCLFNVDEDPCEIDNVA 483



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 29/36 (80%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VL+G E  GLPL+EK+LP+YL++LGY T I+ 
Sbjct: 75  MQHRVLFGVETRGLPLTEKLLPEYLQKLGYSTHIVG 110



 Score = 39.7 bits (91), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 28/49 (57%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           + K +P V+ +ME+ L   N TAV P     D  G+PK + + W+ FGD
Sbjct: 482 VAKSFPQVVKEMEERLLYWNSTAVKPAKVINDLRGNPKFWGYVWTNFGD 530


>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
 gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
          Length = 564

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 216/333 (64%), Positives = 253/333 (75%), Gaps = 2/333 (0%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S  P+IIFILADDLGWNDVGFHG  QIPTPN+DALAYSGIIL  YY   +CTPSRSA+MT
Sbjct: 36  SARPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVNPICTPSRSALMT 95

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           GK+PIHTGMQH VLY  E  GLPLSEK+LPQYLK+LGY   IVGKWHLG Y+  +TP  R
Sbjct: 96  GKYPIHTGMQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSNHIVGKWHLGHYQLRFTPMQR 155

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           GF+SH G+WTGH    DH+A E   WGLDMRR  + A+DLHG+Y+T V  AEA+ I+  H
Sbjct: 156 GFDSHTGFWTGHHHMNDHTAVEHGHWGLDMRRGYDVAYDLHGQYTTHVLGAEAIAIVQGH 215

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           +   PLFLY+AHAA HSANPY+ L APD  +    HIE+++R KFAA++ +LD +VG +V
Sbjct: 216 NKSSPLFLYVAHAAVHSANPYDFLPAPDETVANLGHIENYRRRKFAAMMVELDRTVGSLV 275

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           +AL  R ML N+IIVF SDNGG A GFN NAASNWPLRGVKNTLWEGG+R AG IWSPLL
Sbjct: 276 DALHARDMLENTIIVFSSDNGGPADGFNDNAASNWPLRGVKNTLWEGGLRAAGFIWSPLL 335

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
           ++   V+ Q V V DWLPTL  AA  N S +P 
Sbjct: 336 QNVSRVSHQMVQVCDWLPTLYEAAGGNVSALPT 368



 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/187 (31%), Positives = 88/187 (47%), Gaps = 31/187 (16%)

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
           N ++   ++DGI VW  LS   P++R  ILHNIDD W  +ALT G WKLVK     R   
Sbjct: 362 NVSALPTDLDGISVWHELSSGAPTRRIEILHNIDDIWGTAALTVGNWKLVKGSHYNR-TW 420

Query: 627 DLTGGPDQVYLSGLSDREWLAL----------------------AMRKLRDAASIQCGPV 664
           D   GP     +G+ D +  AL                       + +LR  A++ CG  
Sbjct: 421 DGWYGP-----AGIRDEKAYALDKLANSPAGKVMEELSLLPTPERITQLRREATVSCGAG 475

Query: 665 KEVP--CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEE 722
             +   C+P   PCL+D+++DPCE NNLA+      +      +  +N  A P    E++
Sbjct: 476 AHMASECDPLERPCLYDVESDPCEYNNLAEE-HLHILQSLLARLADYNSTAVPPSNLEDD 534

Query: 723 KKKKKKK 729
            + + ++
Sbjct: 535 PRGEPQR 541



 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 27/36 (75%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH VLY  E  GLPLSEK+LPQYLK+LGY   I+ 
Sbjct: 104 MQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSNHIVG 139



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP--CEPQIAPC 782
           + +K     K       K  ++       E + +LR  A++ CG    +   C+P   PC
Sbjct: 430 RDEKAYALDKLANSPAGKVMEELSLLPTPERITQLRREATVSCGAGAHMASECDPLERPC 489

Query: 783 LFDIKNDPCEKNNLAD 798
           L+D+++DPCE NNLA+
Sbjct: 490 LYDVESDPCEYNNLAE 505


>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
 gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
 gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
          Length = 562

 Score =  447 bits (1151), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 212/326 (65%), Positives = 253/326 (77%), Gaps = 1/326 (0%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           S   P+IIFILADDLG+NDVGFHG  +IPTPNIDALAYSGIIL  YY   +CTPSRSA+M
Sbjct: 22  SPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALM 81

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TGK+PIHTGMQH VLY  E  GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +
Sbjct: 82  TGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLY 141

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           RGF SH+G+W+GHQDY DH+A E   WGLDMR   + A+DLHG Y+TDV T  +V +I N
Sbjct: 142 RGFSSHVGFWSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIAN 201

Query: 236 H-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           H +T  PLFLY+AHAA HS+NPY PL  PD+ +    HI ++KR KFAA++ K+D SVG+
Sbjct: 202 HNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQ 261

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           +V+ L +  ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSP
Sbjct: 262 IVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSP 321

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL+    V+ Q +H+ DWLPTLL AA
Sbjct: 322 LLKKSQRVSNQTMHIIDWLPTLLEAA 347



 Score = 79.7 bits (195), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 99/235 (42%), Gaps = 41/235 (17%)

Query: 515 VWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNE 574
           +WS L +      N  +H ID            W      +  G     N S       +
Sbjct: 318 MWSPLLKKSQRVSNQTMHIID------------WLPTLLEAAGGQPALSNLSK------Q 359

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD- 633
           IDG  +W  L +++ S R  +LHNIDD W  +AL+ G WKLVK     R   D   GP  
Sbjct: 360 IDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN-YRGSWDGWYGPAG 418

Query: 634 ---------QVYLSGLSDREWLALAM-------RKLRDAASIQC--GPVKEVPCEPQI-- 673
                    Q+     + +   AL M       +++R AA++ C     +   C      
Sbjct: 419 ERDPRLYDWQLVGRSRAGKALEALKMLPSRADQQRIRAAATVSCPGQSSQGTSCVATAFS 478

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
           APCLF I++DPCE+ NLA +   + +N   TE+ RFN  A P   +  + +   +
Sbjct: 479 APCLFHIRDDPCEQYNLA-KQYPEVVNALMTELERFNATAVPPSNKPADPRADPR 532



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL ELGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 31/49 (63%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           + K YP+V++ +  EL   N TAV P NKP D   DP+ +++ W+ FGD
Sbjct: 495 LAKQYPEVVNALMTELERFNATAVPPSNKPADPRADPRFWNYTWTNFGD 543


>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 514

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 207/326 (63%), Positives = 249/326 (76%), Gaps = 2/326 (0%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII ILADDLGWNDVGFHG +QIPTPNIDALA++GIIL NYY   +CTPSRSA++TGK+
Sbjct: 31  PHIIIILADDLGWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKY 90

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTG+QH V++G    GL L+EK+LP+YL+ L Y TR VGKWHLG +KK+YTP +RGF+
Sbjct: 91  PIHTGLQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFD 150

Query: 180 SHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           SH GYWTGHQDY+DH+A E    WG DMRR +       G Y+TD+FT EAV +I  H +
Sbjct: 151 SHYGYWTGHQDYYDHTAIENPGFWGYDMRRGMNVTRSDFGYYTTDLFTNEAVKVIKGHDS 210

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           ++PLFLYLAH ATHS N Y PLQAP   +    +I+D  R  FA +L KLDESVGKVVEA
Sbjct: 211 NKPLFLYLAHLATHSGNKYSPLQAPAETVAKFNYIKDKNRRLFAGMLSKLDESVGKVVEA 270

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L    M++N +I+F +DNGG A GFNLNAASNWPLRGVK+TLWEGGVRG G IWSP L S
Sbjct: 271 LADSNMINNCVILFSTDNGGPAGGFNLNAASNWPLRGVKDTLWEGGVRGVGFIWSPFLPS 330

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSD 384
              V+   +H++DWLPTLLS  N S+
Sbjct: 331 SK-VSNAMIHITDWLPTLLSLTNASN 355



 Score = 46.2 bits (108), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 27/59 (45%), Positives = 37/59 (62%)

Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
           +N ++  ++IDGI+VW  LS+  PS R  IL NID E +I+AL    WK  K N+ N N
Sbjct: 351 TNASNSISDIDGINVWPNLSKGLPSVRKEILLNIDTERKIAALRYKNWKYKKGNANNWN 409



 Score = 42.7 bits (99), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 32/49 (65%)

Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWK 614
           +N ++  ++IDGI+VW  LS+  PS R  IL NID E +I+AL    WK
Sbjct: 351 TNASNSISDIDGINVWPNLSKGLPSVRKEILLNIDTERKIAALRYKNWK 399



 Score = 40.0 bits (92), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 33
           +QH V++G    GL L+EK+LP+YL+ L Y TR
Sbjct: 96  LQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTR 128


>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 713

 Score =  421 bits (1081), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 197/326 (60%), Positives = 243/326 (74%), Gaps = 5/326 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII ILADDLGWNDV FHG DQIPTPNIDALAY+GIIL N+Y   LCTPSRSA+MTGK+
Sbjct: 72  PHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKN 131

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH+VL+  E  GLPLSEK+LP+YL+E+GY+T  VGKWHLG++KKEYTPT+RGF+
Sbjct: 132 PIHLGMQHSVLFPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYTPTYRGFD 191

Query: 180 SHLGYWTGHQDYFDHSAEE-----MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           SH GYW G QDY+ H  +E      +  G DMRR+L  AWD  GKYSTD+FT EA+ +I+
Sbjct: 192 SHFGYWNGLQDYYTHITQEPDPAFSEFKGFDMRRNLTVAWDTVGKYSTDLFTNEAIRLIN 251

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H TD P+FLYLAH A H  N  +  +APD  +    +I D +R   AA++ KLD+SVG 
Sbjct: 252 EHDTDRPMFLYLAHLAVHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGD 311

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V++AL  R ML NSI++F+SDNG  + G   N  SN+PLRG+KN+ WEGG RG   IWSP
Sbjct: 312 VMDALRNRGMLENSIVLFMSDNGAPSQGILNNKGSNYPLRGIKNSPWEGGTRGVAAIWSP 371

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           L++    V+ Q + +SDWLPTLLSAA
Sbjct: 372 LIKKSKRVSNQMMFISDWLPTLLSAA 397



 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 90/190 (47%), Gaps = 29/190 (15%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN---GTSENRS 566
           IDG D+WS L  N+ S R+ ++ NIDD    ++  RG +K +   +  G+   G S + S
Sbjct: 407 IDGFDLWSALVLNKVSSRSEVVLNIDDLSDYASFRRGNFKYIIGRTETGSDWLGASGDPS 466

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
              S Q      D + +L     SK    +  I    Q   L + + + +K+     Y  
Sbjct: 467 EGISPQ-----YDPYKILY----SKTGVAISGIITAKQAMELKERRKRSIKI-----YDT 512

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKND 683
            L    ++  LS           + +LR  A I+C  V+E   +PCEP  APCLF+I+ D
Sbjct: 513 ALETKFEEKILS--------VENISELRRKAQIKCN-VQEKDRIPCEPMTAPCLFNIEKD 563

Query: 684 PCEKNNLADR 693
           PCE  NLA+R
Sbjct: 564 PCEMVNLAER 573



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 16/194 (8%)

Query: 615 LVKVVKVMRYQVDLTGGPD---QVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
           L+    V R Q+    G D    + L+ +S R  + L +  L D AS + G  K +    
Sbjct: 393 LLSAAGVNRKQLGNIDGFDLWSALVLNKVSSRSEVVLNIDDLSDYASFRRGNFKYIIGRT 452

Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKK 731
           +          DP E   ++ + +  +I +  T V     I      E +E++K+  K  
Sbjct: 453 ETGSDWLGASGDPSE--GISPQYDPYKILYSKTGVAISGIITAKQAMELKERRKRSIKIY 510

Query: 732 KKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKN 788
               + K ++K           E + +LR  A I+C  V+E   +PCEP  APCLF+I+ 
Sbjct: 511 DTALETKFEEKILSV-------ENISELRRKAQIKCN-VQEKDRIPCEPMTAPCLFNIEK 562

Query: 789 DPCEKNNLADRSEV 802
           DPCE  NLA+R  V
Sbjct: 563 DPCEMVNLAERRPV 576



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/32 (65%), Positives = 28/32 (87%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH+VL+  E  GLPLSEK+LP+YL+E+GY+T
Sbjct: 137 MQHSVLFPTEPRGLPLSEKLLPEYLREIGYKT 168


>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
 gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
          Length = 545

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 210/360 (58%), Positives = 252/360 (70%), Gaps = 24/360 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADDLG+NDVGFHG  QIPTPNIDALAYSGIIL  YY   +CTPSRSA+MTGK+
Sbjct: 26  PHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL EKILPQYL +LGY + I GKWHLG +K EYTP FRGF 
Sbjct: 86  PIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFS 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                   +WGLDMR   + A++LHG+Y+TDV T  +V +I NH + 
Sbjct: 146 SH-----------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAA 188

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS NPY PL  PD  +    HI  +KR ++AA++ K+DESVG +V+ 
Sbjct: 189 KGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQ 248

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F SDNGG A GFNLN ASN+PLRGVKNTLWEGGVR AGL+WSPLLE 
Sbjct: 249 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEK 308

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
           R  V ++ +H+SDWLPTL+ AA    IP   N+ +   +    + +  ++   H Y SPR
Sbjct: 309 RQRVVDETMHISDWLPTLIEAAGG--IPALANANLSQSL----DGMSIWQTLVHAYPSPR 362



 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 87/192 (45%), Gaps = 24/192 (12%)

Query: 556 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
           I   G     +N N  Q+ +DG+ +W  L    PS R T+LHNIDD W  +ALT G WK+
Sbjct: 327 IEAAGGIPALANANLSQS-LDGMSIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKM 385

Query: 616 VKVVKVMRYQVDLTGGPD----------QVYLSGLSDREWLALAM-------RKLRDAAS 658
           ++       + D   GP            + L   S +    L M       ++LR+ A+
Sbjct: 386 IQGTN-YNGRWDGWYGPGGERDPHLYDWNLVLKSSSGKAMTGLKMLPSRADQQRLRETAT 444

Query: 659 IQCG--PVKEVPCEPQI--APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
           + C     +   C      APCLF +++DPCE+ NLA++   + ++    E+   N  A 
Sbjct: 445 VTCQGQSTQGTSCVATAFSAPCLFHVRDDPCEQFNLAEQYP-KVLDLLVAELQHINSTAV 503

Query: 715 PDKEEEEEKKKK 726
           P   + ++ +  
Sbjct: 504 PPSNQPDDPRAS 515



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/34 (67%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL +LGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHI 124



 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 39/170 (22%), Positives = 58/170 (34%), Gaps = 62/170 (36%)

Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG--PVKEVPCEPQI--APCLFDIKNDPCE 792
           K    K     K        ++LR+ A++ C     +   C      APCLF +++DPC 
Sbjct: 417 KSSSGKAMTGLKMLPSRADQQRLRETATVTCQGQSTQGTSCVATAFSAPCLFHVRDDPC- 475

Query: 793 KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
                                     ++FN                              
Sbjct: 476 --------------------------EQFN------------------------------ 479

Query: 853 MMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
            + + YP VL  +  EL +IN TAV P N+P D    P  F++ W+ FGD
Sbjct: 480 -LAEQYPKVLDLLVAELQHINSTAVPPSNQPDDPRASPMYFNYTWTNFGD 528


>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
 gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
          Length = 545

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 210/360 (58%), Positives = 252/360 (70%), Gaps = 24/360 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADDLG+NDVGFHG  QIPTPNIDALAYSGIIL  YY   +CTPSRSA+MTGK+
Sbjct: 26  PHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL EKILPQYL +LGY + I GKWHLG +K EYTP FRGF 
Sbjct: 86  PIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFS 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                   +WGLDMR   + A++LHG+Y+TDV T  +V +I NH + 
Sbjct: 146 SH-----------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAA 188

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS NPY PL  PD  +    HI  +KR ++AA++ K+DESVG +V+ 
Sbjct: 189 KGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQ 248

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F SDNGG A GFNLN ASN+PLRGVKNTLWEGGVR AGL+WSPLLE 
Sbjct: 249 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEK 308

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
           R  V ++ +H+SDWLPTL+ AA    IP   N+ +   +    + +  ++   H Y SPR
Sbjct: 309 RQRVVDETMHISDWLPTLIEAAGG--IPALANANLSQSL----DGMSIWQTLVHAYPSPR 362



 Score = 76.3 bits (186), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 87/192 (45%), Gaps = 24/192 (12%)

Query: 556 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
           I   G     +N N  Q+ +DG+ +W  L    PS R T+LHNIDD W  +ALT G WK+
Sbjct: 327 IEAAGGIPALANANLSQS-LDGMSIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKM 385

Query: 616 VKVVKVMRYQVDLTGGPD----------QVYLSGLSDREWLALAM-------RKLRDAAS 658
           ++       + D   GP            + L   S +    L M       ++LR+ A+
Sbjct: 386 IQGTN-YNGRWDGWYGPGGERDPHLYDWNLVLKSSSGKAMTGLKMLPSRADQQRLRETAT 444

Query: 659 IQCG--PVKEVPCEPQI--APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
           + C     +   C      APCLF +++DPCE+ NLA++   + ++    E+   N  A 
Sbjct: 445 VTCQGQSTQGTSCVATAFSAPCLFHVRDDPCEQFNLAEQYP-KVLDLLMAELQHINSTAV 503

Query: 715 PDKEEEEEKKKK 726
           P   + ++ +  
Sbjct: 504 PPSNQPDDPRAS 515



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/34 (67%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL +LGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHI 124



 Score = 48.1 bits (113), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 39/170 (22%), Positives = 58/170 (34%), Gaps = 62/170 (36%)

Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG--PVKEVPCEPQI--APCLFDIKNDPCE 792
           K    K     K        ++LR+ A++ C     +   C      APCLF +++DPC 
Sbjct: 417 KSSSGKAMTGLKMLPSRADQQRLRETATVTCQGQSTQGTSCVATAFSAPCLFHVRDDPC- 475

Query: 793 KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
                                     ++FN                              
Sbjct: 476 --------------------------EQFN------------------------------ 479

Query: 853 MMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
            + + YP VL  +  EL +IN TAV P N+P D    P  F++ W+ FGD
Sbjct: 480 -LAEQYPKVLDLLMAELQHINSTAVPPSNQPDDPRASPMYFNYTWTNFGD 528


>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 671

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 197/326 (60%), Positives = 244/326 (74%), Gaps = 5/326 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ ILADDLGWNDV FHG DQIPTPNIDALAY+GIIL N+Y   LCTPSRSA+MTGK+
Sbjct: 28  PHIVIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKN 87

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH+VL+  E  GLPLSEK+LP+YL+E+GY+T  VGKWHLG++KKEYTPT+RGF+
Sbjct: 88  PIHLGMQHSVLFPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTPTYRGFD 147

Query: 180 SHLGYWTGHQDYFDHSAEE-----MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           SH GYW G QDY+ H  +E      +  G DMRR+L  AWD  GKYSTD+FT EAV +I+
Sbjct: 148 SHFGYWNGLQDYYTHITQEPDPVYSEYKGFDMRRNLTVAWDTVGKYSTDLFTNEAVRLIN 207

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H+ D+P+FLYLAH A H  N  +  +APD  +    +I D +R   AA++ KLD+SVG 
Sbjct: 208 EHNIDQPMFLYLAHLAPHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGD 267

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V++AL  R ML NSI+VF+SDNG  + G   N  SN+PLRG+KN+ WEGG RG   IWSP
Sbjct: 268 VMDALRNRGMLENSIVVFMSDNGAPSQGLLSNEGSNYPLRGIKNSPWEGGTRGVAAIWSP 327

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           L++    V+ Q + +SDWLPTLLSAA
Sbjct: 328 LIKKSKRVSNQMMFMSDWLPTLLSAA 353



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 89/190 (46%), Gaps = 28/190 (14%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN---GTSENRS 566
           IDG D+WS L  N+ S R+ ++ NIDD    +A  RG +K +   +  G+   G S + S
Sbjct: 363 IDGYDLWSALVLNKISSRSEVMLNIDDLSDYAAFRRGDFKYIIGRTETGSAWLGASGDPS 422

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQV 626
              S Q +   I           SK    +  I    Q   L + + K  + VK+  Y  
Sbjct: 423 EGISPQYDPHKILY---------SKTGVAISGIITAKQAMELKEERRK--RSVKI--YDT 469

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKND 683
            L    ++  LS           + +LR  A I+C  VKE   +PCEP  APCLF+I+ D
Sbjct: 470 ALKTNFEEKILS--------VENISELRRKAQIKCN-VKEKDRIPCEPMTAPCLFNIEKD 520

Query: 684 PCEKNNLADR 693
           PCE  NLA++
Sbjct: 521 PCEMVNLAEK 530



 Score = 65.9 bits (159), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 86/191 (45%), Gaps = 15/191 (7%)

Query: 615 LVKVVKVMRYQVDLTGGPD---QVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
           L+    V R Q+    G D    + L+ +S R  + L +  L D A+ + G  K +    
Sbjct: 349 LLSAAGVDRRQLGNIDGYDLWSALVLNKISSRSEVMLNIDDLSDYAAFRRGDFKYIIGRT 408

Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKK 731
           +          DP E   ++ + +  +I +  T V     I      E +E+++K+  K 
Sbjct: 409 ETGSAWLGASGDPSE--GISPQYDPHKILYSKTGVAISGIITAKQAMELKEERRKRSVKI 466

Query: 732 KKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKN 788
                K   ++K          E + +LR  A I+C  VKE   +PCEP  APCLF+I+ 
Sbjct: 467 YDTALKTNFEEKILSV------ENISELRRKAQIKCN-VKEKDRIPCEPMTAPCLFNIEK 519

Query: 789 DPCEKNNLADR 799
           DPCE  NLA++
Sbjct: 520 DPCEMVNLAEK 530



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/32 (65%), Positives = 28/32 (87%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH+VL+  E  GLPLSEK+LP+YL+E+GY+T
Sbjct: 93  MQHSVLFPAEPRGLPLSEKLLPEYLREVGYKT 124


>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 583

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/330 (60%), Positives = 248/330 (75%), Gaps = 4/330 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII ILADDLGWNDV FHG +QIPTPNIDALAY G++LKN+Y   LCTPSR+A++TGK+
Sbjct: 41  PHIIIILADDLGWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTGKY 100

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH  ++  E  GLPL EK+LPQYLK++ Y T IVGKWHLG+YK EYTP +RGF+
Sbjct: 101 PIHLGMQHEAIFPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRGFD 160

Query: 180 SHLGYWTGHQDYFDH-SAEEMKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           +H GYW G QDY+ H +AE   +  G+DMRR+   AWD  GKYS D++T EAV +I+ H+
Sbjct: 161 THFGYWNGLQDYYSHKTAEPYTLNIGMDMRRNFTVAWDTMGKYSVDLYTDEAVRLINTHN 220

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
           TD P+FLYLA  A H+ N  +  QA    +    +I D KR ++AA++ KLDESVGKVVE
Sbjct: 221 TDNPMFLYLAQIAPHAGNANQLPQALPEEIEKFSYIIDPKRKRYAAVVSKLDESVGKVVE 280

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL+ R MLSNSII+F+SDNG   +GF  N  SN+P RG+K TLWEGGVRGA  IWSPL++
Sbjct: 281 ALKNRNMLSNSIILFMSDNGAPTSGFLSNGGSNYPFRGIKKTLWEGGVRGAAAIWSPLIK 340

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
            R  V+   +HV+DWLPTLLSAA  NK +I
Sbjct: 341 KRERVSYHLMHVTDWLPTLLSAAGGNKEEI 370



 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 64/141 (45%), Gaps = 27/141 (19%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
           IDG+D+W  +   + S R  +L NIDD     A+  GK+K V+     +   D  G    
Sbjct: 373 IDGLDMWETIVYGDCSPRKEVLINIDDICDSEAIRIGKYKYVR--GESKKNNDWVGDSGR 430

Query: 631 ---------GPDQVY----------LSGLSDREWLALAMRKLRDAASIQCGPVKE--VPC 669
                     P+ V           L G ++    +  M +LR  A + C   +E  V C
Sbjct: 431 NSSEQRPPYSPETVLNSKVGIAIASLPGNNNVALTSAKMLELRQQAEVHCNVTEEEKVQC 490

Query: 670 EPQIAPCLFDIKNDPCEKNNL 690
           + Q+APCLFD++ DPCE  N+
Sbjct: 491 DTQLAPCLFDLERDPCEMINI 511



 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 756 MRKLRDAASIQCGPVKE--VPCEPQIAPCLFDIKNDPCEKNNL 796
           M +LR  A + C   +E  V C+ Q+APCLFD++ DPCE  N+
Sbjct: 469 MLELRQQAEVHCNVTEEEKVQCDTQLAPCLFDLERDPCEMINI 511



 Score = 44.3 bits (103), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH  ++  E  GLPL EK+LPQYLK++ Y T I+ 
Sbjct: 106 MQHEAIFPSEPRGLPLEEKLLPQYLKDMNYVTHIVG 141


>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
 gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
          Length = 541

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 212/345 (61%), Positives = 246/345 (71%), Gaps = 22/345 (6%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           M F  L +   L       V +S P P+IIFILADDLG+NDVGFHG  QIPTPNIDALAY
Sbjct: 1   MLFLWLFIFLILPQFLKGEVGNSSPKPNIIFILADDLGFNDVGFHGSAQIPTPNIDALAY 60

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
           SGIIL  YY   +CTPSRSA+MTGK+PIHTGMQH VLY  E  GL L EKILPQYL +LG
Sbjct: 61  SGIILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVLYAAEPRGLSLKEKILPQYLNDLG 120

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y + I GKWHLG +K +YTP FRGF SH                    WGLDMR   E A
Sbjct: 121 YTSHIAGKWHLGHWKLKYTPLFRGFSSH--------------------WGLDMRNGTEVA 160

Query: 214 WDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
           +DLHG+Y+TDV T  AV +I NH +T  PLFLY+AHAA HS+NPY PL  PD+ +    H
Sbjct: 161 YDLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAHAACHSSNPYNPLPVPDNEVMKLGH 220

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I  +KR KFAA++ K+DESVG +V+ L +  ML NSII+F SDNGG A GFNLN ASN+P
Sbjct: 221 IPHYKRRKFAAMVTKMDESVGLIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 280

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           L+GVKNTLWEGGVR AGLIWSPLL+SR  V+ Q +H+SDWLPTLL
Sbjct: 281 LKGVKNTLWEGGVRAAGLIWSPLLKSRQRVSTQTIHISDWLPTLL 325



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/177 (30%), Positives = 82/177 (46%), Gaps = 27/177 (15%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD- 633
           IDG  +W  L  +EPS R  ++HNIDD W  + L+ G WKLVK       + D   GP  
Sbjct: 341 IDGQSIWQALVHDEPSPRLKVIHNIDDIWGSAGLSVGDWKLVKGTNYYG-KWDGWYGPAG 399

Query: 634 ---------QVYLSGLSDREWLALAM-------RKLRDAASIQCGPVKEVP----CEPQI 673
                    Q      S +    L M       ++LR AA+++C  + + P    C+   
Sbjct: 400 ERDPHLYDWQQIPKSRSGKALNGLKMLPSQAEQQRLRAAATVKC--LGQSPQGSSCKATA 457

Query: 674 --APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
             APCLF+I +DPCE+ NLA++   + +     E+   N  A P   +  +++   K
Sbjct: 458 FSAPCLFNIHDDPCEQFNLAEQYP-KVLKALMEELKYQNSTAVPPANKPGDRRASPK 513



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 30/49 (61%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
           IDG  +W  L  +EPS R  ++HNIDD W  + L+ G WKLVK  +  G
Sbjct: 341 IDGQSIWQALVHDEPSPRLKVIHNIDDIWGSAGLSVGDWKLVKGTNYYG 389



 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 24/34 (70%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GL L EKILPQYL +LGY + I
Sbjct: 92  MQHAVLYAAEPRGLSLKEKILPQYLNDLGYTSHI 125



 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 30/49 (61%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           + + YP VL  + +EL   N TAV P NKP D+   PK +++ W+ FGD
Sbjct: 476 LAEQYPKVLKALMEELKYQNSTAVPPANKPGDRRASPKYWNYTWTNFGD 524


>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 689

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 192/326 (58%), Positives = 244/326 (74%), Gaps = 5/326 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII ILADDLGWNDV FHG DQIPTPNIDA+AY+GIIL ++Y   LCTPSR+A+MTGK+
Sbjct: 59  PHIIIILADDLGWNDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTGKN 118

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH+VL   E  GLPLSEK+LP+YL+E+GYRT  VGKWHLG++++EYTPTFRGF+
Sbjct: 119 PIHLGMQHSVLLPSEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRGFD 178

Query: 180 SHLGYWTGHQDYFDHSAEE-----MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           +H GYW G QDY+ H  +E      +  GLDMRR+L  AWD  GKYSTD+FT EAV +I+
Sbjct: 179 THFGYWNGLQDYYTHITQEPDPQFGEFMGLDMRRNLTAAWDTQGKYSTDLFTEEAVRLIN 238

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H  D+P+FLYLAH A H  NP   L+A D  +    +I D +R   AA++ K+D+SVG+
Sbjct: 239 EHDKDDPMFLYLAHLAPHKGNPNRLLRASDEDIARFSYILDPERRIQAAVVSKVDQSVGE 298

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V++AL +R ML NSI++F+SDNG    G   N  SN+PLRG+K++ WEGG RG   IWSP
Sbjct: 299 VMDALRKRGMLENSIVLFMSDNGAPTQGSLSNQGSNYPLRGIKDSPWEGGTRGVAAIWSP 358

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           L++    V+ Q + ++DWLPTLLSAA
Sbjct: 359 LIKESQRVSNQMMFIADWLPTLLSAA 384



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 79/168 (47%), Gaps = 43/168 (25%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWK-LVKVVKVMRYQV 626
           D+     IDG D+W  L  ++ S R+ ++ NIDD  + +++ +G +K ++   +  R  V
Sbjct: 387 DSKQLGNIDGFDLWPALRSDKISPRSDVVLNIDDISKYASIRRGDFKYVIGETETGRAWV 446

Query: 627 DLTGGPDQ-----------------VYLSGL------------------SDREWLALAMR 651
             TG P +                 V +SG+                   D E   L+  
Sbjct: 447 GATGAPSEGIPPKYDPDKVLYSKAGVAISGVLTAKQAKELKKKKNTSSKIDFEEKILSAE 506

Query: 652 K---LRDAASIQCGPVKE---VPCEPQIAPCLFDIKNDPCEKNNLADR 693
           K   LR+ A +QC  VKE   +PC+P  APCLF+I+ DPCE  NLAD+
Sbjct: 507 KIFQLRNDARVQCN-VKEENRIPCDPTAAPCLFNIEKDPCEMVNLADK 553



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/49 (53%), Positives = 35/49 (71%), Gaps = 4/49 (8%)

Query: 754 EGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKNDPCEKNNLADR 799
           E + +LR+ A +QC  VKE   +PC+P  APCLF+I+ DPCE  NLAD+
Sbjct: 506 EKIFQLRNDARVQCN-VKEENRIPCDPTAAPCLFNIEKDPCEMVNLADK 553



 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/32 (68%), Positives = 27/32 (84%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH+VL   E  GLPLSEK+LP+YL+E+GYRT
Sbjct: 124 MQHSVLLPSEPRGLPLSEKLLPEYLREVGYRT 155


>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
 gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
          Length = 542

 Score =  411 bits (1056), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 200/322 (62%), Positives = 239/322 (74%), Gaps = 21/322 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFILADDLG+NDVGFHG  +IPTPNIDALAYSGIIL  YY   +CTPSRSA+MTGK+
Sbjct: 26  PNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +RGF 
Sbjct: 86  PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFS 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                    WGLDMR   + A+DLHG Y+TDV T ++V +I NH +T
Sbjct: 146 SH--------------------WGLDMRNGTQVAYDLHGHYTTDVITDQSVKVIANHNAT 185

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS+NPY PL  PD+ +    HI ++KR KFAA++ K+D+SVG++V+ 
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQ 245

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+ 
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
              V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIVDWLPTLLEAA 327



 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 82/176 (46%), Gaps = 23/176 (13%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           +IDG  +W  L  ++ S R  +LHNIDD W  +AL+ G WKLVK     R   D   GP 
Sbjct: 339 QIDGQSIWRALVHDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN-YRGSWDGWYGPA 397

Query: 634 ----------QVYLSGLSDREWLALAM-------RKLRDAASIQC--GPVKEVPCEPQI- 673
                     Q+     + +   AL M       +++R AA++ C     +   C     
Sbjct: 398 GERDPRLYDWQLVGRSRAGKALEALKMLPSRADQQRIRAAATVSCPGQSSQGTSCVATAF 457

Query: 674 -APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
            APCLF I++DPCE+ NLA +   + ++    E+ RFN  A P   +  + +   +
Sbjct: 458 SAPCLFHIRDDPCEQYNLA-KQYPEVVSALMAELERFNATAVPPSNKPADPRADPR 512



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL ELGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           + K YP+V+S +  EL   N TAV P NKP D   DP+ ++  W+ FGD
Sbjct: 475 LAKQYPEVVSALMAELERFNATAVPPSNKPADPRADPRFWNFTWTNFGD 523


>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
 gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
          Length = 633

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 200/322 (62%), Positives = 238/322 (73%), Gaps = 21/322 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFILADDLG+NDVGFHG  +IPTPNIDALAYSGIIL  YY   +CTPSRSA+MTGK+
Sbjct: 26  PNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +RGF 
Sbjct: 86  PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFS 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                    WGLDMR   + A+DLHG Y+TDV T  +V +I NH +T
Sbjct: 146 SH--------------------WGLDMRNGTQVAYDLHGHYTTDVITEHSVKVIANHNAT 185

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS+NPY PL  PD+ +    HI ++KR KFAA++ K+D+SVG++V+ 
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQ 245

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+ 
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
              V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIVDWLPTLLEAA 327



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL ELGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 6/76 (7%)

Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
           +P L  +  G     N S       +IDG  +W  L +++ S R  +LHNIDD W  +A+
Sbjct: 320 LPTLLEAAGGQPALANLSK------QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAAI 373

Query: 544 TRGKWKLVKENSINGN 559
           + G WKLVK  +  G+
Sbjct: 374 SVGDWKLVKGTNYRGS 389



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 30/44 (68%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK 617
           +IDG  +W  L +++ S R  +LHNIDD W  +A++ G WKLVK
Sbjct: 339 QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAAISVGDWKLVK 382


>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 710

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/332 (56%), Positives = 244/332 (73%), Gaps = 5/332 (1%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+I+ I+ADD+GWNDV FHG DQIPTPNIDALAY+G+IL ++Y   LCTPSRSA++TGK
Sbjct: 43  PPNIVMIIADDMGWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGK 102

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +PIHTGMQH V+   E  GLPL EKILPQYLKE GY T  +GKWH GF+++EYTPT+RGF
Sbjct: 103 YPIHTGMQHLVILEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGF 162

Query: 179 ESHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           +SH GYW G QDY+ H    S  +    G DMRR++  A D +GKYSTD+FT EAV +I 
Sbjct: 163 DSHFGYWQGLQDYYTHEVGSSNPKEGFLGFDMRRNMSLARDTYGKYSTDLFTDEAVRLIE 222

Query: 235 NHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            H  +  P+FLYLAH A HS N  EPLQAPD  +    ++ED +R  +AA++ KLD+SVG
Sbjct: 223 EHRPEAGPMFLYLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVG 282

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           +VV AL ++ ML NSI+VF++DNG A  G + N  SN+PLRG+K + WEG VRGA  +WS
Sbjct: 283 EVVSALRRKNMLQNSIVVFMADNGAATQGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWS 342

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
           PL++    +  + + ++DWLPTLLSA+   D+
Sbjct: 343 PLIQRPKRIYNELMSIADWLPTLLSASGLRDV 374



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/190 (34%), Positives = 87/190 (45%), Gaps = 30/190 (15%)

Query: 510 IDGIDVWSVLSRNEPSK-RNTILHNIDDEWQISALTRGKWKLVKENSINGNG--TSENRS 566
           IDG+D W  +S   PS  RN IL NID  +  SAL RG++K V     NG        R 
Sbjct: 381 IDGVDQWPAISGVAPSPPRNEILVNIDPIFNYSALRRGEFKYVLGTVGNGEEWYGETGRP 440

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS---ALTKGKWKLVKVVKVMR 623
            +   +      D  +VL     SK  T +  +    Q+S   A+ K  +  VK      
Sbjct: 441 ENQGLEGASPTYDPETVLM----SKAGTAIAGLLTAKQVSEIRAVRKDTYGRVK------ 490

Query: 624 YQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG-PVKE-VPCEPQIAPCLFDIK 681
                    D++  S L   + L     KLR +AS++C  P  E V C P  +PCLF+IK
Sbjct: 491 --------DDEMATSKLLTVDELL----KLRSSASLRCTVPESERVACHPLQSPCLFNIK 538

Query: 682 NDPCEKNNLA 691
            DPCE+ NLA
Sbjct: 539 EDPCEQRNLA 548



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG-PVKE-VPCEPQIA 780
           K+  + +  +K    + K  +    K    +E + KLR +AS++C  P  E V C P  +
Sbjct: 473 KQVSEIRAVRKDTYGRVKDDEMATSKLLTVDE-LLKLRSSASLRCTVPESERVACHPLQS 531

Query: 781 PCLFDIKNDPCEKNNLA 797
           PCLF+IK DPCE+ NLA
Sbjct: 532 PCLFNIKEDPCEQRNLA 548



 Score = 45.8 bits (107), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 22/32 (68%), Positives = 23/32 (71%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH V+   E  GLPL EKILPQYLKE GY T
Sbjct: 109 MQHLVILEAEPRGLPLHEKILPQYLKEAGYAT 140


>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
 gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
          Length = 478

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 200/322 (62%), Positives = 236/322 (73%), Gaps = 21/322 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFILADDLG+NDVGFHG   IPTPNIDALAYSGIIL  YY   +CTPSRSA+MTGK+
Sbjct: 26  PNIIFILADDLGFNDVGFHGSADIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +RGF 
Sbjct: 86  PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYASHIAGKWHLGHWKLKYTPLYRGFS 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                    WGLDMR   E A+DLHG Y+TDV T  +V +I +H +T
Sbjct: 146 SH--------------------WGLDMRNGTEVAYDLHGHYTTDVITDHSVKVIASHNAT 185

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS+NPY PL  PD+ +    HI  +KR KFAA++ K+D+SVG++V+ 
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMAHIPHYKRRKFAAMVSKMDDSVGQIVDQ 245

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+ 
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
              V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIIDWLPTLLEAA 327



 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 91/212 (42%), Gaps = 59/212 (27%)

Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
           +P L  +  G     N S       +IDG  +W  L +++ S R  +LHNIDD W  +AL
Sbjct: 320 LPTLLEAAGGPAAVANLSK------QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAAL 373

Query: 544 TRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
           + G WKLVK       GT+   S D  Y                   +RN  L+    +W
Sbjct: 374 SVGDWKLVK-------GTNYRGSWDGWY---------------GPAGERNPRLY----DW 407

Query: 604 QISALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGP 663
           Q+ A ++   K ++ +K++  Q D                       +++R AA++ C  
Sbjct: 408 QMVARSRAG-KALEALKMLPSQAD----------------------QQRIRAAATVSCPA 444

Query: 664 --VKEVPCEPQI--APCLFDIKNDPCEKNNLA 691
              K   C      APCLF I++DPCE+ NLA
Sbjct: 445 QSSKGTSCLATAFSAPCLFHIRDDPCEQYNLA 476



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL ELGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLEEKILPQYLNELGYASHI 124


>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
 gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
          Length = 542

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 204/341 (59%), Positives = 240/341 (70%), Gaps = 24/341 (7%)

Query: 41  PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           P++F L      +   S  P+IIFILADDLG+NDVGF G  QIPTPNIDALAYSG+IL  
Sbjct: 19  PISFAL------VACQSTQPNIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNR 72

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           YY   +CTPSRSA+MTGK+PIHTGMQH VLY  E  GLPL  KILPQYL ELGY + I G
Sbjct: 73  YYVNPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLDLKILPQYLNELGYTSHIAG 132

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           KWHLG +K+ YTP +RGF SH GY                 WGLDMR   E A+DLHG+Y
Sbjct: 133 KWHLGHWKRVYTPLYRGFSSHHGY-----------------WGLDMRNGTEIAYDLHGQY 175

Query: 221 STDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           +TDV T  ++ +I NH   + PLFLY+AHAA HS NPY PL A D  +     I+ +KR 
Sbjct: 176 TTDVITQHSLQVIANHKPAKGPLFLYVAHAAVHSGNPYNPLPASDDAVRRLDKIQHYKRR 235

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           K+AA++  +DESVGK+VE L + RML NSIIVF +DNGG A GFN N ASN+PLRGVKNT
Sbjct: 236 KYAALVTAMDESVGKIVEQLRKSRMLENSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNT 295

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           LWEGGVRGA L+WSP L  R   AEQ +H+ DWLPTL+ AA
Sbjct: 296 LWEGGVRGAALLWSPQLTKRPRTAEQLMHMVDWLPTLVEAA 336



 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 82/175 (46%), Gaps = 28/175 (16%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           ++DG+ +W  L RNE S RN+ILHNIDD W  +AL+ G WKL   V+   Y     G   
Sbjct: 349 KLDGVSLWQALVRNEASPRNSILHNIDDIWGSAALSVGVWKL---VQGSHYNGSWDGWYG 405

Query: 634 QVYLSGLSDREWLAL-------AMR------------KLRDAASIQC-----GPVKEVPC 669
              L    D  W  +       AM+            +LR AA++ C      P  ++ C
Sbjct: 406 PAGLRNPHDYNWQLVTKCAAGQAMQQLKMLPQQADIIRLRKAATVSCKEAKVKPSPDIAC 465

Query: 670 EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
            P    CLF++ +DPCE+ NLA +   + +     E+  FN  A P   + ++ +
Sbjct: 466 NPLKEICLFNVLDDPCEQRNLA-KQHPKVLALLLNELQLFNATAIPPANKPDDPR 519



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/34 (67%), Positives = 24/34 (70%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL  KILPQYL ELGY + I
Sbjct: 97  MQHTVLYAAEPRGLPLDLKILPQYLNELGYTSHI 130



 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 5/66 (7%)

Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG-----PVKEVPCEPQIAPCLFDIKNDPC 791
           K    +  ++ K   ++  + +LR AA++ C      P  ++ C P    CLF++ +DPC
Sbjct: 422 KCAAGQAMQQLKMLPQQADIIRLRKAATVSCKEAKVKPSPDIACNPLKEICLFNVLDDPC 481

Query: 792 EKNNLA 797
           E+ NLA
Sbjct: 482 EQRNLA 487



 Score = 43.1 bits (100), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 34/53 (64%)

Query: 850 KKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           +++ + K +P VL+ +  EL   N TA+ P NKP D  GDP+ ++H W+ FGD
Sbjct: 482 EQRNLAKQHPKVLALLLNELQLFNATAIPPANKPDDPRGDPRLWNHTWTNFGD 534


>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 559

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/336 (55%), Positives = 244/336 (72%), Gaps = 2/336 (0%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           +A+S  PHII I+ADDLGW+DV FHG DQIPTPNID LAYSG++L NYY   +CTP+R+A
Sbjct: 22  LATSSQPHIIVIIADDLGWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQPICTPTRAA 81

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG+HPIH G+Q  V+      GLPL+E I+PQYLK LGY T IVGKWHLGF+  +YTP
Sbjct: 82  LMTGRHPIHLGLQDGVIVASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLGFFAWQYTP 141

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
            +RGF++H GY+ G + Y+DH+AEE K  GLD R + E     +G+YST++FT+ A  II
Sbjct: 142 LYRGFDTHFGYYNGEEGYYDHTAEEPKYIGLDFRNNTELFKSAYGEYSTELFTSYAEKII 201

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
           HNH+ ++PLFLYLAH A HS N Y PL+AP  Y +   +I+D +R  FA ++  LD+SVG
Sbjct: 202 HNHNKNKPLFLYLAHQAVHSGNSYSPLEAPYKYTSRFPYIQDERRRTFAGMVSALDDSVG 261

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            +  AL+   +  N II+F +DNGG AAGF+ N ASNWPLRG+K+TLWEGGVRG G I S
Sbjct: 262 NITRALKHSGLYENCIILFSTDNGGPAAGFDANYASNWPLRGIKHTLWEGGVRGDGFIHS 321

Query: 354 PLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPN 387
           PLLE  G ++   +HV DWLPT+  ++  N S +P 
Sbjct: 322 PLLEKPGRISTDMIHVCDWLPTIYHIAGGNVSSLPT 357



 Score = 44.3 bits (103), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLVKENSINGN 559
           N +S    +DG +VW  LSR  PS R  ILHNID       +AL  G++K++        
Sbjct: 351 NVSSLPTNLDGFNVWDTLSRGTPSPRTEILHNIDPHKNHNAAALRVGEFKIIL------- 403

Query: 560 GTSENRSNDNSYQNE 574
           GT E+   D  Y+ E
Sbjct: 404 GTEEHGEWDGWYEPE 418



 Score = 43.1 bits (100), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLV 616
           N +S    +DG +VW  LSR  PS R  ILHNID       +AL  G++K++
Sbjct: 351 NVSSLPTNLDGFNVWDTLSRGTPSPRTEILHNIDPHKNHNAAALRVGEFKII 402


>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
 gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
          Length = 544

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 199/322 (61%), Positives = 236/322 (73%), Gaps = 21/322 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFILADDLG+NDVGFHG  +IPTPNIDALAYSGIIL  YY   +CTPSRSA+MTGK+
Sbjct: 26  PNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL EKILPQYL ELGY + I GKWHLG +K +YTP  RGF 
Sbjct: 86  PIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLHRGFS 145

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                    WGLDMR   + A+DLHG Y+TDV T  +V +I +H +T
Sbjct: 146 SH--------------------WGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIASHNAT 185

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS+NPY PL  PD+ +    HI  +KR KFAA++ K+D+SVG++V+ 
Sbjct: 186 KGPLFLYVAHAACHSSNPYNPLPVPDNDVLKMSHIPHYKRRKFAAMVSKMDDSVGQIVDQ 245

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F SDNGG A GFNLN ASN+PL+GVKNTLWEGGVR AGL+WSPLL+ 
Sbjct: 246 LRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKK 305

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
              V+ Q +H+ DWLPTLL AA
Sbjct: 306 SQRVSNQTMHIIDWLPTLLEAA 327



 Score = 79.3 bits (194), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 84/177 (47%), Gaps = 25/177 (14%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           +IDG  +W  L +++ S R  +LHNIDD W  +AL+ G WKLVK     R   D   GP 
Sbjct: 339 QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN-YRGSWDGWYGPA 397

Query: 634 ----------QVYLSGLSDREWLALAM-------RKLRDAASIQCGPVKE---VPCEPQI 673
                     Q+       +   AL M       +++R AA++ C P +      C    
Sbjct: 398 GERDPRQYDWQLVAKSRVGKALDALKMLPSRADQQRIRAAATVTC-PAQSSQGTSCLATA 456

Query: 674 --APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
             APCLF I++DPCE+ NLA +   + +N   TE+ RFN  A P   +  + +   +
Sbjct: 457 FSAPCLFHIRDDPCEQYNLA-KQYPEVVNALMTELERFNATAVPPSNKPADPRADPR 512



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL EKILPQYL ELGY + I
Sbjct: 91  MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124



 Score = 48.5 bits (114), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 31/49 (63%)

Query: 854 MKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
           + K YP+V++ +  EL   N TAV P NKP D   DP+ +++ W+ FGD
Sbjct: 475 LAKQYPEVVNALMTELERFNATAVPPSNKPADPRADPRFWNYTWTNFGD 523


>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 570

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 192/343 (55%), Positives = 248/343 (72%), Gaps = 1/343 (0%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           +L +  TLS+ F ++V     P+II ILADDLGWNDV FHG +QI TPNIDALAY+GIIL
Sbjct: 26  ILLIFITLSIFFQNVVDGIERPNIIIILADDLGWNDVSFHGSNQIQTPNIDALAYNGIIL 85

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
            ++Y   LCTPSR+++MTGK+P   GMQH V+   E  GLPL+E ++P+Y  + GY T  
Sbjct: 86  NSHYVPALCTPSRASLMTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHA 145

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           VGKWHLGF+KKEYTP +RGF+SH G+W G QDY+DH+     + G DMRR+ E  +   G
Sbjct: 146 VGKWHLGFFKKEYTPIYRGFDSHFGHWNGFQDYYDHTTMSDSLKGYDMRRNFEVDYSYQG 205

Query: 219 KYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
            Y+TDVFT EA+ II NH++ + PLFLYL+H A HS NP  P QAP+  ++ H  I D  
Sbjct: 206 MYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPEDEISKHECINDPG 265

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  +AA++ KLDESVG+VV ALE+ +ML+NSII+F+SDNG A  G + N  SN+PLRG+K
Sbjct: 266 RKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRGSNYPLRGLK 325

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            + WEGGVRG   IWSP L     V++Q +H+SDWLPTLL+AA
Sbjct: 326 ESPWEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAA 368



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 81/190 (42%), Gaps = 48/190 (25%)

Query: 508 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSN 567
           N+IDGID+W+VLS + PS R  + +N D+    S+L    WK V+       GT++    
Sbjct: 379 NKIDGIDMWNVLSNDLPSPRKEVFNNYDEIENYSSLMIDSWKYVE-------GTAQEGKA 431

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
           D  ++               EPS+ N   + + +E                  + R + D
Sbjct: 432 DYWFE---------------EPSRNNCSEYRVSNE-----------------DIFRLRRD 459

Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 686
            T   D    S        +L++ +  +   ++      + C+P +   CLF++ +DPCE
Sbjct: 460 STIICDNPTFSS-------SLSITR-NNHTDVKNKTKYVLTCDPLLKRFCLFNLNDDPCE 511

Query: 687 KNNLADRSED 696
           + NLAD   D
Sbjct: 512 RLNLADVFPD 521


>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
 gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
          Length = 556

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 199/351 (56%), Positives = 244/351 (69%), Gaps = 20/351 (5%)

Query: 44  FTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
           + L ++ + + +   P PHII+ILADDLG+NDVGFHG  QIPTPNIDALAYSG+IL  YY
Sbjct: 9   WLLPLMVISVSSQQSPKPHIIYILADDLGFNDVGFHGSAQIPTPNIDALAYSGLILNRYY 68

Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
              +CTPSRS++MTGK+ IHTGMQH VLYG E  GLPL EK+LPQYL +LGY + I GKW
Sbjct: 69  VAPICTPSRSSLMTGKYAIHTGMQHTVLYGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKW 128

Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYST 222
           HLG +K  YTP  RGF SH G+                 WGLDMR     A++LHG+Y+T
Sbjct: 129 HLGHWKMPYTPLRRGFNSHHGF-----------------WGLDMRNGSRVAYELHGQYTT 171

Query: 223 DVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           DV T  ++D+I NH   + PLFLY+AHAA HS NPY PL   D  +     IE +KR ++
Sbjct: 172 DVITQHSIDVIANHPVSKGPLFLYVAHAAAHSGNPYNPLPVADDEVIKLGRIEHYKRRRY 231

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           AA++ KLDESVG +V+ L +  ML NSIIVF +DNGGAA GFNLN ASN+PLRGVKNTLW
Sbjct: 232 AAMVSKLDESVGIIVDQLRRSNMLENSIIVFATDNGGAAEGFNLNFASNYPLRGVKNTLW 291

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSDIPNYVNS 391
           EGGVRG GL+WS     R  VAEQ +H++DWLPT +  A  K  + N+  S
Sbjct: 292 EGGVRGVGLLWSQRYLKRPRVAEQTMHITDWLPTFVEVAGGKEALANFTTS 342



 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 86/180 (47%), Gaps = 24/180 (13%)

Query: 571 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG 630
           +   +DG  +W  L ++E S R ++LHNIDD W  SALT G WKL+K     +   D   
Sbjct: 339 FTTSLDGQSIWQALVQDEASPRKSVLHNIDDIWNSSALTVGDWKLIKGTN-YKGAWDGWY 397

Query: 631 GPD-----QVY--------LSGLSDREWLALAMR----KLRDAASIQC-----GPVKEVP 668
           GP      Q+Y        L+G +  +   L  R    +LR AA++ C        K   
Sbjct: 398 GPAGERDPQLYNWQQVAKSLAGKAMLQLKVLPSRADQQRLRSAATVNCPNQQSNTQKGSV 457

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
           C P   PCLF++++DPCE+ NLA +     +N   +E+  FN  A P     ++ +   K
Sbjct: 458 CNPLQTPCLFNLRDDPCEQYNLATQYP-LVLNSLLSELKHFNSTAIPPSNLPDDPRGNPK 516



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/34 (67%), Positives = 26/34 (76%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLYG E  GLPL EK+LPQYL +LGY + I
Sbjct: 91  MQHTVLYGAEPRGLPLEEKLLPQYLNDLGYTSHI 124



 Score = 46.2 bits (108), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 38/175 (21%), Positives = 62/175 (35%), Gaps = 63/175 (36%)

Query: 733 KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQC-----GPVKEVPCEPQIAPCLFDIK 787
           ++  K    K   + K        ++LR AA++ C        K   C P   PCLF+++
Sbjct: 411 QQVAKSLAGKAMLQLKVLPSRADQQRLRSAATVNCPNQQSNTQKGSVCNPLQTPCLFNLR 470

Query: 788 NDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKK 847
           +D                           P +++N                         
Sbjct: 471 DD---------------------------PCEQYN------------------------- 478

Query: 848 KKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
                 +   YP VL+ +  EL + N TA+ P N P D  G+PK +++ W+ FGD
Sbjct: 479 ------LATQYPLVLNSLLSELKHFNSTAIPPSNLPDDPRGNPKFWNYTWTNFGD 527


>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
          Length = 561

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/360 (54%), Positives = 257/360 (71%), Gaps = 17/360 (4%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           +R R +AF +L L          + A+S  P+I+FILADDLGWNDVGFHG +Q+ TPN+D
Sbjct: 4   HRFRCVAFLLLQL--------TAVSATSPRPNIVFILADDLGWNDVGFHGSEQVLTPNLD 55

Query: 90  ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           ALAY G+IL+NYY   +CTPSR+A+MTG+HPIHTGMQ NV+Y  E  GL L+E I PQYL
Sbjct: 56  ALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQQNVIYSAEPYGLGLNEIIFPQYL 115

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---SAEEMKM---WG 203
           K++GY+T IVGKWHLGF+ ++YTP  RGF+SH GY+ G +DY+ H   +A E+     WG
Sbjct: 116 KQIGYKTHIVGKWHLGFFAEQYTPIERGFDSHYGYYMGAEDYWVHIAGNAHEVSFNASWG 175

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY---EPL 260
           LD RR+ E      G+YST++FT EA +II +H+  EPLF+Y+A  A HSANPY     L
Sbjct: 176 LDFRRNGEVVKTAFGQYSTELFTTEAENIIASHNQSEPLFMYVAQQAVHSANPYTGDAEL 235

Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           +AP  Y     HI++ KR KFAA++  LD+SVG + +AL  + +L N+II F +DNGG A
Sbjct: 236 EAPFKYYEKFPHIKNEKRRKFAAMVSALDDSVGNITKALHAKGILENTIICFSTDNGGPA 295

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +GF+ N ASNWPLRGVK +LWEGGVRG G +WSPL+++ G  +   + + D+LPTLL AA
Sbjct: 296 SGFDFNDASNWPLRGVKESLWEGGVRGTGFVWSPLMKTSGYTSPAMMQIFDFLPTLLKAA 355



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 69/169 (40%), Gaps = 35/169 (20%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           ++DGID W ++S  + + R  +LHNID+     AL   + K+V           L    D
Sbjct: 365 DLDGIDQWEIISERKDAVRTEMLHNIDNTEGTRALRINEMKVVVSQGGSTRSWPLWYSTD 424

Query: 634 QVY---------LSGLSDREW-----------------LALAMRKLRDAAS--IQCGPVK 665
           Q Y         +S     +W                 L    R LR      I CGPV 
Sbjct: 425 QFYGGYNHTDHSVSADLSAKWDIDSEVAPRVNSDLTSVLRNLGRPLRSGTPLKIHCGPVP 484

Query: 666 EVP---CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQ 711
                 C+    PCLF+I  DPCE +NLA    D+R +  +T + R +Q
Sbjct: 485 PSARGSCDVGKKPCLFNITADPCEFHNLA----DERPDMLSTMLDRLDQ 529



 Score = 45.8 bits (107), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQ NV+Y  E  GL L+E I PQYLK++GY+T I+ 
Sbjct: 91  MQQNVIYSAEPYGLGLNEIIFPQYLKQIGYKTHIVG 126


>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
 gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
          Length = 531

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 196/319 (61%), Positives = 230/319 (72%), Gaps = 18/319 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADDLG+NDVGF G  QIPTPNIDALAYSG+IL  YY   +CTPSRSA+MTGK+
Sbjct: 21  PHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKY 80

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL  KILPQYL +LGY + I GKWHLG +++ YTP +RGF 
Sbjct: 81  PIHTGMQHTVLYAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGFS 140

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH GY                 WGLDMR   E A+DLHG+YSTDV T  ++++I  H +T
Sbjct: 141 SHHGY-----------------WGLDMRNGTEVAYDLHGQYSTDVITQHSLNVISKHNAT 183

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS NPY PL   D  +     I+ +KR K+AA++  +DESVGK+VE 
Sbjct: 184 KGPLFLYVAHAAVHSGNPYNPLPVKDDAVRRLDTIQHYKRRKYAALVTAMDESVGKIVEQ 243

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L + RML  SIIVF +DNGG A GFN N ASN+PLRGVKNTLWEGGVRGA L+WSP L  
Sbjct: 244 LRKSRMLETSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAALLWSPQLTK 303

Query: 359 RGIVAEQYVHVSDWLPTLL 377
           R   AEQ +H+ DWLPTL+
Sbjct: 304 RPRTAEQTMHIVDWLPTLV 322



 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 49/142 (34%), Positives = 70/142 (49%), Gaps = 27/142 (19%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           ++DGI +W  L +NE S R ++LHNIDD W  +AL+ G WKLVK      Y     G   
Sbjct: 338 KLDGISLWQALLKNETSPRKSVLHNIDDIWGSAALSVGDWKLVKGTN---YNGSWDGWYG 394

Query: 634 QVYLSGLSDREWLALA-------------------MRKLRDAASIQCGPV--KEVP---C 669
              +    D +W ++A                   M +LR+AA++ C     K+ P   C
Sbjct: 395 PAGMRNPHDYDWQSVAKSATGQAMQQLKMLPKTADMIRLREAANVICKDANPKDAPHIAC 454

Query: 670 EPQIAPCLFDIKNDPCEKNNLA 691
            P   PCLF+++ DPCE+ N A
Sbjct: 455 NPLSEPCLFNVREDPCEQRNQA 476



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 24/34 (70%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL  KILPQYL +LGY + I
Sbjct: 86  MQHTVLYAAEPRGLPLDLKILPQYLNDLGYTSHI 119



 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 5/66 (7%)

Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV--KEVP---CEPQIAPCLFDIKNDPC 791
           K    +  ++ K   +   M +LR+AA++ C     K+ P   C P   PCLF+++ DPC
Sbjct: 411 KSATGQAMQQLKMLPKTADMIRLREAANVICKDANPKDAPHIACNPLSEPCLFNVREDPC 470

Query: 792 EKNNLA 797
           E+ N A
Sbjct: 471 EQRNQA 476


>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 567

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 195/365 (53%), Positives = 253/365 (69%), Gaps = 20/365 (5%)

Query: 32  TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           TR   +A+L  A  L +        S  P+I+FILADDLGWND+ FHG D+IPTPNIDAL
Sbjct: 4   TRSSLWALLAFATILQLA----RTQSSKPNIVFILADDLGWNDLSFHGSDEIPTPNIDAL 59

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           A++GI+L N YT  +CTPSR A+MTGK+PI  GMQ    YG E  GLPLSEK+LP+YL+E
Sbjct: 60  AFNGIVLNNLYTQPVCTPSRVALMTGKYPIKLGMQGPPTYGAEPNGLPLSEKLLPEYLRE 119

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM-----KMWGLDM 206
           LGY TR +GKWHLGFYK+ YTPT RGF+SH GY+TG+  Y+D+  +++     +  G DM
Sbjct: 120 LGYTTRAIGKWHLGFYKQAYTPTRRGFDSHFGYYTGYVSYYDYLLQDVYQNFGEFQGFDM 179

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
           RR+   AWD+ GKY+TDVFT EAV +I     ++PLF+YLAH A H+ N  + L+AP   
Sbjct: 180 RRNDTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFMYLAHVAVHTGNRGKYLEAPQSE 239

Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA------ 320
           +N   HI D  R  +AA++ KLDESVG+VVEAL +++ML N+IIVF+SDNG  +      
Sbjct: 240 VNKFNHILDPNRRIYAAMVSKLDESVGRVVEALTEKKMLQNTIIVFMSDNGSPSFDGSGR 299

Query: 321 -----AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
                AG   N  SN+P RG+KNTLWEGGV+ A  IW+P  +    V++Q +H++DWLPT
Sbjct: 300 NFRPEAGVTANWGSNFPYRGIKNTLWEGGVKSASFIWAPYFQENPRVSKQLMHITDWLPT 359

Query: 376 LLSAA 380
           L SAA
Sbjct: 360 LYSAA 364



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 80/189 (42%), Gaps = 46/189 (24%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN-----GTSEN 564
           +DG+D W+ L+ N PS+RN +L NID++ + + + +  WKL+  ++ NG      G +  
Sbjct: 375 LDGVDQWTSLTLNLPSQRNFVLLNIDEKQRYAGILKDNWKLIVGSTANGTLDGFFGATRP 434

Query: 565 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRY 624
           R+  N+           +VL    P+ R   L  + D    +    G    V+ +  MRY
Sbjct: 435 RTPYNAT----------AVLY--SPAGR--ALTKLSDSLPFATAGFGPMSSVRELLAMRY 480

Query: 625 QVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDP 684
           +  +   P                             GP  + PC P    CLFD+  DP
Sbjct: 481 ESTIRCNPS--------------------------AAGPRTQSPC-PAGEACLFDLVADP 513

Query: 685 CEKNNLADR 693
           CE NN+A +
Sbjct: 514 CETNNVAKK 522



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQ    YG E  GLPLSEK+LP+YL+ELGY TR + 
Sbjct: 93  MQGPPTYGAEPNGLPLSEKLLPEYLRELGYTTRAIG 128


>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
 gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
          Length = 528

 Score =  394 bits (1012), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 192/317 (60%), Positives = 230/317 (72%), Gaps = 21/317 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADDLG+NDVGF G  QIPTPNIDALAYSG+IL  YY   +CTPSRSA+MT K+
Sbjct: 21  PHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTAKY 80

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQH VLY  E  GLPL+ KILPQYL +LGY + I GKWHLG +K+ YTP +RGF 
Sbjct: 81  PIHTGMQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPLYRGFS 140

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-ST 238
           SH                    WGLDMR   E A+DLHG+Y+TDV T  ++++I  H ST
Sbjct: 141 SH--------------------WGLDMRNGTELAYDLHGQYTTDVITQHSLNVIAKHNST 180

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+AHAA HS NPY PL A D  +     I+D+KR K+AA++  +D+SVG++VE 
Sbjct: 181 KGPLFLYVAHAAVHSGNPYNPLPAKDDIVRRLGTIQDYKRRKYAALVTAMDDSVGRIVEQ 240

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  ML NSII+F +DNGG A GFN N ASN+PLRGVKNTLWEGGVRGAGL+WS  L  
Sbjct: 241 LRKSHMLDNSIIIFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAGLVWSAQLSK 300

Query: 359 RGIVAEQYVHVSDWLPT 375
           R  VA+Q +H++DWLPT
Sbjct: 301 RPRVADQTMHIADWLPT 317



 Score = 85.9 bits (211), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 71/143 (49%), Gaps = 27/143 (18%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           ++DG+ VW  L +NE S R ++LHNIDD W  +AL+ G WKLVK      Y     G   
Sbjct: 335 QLDGMSVWQALLKNEVSPRKSVLHNIDDIWGSAALSVGDWKLVKGTN---YNGSWDGWYG 391

Query: 634 QVYLSGLSDREWLALA-------------------MRKLRDAASIQC--GPVKEVP---C 669
                 L D +W  +A                   M + R+AA++ C    +K+ P   C
Sbjct: 392 PAGTRNLRDYDWYLVAKSLAGQALQQLKVLPKTADMMRAREAANVICKDARLKDAPNISC 451

Query: 670 EPQIAPCLFDIKNDPCEKNNLAD 692
            P  APCLF+I+ DPCE+ N AD
Sbjct: 452 NPLKAPCLFNIREDPCEQRNQAD 474



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 58/152 (38%), Gaps = 63/152 (41%)

Query: 756 MRKLRDAASIQC--GPVKEVP---CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTT 810
           M + R+AA++ C    +K+ P   C P  APCLF+I+                       
Sbjct: 427 MMRAREAANVICKDARLKDAPNISCNPLKAPCLFNIRE---------------------- 464

Query: 811 EVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
                DP ++ NQ                                  YP VL  M KEL 
Sbjct: 465 -----DPCEQRNQ-------------------------------ADVYPKVLELMLKELQ 488

Query: 871 NINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
            IN TAV P NKP D  G P+ +++ W+ FGD
Sbjct: 489 LINSTAVPPANKPDDPLGYPRFWNYTWTNFGD 520



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 25/34 (73%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
           MQH VLY  E  GLPL+ KILPQYL +LGY + I
Sbjct: 86  MQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHI 119


>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 646

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 189/344 (54%), Positives = 244/344 (70%), Gaps = 5/344 (1%)

Query: 48  MVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
           + F  +VAS   +  P+II I+ADD+G+NDVGFHG ++IPTPNIDALAY+G+IL ++YT 
Sbjct: 8   IFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQ 67

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
            LCTPSRSA +TGK+PIH GMQH V+   E  GLPL+E ILPQYLK  GY T  +GKWHL
Sbjct: 68  ALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHL 127

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
           GF++KEYTPT+RGF+SH GYW G QDY+ H+       G DMRR++   W   GKYST +
Sbjct: 128 GFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVHFTPEHGYDMRRNMTVDWSAQGKYSTTL 187

Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           FT EAV +I  H+T+ P+F+YLAH A HS N  +PLQAPD  +    HI D +R  +AA+
Sbjct: 188 FTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAM 247

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           +  LD+SVG V+ AL  + ML NSIIVF+SDNG    G + N  SN+PLRG KN+ WEG 
Sbjct: 248 VSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNSAWEGA 307

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIP 386
           +R    IWSPL++    V+   +H+SDWLPT  +AA  NK+++P
Sbjct: 308 MRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP 351



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 91/217 (41%), Gaps = 40/217 (18%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG--G 631
           ++DG+D+W+ +S  + S R  +LHNID+ +   AL  G WK +             G  G
Sbjct: 352 KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDGWYGSSG 411

Query: 632 PDQVY---------------LSGLSDREWL----------------ALAMRKLRDAASIQ 660
            D +Y               L+GL+  + +                +  ++ LR AA ++
Sbjct: 412 RDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLRGAAEVK 471

Query: 661 CGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
           C  V  +E+P    C    +PCLF+IK DPCE+ NLA       +N     + RF Q A 
Sbjct: 472 CPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLN-MEMALARFKQTAL 530

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           P +    +      K        +  +  KK+K Y N
Sbjct: 531 PIRNVPRDPNADPAKWNNTWVNWQDYEDVKKQKIYFN 567



 Score = 46.6 bits (109), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 6/61 (9%)

Query: 752 EEEGMRKLRDAASIQCGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
           + E ++ LR AA ++C  V  +E+P    C    +PCLF+IK DPCE+ NLA    +  +
Sbjct: 457 DSETIKTLRGAAEVKCPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVL 516

Query: 806 N 806
           N
Sbjct: 517 N 517



 Score = 40.4 bits (93), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 23/36 (63%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V+   E  GLPL+E ILPQYLK  GY T  + 
Sbjct: 88  MQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIG 123


>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
 gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score =  391 bits (1004), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 187/335 (55%), Positives = 240/335 (71%), Gaps = 1/335 (0%)

Query: 46  LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
           +S+ FV L A+  P HI+FI+ADDLGW+DV FHG  QIPTPNID LA +G+IL NYY   
Sbjct: 11  MSLPFVALTANKKP-HIVFIVADDLGWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSP 69

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           +CTP+RSAIMTGK+PIHTGMQH+V+   +  GL L+E ++PQYLK LGY T  VGKWHLG
Sbjct: 70  ICTPTRSAIMTGKYPIHTGMQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLG 129

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
           F+K EYTP  RGF+S+ GYW G  DY+DHS  E   WGLD+    +  W   G YS+D+F
Sbjct: 130 FFKYEYTPIQRGFDSYFGYWCGKGDYWDHSNNEKYGWGLDLHDSEQDVWTEWGHYSSDLF 189

Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
             +AV++I  H+   PLFLYL   A HSAN  +PLQAP   ++  ++I+D +R  FAA++
Sbjct: 190 AEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAAMV 249

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
             +D ++ KVV++L+ R M +NSIIVF +DNGG A GF+ N ASN+PLRGVK TLWEGG+
Sbjct: 250 SSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEGGI 309

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           RG   I SPL+   G V  + +HVSDWLPTL + A
Sbjct: 310 RGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVA 344



 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/121 (36%), Positives = 62/121 (51%), Gaps = 14/121 (11%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           +DG D+W  +S +  S R  ++HNID     +A    +WK+V  V   +Y       P+ 
Sbjct: 354 LDGFDLWDSISTDAMSPREEMVHNIDPVNWEAAYRFREWKIV--VNQTKYMSGWYPLPN- 410

Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP--CEPQIAPCLFDIKNDPCEKNNLAD 692
                + +RE        LRDA  ++CGP  E+P  C     PCLF+IKNDPCE  NLA 
Sbjct: 411 -----IEERE---PHPATLRDAV-VKCGPPPEIPVNCTASDGPCLFNIKNDPCEYVNLAK 461

Query: 693 R 693
           +
Sbjct: 462 K 462



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/50 (52%), Positives = 34/50 (68%), Gaps = 4/50 (8%)

Query: 759 LRDAASIQCGPVKEVP--CEPQIAPCLFDIKNDPCEKNNLADRSEVQRIN 806
           LRDA  ++CGP  E+P  C     PCLF+IKNDPCE  NLA + E++ +N
Sbjct: 421 LRDAV-VKCGPPPEIPVNCTASDGPCLFNIKNDPCEYVNLA-KKELEILN 468



 Score = 40.0 bits (92), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH+V+   +  GL L+E ++PQYLK LGY T
Sbjct: 89  MQHSVILAAQPYGLGLNETLMPQYLKRLGYAT 120


>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
          Length = 648

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 190/346 (54%), Positives = 245/346 (70%), Gaps = 7/346 (2%)

Query: 48  MVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
           + F  +VAS   +  P+II I+ADD+G+NDVGFHG ++IPTPNIDALAY+G+IL ++YT 
Sbjct: 8   IFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQ 67

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
            LCTPSRSA +TGK+PIH GMQH V+   E  GLPL+E ILPQYLK  GY T  +GKWHL
Sbjct: 68  ALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHL 127

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHS--AEEMKMWGLDMRRDLEPAWDLHGKYST 222
           GF++KEYTPT+RGF+SH GYW G QDY+ H+  A      G DMRR++   W   GKYST
Sbjct: 128 GFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVHATFTPEHGYDMRRNMTVDWSAQGKYST 187

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
            +FT EAV +I  H+T+ P+F+YLAH A HS N  +PLQAPD  +    HI D +R  +A
Sbjct: 188 TLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYA 247

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A++  LD+SVG V+ AL  + ML NSIIVF+SDNG    G + N  SN+PLRG KN+ WE
Sbjct: 248 AMVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNSAWE 307

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIP 386
           G +R    IWSPL++    V+   +H+SDWLPT  +AA  NK+++P
Sbjct: 308 GAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP 353



 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 91/217 (41%), Gaps = 40/217 (18%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG--G 631
           ++DG+D+W+ +S  + S R  +LHNID+ +   AL  G WK +             G  G
Sbjct: 354 KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDGWYGSSG 413

Query: 632 PDQVY---------------LSGLSDREWL----------------ALAMRKLRDAASIQ 660
            D +Y               L+GL+  + +                +  ++ LR AA ++
Sbjct: 414 RDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLRGAAEVK 473

Query: 661 CGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
           C  V  +E+P    C    +PCLF+IK DPCE+ NLA       +N     + RF Q A 
Sbjct: 474 CPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLN-MEMALARFKQTAL 532

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           P +    +      K        +  +  KK+K Y N
Sbjct: 533 PIRNVPRDPNADPAKWNNTWVNWQDYEDVKKQKIYFN 569



 Score = 47.0 bits (110), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 6/61 (9%)

Query: 752 EEEGMRKLRDAASIQCGPV--KEVP----CEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
           + E ++ LR AA ++C  V  +E+P    C    +PCLF+IK DPCE+ NLA    +  +
Sbjct: 459 DSETIKTLRGAAEVKCPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVL 518

Query: 806 N 806
           N
Sbjct: 519 N 519



 Score = 40.8 bits (94), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 23/36 (63%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V+   E  GLPL+E ILPQYLK  GY T  + 
Sbjct: 88  MQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIG 123


>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 610

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 181/328 (55%), Positives = 239/328 (72%), Gaps = 8/328 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFILADDLGWND+ FHG DQI TPN+DALAY+G+IL + Y + +CTPSR+A+MTG +
Sbjct: 41  PNIIFILADDLGWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTGMY 100

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQ   L   E   LP + K+LP YLK+LGY TR+VGKWHLG+Y+KE+TPT+RGF+
Sbjct: 101 PIHNGMQGLPLEASEPRALP-AGKLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRGFD 159

Query: 180 SHLGYWTGHQDYFDHSAEE-------MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           SHLGYW G   Y+D+  +E         + G DMRR++ PA+DL G+Y+T++FT EAV +
Sbjct: 160 SHLGYWNGIVSYYDYILQEDDNRKPRSSLNGFDMRRNITPAYDLQGRYATEMFTDEAVHL 219

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I +H+ + PLFLY++H A H+ NP + L+AP   +N   HI D  R  FA ++ KLDESV
Sbjct: 220 IRSHNKNTPLFLYMSHLAVHAGNPGKFLEAPQEAINKFLHIADPNRRTFAGMVQKLDESV 279

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G+++ ALE+  ML NSIIVF +DNG    G   N  SN+PLRG+K TLWEGG RG   IW
Sbjct: 280 GRLIAALEESNMLHNSIIVFSADNGAPTIGGYPNWGSNYPLRGIKETLWEGGTRGVSFIW 339

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           SPL+++   V+ + +H+SDWLPTL +AA
Sbjct: 340 SPLIKNTPRVSNELIHISDWLPTLYAAA 367



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 86/211 (40%), Gaps = 55/211 (26%)

Query: 515 VWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNE 574
           +WS L +N P   N ++H  D            W      +  GN   EN  N       
Sbjct: 338 IWSPLIKNTPRVSNELIHISD------------WLPTLYAAAGGN--MENVKN------- 376

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL----TKGKWKL-VKVVKVMRYQVDLT 629
           +DG++ W  +S N PS+RN IL NID+  Q +A+     K  WKL V  V    Y     
Sbjct: 377 LDGVNQWPSISENLPSERNEILLNIDEILQTAAIRMNSRKYHWKLVVGTVNNGTYDEYFG 436

Query: 630 GGPD-----------------QVY--LSGLSDREWLALA-----MRKLRDAASIQCGPVK 665
            G +                 QVY  L  L+ +  L        M+ LR  A+I+C    
Sbjct: 437 EGTNKFPENNPPYDFNAVANSQVYRTLERLAQKRLLTYPTSKEKMKVLRRDATIRCDYKI 496

Query: 666 EVP--CEPQIAP---CLFDIKNDPCEKNNLA 691
             P  C P   P   CL+DIK DPCE NNL 
Sbjct: 497 INPNRCNPSDGPNQVCLYDIKRDPCELNNLT 527



 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 5/67 (7%)

Query: 736 KKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP--CEPQIAP---CLFDIKNDP 790
           +  ++  +K+   Y   +E M+ LR  A+I+C      P  C P   P   CL+DIK DP
Sbjct: 461 RTLERLAQKRLLTYPTSKEKMKVLRRDATIRCDYKIINPNRCNPSDGPNQVCLYDIKRDP 520

Query: 791 CEKNNLA 797
           CE NNL 
Sbjct: 521 CELNNLT 527


>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
 gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
          Length = 485

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 188/337 (55%), Positives = 234/337 (69%), Gaps = 8/337 (2%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GWNDV FHG DQIPTPN+D+LAYSG+IL NYY   +CTP+RSAIMTG+HPIHTG+QH V+
Sbjct: 3   GWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGVI 62

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
            G    GLPL+E ILPQYLK LGY T IVGKWHLG +  E+TPTFRGF+S+ GY TG  +
Sbjct: 63  SGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKDN 122

Query: 191 YFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFL 244
           Y+DH      S EE+   GLD+R   EP W  +G YST++F  EA  II +H T +PLFL
Sbjct: 123 YYDHTDDESNSPEELGYKGLDLRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLFL 182

Query: 245 YLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRM 304
           YL H A HS NP  PLQAP  Y++   HI+   R  FAA++  LD++VG V +AL  R M
Sbjct: 183 YLPHQAVHSGNPDNPLQAPQKYIDKFPHIQHPGRRTFAAMVSALDDAVGNVTKALSARGM 242

Query: 305 LSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
           L NS+I+F +DNGG AAGF+ N ASNWPLRGVKNTLWEGGV G G + SPL++       
Sbjct: 243 LENSVIIFTTDNGGPAAGFDQNYASNWPLRGVKNTLWEGGVHGTGFVHSPLIKQPKRTTH 302

Query: 365 QYVHVSDWLPTL--LSAANKSDIPNYVNSTVENIIPR 399
           + +HV D LPT+  L+  + +++ N   + V   I R
Sbjct: 303 ELLHVCDLLPTIYELAGGDSTELKNLDGTNVWETISR 339



 Score = 47.4 bits (111), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH V+ G    GLPL+E ILPQYLK LGY T I+ 
Sbjct: 57 LQHGVISGATPFGLPLNETILPQYLKPLGYATHIVG 92



 Score = 40.8 bits (94), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 10/94 (10%)

Query: 498 ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSIN 557
           E    D++    +DG +VW  +SR   S R  +LHNID + + +AL  G +K++   +  
Sbjct: 316 ELAGGDSTELKNLDGTNVWETISRGVQSPRVEVLHNIDPKRKTAALRYGDYKIILGEAYK 375

Query: 558 G--------NGTSENRSNDNSYQN--EIDGIDVW 581
           G         G + N S + ++++  E + I  W
Sbjct: 376 GAWDGWYPPEGVTTNHSEEANHEDPCEFNNIADW 409



 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 31/54 (57%)

Query: 563 ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           E    D++    +DG +VW  +SR   S R  +LHNID + + +AL  G +K++
Sbjct: 316 ELAGGDSTELKNLDGTNVWETISRGVQSPRVEVLHNIDPKRKTAALRYGDYKII 369


>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
 gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
          Length = 380

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 177/331 (53%), Positives = 238/331 (71%), Gaps = 3/331 (0%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII I+ADDLGW+DV FHG  QIPTPN+D LA  G+IL NYY   +CTP+R+++MTGK+
Sbjct: 5   PHIIVIVADDLGWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLMTGKY 64

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH V+Y  +  GLPL E  LPQYL+  GY+T  +GKWHLGF+ KEYTPT+RGF+
Sbjct: 65  PIHLGMQHFVIYAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTYRGFD 124

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G W+   DY++H++ E   WG DMR ++EP      KY+T+VFT EA+ +I NH+  
Sbjct: 125 SFYGMWSAKADYWNHTSFENGFWGTDMRNNMEPVTTDKDKYATEVFTREALKVIENHNKS 184

Query: 240 EPLFLYLAHAATHSANPYEPLQAP-DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           EPLFLY+AH A HSANP++PLQAP D        I+  +R ++AA++  +D+S+G+V  A
Sbjct: 185 EPLFLYIAHQAPHSANPHDPLQAPEDKVKKFSGVIDKIERQQYAAMVTCVDDSIGEVFRA 244

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           LE+ RML+NS+I+F +DNGGA  GFN N  SN+PLRG K+ +WEGGVRG   I+S L++ 
Sbjct: 245 LEKNRMLNNSVILFTTDNGGAPHGFNRNQGSNYPLRGGKDMMWEGGVRGTAFIYSDLIKH 304

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
           +G V+   + V+DW+PTL   A  +  P Y+
Sbjct: 305 KGRVSTDLIDVTDWVPTLYYLAGGT--PGYL 333



 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH V+Y  +  GLPL E  LPQYL+  GY+T
Sbjct: 70  MQHFVIYAAQPYGLPLGEITLPQYLQIQGYKT 101


>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
 gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
          Length = 573

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 197/410 (48%), Positives = 269/410 (65%), Gaps = 15/410 (3%)

Query: 41  PLAFTLSMVFVDLVASSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
           P   TL ++++   AS+     P+I+ I+ADDLGWNDV FHG +QIPTPNIDALAY GII
Sbjct: 6   PYLVTLWLLWLGGGASTAHQERPNIVIIVADDLGWNDVSFHGSNQIPTPNIDALAYDGII 65

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
           L  +Y   LCTPSR+++MTGKHP++ GMQ +V+   E  GL L +K++PQY +E GYRT 
Sbjct: 66  LNRHYVPPLCTPSRASLMTGKHPMNIGMQDHVIISDEPWGLGLDQKLMPQYFREAGYRTH 125

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWD 215
           +VGKWHLGF+++ YTPT+RGF+SH GY   + DY+DHS +  E    GLDMRR+    +D
Sbjct: 126 LVGKWHLGFFRRAYTPTYRGFDSHFGYLGPYIDYWDHSLQMNETSARGLDMRRNTAVNYD 185

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
            +G Y+TD+F  EAV +I +H+  +PLFL L H A H+ N  +PLQAP   +    +I+D
Sbjct: 186 ANGTYATDLFNDEAVRLIDSHNRSKPLFLVLTHLAPHTGNEDDPLQAPADEIAKFDYIQD 245

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
            KR   AA++ ++D  VG++  +LE+RRML+N+II+F +DNG    G + N+ SN+PLRG
Sbjct: 246 PKRRTLAAMVSRIDTGVGRIYRSLERRRMLNNTIILFYADNGAPTLGIHANSGSNYPLRG 305

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVEN 395
            K + WEG VRGA LIWS LL  +GIV+ Q++HVSDWLPTL  AA     PN  NS ++ 
Sbjct: 306 QKESPWEGAVRGAALIWSRLLPRKGIVSNQWLHVSDWLPTLGHAAGIRISPN--NSPIDG 363

Query: 396 IIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
                +N   ++   T   NS R    N  +   T+    K   +Y NGT
Sbjct: 364 -----QN---QWSTLTSSSNSGRTVVMNNAHNEFTYSSYIKRGWKYVNGT 405



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 83/172 (48%), Gaps = 23/172 (13%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM----RYQVDLTG 630
           IDG + WS L+ +  S R  +++N  +E+  S+  K  WK V          R+   L  
Sbjct: 361 IDGQNQWSTLTSSSNSGRTVVMNNAHNEFTYSSYIKRGWKYVNGTSFKGAYDRWLGQLKD 420

Query: 631 GPDQV----YLSGLSDREWLALAM-------RKLRDAASIQCGP------VKEVPCEPQI 673
           G +Q+    Y   L   + +  +M       ++LR  A ++C        V+ +PCEP +
Sbjct: 421 G-EQISHEEYYKRLIASQTIGSSMQLTRDDVKRLRSKAMVECSSAQGTVYVQAIPCEPLV 479

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKK 725
            PCLF++  DPCE+ NLAD+   + +    ++V RF + A P + +  + + 
Sbjct: 480 KPCLFNVVEDPCERVNLADQYP-ELLKDLQSDVERFKRDALPPRNQPSDSRS 530



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 6/52 (11%)

Query: 754 EGMRKLRDAASIQCGP------VKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           + +++LR  A ++C        V+ +PCEP + PCLF++  DPCE+ NLAD+
Sbjct: 448 DDVKRLRSKAMVECSSAQGTVYVQAIPCEPLVKPCLFNVVEDPCERVNLADQ 499


>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 476

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 183/366 (50%), Positives = 251/366 (68%), Gaps = 5/366 (1%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L F L    ++ V ++ PP+IIF++ADDLGWNDV FHG  QIPTP+IDALA  G+IL NY
Sbjct: 11  LTFFLLGCMLECVYTAQPPNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNY 70

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +CTP+RSAIMTGKHPIHTG+Q++V+   E  GL  +E I+PQYL+ LGYRT +VGK
Sbjct: 71  YVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGK 130

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLGF+K+  TP+ RGFES+ GY+ G QDYF H + E  + G D   +      + G+YS
Sbjct: 131 WHLGFFKESLTPSHRGFESYYGYYGGMQDYFTHESTEHTLTGFDFHVNGSIYKPVFGQYS 190

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY--EPLQAPDHYLNIHRHIEDFKRS 279
           T+++T +  +II NH+  EPL++YLAH A HSAN Y  + LQAP  Y     +I +  R 
Sbjct: 191 TEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSAN-YNGQRLQAPYKYYERFPNITNENRR 249

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           KFAA++  LD+S+G + + L++  + +N++IVF +DNGG A GF+ N A+NWPLRGVK+T
Sbjct: 250 KFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPLRGVKDT 309

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPNYVNSTVENII 397
            WEGG+RGAG +W  L+E  G  ++  +HV DW+PTL  L+  N S + +     V  ++
Sbjct: 310 TWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGIDVWPML 369

Query: 398 PRYENS 403
            R E S
Sbjct: 370 SRAEPS 375



 Score = 75.9 bits (185), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 26/135 (19%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
           + S    +DGIDVW +LSR EPS R  ILHNID    +SA+  G +KLV+          
Sbjct: 353 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQ---------- 402

Query: 628 LTGGPDQVYLSGLSD------REWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCL 677
                 Q Y    SD         + +  + + +A  + C P K       C+P+  PCL
Sbjct: 403 -----GQNYNGSWSDWYPPEGESSVDVDSKPVPNAFVVSC-PSKPANASTNCDPKEKPCL 456

Query: 678 FDIKNDPCEKNNLAD 692
           F+I++DPCE NN+AD
Sbjct: 457 FNIRHDPCEFNNIAD 471



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 37/57 (64%)

Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
           + S    +DGIDVW +LSR EPS R  ILHNID    +SA+  G +KLV+  + NG+
Sbjct: 353 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGS 409



 Score = 42.7 bits (99), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 15/24 (62%), Positives = 21/24 (87%)

Query: 775 CEPQIAPCLFDIKNDPCEKNNLAD 798
           C+P+  PCLF+I++DPCE NN+AD
Sbjct: 448 CDPKEKPCLFNIRHDPCEFNNIAD 471


>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
 gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
          Length = 498

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 172/324 (53%), Positives = 239/324 (73%), Gaps = 1/324 (0%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII ++ADDLGW+D+ FHG  QIPTPN+DALA SG+IL NYY   + TPSR++ MTGK+
Sbjct: 1   PHIIMVVADDLGWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKY 60

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH G+QH+ L+  +  G+PL+EK LP++L+E+GY+T  VGKW LGF+ KEYTPT+RGF+
Sbjct: 61  PIHMGVQHDTLHNRQPFGVPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFD 120

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G+WT H+DY++H A +   +G+D+RR+L+ + D  G Y T++   EA ++I NHS D
Sbjct: 121 SFFGFWTSHEDYYNHVAND-GGYGIDLRRNLDVSNDHTGVYGTELLAREADEVIENHSGD 179

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLYLAH A H  N  EPLQAP  +++  ++I D +R  FA ++  LDES+ ++V +L
Sbjct: 180 KPLFLYLAHQAVHVGNMDEPLQAPKRHVDKFKYITDERRRTFAGMVSSLDESMRQLVTSL 239

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           +++ +  NSII+F +DNGGAA G +++A SN+PLRG KNTLWEGGVRG   + SPL++  
Sbjct: 240 KRKGLYQNSIIIFTTDNGGAAGGLDMSAGSNFPLRGNKNTLWEGGVRGVAFVHSPLIKRP 299

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
           G V +  +H SDWLPTL   A  S
Sbjct: 300 GRVYDGLMHASDWLPTLHLLAGGS 323



 Score = 59.3 bits (142), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 17/127 (13%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
           +IDG D+W  +S+  PS R  +LH ID  +   +AL  G +K++ + + +++  D    P
Sbjct: 329 QIDGYDLWDGISKGTPSPRYEVLHGIDTMQGNRAALRVGDYKVI-LNQDLKFYSDWYKRP 387

Query: 633 DQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP--------CEPQIAPCLFDIKNDP 684
           +        + E   +   +L   A++ C      P        C PQ  PCLF+IK DP
Sbjct: 388 E-------GEHELDYIRKPRLLSNATVDCTIKSPHPLLYTHAPLCNPQKKPCLFNIKWDP 440

Query: 685 CEKNNLA 691
           CE +NLA
Sbjct: 441 CEYHNLA 447



 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH+ L+  +  G+PL+EK LP++L+E+GY+T  + 
Sbjct: 66  VQHDTLHNRQPFGVPLTEKFLPEFLREMGYQTHAVG 101


>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 479

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 184/369 (49%), Positives = 250/369 (67%), Gaps = 8/369 (2%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L F L    ++ V ++ PP+IIF++ADDLGWNDV FHG  QIPTP+IDALA  G+IL NY
Sbjct: 11  LTFFLLGCMLECVYTAQPPNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNY 70

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +CTP+RSAIMTGKHPIHTG+Q++V+   E  GL  +E I+PQYL+ LGYRT +VGK
Sbjct: 71  YVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGK 130

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE---EMKMWGLDMRRDLEPAWDLHG 218
           WHLGFY KE+TP  RGFES  GY+ G QDYF H  +   +  + G D   +      + G
Sbjct: 131 WHLGFYSKEHTPIERGFESTFGYYLGQQDYFTHETQVKRKHTLTGFDFHVNGSIYKPVFG 190

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY--EPLQAPDHYLNIHRHIEDF 276
           +YST+++T +  +II NH+  EPL++YLAH A HSAN Y  + LQAP  Y     +I + 
Sbjct: 191 QYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSAN-YNGQRLQAPYKYYERFPNITNE 249

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R KFAA++  LD+S+G + + L++  + +N++IVF +DNGG A GF+ N A+NWPLRGV
Sbjct: 250 NRRKFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPLRGV 309

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPNYVNSTVE 394
           K+T WEGG+RGAG +W  L+E  G  ++  +HV DW+PTL  L+  N S + +     V 
Sbjct: 310 KDTTWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGIDVW 369

Query: 395 NIIPRYENS 403
            ++ R E S
Sbjct: 370 PMLSRAEPS 378



 Score = 75.9 bits (185), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 26/135 (19%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
           + S    +DGIDVW +LSR EPS R  ILHNID    +SA+  G +KLV+          
Sbjct: 356 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQ---------- 405

Query: 628 LTGGPDQVYLSGLSD------REWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCL 677
                 Q Y    SD         + +  + + +A  + C P K       C+P+  PCL
Sbjct: 406 -----GQNYNGSWSDWYPPEGESSVDVDSKPVPNAFVVSC-PSKPANASTNCDPKEKPCL 459

Query: 678 FDIKNDPCEKNNLAD 692
           F+I++DPCE NN+AD
Sbjct: 460 FNIRHDPCEFNNIAD 474



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 37/57 (64%)

Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
           + S    +DGIDVW +LSR EPS R  ILHNID    +SA+  G +KLV+  + NG+
Sbjct: 356 NTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGS 412



 Score = 42.7 bits (99), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 15/24 (62%), Positives = 21/24 (87%)

Query: 775 CEPQIAPCLFDIKNDPCEKNNLAD 798
           C+P+  PCLF+I++DPCE NN+AD
Sbjct: 451 CDPKEKPCLFNIRHDPCEFNNIAD 474


>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
 gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
          Length = 675

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 175/330 (53%), Positives = 231/330 (70%), Gaps = 4/330 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ADD+GWNDVGFHG +QIPTPNIDALAY GIIL  +YT  +CTPSR+++MTGK+
Sbjct: 32  PHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKN 91

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI+ GMQH V+   E  GL L +KI+P+Y KE GYRT +VGKWHLGF  K+YTPT RGF+
Sbjct: 92  PINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFD 151

Query: 180 SHLGYWTGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           +H+GY   + DY+D++   +      G DMR +L   +D +G Y+TD FT  A  II  H
Sbjct: 152 THVGYLGPYVDYWDYTLKFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERH 211

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
            T +PLFL + H A H+AN  +PLQAP+  +    +I D +R  +AA++ KLD+SVG++ 
Sbjct: 212 DTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIF 271

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            +L  + ML NSII+F+SDNG   A  + N  SN+PLRG+K+  WE   R    IWSPLL
Sbjct: 272 NSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIWSPLL 331

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           + R  V+ Q++H+SDWLPTL SAA   DIP
Sbjct: 332 QERQRVSNQFIHISDWLPTLASAAGI-DIP 360



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 61/141 (43%), Gaps = 25/141 (17%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
            +EIDG D W  LS +  + R  +L+ ID+ +  S+  +  +K V             G 
Sbjct: 365 HSEIDGQDQWEALSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQ 424

Query: 632 P---DQVYLSGLSDREWLALAMRK-----------------LRDAASIQCGPVKEV-PCE 670
           P   DQ     LSD +++ L ++                  LR  A + C    E   C 
Sbjct: 425 PNTSDQT----LSDDQYIDLVLQTEITRWAGETISRDTIKYLRKHARVNCNHQPEANKCN 480

Query: 671 PQIAPCLFDIKNDPCEKNNLA 691
           P   PCLFDI NDPCE N+L+
Sbjct: 481 PLKRPCLFDIINDPCELNDLS 501



 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 754 EGMRKLRDAASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLA 797
           + ++ LR  A + C    E   C P   PCLFDI NDPCE N+L+
Sbjct: 457 DTIKYLRKHARVNCNHQPEANKCNPLKRPCLFDIINDPCELNDLS 501



 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V+   E  GL L +KI+P+Y KE GYRT ++ 
Sbjct: 97  MQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVG 132


>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
 gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
          Length = 585

 Score =  370 bits (951), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 182/352 (51%), Positives = 246/352 (69%), Gaps = 13/352 (3%)

Query: 42  LAFTLSMVFVDLV---------ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
           L   L ++FV ++          SS  P+I+ I+ADDLGWNDV FHG  QIPTPNIDALA
Sbjct: 4   LPVLLGLLFVAVLNVFGEKSTDRSSNRPNIVVIIADDLGWNDVSFHGSLQIPTPNIDALA 63

Query: 93  YSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL 152
           YSGIIL  +Y   LCTPSR+++MTGKHPI+ GMQH+V+   E  GL L +K++P+Y +E 
Sbjct: 64  YSGIILNRHYAPPLCTPSRASLMTGKHPINIGMQHHVIEVDEPWGLGLDQKLMPEYFREA 123

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDL 210
           GYRTR+VGKWHLGF++K YTPT RGF+SH GY   + DY+DHS +       GLDMRR+L
Sbjct: 124 GYRTRLVGKWHLGFFRKAYTPTMRGFDSHYGYIGPYIDYWDHSLQMSNTSTRGLDMRRNL 183

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNH--STDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           +  +   G Y+TD+FT EAV +IH+H  ++  PLFL + H A H+ N  +P+QAP+  + 
Sbjct: 184 QVDYSARGTYATDLFTREAVRLIHDHNQTSANPLFLVVTHLAPHTGNEDDPMQAPEEDVE 243

Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
           +   I+D KR   AA++ KLD+ VG++V+AL++  ML+N++++F +DNG    G + N  
Sbjct: 244 LFSFIKDPKRRVLAAMISKLDDGVGQIVQALKKSGMLNNTVVLFYADNGAPTIGKHSNGG 303

Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           SN+PLRG K + WEG VR A  IWSPLL + G V+ Q++HVSDWLPTL  AA
Sbjct: 304 SNFPLRGQKYSPWEGAVRTAAAIWSPLLNNTGRVSNQWIHVSDWLPTLARAA 355



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 66/149 (44%), Gaps = 35/149 (23%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK------------ 620
           +EIDG D W  L+    + R T++HNID     S+ TK  WKLV                
Sbjct: 364 SEIDGQDQWEALNNPNLTGRTTVMHNIDQFQHYSSYTKNGWKLVNGTSWEGAFDDWMGSL 423

Query: 621 -----------VMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG--PVKEV 667
                       +R    + G   +V L  ++          ++R  A++ C   P   +
Sbjct: 424 DEEDELSEEEYSVRLISSVVGRLTKVDLDDVA----------RIRRDATVHCPYIPGGTI 473

Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRSED 696
            CEPQ +PCLF+I  DPCE NN+A++  D
Sbjct: 474 DCEPQKSPCLFNILEDPCETNNIANQRPD 502



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 32/48 (66%), Gaps = 2/48 (4%)

Query: 754 EGMRKLRDAASIQCG--PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           + + ++R  A++ C   P   + CEPQ +PCLF+I  DPCE NN+A++
Sbjct: 452 DDVARIRRDATVHCPYIPGGTIDCEPQKSPCLFNILEDPCETNNIANQ 499



 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 28/51 (54%)

Query: 508 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
           +EIDG D W  L+    + R T++HNID     S+ T+  WKLV   S  G
Sbjct: 364 SEIDGQDQWEALNNPNLTGRTTVMHNIDQFQHYSSYTKNGWKLVNGTSWEG 414



 Score = 40.8 bits (94), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH+V+   E  GL L +K++P+Y +E GYRTR++ 
Sbjct: 96  MQHHVIEVDEPWGLGLDQKLMPEYFREAGYRTRLVG 131


>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 502

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 178/332 (53%), Positives = 247/332 (74%), Gaps = 3/332 (0%)

Query: 50  FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
           F+ +V  +  PHII I+ADDLGWND+ FHG ++IPTPNID LA +G+IL NYY + +CTP
Sbjct: 10  FIMIVIHADKPHIIMIVADDLGWNDISFHGSNEIPTPNIDRLANNGVILDNYYVLPICTP 69

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           SRSAIMTG++PIHTGMQ + ++G    G+ L+EK LPQYLK+ GY+T  VGKWHLGF+ K
Sbjct: 70  SRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAK 129

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           +YTPT+RGF+S+ G + G  DY++HS  E    GLD+  +    +   G YST+++TAEA
Sbjct: 130 QYTPTYRGFDSYYGSYLGKGDYWNHSNTETYS-GLDLHDNENGVFSQDGNYSTEMYTAEA 188

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           +  I+NH++ EPLFLYLA+ A HSAN  E PLQAP  +++   +I+  +R K+AA+L  +
Sbjct: 189 ISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRKYAAMLGYM 248

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D  VG+V +AL +++ML NSII+F +DNGG A GF+ N A+N+PLRGVK TL+EGGVRG 
Sbjct: 249 DYGVGRVHDALAKKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATLFEGGVRGV 308

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             ++S L+ES   V+ + +H++DWLPTL++ A
Sbjct: 309 SFVYSKLIESPR-VSHELIHITDWLPTLVNLA 339



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 60/123 (48%), Gaps = 20/123 (16%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           +DG D W+ L   + S+RN +L NID++ W+  AL  G WK++K            G   
Sbjct: 348 LDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKIIK-----------EGNYW 396

Query: 634 QVYLSGLSDREWLALAMRKLRDAASIQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
             +    S  E    +   L  +++++CG   P+    C+   + CLF I  DPCE N+L
Sbjct: 397 DGWYPPPSFNEQSNNSFSYL--SSTVKCGHDIPIVINHCD---SYCLFHIDEDPCEINDL 451

Query: 691 ADR 693
           + +
Sbjct: 452 SKK 454



 Score = 44.7 bits (104), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 30/45 (66%), Gaps = 1/45 (2%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTRGKWKLVKE 553
           +DG D W+ L   + S+RN +L NID++ W+  AL  G WK++KE
Sbjct: 348 LDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKIIKE 392


>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
 gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
          Length = 607

 Score =  370 bits (949), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 175/330 (53%), Positives = 231/330 (70%), Gaps = 4/330 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ADD+GWNDVGFHG +QIPTPNIDALAY GIIL  +YT  +CTPSR+++MTGK+
Sbjct: 32  PHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKN 91

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI+ GMQH V+   E  GL L +KI+P+Y KE GYRT +VGKWHLGF  K+YTPT RGF+
Sbjct: 92  PINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFD 151

Query: 180 SHLGYWTGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           +H+GY   + DY+D++   +      G DMR +L   +D +G Y+TD FT  A  II  H
Sbjct: 152 THVGYLGPYVDYWDYTLKFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERH 211

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
            T +PLFL + H A H+AN  +PLQAP+  +    +I D +R  +AA++ KLD+SVG++ 
Sbjct: 212 DTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIF 271

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            +L  + ML NSII+F+SDNG   A  + N  SN+PLRG+K+  WE   R    IWSPLL
Sbjct: 272 NSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIWSPLL 331

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           + R  V+ Q++H+SDWLPTL SAA   DIP
Sbjct: 332 QERQRVSNQFIHISDWLPTLASAAGI-DIP 360



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 61/141 (43%), Gaps = 25/141 (17%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
            +EIDG D W  LS +  + R  +L+ ID+ +  S+  +  +K V             G 
Sbjct: 365 HSEIDGQDQWEALSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQ 424

Query: 632 P---DQVYLSGLSDREWLALAMRK-----------------LRDAASIQCGPVKEV-PCE 670
           P   DQ     LSD +++ L ++                  LR  A + C    E   C 
Sbjct: 425 PNTSDQT----LSDDQYIDLVLQTEITRWAGETISRDTIKYLRKHARVNCNHQPEANKCN 480

Query: 671 PQIAPCLFDIKNDPCEKNNLA 691
           P   PCLFDI NDPCE N+L+
Sbjct: 481 PLKRPCLFDIINDPCELNDLS 501



 Score = 42.7 bits (99), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 754 EGMRKLRDAASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLA 797
           + ++ LR  A + C    E   C P   PCLFDI NDPCE N+L+
Sbjct: 457 DTIKYLRKHARVNCNHQPEANKCNPLKRPCLFDIINDPCELNDLS 501



 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V+   E  GL L +KI+P+Y KE GYRT ++ 
Sbjct: 97  MQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVG 132


>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
          Length = 881

 Score =  366 bits (940), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 179/340 (52%), Positives = 237/340 (69%), Gaps = 6/340 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FI+ADD+G NDVGFHG +QIPTPNIDALAY GIIL  +Y+  +CTPSR+A+MTG+H
Sbjct: 34  PHIVFIMADDMGLNDVGFHGSNQIPTPNIDALAYDGIILNRHYSAPMCTPSRAALMTGRH 93

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P++ GMQH V+   E  GL L EK+LPQY +E GYRT ++GKWHLGFY + Y PT RGF+
Sbjct: 94  PMNVGMQHYVIDSDEPWGLGLQEKLLPQYFREAGYRTHLIGKWHLGFYAEPYLPTNRGFD 153

Query: 180 SHLGYWTGHQDYFDHSA--EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           +H+GY   + DY+ + +  +     G D+R++L   +  +G Y+TD FT  AV+II +H+
Sbjct: 154 THIGYLGPYIDYWSYISKMDSATFEGYDLRQNLAVNYKPNGTYATDYFTEAAVEIIRSHN 213

Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
            T E + L L H A H+ N   PLQAP+  +    +I D  R  +AA++ KLD+SVG+V 
Sbjct: 214 RTGERMLLVLNHLAPHTGNDDAPLQAPEETIEKFAYIRDTDRRTYAAMMSKLDDSVGQVY 273

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL+Q  ML NSIIVF+SDNGG   G + N+ SN+PLRG K++ WEG VR A LIWSPLL
Sbjct: 274 RALQQHGMLENSIIVFISDNGGVTRGMHSNSGSNYPLRGQKHSPWEGAVRTAALIWSPLL 333

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
           + R  V+ Q+ H+SDWLPTL SAA    IP Y    + +I
Sbjct: 334 KDRQRVSNQWFHISDWLPTLASAAG---IPAYRGDNLSDI 370



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 37/193 (19%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
           ++IDGID W  L+    + R  +++N+D+ +  ++      K V    +        G  
Sbjct: 368 SDIDGIDQWEALAYGTGNPRQRVMNNLDEIFGYTSYVDNGLKYVNGTTLDGVNDGWYGAI 427

Query: 633 DQVYLSGLSDREWLALAMRK---------------LRDAASIQCG-PVKEVPCEPQIAPC 676
           D    +  S+ E+L   M                 LR+ A I CG  +    C P   PC
Sbjct: 428 DDTD-TLPSNAEYLRTVMASHVGRLTKPDEKVVSYLREHARIDCGNDIGNRDCNPLRRPC 486

Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKK 736
           LFDI NDPCE ++L +R                    YPDK  + +++ +K + +  K +
Sbjct: 487 LFDIINDPCEMHDLFER--------------------YPDKVRQIQERVEKYRLRAAKPR 526

Query: 737 KKKKKKKKKKKKY 749
            +         K+
Sbjct: 527 NRPHDPAADPSKF 539



 Score = 46.6 bits (109), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 752 EEEGMRKLRDAASIQCG-PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           +E+ +  LR+ A I CG  +    C P   PCLFDI NDPCE ++L +R
Sbjct: 455 DEKVVSYLREHARIDCGNDIGNRDCNPLRRPCLFDIINDPCEMHDLFER 503



 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V+   E  GL L EK+LPQY +E GYRT ++ 
Sbjct: 99  MQHYVIDSDEPWGLGLQEKLLPQYFREAGYRTHLIG 134


>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
          Length = 500

 Score =  365 bits (937), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 177/327 (54%), Positives = 231/327 (70%), Gaps = 3/327 (0%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           +G P+++FILADDLGW+D+  HG  +IPTPNID LA  GI+L NYY   +CTPSR+A+MT
Sbjct: 22  AGHPNVVFILADDLGWDDISLHGSQEIPTPNIDLLATDGILLNNYYVQPICTPSRAALMT 81

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G+HP+H G+QH+V+   +  GL L+E +LPQYLK LGY T +VGKWHLGFY KE+TPT R
Sbjct: 82  GRHPVHLGLQHDVIVWAQPYGLGLNETLLPQYLKTLGYSTHMVGKWHLGFYDKEHTPTKR 141

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           GF+SHLGY+TG +DY+DH+    K  WGLD   D E      G+YST+VFT+EA  +I  
Sbjct: 142 GFDSHLGYYTGCEDYYDHTWGFTKQDWGLDFWHDREVDRSAFGQYSTEVFTSEAERVIAE 201

Query: 236 HSTDEPLFLYLAHAATHSANPYEP--LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
           H   +PLFLYLA  A HS NP     LQAP  Y+     I+  +R  FA ++  LD+SV 
Sbjct: 202 HDVSKPLFLYLAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKSEERRVFAGMVSALDDSVS 261

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            V +AL ++ +L+++II+F +DNGG A   + N A NWPLRG K T+WEGGVRG G IWS
Sbjct: 262 NVTKALHEKGILNDTIIIFSTDNGGPANRQDYNDACNWPLRGSKRTMWEGGVRGNGFIWS 321

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
           PLLE+ G V+E  + + DW+PT+L AA
Sbjct: 322 PLLENSGYVSEHLMQIVDWVPTVLEAA 348



 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 63/150 (42%), Gaps = 22/150 (14%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           +DG+  W  LSR+E S R  +LHNID      AL     K+V             GG  +
Sbjct: 359 LDGLSQWKALSRHESSPRKVLLHNIDPINGDGALRTTDMKIV-----------FGGGDTE 407

Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRS 694
            Y       E L  +  K   + +          C+P + PC+F+I  DPCE  N+AD  
Sbjct: 408 RYPGWYEPEEALIRSPSKRPSSTA----------CKPWLKPCVFNITADPCEYYNIADFH 457

Query: 695 EDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
            D  +     +V  FNQ A P +    + K
Sbjct: 458 PDI-LQELLQKVDEFNQTAVPPRNVPSDPK 486


>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 588

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 183/366 (50%), Positives = 241/366 (65%), Gaps = 11/366 (3%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASS--GP---PHIIFILADDLGWNDVGFHGLDQIPT 85
           R R++  A +     L +    ++ +S   P   PHI+FILADDLGWNDV ++G  QI T
Sbjct: 31  RFRLVNIATIGAVILLFIALAAMLLTSRRSPVRQPHIVFILADDLGWNDVSYNGCPQIRT 90

Query: 86  PNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL 145
           PNIDALA++GI L+ YYT  +CTPSR+A+MTG++PIHTGMQH V+   E  GLPL  K+L
Sbjct: 91  PNIDALAWNGIRLQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVILQNEPRGLPLKFKLL 150

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---SAEEMKMW 202
           PQ+L +LGY ++++GKWHLGFYKKEYTPT RGF+ H+G W G  DY+ H   +       
Sbjct: 151 PQWLGDLGYVSQMLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVDYYSHIRFNKIGFSHS 210

Query: 203 GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
           GLD R+ L    +  G+Y T+  T  A  +I NH  ++PLFLYLAH A H AN ++PLQ 
Sbjct: 211 GLDFRQGLSEGREFDGQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQV 270

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           P  Y + +  I  + R+ +A ++  LDESVG VVEAL +R MLS++++VF SDNGG   G
Sbjct: 271 PKKYSDKYHDIGHWNRTMYAGMVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGGDTNG 330

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES-RGIVAEQYVHVSDWLPTL--LSA 379
            N N AS+WP +G K TLWEGG+   G IWSPL    RG       H+SDWLPTL  L+ 
Sbjct: 331 ENPNYASSWPFKGQKRTLWEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAG 390

Query: 380 ANKSDI 385
            + SD+
Sbjct: 391 GDPSDL 396



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 69/145 (47%), Gaps = 19/145 (13%)

Query: 567 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV---KVMR 623
            D S   +IDGI     LSR   + R  +L NID    +SA+ +G +KLV  V    V+ 
Sbjct: 391 GDPSDLGDIDGISHLDSLSRRSETPRKELLINIDPIENVSAIIEGHFKLVSGVVRGGVLD 450

Query: 624 YQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAA-SIQCG--------PVK------EVP 668
               + G     Y     + E  +L  R LR+A   + CG        P+K        P
Sbjct: 451 EWFQVPGNITWDYNRARQECE-TSLVARVLRNAGHDVACGSEDGSFPTPIKCGKRDPSKP 509

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
           C P +APCLFD+  DPCE NN+A++
Sbjct: 510 CVPTVAPCLFDLSKDPCEYNNIAEQ 534



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 27/38 (71%), Gaps = 5/38 (13%)

Query: 765 IQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           I+CG   P K  PC P +APCLFD+  DPCE NN+A++
Sbjct: 499 IKCGKRDPSK--PCVPTVAPCLFDLSKDPCEYNNIAEQ 534


>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
 gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
          Length = 550

 Score =  359 bits (922), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 183/396 (46%), Positives = 250/396 (63%), Gaps = 18/396 (4%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S  P+II IL DD+G+NDV FHG +QI TPNIDALAY+G++L   Y   LCTPSR+ ++T
Sbjct: 24  STKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRATLLT 83

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           GK+PIHTGMQH V+   E  GLP  E+++P++ ++ GY T+++GKWHLGF++K+ TPT R
Sbjct: 84  GKYPIHTGMQHYVIITDEPWGLPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLTPTMR 143

Query: 177 GFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           GF+ H GY+ G+ DY+DH+   +      G+D RRDL+P    +G Y+TD FTAEA  +I
Sbjct: 144 GFDHHYGYYNGYIDYYDHTLHMLTKNYTKGVDFRRDLDPCPQDNGTYATDAFTAEAKRVI 203

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
             H    PLF+ L+H A H+ N   P+QAP+  +    HI D KR  +A ++  LD+SVG
Sbjct: 204 EQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHITDPKRRTYAGMISSLDKSVG 263

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           + V+AL    ML+NSII+  SDNG    G + N  SN+P RG K + WEGG+R AG IWS
Sbjct: 264 QTVKALSDNGMLNNSIILLYSDNGAPTVGIHSNQGSNYPFRGQKESPWEGGIRSAGAIWS 323

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAAN-------KSDIPNYVNSTVENIIPRYENSILR 406
           PLL+ RG V+ Q +H  DWLPTL +AAN       K D  N   S   NI P+    +L 
Sbjct: 324 PLLQQRGYVSNQIIHAIDWLPTLAAAANVELPENLKLDGVNMWPSLSGNIEPQ-PRRLLH 382

Query: 407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE 442
             +    Y+S      N +Y NG+      +E +Y+
Sbjct: 383 VMDDVFGYSS--YMRDNLKYVNGS-----SFEGQYD 411



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 28/167 (16%)

Query: 574 EIDGIDVWSVLSRN-EPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM----RYQVDL 628
           ++DG+++W  LS N EP  R  +LH +DD +  S+  +   K V          ++  DL
Sbjct: 359 KLDGVNMWPSLSGNIEPQPRR-LLHVMDDVFGYSSYMRDNLKYVNGSSFEGQYDQWLGDL 417

Query: 629 TGGPDQ----VYLSGLSDREWLALA---------MRKLRDAASIQCGPVKE-------VP 668
             G D      Y   +   E  A+          + ++R+ A+ +C P+ E         
Sbjct: 418 ESGQDDPLSPYYAQQVLSSEVHAVIGSQSVTNERIEQMREEATHRC-PLNEQDHLQEIYL 476

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           CEP  APC F++  DPCE+ NLA+    Q +   + EV +F Q A P
Sbjct: 477 CEPLKAPCFFNLAKDPCERFNLANLYPLQ-LQFLSQEVEQFRQNAVP 522



 Score = 42.4 bits (98), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 8/57 (14%)

Query: 754 EGMRKLRDAASIQCGPVKE-------VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           E + ++R+ A+ +C P+ E         CEP  APC F++  DPCE+ NLA+   +Q
Sbjct: 450 ERIEQMREEATHRC-PLNEQDHLQEIYLCEPLKAPCFFNLAKDPCERFNLANLYPLQ 505


>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
 gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
          Length = 472

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 169/266 (63%), Positives = 205/266 (77%)

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           MTGK+PIHTGMQH VLYG E  GLPL+EK+LPQYLKELGY+  I GKWHLG Y +++TP 
Sbjct: 1   MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
            RGF+SH+G+WTGH   FDH+A E   WGLDMRR  + A+DLHG Y+T V   E+V  I 
Sbjct: 61  ERGFDSHVGFWTGHHHMFDHTAVETNAWGLDMRRGFDVAYDLHGYYTTHVIRDESVAAIR 120

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H+T +P+FLY++HAATHSANPY+ L APD  +     I ++ R KFAA+L +LD+SVG 
Sbjct: 121 AHNTSQPMFLYVSHAATHSANPYDFLPAPDETVERLAGISNYSRRKFAAMLTELDQSVGA 180

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           ++ AL +R ML NSIIVF +DNGG A GFN NAASNWPLRG KNTLWEGGVRGAG IWSP
Sbjct: 181 IIVALSERGMLDNSIIVFSTDNGGPAEGFNNNAASNWPLRGTKNTLWEGGVRGAGCIWSP 240

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           L+E +  V+ Q +H+SDWLPTLL AA
Sbjct: 241 LIEEKRRVSHQIMHISDWLPTLLDAA 266



 Score = 79.3 bits (194), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 70/149 (46%), Gaps = 26/149 (17%)

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK-------- 620
           N   + +DGI VW  L   + ++R+ ILHNIDD W  +ALT   WK+VK           
Sbjct: 271 NKLPSNLDGISVWPELRNGDVTRRHEILHNIDDIWGSAALTVDDWKVVKGTNYEGKWDAW 330

Query: 621 --------VMRYQV-DLTGGPDQVYLSGLS----DREWLALAMRKLRDAASIQCGPVKEV 667
                      Y +  + G P    LS L     + E     + KLR  A++QC      
Sbjct: 331 YGPAGDHDTKSYNLSSIFGCPTGKALSKLKMLPPEEE-----IIKLRREATVQCANGTRN 385

Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRSED 696
            C P  +PCLFD+ NDPCE  NLAD+  D
Sbjct: 386 KCNPLESPCLFDLFNDPCEFENLADQYPD 414



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/34 (73%), Positives = 28/34 (82%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
          MQH VLYG E  GLPL+EK+LPQYLKELGY+  I
Sbjct: 11 MQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHI 44



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/58 (46%), Positives = 33/58 (56%)

Query: 742 KKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           K   K K    EE + KLR  A++QC       C P  +PCLFD+ NDPCE  NLAD+
Sbjct: 354 KALSKLKMLPPEEEIIKLRREATVQCANGTRNKCNPLESPCLFDLFNDPCEFENLADQ 411



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 31/47 (65%)

Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDDL 904
           YPD+L  + K+L++ N TAV P N P D  GDP+ + + W  FGD+L
Sbjct: 412 YPDILEVLLKKLSDYNATAVPPGNMPLDSRGDPRFWGYTWHNFGDEL 458



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 33/55 (60%)

Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
           N   + +DGI VW  L   + ++R+ ILHNIDD W  +ALT   WK+VK  +  G
Sbjct: 271 NKLPSNLDGISVWPELRNGDVTRRHEILHNIDDIWGSAALTVDDWKVVKGTNYEG 325


>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 657

 Score =  358 bits (920), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 178/358 (49%), Positives = 233/358 (65%), Gaps = 20/358 (5%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
           L  S   PHI+ I+ADD+GWNDVGFHG +QIPTPNIDALAY GIIL  +YT  +CTPSR+
Sbjct: 24  LYESGQKPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRA 83

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           AIMTG++PI  GMQH V+   E  GL L +KI+P+Y +E GYRT +VGKWHLGF+ ++YT
Sbjct: 84  AIMTGRNPISVGMQHYVIDSDEPWGLGLDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYT 143

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           PT RGF+SH  Y                  G DMR +L   +D +G Y+TD FT  A  I
Sbjct: 144 PTMRGFDSHTNY-----------------TGYDMRHNLAVDYDANGTYATDHFTDAASRI 186

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I  H+  EPLFL + H A H+ N  +PLQAP+  +    HI D  R  +AA++ KLD+SV
Sbjct: 187 IDKHNPSEPLFLMVNHLAPHTGNDNDPLQAPEERIKKFEHISDENRRIYAAMVSKLDDSV 246

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G + ++L  + M+ NSII+F+SDNG  + G + N+ SN+PLRG KN+ WEG  R    IW
Sbjct: 247 GAIFKSLRAKNMIQNSIILFISDNGAPSFGLHSNSGSNFPLRGQKNSPWEGATRNVAAIW 306

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENG 410
           SPLLE R  V+ QY+H+SDWLPT  S A    +P   NS+  + + ++E   L Y+ G
Sbjct: 307 SPLLEERQRVSNQYIHISDWLPTFASIAGIR-LPQARNSSEVDGVDQWE--ALSYDTG 361



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 71/167 (42%), Gaps = 36/167 (21%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG- 631
           +E+DG+D W  LS +    R  IL+ ID+ +  S+  +  +K V            TGG 
Sbjct: 345 SEVDGVDQWEALSYDTGYPRRVILNMIDEIFGYSSYMENGFKYVN---------GTTGGG 395

Query: 632 ----------------PDQVYLSGLSDREWLALA--------MRKLRDAASIQC-GPVKE 666
                            D+ Y+  + + E   L+        ++ LR  + + C  P+  
Sbjct: 396 NFDGWYGQLNNEDPHPSDEEYIKTVLETEIAQLSGDVLSSELIKHLRKHSRVVCHKPLHP 455

Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIA 713
             C P    CLFDI NDPCE N+++ R E  +     T+V    ++A
Sbjct: 456 TTCNPLKRACLFDIVNDPCELNDISAR-EPFKFRELRTKVETLRRLA 501



 Score = 42.7 bits (99), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 754 EGMRKLRDAASIQC-GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 799
           E ++ LR  + + C  P+    C P    CLFDI NDPCE N+++ R
Sbjct: 436 ELIKHLRKHSRVVCHKPLHPTTCNPLKRACLFDIVNDPCELNDISAR 482


>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
          Length = 545

 Score =  357 bits (916), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 177/351 (50%), Positives = 237/351 (67%), Gaps = 16/351 (4%)

Query: 44  FTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           FTL   FV +V ++G      P+I++ILADDLGW+D+  HG +QIPTPNIDALA  GI+L
Sbjct: 7   FTLC-SFVCIVFAAGSVRNQRPNIVYILADDLGWDDISLHGSEQIPTPNIDALAADGILL 65

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
            NYY   +CTPSR+A++TGKHP+H G+QHN +      GL L+E+ILP+YL  LGY + +
Sbjct: 66  NNYYVQPICTPSRAALLTGKHPVHLGLQHNTIPAPSAYGLGLNERILPEYLNTLGYDSHM 125

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEP 212
           VGKWHLG++  ++TPT+RGF+SH GY  G +DY DH      S   M  WGLD   D + 
Sbjct: 126 VGKWHLGYFTPQHTPTYRGFKSHFGYLNGCEDYLDHTLAYDFSTLGMDGWGLDFWNDTKI 185

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP---YEPLQAPDHYLNI 269
                G+YST++FT  A ++I + +T EPLFLY++H A HS NP      LQAP  Y N 
Sbjct: 186 HRTSFGQYSTEIFTTRAEELIRS-NTGEPLFLYMSHQAVHSGNPGLNGSKLQAPWKYFNK 244

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
             +I+  +R + A ++  LD+SVG +   L  + ML+N+IIVF +DNGG A  F+ N AS
Sbjct: 245 FNYIQSDERRRLAGMISALDDSVGNITRTLHAKGMLNNTIIVFSTDNGGPADRFDDNCAS 304

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           NWPLRG K T+WEGG+RG G +WSPLL+    V+E ++ + DWLPTLL AA
Sbjct: 305 NWPLRGGKRTMWEGGLRGNGFVWSPLLKHSNYVSEHFMSIMDWLPTLLDAA 355



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 63/139 (45%), Gaps = 18/139 (12%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK-----------VVKVM 622
           +++G+  W  LS N PS R+ +LHNID       L  G  K+V                M
Sbjct: 365 DVEGMSQWKSLSENSPSPRDYLLHNIDPLTGEGGLRLGDMKVVFGGGNTKAHRGWYPPEM 424

Query: 623 RYQVDLTGGPDQVYLSGLSD-REWLALAMRKLRDAASIQCGPVK---EVPCEPQIAPCLF 678
           R+  +    P  ++ + L D    L     + + +  ++CGP        C+  I PCLF
Sbjct: 425 RWGNETVSAPSHLFETDLWDIFHDLGRPPVQKKISVQVECGPAPPDVTTNCKSWIEPCLF 484

Query: 679 DIKNDPCEKNNLADRSEDQ 697
           +I  DPCE +NL   SEDQ
Sbjct: 485 NISADPCEYHNL---SEDQ 500


>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
 gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
          Length = 577

 Score =  356 bits (913), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 178/398 (44%), Positives = 248/398 (62%), Gaps = 18/398 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S  P+II IL DD+G+NDV FHG +QI TPNIDALAY+GI+L  +Y   LCTPSR+ ++
Sbjct: 24  ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TGK+PIHTGMQH V+   E  GLP  E+++P+  ++ GY T +VGKWHLGF++K+ TPT 
Sbjct: 84  TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           RGF+ H GY+ G+ DY+D +   +      GLD RRDLEP  +  G Y+T+ FT EA  +
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I  H  + PLF+ L+H A H+ N   P+QAP+  +    HI D KR  +A ++  LD+SV
Sbjct: 204 IERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G+ + AL    ML+NSI++  SDNG    G + NA SN+P RG K + WEGG+R  G +W
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRGQKESPWEGGIRSVGALW 323

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-------IL 405
           SPLL+ RG V+ Q +H  DWLPTL +AA  S +P+ +     N+ P    S       +L
Sbjct: 324 SPLLQQRGYVSNQAIHAIDWLPTLAAAAGVS-LPSDLRLDGLNLWPSLSASAQPQPRNLL 382

Query: 406 RYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
              +    Y+S        +Y NG+      +E RY++
Sbjct: 383 HVLDDVFGYSS--YTQDTLKYVNGS-----SFEGRYDH 413



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 51/155 (32%)

Query: 575 IDGIDVWSVLSRN-EPSKRNTILHNIDDEWQISALTKGKWKLV----------------- 616
           +DG+++W  LS + +P  RN +LH +DD +  S+ T+   K V                 
Sbjct: 361 LDGLNLWPSLSASAQPQPRN-LLHVLDDVFGYSSYTQDTLKYVNGSSFEGRYDHWLGELE 419

Query: 617 ------------KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPV 664
                       ++V+    Q  L        ++   DR      +R++RD A+ +C P+
Sbjct: 420 DGEEDPNSRDYGQLVRASEVQAVLGS------VAPTDDR------IRQMRDEATQRC-PI 466

Query: 665 KE-------VPCEPQIAPCLFDIKNDPCEKNNLAD 692
           +E         CEP  A CLFD+ NDPCE+ NLA+
Sbjct: 467 QERDPEDPVYLCEPLKAACLFDLANDPCERYNLAN 501



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 47/91 (51%), Gaps = 8/91 (8%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE------- 772
           E E  ++    +   +  +  + +      +  ++ +R++RD A+ +C P++E       
Sbjct: 417 ELEDGEEDPNSRDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC-PIQERDPEDPV 475

Query: 773 VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
             CEP  A CLFD+ NDPCE+ NLA+   +Q
Sbjct: 476 YLCEPLKAACLFDLANDPCERYNLANLYPLQ 506


>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 526

 Score =  354 bits (908), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 180/353 (50%), Positives = 235/353 (66%), Gaps = 9/353 (2%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L L   LS++   LV SS  P+IIFILADDLGW+DV FHG  QIPTPNID LAYSG++L 
Sbjct: 7   LILKVVLSLLSWKLVISSKQPNIIFILADDLGWDDVSFHGSHQIPTPNIDELAYSGVLLH 66

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           NYY   +CTP+R A++TG++P+H G+QH V+   E  GLPL+E  LP YLK+LGY T +V
Sbjct: 67  NYYVQPVCTPTRGALLTGRYPMHLGLQHFVITPNEPVGLPLNETTLPTYLKKLGYSTHMV 126

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLGF+ KEYTPT+RGF+SH GY+ GHQDY+ H+A     WG D+R +++      G+
Sbjct: 127 GKWHLGFFAKEYTPTYRGFDSHYGYFLGHQDYYTHNALWNNQWGFDLRHNMDLQRSTFGE 186

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP---YEPLQAPDHYLNIHRHIEDF 276
           Y  ++FT +A  +I++H   +PLFLY AH A H  N       L+AP  Y +   HI D 
Sbjct: 187 YGPELFTTQAEKLIYDHDHKKPLFLYFAHQAVHYGNSGPNGTLLEAPYKYTSRFPHIADH 246

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R  +A ++  LD+SVG +  AL++  + +NSIIVF +DNGG         A N PLRG 
Sbjct: 247 QRRIYAGMVSALDDSVGNITRALKKSGLYNNSIIVFSTDNGGPHEC----EAFNTPLRGT 302

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPN 387
           KNTLWEGG+RGA  I S LLE    V E  +HV DWLPT+  ++  N SD+ N
Sbjct: 303 KNTLWEGGIRGAAFIHSVLLEKPKRVCEGMMHVVDWLPTMYHVAGGNLSDLHN 355



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 56/125 (44%), Gaps = 10/125 (8%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           +DG ++W  LS    S R  ILHNID   Q +AL  G  K++       + V     P Q
Sbjct: 356 LDGFNMWDTLSSGVSSPRTEILHNIDPITQTAALRVGNLKVIVGENKSDWFV-----PPQ 410

Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP---CEPQIAPCLFDIKNDPCEKNNLA 691
           +      +   L            + C  V  V    C+P+  PCL++IKNDPCE  NLA
Sbjct: 411 ITFG--KNGTNLTKQFGSDHRGIIVHCSSVPPVAYKNCDPKQKPCLYNIKNDPCEYYNLA 468

Query: 692 DRSED 696
           D   D
Sbjct: 469 DEYPD 473



 Score = 44.7 bits (104), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 9/55 (16%)

Query: 765 IQCGPVKEVP---CEPQIAPCLFDIKNDPCEKNNLAD------RSEVQRINHYTT 810
           + C  V  V    C+P+  PCL++IKNDPCE  NLAD      +S + ++N Y +
Sbjct: 433 VHCSSVPPVAYKNCDPKQKPCLYNIKNDPCEYYNLADEYPDVLQSLLYKLNDYNS 487


>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
 gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
          Length = 579

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 172/356 (48%), Positives = 234/356 (65%), Gaps = 8/356 (2%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
             +L ++  LS+ +      S  P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI
Sbjct: 7   LVLLVVSSILSLAYGS--GYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGI 64

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
           +L  +Y   LCTPSR+ ++TGK+PIHTGMQH V+   E  GLP  E+++P+  ++ GY T
Sbjct: 65  LLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYST 124

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPA 213
            +VGKWHLGF++K+ TPT RGF+ H GY+ G+ DY+DH    +      GLD RRDLEP 
Sbjct: 125 HLVGKWHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPC 184

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
            + +G Y+T+ FT+EA  II  H   +PLF+ L+H A H+ N   P+QAP+  +    HI
Sbjct: 185 PEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHI 244

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
            D KR  +A ++  LD+SV + + AL+   ML+NSII+  SDNG    G + NA SN+P 
Sbjct: 245 RDPKRRTYAGMISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPY 304

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
           RG K + WEGG+R AG +WSPLL+ RG V+ Q +H  DWLPTL  AA  S   D+P
Sbjct: 305 RGQKESPWEGGIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLP 360



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 82/214 (38%), Gaps = 58/214 (27%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN-----GTSEN 564
           +DGI++W +LS NE  K  T++H +D+ +  S+  R   K V  +S  G      G  E 
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELET 420

Query: 565 RSND---NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV 621
             +D    SY+  +   DV S+L                       LTK +      ++ 
Sbjct: 421 NEDDPLGESYEQHVLASDVQSLLGNR-------------------GLTKDR------IRQ 455

Query: 622 MRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIK 681
           MR +   T  P    + G +  E                        CEP  APC FD+ 
Sbjct: 456 MRSEATETCPP----IEGQNPLE--------------------SHFKCEPLKAPCFFDLA 491

Query: 682 NDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
            DPCE+ NLA     Q +     E+ +  + A P
Sbjct: 492 KDPCERYNLAQMYPLQ-LQQLADELEQIRKTAIP 524



 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 7/55 (12%)

Query: 756 MRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           +R++R  A+  C P++          CEP  APC FD+  DPCE+ NLA    +Q
Sbjct: 453 IRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQ 507


>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
 gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
          Length = 577

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 169/338 (50%), Positives = 227/338 (67%), Gaps = 6/338 (1%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A S  P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L  +Y   LCTPSR+ +
Sbjct: 23  AYSAKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATL 82

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TGK+PIHTGMQH V+   E  GLP  E+++P+  ++ GY T +VGKWHLGF++K+ TPT
Sbjct: 83  LTGKYPIHTGMQHFVIITDEPWGLPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPT 142

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
            RGF+ H GY+ G+ DY+DH    +      GLD RRDLEP  + +G Y+T+ FT+EA  
Sbjct: 143 MRGFDHHFGYYNGYIDYYDHQVRMLDRNYSHGLDFRRDLEPCPEANGTYATEAFTSEAKR 202

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           II  H   +PLF+ ++H A H+ N   P+QAP+  +    HI D KR  +A ++  LD+S
Sbjct: 203 IIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKS 262

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           V + + AL+   ML+NSII+  SDNG    G + NA SN+P RG K + WEGG+R AG +
Sbjct: 263 VAQTIGALKDNDMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGAL 322

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
           WSPLL+ RG V+ Q +H  DWLPTL  AA  S   D+P
Sbjct: 323 WSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 25/165 (15%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM-RYQV------- 626
           +DGI++W +LS NE  K  T++H +D+ +  S+  K   K +       RY         
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYINGSSFQGRYDHWLGELKT 420

Query: 627 ----DLTGGPDQVYLSG-----LSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
                L+   +Q  L+      L +R      +R++R  A+  C PV+       +  CE
Sbjct: 421 NEDHPLSESYEQHVLASDVQSILGNRWLTEDRIRQMRSEATEMCPPVEGLNPLEPQFKCE 480

Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           P  APC FD+  DPCE+ NLA+    Q +     EV +  + A P
Sbjct: 481 PLKAPCFFDLAKDPCERYNLAETYPLQ-LQQLADEVEQIRKTAIP 524



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 7/58 (12%)

Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           E+ +R++R  A+  C PV+       +  CEP  APC FD+  DPCE+ NLA+   +Q
Sbjct: 450 EDRIRQMRSEATEMCPPVEGLNPLEPQFKCEPLKAPCFFDLAKDPCERYNLAETYPLQ 507


>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
 gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
          Length = 578

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 172/338 (50%), Positives = 226/338 (66%), Gaps = 6/338 (1%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A S  P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L  +Y   LCTPSR+ +
Sbjct: 23  AYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATL 82

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TGK+PIHTGMQH V+   E  GLP  E+++P+  +E GY T +VGKWHLGF+ K+ TPT
Sbjct: 83  LTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFREAGYSTHLVGKWHLGFWHKDLTPT 142

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
            RGF+ H GY+ G+ DY+DH    +      GLD RRDLEP  + +G Y+T+ FTAEA  
Sbjct: 143 RRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTAEAKR 202

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           II  H   +PLF+ ++H A H+ N   P+QAP+  +    HI D KR  +A ++  LD+S
Sbjct: 203 IIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKS 262

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           V + V AL+   ML+NSII+  SDNG    G + NA SN+P RG K + WEGG+R AG +
Sbjct: 263 VAQTVGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGAL 322

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
           WSPLL+ RG V+ Q +H  DWLPTL  AA  S   D+P
Sbjct: 323 WSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360



 Score = 62.8 bits (151), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 74/156 (47%), Gaps = 25/156 (16%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV----------KVVKVMRY 624
           +DGI++W +LS NE  K  T++H +D+ +  S+  K   K V          + +  +  
Sbjct: 361 LDGINLWPLLSGNEQPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFQGRYDQWLGELET 420

Query: 625 QVD--LTGGPDQVYLSG-----LSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
             D  L+   +Q  L+      L +R      +R++R  A+  C PV+          CE
Sbjct: 421 SADDPLSESYEQHVLASDVQSILGNRGLTEDRIRQMRSEATELCPPVEGQNPLEPHFKCE 480

Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEV 706
           P  APC FD+ NDPCE+ NLA R+   ++     EV
Sbjct: 481 PLKAPCFFDLANDPCERYNLA-RTYPLQLQQLADEV 515



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 7/58 (12%)

Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           E+ +R++R  A+  C PV+          CEP  APC FD+ NDPCE+ NLA    +Q
Sbjct: 450 EDRIRQMRSEATELCPPVEGQNPLEPHFKCEPLKAPCFFDLANDPCERYNLARTYPLQ 507


>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
 gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
          Length = 579

 Score =  352 bits (904), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 172/351 (49%), Positives = 231/351 (65%), Gaps = 8/351 (2%)

Query: 44  FTLSMVFVDLVAS--SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             +S +F     S  S  P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L  +
Sbjct: 10  LVVSTIFSLAFGSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKH 69

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   LCTPSR+ ++TGK+PIHTGMQH V+   E  GLP  E+++P+  ++ GY T +VGK
Sbjct: 70  YVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGK 129

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHG 218
           WHLGF++K+ TPT RGF+ H GY+ G+ DY+DH    +      GLD RRDLEP  + +G
Sbjct: 130 WHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEANG 189

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y+T+ FT+EA  II  H   +PLF+ L+H A H+ N   P+QAP+  +    HI D KR
Sbjct: 190 TYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKR 249

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +A ++  LD+SV + + AL+   ML+NSII+  SDNG    G + NA SN+P RG K 
Sbjct: 250 RTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKE 309

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
           + WEGG+R AG +WSPLL+ RG V+ Q +H  DWLPTL  AA  S   D+P
Sbjct: 310 SPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360



 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 73/165 (44%), Gaps = 25/165 (15%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV-MRYQV------- 626
           +DGI++W +LS NE  K  T++H +D+ +  S+  +   K V       RY         
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFEGRYDQWLGELET 420

Query: 627 ----DLTGGPDQVYLSG-----LSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
                L+   +Q  L+      L +R      +R++R  A+  C PV+          CE
Sbjct: 421 NEDDPLSESYEQHVLASDVQSLLGNRGLTEDRIRQMRSEATETCPPVEGQNPLEPHFKCE 480

Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           P  APC FD+  DPCE+ NLA     Q +     EV +  + A P
Sbjct: 481 PLKAPCFFDLAKDPCERYNLAQIYPLQ-LQQLADEVEQIRKTAIP 524



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           E+ +R++R  A+  C PV+          CEP  APC FD+  DPCE+ NLA    +Q
Sbjct: 450 EDRIRQMRSEATETCPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQ 507


>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
          Length = 512

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 178/342 (52%), Positives = 231/342 (67%), Gaps = 13/342 (3%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GWNDV +HG  QI TPNIDALA++GI L+ YY   LCTPSR+A++TG++PI+ G+QH+V+
Sbjct: 1   GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
           Y  E  GLPLS+ +LPQ+L +LGY T  +GKWH+GF+KKEYTPT RGFE H+G+W  + D
Sbjct: 61  YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120

Query: 191 YFDHSAEEM---KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLA 247
           Y+ H    +   +  GLDMRR+L  A +  G+Y T + T EA+++I NH  D+PLFLYLA
Sbjct: 121 YYKHEKAYLGPTRSPGLDMRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLYLA 180

Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
           H A HSA P +PLQ PD Y+  +R I   +R K+A ++  LDESVG V EAL +  MLSN
Sbjct: 181 HLAPHSAGPQDPLQVPDKYIEQYRDIGSTERMKYAGMVSALDESVGLVFEALAKNGMLSN 240

Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR---GIVAE 364
           ++ VF SDNG  AAG N N AS+WPLRG K T WEGGVR A L+WS  L  R   G   E
Sbjct: 241 TLFVFTSDNGADAAGPNANYASSWPLRGQKYTPWEGGVRAAALLWSHALGPRVRAGGDYE 300

Query: 365 QYVHVSDWLPTLLSAANKS-------DIPNYVNSTVENIIPR 399
              H++DWLPTL   A          D  ++VN+ V +  P+
Sbjct: 301 HLFHITDWLPTLYQLAGGDPAHLGDIDGVSHVNALVGDASPQ 342



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 29/174 (16%)

Query: 566 SNDNSYQNEIDGIDVWSVL---SRNEPSK--RNTILHNIDDEWQISALTKGKWKLVKVVK 620
             D ++  +IDG+   + L   +  +P +  R  +L NID     SA+  G++K V  V 
Sbjct: 317 GGDPAHLGDIDGVSHVNALVGDASPQPPQPPRTEVLVNIDPIDNYSAIIDGRFKFVAGVA 376

Query: 621 ---VMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRD------------AASIQCG--- 662
              +      + G  D      L   E   +A R L+             AA I CG   
Sbjct: 377 ERGIYEQWYPILGNVDWNSTEPLEQCEASPVA-RVLKSFGPEPACGPSGGAAPIDCGVRD 435

Query: 663 PVKEVPCEPQIAPCLFDIKNDPCEKNNL-ADRSEDQRINHYTTEVGRFNQIAYP 715
           P K   C+P +APCLFD++ DPCE N++ A+R E   +    T++ R+   + P
Sbjct: 436 PAKA--CQPTVAPCLFDLREDPCEYNDVSAERPE--LVKSLLTKLRRYEDASMP 485



 Score = 48.5 bits (114), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 32/47 (68%), Gaps = 6/47 (12%)

Query: 762 AASIQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNL-ADRSEVQR 804
           AA I CG   P K   C+P +APCLFD++ DPCE N++ A+R E+ +
Sbjct: 426 AAPIDCGVRDPAKA--CQPTVAPCLFDLREDPCEYNDVSAERPELVK 470



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 25/32 (78%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
          +QH+V+Y  E  GLPLS+ +LPQ+L +LGY T
Sbjct: 55 LQHSVIYNEEPRGLPLSDTLLPQWLADLGYVT 86


>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
 gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
          Length = 579

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 171/351 (48%), Positives = 231/351 (65%), Gaps = 8/351 (2%)

Query: 44  FTLSMVFVDLVAS--SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             +S +F     S  S  P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L  +
Sbjct: 10  LVVSTIFSLAFGSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKH 69

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   LCTPSR+ ++TGK+PI+TGMQH V+   E  GLP  E+++P+  ++ GY T +VGK
Sbjct: 70  YVPNLCTPSRATLLTGKYPIYTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGK 129

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHG 218
           WHLGF++K+ TPT RGF+ H GY+ G+ DY+DH    +      GLD RRDLEP  + +G
Sbjct: 130 WHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEANG 189

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y+T+ FT+EA  II  H   +PLF+ L+H A H+ N   P+QAP+  +    HI D KR
Sbjct: 190 TYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKR 249

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +A ++  LD+SV + + AL+   ML+NSII+  SDNG    G + NA SN+P RG K 
Sbjct: 250 RTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKE 309

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP 386
           + WEGG+R AG +WSPLL+ RG V+ Q +H  DWLPTL  AA  S   D+P
Sbjct: 310 SPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLP 360



 Score = 59.3 bits (142), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 72/165 (43%), Gaps = 25/165 (15%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM-RYQVDL----T 629
           +DGI++W +LS NE  K  T++H +D+ +  S+  K   K V       RY   L    T
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFEGRYDQWLGELET 420

Query: 630 GGPDQV------------YLSGLSDREWLALAMRKLRDAASIQCGPVK-------EVPCE 670
              D +              S L +R      +R++R  A+  C PV+          CE
Sbjct: 421 NEDDPLSESYEKHVLASDVQSLLGNRGLTEDRIRQMRSEATETCPPVEGQNPLEPHFKCE 480

Query: 671 PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           P  APC FD+  DPCE+ NLA     Q +     EV +  + A P
Sbjct: 481 PLKAPCFFDLAKDPCERYNLAQIYPLQ-LQELADEVEQIRKTAIP 524



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           E+ +R++R  A+  C PV+          CEP  APC FD+  DPCE+ NLA    +Q
Sbjct: 450 EDRIRQMRSEATETCPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQ 507


>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 502

 Score =  350 bits (897), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 172/327 (52%), Positives = 220/327 (67%), Gaps = 5/327 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADDLGWNDV +HG  QI TPNIDALA++GI L  YYT  LCTPSRSA +TG +
Sbjct: 26  PHIVFILADDLGWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTGCY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P++TGMQH+V+   E  GLPL  K+LPQ+L + GY +R++GKWHLG+YK+EYTPT RGF+
Sbjct: 86  PMNTGMQHSVILTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRGFQ 145

Query: 180 SHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           SH+G W G  DY+ H    S +   + G D RRD++ + +  G+Y T V T EA+ II +
Sbjct: 146 SHVGSWEGFSDYYSHIMDFSWQTWSISGHDFRRDMQKSKEDDGRYYTHVMTEEALKIIKD 205

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H  ++PLFLY+AH A HS N  EPL+AP  Y + +  I    R+ +A +L  LDESVG V
Sbjct: 206 HPNEKPLFLYIAHLAVHSGNQPEPLKAPTKYTDPYMDIGHPSRTLYAGMLSVLDESVGAV 265

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
            EAL +R +L+ +I+V  +DNG        +  S WP RG K T WEGGVR   ++WSPL
Sbjct: 266 FEALGRRGILNETIVVLTTDNGAGITTQFSSFGSGWPFRGQKGTAWEGGVRVPAVVWSPL 325

Query: 356 L-ESRGIVAEQYVHVSDWLPTLLSAAN 381
             E RG V E   HVSDWLPT    A 
Sbjct: 326 FSEHRGAVVESLFHVSDWLPTFYELAG 352



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 21/148 (14%)

Query: 563 ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK----- 617
           E    D S   EIDGI     L   E   R  I+ NID    +SA+    +KL++     
Sbjct: 349 ELAGGDPSNLGEIDGISQLDTLRHREKLPRTEIVLNIDPIENVSAIIVYPFKLMQGDDQG 408

Query: 618 ---------VVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRK--LRDAASIQCG---P 663
                    +  V R Q D   G +   +  +       +   +  +  A S++CG   P
Sbjct: 409 GHYDDWYPFMGNVTRTQKDAREGCEASVVYRVMKHSGFVVTCGQDPVTYATSVKCGEKDP 468

Query: 664 VKEVPCEPQIAPCLFDIKNDPCEKNNLA 691
            K   C+P + PCLFD+  DPCE N+++
Sbjct: 469 TKA--CKPIVKPCLFDLSKDPCEYNDIS 494



 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 5/46 (10%)

Query: 755 GMRKLRDAASIQCG---PVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           G   +  A S++CG   P K   C+P + PCLFD+  DPCE N+++
Sbjct: 451 GQDPVTYATSVKCGEKDPTKA--CKPIVKPCLFDLSKDPCEYNDIS 494


>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
           purpuratus]
          Length = 514

 Score =  348 bits (893), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 173/324 (53%), Positives = 215/324 (66%), Gaps = 2/324 (0%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFI+ADDLGW+DV  HG  QIPTPNID LA  G+ L NYY   LCTPSRSAIMTG+
Sbjct: 10  PPHIIFIVADDLGWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTGR 69

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           HPIHTG+Q  V+      GL L EK + QYLK LGY T  VGKWHLG++ KEYTPT+RGF
Sbjct: 70  HPIHTGLQFGVISPEAPYGLGLEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRGF 129

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +S  G++ G  DY+ H   + ++ G D+ ++ +      G+YSTD+F  EA  II  H+ 
Sbjct: 130 DSFFGFYNGRGDYYTHEEVQSEVSGYDLHKNGKVYRPAFGQYSTDIFNQEAEQIIKAHNA 189

Query: 239 DEPLFLYLAHAATHSA-NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            +PLFLYLAH A H+   P   LQAPD Y     HIE   R  +AA++  LD+SVG + +
Sbjct: 190 SQPLFLYLAHQAVHAGVYPDRLLQAPDKYYQRFPHIETEGRRMYAAMVSALDDSVGSISQ 249

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            L    +  NSIIVF +DNGGA    F+   ASNWPLRG K+TLWEGGVRG   + SPL+
Sbjct: 250 TLRDAGLYDNSIIVFTTDNGGATYDFFDGTHASNWPLRGGKHTLWEGGVRGTAFVNSPLI 309

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
           +     ++Q +HV DWLPTL S A
Sbjct: 310 KKPRRFSDQMMHVCDWLPTLHSVA 333



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 62/127 (48%), Gaps = 15/127 (11%)

Query: 576 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQV 635
           DG D W  LS N PS RN ILHNI+   + +A+  G +KL        +     G     
Sbjct: 344 DGFDQWDALSNNAPSPRNEILHNIEPIEKYAAIRVGDFKL--------HYGAFGGKIGSK 395

Query: 636 YLSGLSDREWLALAMRK-LRDAASIQCGPVKEVPCEPQI-----APCLFDIKNDPCEKNN 689
           Y SG    E     M + + ++  ++C P K+             PCL++I++DPCE NN
Sbjct: 396 YFSGWYPPEGATSNMAEYVPNSFRVRC-PSKQANASTNCDYGDRKPCLYNIRDDPCEYNN 454

Query: 690 LADRSED 696
           +AD ++D
Sbjct: 455 IADWNQD 461



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 25/40 (62%)

Query: 511 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKL 550
           DG D W  LS N PS RN ILHNI+   + +A+  G +KL
Sbjct: 344 DGFDQWDALSNNAPSPRNEILHNIEPIEKYAAIRVGDFKL 383


>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
          Length = 502

 Score =  346 bits (887), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 175/348 (50%), Positives = 223/348 (64%), Gaps = 10/348 (2%)

Query: 39  VLPLAFTLSMVFVDLVASSG----PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           +L  AF L++ F   +  SG    PP+I+FILADDLGW+DV FHG  Q+PTPNIDALA  
Sbjct: 1   MLRCAFVLTLSF--FLECSGNSEIPPNIVFILADDLGWDDVSFHGSRQVPTPNIDALASD 58

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GIIL NYY   LC+PSR A+MTGKHPI  G+Q  V+   +  GL L EK+LP+YL  LGY
Sbjct: 59  GIILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGVIMPAQPSGLGLKEKLLPEYLNTLGY 118

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
           ++ +VGKWHLG   +EYTP  RGFESH G++ G + Y  H        GLD   + EP  
Sbjct: 119 KSHMVGKWHLGMCAEEYTPMHRGFESHFGFYQGCESYTTHMCGNS---GLDFWLNEEPDH 175

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
              G+YST +FTA+A  ++  H T  P+FLYLAH A H  N  +   APD Y +    I 
Sbjct: 176 SAGGQYSTSLFTAKAEQLLAEHDTASPMFLYLAHQAVHVGNQDQKFYAPDKYTDKLSFIS 235

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPL 333
           D +R + A +L  LDESVGKV +AL  + ML N+IIVF SDNG    G  +    SN+PL
Sbjct: 236 DDRRRQMAGMLTALDESVGKVTKALHAKGMLENTIIVFSSDNGAPHIGNSDHTVGSNFPL 295

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           RG K ++WEGG R    +WSPL +  G +++Q +H+SDWLPTLL AA 
Sbjct: 296 RGGKPSIWEGGSRVPAFVWSPLFKKSGYISKQLMHISDWLPTLLQAAG 343


>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 554

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 169/322 (52%), Positives = 208/322 (64%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADDLGW+DV FHG  QIPTPN+D LA  G+IL NYY    CTPSR+A+MTG +
Sbjct: 36  PHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLY 95

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI TGMQ   +   E  GLP   +ILPQYLKE GY T +VGKWHLG YK+  TPT RGF+
Sbjct: 96  PIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFD 155

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  GY+ G  DYF H+       GLD   + +P W   G YST VFT  A  II N +  
Sbjct: 156 SFYGYYYGESDYFAHTISYENHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKS 215

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PL L + H ATH A   E LQAP   ++   +I +  R+ +A ++  LD+SVG+VV+AL
Sbjct: 216 KPLLLVITHQATHCALERERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKAL 275

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
               ML N+IIVF SDNGG   G + +   NWPLRG K T+WEGG R A  +WSPLL  R
Sbjct: 276 SDAGMLENTIIVFSSDNGGGPWGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWSPLLARR 335

Query: 360 GIVAEQYVHVSDWLPTLLSAAN 381
             V+ Q +H++DWLPT  S A 
Sbjct: 336 RRVSHQLMHITDWLPTFYSIAG 357



 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 22/139 (15%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTKGKWKLV---KVVKVMRYQVDLT 629
           E+DG D+W  LS    S R  +L+N D ++  S AL   ++KLV         RY+V   
Sbjct: 365 ELDGHDMWRPLSHGRRSPRVELLYNYDYKYTKSAALRNHRYKLVLDETGRFADRYKVPGG 424

Query: 630 GGP--DQVYLSGLSDREWLALAMRKL------------RDAASIQCGPVKEVPCE-PQIA 674
             P  D  YL+  S     A  +R              R  A + CG  K++       +
Sbjct: 425 SRPRHDLDYLAAKSTA---ASVLRDFYKKKQLNFPKSWRRKARLTCGKQKKINFSWSATS 481

Query: 675 PCLFDIKNDPCEKNNLADR 693
             LFD+  DPCE NNLA +
Sbjct: 482 VYLFDVVTDPCELNNLASK 500


>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
 gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
          Length = 634

 Score =  342 bits (878), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 163/324 (50%), Positives = 222/324 (68%), Gaps = 3/324 (0%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I ADD+GWNDVGFHG +QI TP+IDALAY G+IL  +Y+  +CTPSR+A+MTG+H
Sbjct: 34  PHIVLITADDMGWNDVGFHGSNQIATPHIDALAYDGVILNRHYSAPMCTPSRAALMTGRH 93

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI+ GMQH V+   E  GL L ++I+PQY +  GYRT ++GKWHLGF+ + Y PT RGF+
Sbjct: 94  PINVGMQHYVIDSDEPWGLGLDQRIMPQYFRAAGYRTHMIGKWHLGFFTEHYIPTNRGFD 153

Query: 180 SHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           +H+GY   + DY+ + ++       G DMR++    +  +G Y+TD FT+ A DII  H 
Sbjct: 154 THIGYLGPYVDYWSYVSKMNSGTFEGYDMRQNQFVNYAANGTYATDYFTSAARDIIAQHG 213

Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
            + +P+ L + H A H+ N  +PLQAP   ++   +I +  R  +AA++ KLD+SVG V 
Sbjct: 214 KSGQPMLLVMNHLAPHAGNDDDPLQAPQETIDRFAYIGNRDRRTYAAMMSKLDDSVGAVY 273

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           E+L    ML NSIIVF+SDNGG   G + N  SN+P RG K++ WEG VR A LIWSPLL
Sbjct: 274 ESLRANGMLDNSIIVFLSDNGGVTRGMHSNTGSNYPFRGQKHSPWEGAVRTAALIWSPLL 333

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
           +    V+ Q+ H+SDWLPTL SAA
Sbjct: 334 KDTQRVSNQWFHISDWLPTLASAA 357



 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 68/140 (48%), Gaps = 17/140 (12%)

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDL 628
           NS  ++IDGID W  L+    + R  +++N+D+ +  ++  +  +K +    +       
Sbjct: 363 NSSFSDIDGIDQWEALAYGTGNPRQRLMNNLDEIFGYTSYMENGFKYINGTTLNGVNDAW 422

Query: 629 TGGPDQV--------YLSGLSDREWLALAMR-------KLRDAASIQCG-PVKEVPCEPQ 672
            G  D +        YLS +     +A A R        LR  A++ CG  +   PC P 
Sbjct: 423 YGETDPLDQLPDDEQYLSSVMATR-IARAGRLDEKLISYLRRHATVNCGNSIATKPCNPL 481

Query: 673 IAPCLFDIKNDPCEKNNLAD 692
           + PCLFDI NDPCE ++++D
Sbjct: 482 VKPCLFDIINDPCEMHDISD 501



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 1/48 (2%)

Query: 752 EEEGMRKLRDAASIQCG-PVKEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
           +E+ +  LR  A++ CG  +   PC P + PCLFDI NDPCE ++++D
Sbjct: 454 DEKLISYLRRHATVNCGNSIATKPCNPLVKPCLFDIINDPCEMHDISD 501


>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
          Length = 558

 Score =  342 bits (876), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 169/382 (44%), Positives = 240/382 (62%), Gaps = 24/382 (6%)

Query: 50  FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
           F  +  ++G P+++FI+ADDLGWNDVGFHG +QIPTPNIDALAY+GIIL ++Y+    TP
Sbjct: 20  FEKIHCATGQPNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTP 79

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           SR+A++TGK+P+  G+Q   +   E   LP   KI+ +Y K++GY T +VGKWHLG  + 
Sbjct: 80  SRAALLTGKYPMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRW 138

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAE----EMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
             TPTFRGF+   G++ G   Y+D+ +     + +  G D+RRD  P+W+  GKY+TD+F
Sbjct: 139 NDTPTFRGFDHFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLF 198

Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
              AVD+I  H+ + PLF+ +AH A H  N  + L+AP   +N  +HI D  R  +AA++
Sbjct: 199 AEHAVDVIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMV 258

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
            KLD+S+G V EALE + ML N+I+VF+SDNG    G + N  SN+PLRG+K+TL+EGGV
Sbjct: 259 SKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGV 318

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS-------------------DIP 386
           R    IWSPLL     V+   +H++DWLPTL +A                       D+P
Sbjct: 319 RTVACIWSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLP 378

Query: 387 NYVNSTVENIIPRYENSILRYE 408
           +  N    NI  +  N+ LR+ 
Sbjct: 379 SARNDIPLNIDEKTRNAALRFS 400



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 51/187 (27%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
           ++DGID WS L  + PS RN I  NID++ + +AL    WKL+        GTS N S +
Sbjct: 363 DLDGIDQWSSLVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIV-------GTSGNGSYN 415

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTKGKWKLVKVVKVMRYQVD 627
             +   ++                     NI++ ++  SA+      + ++ K + Y   
Sbjct: 416 GYFGAPLN--------------------ENIEEQQYNTSAINDS--PVGRIAKKINY--- 450

Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDP 684
                     + LS+ ++  L     R  A+++C     K  PC+P   A CL++I NDP
Sbjct: 451 ----------NPLSETDFDGL-----RRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDP 495

Query: 685 CEKNNLA 691
           CE+N+LA
Sbjct: 496 CEENDLA 502



 Score = 42.7 bits (99), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 5/59 (8%)

Query: 752 EEEGMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDPCEKNNLADR--SEVQRI 805
            E     LR  A+++C     K  PC+P   A CL++I NDPCE+N+LA    S V+RI
Sbjct: 454 SETDFDGLRRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDPCEENDLAKFFPSVVRRI 512


>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
          Length = 563

 Score =  341 bits (875), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 172/357 (48%), Positives = 240/357 (67%), Gaps = 12/357 (3%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           IM+ +V+     L   +V L          P+II I+ADDLGWNDV FHG DQIPTPNID
Sbjct: 9   IMSMSVMIATGCLFYFYVVLAQQKAQAQQRPNIIVIMADDLGWNDVSFHGADQIPTPNID 68

Query: 90  ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           ALAY+G+IL+ +Y + +CTPSR+A +TG++PI TGMQ + L   E   +PL+  +LP+YL
Sbjct: 69  ALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGHPLDPGEVRAIPLNNTLLPEYL 128

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
           ++LGY T +VGKWH+G+Y   +TP +RGF++ LGY++G   YF+H+ E+    G D+  D
Sbjct: 129 RKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGFMTYFNHTIEQNHHVGYDLHYD 188

Query: 210 LEPAWDLHGKYS----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
           +  A +L  KYS    TD+ T  A DII NH+  +PL+L L+H A HS+N  + ++  D 
Sbjct: 189 V--AGNLSVKYSHEYMTDLITERAEDIILNHNHSKPLYLQLSHIAAHSSNINKTVEVRDE 246

Query: 266 YLN--IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
                   +IEDF R K A ++  LDESVG+VV+AL+Q  ML NSII+F+SDNG    G 
Sbjct: 247 EETNATLGYIEDFDRRKLAGVVTALDESVGRVVQALKQANMLENSIILFMSDNGAQTVGL 306

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             N+ SN+PLRG+K TL+EGGVRGA  ++SP +++   ++ Q +HV+DWLPT  SAA
Sbjct: 307 LDNSGSNYPLRGMKFTLFEGGVRGAACVYSPAIKNPSRISNQLIHVTDWLPTFYSAA 363



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 20/137 (14%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           +  +DG+D W+ +   E ++R  IL NID++  +S    G++KL+   K+         G
Sbjct: 371 EENLDGVDQWATIVSEENTRRKNILLNIDEKEDLSGALMGRYKLINGAKLQYGDYYGDSG 430

Query: 632 PDQVY----------------LSGLSDREWLALAMRKLRDAASIQCGPVKEVP-CEPQIA 674
             + Y                ++ +S+    +  + +LR AA + C      P C   + 
Sbjct: 431 MSESYPEYNVSNVLHSAAGFAVASVSNVTLNSERIIELRRAAMVVCDNFTSYPKC---LD 487

Query: 675 PCLFDIKNDPCEKNNLA 691
            CLFD+ NDPCE  +++
Sbjct: 488 RCLFDVYNDPCETTDVS 504


>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  341 bits (874), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 169/335 (50%), Positives = 231/335 (68%), Gaps = 8/335 (2%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+I+ I+ADDLGWNDV FHG D+IPTPNIDALAY+G+IL  +Y + LCTPSR+A +TGK
Sbjct: 13  PPNIVVIMADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRTAFLTGK 72

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +PI TGMQ  VL   E  G+PL++ +LP+YL++LGY T +VGKWH+G++ K YTPT RGF
Sbjct: 73  YPIRTGMQGYVLQPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYTPTRRGF 132

Query: 179 ESHLGYWTGHQDYFDHSA--EEMKMWGLDMRRDLEPAWDLHGKYS--TDVFTAEAVDIIH 234
           ++ LGY+ G+  YF+H+   EE K  G D  R +     +  +Y   TD+ T E  +II 
Sbjct: 133 DTFLGYYNGYIHYFNHTILDEEQKYLGYDFHRIVGENRTIEYRYDYITDIITDEVENIIF 192

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFKRSKFAAILHKLDESV 292
           +H+  +PL+L ++H A HS      +Q  D         +IED  R K+A+++  LDESV
Sbjct: 193 SHNPAKPLYLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDINRRKYASVVATLDESV 252

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G++++AL +  ML NSIIVF+SDNG    GF  N  SN+PLRG+K +L+EGGVRGA  I+
Sbjct: 253 GRIIDALRKTDMLKNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGVRGAAYIY 312

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
           SPL++    V+ Q  H++DWLPTL SAA  N SD+
Sbjct: 313 SPLIDRLSRVSTQLFHITDWLPTLYSAAGGNSSDL 347



 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 64/136 (47%), Gaps = 13/136 (9%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDD----EWQISALTKGKWKLVKVVKVMRYQVDLT 629
           ++DG D WS +   E SKR +IL NID+    E+Q      G   L          +++ 
Sbjct: 349 QLDGFDQWSAIKSAEDSKRKSILINIDERDTSEYQKYYNYSGNNALYPKYNA----INIL 404

Query: 630 GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVP-CEPQIAPCLFDIKNDPCEKN 688
             P    ++ +S     A  + +LR  A+I C    +   C  +   CLFD+K DPCE  
Sbjct: 405 ASPVASAIASISTSMLNANKIMQLRKEATIVCKNFMDFSNCTNR--SCLFDVKEDPCETT 462

Query: 689 NLADR--SEDQRINHY 702
           +L+ +   E +R+N +
Sbjct: 463 DLSAKYPKEVERLNMF 478


>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 536

 Score =  340 bits (872), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/372 (44%), Positives = 235/372 (63%), Gaps = 24/372 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+++FI+ADDLGWNDVGFHG +QIPTPNIDALAY+GIIL ++Y+    TPSR+A++TGK+
Sbjct: 8   PNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKY 67

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+  G+Q   +   E   LP   KI+ +Y K++GY T +VGKWHLG  +   TPTFRGF+
Sbjct: 68  PMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFD 126

Query: 180 SHLGYWTGHQDYFDHSAE----EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
              G++ G   Y+D+ +     + +  G D+RRD  P+W+  GKY+TD+F   AVD+I  
Sbjct: 127 HFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQK 186

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H+ + PLF+ +AH A H  N  + L+AP   +N  +HI D  R  +AA++ KLD+S+G V
Sbjct: 187 HNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAV 246

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
            EALE + ML N+I+VF+SDNG    G + N  SN+PLRG+K+TL+EGGVR    IWSPL
Sbjct: 247 FEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPL 306

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS-------------------DIPNYVNSTVENI 396
           L     V+   +H++DWLPTL +A                       D+P+  N    NI
Sbjct: 307 LVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDIPLNI 366

Query: 397 IPRYENSILRYE 408
             +  N+ LR+ 
Sbjct: 367 DEKTRNAALRFS 378



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 51/187 (27%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
           ++DGID WS L  + PS RN I  NID++ + +AL    WKL+        GTS N S +
Sbjct: 341 DLDGIDQWSSLVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIV-------GTSGNGSYN 393

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTKGKWKLVKVVKVMRYQVD 627
             +   ++                     NI++ ++  SA+      + ++ K + Y   
Sbjct: 394 GYFGAPLN--------------------ENIEEQQYNTSAINDS--PVGRIAKKINY--- 428

Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDP 684
                     + LS+ ++  L     R  A+++C     K  PC+P   A CL++I NDP
Sbjct: 429 ----------NPLSETDFDGL-----RRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDP 473

Query: 685 CEKNNLA 691
           CE+N+LA
Sbjct: 474 CEENDLA 480



 Score = 43.1 bits (100), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 5/59 (8%)

Query: 752 EEEGMRKLRDAASIQC--GPVKEVPCEPQI-APCLFDIKNDPCEKNNLADR--SEVQRI 805
            E     LR  A+++C     K  PC+P   A CL++I NDPCE+N+LA    S V+RI
Sbjct: 432 SETDFDGLRRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDPCEENDLAKFFPSVVRRI 490


>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
          Length = 542

 Score =  340 bits (871), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 169/351 (48%), Positives = 225/351 (64%), Gaps = 9/351 (2%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS +      S  PPHIIFILADD GW+DVGFHG  +IPTPN+DALA  GIIL N+
Sbjct: 6   LVLALSFLTGSFGNSHRPPHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNH 65

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y+  LCTPSR +++TGKHPI  G+Q  V+Y  +  GL L EK+LP+YLK LGY++ +VGK
Sbjct: 66  YSQPLCTPSRGSLLTGKHPIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGK 125

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLGF+  EYTP  RGF+SH G++   +DY  H      M GLD   + +P     G YS
Sbjct: 126 WHLGFFADEYTPMRRGFDSHYGFYGASEDYMTHIG---GMGGLDFWLNGQPDRSGQGHYS 182

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP---LQAPDHYLNIHRHIEDFKR 278
           T +FT +A  ++  H+  EP+FLY +H A H+++P E    L AP  Y +    I D + 
Sbjct: 183 TTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVEC 242

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
              A ++  LDESVG + ++L    ML N+II+F SDNG      ++  +SN+PLRG K 
Sbjct: 243 QHMAGMISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAPHENTDV-CSSNYPLRGAKT 301

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
           ++WEGG R    +WSPLL+  G V++Q +H+SDWLPTLL AA  N S +P 
Sbjct: 302 SVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAGYNMSALPG 352



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 47/195 (24%), Positives = 78/195 (40%), Gaps = 51/195 (26%)

Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTRGKWKLVKENSINGNG 560
           N ++   ++ G+  W  L  N PS RN++L+N D + Q S AL  G  KL++        
Sbjct: 346 NMSALPGDLYGVSQWQALQENGPSARNSMLYNADHKKQGSWALRVGDMKLLQ-------- 397

Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK 620
            +E  +  +S   +   I++ S L  NEP+  N +    +++ Q++++            
Sbjct: 398 -AEGATQLHSGWQQPFAINLQSQLPSNEPNNTNYVQ---EEQSQVASIL----------- 442

Query: 621 VMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG----PVKEVPCEPQIAPC 676
                                  E L   MR       + CG     V    C     PC
Sbjct: 443 -----------------------EELGRGMRNSTTKYFVNCGFISSGVTSHSCIDTGRPC 479

Query: 677 LFDIKNDPCEKNNLA 691
           L+++  DPCE +NLA
Sbjct: 480 LYNVTADPCEYHNLA 494


>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
          Length = 563

 Score =  339 bits (869), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 164/327 (50%), Positives = 225/327 (68%), Gaps = 8/327 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II I+ADDLGWNDV FHG DQIPTPNIDALAY+G+IL+ +Y + +CTPSR+A +TG++
Sbjct: 39  PNIIVIMADDLGWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRY 98

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI TGMQ   L   E   +PL+  +LP+YL++LGY T +VGKWH+G+Y   +TP +RGF+
Sbjct: 99  PIRTGMQGYPLKAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFD 158

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS----TDVFTAEAVDIIHN 235
           + LGY++G+  YF H+ E+    G D+  D+  A +L  KYS    TD+ T  A DII N
Sbjct: 159 TFLGYYSGYITYFKHTIEQNLHVGYDLHYDV--AGNLSVKYSHEYMTDLITERAEDIIFN 216

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFKRSKFAAILHKLDESVG 293
           H+  +PL+L L+H A HS++    ++  D         +IEDF R K A ++  +DESVG
Sbjct: 217 HNRSKPLYLQLSHVAAHSSDAKANMEVRDEEETNATLGYIEDFDRRKLAGVVTAMDESVG 276

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           +VV+AL Q  ML NSIIVF+SDNG    G   N  SN+PLRG+K TL+EGGVRG   ++S
Sbjct: 277 RVVQALRQTNMLENSIIVFMSDNGAQTVGLLNNYGSNYPLRGLKFTLFEGGVRGVACVYS 336

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
           P +++   ++ Q +HV+DWLPT  SAA
Sbjct: 337 PAIKNPSRISNQLIHVTDWLPTFYSAA 363



 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/137 (24%), Positives = 63/137 (45%), Gaps = 20/137 (14%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           +  +DG+D W+ +   + ++R  +L NID++  +S    G++KL+    V         G
Sbjct: 371 EENLDGVDQWATIVSEKKTRRKNVLLNIDEKQALSGALMGRYKLINGANVSYGDYYGDSG 430

Query: 632 PDQVY----------------LSGLSDREWLALAMRKLRDAASIQCGPVKEVP-CEPQIA 674
             + Y                L+ +S+  + +  + +LR AA++ C      P C   + 
Sbjct: 431 TSESYPEYNVSNVLHSAAGSALASVSNATFNSERIIELRRAATVVCDNFTSYPRC---LD 487

Query: 675 PCLFDIKNDPCEKNNLA 691
            CLFD+  DPCE  +++
Sbjct: 488 RCLFDVYKDPCETTDVS 504


>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 529

 Score =  338 bits (868), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 166/326 (50%), Positives = 218/326 (66%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           +S+ PP+IIFILADDLGW DV F G  QIPTPN+D LA  GIIL NYY   LC PSR A+
Sbjct: 15  SSTVPPNIIFILADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVQPLCAPSRGAL 74

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           M+G +PIHTG+QH V    E  GLP +  I+P+YLK LGY T ++GKWHLG++K+ YTPT
Sbjct: 75  MSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGYHKESYTPT 134

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
            RGF+S  GY  G +DY+DH+       GLD   +  P  +    YST++FT +A  +I 
Sbjct: 135 RRGFDSFYGYLNGGEDYYDHTILWSNASGLDFWENTTPVRNEGNHYSTELFTKKAQSLIK 194

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           +H   +P+FLY +H A H  +    L+AP   +    +I++  RS  A  +++LD+SVG 
Sbjct: 195 HHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNRSIHAGAVYELDKSVGL 254

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V+EAL +R MLSNSI++F +DNGG   G   N+  NWPLRG K T WEGG RGA  +WSP
Sbjct: 255 VMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGYNWPLRGSKETNWEGGARGAAFVWSP 314

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL   G ++ Q +H++DWLPTL SAA
Sbjct: 315 LLFKSGRLSNQMMHITDWLPTLYSAA 340



 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 54/209 (25%), Positives = 78/209 (37%), Gaps = 59/209 (28%)

Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
           +P L ++  GN ++            IDG D+W  LS +  S R  +L NID     SAL
Sbjct: 333 LPTLYSAAGGNVST---------LGNIDGKDMWKALSEDLESPRQEVLINIDPIENSSAL 383

Query: 544 TRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
             G+ K+V   S N  G+ + R         +DG+D   + SR            + D +
Sbjct: 384 IVGRHKVVL-GSFN-EGSHDMRMKAPGGSRPVDGLDQMMLSSRTGKV--------LKDFY 433

Query: 604 QISALT-KGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG 662
            +  LT +  W+   VV+  RY                                      
Sbjct: 434 NVRQLTVRPNWRNEAVVRCDRY------------------------------------AP 457

Query: 663 PVKEVPCEPQIAPCLFDIKNDPCEKNNLA 691
           P   V   P   P  FD+++DPCE NNLA
Sbjct: 458 PNNFVAASP---PYYFDLEHDPCELNNLA 483


>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 579

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 166/348 (47%), Positives = 236/348 (67%), Gaps = 10/348 (2%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           ++ ++LPL   +   F  L  +S  PHII  +ADDLGWNDV FHG +QIPTPNIDAL Y+
Sbjct: 9   ISTSLLPLILFIDCAF-GLYETS--PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYN 65

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GIIL  +Y +   TPSR+A  TG++PI  GMQ   + G E  GLPL+ KILP++L+ LGY
Sbjct: 66  GIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGY 125

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T+++GKWH+G+Y  +YTP  RGF++ LG++  +  Y+D++     M G DM R  +PA+
Sbjct: 126 TTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYISYYDYNYSNQNMSGYDMHRGDDPAY 185

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--H 272
            ++ +Y+TD+FT+EA++II NH  + PL+L L+H A HS     PL+ P +  N     H
Sbjct: 186 GMNREYATDMFTSEAINIIENHELNRPLYLQLSHLAVHS-----PLEQPANVYNDREPIH 240

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I +  R K+A ++ KLDESVG++  AL ++ ML +S+I+F++DNG A+ G   N  SN+P
Sbjct: 241 IREPNRRKYAKMVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYP 300

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           LRG K TL+EGGVRG   +WS  L+    V+ + VH++DWLPTL SAA
Sbjct: 301 LRGTKYTLYEGGVRGVAALWSSRLQRAARVSNELVHITDWLPTLYSAA 348



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 60/145 (41%), Gaps = 28/145 (19%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP- 632
           EIDGID W +LS      R+ +L NID+  +       ++KL++      Y     G   
Sbjct: 357 EIDGIDQWRLLSEGRGRGRDKLLLNIDEVLKTEGAIYSRFKLLRGTIESGYYDGYYGDSG 416

Query: 633 ----------DQVYLSGLSDREWLALA--------MRKLRDAASIQCGP-------VKEV 667
                     D V  S +S+     L         M +LR  A++QC             
Sbjct: 417 KTLETLPTYTDIVMKSTVSESLTYHLGGPVTQPSTMVRLRQDATVQCNANFTYYRRQMLT 476

Query: 668 PCEPQIAPCLFDIKNDPCEKNNLAD 692
           PC   +  CLFDI NDPCE  N+AD
Sbjct: 477 PC--NVTECLFDIVNDPCETRNIAD 499



 Score = 43.5 bits (101), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 9/50 (18%)

Query: 756 MRKLRDAASIQCGP-------VKEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
           M +LR  A++QC             PC   +  CLFDI NDPCE  N+AD
Sbjct: 452 MVRLRQDATVQCNANFTYYRRQMLTPC--NVTECLFDIVNDPCETRNIAD 499


>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
          Length = 516

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 168/355 (47%), Positives = 230/355 (64%), Gaps = 17/355 (4%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           MA A   +   L + F   +A+  P HI+FILADD GW DVG+H L +I TP +D L+  
Sbjct: 1   MALASFLIITGLFLSFPGFLAAKQP-HIVFILADDFGWYDVGYH-LSEIRTPILDKLSSG 58

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+ L+NYY   LCTPSR+ +MTG++ IHTGMQH +++ C+   +PL EK+LPQ +KE GY
Sbjct: 59  GVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGY 118

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRR 208
            T +VGKWHLG YKK+  PT RGF+S+LGY TG +DY+ H      SA  +    LD+R 
Sbjct: 119 ATHMVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSEDYYTHIRCHPISALNLTRCALDLRE 178

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
               A    G YST++F+  AV II  H++ EPLFLY+A  A H+     PLQ P+ Y+ 
Sbjct: 179 AEAVARSYKGTYSTELFSQRAVSIIEKHTSTEPLFLYVAFQAVHA-----PLQVPERYVA 233

Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
            +  I+D  R  +A ++  +DE+VG +  AL++R +  N+++VF +DNGG      L+  
Sbjct: 234 PYSFIQDHSRRSYAGMVSAMDEAVGNITLALQKRGLWENTVLVFSTDNGGQ----TLSGG 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SNWPLRG K +LWEGGVRG G + SPLLE  G ++ Q +H+SDWLPTL+  A  S
Sbjct: 290 SNWPLRGRKWSLWEGGVRGVGFVASPLLEQPGTISRQLIHISDWLPTLVGLAGGS 344



 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH +++ C+   +PL EK+LPQ +KE GY T ++ 
Sbjct: 89  MQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVG 124


>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
          Length = 549

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 164/333 (49%), Positives = 219/333 (65%), Gaps = 9/333 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADD GW+DVGFHG  +IPTPN+DALA  GIIL N+Y+  LCTPSR +++TGKH
Sbjct: 1   PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI  G+Q  V+Y  +  GL L EK+LP+YLK LGY++ +VGKWHLGF+  EYTP  RGF+
Sbjct: 61  PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           SH G++   +DY  H      M GLD   + +P     G YST +FT +A  ++  H+  
Sbjct: 121 SHYGFYGASEDYMTHIG---GMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQT 177

Query: 240 EPLFLYLAHAATHSANPYEP---LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           EP+FLY +H A H+++P E    L AP  Y +    I D +    A ++  LDESVG + 
Sbjct: 178 EPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLT 237

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           ++L    ML N+II+F SDNG      ++  +SN+PLRG K ++WEGG R    +WSPLL
Sbjct: 238 KSLHTNGMLENTIIIFSSDNGAPHENTDV-CSSNYPLRGAKTSVWEGGTRVPAFVWSPLL 296

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
           +  G V++Q +H+SDWLPTLL AA  N S +P 
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLEAAGYNMSALPG 329



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 90/215 (41%), Gaps = 58/215 (26%)

Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTRGKWKLVKENSINGNG 560
           N ++   ++ G+  W  L  N PS RN++L+N D + Q S AL  G  KL++        
Sbjct: 323 NMSALPGDLYGVSQWQALQENGPSARNSMLYNADHKKQGSWALRVGDMKLLQ-------- 374

Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK 620
            +E  +  +S   +   I++ S L  NEP+  N +    +++ Q++++ +          
Sbjct: 375 -AEGATQLHSGWQQPFAINLQSQLPSNEPNNTNYVQ---EEQSQVASILE---------- 420

Query: 621 VMRYQVDLTGGPDQVYLSGLSDRE---WLALAM-RKLRDAASIQ---------------C 661
                       +   L+G    E   WLA  + ++L + A IQ               C
Sbjct: 421 ------------EHETLAGKRRHEATFWLASTLCKQLAEPAVIQQARQNKRRSGRTVPNC 468

Query: 662 G----PVKEVPCEPQIAPCLFDIKNDPCEKNNLAD 692
           G     V    C     PCL+++  DPCE  NLA+
Sbjct: 469 GFASTGVAPRSCVNTGRPCLYNVTADPCEYYNLAE 503



 Score = 39.3 bits (90), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +Q  V+Y  +  GL L EK+LP+YLK LGY++ ++ 
Sbjct: 66  LQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVG 101


>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 163/333 (48%), Positives = 233/333 (69%), Gaps = 6/333 (1%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+I+ I+ADDLGWNDV FHG D+IPTPNIDALAY+G+IL  +Y + +CTPSR+A +TGK
Sbjct: 3   PPNIVVIIADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTGK 62

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +PI TGMQ   L G E  G+ L+  +LP+YL++LGY T +VGKWH+G++ + Y PT RGF
Sbjct: 63  YPIRTGMQGYPLQGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRGF 122

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--TDVFTAEAVDIIHNH 236
           ++  GY+ G+  YF+H+  E +  G D+ R +     +  +Y   TD+ T EA +II +H
Sbjct: 123 DTFAGYYNGYIQYFNHTLYESEQLGYDLHRIIGDDHKIEYRYDYMTDLITDEAENIISSH 182

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           +  +PL+L +AH A HS++  E ++  +         +IED  R K+A+++  LDESVG+
Sbjct: 183 NPAKPLYLQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDINRRKYASVVATLDESVGR 242

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V++AL++  ML+NSIIVF+SDNG    GF  N  SN+PLRG+K +L+EGG+RGA  I+SP
Sbjct: 243 VIDALKKTDMLNNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGIRGAACIYSP 302

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
           L++    V+ Q  H++DWLPTL SAA  N SD+
Sbjct: 303 LIDRPSRVSNQLFHITDWLPTLYSAAGGNSSDL 335



 Score = 44.3 bits (103), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 20/147 (13%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG-- 631
           ++DG D WS +   + SKR++IL NID++    A   G +KLV      +   + +G   
Sbjct: 337 QLDGFDQWSAIKSAKNSKRSSILINIDEKENSEAALIGHYKLVTDTSEYQKYYNYSGNNA 396

Query: 632 --PDQVYLSGLSDREWL-----------ALAMRKLRDAASIQCGPVKEVP-CEPQIAPCL 677
             P     + L+                A  + +LR  A+I C    +   C  +   CL
Sbjct: 397 LYPKYNVTTILASPAASAIASISISTLNASKIMQLRKEATIICKNFTDFSNCTNR--SCL 454

Query: 678 FDIKNDPCEKNNLADR--SEDQRINHY 702
           FD+  DPCE  +L  +   + +R+N +
Sbjct: 455 FDLNKDPCETTDLFSKHPKDVKRLNMF 481


>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
 gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
          Length = 542

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 160/332 (48%), Positives = 221/332 (66%), Gaps = 4/332 (1%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
           +G NDV FHG +QI TPNIDALAY+G++L  +Y   LCTPSR+ ++TGK+PIHTGMQH V
Sbjct: 1   MGMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWV 60

Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ 189
           +   E  GLP  E+++P+  +E GY T +VGKWHLGF++++ TPT RGF+ H GY+ G+ 
Sbjct: 61  IITDEPWGLPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGYYNGYI 120

Query: 190 DYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
           DY+DH    +      GLD RRD EP    +G Y+T+ FT+EA  II  H   +PLF+ L
Sbjct: 121 DYYDHQVRLLGTNYSAGLDFRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPLFMVL 180

Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
           +H A H+ N   P+QAP+  +    HI+D KR  +A ++  LD+SVG+ + AL+   ML+
Sbjct: 181 SHLAVHTGNEDNPMQAPEEEVAKFSHIKDPKRRTYAGMISSLDKSVGQTMRALKDNDMLN 240

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           N+I++  SDNG    G + NA SN+P RG K + WEGG+R  G +WSPLLE +G V+ Q 
Sbjct: 241 NTIVLLYSDNGAPTVGIHSNAGSNYPYRGQKESPWEGGIRSVGALWSPLLEKQGYVSNQA 300

Query: 367 VHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
           +H  DWLPTL +AA  S +P  +    EN+ P
Sbjct: 301 IHAIDWLPTLAAAAGVS-LPQDLELDGENLWP 331



 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 70/167 (41%), Gaps = 27/167 (16%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLT 629
           E+DG ++W  LS     K    +H +D+ +  S+      K V       +   +  DL 
Sbjct: 323 ELDGENLWPSLSEKAEPKPRKFIHVLDEVFGYSSYMSDTLKYVNGSSFDGQYDYWLGDLE 382

Query: 630 GGPDQVYLSGLSDREWLALAMR--------------KLRDAASIQCGPVK-------EVP 668
              D   LSG+   + LA  ++              ++R  A+ +C  ++          
Sbjct: 383 ANEDDP-LSGIYGEQVLASEVQSVLGSHGLTEDRILQMRSEATQKCPAIEGQDPLQPHYR 441

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           CEP +APC FD+  DPCE+ NLA R    ++     EV  F Q A P
Sbjct: 442 CEPLVAPCFFDLAQDPCERYNLA-RLYPLQLEKLRQEVEEFRQTAIP 487



 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 753 EEGMRKLRDAASIQCGPVK-------EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
           E+ + ++R  A+ +C  ++          CEP +APC FD+  DPCE+ NLA    +Q
Sbjct: 413 EDRILQMRSEATQKCPAIEGQDPLQPHYRCEPLVAPCFFDLAQDPCERYNLARLYPLQ 470


>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
          Length = 613

 Score =  336 bits (861), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 178/388 (45%), Positives = 238/388 (61%), Gaps = 37/388 (9%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           +M F V  LAF +    +  V +  PP+IIFILADDLGWNDV FHG  QIPTP+IDALA 
Sbjct: 3   MMTFKVAVLAFFMG-CMLQRVYTDQPPNIIFILADDLGWNDVSFHGSSQIPTPHIDALAQ 61

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
            G+IL NYY   +CTP+RSAIMTGKHPIHTG++H V+      GL L EK++PQYL+ELG
Sbjct: 62  EGVILTNYYVSPICTPTRSAIMTGKHPIHTGLEHGVIGVSHPYGLGLEEKLMPQYLRELG 121

Query: 154 YRTRIVGK-------------WHLGFY-------------------KKEYTPTFRGFESH 181
           YRT +VGK             + LG                     K+  TP+ RGFES 
Sbjct: 122 YRTHMVGKVSLEHGVIGVSHPYGLGLEEKLMHQYLRELGYRTHMVGKESLTPSHRGFESF 181

Query: 182 LGYWTGHQDYFDHS-AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
            GY+ G  DY+ H    +  M G D   D      + G+YST++FT    +II  H+  E
Sbjct: 182 YGYYAGMGDYYTHEITSDGNMTGFDFHMDGSVHKPVFGQYSTEIFTERTQEIILKHNPKE 241

Query: 241 PLFLYLAHAATHSAN-PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           PL++YLAH A HSAN   + LQAP  Y     +I    R K+AA++  LD+S+G + ++L
Sbjct: 242 PLYIYLAHQAVHSANYDGQRLQAPHEYYKRFPNITHENRRKYAAMVAALDDSIGNITQSL 301

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++  + +N++IVF +DNGG A GF+ N ASNWPLRG+K+T WEGG+RG G +W  L+E  
Sbjct: 302 KESSLYNNTLIVFTTDNGGPANGFDFNYASNWPLRGMKHTTWEGGLRGVGFLWGALIEKP 361

Query: 360 GIVAEQYVHVSDWLPTL--LSAANKSDI 385
           G +++  +HV DW+PTL  L+  N S +
Sbjct: 362 GRMSDGMMHVCDWVPTLYGLAGGNTSTL 389



 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/235 (29%), Positives = 104/235 (44%), Gaps = 40/235 (17%)

Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
             + S    +DGIDVW +LSR + S R  ILH ID    +SA+  G +KLV+       G
Sbjct: 383 GGNTSTLQHLDGIDVWPMLSRGKQSPRQEILHTIDPLLDLSAIRIGDYKLVQGQ----RG 438

Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVK 620
           T  N      Y  E  G          + S R  ILH ID    +SA+  G +KLV+   
Sbjct: 439 TGRN----GWYPPEGVG----------KQSPRQEILHTIDPLLDLSAIRIGDYKLVQ--- 481

Query: 621 VMRYQVDLTGGPDQVYLSGLSDREWLALA---MRKLRDAASIQCGPVK----EVPCEPQI 673
                     G      +G    E ++LA    + + +A  + C P K       C P  
Sbjct: 482 ----------GQRGTGRNGWYPPEGVSLADLDSKPVPNAFVVSC-PYKPANASTNCNPYK 530

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
            PCLF+I++DPCE NN+A+ ++D  +    T +  +     P + +E + +   K
Sbjct: 531 KPCLFNIRHDPCEFNNIANWNQDI-VELLLTRIEEYRATMVPIRNKEPDPRSYPK 584



 Score = 39.7 bits (91), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           ++H V+      GL L EK++PQYL+ELGYRT ++ 
Sbjct: 93  LEHGVIGVSHPYGLGLEEKLMPQYLRELGYRTHMVG 128


>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
          Length = 429

 Score =  336 bits (861), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 164/336 (48%), Positives = 218/336 (64%), Gaps = 16/336 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG   I TP +DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFVLADDLGWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST+VFT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTE 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A  +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A +   
Sbjct: 218 RATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASV 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  ALE+R + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALERRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            G + SPLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 329 VGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGS 364



 Score = 41.2 bits (95), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVG 144


>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 594

 Score =  335 bits (859), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 165/320 (51%), Positives = 218/320 (68%), Gaps = 4/320 (1%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           ++ ILADD+GWNDV F G  QIPTPN+DALA +G+IL+N+Y    CTPSR+A++TG +PI
Sbjct: 85  VVVILADDMGWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYPI 144

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
           H+GMQH V+   E  GLPL  K+LPQ+LK+LGYRT ++GKWHLG +KKE+ PT RGF+SH
Sbjct: 145 HSGMQHYVIRSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDSH 204

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG-KYSTDVFTAEAVDIIHNHSTDE 240
           LGY+ G+ DYF H+    +   LD  +D  P    H  +Y+T +FT  A +II +H  D 
Sbjct: 205 LGYYNGYIDYFTHNHTYKRDSALDFFKDEVP---YHSEEYATRLFTDRAEEIIRDHDVDN 261

Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE 300
           PLFLY AH A H A   +P QAP   ++   ++ D  R+ FAA+L +LD SVG+VVEAL 
Sbjct: 262 PLFLYFAHLAVHRATDRDPFQAPQETIDKFSYVGDRNRTTFAAMLAELDVSVGRVVEALA 321

Query: 301 QRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG 360
           ++ +L N+II+F SDNGG A     N  +N+PLRG K TL+EGG R    +WSPL+    
Sbjct: 322 KKGILDNTIILFSSDNGGQATAPMENTGTNFPLRGQKRTLFEGGTRVPAFVWSPLIRRPR 381

Query: 361 IVAEQYVHVSDWLPTLLSAA 380
            V    VH+ DWLPT+ S A
Sbjct: 382 RVFYDMVHIVDWLPTIYSLA 401



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 59/139 (42%), Gaps = 23/139 (16%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV--KVVKVMR-YQVDLTGG 631
           IDG++VW  +S    S R  IL+NID   +  A+  G++KL+   +  +   Y     G 
Sbjct: 411 IDGMNVWGAVSDGMASPRTEILYNIDPVQRDLAIRVGRYKLLYGPMYNLTEWYDNRGFGY 470

Query: 632 PDQVYLSGLSDREWLALAMRKL-------------------RDAASIQCG-PVKEVPCEP 671
                L    D+  +A  +R L                   R+   + CG P     C P
Sbjct: 471 ASHSVLERYMDKSAVARVLRDLGYWWKGREEDYSRRPTWQWREETRVHCGNPSSATACNP 530

Query: 672 QIAPCLFDIKNDPCEKNNL 690
              PCLFD+  DPCE NNL
Sbjct: 531 SEQPCLFDLATDPCEYNNL 549



 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 760 RDAASIQCG-PVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
           R+   + CG P     C P   PCLFD+  DPCE NNL
Sbjct: 512 REETRVHCGNPSSATACNPSEQPCLFDLATDPCEYNNL 549



 Score = 43.9 bits (102), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH V+   E  GLPL  K+LPQ+LK+LGYRT ++ 
Sbjct: 148 MQHYVIRSREPWGLPLDFKLLPQHLKDLGYRTHLIG 183


>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
          Length = 502

 Score =  334 bits (857), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 162/342 (47%), Positives = 226/342 (66%), Gaps = 16/342 (4%)

Query: 48  MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLC 107
           M      A++ PPHI+F+LADDLGWNDVGFHG  +I TP +D L+  G+ L  YYT  LC
Sbjct: 1   MALSQPAAATSPPHIVFVLADDLGWNDVGFHG-SEILTPTLDFLSGQGVRLAGYYTQPLC 59

Query: 108 TPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           TPSRS +++G++ IHTG+QH +++ C+    PL +K+LP+ LKE GY T +VGKWHLG Y
Sbjct: 60  TPSRSQLLSGRYQIHTGLQHQIIWPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMY 119

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           K +  PT RGF+S+ GYWTG +DY+ H      +   +    LD R    PA D   KYS
Sbjct: 120 KTDCLPTRRGFDSYFGYWTGGEDYYSHERCYLITTLNITRCALDFRDGEVPATDYQMKYS 179

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T +FT  AVD+I NH+ ++PLFLYLA+ A HS     PLQ PD Y+  +  I D  R  +
Sbjct: 180 THLFTDRAVDLITNHNPEKPLFLYLAYQAVHS-----PLQVPDQYIEPYTSIHDKNRRLY 234

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           A ++  LDE+V  V  AL+++ + +N++ +F +DNGG      L+  +NWPLRG K TLW
Sbjct: 235 AGMVSVLDEAVANVTNALKKQGLWNNTVFIFSTDNGGQ----TLSGGNNWPLRGRKWTLW 290

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           EGG+RG G + S LL+ +G+ +++ +H+SDWLPTL++ A  S
Sbjct: 291 EGGIRGVGFVTSSLLKQKGVTSKELIHISDWLPTLVNLAGGS 332


>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
 gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
          Length = 533

 Score =  334 bits (857), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 164/336 (48%), Positives = 218/336 (64%), Gaps = 16/336 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG   I TP +DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFVLADDLGWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST+VFT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTE 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A  +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A +   
Sbjct: 218 RATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASV 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  ALE+R + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALERRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            G + SPLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 329 VGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGS 364



 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVG 144


>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 511

 Score =  333 bits (855), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 172/359 (47%), Positives = 223/359 (62%), Gaps = 22/359 (6%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GW+DV FHG  QIPTPN+D LA  GIIL NYY    CTPSR+A+MTG +PIHTGMQH VL
Sbjct: 2   GWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGVL 61

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
              E  GLPLS  I+PQYLK LGY T IVGKW+LG YK  YTPTFRGF+S  GY++  +D
Sbjct: 62  SPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVED 121

Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
           Y++H+       GLD   + +P  ++ G YST ++T     +I N    +P FLYL + A
Sbjct: 122 YYNHTVLWDNQTGLDFWLNTQPLRNVSGIYSTQLYTERTKFLIENRDVSKPFFLYLPYQA 181

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            H  N  +PLQAP   ++   +I +  R+ FA +   LDES+G+V  AL    ML N+II
Sbjct: 182 VHCGNFDDPLQAPQENIDKFPYIGEENRTIFAGMTDALDESIGEVFRALFDADMLDNTII 241

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           V  +DNGG   GF  N   N+PLRG K TLWEGG R +  +WSPLL+ +G+V++Q +H++
Sbjct: 242 VLSADNGGLPFGFESNRGYNFPLRGGKGTLWEGGCRASAFVWSPLLKRKGVVSDQMMHIT 301

Query: 371 DWLPTLLSAA----------NKSDIPNYVNSTVENIIPRYE----------NSILRYEN 409
           DW+PTL SAA          + +D+  +V     +  PRYE          NS LRY N
Sbjct: 302 DWVPTLYSAAGGNPQTLGKLDGTDM--WVPLAYGSASPRYEILYNIDPIDRNSALRYTN 358



 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 63/138 (45%), Gaps = 22/138 (15%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV-KVVKVMRY-QVDLTGG 631
           ++DG D+W  L+    S R  IL+NID   + SAL    +KLV  +     Y Q  LT G
Sbjct: 320 KLDGTDMWVPLAYGSASPRYEILYNIDPIDRNSALRYTNYKLVLGIANGGAYDQRYLTTG 379

Query: 632 PDQVY-----LSGLSD-----REWLALAMRKL----RDAASIQCG---PVKEVPCEPQIA 674
             + Y     L   S+     RE+      K     R  A++ CG   P   V   P   
Sbjct: 380 GSRPYGDLDLLMAQSEAAQVLREYYGTGYLKFPDQWRQRATLTCGNKVPKNFVSLSP--- 436

Query: 675 PCLFDIKNDPCEKNNLAD 692
           P LFD+  DPCE NN+A+
Sbjct: 437 PYLFDLTADPCEINNIAN 454



 Score = 46.2 bits (108), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 24/36 (66%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          MQH VL   E  GLPLS  I+PQYLK LGY T I+ 
Sbjct: 56 MQHGVLSPAEPYGLPLSVSIMPQYLKNLGYETHIVG 91


>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
          Length = 354

 Score =  333 bits (853), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 163/325 (50%), Positives = 216/325 (66%), Gaps = 7/325 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADD GW+DVGFHG  +IPTPN+DALA  GIIL N+Y+  LCTPSR +++TGKH
Sbjct: 1   PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI  G+Q  V+Y  +  GL L EK+LP+YLK LGY++ +VGKWHLGF+  EYTP  RGF+
Sbjct: 61  PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           SH G++   +DY  H      M GLD   + +P     G YST +FT +A  ++  H+  
Sbjct: 121 SHYGFYGASEDYMTHIG---GMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQT 177

Query: 240 EPLFLYLAHAATHSANPYEP---LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           EP+FLY +H A H+++P E    L AP  Y +    I D +    A ++  LDESVG + 
Sbjct: 178 EPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLT 237

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           ++L    ML N+II+F SDN GA  G     +SN+PLRG K+++WEGG R    +WSPLL
Sbjct: 238 KSLHANGMLENTIIIFSSDN-GAPHGNTDVCSSNYPLRGAKSSVWEGGTRVPAFVWSPLL 296

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
           +  G V++Q +H+SDWLPTLL AA 
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLKAAG 321



 Score = 39.3 bits (90), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +Q  V+Y  +  GL L EK+LP+YLK LGY++ ++ 
Sbjct: 66  LQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVG 101


>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 527

 Score =  332 bits (851), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 162/326 (49%), Positives = 219/326 (67%), Gaps = 6/326 (1%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           AS  PP+IIF+L D LGW+DV FHG  QIPTPN+DALA  GIIL NYY   +CTPSR+A+
Sbjct: 18  ASKLPPNIIFMLVDGLGWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAAL 77

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           MTG +PIHTG+QH VL   E  GLPL  KI+P+Y K+LGY T ++GKW+LG+Y KEYTPT
Sbjct: 78  MTGMYPIHTGLQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPT 137

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           +RGF+S  G++   +DYF H+ E +      + R    ++ ++   S D       D++ 
Sbjct: 138 YRGFDSFYGFYNYEEDYFTHNLEFVNQSNAMVWR--PSSFCVYLTLSPDT----GFDLLS 191

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
                 P FLYL+H + H A+  +PLQAP+  +    +I D +R+K+A ++  LDES G 
Sbjct: 192 ASIERGPFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDALDESAGD 251

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V+EAL +  ML+N+IIV  S NGG ++G   NA SN+PLRG K  LWEGG R +  IWSP
Sbjct: 252 VLEALYEAGMLANTIIVMSSANGGLSSGVESNAGSNFPLRGGKGALWEGGTRASAFIWSP 311

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL  +  V++Q +H++DWLPTL +AA
Sbjct: 312 LLYQKNRVSDQMMHITDWLPTLYAAA 337



 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH VL   E  GLPL  KI+P+Y K+LGY T ++ 
Sbjct: 88  LQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIG 123



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 46/182 (25%), Positives = 68/182 (37%), Gaps = 48/182 (26%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
           +DG ++W+ LS N PS R  +L+NID   Q SA+    +KLV   S +G+  +E+R    
Sbjct: 347 LDGQNLWNHLSYNLPSPRYELLYNIDPIDQTSAIRFLNYKLVLGES-SGSEENEHRLKTP 405

Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLT 629
                   +D+    S+     RN   HN                  + +  M       
Sbjct: 406 GNSRPYRDLDIAMAQSKAARVLRN--FHN------------------RGILFM------- 438

Query: 630 GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
                       D  W        R  A++ CG  +     P   P LFD+  DPCE  N
Sbjct: 439 ------------DSNW--------RQKAALNCGDDERENVVPGAPPYLFDLAKDPCEMRN 478

Query: 690 LA 691
           + 
Sbjct: 479 IV 480


>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
          Length = 591

 Score =  331 bits (849), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 7/341 (2%)

Query: 45  TLSMVFVDLVASS-----GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           TL   F+ L  ++       PHII I+ADDLGW+DVGFHG DQI TPNIDAL Y+GIIL 
Sbjct: 10  TLVSYFIVLFGTTLGLYESAPHIIVIMADDLGWDDVGFHGSDQIRTPNIDALGYNGIILN 69

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
            +Y +   TPSR+A  TG++PI  GMQ   + G E  G+PL+ +ILP++L++LGY T+++
Sbjct: 70  RHYVLPSSTPSRTAFFTGQYPIRMGMQGEDIQGGEPRGIPLNVRILPEFLRDLGYMTKLI 129

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLG+Y  ++TP  RGF+S LG++  H  Y+++      M G DM R   PA+    K
Sbjct: 130 GKWHLGYYTPQHTPLRRGFDSFLGFYNSHVSYYNYKYSFQNMSGYDMHRGDAPAYGSTDK 189

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           Y TD FT EA+ II  +    PL+L ++H A H+  P E  Q  +HY +   HI +  R 
Sbjct: 190 YVTDFFTDEAIKIIEYYDPSRPLYLQISHLAVHA--PLEGPQDYNHYDSQFLHIREINRR 247

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           K+A ++ +LD SVG++V+AL  + ML NS+I+F++DNG A  G   N  SN+PLRG+K T
Sbjct: 248 KYAKMVLRLDNSVGRIVQALGNKGMLKNSLILFLTDNGAAPIGKFRNYGSNYPLRGMKYT 307

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           L+EGGVRG  L+WSP L     V+++ +HV+DWLPTL S A
Sbjct: 308 LYEGGVRGVALLWSPRLRKTARVSDELMHVTDWLPTLYSIA 348



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/155 (25%), Positives = 63/155 (40%), Gaps = 35/155 (22%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK---------------- 617
           EIDGID W +L+ + PS R+ +L NID+  +       ++KLV+                
Sbjct: 357 EIDGIDQWCMLNGSLPSARDRLLLNIDEVSKTEGAIYKQFKLVRGSIEGGLYDGYFYDHI 416

Query: 618 -----VVKVM--------------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAAS 658
                ++ +M                +  L     +   S L D       + +LR  A+
Sbjct: 417 ERNVPIMLIMPSDPKTIVQEDVPPYTETILRSVVSETITSFLGDPVTQPSTIIQLRREAT 476

Query: 659 IQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 693
           + C           +  CLFD+ NDPCE NN+A++
Sbjct: 477 VNCRSRNRSFLTCNVTECLFDVNNDPCETNNIAEQ 511



 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 758 KLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR-SEVQR 804
           +LR  A++ C           +  CLFD+ NDPCE NN+A++ S + R
Sbjct: 470 QLRREATVNCRSRNRSFLTCNVTECLFDVNNDPCETNNIAEQYSRISR 517


>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
          Length = 564

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 167/351 (47%), Positives = 228/351 (64%), Gaps = 15/351 (4%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M  ++LPL  T+      L     PPHII  +ADDLGWNDVGFHG +QIPTPNIDALAY+
Sbjct: 9   MIISLLPLILTIDCA---LGLYETPPHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYN 65

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GIIL  +Y +   TPSR A  TG +PI  GMQ + + G E  GLPL  KILP++L+ LGY
Sbjct: 66  GIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGY 125

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T+++GKWH+GF+  +YTP  RGF++  G++  H  Y+D+      M G DM    +PA+
Sbjct: 126 VTKLIGKWHMGFHTLQYTPLHRGFDTFFGFYNSHITYYDYEYSNQNMTGYDMHCGDDPAY 185

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--- 271
            +  +Y+TD+FT EA+ II NH    PL+L ++H A H+     P++ PD   +  R   
Sbjct: 186 GMKREYATDLFTNEAIKIIENHELPRPLYLQISHLAVHA-----PIEQPD---DSSRDEI 237

Query: 272 -HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
             I +  R K+A ++ KLDESVG+VV AL ++ ML +S+I+F++DNG A+ G   N  SN
Sbjct: 238 VQIREPNRRKYAKMVSKLDESVGRVVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSN 297

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           +PLRG K TL+EGGVRG   +WS  LE    V ++ +H++DWLPTL SAA 
Sbjct: 298 YPLRGTKYTLYEGGVRGVAALWSSRLEKGARVFKKLIHITDWLPTLYSAAG 348


>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
          Length = 521

 Score =  330 bits (845), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 160/331 (48%), Positives = 224/331 (67%), Gaps = 16/331 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++F+LADDLGW DVG+HG  QI TP +DAL+ +G+ L+ YY   LCTPSRS ++TG+
Sbjct: 32  PPHLVFVLADDLGWQDVGWHG-SQIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGR 90

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH +++ C+   +PL EK+LP+ LKE GY T +VGKWHLG Y+ E  PT RGF
Sbjct: 91  YQIHTGLQHEIIWPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGF 150

Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DY+ H       ++ +    LD+R   + A      YST+VFT  A D+
Sbjct: 151 DTYFGYLLGSEDYYSHEHCVPIVSKNVTRCALDLRDGEKIADGFKNMYSTNVFTQRAQDL 210

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH  ++PLFLYLA  + H     EPLQ P+ Y+  +  I+D KR K+A ++  LDE+V
Sbjct: 211 IANHQPEKPLFLYLALQSVH-----EPLQVPEKYVEPYSFIKDEKRRKYAGMVSILDEAV 265

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G + EAL+ R + +N++++F +DNGG      L   +NWPLRG K TLWEGGVRGAG + 
Sbjct: 266 GNITEALKTRGLWNNTVLIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGAGFVA 321

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ RG+ + + +H+SDWLPTL+  A  S
Sbjct: 322 SPLLKQRGVESHELIHISDWLPTLVKLAGGS 352



 Score = 40.8 bits (94), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LP+ LKE GY T ++ 
Sbjct: 97  LQHEIIWPCQPSCVPLDEKLLPELLKEAGYVTHMVG 132



 Score = 39.3 bits (90), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 61/151 (40%), Gaps = 39/151 (25%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKV--VKVMRYQVD----- 627
           +DG DVW  LS  EPS R  +LHNID  +   +  KG    + V     ++Y +      
Sbjct: 359 LDGFDVWKTLSEGEPSPRKELLHNIDPIFVDPSPCKGNRSCLAVDYAAALKYSIFNISIH 418

Query: 628 ----------LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCL 677
                     LTG P      G S   W         ++ ++   P+ +          L
Sbjct: 419 AAIRQGKWKLLTGNP------GCS--HWFP-PPSLFNESKTMSSDPLTK-------KLWL 462

Query: 678 FDIKNDPCEKNNLADRSED------QRINHY 702
           FDI +D  E+N+L+D+  D       R+ HY
Sbjct: 463 FDIVHDAEERNDLSDQYPDIVKKLLIRLQHY 493



 Score = 39.3 bits (90), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 16/26 (61%), Positives = 18/26 (69%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
           +DG DVW  LS  EPS R  +LHNID
Sbjct: 359 LDGFDVWKTLSEGEPSPRKELLHNID 384


>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 578

 Score =  329 bits (844), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 162/346 (46%), Positives = 223/346 (64%), Gaps = 13/346 (3%)

Query: 44  FTLSMVFVDLVASS-----GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           F   +    L+ SS     G PHII  +ADDLGWNDVGFHG  QIPTPNIDAL Y+GIIL
Sbjct: 4   FIALLTIAGLLGSSCADFEGRPHIIVFMADDLGWNDVGFHGATQIPTPNIDALGYNGIIL 63

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
             +Y +  C+P+R+A +TGK+PI  GMQ   + G E  GLP+  + LP+YL+ LGY T +
Sbjct: 64  NKHYVLPSCSPTRAAFLTGKYPIRMGMQGAGIAGGEPRGLPVHVQTLPEYLQGLGYETNL 123

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           +GKWH+G++  ++ P  RGF+   G++  H  Y+D+   +  M G DM  + E A+   G
Sbjct: 124 IGKWHVGYHTPKHLPNRRGFDYFYGFYNSHIGYYDYRYSQGNMSGFDMHINGETAYGTDG 183

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP---DHYLNIHRHIED 275
            Y+TD FT  A+D+I+ H  + P++L ++H A     P+ P+  P   + Y +  RHI +
Sbjct: 184 VYATDRFTQAAIDVIYRHDLESPMYLQVSHLA-----PHAPMDVPFEDNPYDDEFRHISE 238

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
            KR  +A ++ +LD+S+G++V +L  R ML NS+IVF+SDNGGA  G   N ASNWPLRG
Sbjct: 239 PKRRAYAKMVARLDDSLGRIVSSLGDRGMLKNSVIVFMSDNGGATIGKFRNWASNWPLRG 298

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K TL+EGGVR    +WSP L S+G V+EQ  HV+DWLPTL   A 
Sbjct: 299 TKYTLFEGGVRSVAAMWSPKLLSKGRVSEQLFHVTDWLPTLFQVAG 344



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 29/145 (20%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG---- 630
           IDG+  W VLS    + R+ IL NID+  Q  A    ++K+V+      Y  ++ G    
Sbjct: 353 IDGVSQWDVLSIGTGNVRDKILLNIDEYTQTEAAIDKRFKIVRGSLYHGYYDNVEGEIGR 412

Query: 631 -------GPDQVYLSGLSD--REWLAL------AMRKLRDAASIQCGPVK---------E 666
                     ++  S +SD  RE L +       M +L+  A++ C P            
Sbjct: 413 GHKNPMYNTSRILKSAVSDAIREHLGVPVTQESVMWELQRQATVLCRPNMTNMQYTSRYT 472

Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLA 691
           +P   Q A CLFD+  DPCE NN+A
Sbjct: 473 LPSCNQTA-CLFDLGIDPCETNNVA 496


>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 562

 Score =  329 bits (844), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 154/325 (47%), Positives = 225/325 (69%), Gaps = 4/325 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II I+ADDLGWNDV FHG +QIPTPNIDALAY+G+IL+ +Y + +CTPSR+A +TG++
Sbjct: 39  PNIIVIMADDLGWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRY 98

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI TGMQ   L   E   +PL+  +LP+YL++LGY T +VGKWH+G+Y   +TPT RGF+
Sbjct: 99  PIRTGMQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFD 158

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP--AWDLHGKYSTDVFTAEAVDIIHNHS 237
           +  GY++G+  YF+H+ ++    G D+  D     + D + +Y+TD+ T  A +II NH 
Sbjct: 159 TFFGYYSGYISYFNHTIKQDDHIGYDLHYDNSKNLSIDYNFEYTTDLITERAENIIKNHD 218

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFKRSKFAAILHKLDESVGKV 295
             +PL+L L H A HS++  E ++  D        ++IED+ R K+A ++  +DESVG+V
Sbjct: 219 RRKPLYLQLCHLAAHSSDAKEVMEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRV 278

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           ++AL Q  ML NSIIVF+SDNG    G   N  SN+PLRG+K TL+EGG+RG   ++S L
Sbjct: 279 IKALGQSSMLENSIIVFISDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYSRL 338

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           +++   ++ + +H++DWLPT  SAA
Sbjct: 339 IQNSSRISNELMHITDWLPTFYSAA 363



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRY---------- 624
           +DG+D W  +   + SKR ++L NID+   +S+   GK+KL+   K ++Y          
Sbjct: 374 MDGVDQWDTIVSGKESKRESVLLNIDEVEDVSSALIGKYKLIINGKNIQYNDYYGDNGTS 433

Query: 625 ----QVDLTGGPDQVYLSGLSDREWLALAMR---KLRDAASIQCGPVKEVP-CEPQIAPC 676
               + +++     +  S +       L ++   KLR+ A + C        C   +  C
Sbjct: 434 VSYPEYNVSSVLHSIAGSTIFHATNFTLNIKDVIKLRNKARVVCNNFTSYSKC---VDKC 490

Query: 677 LFDIKNDPCEKNNLADRSED--QRINHYTTE 705
           LFDI NDPCE  +L++   +  +++N++  E
Sbjct: 491 LFDIYNDPCETTDLSNEHPEIVKKLNYFIAE 521


>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
          Length = 534

 Score =  329 bits (843), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 160/331 (48%), Positives = 219/331 (66%), Gaps = 16/331 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS ++
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT 
Sbjct: 101 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +D
Sbjct: 221 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 275

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G
Sbjct: 276 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 331

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLA 362



 Score = 42.7 bits (99), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
          Length = 520

 Score =  329 bits (843), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 161/347 (46%), Positives = 224/347 (64%), Gaps = 17/347 (4%)

Query: 44  FTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
           + + +V +  V  + P PHI+FI+ADD GWNDVG+H   +I TPN+D L+  G+ L+NYY
Sbjct: 12  WMMIVVMILPVTLAAPRPHIVFIMADDFGWNDVGYHN-SEIKTPNLDLLSAKGVRLQNYY 70

Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
              LC+PSR+ +MTG++ IHTGMQH +++ C+   +PL EK+LPQ LKE GY T +VGKW
Sbjct: 71  VQPLCSPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVGKW 130

Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDL 216
           HLG YKK+  PT RGF+S+ GY+ G +DY+ H      +A  +    LD+R   E A   
Sbjct: 131 HLGMYKKDCLPTHRGFDSYFGYYLGSEDYYTHTRCYPITALNLTRCALDLRDGEEVATAY 190

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            G YST++F+  AV +I  H+  +PLFLY+A  A H     EPLQ P+ Y+  +  I+D 
Sbjct: 191 KGAYSTELFSQRAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDV 245

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R K+A ++  +DE+VG +  AL Q  +  N+++VF +DNGG          SNWPLRG 
Sbjct: 246 SRRKYAGMVSAMDEAVGNISLALRQAGLWDNTVLVFSTDNGGQTRA----GGSNWPLRGR 301

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           K  LWEGG+RG G + SPLLE+ G V  + +H+SDWLPTL+  A  S
Sbjct: 302 KWALWEGGIRGVGFVSSPLLENPGTVNNELMHISDWLPTLVRLAGGS 348



 Score = 42.7 bits (99), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 93  MQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVG 128


>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 593

 Score =  328 bits (841), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 170/361 (47%), Positives = 211/361 (58%), Gaps = 39/361 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADDLGW+DV FHG  QIPTPN+D LA  G+IL NYY    CTPSR+A+MTG +
Sbjct: 36  PHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLY 95

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI TGMQ   +   E  GLP   +ILPQYLKE GY T +VGKWHLG YK+  TPT RGF+
Sbjct: 96  PIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFD 155

Query: 180 SHLGYWTGHQDYFDHSAEEMK--MW----------------------------------- 202
           S  GY+ G  DYF H+   ++   W                                   
Sbjct: 156 SFYGYYYGESDYFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTISYEN 215

Query: 203 --GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
             GLD   + +P W   G YST VFT  A  II N +  +PL L + H ATH A   E L
Sbjct: 216 HTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERL 275

Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           QAP   ++   +I +  R+ +A ++  LD+SVG+VV+AL    ML N+IIVF SDNGG  
Sbjct: 276 QAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGP 335

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + +   NWPLRG K T+WEGG R A  +WSPLL  R  V+ Q +H++DWLPT  S A
Sbjct: 336 WGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIA 395

Query: 381 N 381
            
Sbjct: 396 G 396



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 22/139 (15%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-ALTKGKWKLV---KVVKVMRYQVDLT 629
           E+DG D+W  LS    S R  +L+N D ++  S AL   ++KLV         RY+V   
Sbjct: 404 ELDGHDMWRPLSHGRRSPRVELLYNYDYKYTKSAALRNHRYKLVLDETGRFADRYKVPGG 463

Query: 630 GGP--DQVYLSGLSDREWLALAMRKL------------RDAASIQCGPVKEVPCE-PQIA 674
             P  D  YL+  S     A  +R              R  A + CG  K++       +
Sbjct: 464 SRPRHDLDYLAAKSTA---ASVLRDFYKKKQLNFPKSWRRKARLTCGKQKKINFSWSATS 520

Query: 675 PCLFDIKNDPCEKNNLADR 693
             LFD+  DPCE NNLA +
Sbjct: 521 VYLFDVVTDPCELNNLASK 539


>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
          Length = 431

 Score =  328 bits (841), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 156/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 41  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H+SDWLPTL+  A  S
Sbjct: 331 GFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGS 365



 Score = 39.7 bits (91), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145


>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 579

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 164/347 (47%), Positives = 226/347 (65%), Gaps = 9/347 (2%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           +  ++LPL  T+  V   L      PHII  +ADDLGWNDVGFHG +QIPTPNIDALAY+
Sbjct: 9   ITISLLPLILTIDCV---LGLYETAPHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYN 65

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GIIL  +Y +   TPSR A  TG +PI  GMQ + + G E  GLPL  KILP++L+ LGY
Sbjct: 66  GIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGY 125

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T+++GKWH+G++  +YTP  RGF++  G++  H  Y+D+      M G DM R  +PA 
Sbjct: 126 TTKLIGKWHMGYHTPQYTPLHRGFDTFFGFYNSHITYYDYEYSNQNMTGYDMHRGDDPAH 185

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RHI 273
            +  +Y TD+FT EA+ II NH    PL+L ++H A H+     P++ PD   +     I
Sbjct: 186 GIKREYVTDLFTKEAIKIIENHELPRPLYLQISHLAVHA-----PIEQPDDSSSDEIIQI 240

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
            +  R K+A ++ KLDESVG++V AL ++ ML +S+I+F++DNG A+ G   N  SN+PL
Sbjct: 241 REPNRRKYAKMVSKLDESVGRIVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPL 300

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           RG K TL+EGGVRG   +WS  LE    V ++ +H++DWLPTL SAA
Sbjct: 301 RGTKYTLYEGGVRGVAALWSSRLEKAARVFKKLIHITDWLPTLYSAA 347


>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
 gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLMKLA 361



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144


>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
          Length = 431

 Score =  327 bits (839), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 41  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H++DWLPTL+  A  S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365



 Score = 39.7 bits (91), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145


>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
          Length = 535

 Score =  327 bits (839), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 41  AGASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
          Length = 532

 Score =  327 bits (839), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 215/329 (65%), Gaps = 16/329 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++F+LADDLGWNDVGFHG   I TP++DALA  G+ L NYY   LCTPSRS +++G+
Sbjct: 43  PPHLVFLLADDLGWNDVGFHG-SSIRTPHLDALAAGGVRLDNYYVQPLCTPSRSQLLSGR 101

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF
Sbjct: 102 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 161

Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DY+ H       A  +    LD R   E A      YST+VFT  A  +
Sbjct: 162 DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATAL 221

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  IED  R  +A ++  +DE+V
Sbjct: 222 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRRHYAGMVSLMDEAV 276

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G V  AL++R +  N++ +F +DNGG      L   +NWPLRG K TLWEGGVRG G + 
Sbjct: 277 GNVTAALKKRGLWDNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVA 332

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           SPLL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 333 SPLLKQKGVKNRELIHISDWLPTLVKLAG 361



 Score = 42.7 bits (99), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 108 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 143


>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
          Length = 415

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 41  AGASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
          Length = 533

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144


>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
 gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
 gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
 gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
          Length = 528

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 156/334 (46%), Positives = 224/334 (67%), Gaps = 16/334 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS ++
Sbjct: 36  AAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IH G+QH ++  C+   +PL EK+LPQ LK+ GY T +VGKWHLG Y+KE  PT 
Sbjct: 95  TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154

Query: 176 RGFESHLGYWTGHQDYFDHSAEEM------KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H A             LD+R   EPA +    YST++FT  A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V +AL+ R + +N++++F +DNGG       +  +NWPLRG K TLWEGG+RGAG
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGAG 325

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            + SPLL+ +G+ + + +H++DWLPTL++ A  S
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 359


>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
 gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
 gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
 gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
          Length = 533

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144


>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
 gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
 gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
 gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
 gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 330 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 362



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 156/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 41  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H+SDWLPTL+  A  S
Sbjct: 331 GFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGS 365



 Score = 39.7 bits (91), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145


>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
          Length = 408

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 220 RAIALITNHPREKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 331 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 546

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 154/327 (47%), Positives = 225/327 (68%), Gaps = 8/327 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II I+ADDLGWNDV FHG +QIPTPNIDALAY+G+IL+ +Y + +CTPSR+A +TG++
Sbjct: 24  PNIIVIMADDLGWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRY 83

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI TGMQ   L   E   +PL+  +LP+YL++LGY T +VGKWH+G+Y   +TPT RGF+
Sbjct: 84  PIRTGMQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFD 143

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMR----RDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           +  GY+ G+  YF+H+ ++    G D+     ++L  A++   +Y TD+ T  A +II N
Sbjct: 144 TFFGYYNGYISYFNHTIKQNNHVGYDLHYHNSKNLSVAYNF--EYITDLITERAENIIKN 201

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFKRSKFAAILHKLDESVG 293
           H   +PL+L L+H A HS++  E ++  D         +IED+ R K+A ++  +DESVG
Sbjct: 202 HDRRKPLYLQLSHLAVHSSDAKEVMEVRDEQETNATLEYIEDYNRRKYAGVVTAMDESVG 261

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           +V++AL Q  ML NSIIVF+SDNG    G   N  SN+PLRG+K TL+EGG+RG   ++S
Sbjct: 262 RVIKALGQSSMLENSIIVFMSDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYS 321

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
            L+++   ++ + +H++DWLPT  SAA
Sbjct: 322 RLIQNSSRISNELIHITDWLPTFYSAA 348



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 70/150 (46%), Gaps = 22/150 (14%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV--KVVKVMRYQVD----- 627
           +DG+D W  +     SKR ++L NID+   +S+   GK+KL+  K ++   Y  D     
Sbjct: 359 MDGVDQWDTIVSGRESKRGSVLLNIDEVENVSSALLGKYKLINGKNIQYSDYYGDNGTSV 418

Query: 628 ------LTGGPDQVYLSGLSDREWLALAMR---KLRDAASIQCGPVKEVP-CEPQIAPCL 677
                 ++        S +       L +    KLR+ A + C      P C   +  CL
Sbjct: 419 SYPEYNVSSVLHSFVGSTIFHTTNFTLNIEDVIKLRNKARVVCNNFTSYPKC---VDKCL 475

Query: 678 FDIKNDPCEKNNLADRSED--QRINHYTTE 705
           FD+ NDPCE  +L+D   +  +++N++ TE
Sbjct: 476 FDVYNDPCETTDLSDEYPEIVKKLNYFITE 505



 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 6/61 (9%)

Query: 754 EGMRKLRDAASIQCGPVKEVP-CEPQIAPCLFDIKNDPCEKNNLADRSE--VQRINHYTT 810
           E + KLR+ A + C      P C   +  CLFD+ NDPCE  +L+D     V+++N++ T
Sbjct: 448 EDVIKLRNKARVVCNNFTSYPKC---VDKCLFDVYNDPCETTDLSDEYPEIVKKLNYFIT 504

Query: 811 E 811
           E
Sbjct: 505 E 505


>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
          Length = 413

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144


>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
 gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  327 bits (837), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 330 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 362



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
 gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
 gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
          Length = 413

 Score =  327 bits (837), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144


>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
 gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
 gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
 gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
 gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  326 bits (836), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 330 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 362



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
 gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  326 bits (836), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 331 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 363



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 532

 Score =  326 bits (836), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 170/350 (48%), Positives = 230/350 (65%), Gaps = 12/350 (3%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
               L++  V    +  PPHII IL DDLGW D GFHG DQI TPN+DALAYSG+IL  +
Sbjct: 10  FIIVLNLAVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQIKTPNMDALAYSGMILNRH 69

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y +  CTPSRSA++TG +PI TGMQ   L G +   LPLS K+ P++LK LGYRT +VGK
Sbjct: 70  YVLPSCTPSRSALLTGLYPIRTGMQGMPLKGGDVRNLPLSFKLKPEFLKNLGYRTHLVGK 129

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMK-MWGLDMRRDLEPAW 214
           WHLG+    + P  RGF+S  GY+ G+ DYF        + E+++  +G D+ R+ E   
Sbjct: 130 WHLGYRTINHLPNQRGFDSFFGYYNGYVDYFKFGHNQTVAGEKIEYFYGYDLHRNGEIYQ 189

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLN-IHRH 272
                Y+T +FT EA  II NH+  EPL+LY +H ATH+ +    ++ P D  +N  + H
Sbjct: 190 TDKDTYATRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDADVNKTYGH 249

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL--NAASN 330
           I+ + R  FA  L +LD+SVG+V+EAL+++ ML NSII+ +SDNGG     +L  N +SN
Sbjct: 250 IKHYGRRAFAGCLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTVSVDLPPNWSSN 309

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           WPL G K TL+EGGVR   LIWSPLL  +G++ + ++H++DWLPTL SAA
Sbjct: 310 WPLGGTKFTLFEGGVRSVALIWSPLL-PKGVINDDFIHITDWLPTLYSAA 358


>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
          Length = 534

 Score =  326 bits (836), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/332 (47%), Positives = 218/332 (65%), Gaps = 16/332 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS ++
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 331

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            + SPLL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAG 363



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
          Length = 534

 Score =  326 bits (836), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 158/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 40  AGASQPPHLVFLLADDLGWNDVGFHG-SRIHTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGG+RG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRG 329

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 330 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 362



 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
 gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  326 bits (835), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 218

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 219 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 273

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 274 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 329

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 330 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 362



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
          Length = 415

 Score =  326 bits (835), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 219

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 220 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 274

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 275 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 330

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 331 VGFVASPLLKQKGMKNRELIHISDWLPTLVKLA 363



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
          Length = 414

 Score =  325 bits (834), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 159/332 (47%), Positives = 218/332 (65%), Gaps = 16/332 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS ++
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 331

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            + SPLL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAG 363



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
          Length = 520

 Score =  325 bits (834), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 158/331 (47%), Positives = 215/331 (64%), Gaps = 16/331 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++ +LADDLGWNDVGFHG  ++ TP++DALA  G+ L NYYT  LCTPSRS ++TG+
Sbjct: 31  PPHLVLVLADDLGWNDVGFHG-SRLRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTGR 89

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF
Sbjct: 90  YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGF 149

Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DYF H       A  +    LD R   E A +    YS ++F   A+ +
Sbjct: 150 DTYFGYLLGSEDYFSHEHCVFIKALNVTRCALDFRDGEEVATEYKNMYSANIFANRAISL 209

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I D  R  +A ++  +DE+V
Sbjct: 210 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRRLYAGMVSLMDEAV 264

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G V  ALE   + +N++ +F +DNGG      L   +NWPLRG K TLWEGG+RG G + 
Sbjct: 265 GNVTAALESHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGIRGVGFVA 320

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 321 SPLLKQKGVRTRELIHISDWLPTLVKLAGGS 351



 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 96  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVG 131


>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
 gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
 gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
          Length = 534

 Score =  325 bits (834), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 41  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H++DWLPTL+  A  S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365



 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145


>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  325 bits (833), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 41  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H++DWLPTL+  A  S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365



 Score = 40.0 bits (92), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145


>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 575

 Score =  325 bits (833), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 158/323 (48%), Positives = 221/323 (68%), Gaps = 7/323 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII I+ADDLGWNDVGFHG +QIPTPNIDAL Y+GIIL  +Y +   TPSRSA +TG +
Sbjct: 27  PHIIVIMADDLGWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI  GMQ + + G E  GLPL  KILP++L++LGY T+++GKWH+G++  +YTP  RGF+
Sbjct: 87  PIRIGMQGDGIRGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFD 146

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
             LG++  H  Y+D+      M G D+ R  +PA  ++ +Y+TD+FT EAV +I  H   
Sbjct: 147 FFLGFYNSHISYYDYHYSNQNMSGYDLHRGDDPAHGINREYATDLFTKEAVRMIETHELP 206

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--HIEDFKRSKFAAILHKLDESVGKVVE 297
            PL+L ++H A H+     PL+ P    N  R  +I +  R K+A ++ +LD+SVG++V 
Sbjct: 207 RPLYLQISHLAVHA-----PLEQPRDEYNDGRFSYIREPNRRKYARMVSRLDDSVGRIVH 261

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL ++ ML +S+I+F++DNG A+ G   N  SN+PLRG K TL+EGGVR    +WSP L+
Sbjct: 262 ALGEKGMLRDSLILFLTDNGAASVGRYRNWGSNYPLRGTKYTLYEGGVRSVAALWSPRLQ 321

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
               V+ Q +H++DWLPTL SAA
Sbjct: 322 KVARVSNQLIHITDWLPTLYSAA 344


>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
 gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
 gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
          Length = 564

 Score =  324 bits (831), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 163/335 (48%), Positives = 221/335 (65%), Gaps = 6/335 (1%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L   Y   +CTPSR
Sbjct: 19  DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+++G++PIHTG QH V+   E   L L+  ++P+  KE GY T +VGKWHLGF + EY
Sbjct: 79  GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
           TPT RGF+ H GYW  + DYF   + +M +     G D RR++E      G Y TD+ TA
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMELECRDRGVYVTDLLTA 197

Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA  +I +H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D  R K+AA++ 
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMIS 257

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           KLD+SVG+++ AL     L NSI++F SDNG  + G   N  SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVR 317

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            AG IWS  L++RG +  Q ++V+DWLPTL  AA+
Sbjct: 318 VAGAIWSSGLQARGSIFRQPLYVADWLPTLSRAAD 352



 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 63/140 (45%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DGID+W  LS   + P     ILH +DD W++SAL  G+WK V               
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTASGRYDSVLTY 419

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V +        LS    R      +   R  A+++CG ++   C P
Sbjct: 420 RELDDLDPRDSRYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNP 478

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  CL+DI +DPCE+NNL 
Sbjct: 479 LLEECLYDILSDPCEQNNLV 498



 Score = 47.0 bits (110), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  GR++ +    + ++ + +  +     +     +   +   ++ + +   +   R  
Sbjct: 407 TTASGRYDSVLTYRELDDLDPRDSRYAVTVRNSATSRALSRYDLRRLTQQRISL--TRRL 464

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C P +  CL+DI +DPCE+NNL 
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDILSDPCEQNNLV 498


>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
          Length = 506

 Score =  324 bits (831), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 159/332 (47%), Positives = 217/332 (65%), Gaps = 16/332 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S PPH++F+LADDLGWNDVGFHG   I TP++DALA  G++L NYYT  LCTPSRS ++
Sbjct: 14  ASRPPHLVFVLADDLGWNDVGFHG-SCIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 72

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT 
Sbjct: 73  TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 132

Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A
Sbjct: 133 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 192

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +D
Sbjct: 193 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 247

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G
Sbjct: 248 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVG 303

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            + SPLL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 304 FVASPLLKQKGVKNRELIHISDWLPTLVKLAG 335



 Score = 42.7 bits (99), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 82  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 117


>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
 gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
          Length = 520

 Score =  324 bits (830), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 160/360 (44%), Positives = 235/360 (65%), Gaps = 19/360 (5%)

Query: 31  RTR-IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           R+R  + F +L    +    F    ++SG P+I+FILADD GWND+G+HG   I TPN+D
Sbjct: 14  RSRPFLPFLLLVCVLSGGKTFAQ--SASGKPNIVFILADDYGWNDIGYHG-SVIRTPNLD 70

Query: 90  ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
            LA  G+ L+NYY   +CTPSRS +MTG++ IH G+QH++++  +  GLPL E  LPQ L
Sbjct: 71  RLAAEGVKLENYYIQPICTPSRSQLMTGRYQIHFGLQHSIIWPPQPSGLPLDEVTLPQRL 130

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMW-----G 203
           KE GY T IVGKWHLGFYK+EYTP  RGF++  G+ TG ++++ H ++  M  +     G
Sbjct: 131 KEGGYSTHIVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSENHYSHRNSGGMPGFRPGWNG 190

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
           LD+R    P  D +G YST +F  +A+++I      +P+FLYL   A H+     PLQAP
Sbjct: 191 LDLRDQDRPVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMFLYLPFQAVHA-----PLQAP 245

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
             Y++++RHI D+ R  +AA+   +DE+VG + +AL+Q  +  N+++VF +DNGG     
Sbjct: 246 QKYISMYRHINDYNRRMYAAMTTAMDEAVGNITDALKQYGLWDNTVLVFSTDNGGQV--- 302

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +NWPLRG K +LWEGG+RG G + SPL++++G  ++  +H+SDW PTL+  A  S
Sbjct: 303 -MAGGNNWPLRGWKGSLWEGGIRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLAGGS 361


>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
          Length = 503

 Score =  323 bits (828), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 158/329 (48%), Positives = 216/329 (65%), Gaps = 16/329 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++F+LADDLGWNDVGFHG  ++ TP++DALA  G+ L NYYT  LCTPSRS ++TG+
Sbjct: 15  PPHLVFVLADDLGWNDVGFHG-SRLRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTGR 73

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH +++ C+   +PL EK+LPQ L+E GY T +VGKWHLG Y+KE  PT RGF
Sbjct: 74  YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRGF 133

Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +
Sbjct: 134 DTYFGYLLGSEDYYTHEHCVFIKALNVTRCALDFRDGEEVATQYKNLYSTNIFTNRATSL 193

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+V
Sbjct: 194 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHLYAGMVSLMDEAV 248

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G V  AL+   + +N++ +F +DNGG      L   +NWPLRG K TLWEGGVRG G + 
Sbjct: 249 GNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVA 304

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           SPLL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 305 SPLLKQKGVKNRELIHISDWLPTLVKLAG 333



 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ L+E GY T ++ 
Sbjct: 80  LQHQIIWPCQPSCVPLDEKLLPQLLQEAGYATHMVG 115


>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
          Length = 491

 Score =  323 bits (828), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 155/325 (47%), Positives = 221/325 (68%), Gaps = 13/325 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ADD G+ND+G+HG  +I TPN+D LA  G+ L+NYY   +CTP+RS +M+G++
Sbjct: 24  PHILLIIADDFGYNDIGYHG-SEIKTPNLDKLAGEGVKLENYYVQPICTPTRSQLMSGRY 82

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH V+   +  GLPL   ILPQ LKE+GY T  VGKWHLGFYK+EY PT RGF+
Sbjct: 83  QIHTGLQHGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEYLPTNRGFD 142

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           SH GY TG +DYF H      M G D+R ++ PA +  G YST +F  +A ++++NH+TD
Sbjct: 143 SHFGYLTGAEDYFKHDRCFGAMCGTDLRDNMNPA-NYTGVYSTHLFAQKAAEVVNNHNTD 201

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLYL   + H+     PLQ P+ Y   + HI+D +R  +A ++  +DE++G + +  
Sbjct: 202 KPLFLYLPFQSVHA-----PLQVPEQYTKPYMHIQDKQRRTYAGMVSAMDEAIGNLTDVF 256

Query: 300 EQRRMLSNSIIVFVSDNGGAA-AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           ++R M +N+++VF +DNGG   AG N     N+PLRG K++LWEGGV G GL+   +L+ 
Sbjct: 257 KKRGMWNNTLMVFSTDNGGQIHAGGN-----NFPLRGWKDSLWEGGVHGVGLVHGQMLKR 311

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
           +G V+   +HVSDW PTL+S    +
Sbjct: 312 KGSVSRDLIHVSDWFPTLISMVGGT 336


>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
          Length = 581

 Score =  323 bits (827), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 166/348 (47%), Positives = 235/348 (67%), Gaps = 10/348 (2%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           ++ ++LPL   +   F  L  +S  PHII  +ADDLGWNDV FHG +QIPTPNIDAL Y+
Sbjct: 9   ISTSLLPLILFIDCAF-GLYETS--PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYN 65

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GIIL  +Y +   TPSR+A  TG++PI  GMQ   + G E  GLPL+ KILP++L+ LGY
Sbjct: 66  GIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGY 125

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T+++GKWH+G+Y  +YTP  RGF++ LG++  +  Y+D+S     M G DM R  +PA+
Sbjct: 126 TTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYISYYDYSYSNQNMSGYDMHRGDDPAY 185

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR--H 272
            ++ +Y+TD+FT EA++II NH  + PL+L L+H A H+     PL+ P +  N     H
Sbjct: 186 GMNREYATDMFTREAINIIENHELNRPLYLQLSHLAVHA-----PLEQPMNVYNDREPIH 240

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I +  R K+A ++ KLDESVG++  AL ++ ML +S+I+F++DNG A+ G   N  SN+P
Sbjct: 241 IREPNRRKYAKMVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYP 300

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           LRG K TL+EGGVRG   +WS  L+    V+ + VH++DWLPTL SAA
Sbjct: 301 LRGTKYTLYEGGVRGVAALWSSRLQKAARVSNELVHITDWLPTLYSAA 348



 Score = 42.7 bits (99), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 11/73 (15%)

Query: 756 MRKLRDAASIQCGP-------VKEVPCEPQIAPCLFDIKNDPCEKNNLADR-SEVQR-IN 806
           M +LR  A++QC             PC   +  CLFDI NDPCE  N+A+    + R +N
Sbjct: 454 MVRLRQDATVQCNANFTYYRRQMLTPC--NVTECLFDIVNDPCETRNIAETYVRIARDLN 511

Query: 807 HYTTEVGYLDPKQ 819
           ++  E  ++  KQ
Sbjct: 512 YFLEEYSHILVKQ 524


>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
 gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
          Length = 535

 Score =  322 bits (826), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 159/329 (48%), Positives = 217/329 (65%), Gaps = 16/329 (4%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           H++F+LADDLGW+DVGFHG  +I TP++DALA +G++L NYYT  LCTPSRS ++TG++ 
Sbjct: 48  HLVFVLADDLGWHDVGFHG-SRIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 107 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +I 
Sbjct: 167 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIS 226

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I D  R  +A ++  +DE+VG 
Sbjct: 227 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGN 281

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+   + +N++ VF +DNGG      L   +NWPLRG K +LWEGGVRG G + SP
Sbjct: 282 VTAALKSHGLWNNTVFVFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 337

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LL+ +G+ + + VH+SDWLPTL+  A  S
Sbjct: 338 LLKRKGVKSRELVHISDWLPTLVGLAGGS 366



 Score = 42.4 bits (98), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
 gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
          Length = 565

 Score =  322 bits (826), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 163/337 (48%), Positives = 218/337 (64%), Gaps = 10/337 (2%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L   Y   +CTPSR
Sbjct: 19  DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+++G++PIHTG QH V+   E   L L+  ++P+  KE GY T +VGKWHLGF + EY
Sbjct: 79  GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM------WGLDMRRDLEPAWDLHGKYSTDVF 225
           TPT RGF+ H GYW  + DYF   +   KM       G D RR++E      G Y TD+ 
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS---KMPVANYSLGYDFRRNMELECRDRGVYVTDLL 195

Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           TAEA  +I  H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D  R K+AA+
Sbjct: 196 TAEAERLIKGHAGKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAM 255

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           + KLD+SVG+++ +L     L NSI++F SDNG  + G   N  SN+PLRG KNT WEGG
Sbjct: 256 ISKLDQSVGRIMSSLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGG 315

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           VR AG IWS  L++RG +  Q ++V+DWLPTL  AA 
Sbjct: 316 VRVAGAIWSSKLQARGSIFSQPLYVADWLPTLCRAAG 352



 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 63/140 (45%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DGID+W+ LS   + P     ILH +DD W+++AL  G+WK V               
Sbjct: 360 KLDGIDLWAELSGSADAPHVPREILHILDDVWRLAALQLGQWKYVNGTTAAGRYDSLLTY 419

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V +        LS    R      +   R  A+++CG ++   C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISITRRLAAVRCGDLQR-SCNP 478

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  CL+DI  DPCE+NNL 
Sbjct: 479 LLEECLYDISTDPCEQNNLV 498



 Score = 47.8 bits (112), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  GR++ +    + ++ + ++ +     +     +   +   ++ + +   +   R  
Sbjct: 407 TTAAGRYDSLLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISI--TRRL 464

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C P +  CL+DI  DPCE+NNL 
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNLV 498


>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
 gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
          Length = 575

 Score =  322 bits (825), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 159/329 (48%), Positives = 216/329 (65%), Gaps = 4/329 (1%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           ++G P+II ILADD+G++DV F G  +  TPNIDALA+ G IL   Y   +CTPSR A++
Sbjct: 33  TAGRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALL 92

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +G++PIHTG QH V+   E  GL L+  ++P+  ++ GY T ++GKWHLGF + EYTPT 
Sbjct: 93  SGRYPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTR 152

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           RGF+ H GYW  + DY+   ++        G D RR++E      G Y TD+ T EA  +
Sbjct: 153 RGFDYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERV 212

Query: 233 IHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I      +EPLFL L+H ATH+AN  +PLQAP+  +    +I+D  R K+AA++ KLD+S
Sbjct: 213 IREREGQEEPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQS 272

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VG++V AL     L NSI++F SDNG  + G   N  SNWPLRG KNT WEGGVR AG I
Sbjct: 273 VGRIVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAI 332

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           WS  L++RG +  Q ++V+DWLP+L  AA
Sbjct: 333 WSAQLQARGNIFTQPIYVADWLPSLAHAA 361



 Score = 72.8 bits (177), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 66/140 (47%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------ 619
           E+DGID+W  L+   + P     ILH +DDEW++++   G WK +               
Sbjct: 370 ELDGIDLWPQLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTY 429

Query: 620 --------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                   +  RY V +   P    L+    R      +  LR +A+++CG ++   C P
Sbjct: 430 RELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQRVMTLRRSAAVRCGDLQR-GCNP 488

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  C++DI +DPCEKNNL 
Sbjct: 489 LVEECVYDIWDDPCEKNNLV 508



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/95 (23%), Positives = 51/95 (53%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  G+++Q+    + ++ + ++ +     +     +   +   ++ + +   +  LR +
Sbjct: 417 TTSAGQYDQVLTYRELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQR--VMTLRRS 474

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C P +  C++DI +DPCEKNNL 
Sbjct: 475 AAVRCGDLQR-GCNPLVEECVYDIWDDPCEKNNLV 508


>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
          Length = 532

 Score =  322 bits (824), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 156/329 (47%), Positives = 215/329 (65%), Gaps = 16/329 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++F+LADDLGWNDVGFHG   I TP++DALA  G++L NYYT  LCTPSRS ++TG+
Sbjct: 43  PPHLVFVLADDLGWNDVGFHG-SSIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR 101

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + I  G+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF
Sbjct: 102 YQIRMGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 161

Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DY+ H      +A  +    LD R   E A      YST++FT  A  +
Sbjct: 162 DTYFGYLLGSEDYYSHERCTLINALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATAL 221

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE++
Sbjct: 222 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAI 276

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + 
Sbjct: 277 GNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVA 332

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           SPLL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 333 SPLLKQKGVKNRELIHISDWLPTLVKLAG 361



 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 108 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 143


>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
 gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
          Length = 565

 Score =  322 bits (824), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 162/335 (48%), Positives = 219/335 (65%), Gaps = 6/335 (1%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L  +Y   +CTPSR
Sbjct: 19  DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+++G++PIHTG QH V+   E   L L+  ++P+  K  GY T +VGKWHLGF + EY
Sbjct: 79  GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEY 138

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
           TPT RGF+ H GYW  + DYF   + +M +     G D RR++E      G Y TD+ TA
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMELECRDRGAYVTDLLTA 197

Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA  +I +H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D  R K+AAI+ 
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIIS 257

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           KLD+SVG+++  L     L NSI++F SDNG  + G   N  SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIMSTLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVR 317

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            AG IWS  L++RG +  Q ++V+DWLPTL  AA 
Sbjct: 318 VAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAG 352



 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 23/139 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DGID+W  LS   + P     ILH +DD W+++AL  G+WK V               
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTY 419

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V +        LS    R      +   R  A+++CG ++   C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISITRRLAAVRCGDLQR-SCNP 478

Query: 672 QIAPCLFDIKNDPCEKNNL 690
            +  CL+DI  DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNL 497



 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 21/94 (22%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  GR++ +    + ++ + ++ +     +     +   +   ++ + +   +   R  
Sbjct: 407 TTAAGRYDSLLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISI--TRRL 464

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
           A+++CG ++   C P +  CL+DI  DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNL 497


>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
          Length = 519

 Score =  322 bits (824), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 154/330 (46%), Positives = 218/330 (66%), Gaps = 16/330 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH++FILADD GW D+G+HG  +I TPN+D L+  G+ L+NYY   LCTPSR+ +MTG++
Sbjct: 30  PHVVFILADDFGWYDIGYHG-SEIRTPNLDKLSAGGVRLENYYVQPLCTPSRNQLMTGRY 88

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTGMQH +++ C+   +PL EK+LPQ +KE GY T +VGKWHLG YKK+  PT RGF+
Sbjct: 89  QIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFD 148

Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           ++LGY TG +DYF H       +  +    LD+R   E A    G YST++ +  A+ II
Sbjct: 149 TYLGYLTGSEDYFTHFRCYQSPSLNLSRCALDLRDGEEVATGYKGVYSTELLSQRAISII 208

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
             H + +PLF+Y+A  A H+     PLQ P+ Y+  +  I+D  R  +A ++  +DE+VG
Sbjct: 209 ERHISQKPLFMYVALQAVHA-----PLQVPERYVTPYSFIKDTNRRLYAGMVSAMDEAVG 263

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            +  AL++  + +N+++VF +DNGG      L+  SNWPLRG K +LWEGGVRG G + S
Sbjct: 264 NITMALQKEGLWNNTVLVFSTDNGGQ----TLSGGSNWPLRGRKWSLWEGGVRGVGFVAS 319

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           PLL+  G V  + +H+SDWLPTL+  A  S
Sbjct: 320 PLLKQPGTVNRELIHISDWLPTLVGLAGGS 349



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH +++ C+   +PL EK+LPQ +KE GY T ++ 
Sbjct: 94  MQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVG 129


>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
 gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
          Length = 528

 Score =  322 bits (824), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 156/331 (47%), Positives = 214/331 (64%), Gaps = 16/331 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+FILADD GWNDVG+HG  +I TPN+D L+  G+ L+NYY   LCTPSR+ +MTG+
Sbjct: 28  PPHIVFILADDFGWNDVGYHG-SEIKTPNLDKLSAKGVRLENYYVQPLCTPSRNQLMTGR 86

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + I TGMQH +++ C+   +PL EK+LPQ ++E GY T +VGKWHLG Y+K+  PT RGF
Sbjct: 87  YQIRTGMQHQIIWPCQPYCVPLDEKLLPQLMREAGYATHMVGKWHLGMYRKDCLPTRRGF 146

Query: 179 ESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +S+ GY TG +DYF H          +    +D+R   E A    G YST +FT     I
Sbjct: 147 DSYFGYLTGSEDYFSHQRCSYVPPLNVTRCAVDLREGEEVATGYTGTYSTQLFTDRVTSI 206

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I   ++ +PLFLY+A  A H+     PLQ P+ Y+  +  I+D  R  +A ++  +DE+V
Sbjct: 207 IAKQNSKKPLFLYVALQAVHA-----PLQVPERYVAPYSFIKDPNRRLYAGMVSAMDEAV 261

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G +  AL++  +  N+++VF +DNGG      L   SNWPLRG K +LWEGGVRG G + 
Sbjct: 262 GNITLALQETGLWDNTVLVFSTDNGGQ----TLAGGSNWPLRGRKWSLWEGGVRGVGFVT 317

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+  G V+ Q +H+SDWLPTL+  A  S
Sbjct: 318 SPLLKQPGTVSRQLIHISDWLPTLVGLAGGS 348



 Score = 40.8 bits (94), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH +++ C+   +PL EK+LPQ ++E GY T ++ 
Sbjct: 93  MQHQIIWPCQPYCVPLDEKLLPQLMREAGYATHMVG 128


>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
          Length = 563

 Score =  321 bits (823), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 163/368 (44%), Positives = 227/368 (61%), Gaps = 14/368 (3%)

Query: 25  LKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHG 79
           +K L     I+   V+     L + F+  + SS P     P+II I+ADDLG+NDV FHG
Sbjct: 1   MKRLKKSQNIVIATVMVCVLGLLLYFI--LTSSKPSTAKKPNIIIIIADDLGYNDVSFHG 58

Query: 80  LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLP 139
             QIPTPN+  +A  GIIL  +YT   CTPSR+A++TG++PI +GMQ   L   E   LP
Sbjct: 59  SSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLP 118

Query: 140 LSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEE 198
           L+   +P + + LGY+T +VGKWHLG   KE TP  +GF+SH GYW G   YFD+ S  +
Sbjct: 119 LNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSK 178

Query: 199 MK----MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
           M     + GLD+    EP W   G+Y+T++FT  ++D+I  H    PLFL ++H A H+ 
Sbjct: 179 MDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTG 238

Query: 255 NPYEPLQAPDHYLNIHR--HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
                L  PD     H   +I+D +R  +A ++  LD S+G+++  L++++ML NSI++F
Sbjct: 239 QNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLF 298

Query: 313 VSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDW 372
            SDNG    G   N+ SNWPLRGVK + +EGGVR A  I+SPL   +G V+E  +H+SDW
Sbjct: 299 FSDNGAQTVGMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDW 358

Query: 373 LPTLLSAA 380
           LPTL SAA
Sbjct: 359 LPTLYSAA 366



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 73/169 (43%), Gaps = 14/169 (8%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
           D ++  +IDGID W  L+ N PS R  IL NID+  +  A+ + K+KL++      YQ  
Sbjct: 369 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQGTPNYYYQQT 428

Query: 628 LT----------GGPDQVYL-SGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPC 676
           L           G  DQ Y  SG               D +   C    + P       C
Sbjct: 429 LLFNCKSGSYHEGTFDQYYGDSGRGPENPTPNPNHTTTDLS--WCRAPDQTPILNCTKGC 486

Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKK 725
           LFD+  DPCE  N+ + SE +  N    ++ +F +   P + ++ + K 
Sbjct: 487 LFDLDKDPCETTNIIE-SEPEIANQLYEKIAQFWKELVPQRNKDTDPKS 534



 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 32/50 (64%)

Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
           D ++  +IDGID W  L+ N PS R  IL NID+  +  A+ R K+KL++
Sbjct: 369 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQ 418


>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
          Length = 492

 Score =  321 bits (823), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 158/330 (47%), Positives = 217/330 (65%), Gaps = 16/330 (4%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  L TPSRS ++T
Sbjct: 1   SRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLT 59

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT R
Sbjct: 60  GRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRR 119

Query: 177 GFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           GF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A+
Sbjct: 120 GFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAI 179

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
            +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE
Sbjct: 180 ALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDE 234

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G 
Sbjct: 235 AVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGF 290

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 291 VASPLLKQKGVKNRELIHISDWLPTLVKLA 320



 Score = 42.7 bits (99), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 68  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 103


>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
 gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
          Length = 565

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 162/335 (48%), Positives = 219/335 (65%), Gaps = 6/335 (1%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L  +Y   +CTPSR
Sbjct: 19  DESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+++G++PIHTG QH V+   E   L L+  ++P+  K  GY T +VGKWHLGF + EY
Sbjct: 79  GALLSGRYPIHTGTQHFVISNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEY 138

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
           TPT RGF+ H GYW  + DYF   + +M +     G D RR++E      G Y TD+ TA
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMELECRDRGAYVTDLLTA 197

Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA  +I +H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D  R K+AAI+ 
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIIS 257

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           KLD+SVG+++  L     L NSI++F SDNG  + G   N  SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIMSTLAATDQLENSIVIFYSDNGAPSLGMFANTGSNFPLRGQKNTPWEGGVR 317

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            AG IWS  L++RG +  Q ++V+DWLPTL  AA 
Sbjct: 318 VAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAG 352



 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 23/139 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DGID+W  LS   + P     ILH +DD W+++AL  G+WK V               
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTY 419

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V +        LS    R      +   R  A+++CG ++   C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISITRRLAAVRCGDLQR-SCNP 478

Query: 672 QIAPCLFDIKNDPCEKNNL 690
            +  CL+DI  DPCE+NNL
Sbjct: 479 LLEECLYDISTDPCEQNNL 497



 Score = 47.8 bits (112), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 21/94 (22%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  GR++ +    + ++ + ++ +     +     +   +   ++ + +   +   R  
Sbjct: 407 TTAAGRYDSLLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISI--TRRL 464

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
           A+++CG ++   C P +  CL+DI  DPCE+NNL
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNL 497


>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
 gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
          Length = 554

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 160/334 (47%), Positives = 218/334 (65%), Gaps = 6/334 (1%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L   Y   +CTPSR
Sbjct: 19  DQSAAARRPNIIIIMADDMGFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+++G++PIHTG QH V+   E   L L+  ++P+  KE GY T +VGKWHLGF + EY
Sbjct: 79  GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLHGKYSTDVFTA 227
           TPT RGF+ H GYW  + DYF   + +M +     G D RR+++      G Y TD+ T 
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS-KMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTT 197

Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA  +I +H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D  R K+AA++ 
Sbjct: 198 EAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMIS 257

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           KLD+SVG+++  L     L NSI++F SDNG  + G   N  SN+PLRG KNT WEGGVR
Sbjct: 258 KLDQSVGRIISTLSSTDQLDNSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVR 317

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            AG IWS  L++RG +  Q ++V+DWLPTL  AA
Sbjct: 318 VAGAIWSSKLQARGSIFSQPLYVADWLPTLSRAA 351



 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 62/140 (44%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKV---------- 621
           ++DGID+W  LS   + P     ILH +DD W++SAL  G+WK V               
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTADGRYDSVLTY 419

Query: 622 ----------MRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V +        LS    R      +   R  A+++CG ++   C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNP 478

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  CL+DI  DPCE+NNL 
Sbjct: 479 LLEECLYDISTDPCEQNNLV 498



 Score = 47.4 bits (111), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  GR++ +    + ++ + ++ +     +     +   +   ++ + +   +   R  
Sbjct: 407 TTADGRYDSVLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISL--TRRL 464

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C P +  CL+DI  DPCE+NNL 
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNLV 498


>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
          Length = 581

 Score =  320 bits (821), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 163/380 (42%), Positives = 241/380 (63%), Gaps = 23/380 (6%)

Query: 40  LPLAFTLSMVFVDLVASSG------------PPHIIFILADDLGWNDVGFHGLDQIPTPN 87
           LP  F + +  V +V   G            PPHI+ ILADD+GWNDV FHG ++IPTPN
Sbjct: 3   LPYKFMMILGGVAIVCFLGYGTLAWGKAIPLPPHIVIILADDMGWNDVSFHGANEIPTPN 62

Query: 88  IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
           IDALAY+G+IL  YYT+ +CTPSRSA+MTG++PI  GMQ   +   E  G+PL+  ++P+
Sbjct: 63  IDALAYNGVILNKYYTMPICTPSRSALMTGRYPIRDGMQGTPMRPAEPRGIPLNVSLMPE 122

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA---EEMKMWGL 204
            ++ LGY TR+VGKWHLG+  ++YTP  RGF++  GY+ G   Y+D+     +  ++ G 
Sbjct: 123 QMRRLGYETRLVGKWHLGYTTEDYTPVRRGFDTFFGYYNGFISYYDYWIGWNDTNEVTGY 182

Query: 205 DMRRDLEPAWDL--HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY--EPL 260
           D+ RD   +++L    +Y TD+ T EA  II N+   +PLFL ++H A H+ +    +PL
Sbjct: 183 DLHRDESDSFELAHSSEYFTDLITDEAEKIIRNNKNAKPLFLEISHLAVHAGSKVHDDPL 242

Query: 261 QA--PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           +    D       +IED++  K+A ++  LDESVG+VV+AL++  ML NSII+F+SDNG 
Sbjct: 243 EVRRTDDVNASFPYIEDYQHRKYAGMMAALDESVGRVVKALKEAEMLENSIIIFMSDNGA 302

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
              G   N  SN+P+RG+K  ++EG  R A  I+SPL+++   V+E+ +H+ DWLPTL +
Sbjct: 303 PTVGLYNNTGSNYPMRGIKGGMFEGAARAAACIFSPLIKAHSRVSEELMHIVDWLPTLYT 362

Query: 379 AANKSDIPNYVNSTVENIIP 398
           AA  +  P  + S  +  +P
Sbjct: 363 AAGGN--PMDLQSQFDGALP 380



 Score = 44.3 bits (103), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 73/211 (34%), Gaps = 55/211 (26%)

Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
           +P L  +  GN        D +    +DG+  WS +    PS R ++L NID+     A 
Sbjct: 357 LPTLYTAAGGNPMDLQSQFDGAL--PLDGVSQWSSIVAGGPSSRQSLLVNIDEAQGFEAA 414

Query: 544 TRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
             G+ KLVK  +            D  Y N           S N+PS             
Sbjct: 415 IIGRHKLVKGMT----------KEDGYYGN-----------SGNDPS------------- 440

Query: 604 QISALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGP 663
                    +    V KV+      T G     L+G +     A     LR  + I C P
Sbjct: 441 ---------FPAYNVKKVLSS----TAGASIGKLAGFASPS--ARRALWLRQKSVITCKP 485

Query: 664 VKEVP-CEPQIAPCLFDIKNDPCEKNNLADR 693
                 C      CLFD+  DPCE  +L+ +
Sbjct: 486 FTSAANCS---GTCLFDLSKDPCETRDLSSK 513


>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
          Length = 514

 Score =  320 bits (821), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 153/334 (45%), Positives = 221/334 (66%), Gaps = 16/334 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A+  PPH++ +LADDLGW DVG+HG   I TP +DAL   G+ L+ YYT  LCTPSRS 
Sbjct: 18  AAARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQ 76

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +++G++ IHTG+QH +++ C+   LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE  P
Sbjct: 77  LLSGRYQIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLP 136

Query: 174 TFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A+ +    LD R   E A      YST++FT 
Sbjct: 137 TRRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTE 196

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+D+I NH T++PLFLYLA  + H     EPL+    Y+  +  I+D KR ++A ++  
Sbjct: 197 RAIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSL 251

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG + +AL++  + +N+++VF +DNGG      +   +NWPLRG K TLWEGGVRG
Sbjct: 252 MDEAVGNLTDALKEYGLWNNTVLVFSTDNGGQ----TMAGGNNWPLRGRKWTLWEGGVRG 307

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            G + SPLL+ +G+ + + +H+SDWLPTL+  A 
Sbjct: 308 VGFVASPLLKQKGVESHELIHISDWLPTLVHLAG 341



 Score = 40.4 bits (93), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   LPL EK+LP+ LK+ GY T ++ 
Sbjct: 88  LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVG 123


>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 593

 Score =  320 bits (821), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 160/355 (45%), Positives = 227/355 (63%), Gaps = 16/355 (4%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           R+ ++    + L F  ++ F D       PHII  + DDLGWNDV FHG +QIPTPNIDA
Sbjct: 8   RSNLLVLCCI-LLFGDALGFYDTA-----PHIIVFMVDDLGWNDVSFHGSNQIPTPNIDA 61

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           L Y+GIIL  +Y +   TPSR+A  TG +PI  GMQ   + G E  GLPL+ +ILP+YL+
Sbjct: 62  LGYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGIQGGEPRGLPLNIRILPEYLR 121

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
            LGY T+++GKWH+G++  ++TP  RGF++ LG++  H  Y+D+      M G DM R  
Sbjct: 122 GLGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVSYYDYRYSYQNMSGYDMHRGD 181

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNI 269
            PA+ L+ +Y+TD+FT EA+ II  H    PL+L ++H A H+     P+++P D + N 
Sbjct: 182 NPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHLAVHA-----PIESPDDDHRNS 236

Query: 270 HR----HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
           +R    HI +  R  +A+++ +LDESVG++V  L  R ML +S+I+F++DNG    G   
Sbjct: 237 NRERFKHIPEVNRRNYASMVTRLDESVGRIVHTLGNRGMLRDSLILFLTDNGAPTIGKFR 296

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           N  SN PLRG+K TL+EGGVRG  ++WSP L     V  + VH++DWLPT  SAA
Sbjct: 297 NWGSNLPLRGMKYTLYEGGVRGVAVLWSPRLRKAARVCNELVHITDWLPTFYSAA 351


>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
 gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
          Length = 560

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 155/329 (47%), Positives = 217/329 (65%), Gaps = 4/329 (1%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S  P+II ++ADD+G++D+   G  +  TPNIDALAY G +L   Y   +CTPSR A++
Sbjct: 29  ASKLPNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALL 88

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +GK+PIHTG QH+V+   E  GL L+  ++P+  ++ GY T +VGKWHLGF + EYTPT+
Sbjct: 89  SGKYPIHTGTQHSVILNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPTY 148

Query: 176 RGFESHLGYWTGHQDYFDHSAE---EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           RGF+ H GYW G+ DY+   ++   +    G D RR+++      G Y TD+ T EA  I
Sbjct: 149 RGFDYHYGYWGGYIDYYQRRSQMPSDNYSMGYDFRRNMQVECTDRGVYVTDLLTNEAERI 208

Query: 233 IHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I    + ++PLFL L+H A H+ N  +PLQAP+  L    HI D  R  +AA++ KLDES
Sbjct: 209 IREQPAKEQPLFLILSHLAPHTGNEIDPLQAPEEELQKFVHINDPNRRLYAAMVSKLDES 268

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VG+V+ AL +   L NSI++F SDNG  + G   N  SNWPL+G K++ WEGG+R AG I
Sbjct: 269 VGRVITALARTDQLENSIVIFYSDNGAPSVGIFANTGSNWPLKGQKDSPWEGGLRVAGAI 328

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           WSPLL++RG +  Q ++V D+LPTL  AA
Sbjct: 329 WSPLLKARGNLFTQPIYVGDFLPTLAHAA 357


>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 533

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 165/397 (41%), Positives = 234/397 (58%), Gaps = 30/397 (7%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           + F L  + + + AS  PP+IIF+L DDLGW+DV FHG  QIPTPN+D LA  G+IL NY
Sbjct: 8   MVFLLIFLALGVSASKKPPNIIFMLVDDLGWDDVSFHGSPQIPTPNMDVLAGDGVILNNY 67

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y    CTPSR+A+MTG +PIH G+Q  V+   +  GLPL  K++P++ K++GY T ++GK
Sbjct: 68  YVQHFCTPSRAALMTGLYPIHNGLQDFVIDVAQPYGLPLYLKVMPEFFKDMGYETHMIGK 127

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLG+++KEYTPT+RGF+S  GY+ G +DY++HS  ++     ++R+       +     
Sbjct: 128 WHLGYFRKEYTPTYRGFDSFYGYYNGAEDYYNHSITKVISQSYNIRQGSVTKKRIENYIK 187

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
                  ++    +    +PLFLYLA+ + H      PL+AP+  +    +I +  R+ F
Sbjct: 188 NTELVLLSLTFYLSILFSQPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIF 242

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           A  +  LD+SVG+V+E L +  ML N+IIV  +DNGG   G   N   N+PLRG K TLW
Sbjct: 243 AGAVDALDQSVGEVLETLYEADMLDNTIIVLSTDNGGLPFGVQSNRGYNFPLRGAKGTLW 302

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN-------------------- 381
           EGG RG+  IWSPLL     V++Q +H++DWLPTL SAA                     
Sbjct: 303 EGGTRGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGNPVSLGKVDGYDMWTQL 362

Query: 382 KSDIPNYVNSTVENIIPRYENSILRYEN-----GTHE 413
             ++P+     + NI P   NS LRY N     GT+E
Sbjct: 363 SYNLPSSRYEVLYNIDPTDLNSALRYTNYKLVLGTYE 399



 Score = 46.2 bits (108), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 75/207 (36%), Gaps = 62/207 (29%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
           ++DG D+W+ LS N PS R  +L+NID     SAL    +KLV        GT E+   D
Sbjct: 352 KVDGYDMWTQLSYNLPSSRYEVLYNIDPTDLNSALRYTNYKLVL-------GTYEDGEFD 404

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDL 628
             +              R     R          +    +   + K  +V+K        
Sbjct: 405 QRF--------------RTTGGSR---------PYGDLEIAMAQSKAARVLK-------- 433

Query: 629 TGGPDQVYLSGLSDREWLALAMRKLRDAASIQC----GPVKEVPCEPQIAPCLFDIKNDP 684
                     G  +R    L  R  R  A++ C    GP   V   P   P LFD+  DP
Sbjct: 434 ----------GFYNRGNFLLP-RNWRQKATVNCVSNNGPKNFVSQAP---PYLFDLTKDP 479

Query: 685 CEKNNLADRSED------QRINHYTTE 705
           CE NN+A           QR+  Y + 
Sbjct: 480 CELNNIASAEAGLVDFLKQRLQSYASS 506


>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 374

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 157/312 (50%), Positives = 204/312 (65%), Gaps = 1/312 (0%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GW+DV  HG  QI TPNID LA  G+ L NYY   +CTP+RSAIMTGKHPIHTGMQH+ +
Sbjct: 43  GWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGKHPIHTGMQHDTI 102

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
              E  GL L EK + Q+LK LGY T  VGKWHLG++ ++Y PT RGF+S  GY+ G  D
Sbjct: 103 GADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAEDYIPTRRGFDSFFGYYNGRGD 162

Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
           Y+ H   E    G D+ ++ E  W   G+YST++FT EA  II  H+  +PLFLYLAH A
Sbjct: 163 YYTHEDTEGGFGGYDLHKNGEVHWPDFGQYSTEIFTEEAQQIIKTHNASQPLFLYLAHQA 222

Query: 251 THSANPYEPL-QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
            H+    + L +AP  Y  +  +I+   R  FAA++  LD+SVG + + L+   + +NSI
Sbjct: 223 VHAGVYGKDLVEAPHKYYQMFPNIKTEGRRMFAAMVAGLDDSVGNISQTLQDAGLYNNSI 282

Query: 310 IVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
           IVF +DNGG   G++ N ASNWPLRG K+TLWEGGVRG   + SPL+E     +++ +HV
Sbjct: 283 IVFTTDNGGPTNGYDGNHASNWPLRGCKHTLWEGGVRGTAFVNSPLIEKPRRFSDRMMHV 342

Query: 370 SDWLPTLLSAAN 381
            DWLPTL   A 
Sbjct: 343 CDWLPTLYGVAG 354


>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 575

 Score =  320 bits (820), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 170/364 (46%), Positives = 228/364 (62%), Gaps = 22/364 (6%)

Query: 32  TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           +RI    +L +   L+  +V L      PHI+ I+ADDLGWNDVGFHG  QIPTPNIDAL
Sbjct: 6   SRITGIFILIVNLQLTD-YVLLRELKKRPHIVVIVADDLGWNDVGFHGSIQIPTPNIDAL 64

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           AY+G IL  +Y    CTPSR+A++TGK+PI  G+Q   +   +   LP +EKILPQYLKE
Sbjct: 65  AYNGAILNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPIASGKASALPTNEKILPQYLKE 124

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
           LGY T +VGKWHLG Y+K +TPT RGF+SH GYW G+  Y + +     M G+D RR  E
Sbjct: 125 LGYSTHLVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYISYRNSTHATRTMSGIDARRGFE 184

Query: 212 PAWDL--HGKYSTDVFTAEAVDIIHNHSTDEP-LFLYLAHAATHSANPYEPLQAPDHYLN 268
            A +     +Y+TDVFT EA  II +   +   +FL ++H A HS N       P+H   
Sbjct: 185 RAGNEMDRDRYATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGN-----SGPNHLEV 239

Query: 269 IHR--------HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           +++        +IE+  R  +A ++  LD+SVGK++E+L +  ML NSI+VF+SDNG   
Sbjct: 240 LNKTYNDEAFGYIENENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFISDNGAPT 299

Query: 321 A----GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
                G + N  SNWPLRG K ++ EGGVRG   IWSP L+ +  V+    H++DWLPTL
Sbjct: 300 DDPLWGHD-NFGSNWPLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTL 358

Query: 377 LSAA 380
            +AA
Sbjct: 359 YTAA 362


>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
 gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
          Length = 554

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 160/336 (47%), Positives = 217/336 (64%), Gaps = 10/336 (2%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L   Y   +CTPSR
Sbjct: 19  DQSAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+++G++PIHTG QH V+   E   L L+  ++P+  KE GY T +VGKWHLGF + EY
Sbjct: 79  GALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEY 138

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKM------WGLDMRRDLEPAWDLHGKYSTDVF 225
           TPT RGF+ H GYW  + DYF   +   KM       G D RR+++      G Y TD+ 
Sbjct: 139 TPTRRGFDYHFGYWGAYIDYFQRRS---KMPVANYSLGYDFRRNMDLECRDRGVYVTDLL 195

Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           T EA  +I +H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D  R K+AA+
Sbjct: 196 TTEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDSNRRKYAAM 255

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           + KLD+SVG+++  L     L NSI++F SDNG  + G   N  SN+PLRG KNT WEGG
Sbjct: 256 ISKLDQSVGRIISTLSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGG 315

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           VR AG IW+  L++RG +  Q ++V+DWLPTL  AA
Sbjct: 316 VRVAGAIWNSKLQARGSIFSQPLYVADWLPTLSRAA 351



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 62/140 (44%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DGID+W  LS   + P     ILH +DD W++SAL  G+WK V               
Sbjct: 360 KLDGIDLWPELSGTADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTAAGRYDSVLTY 419

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V +        LS    R      +   R  A+++CG ++   C P
Sbjct: 420 RELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNP 478

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  CL+DI  DPCE+NNL 
Sbjct: 479 LLEECLYDISTDPCEQNNLV 498



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  GR++ +    + ++ + ++ +     +     +   +   ++ + +   +   R  
Sbjct: 407 TTAAGRYDSVLTYRELDDLDPRESRYAVTVRNSATSRALSRYDLRRLTQQRISL--TRRL 464

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C P +  CL+DI  DPCE+NNL 
Sbjct: 465 AAVRCGDLQR-SCNPLLEECLYDISTDPCEQNNLV 498


>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
 gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
          Length = 598

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 161/363 (44%), Positives = 236/363 (65%), Gaps = 20/363 (5%)

Query: 31  RTRIMAFAVLPLAFTL---SMVFVDLV-ASSGPPHIIFILADDLGWNDVGFHGLDQIPTP 86
           R R++A +V     ++    +V+ D+  +SSG P+I+FILADD GWND+G+HG   I TP
Sbjct: 89  RCRVVAVSVSYFQKSVRRQDVVYSDIQESSSGKPNIVFILADDYGWNDIGYHG-SVIRTP 147

Query: 87  NIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILP 146
           N+D LA  G+ L+NYY   LC+PSR  +MTG++ I  G+QH++++  +  GLPL E  LP
Sbjct: 148 NLDRLAAEGVKLENYYVQPLCSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLP 207

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE-----EMKM 201
           Q LKE GY T IVGKWHLGFYK++YTPT RGF++  GY TG +DY+ H  +     + + 
Sbjct: 208 QRLKEGGYSTHIVGKWHLGFYKQDYTPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQT 267

Query: 202 W-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
           W GLD+R    P  D +G YST +F  +A++II     ++P+FL+L+  A H     +PL
Sbjct: 268 WSGLDLRDQNRPVTDQNGTYSTHLFANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPL 322

Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           QAP+  ++ + HI D  R  +AA+   +D++VG V  AL+Q  +  N++++F +DNGG  
Sbjct: 323 QAPEEDISRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGGRV 382

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                    NWPLRG K +LWEGGVRG G + SPL++++G  ++  +H+SDW PTL+  A
Sbjct: 383 D----RGGINWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLA 438

Query: 381 NKS 383
           + S
Sbjct: 439 SGS 441


>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 571

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 160/350 (45%), Positives = 227/350 (64%), Gaps = 20/350 (5%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
            +F L+ +F  +    G PHI+ I+ADD+GWNDVGFHG ++IPTPNIDALAY G+IL  +
Sbjct: 11  FSFILTPIFGPIY---GQPHIVVIIADDMGWNDVGFHGSNEIPTPNIDALAYGGVILNRH 67

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y +  CTPSR+A +TG+HPI  G+Q   +   E  G+PL E++LP+YL+ELGY TR+VGK
Sbjct: 68  YALPTCTPSRTAFLTGRHPIRMGLQGIPMNVAEPRGVPLHERLLPEYLRELGYVTRLVGK 127

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-- 219
           WHLG+Y  ++TPT RGF+S +GY+ G   YF+H+  + K  G+D        WD  GK  
Sbjct: 128 WHLGYYTDKHTPTRRGFDSFVGYYGGVITYFNHTVTKDKHTGIDYH------WDTSGKIE 181

Query: 220 ------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHR 271
                 Y TD  + +A  +I NH   +PLFL LAH A H++   +P++  +     +   
Sbjct: 182 PFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLAHVAAHASENRDPIEVRNMTEVNDTLS 241

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA-GFNLNAASN 330
           +I D  R K+A ++  +D+SVG+VV+AL+   MLSNSII+F+SDNG   A     N  SN
Sbjct: 242 YIPDINRRKYAGVVTAMDDSVGRVVKALKDANMLSNSIIIFMSDNGSPTAEAPYTNYGSN 301

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +PLRG+K T++EGGVR    ++SP L+ R  V+++  H++DW PTL   A
Sbjct: 302 YPLRGIKATVFEGGVRVPACVFSPRLKDRFRVSDELFHITDWFPTLYKLA 351



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 71/156 (45%), Gaps = 34/156 (21%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQ-- 625
           D S   ++DG+D WS +S ++ S R ++L NID+     A   G +KL++ +   RY   
Sbjct: 354 DLSKIQDLDGVDQWSSISGSQKSNRESLLVNIDEVSNPEAAISGYYKLIRGIN--RYDDY 411

Query: 626 ------------VDLTGGPDQVYLSGLSDREWLALA---------MRKLRDAASIQCGPV 664
                        D+TG      LS L+ R   +L          + +LR+ A+++C   
Sbjct: 412 YGKDGNDYSPKTYDVTG-----VLSSLAGRAIASLGNQYLPPQKRITELRNKATLRCEKK 466

Query: 665 KEVP-CEPQIAPCLFDIKNDPCEKNNLADRSEDQRI 699
            + P C      CLFDI  DPCE  N+ D   +  I
Sbjct: 467 DDRPSCRDT---CLFDIVKDPCETTNIIDEHPEMEI 499


>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
 gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
          Length = 583

 Score =  319 bits (818), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 166/355 (46%), Positives = 231/355 (65%), Gaps = 5/355 (1%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
           A++ L  T  +V +     SG P+II I+ADDLG++DV F G +   TPNIDALAYSG+I
Sbjct: 13  AIILLLTTGLVVVLAGTEISGKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVI 72

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
           L N YT  +CTPSR+A++TGK+PI+TGMQH V+   +  GLPL+E  +    +  GYRT 
Sbjct: 73  LNNLYTAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMADIFRGNGYRTS 132

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK--MWGLDMRRDLEPAWD 215
           ++GKWHLG  ++ YTPT RGF+ HLGY   + DY++ S E++     G D R +L+P  +
Sbjct: 133 LIGKWHLGMSQRNYTPTLRGFDYHLGYLGAYVDYYNQSYEQVSKGYRGHDFRENLKPNHE 192

Query: 216 LHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
              KY TD+ T  AV  I +H+     +PLFL L H A H+AN  +P+QAP   L+   +
Sbjct: 193 HVDKYVTDILTDAAVREIDDHAAKNNSKPLFLLLNHLAPHAANDADPMQAPADELSGFEY 252

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I D     +AA++ +LD+SVG VV+AL ++ ML NSI++F+SDNGG   G +   ASN+P
Sbjct: 253 IRDETHRYYAAMVSRLDKSVGHVVDALARQDMLQNSILLFLSDNGGPTLGEHATTASNYP 312

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           LRG KN+ WEGG+R +  IWS   E  G V +Q +++ D LPTL +AA  S  P+
Sbjct: 313 LRGQKNSPWEGGIRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPD 367



 Score = 66.6 bits (161), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 75/162 (46%), Gaps = 30/162 (18%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS--ALTKGKWKLVK-VVKVMRY------Q 625
           +DG+++WS L     S    I+H ID+E      A T+GKWK+V       RY      +
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEEAPEPHLAYTRGKWKVVSGTTNSGRYDGWLGER 429

Query: 626 VDLTGGP-DQVYLSGLSDRE-WLALA--------MRKLRDAASIQCGPVKE---VPCEPQ 672
           V     P  Q Y + + +   WL L         +  LR  A I+C    E    PC P 
Sbjct: 430 VTTEVDPRSQEYEALIQNTSVWLQLQKVSGGERNISGLRQQARIECPATTEDEHTPCLPL 489

Query: 673 IAPCLFDIKNDPCEKNNLADRSED--------QRINHYTTEV 706
            APCLFDI+ DPCE+NNL +  ++        QR+  +T + 
Sbjct: 490 EAPCLFDIEADPCEQNNLYEEYKNTTVFQDLWQRVQQFTAQA 531



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 11/77 (14%)

Query: 747 KKYSNEEEGMRKLRDAASIQCGPVKE---VPCEPQIAPCLFDIKNDPCEKNNLAD----- 798
           +K S  E  +  LR  A I+C    E    PC P  APCLFDI+ DPCE+NNL +     
Sbjct: 455 QKVSGGERNISGLRQQARIECPATTEDEHTPCLPLEAPCLFDIEADPCEQNNLYEEYKNT 514

Query: 799 ---RSEVQRINHYTTEV 812
              +   QR+  +T + 
Sbjct: 515 TVFQDLWQRVQQFTAQA 531


>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
          Length = 531

 Score =  319 bits (817), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 158/351 (45%), Positives = 221/351 (62%), Gaps = 12/351 (3%)

Query: 42  LAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           +   L ++   ++ SS P     P+II I+ADDLG+NDV FHG  QIPTPN+  +A  GI
Sbjct: 1   MVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGSSQIPTPNLAKMATRGI 60

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
           IL  +YT   CTPSR+A++TG++PI +GMQ   L   E   LPL+   +P + + LGY+T
Sbjct: 61  ILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLPLNMPTMPLHFQNLGYKT 120

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMK----MWGLDMRRDLE 211
            +VGKWHLG   KE TP  +GF+SH GYW G   YFD+ S  +M     + GLD+    E
Sbjct: 121 HLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFE 180

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
           P W   G+Y+T++FT  ++D+I  H    PLFL ++H A H+      L  PD     H 
Sbjct: 181 PVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHE 240

Query: 272 --HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
             +I+D +R  +A ++  LD S+G+++  L++++ML NSI++F SDNG    G   N+ S
Sbjct: 241 FSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSGS 300

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           NWPLRGVK + +EGGVR A  I+SPL   +G V+E  +H+SDWLPTL SAA
Sbjct: 301 NWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAA 351



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 23/165 (13%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMR 623
           D ++  +IDGID W  L+ N PS R  IL NID+  +  A+ + K+KL++         +
Sbjct: 354 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQGSYHEGTFDQ 413

Query: 624 YQVDLTGGPDQVYLS---GLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDI 680
           Y  D   GP+    +     +D  W               C    + P       CLFD+
Sbjct: 414 YYGDSGRGPENPTPNPNHTTTDLSW---------------CRAPDQTPILNCTKGCLFDL 458

Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKK 725
             DPCE  N+ + SE +  N    ++ +F +   P + ++ + K 
Sbjct: 459 DKDPCETTNIIE-SEPEIANQLYEKIAQFWKELVPQRNKDTDPKS 502



 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 32/50 (64%)

Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
           D ++  +IDGID W  L+ N PS R  IL NID+  +  A+ R K+KL++
Sbjct: 354 DVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQ 403


>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
 gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
          Length = 575

 Score =  319 bits (817), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 159/327 (48%), Positives = 213/327 (65%), Gaps = 4/327 (1%)

Query: 58  GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
           G P+II ILADD+G++DV F G  +  TPNIDALA+ G IL   Y   +CTPSR A+++G
Sbjct: 35  GRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSG 94

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           ++PIHTG QH V+   E  GL L+  ++P+  ++ GY T ++GKWHLGF + EYTPT RG
Sbjct: 95  RYPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRG 154

Query: 178 FESHLGYWTGHQDYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           F+ H GYW  + DY+   ++        G D RR++E      G Y TD+ T EA  +I 
Sbjct: 155 FDYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERVIR 214

Query: 235 NHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
                E PLFL L+H ATH+AN  +PLQAP+  +    +I+D  R K+AA++ KLD+SVG
Sbjct: 215 EREGQEQPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVG 274

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           ++V AL     L NSI++F SDNG  + G   N  SNWPLRG KNT WEGGVR AG IWS
Sbjct: 275 RIVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAIWS 334

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
             L++RG +  Q ++V+DWLP+L  AA
Sbjct: 335 AQLQARGNIFTQPIYVADWLPSLAHAA 361



 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 66/140 (47%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------ 619
           E+DGID+W  L+   + P     ILH +DDEW++++   G WK +               
Sbjct: 370 ELDGIDLWPQLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTY 429

Query: 620 --------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                   +  RY V +   P    L+    R      +  LR +A+++CG ++   C P
Sbjct: 430 RELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQRVMTLRRSAAVRCGDLQR-GCNP 488

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  C++DI +DPCEKNNL 
Sbjct: 489 LVEECVYDIWDDPCEKNNLV 508



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/95 (23%), Positives = 51/95 (53%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  G+++Q+    + ++ + ++ +     +     +   +   ++ + +   +  LR +
Sbjct: 417 TTSAGQYDQVLTYRELDDLDPRESRYVVTVRNSPTSRALARHDIRRMTQQR--VMTLRRS 474

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C P +  C++DI +DPCEKNNL 
Sbjct: 475 AAVRCGDLQR-GCNPLVEECVYDIWDDPCEKNNLV 508


>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
          Length = 358

 Score =  318 bits (816), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 215/336 (63%), Gaps = 16/336 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+ PPH++ ILADDLGWNDVGFHG  +I TP++D LA  G+ L NYY   LCTPSR+ 
Sbjct: 17  CVSARPPHVVLILADDLGWNDVGFHG-SEIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQ 75

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++ I TG+QH +++ C+   +PL EK+LPQ L+E GY T +VGKWHLG ++K+  P
Sbjct: 76  LMTGRYQIRTGLQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLP 135

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+S  GY TG +DY+ H      +   +    LD+R     A +  G+YST++ T 
Sbjct: 136 THRGFQSFFGYLTGSEDYYTHKRCSPIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTE 195

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A  II  H+ D+PLFLY+A  A H+     PLQ PDHY+  +  I+D  R ++A ++  
Sbjct: 196 RATHIITQHTPDQPLFLYVALQAVHA-----PLQVPDHYIAPYSFIQDPHRRRYAGMVSA 250

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG +   L++  +  N++++F +DNGG      L   +NWPLRG K TLWEGGVRG
Sbjct: 251 MDEAVGNITHTLQETGLWDNTVLIFSTDNGGQ----TLYGGNNWPLRGRKWTLWEGGVRG 306

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
              +  PL+E  G V  + +H+SDWLPT+   A  S
Sbjct: 307 VAFVSGPLIEKPGAVNRELIHISDWLPTIAGLAGAS 342



 Score = 39.7 bits (91), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ L+E GY T ++ 
Sbjct: 87  LQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVG 122


>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
 gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
          Length = 535

 Score =  318 bits (815), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 154/331 (46%), Positives = 215/331 (64%), Gaps = 16/331 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++F+LADDLGWNDV FHG   I TP++D LA  G++L NYYT  LCTPSRS ++TG+
Sbjct: 46  PPHLVFVLADDLGWNDVSFHG-SNIRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGR 104

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF
Sbjct: 105 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 164

Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DY+ H       +  +    LD R   + A      YST++FT  A  +
Sbjct: 165 DTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATAL 224

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I +H  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+V
Sbjct: 225 ITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAV 279

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGG+RG G + 
Sbjct: 280 GNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVA 335

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 336 SPLLKQKGVKNRELIHISDWLPTLVKLARGS 366



 Score = 42.7 bits (99), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146


>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 510

 Score =  318 bits (814), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 156/322 (48%), Positives = 203/322 (63%), Gaps = 6/322 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FIL DDLGW DV FHG  QIPTPNID LA  G+IL NYY +  CTPSR+A+MTG +
Sbjct: 3   PNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTGLY 62

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQ +++      GLPL  KILPQ+ ++LGY   ++GKWHLGF+K  Y P  RGF+
Sbjct: 63  PIHTGMQSDIIEPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRGFD 122

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +  G++TG  DY++H S  E    G+ +   L  AW    +    + T   +++      
Sbjct: 123 TFFGFYTGSNDYYNHTSGSENNEQGVSLVATLTAAWGRIAQGVEPLLTDSPLNVY----- 177

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +P F Y +H A HSA   EP QAP   +    +I +  R+ +A +L  LDESVG+VVEA
Sbjct: 178 PQPFFCYFSHHAVHSALMAEPFQAPARNVLKFPYIGESNRTIYAGMLDALDESVGRVVEA 237

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+   ML ++IIVF SDNGGA  G   N   NWPLRG K TLWEG VR    +WSP    
Sbjct: 238 LDNAGMLEDTIIVFSSDNGGAPCGEASNQGFNWPLRGAKFTLWEGSVRVPAFVWSPKFLK 297

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           +  V+ Q +H+SDWLPTL +AA
Sbjct: 298 QSRVSNQLMHISDWLPTLYTAA 319



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/181 (25%), Positives = 77/181 (42%), Gaps = 31/181 (17%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
           D S    IDG+D+W  L+ +  SKR  IL NID  + IS L   ++K  K+V    +  +
Sbjct: 322 DVSALGVIDGVDMWHSLTHSSASKRKEILLNIDASFNISGL---RYKQYKLVVGGGFDGE 378

Query: 628 LTGGPDQVYLSG----------LSDREWLALAMRKLRDAASIQCGPVK-----EVPCE-- 670
           L    D  Y +G          L +   +A  ++K  +       P+       V C   
Sbjct: 379 LN---DHYYFTGGTRPTNNIDELRNESTMARVLKKFYEKQRRSWSPLTWSENVAVDCRRD 435

Query: 671 -------PQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEK 723
                  P   P LF+I+ DPCE NNLA +++   +     ++  +N    P   + ++ 
Sbjct: 436 RLTENFVPMQPPYLFNIEQDPCELNNLA-KTKVHTLRFLMKKLDAYNSTVVPSLFQPDDP 494

Query: 724 K 724
           +
Sbjct: 495 R 495


>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
          Length = 491

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 153/328 (46%), Positives = 216/328 (65%), Gaps = 16/328 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH++ +LADDLGW DVG+HG   I TP +DAL   G+ L+ YYT  LCTPSRS +++G++
Sbjct: 1   PHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRY 59

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH +++ C+   LPL EK+LP+ L+E GY T +VGKWHLG Y+KE  PT RGF+
Sbjct: 60  QIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFD 119

Query: 180 SHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           ++ GY  G +DY+ H       A+ +    LD R   E A      YST++FT  A+D+I
Sbjct: 120 TYFGYLLGSEDYYTHDRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVI 179

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            NH T++PLFLYLA  + H     EPL+ P+ Y+  +  I+D KR  +A ++  +DE+VG
Sbjct: 180 ANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRRHYAGMVTLMDEAVG 234

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            + +AL+   +  N++ VF +DNGG      L   +NWPLRG K TLWEGGVRG G + S
Sbjct: 235 NLTDALKTYGLWDNTVFVFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVAS 290

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           PLL+ +G+ + + +H+SDWLPTL+  A 
Sbjct: 291 PLLKRKGVESHELIHISDWLPTLVHLAG 318



 Score = 40.8 bits (94), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   LPL EK+LP+ L+E GY T ++ 
Sbjct: 65  LQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVG 100


>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
 gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
          Length = 619

 Score =  317 bits (812), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 156/326 (47%), Positives = 215/326 (65%), Gaps = 6/326 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II ++ADD+G++D+   G  +  TPNIDALAY G +L   Y   +CTPSR A+++GK+
Sbjct: 33  PNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKY 92

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTG QH V+   E  GL L+  ++P+  +E GY T +VGKWHLGF + EYTPT RGF+
Sbjct: 93  PIHTGTQHYVIGNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPTHRGFD 152

Query: 180 SHLGYWTGHQDYFDHSAEEMKMW----GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           +H GYW  + DY+   + +M +     G D RR+++      G Y TD+ T EA  II  
Sbjct: 153 NHYGYWGAYIDYYQRRS-QMPLGNYSVGYDFRRNMQVECTDRGVYVTDLLTNEAERIIRE 211

Query: 236 H-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
             + ++PLFL L+H A H+ N  +PLQAP+  L    HI+D  R  +AA++ KLDESVG+
Sbjct: 212 QPAKEQPLFLILSHLAPHTGNTNKPLQAPEEELRKFTHIKDPNRRLYAAMVSKLDESVGR 271

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V+ AL +   L NSI++F SDNG  + G   N  SNWPL+G KN+ WEGG+R AG IWSP
Sbjct: 272 VITALARTDQLENSIVIFYSDNGAPSVGMFSNTGSNWPLKGQKNSPWEGGLRVAGAIWSP 331

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL++RG +  Q ++V D+LPTL  AA
Sbjct: 332 LLKARGNLFTQPIYVGDFLPTLAHAA 357



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 64/139 (46%), Gaps = 23/139 (16%)

Query: 575 IDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------- 619
           +DGID+W  LS  ++ P     ILH +DD W++ +L +G++K V                
Sbjct: 367 LDGIDLWPQLSGPKDAPHVPREILHQLDDVWRVQSLFQGQYKYVNGTTLAGQFDQVLTYR 426

Query: 620 -------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQ 672
                  +  RY V++   P    L+    +      +  +  +A++ C  ++   C   
Sbjct: 427 ELDDLDPRASRYVVEVRNSPASRALARYDLQRITQARINTISRSAAVICSDMQR-GCNAL 485

Query: 673 IAPCLFDIKNDPCEKNNLA 691
           +  CL+D+  DPCE+NNLA
Sbjct: 486 LEECLYDLSTDPCEQNNLA 504


>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
 gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
          Length = 567

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 159/331 (48%), Positives = 214/331 (64%), Gaps = 5/331 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II ILADD+G++DV F G  +  TPNIDALAY G IL N Y   +CTPSR A+++G++
Sbjct: 28  PNIIIILADDMGFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPAMCTPSRGALLSGRY 87

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P HTG QH V+   E  GL L+  ++P+  KE GY T ++GKWHLGF   EYTPT RGF+
Sbjct: 88  PAHTGTQHFVISNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLGFASPEYTPTRRGFD 147

Query: 180 SHLGYWTGHQDYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN- 235
            H GYW  + DY+   ++        G D RR+++      G Y TD+ T EA  +I   
Sbjct: 148 YHYGYWGAYIDYYQRRSQMPVANYSMGYDFRRNMDLECQNRGVYITDLLTQEAEHVIREK 207

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
            + +E  FL L+H ATH+AN  +PLQAP+  +    +I+D +R K+AA++ KLD+SVG++
Sbjct: 208 AAANETFFLMLSHLATHTANDNDPLQAPEEEIRKFAYIKDPRRRKYAAMVAKLDQSVGRI 267

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           +  L Q   L NSI++F SDNG  + G   N  SNWPLRG KNT WEGG+R AG IWS  
Sbjct: 268 ITTLAQTEQLENSIVIFYSDNGAPSVGQFANTGSNWPLRGQKNTPWEGGIRVAGAIWSTQ 327

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           L++RG    Q ++V+DWLPTL  AA   D+P
Sbjct: 328 LQARGNKFTQPIYVADWLPTLAHAAGI-DLP 357



 Score = 69.3 bits (168), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 23/139 (16%)

Query: 574 EIDGIDVWSVL--SRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DG+D+W  L  S++       ILH +DD W +S++  G WK +               
Sbjct: 362 QLDGLDLWPQLAGSKDAAHVPREILHALDDVWHVSSMLLGHWKYINGTTASGTYDQVLTY 421

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V++   P    L+    +      +  LR +AS++CG ++   C P
Sbjct: 422 RELDDLDPRNSRYVVEVRNSPASRALAHFDLQRLGQQKITSLRHSASVRCGNLQR-GCNP 480

Query: 672 QIAPCLFDIKNDPCEKNNL 690
            +  CL+DI  DPCE+NNL
Sbjct: 481 LVEECLYDIAADPCEQNNL 499



 Score = 47.8 bits (112), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 3/94 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  G ++Q+    + ++ + +  +   + +     +       ++   ++  +  LR +
Sbjct: 409 TTASGTYDQVLTYRELDDLDPRNSRYVVEVRNSPASRALAHFDLQRLGQQK--ITSLRHS 466

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
           AS++CG ++   C P +  CL+DI  DPCE+NNL
Sbjct: 467 ASVRCGNLQR-GCNPLVEECLYDIAADPCEQNNL 499


>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
 gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
          Length = 584

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 165/371 (44%), Positives = 237/371 (63%), Gaps = 10/371 (2%)

Query: 21  LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
           +  +L +    T ++A  VL +A +       ++A+S  P+II I+ADDLG++DV F G 
Sbjct: 1   MSTHLDKFSSATWLLAGFVLCIALSNG-----IIAASDRPNIIIIMADDLGFDDVSFRGS 55

Query: 81  DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
           +   TPNIDALAYSG+IL N Y   +CTPSR+A++TGK+PI+TGMQH V+   +  GLPL
Sbjct: 56  NNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPL 115

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
           +E  + +  +E GYRT ++GKWHLGF ++ +TPT RGF+ HLGY   + DY+  S E+  
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGYLGAYVDYYTQSYEQQS 175

Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS---TDEPLFLYLAHAATHSAN 255
               G D R +L+ + D  G Y TDV T  AV  I +H+   + +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDNLKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPLFLLLNHLAPHAAN 235

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
              P+QAP   ++   +I +     +AA++ KLD+SVG V++AL ++ ML NSI++F+SD
Sbjct: 236 DDNPMQAPAEEVSRFEYIRNKTHRYYAAMVSKLDQSVGSVIDALARQEMLQNSIVLFLSD 295

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NGG   G +   ASN+PLRG K++ WEG +R +  IWS   E  G V +Q +++ D LPT
Sbjct: 296 NGGPTQGQHSTTASNYPLRGQKDSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355

Query: 376 LLSAANKSDIP 386
           L +AA  S  P
Sbjct: 356 LAAAAGISPDP 366



 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 69/149 (46%), Gaps = 45/149 (30%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVKVMRYQVDLTGGP 632
           +DG+++WS L     S    I+H ID++  +  L  T+GKWK++            +G  
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGKWKVI------------SGTT 417

Query: 633 DQ-VYLSGLSDRE--------------------WLALA--------MRKLRDAASIQC-G 662
           +Q +Y   L  RE                    W  L         + +LRD ASI+C  
Sbjct: 418 NQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWQQLQQVTSGERNISELRDQASIECPN 477

Query: 663 PVKEV-PCEPQIAPCLFDIKNDPCEKNNL 690
           P   V PC P  APCLFDI+ DPCE++NL
Sbjct: 478 PATGVRPCLPLEAPCLFDIEADPCERSNL 506



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 56/121 (46%), Gaps = 10/121 (8%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           G  NQ  Y       E  +   +  + ++  +     ++ ++ ++ E  + +LRD ASI+
Sbjct: 415 GTTNQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWQQLQQVTSGERNISELRDQASIE 474

Query: 767 C-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
           C  P   V PC P  APCLFDI+ DPCE++NL           Y     +LD   R  Q 
Sbjct: 475 CPNPATGVRPCLPLEAPCLFDIEADPCERSNL--------YAEYKNSTTFLDLWARIQQF 526

Query: 825 A 825
           A
Sbjct: 527 A 527


>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
 gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
          Length = 558

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 154/325 (47%), Positives = 207/325 (63%), Gaps = 5/325 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII I+ DD+G ND+G    +QIPTPNIDAL Y+G++L  YY    CTPSR+A +TG +
Sbjct: 27  PHIIVIMGDDMGHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYVQNACTPSRAAFLTGNY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI + MQ   +   E   LPL+  ++PQ+LK LGYRT IVGKWHLG   +  TPT +GF+
Sbjct: 86  PIRSAMQGLPIVAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFD 145

Query: 180 SHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           SH GYW G   Y+D+  +     + G D+    E      G+Y+T VFT  A+DII  H+
Sbjct: 146 SHFGYWNGFTGYYDYFTDFNSTAIEGFDLHDRFETERGYQGQYATRVFTERALDIIEGHN 205

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHY--LNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           T  PLFL + H A H+      L  P+       + +I+D +R  +A I+ +LD S+G+V
Sbjct: 206 TTRPLFLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRRLYAEIVAELDRSIGQV 265

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           V  L +R+ML NSII+F SDNG    G   N+ SNWPLRG+K T +EGG+RG   I+SPL
Sbjct: 266 VRKLSERQMLENSIILFFSDNGAPTVGPYTNSGSNWPLRGIKLTNFEGGIRGTATIFSPL 325

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L+ RG V ++ +HVSDWLPT  +AA
Sbjct: 326 LKKRGYVNKELIHVSDWLPTFYAAA 350


>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
          Length = 522

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 153/332 (46%), Positives = 220/332 (66%), Gaps = 16/332 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           ++G PHI+F+LADDLGWNDVG+H    I TP++DAL+  G+ L+NYYT  LCTPSRS ++
Sbjct: 30  ATGQPHIVFVLADDLGWNDVGYHD-SNIFTPHLDALSAQGVRLENYYTQPLCTPSRSQLL 88

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IHTG+QH +++ C+   +PL EK+LP+ L+E GY T +VGKWHLG ++KE  PT 
Sbjct: 89  TGRYQIHTGLQHQIIWPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTR 148

Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF++  GY  G +DY+ H       A ++    LD R   + A      YST+VFT  A
Sbjct: 149 RGFDTFFGYLLGSEDYYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERA 208

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           V++I NH   +PLFLYLA  + H     EPLQ P+ YL  +  I++  R  +A ++  +D
Sbjct: 209 VNLIANHPAQKPLFLYLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQHYAGMVTLMD 263

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V  AL++  + +N++ +F +DNGG      L   +NWPLRG K TLWEGG+RG G
Sbjct: 264 EAVGNVTRALQKYGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGIRGVG 319

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            + SPLL+ + + + + +H+SDWLPTL++ A 
Sbjct: 320 FVSSPLLKRKRVKSSELIHISDWLPTLVNLAG 351



 Score = 40.4 bits (93), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 14/36 (38%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LP+ L+E GY T ++ 
Sbjct: 98  LQHQIIWPCQPSCIPLDEKLLPELLREAGYVTHMVG 133


>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
 gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
          Length = 559

 Score =  315 bits (807), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 166/398 (41%), Positives = 231/398 (58%), Gaps = 40/398 (10%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S  P+II IL DD+G+NDV FHG +QI TPNIDALAY+GI+L  +Y   LCTPSR+ ++
Sbjct: 24  ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TGK+PIHTGMQH V+   E  GLP  E+++P+  ++ GY T +VGKWHLGF++K+ TPT 
Sbjct: 84  TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           RGF+ H GY+ G+ DY+D +   +      GLD RRDLEP  +  G Y+T+ FT EA  +
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I  H    PLF+ L+H A H+ N   P+QAP+  +    HI D KR  +A ++  LD+SV
Sbjct: 204 IERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G+ + AL    ML+NSI++  SDNG    G + NA SN+P RG                 
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRG----------------- 306

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-------IL 405
                 RG V+ Q +H  DWLPTL +AA  S +P+ +     N+ P    S       +L
Sbjct: 307 -----QRGYVSNQAIHAIDWLPTLAAAAGVS-LPSDLRLDGLNLWPSLSASAQPQPRNLL 360

Query: 406 RYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
              +    Y+S        +Y NG+      +E RY++
Sbjct: 361 HVLDDVFGYSS--YTQDTLKYVNGS-----SFEGRYDH 391



 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 51/155 (32%)

Query: 575 IDGIDVWSVLSRN-EPSKRNTILHNIDDEWQISALTKGKWKLV----------------- 616
           +DG+++W  LS + +P  RN +LH +DD +  S+ T+   K V                 
Sbjct: 339 LDGLNLWPSLSASAQPQPRN-LLHVLDDVFGYSSYTQDTLKYVNGSSFEGRYDHWLGELE 397

Query: 617 ------------KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPV 664
                       ++V+    Q  L        ++   DR      +R++RD A+ +C P+
Sbjct: 398 DGEEDPNSKDYGQLVRASEVQAVLGS------VAPTDDR------IRQMRDEATQRC-PI 444

Query: 665 KE-------VPCEPQIAPCLFDIKNDPCEKNNLAD 692
           +E         CEP  A CLFD+ NDPCE+ NLA+
Sbjct: 445 QERDPEDPVYLCEPLKAACLFDLANDPCERYNLAN 479



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 47/91 (51%), Gaps = 8/91 (8%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE------- 772
           E E  ++    K   +  +  + +      +  ++ +R++RD A+ +C P++E       
Sbjct: 395 ELEDGEEDPNSKDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC-PIQERDPEDPV 453

Query: 773 VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
             CEP  A CLFD+ NDPCE+ NLA+   +Q
Sbjct: 454 YLCEPLKAACLFDLANDPCERYNLANLYPLQ 484


>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
 gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
          Length = 478

 Score =  315 bits (806), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 153/330 (46%), Positives = 219/330 (66%), Gaps = 17/330 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADD GWND+G+HG   I TPN+D LA  G+ L+NYY   +C+PSR  +MTG++
Sbjct: 3   PHIVFILADDYGWNDIGYHG-SFIKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRY 61

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IH G+QH+V+      GLPL E  LPQ LKE+GY T +VGKWHLGF+++EY P  RGF+
Sbjct: 62  QIHYGLQHSVITHDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGFD 121

Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           +  G+ TG +DY+ H      S +  +  GLD+R   EP  D +G YST +F  +A+DII
Sbjct: 122 TFYGFLTGGEDYWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQNGTYSTHLFQRKAIDII 181

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            +H  ++P+FLYL+  A H+     PLQAP+ Y+  ++ +++    K+AA++  +DE+VG
Sbjct: 182 AHHDRNKPMFLYLSFQAVHA-----PLQAPEGYIEKYKSVQNILIRKYAAMVTAMDEAVG 236

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            V +AL+   +  N++++F +DNG   +     A SNWPLRG KNTLWEGG+RG   + S
Sbjct: 237 NVTDALKGSGLWDNTVLIFSTDNGARRS-----AGSNWPLRGWKNTLWEGGIRGVCFVTS 291

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            LLE +G  ++  +H+SDW PTL+  A  S
Sbjct: 292 NLLERKGTKSDALIHISDWFPTLIHLAQGS 321


>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
 gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
          Length = 570

 Score =  315 bits (806), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 160/354 (45%), Positives = 219/354 (61%), Gaps = 16/354 (4%)

Query: 44  FTLSMVFVDLVASSGP------------PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           ++LS  F+ L+   G             P+II I+ADD+G++DV F G  +  TPNIDAL
Sbjct: 2   WSLSATFLLLLCLRGSKANGAEMEATRRPNIIIIMADDMGFDDVSFRGGREFITPNIDAL 61

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           AY G +L   Y   +CTPSR A+++G++PIHTG QH V+   E   L  +  ++P+  + 
Sbjct: 62  AYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWALDSNATLMPEIFQR 121

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM---WGLDMRR 208
            GY T +VGKWHLGF + EYTPT RGF+ H GYW  + DY+   ++        G D RR
Sbjct: 122 AGYSTNLVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYIDYYQRRSKMPVANYSLGYDFRR 181

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
           ++E      G Y TD+ T EA  +I   +  D+PLFL L+H ATH+AN  +PLQAP+  +
Sbjct: 182 NMELECRDRGTYVTDLLTTEAERLIKEQAGKDKPLFLMLSHLATHTANEDDPLQAPEEEI 241

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
               +I+D  R K+AA++ KLD+SVG+++  L     L NSI++F SDNG  + G   N 
Sbjct: 242 RKFSYIKDPNRRKYAAMVSKLDQSVGRIISTLASTDQLENSIVIFYSDNGAPSVGMFANT 301

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            SNWPLRG KNT WEGGVR AG +WS  LE+RG +  Q ++V DWLPTL  AA+
Sbjct: 302 GSNWPLRGQKNTPWEGGVRVAGAVWSSKLEARGSIFTQPIYVGDWLPTLAHAAD 355



 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 64/139 (46%), Gaps = 23/139 (16%)

Query: 575 IDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------- 619
           +DGID+W  LS   + P     ILH++DD W++ +L  G WK V                
Sbjct: 366 LDGIDLWPQLSGAADAPHVPREILHSLDDVWKVGSLLMGHWKYVNGTTSAGQYDSVLTYR 425

Query: 620 -------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQ 672
                  +  RY V +   P    L+ L  R      +  LR +A++ CG  +   C P 
Sbjct: 426 EFDDIDPRESRYAVTIRNSPASRALTRLDLRRLTQQRIAGLRRSAAVHCGDFQR-SCNPL 484

Query: 673 IAPCLFDIKNDPCEKNNLA 691
           +  CLFDI  DPCE+NNLA
Sbjct: 485 LEDCLFDISADPCEQNNLA 503



 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/95 (24%), Positives = 48/95 (50%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  G+++ +    + ++ + ++ +     +     +   +   ++ + +   +  LR +
Sbjct: 412 TTSAGQYDSVLTYREFDDIDPRESRYAVTIRNSPASRALTRLDLRRLTQQR--IAGLRRS 469

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A++ CG  +   C P +  CLFDI  DPCE+NNLA
Sbjct: 470 AAVHCGDFQR-SCNPLLEDCLFDISADPCEQNNLA 503


>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
 gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
          Length = 525

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 158/326 (48%), Positives = 222/326 (68%), Gaps = 4/326 (1%)

Query: 66  LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
           +ADDLG++DV F G +   TPNIDALAYSG+IL N YT  +CTPSRSA++TGK+PI TGM
Sbjct: 1   MADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGM 60

Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
           QH V+   +  GLPL+E  + +  ++ GY T ++GKWHLG  ++ +TPT RGF++HLGY 
Sbjct: 61  QHYVIVNDQPWGLPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGYL 120

Query: 186 TGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEP 241
             + DY+D +   + +    G D R +LE + D  G Y TD+ +  AV++I  H+ T +P
Sbjct: 121 GAYIDYYDQTYLQSSQNYSRGHDFRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKP 180

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
           LFL L+H A H+AN  +PLQAP   L+   +I++     +AA++ +LD+SVG+V+EAL  
Sbjct: 181 LFLLLSHLAPHAANDNDPLQAPMEELSQFEYIQNKSHRYYAAMVSRLDKSVGRVIEALAN 240

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGI 361
           R ML NSI++F+SDNGG + G + N ASN+PLRG K+T WEGG+R +  IWS   E  G 
Sbjct: 241 REMLRNSIVLFLSDNGGPSVGEHANTASNYPLRGQKHTPWEGGIRSSAAIWSTEFERLGS 300

Query: 362 VAEQYVHVSDWLPTLLSAANKSDIPN 387
           V +Q +++ D LPTL SAA  S  P+
Sbjct: 301 VWKQRIYIGDLLPTLCSAAGISLDPS 326



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 64/148 (43%), Gaps = 23/148 (15%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTKGKWKLVKVV------------- 619
           +DG+++WS L     S    I+H+ID  D     A  +GKWK++                
Sbjct: 329 LDGLNLWSALKYGYESVEREIVHSIDAVDPQPHLAYMRGKWKIINGTTNSGQYDGWLGQR 388

Query: 620 -------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC-GPVKEVPCEP 671
                  + + Y   +        L  L+D+      + +LR  A I C  P    PC P
Sbjct: 389 ITDEVDPRSVHYDQLIRNTTVWQQLQLLADKPQQERNITELRQQAQILCPTPANAKPCMP 448

Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRI 699
            IAPC FDI+ DPCE+NNL D   +  I
Sbjct: 449 LIAPCFFDIEVDPCEQNNLYDEYRNTSI 476



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 39/75 (52%), Gaps = 9/75 (12%)

Query: 752 EEEGMRKLRDAASIQC-GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTT 810
           +E  + +LR  A I C  P    PC P IAPC FDI+ DPCE+NNL D         Y  
Sbjct: 422 QERNITELRQQAQILCPTPANAKPCMPLIAPCFFDIEVDPCEQNNLYDE--------YRN 473

Query: 811 EVGYLDPKQRFNQIA 825
              +LD  +R +Q A
Sbjct: 474 TSIFLDLWERIDQYA 488


>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
          Length = 513

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 155/322 (48%), Positives = 216/322 (67%), Gaps = 7/322 (2%)

Query: 66  LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
           +ADD+G NDVGF+G  QIPTP+IDALAY+GIIL  +YT   CTPSR+A++TG++PI  GM
Sbjct: 1   MADDVGRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGM 60

Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
           Q   +   E   LPL    +PQYLK LGY+T +VGKWHLG+   E TP  RGF+SH GYW
Sbjct: 61  QGLPIRAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYW 120

Query: 186 TGHQDYFDHS-----AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
            G   YF+++     A +  + G D+   + PAW   GKY+TD+FT +A+ II  H++++
Sbjct: 121 NGFVGYFNYTAVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNSEK 180

Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHR--HIEDFKRSKFAAILHKLDESVGKVVEA 298
           PLFL LAH A H+      L  PD      R   I++ KR  +A I+ +LD+SVG++VE 
Sbjct: 181 PLFLVLAHLAGHTGEDGVELGVPDVAQAETRFSFIKNPKRRLYADIVSRLDDSVGQIVEK 240

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+++ +L NSII+F+SDNG    G   N+ SNWPLRG+K T +EGG+R    I+SPL   
Sbjct: 241 LDEKNLLDNSIILFLSDNGAQTIGVYENSGSNWPLRGLKLTEFEGGIRAPAAIYSPLFHQ 300

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           +G V+++ +H++D+LPT  +AA
Sbjct: 301 KGYVSKELIHITDFLPTFYAAA 322



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 71/157 (45%), Gaps = 11/157 (7%)

Query: 571 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTG 630
           Y  +IDG++ W VLS+ +P KR+ IL N ++   +SA+  G++KLV+         D  G
Sbjct: 328 YLGKIDGVNQWEVLSQQKPGKRDEILVNFNEIDNVSAIIMGQYKLVQGRLAFPEYGDYHG 387

Query: 631 -------GPDQVYLSGLSD--REWLALAMRKLRDAASIQ-CGPVKEVPCEPQIAPCLFDI 680
                   P   + + LS    +     +  LR+   +  C   K  P       CLF++
Sbjct: 388 DSGRNPEDPPYNFTAVLSKNINQQNRNKIESLRNQTDLSWCRGEKNQPLIDCTEGCLFNL 447

Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDK 717
           ++DPCE  N+  + E          +  F+QI  P +
Sbjct: 448 ESDPCETTNII-QEEKIIAGKLLGRLFEFSQILVPQQ 483



 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 33/47 (70%)

Query: 506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
           Y  +IDG++ W VLS+ +P KR+ IL N ++   +SA+  G++KLV+
Sbjct: 328 YLGKIDGVNQWEVLSQQKPGKRDEILVNFNEIDNVSAIIMGQYKLVQ 374


>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 599

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 169/361 (46%), Positives = 229/361 (63%), Gaps = 18/361 (4%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           ++ ++  L   +S +  D     GP    PHII I+ADDLGWNDVGFHG  QIPTPNIDA
Sbjct: 5   ISCSIRILIILVSSLLSDCSFLRGPNKKQPHIILIVADDLGWNDVGFHGSIQIPTPNIDA 64

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           LAY+G+IL  +Y    CTPSR+A++TGK+PI  G+Q   +       LPL+EKILPQYLK
Sbjct: 65  LAYNGVILNRHYVQPTCTPSRAALLTGKYPIRYGLQGFPIIAGVPLALPLNEKILPQYLK 124

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           +LGY T +VGKWHLG  K ++TP  RGF+SH GYW G   Y + +     M G D RR  
Sbjct: 125 DLGYSTHLVGKWHLGANKNQHTPIKRGFDSHFGYWNGFISYRNSTHSTGLMVGKDARRGF 184

Query: 211 EPAWD-LHGKYSTDVFTAEA---VDIIHNHSTDEPLFLYLAHAATHSANPY-EPLQAPDH 265
           E A D +  +Y+TD+FT EA   + +  NH  D+P+FL ++H A H+  P    L+  + 
Sbjct: 185 ERAGDEMVDRYATDIFTDEANKVIKLCKNH--DKPMFLMVSHLAVHTGVPGPNILEVSNK 242

Query: 266 YLNIHR--HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA-- 321
             N  R  +IE+ +R  +A +L  LDESVG ++E+L+   ML +SI++F+SDNG  A   
Sbjct: 243 THNDIRFDYIENKERRLYAGMLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDP 302

Query: 322 --GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
             G+  N+ SNWPLRG K  + +GGVRG   IWSP L+ +  + E   H++DWLPTL +A
Sbjct: 303 IWGYG-NSGSNWPLRGEKGAVLDGGVRGVAAIWSPWLKKKHRIFENLFHITDWLPTLYTA 361

Query: 380 A 380
           A
Sbjct: 362 A 362


>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 528

 Score =  313 bits (802), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 157/353 (44%), Positives = 216/353 (61%), Gaps = 10/353 (2%)

Query: 43  AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
            F + +   +  ASS  P+I+ I+ DDLGW+D+  HG DQIPTPNID LA  G++L+NYY
Sbjct: 6   CFVIVLRIANCAASS--PNIVLIVIDDLGWDDISLHGSDQIPTPNIDKLAAEGVLLENYY 63

Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
           T  +CTPSR A+MTGK+PIH G+Q++V+ G +  GLP   KI+PQYL    Y++ I+GKW
Sbjct: 64  TQAICTPSRGALMTGKYPIHLGLQYDVIQGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKW 123

Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM-----KMW-GLDMRRDLEPAWDL 216
           HLG  + E  PT RGF SH G+  GH DYF+H  EE      +M+ GLD+  +  P    
Sbjct: 124 HLGHSRSELLPTRRGFHSHFGFRLGHSDYFNHWGEESSPVKNEMYAGLDLWSNEVPIKKY 183

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIED 275
           HG Y+ D+FT  A+ I+  H+   PLFLYLAH A H  +   PLQAP  ++      I D
Sbjct: 184 HGTYANDLFTKRAISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYD 243

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
            KR  +AA++  +D S+G+++  +       N+II   +DNGG   G   + +SN PLRG
Sbjct: 244 EKRKHYAAMVSAMDASIGELMHGISTNGFAENTIIFLTNDNGGPTNGMASSGSSNHPLRG 303

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
            K TLWEGG RG+ L W P     G   +   H++D LPTLL+A+  +   N+
Sbjct: 304 CKYTLWEGGTRGSALFWYPKKLGAGTY-KGMAHITDVLPTLLTASGNAVAGNF 355


>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
 gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
          Length = 584

 Score =  313 bits (801), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 156/362 (43%), Positives = 216/362 (59%), Gaps = 52/362 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII I+ADDLGW+DV FHG  QIPTP ID  A  G+IL NYY   +CTPSR+++MTGK+
Sbjct: 27  PHIIIIVADDLGWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTGKY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PI+ GM                                    WHLGF+ KEYTP +RGF+
Sbjct: 87  PINLGM------------------------------------WHLGFFTKEYTPVYRGFD 110

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G+W    DY++HS+ E   WG+D+R ++EP     G Y T++FT EAV +I  H T 
Sbjct: 111 SFYGFWNAKTDYWNHSSYENNFWGVDLRDNMEPVQSEDGTYGTELFTREAVKVIEAHDTS 170

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH---------IEDFKRSKFAAILHKLDE 290
            PLFLY+AH A H+ANP EPLQAP   +++            I+D +R  +AA++  LD+
Sbjct: 171 TPLFLYVAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYAAMVTSLDQ 230

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           SVG +  AL +R ML +S+++F +DNGGA  G N N  SN+PLRG K+ LWEGGV+G   
Sbjct: 231 SVGDIFAALSKRHMLRDSVVIFTTDNGGAPYGLNWNRGSNFPLRGGKDMLWEGGVKGVAF 290

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD---IPNY----VNSTVENIIPRYENS 403
           ++S L++ +G V+++ + V+DW+PT+   A  +    +PN     V ST+    P   + 
Sbjct: 291 VYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVPNMDGKNVWSTISEGAPSPRDE 350

Query: 404 IL 405
           IL
Sbjct: 351 IL 352



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 22/149 (14%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           +DG +VWS +S   PS R+ ILHNID   + + L KGK+K+V+             G D 
Sbjct: 331 MDGKNVWSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQ-------------GMDD 377

Query: 635 VYLS-GLSDR----EWLALAMRKLRDAASIQCGPV--KEVPCEPQIAP-CLFDIKNDPCE 686
            Y   G  DR       ++   +L   A I C     +E  C+      CLFD++ DPCE
Sbjct: 378 TYKGVGWYDRYPGHALSSMKQPELLPGAVIDCKKTFDEERKCDSSDGKFCLFDMEEDPCE 437

Query: 687 KNNLADRSEDQRINHYTTEVGRFNQIAYP 715
            ++L+++   + +    T +  +  IA P
Sbjct: 438 YHDLSNQLP-EVLAEMKTRLEYYKNIALP 465



 Score = 44.7 bits (104), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 30/43 (69%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
           +DG +VWS +S   PS R+ ILHNID   + + L +GK+K+V+
Sbjct: 331 MDGKNVWSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQ 373


>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
 gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
          Length = 585

 Score =  312 bits (800), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 164/371 (44%), Positives = 234/371 (63%), Gaps = 10/371 (2%)

Query: 21  LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
           +  +L +    T ++   VL +A +       ++A+S  P+II I+ADDLG++DV F G 
Sbjct: 1   MSTHLDKFSSATSLLTGFVLFIALSNG-----IIATSDKPNIIIIMADDLGFDDVSFRGS 55

Query: 81  DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
           D   TPNIDALAYSG+IL N Y   +CTPSR+A++TGK+PI+TGMQH V+   +  GLPL
Sbjct: 56  DNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPL 115

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
           +E  + +  +E GYRT ++GKWHLG  ++ +TPT RGF+ H GY   + DY+  S E+  
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVDYYTQSYEQQN 175

Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSAN 255
               G D R +L+   D  G Y TDV T  AV  I +H   ++ +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDNLKSTHDHVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAAN 235

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +P+QAP   ++   +I +     +AA++ +LD+SVG V++AL ++ ML NSII+F+SD
Sbjct: 236 DDDPMQAPAEEVSRFEYISNKAHRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSD 295

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NGG   G +   ASN+PLRG KN+ WEG +R +  IWS   E  G V +Q +++ D LPT
Sbjct: 296 NGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355

Query: 376 LLSAANKSDIP 386
           L +AA  S  P
Sbjct: 356 LAAAAGISPDP 366



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 69/146 (47%), Gaps = 21/146 (14%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVK-------VMRYQ 625
           +DG+++WS L     S    I+H ID++  +  L  T+GKWK++            + ++
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHR 429

Query: 626 VDLTGGPDQVYLSGL--SDREWLALA--------MRKLRDAASIQC-GPVKEV-PCEPQI 673
                 P  V    L  +   WL L         + +LRD + I+C  P   V PC P  
Sbjct: 430 ETSEVDPGAVEYEELVRNTSVWLQLQQVTSGERNISELRDKSRIECPDPATGVKPCLPLE 489

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRI 699
            PCLFDI+ DPCE++NL    ++  I
Sbjct: 490 GPCLFDIEADPCERSNLYAEYQNSTI 515



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 747 KKYSNEEEGMRKLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQR 804
           ++ ++ E  + +LRD + I+C  P   V PC P   PCLFDI+ DPCE++NL        
Sbjct: 455 QQVTSGERNISELRDKSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNL-------- 506

Query: 805 INHYTTEVGYLDPKQRFNQIA 825
              Y     +LD  +R  Q A
Sbjct: 507 YAEYQNSTIFLDLWERIQQFA 527


>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
          Length = 531

 Score =  312 bits (800), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 161/343 (46%), Positives = 210/343 (61%), Gaps = 11/343 (3%)

Query: 48  MVFVDLVASSG---------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           ++F+ +   +G         PPHI+FIL DDLGW D   HG  QIPTPN+DALA +G++L
Sbjct: 5   LLFISVACVAGSSVKTPFRKPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTGVLL 64

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
            NYY   LC+PSR A+MTG +P H G++   L G +  GLPL  KILP++LK+LGY T I
Sbjct: 65  NNYYVQPLCSPSRGALMTGLYPAHNGIRMP-LVGAQVAGLPLQFKILPEHLKDLGYETHI 123

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           VGKWHLG++   YTPT+RGF++  G+  G  DY+    E+    GLD             
Sbjct: 124 VGKWHLGYFNLNYTPTYRGFDTFFGFHNGPIDYYRGIMEQEGHVGLDFWNGTSALPLKER 183

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y+T      A  II N +T +PLFLYLAH A HS    E LQAP        +I D  R
Sbjct: 184 TYATARLQNHAKSIIANRNTSKPLFLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRDSSR 243

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVK 337
              A ++  LD+SVG++VE LE+  +L ++I+VF +DNGG      + N   NWPLRG+K
Sbjct: 244 KILAGMMDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRADPNRGFNWPLRGIK 303

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            T+WEGG+RG   IWSP LE    V+ Q +H+SDWLPTL SAA
Sbjct: 304 GTVWEGGIRGTAFIWSPRLEVSRRVSTQLMHISDWLPTLYSAA 346



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 73/177 (41%), Gaps = 37/177 (20%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV------------KVVKVM 622
           +DGID+W  LS   PS R  +L   ++E   +AL     K+V            +V+  +
Sbjct: 356 LDGIDMWQSLSTGGPSPRIEVLLEFNNETDRAALRYLNHKIVLGTFSDEVSKRYEVIGGV 415

Query: 623 RYQVDLTG---------------GPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEV 667
           R  +DL                 G  Q++L G  D+ W        R  A+++C      
Sbjct: 416 RPGIDLDALQRNSKAGRVLKRFYGNPQLFLHGF-DKSW--------RRRATVRCNEHGRT 466

Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
                    LFD+++DPCE  NLAD S    +     ++G + +   P+  +E + K
Sbjct: 467 NFNSSSKVFLFDLESDPCELRNLAD-SNVTLLTDLFVKLGAYLRTLRPEINKEIDPK 522


>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
 gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
 gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
 gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
 gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
 gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
          Length = 585

 Score =  312 bits (799), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 164/371 (44%), Positives = 234/371 (63%), Gaps = 10/371 (2%)

Query: 21  LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
           +  +L +    T ++   VL +A +       +VA+S  P+II I+ADDLG++DV F G 
Sbjct: 1   MSTHLDKFSSATSLLTGFVLCIALSNG-----IVATSDKPNIIIIMADDLGFDDVSFRGS 55

Query: 81  DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
           +   TPNIDALAYSG+IL N Y   +CTPSR+A++TGK+PI+TGMQH V+   +  GLPL
Sbjct: 56  NNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPL 115

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
           +E  + +  +E GYRT ++GKWHLG  ++ +TPT RGF+ HLGY   + DY+  S E+  
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQN 175

Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSAN 255
               G D R  L+   D  G Y TD+ T  AV  I +H   ++ +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAAN 235

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +P+QAP   ++   +I +     +AA++ +LD+SVG V++AL ++ ML NSII+F+SD
Sbjct: 236 DDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSD 295

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NGG   G +   ASN+PLRG KN+ WEG +R +  IWS   E  G V +Q +++ D LPT
Sbjct: 296 NGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355

Query: 376 LLSAANKSDIP 386
           L +AA  S  P
Sbjct: 356 LAAAAGISPDP 366



 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 70/158 (44%), Gaps = 45/158 (28%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVKVMRYQVDLTGGP 632
           +DG+++WS L     S    I+H ID++     L  T+GKWK++            +G  
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAEPHLSYTRGKWKVI------------SGTT 417

Query: 633 DQ-VYLSGLSDRE--------------------WLALA--------MRKLRDAASIQC-G 662
           +Q +Y   L  RE                    WL L         + +LRD + I+C  
Sbjct: 418 NQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWLQLQQVSFGERNISELRDQSRIECPD 477

Query: 663 PVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEDQRI 699
           P   V PC P   PCLFDI+ DPCE++NL    ++  I
Sbjct: 478 PATGVKPCLPLEGPCLFDIEADPCERSNLYAEYQNSTI 515



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 10/121 (8%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           G  NQ  Y       E  +   +  + ++  +      + ++ S  E  + +LRD + I+
Sbjct: 415 GTTNQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWLQLQQVSFGERNISELRDQSRIE 474

Query: 767 C-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
           C  P   V PC P   PCLFDI+ DPCE++NL           Y     +LD   R  Q 
Sbjct: 475 CPDPATGVKPCLPLEGPCLFDIEADPCERSNL--------YAEYQNSTIFLDLWSRIQQF 526

Query: 825 A 825
           A
Sbjct: 527 A 527


>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
 gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
          Length = 585

 Score =  312 bits (799), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 163/371 (43%), Positives = 234/371 (63%), Gaps = 10/371 (2%)

Query: 21  LPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL 80
           +  +L +    T ++   VL +A +       ++A+S  P+II I+ADDLG++DV F G 
Sbjct: 1   MSTHLDKFSSATSLLTGFVLFIALSNG-----IIATSDKPNIIIIMADDLGFDDVSFRGS 55

Query: 81  DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
           D   TPNIDALAYSG+IL N Y   +CTPSR+A++TGK+PI+TGMQH V+   +  GLP+
Sbjct: 56  DNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPI 115

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
           +E  + +  +E GYRT ++GKWHLG  ++ +TPT RGF+ H GY   + DY+  S E+  
Sbjct: 116 NETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVDYYTQSYEQQN 175

Query: 201 MW--GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSAN 255
               G D R +L+   D  G Y TDV T  AV  I +H   ++ +PLFL L H A H+AN
Sbjct: 176 KGYNGHDFRDNLKSTHDYVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAAN 235

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +P+QAP   ++   +I +     +AA++ +LD+SVG V++AL ++ ML NSII+F+SD
Sbjct: 236 DDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSD 295

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NGG   G +   ASN+PLRG KN+ WEG +R +  IWS   E  G V +Q +++ D LPT
Sbjct: 296 NGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPT 355

Query: 376 LLSAANKSDIP 386
           L +AA  S  P
Sbjct: 356 LAAAAGISPDP 366



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 74/202 (36%), Gaps = 68/202 (33%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TRGKWKLVKENSING--------N 559
           +DG+++WS L     S    I+H ID++  +  L  TRGKWK++   +  G         
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHR 429

Query: 560 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV 619
            TSE       Y+  I    VW  L +    +RN                         +
Sbjct: 430 ETSEVDPRAVEYEELIRNTSVWLQLQQVSSGERN-------------------------I 464

Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC-GPVKEV-PCEPQIAPCL 677
             +RYQ                               + I+C  P   V PC P   PCL
Sbjct: 465 SELRYQ-------------------------------SRIECPDPATGVKPCLPLEGPCL 493

Query: 678 FDIKNDPCEKNNLADRSEDQRI 699
           FDI+ DPCE++NL    ++  I
Sbjct: 494 FDIEADPCERSNLYAEYQNSTI 515



 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 10/81 (12%)

Query: 747 KKYSNEEEGMRKLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQR 804
           ++ S+ E  + +LR  + I+C  P   V PC P   PCLFDI+ DPCE++NL        
Sbjct: 455 QQVSSGERNISELRYQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNL-------- 506

Query: 805 INHYTTEVGYLDPKQRFNQIA 825
              Y     +LD   R  Q A
Sbjct: 507 YAEYQNSTIFLDLWARIQQFA 527


>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
          Length = 382

 Score =  311 bits (798), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 156/342 (45%), Positives = 213/342 (62%), Gaps = 12/342 (3%)

Query: 42  LAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           L  T +++FV L  S +  PH++FI+ADDLGWNDVGF    Q+ TP++D LA +G+IL +
Sbjct: 6   LVATFTLIFVYLYGSQAAKPHVVFIVADDLGWNDVGFRN-PQVLTPHLDKLAKAGVILNS 64

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
            Y   LC+PSR+  M+G  P HTG+Q  V+     G +P+    LPQ LKELGY T  VG
Sbjct: 65  SYVQPLCSPSRNCFMSGYFPYHTGLQDGVIRPASPGFVPIKFTFLPQKLKELGYSTHAVG 124

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           KWHLGF   +YTPT+RGF++ +GY+ G +DY+ H+ E  K  G D+R +     +  GKY
Sbjct: 125 KWHLGFCNLKYTPTYRGFDTFVGYYIGAEDYYKHTREYDKFSGYDLRFNTSVYTEAKGKY 184

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
           ST VF   AVDII +H TD PL+LYL   A H+     PL+ P  Y N+++HI D  R  
Sbjct: 185 STRVFAERAVDIIKSHDTDTPLYLYLPFQAVHA-----PLEVPPEYENLYKHIHDLPRRT 239

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +  ++  LDE+V  +  ALE+  ++ N ++VF +DNGG   G    AA+N PLRG K TL
Sbjct: 240 YCGMISALDEAVANITNALEETGLIDNLLLVFTTDNGGPFNG----AANNLPLRGCKATL 295

Query: 341 WEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           WEGG +GAG I+S  LL+  G +    +H  DW PT +  A 
Sbjct: 296 WEGGTKGAGFIYSKTLLKKTGYLNTGMMHAVDWYPTFVELAG 337


>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 548

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 166/347 (47%), Positives = 213/347 (61%), Gaps = 20/347 (5%)

Query: 47  SMVFVDLVASS--GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
           S+VF+  VA +   PPHI+FILADDLGW+DV FHG  QIPTPNID LA  GI L NYY  
Sbjct: 22  SLVFLATVALAKKMPPHIVFILADDLGWDDVSFHGSSQIPTPNIDVLAADGITLHNYYVQ 81

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
            +CTPSR+A+MTG +PI TGMQH V+   E  GLPL  K++P++LK+LGY T +VGK   
Sbjct: 82  PMCTPSRAALMTGLYPIRTGMQHWVIRSPEPWGLPLELKLMPEHLKDLGYSTHLVGKVLF 141

Query: 165 GFYK----------KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
              K           E T     F S +        YF + +      GLD R   EP  
Sbjct: 142 DLKKFIVSVNRLCINESTEVCHTFVSAVTLCI----YFVYKSHA----GLDFRNGEEPFH 193

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
           +  G+Y+T +FT  A+ II  H+  +PLFLYL+H A H A   EPLQAPD  +    +I 
Sbjct: 194 NDTGQYATTLFTDRAISIIEQHNQTKPLFLYLSHLAPHGATHDEPLQAPDENVEKFDYIG 253

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
           +  R+ +A ++  LD SVG+VVEAL +  ML NSI+ F SDNG    GF  N   NWPLR
Sbjct: 254 EEDRTIYAGMVDALDVSVGRVVEALSRAGMLENSIVAFSSDNGAVPFGFRSNRGFNWPLR 313

Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+K ++WEGGVR  G +WSPLL     ++ Q +H++DWLPTL +AA 
Sbjct: 314 GIKASVWEGGVRVPGFVWSPLLRKSARLSTQMMHITDWLPTLYAAAG 360



 Score = 43.1 bits (100), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 71/176 (40%), Gaps = 23/176 (13%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV--KVVKVMRYQVDLTG 630
            ++DG ++W  LSR   S R  IL NID     + L  GK KL+   V       V  TG
Sbjct: 367 GQLDGHNMWEALSRGWRSPRTEILLNIDPITGSAGLRFGKHKLLIPGVNGTTDAHVPTTG 426

Query: 631 GP------DQVYLSGLS----------DREWLALAMRKLRDAASIQCGPVK-EVPCEPQI 673
            P      D +    L+          D  W     R   ++A + CGP + +       
Sbjct: 427 YPRPTEDMDTMMQQSLAAKVLRRFYNVDGIWFR---RGWENSAEVDCGPAEWQSNFVSGR 483

Query: 674 APCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKK 729
              LFDI+ DPCE  NLA  +    +     ++  +     P + +EE+ +   +K
Sbjct: 484 VYNLFDIERDPCEMINLA-ATRRGTVRMMMRKLQSYAAAMIPPRNKEEDPRSFPEK 538


>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
          Length = 1007

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 161/380 (42%), Positives = 221/380 (58%), Gaps = 24/380 (6%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           R+++ A   L L +   M+         PP++I I+ADDLGWNDV FH   QI TPNID 
Sbjct: 427 RSQLAARQALLLIWLSLMIGKIKGVDRSPPNVIVIVADDLGWNDVSFHSSKQIFTPNIDV 486

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           LAY G+IL  +Y     T S+ A+MTG HP+  G Q       +   L    K++P+Y +
Sbjct: 487 LAYHGVILNRHYCAPFGTASQVALMTGSHPLSVGTQSASNEPDQPWTLDPELKLMPEYFR 546

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRR 208
           + GY T ++GKW LGF +K+YTPT RGF+SH G+   + DY+DHS         GLDMRR
Sbjct: 547 DAGYATHLIGKWGLGFSRKDYTPTQRGFDSHFGFLGPYIDYWDHSLRLRNTSTRGLDMRR 606

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           +L+  + ++G Y+TD+F  EAV +I  H   +PL L L H A H+ N  +P+QAP   + 
Sbjct: 607 NLDVDYSVNGSYATDLFNGEAVRLIREHDQKKPLLLVLTHLAPHTGNEDDPMQAPAEEVE 666

Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
              +I D KR   AA++ K+DE VG++++ L +R ML NSI++F +DNG    G + N+ 
Sbjct: 667 KFDYIRDEKRRVLAAMISKVDEGVGQIIQTLAERDMLDNSIVLFYADNGAPTVGMHSNSG 726

Query: 329 SNWPLRGV---------------------KNTLWEGGVRGAGLIWSPLLE-SRGIVAEQY 366
           SN+PLRGV                     K + WEG VR    IWSPLL  + G V+ Q+
Sbjct: 727 SNFPLRGVCIDNWSTLHIRCTNGCFSFQQKYSPWEGAVRTVAAIWSPLLNLTAGRVSNQW 786

Query: 367 VHVSDWLPTLLSAANKSDIP 386
           +HVSDWLPTL  AA    IP
Sbjct: 787 IHVSDWLPTLAHAAGIEGIP 806



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 69/135 (51%), Gaps = 18/135 (13%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV-------KVVKVMRYQ 625
           +EIDG + W  L     + RN +++NID+    S+ ++  WK V       K    M   
Sbjct: 809 SEIDGRNQWEALKNPAITVRNVVMNNIDELHHYSSYSRNGWKYVNGTSWEGKFDNWMGEL 868

Query: 626 VDLTGGPDQVY---LSGLSDREWLALAMR---KLRDAASIQC---GPVKEVPCEPQIAPC 676
            D     ++ Y   L+G      + L +    +LR AA+++C   G VK   C+P  +PC
Sbjct: 869 EDEDELSEEEYVVRLAGSVVGRMMPLDLEHVARLRRAATVECEVEGQVKA--CQPMESPC 926

Query: 677 LFDIKNDPCEKNNLA 691
           LF++ +DPCE+NN+A
Sbjct: 927 LFNLMDDPCERNNVA 941



 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 34/47 (72%), Gaps = 5/47 (10%)

Query: 754 EGMRKLRDAASIQC---GPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           E + +LR AA+++C   G VK   C+P  +PCLF++ +DPCE+NN+A
Sbjct: 897 EHVARLRRAATVECEVEGQVKA--CQPMESPCLFNLMDDPCERNNVA 941


>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 557

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 158/355 (44%), Positives = 229/355 (64%), Gaps = 16/355 (4%)

Query: 39  VLPLAFTLSMVFVDL-----VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           VL   F   ++  ++       S  PP+I+ I+ADDLGWNDV F+G D+IPTPNID+LAY
Sbjct: 11  VLLFIFVCDVILKNISRNSQATSKQPPNIVVIMADDLGWNDVSFNGGDEIPTPNIDSLAY 70

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
           +G+IL  +Y + +CTPSR+A  TG++PI +GMQ   L G E   +PL+  +LPQYL++LG
Sbjct: 71  NGVILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPLQGAEPRSIPLNNILLPQYLRKLG 130

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y T +VGKWH+G+    +TPT RGF++ LGY++G+ +YF H+  E    G D+ R +   
Sbjct: 131 YATHLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIEYFSHNLVENGQSGYDIHRSVGDN 190

Query: 214 WDLHGKYS--TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ-----APDHY 266
             +  +Y   TD+ T EA +II +H+  +PL+L LAH A H++   + ++     A +  
Sbjct: 191 HTIEYRYDYMTDLITDEAENIISSHNPAKPLYLQLAHLAPHASTVDDVIEVRSWKATNDT 250

Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL- 325
           L    +I D  R KFA+ +  +DES+G+VV+AL +  ML NSIIVF+SDNG       L 
Sbjct: 251 LG---YIRDINRRKFASAVVTMDESIGRVVDALRRADMLKNSIIVFMSDNGAPTKDQILY 307

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           N  SN+PLRG+K + +EG VRG   I+SPL++    V+ Q  H++DWLPTL +AA
Sbjct: 308 NFGSNYPLRGMKQSFFEGAVRGVACIYSPLIDFPSRVSTQLFHITDWLPTLYAAA 362



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 64/141 (45%), Gaps = 17/141 (12%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV----------- 616
           D S    +DGID WS++   +  +R  +L NI+++    A   G++KL+           
Sbjct: 365 DASDLKALDGIDQWSMIKNADNGRRQFLLVNINEKTDSKAALIGRYKLIRDQSEYTKYYG 424

Query: 617 ---KVVKVMRYQV-DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQ 672
              K      Y V ++   P    ++ +S        +++LR+ + + C   +   C  +
Sbjct: 425 STGKDPSYCEYNVTNVLASPVASAIADISRSSLDIDKIKQLREKSRVVCRVSRFSRCTKR 484

Query: 673 IAPCLFDIKNDPCEKNNLADR 693
              CLFDIK DPCE  N++ +
Sbjct: 485 T--CLFDIKKDPCETTNVSSK 503


>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
 gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
          Length = 591

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 160/353 (45%), Positives = 227/353 (64%), Gaps = 6/353 (1%)

Query: 40  LPLAFTLSMVFVD-LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           L   F L +   + ++A+S  P+II I+ADDLG++DV F G +   TPNIDALAYSG+IL
Sbjct: 14  LITGFVLCIALSNGIIATSDKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVIL 73

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
            N Y   +CTPSR+A++TGK+PI+TGMQH V+   +  GLPL+E  + +  +E GYRT +
Sbjct: 74  NNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSL 133

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDL 216
           +GKWHLGF ++ +TPT RGF+ H GY   + DY+  S E+      G D R +L+   D 
Sbjct: 134 LGKWHLGFSQRNFTPTQRGFDRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDH 193

Query: 217 HGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
            G+Y TDV T  AV  I +H   ++ +PLFL L H A H+AN   P+QAP   ++   +I
Sbjct: 194 VGRYITDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDNPMQAPAEEVSRFEYI 253

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
            +     +AA++ KLD+SVG V++AL ++ ML NSI++F+SDNGG   G +   ASN+PL
Sbjct: 254 GNKTHRYYAAMVSKLDQSVGYVIDALARQEMLQNSIVLFLSDNGGPTQGQHSTTASNYPL 313

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           RG K++ WEG +R +  IWS   E  G V +Q +++ D LPTL + A  S  P
Sbjct: 314 RGQKDSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAGAGISPDP 366



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 45/149 (30%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TKGKWKLVKVVKVMRYQVDLTGGP 632
           +DG+++WS L     S    I+H ID++  +  L  T+GKWK++            +G  
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAMPHLSYTRGKWKVI------------SGTT 417

Query: 633 DQ-VYLSGLSDRE--------------------WLALA--------MRKLRDAASIQCGP 663
           +Q +Y   L  RE                    W  L         + +LRD ASI+C  
Sbjct: 418 NQGLYDGWLGHRETSEVDPRAAVYEELVRNTSVWRQLQQVSFGERNISELRDQASIECPD 477

Query: 664 VKE--VPCEPQIAPCLFDIKNDPCEKNNL 690
                 PC P  APCLFDI+ DPCE++NL
Sbjct: 478 TANGVRPCLPLEAPCLFDIEADPCERSNL 506



 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 10/121 (8%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           G  NQ  Y       E  +   +    ++  +     ++ ++ S  E  + +LRD ASI+
Sbjct: 415 GTTNQGLYDGWLGHRETSEVDPRAAVYEELVRNTSVWRQLQQVSFGERNISELRDQASIE 474

Query: 767 CGPVKE--VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
           C        PC P  APCLFDI+ DPCE++NL           Y     +LD   R  Q 
Sbjct: 475 CPDTANGVRPCLPLEAPCLFDIEADPCERSNL--------YAEYKNSTTFLDLWARIQQF 526

Query: 825 A 825
           A
Sbjct: 527 A 527


>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 537

 Score =  309 bits (791), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 161/368 (43%), Positives = 226/368 (61%), Gaps = 23/368 (6%)

Query: 25  LKELGYRTRIMAFAVLPLAFTLSMVFVDLV--ASSGPPHIIFILADDLGWNDVGFHGLDQ 82
           LK L  +    AF V  L  T+ ++   L+  A+  PPHI+FI+ADD GW DVG+H    
Sbjct: 4   LKMLKVKRCSCAFLVEALTVTV-LICTGLIKGATGKPPHIVFIVADDYGWFDVGYHN-ST 61

Query: 83  IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE 142
           I TPN+D LA  G+ L+NYY   +C+PSRS +MTG++ IHTG+QH V+   +   LPL+E
Sbjct: 62  IKTPNLDLLASRGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNE 121

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-------S 195
             LPQ LKE GY T +VGKWHLGFYK E  P  RGF+S  GY +G QDY+ H        
Sbjct: 122 TTLPQKLKESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPG 181

Query: 196 AEEMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
             E   W G+D   +   AW+  G YS  VFT  A  +I  H+ ++PLFLYL   + H  
Sbjct: 182 FPEGNHWLGIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG- 240

Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
               PLQ P+ Y+  + H +D  R  +A ++  +DE+VGKVV++L++  + +++++VF +
Sbjct: 241 ----PLQVPEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTT 296

Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDW 372
           DNGG        + +NWPLRG KNTLWEGGV G G I  P++ +  +G V++ ++H+SDW
Sbjct: 297 DNGGTPG----KSGNNWPLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDW 352

Query: 373 LPTLLSAA 380
            PTL+   
Sbjct: 353 FPTLIEGV 360


>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
          Length = 500

 Score =  308 bits (790), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 159/343 (46%), Positives = 211/343 (61%), Gaps = 11/343 (3%)

Query: 48  MVFVDLVASSG---------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           ++F+ +   +G         PPHI+FIL DDLGW D   HG  QIPTPN+DALA +G++L
Sbjct: 5   LLFISVACVTGTSAKTPYPTPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTGVLL 64

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
            NYY   LC+PSR+A+MTG +P H G++  ++ G +  GLPL  KILP++LK+LGY T +
Sbjct: 65  NNYYVQPLCSPSRAALMTGLYPAHNGIRMPLM-GAQVAGLPLQFKILPEHLKDLGYETHM 123

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           VGKWHLG     YTPT+RGF++  G++ G  DY+    E+    GLD             
Sbjct: 124 VGKWHLGHSSLNYTPTYRGFDTFFGFYNGPIDYYHGIMEQEGHIGLDFWNGTRALPLEER 183

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y+T  F   A  II N +  +PLFLYLAH A HSA   E LQAP         + D  R
Sbjct: 184 IYATTRFQDHANYIIANRNASKPLFLYLAHQAVHSAYEPEFLQAPGENTKKFPFLGDASR 243

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVK 337
              A ++  LD+SVG++VE LE+  +L ++I+VF +DNGG      + N   NWPLRG+K
Sbjct: 244 KSLAGMVDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRSDPNRGFNWPLRGIK 303

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            T+WEGG+RG   IWSP LE    V+ Q +H+SDWLPTL SAA
Sbjct: 304 GTVWEGGIRGTAFIWSPRLELSSRVSTQLMHISDWLPTLYSAA 346



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 38/146 (26%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV------------KVVKVM 622
           +DGID+W  LS   PS R  +L   ++E   +AL     K+V            +VV  +
Sbjct: 356 LDGIDMWQSLSTGGPSPRTEVLLEFNNETDRAALRYLNHKIVLGIYSNEVSERYEVVGGV 415

Query: 623 RYQVDLT----------------GGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE 666
           R  +DL                 G P Q++L G  D+ W        R  A+++C     
Sbjct: 416 RPGIDLDVLQRNSKAGRVLARFYGNP-QLFLRGF-DKSW--------RSRATVRCNENGP 465

Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLAD 692
                     LFD++ DPCE +NLAD
Sbjct: 466 TNFNSSSQVFLFDLETDPCELHNLAD 491


>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
 gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
          Length = 574

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/325 (46%), Positives = 210/325 (64%), Gaps = 4/325 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P++I I+ADD+G++D+   G  +  TPNIDAL + G +L   Y   +CTPSR A+++GK+
Sbjct: 35  PNVIIIMADDMGFDDISLRGGREFLTPNIDALGFHGRLLDRLYAPAMCTPSRGALLSGKY 94

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTG QH V+   E   L L+  ++P+  +  GY T +VGKWHLGF + EYTPT RGF+
Sbjct: 95  PIHTGTQHFVISNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWHLGFARPEYTPTHRGFD 154

Query: 180 SHLGYWTGHQDYFDHSA---EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
            H GYW  + DY+   +   E+  + G D RR++E      G Y TD+ T EA  II   
Sbjct: 155 YHYGYWGAYIDYYQRRSQMPEKTYIVGYDFRRNMEVECTDRGVYVTDLLTNEAERIIQET 214

Query: 237 STDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           +  + PLFL + H ATH+AN  +PLQAP+  +    HI+D    K+AA++ KLD+SVG+V
Sbjct: 215 AAKQKPLFLMINHLATHTANDNDPLQAPEEEVQKFLHIKDPNHRKYAAMVSKLDQSVGRV 274

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           + AL +   L NSI++F SDNG  + G   N  SNWPL+G KN+ WEGGVR AG+IWS L
Sbjct: 275 ITALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGVRVAGVIWSQL 334

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L +RG +  Q ++V D+LPTL  AA
Sbjct: 335 LTARGHLFTQPIYVGDFLPTLAHAA 359



 Score = 66.6 bits (161), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 64/140 (45%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVL--SRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV------------ 619
           ++DGID+W  L  S++ P     ILH++DD    S+L  G+WK V               
Sbjct: 368 QLDGIDLWPQLAGSKDAPHVPREILHSLDDVSHFSSLLLGQWKYVNGTTLAGQFDQQLTH 427

Query: 620 --------KVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                   +  RY V++   P    L+    +      M  LR  A+++CG ++   C  
Sbjct: 428 RELDELDPRARRYVVEVRNSPASKALARYDLQRLTQHRMHTLRRLATVRCGDMQR-GCNA 486

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  CL+D+  DPCE NNLA
Sbjct: 487 LLEECLYDLSVDPCELNNLA 506



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           TT  G+F+Q     + +E + + ++   + +     K   +   ++ +     M  LR  
Sbjct: 415 TTLAGQFDQQLTHRELDELDPRARRYVVEVRNSPASKALARYDLQRLTQHR--MHTLRRL 472

Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 797
           A+++CG ++   C   +  CL+D+  DPCE NNLA
Sbjct: 473 ATVRCGDMQR-GCNALLEECLYDLSVDPCELNNLA 506


>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
 gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
          Length = 488

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 161/345 (46%), Positives = 215/345 (62%), Gaps = 20/345 (5%)

Query: 43  AFTLSMVFVDLV--ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           A +L ++   ++  A++ PPHI+FIL DDLGW D+G+HG   I TPNI+ LA  GIIL N
Sbjct: 6   ALSLVLLSAQILSEANAIPPHIVFILVDDLGWFDLGYHG-SVIRTPNINQLAGDGIILDN 64

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           YY   LCTP+RSA+MTGK+PIH G QH V+   +  GLPL    LP+ LK+ GY T IVG
Sbjct: 65  YYVQPLCTPTRSALMTGKYPIHLGTQHGVILPGQPMGLPLDSSTLPEQLKQQGYATHIVG 124

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           KWHLGFYK+++ PT RGF+S  GY+ G +D+F H+        LD R +     D  G Y
Sbjct: 125 KWHLGFYKEDFVPTKRGFDSFYGYYCGAEDHFTHNVLGF----LDFRDNDLIVKDQKGTY 180

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
            T  FT  AVD IH H++  PLFLYL     H      P+QAP  Y++ +  I+D  R  
Sbjct: 181 GTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPEYIDKYSFIKDKTRRT 235

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--AAAGFNLNAASNWPLRGVKN 338
            AA++  +DE++G V  AL+   +  N+++VF +DNGG   A G+      N+PLRG KN
Sbjct: 236 HAAMVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNGGIHTAGGY------NYPLRGEKN 289

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           TLWEGGVRGAG +  P+    G++    +HV+DW PTL+  A  S
Sbjct: 290 TLWEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAGGS 334


>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
 gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
          Length = 492

 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 151/332 (45%), Positives = 208/332 (62%), Gaps = 22/332 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADD GWND+G+H    I TPN+D LA  G+ L+NYY   +C+PSR+ +MTG++
Sbjct: 16  PHIVFILADDYGWNDIGYHS-SLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTGRY 74

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  GMQH+VL      GLPL E  LPQ LKE GY T IVGKWHLG +KKEY PT+RGF+
Sbjct: 75  QIRYGMQHSVLMSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRGFD 134

Query: 180 SHLGYWTGHQDYFDH--------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           +  G+  G +DYF H        + E  +    D     +P    +G YST VF  +++D
Sbjct: 135 TFFGFLGGGEDYFTHRIPNEIVETPETYR--AFDFWDGSKPCLSENGSYSTHVFARKSID 192

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           +I  H  D+P+FLYL   A H+     PL+AP+ ++N + HI       +A +   +DE+
Sbjct: 193 LISRHDKDKPMFLYLPFQAVHA-----PLEAPEEFINKYTHIRSKNMRTYAGVTTAMDEA 247

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VG +  AL+   M  N++++F +DNG + AG      SNWPLRG KNTLWEGGVRG G +
Sbjct: 248 VGNITRALKDHGMWENTVLIFSTDNGASKAG------SNWPLRGFKNTLWEGGVRGVGFV 301

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            SPLL+ +  +++  +H+SDW PT++  A  S
Sbjct: 302 SSPLLKVKQRISDALLHISDWFPTIVRIAGGS 333



 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 22/36 (61%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH+VL      GLPL E  LPQ LKE GY T I+ 
Sbjct: 80  MQHSVLMSDRPHGLPLGEVTLPQVLKESGYATHIVG 115



 Score = 40.0 bits (92), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 48/108 (44%), Gaps = 24/108 (22%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID--------DEWQI-----------SALTRGKWKL 550
           +DG DVW  +S   PS R  ILHNID        D  +I           +A+    WKL
Sbjct: 340 LDGYDVWDTISEGTPSPRTEILHNIDPLIAATNSDASKIHGHSTFNTSVKAAIRVRDWKL 399

Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGI---DVWSVLSRNEPSKRNTI 595
           +  N   GNG  + ++ D   + +  G+    +W    R++P +R  +
Sbjct: 400 LTGNP--GNGDWQLQTFDGKIKEKSTGVRGKQLWLFNIRSDPQERKDV 445


>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
          Length = 784

 Score =  306 bits (785), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 153/311 (49%), Positives = 209/311 (67%), Gaps = 2/311 (0%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GW+DV FHG  QIPTPNID+LA SG+IL NYY       +++  MTGK+  H G QH VL
Sbjct: 1   GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-Q 189
           +  +  GLP +EKILPQYLKE GY    VGKW LG+YK+E  P  RGF+   G  T   +
Sbjct: 61  HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFDFFYGGLTSSGK 120

Query: 190 DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHA 249
           DY+ HSA + + +GLD+RR+ E   +  G Y T+V+T EAV+II N++ ++PLFLY+AH 
Sbjct: 121 DYYTHSAFD-ENYGLDLRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFLYVAHQ 179

Query: 250 ATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
           A H+ N  +PLQAP+ YL    HI++ KR  FA ++  LDESV  +  AL Q+ +L N++
Sbjct: 180 AVHTGNADDPLQAPESYLKKLNHIKNIKRKLFAGMVLALDESVLNITIALAQKGLLDNTV 239

Query: 310 IVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
           IVF +D GGA  G  L++ASN+PLRG K T+WEGG+R    ++S L++++G V+ +  H 
Sbjct: 240 IVFTTDAGGAVGGQELSSASNFPLRGSKLTVWEGGIRAVAFVYSDLIKNKGRVSLEMYHS 299

Query: 370 SDWLPTLLSAA 380
           +DWL T L  A
Sbjct: 300 TDWLLTTLGLA 310



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 17/128 (13%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ-ISALTKGKWKLVKVVKVMRYQVDLTGGP 632
           ++DG DVW  +S  E S R  ILH  D   + + +L  G +KLV + + + +  D    P
Sbjct: 318 KVDGFDVWKSISEGEVSPRFEILHQFDPLTKNLCSLRVGDYKLV-INQDIGFYGDWYPRP 376

Query: 633 DQVYLSGLSDREWLALAMRKLRDAASIQCG-----PVKEV---PCEPQIAPCLFDIKNDP 684
            ++        E   L   +    A + C      P  E+   PC+P   PC+F+I+ DP
Sbjct: 377 AEI-------GELRYLKRIETLPNAKVTCDKNYPHPFLEMHAPPCDPMKKPCMFNIQWDP 429

Query: 685 CEKNNLAD 692
           CE +NLA+
Sbjct: 430 CEFHNLAE 437



 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/29 (65%), Positives = 22/29 (75%)

Query: 2  QHNVLYGCERGGLPLSEKILPQYLKELGY 30
          QH VL+  +  GLP +EKILPQYLKE GY
Sbjct: 56 QHGVLHNKQPFGLPHTEKILPQYLKEAGY 84


>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
          Length = 547

 Score =  306 bits (785), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 10/335 (2%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           V+++  PH+I I+ADD+GW+D   HG   + TPN+D L  SG+ L  YYT  LC+P+R+A
Sbjct: 17  VSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTA 76

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TGK+    GMQ   L   E  G+PL E+++ QYL++ GYRT++VGKWH+G    E  P
Sbjct: 77  VLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLP 136

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           T+RGFE+H G   G  DY++++A+E      + GL +  DL+P W   G Y TDV+T ++
Sbjct: 137 TYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITDVYTEKS 195

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             II NH+  EPL+L L H A H+ N    LQAP   +   RH+E   R  FAA++ KLD
Sbjct: 196 TTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLD 255

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           +S+G++V  LE++ ML N+II F +DNG    G   N+ SN+PLRGVK + WEGG+RG  
Sbjct: 256 DSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNA 315

Query: 350 LIW-----SPLLESRGIVAEQYVHVSDWLPTLLSA 379
           +IW     +P    RG V +  +H +DW+PTLL A
Sbjct: 316 MIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEA 350



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 79/183 (43%), Gaps = 30/183 (16%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLTG 630
           +DGI +WS +  N+PS R  I   IDD +  S++T G+ KLVK          Y  DL G
Sbjct: 359 LDGIPMWSHIIENKPSPRTEIF-EIDDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRG 417

Query: 631 --GPDQVYLSGLSD-REWLALAMRKL---------RDAASIQCGPVKEVPCEPQIAP-CL 677
             G    Y   L D + W +L    +         RD A + CG V   PC P     CL
Sbjct: 418 IIGTPPDYKQKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCL 477

Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
           +DI  DPCE  +L++            E+ +  QI     E+EE K   ++ +     K 
Sbjct: 478 YDIIEDPCELRDLSE------------ELPQLAQILLYRLEQEEAKIIPREGQYVADPKS 525

Query: 738 KKK 740
             K
Sbjct: 526 APK 528



 Score = 43.5 bits (101), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 9/126 (7%)

Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK 739
           I+N P  +  + +   D   NH +  +GR   +     E   +   +  +         K
Sbjct: 369 IENKPSPRTEIFEI--DDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYK 426

Query: 740 KKKKKKKKKYSNEEEGM------RKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 792
           +K +  K   S E  G+         RD A + CG V   PC P     CL+DI  DPCE
Sbjct: 427 QKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCLYDIIEDPCE 486

Query: 793 KNNLAD 798
             +L++
Sbjct: 487 LRDLSE 492


>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
          Length = 517

 Score =  306 bits (785), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 148/334 (44%), Positives = 217/334 (64%), Gaps = 25/334 (7%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS ++
Sbjct: 36  AAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IH G+QH ++  C+   +PL EK+LPQ LK+ GY T +VGKWHLG Y+KE  PT 
Sbjct: 95  TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154

Query: 176 RGFESHLGYWTGHQDYFDHSAEE------MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H A             LD+R   EPA +    YST++FT  A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++ +              ++PLQ P+ Y+  +  I+D  R  +A ++  LD
Sbjct: 215 TTLIANHPPEKSV--------------HDPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 260

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V +AL+ R + +N++++F +DNGG       +  +NWPLRG K TLWEGG+RGAG
Sbjct: 261 EAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGAG 316

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            + SPLL+ +G+ + + +H++DWLPTL++ A  S
Sbjct: 317 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 350


>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 10/335 (2%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           V+++  PH+I I+ADD+GW+D   HG   + TPN+D L  SG+ L  YYT  LC+P+R+A
Sbjct: 17  VSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTA 76

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TGK+    GMQ   L   E  G+PL E+++ QYL++ GYRT++VGKWH+G    E  P
Sbjct: 77  VLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLP 136

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           T+RGFE+H G   G  DY++++A+E      + GL +  DL+P W   G Y TDV+T ++
Sbjct: 137 TYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITDVYTEKS 195

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             II NH+  EPL+L L H A H+ N    LQAP   +   RH+E   R  FAA++ KLD
Sbjct: 196 TTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLD 255

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           +S+G++V  LE++ ML N+II F +DNG    G   N+ SN+PLRGVK + WEGG+RG  
Sbjct: 256 DSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNA 315

Query: 350 LIW-----SPLLESRGIVAEQYVHVSDWLPTLLSA 379
           +IW     +P    RG V +  +H +DW+PTLL A
Sbjct: 316 MIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEA 350



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 79/183 (43%), Gaps = 30/183 (16%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLTG 630
           +DGI +WS +  N+PS R  I   IDD +  S++T G+ KLVK          Y  DL G
Sbjct: 359 LDGIPMWSHIIENKPSPRTEIF-EIDDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRG 417

Query: 631 --GPDQVYLSGLSD-REWLALAMRKL---------RDAASIQCGPVKEVPCEPQIAP-CL 677
             G    Y   L D + W +L    +         RD A + CG V   PC P     CL
Sbjct: 418 IIGTPPDYKQKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCL 477

Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
           +DI  DPCE  +L++            E+ +  QI     E+EE K   ++ +     K 
Sbjct: 478 YDIIEDPCELRDLSE------------ELPQLAQILLYRLEQEEAKIIPREGQYVSDPKS 525

Query: 738 KKK 740
             K
Sbjct: 526 APK 528



 Score = 43.1 bits (100), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 9/126 (7%)

Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK 739
           I+N P  +  + +   D   NH +  +GR   +     E   +   +  +         K
Sbjct: 369 IENKPSPRTEIFEI--DDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYK 426

Query: 740 KKKKKKKKKYSNEEEGM------RKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 792
           +K +  K   S E  G+         RD A + CG V   PC P     CL+DI  DPCE
Sbjct: 427 QKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCLYDIIEDPCE 486

Query: 793 KNNLAD 798
             +L++
Sbjct: 487 LRDLSE 492


>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
 gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 10/335 (2%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           V+++  PH+I I+ADD+GW+D   HG   + TPN+D L  SG+ L  YYT  LC+P+R+A
Sbjct: 17  VSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTA 76

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TGK+    GMQ   L   E  G+PL E+++ QYL++ GYRT++VGKWH+G    E  P
Sbjct: 77  VLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLP 136

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           T+RGFE+H G   G  DY++++A+E      + GL +  DL+P W   G Y TDV+T ++
Sbjct: 137 TYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITDVYTEKS 195

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             II NH+  EPL+L L H A H+ N    LQAP   +   RH+E   R  FAA++ KLD
Sbjct: 196 TTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLD 255

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           +S+G++V  LE++ ML N+II F +DNG    G   N+ SN+PLRGVK + WEGG+RG  
Sbjct: 256 DSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNA 315

Query: 350 LIW-----SPLLESRGIVAEQYVHVSDWLPTLLSA 379
           +IW     +P    RG V +  +H +DW+PTLL A
Sbjct: 316 MIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEA 350



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 79/183 (43%), Gaps = 30/183 (16%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVV----KVMRYQVDLTG 630
           +DGI +WS +  N+PS R  I   IDD +  S++T G+ KLVK          Y  DL G
Sbjct: 359 LDGIPMWSHIIENKPSPRTEIF-EIDDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRG 417

Query: 631 --GPDQVYLSGLSD-REWLALAMRKL---------RDAASIQCGPVKEVPCEPQIAP-CL 677
             G    Y   L D + W +L    +         RD A + CG V   PC P     CL
Sbjct: 418 IIGTPPDYKQKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCL 477

Query: 678 FDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
           +DI  DPCE  +L++            E+ +  QI     E+EE K   ++ +     K 
Sbjct: 478 YDIIEDPCELRDLSE------------ELPQLAQILLYRLEQEEAKIIPREGQYVADPKS 525

Query: 738 KKK 740
             K
Sbjct: 526 APK 528



 Score = 43.1 bits (100), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 9/126 (7%)

Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK 739
           I+N P  +  + +   D   NH +  +GR   +     E   +   +  +         K
Sbjct: 369 IENKPSPRTEIFEI--DDYFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYK 426

Query: 740 KKKKKKKKKYSNEEEGM------RKLRDAASIQCGPVKEVPCEPQIAP-CLFDIKNDPCE 792
           +K +  K   S E  G+         RD A + CG V   PC P     CL+DI  DPCE
Sbjct: 427 QKLRDSKAWESLETIGIPLDADVMADRDEAIVTCGNVVPKPCSPSAESWCLYDIIEDPCE 486

Query: 793 KNNLAD 798
             +L++
Sbjct: 487 LRDLSE 492


>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 557

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 212/328 (64%), Gaps = 5/328 (1%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           P +++ ILADD+GWND  FHG  +IPTPN+DALA SG+IL+++Y   +CTP+R+A++TG 
Sbjct: 79  PTNVVMILADDMGWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALLTGL 138

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P HTGMQ+ V+   E  GLPL  KILP YL E  Y + +VGKWHLG +   + PT R F
Sbjct: 139 YPFHTGMQNFVIRTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTARHF 198

Query: 179 ESHLGYWTGHQDYFDH---SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
            +H+GY+ G  DYF H   S     + GLD    +    +    Y+T +FT  AV++I N
Sbjct: 199 NTHVGYYNGFIDYFTHEHISPGNDSLIGLDWH--INEENENEEGYATHLFTKRAVNLIEN 256

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H + EPLF+ L+H A H+    +P QAP   +    HI+D  R  +AA++ +LD+SVG+V
Sbjct: 257 HKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRKVYAAMVDELDQSVGQV 316

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           VEAL ++R+L +++IVF+SDNGG   G   N  SN+P RG K TL+EGG R +  +WS  
Sbjct: 317 VEALYRKRLLDSTMIVFLSDNGGQTTGVMNNTGSNYPFRGQKRTLFEGGTRVSAFVWSTD 376

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +  +  +    +HV+DWLPT+L  A  S
Sbjct: 377 IVKKPRIESGLMHVTDWLPTILKRAGLS 404



 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 30/148 (20%)

Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLT 629
           SY  E+DG D WS+LS   PS R  +L+N++   Q +AL   ++KL+      R  V L 
Sbjct: 404 SYPRELDGRDQWSMLSEGAPSARAEVLYNLNYVDQSAALRNDRFKLIVGP---RNGVSLL 460

Query: 630 GGPDQVY--------LSGLSDREWLALAMRKL-------------------RDAASIQCG 662
              DQ++        L G+ D    A  +++                    RD A I+CG
Sbjct: 461 NDWDQLHKGKHSRAQLDGMMDESLTARTLKEAGLWWKMRHGDTLSADHSTWRDKAEIKCG 520

Query: 663 PVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
              +  C+   +PCLFD++NDPCE NN+
Sbjct: 521 SHDKTLCDVHRSPCLFDLENDPCELNNV 548



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%)

Query: 760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
           RD A I+CG   +  C+   +PCLFD++NDPCE NN+
Sbjct: 512 RDKAEIKCGSHDKTLCDVHRSPCLFDLENDPCELNNV 548


>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
          Length = 476

 Score =  306 bits (783), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 158/337 (46%), Positives = 217/337 (64%), Gaps = 21/337 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ I+ADDLG +DVG+HG   I TPNID LAY+G+ L+NYY   +CTPSRS +MTG++
Sbjct: 1   PNILLIVADDLGHHDVGYHG-SVIKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRY 59

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QHN++   +   +PL   +LP+ LK+ GY T +VGKWHLGFYK E  P  RGF+
Sbjct: 60  QIHTGLQHNIINPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFD 119

Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           S+ GY TG +DYF H            + G+D+R D E  W+  GKYST +F  +A D++
Sbjct: 120 SYYGYLTGSEDYFTHRRCGALPGANKTVCGIDLRNDFEVDWNQTGKYSTQLFAEKAEDVV 179

Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             H+    D+PLFLY+A  A H+ N     Q P+ YL  +  I+D KR   A ++  +DE
Sbjct: 180 RKHAVHQPDQPLFLYVAFQAVHAPN-----QVPNEYLKPYD-IDDPKRRLLAGMVTCMDE 233

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +VG +  A +   +  N+++VF +DNGG     N N+ +NWPLRG K++LWEGG+RG G 
Sbjct: 234 AVGNITAAFKDTGLWDNTVVVFTTDNGGDV---NENSGNNWPLRGWKHSLWEGGMRGVGF 290

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDI 385
           + SPLL+S G      +HVSDW PTL  L+ AN S I
Sbjct: 291 VNSPLLKSFGYPVNGLMHVSDWFPTLLNLAGANTSSI 327


>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
          Length = 556

 Score =  305 bits (781), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 147/335 (43%), Positives = 216/335 (64%), Gaps = 25/335 (7%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 72  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 130

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 131 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 190

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 191 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 250

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++ +              ++PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 251 ATTVIANHPPEKSV--------------HDPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 296

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 297 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 352

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H++DWLPTL+  A  S
Sbjct: 353 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 387



 Score = 40.0 bits (92), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 141 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 176


>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 513

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 219/359 (61%), Gaps = 12/359 (3%)

Query: 39  VLPLAFTL-SMVFVDLVASSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           +L   F L S    D+   SG    P+++ I+ DDLGW+D+G HG  QIPTPNID LA  
Sbjct: 4   ILFFLFCLRSFGTCDISGPSGETKAPNVVLIVVDDLGWDDIGLHGSSQIPTPNIDKLAEE 63

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G++L NYYT  +CTPSR+++MTGK+P+  G+QH+V+      GLP + KI+PQYL +  Y
Sbjct: 64  GVVLDNYYTQPICTPSRASLMTGKYPVRLGLQHDVISAATPFGLPSNFKIMPQYLHDKNY 123

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK------MWGLDMRR 208
              IVGKWHLG  + E+ PT RGF+ H GY  G  D++ H   +         +GLD+  
Sbjct: 124 DCHIVGKWHLGHSRSEFLPTRRGFKDHFGYRLGSSDHYSHYGADDSDVPGSLFYGLDLWH 183

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           +  PA + +GKYSTD++T  + DI+  H+   PLFLYLA+ A H+ NP + LQAP   ++
Sbjct: 184 NEVPAKEFNGKYSTDIYTHRSTDILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIVD 243

Query: 269 -IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
                I + +R ++AA++  +D ++G V+ A+       N+++ F +DNGG     + + 
Sbjct: 244 RFSSSIRNDRRRRYAAMVSAVDTAIGNVMGAIRANGFAGNTLVFFTNDNGGPINANDRSP 303

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
            SN+PLR  K TLWEGGVRG G+ W+P +   G       H+ D LPT+LSAA  S +P
Sbjct: 304 GSNYPLRAGKFTLWEGGVRGTGIFWAPQVLKPGKFG-GLSHIVDVLPTILSAAGMSSVP 361


>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
          Length = 661

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 154/331 (46%), Positives = 211/331 (63%), Gaps = 12/331 (3%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
            + A++  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSR
Sbjct: 130 SITAATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSR 188

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           S  +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE 
Sbjct: 189 SQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKEC 248

Query: 172 TPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEA 229
            PT RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T   
Sbjct: 249 MPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRV 308

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LD
Sbjct: 309 QQILASHNPTKPIFLYIAYQAVHS-----PLQAPVRYFEHYRSIVNINRRRYAAMLSCLD 363

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E++  V  AL+     +NSII++ SDNGG  A       SNWPLRG K T WEGG+R  G
Sbjct: 364 EAIHNVTLALKAYGFYNNSIIIYSSDNGGQPAA----GGSNWPLRGSKGTYWEGGIRAIG 419

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            + SPLL+++G V  + VH++DW PTL+S A
Sbjct: 420 FVHSPLLKNKGTVCRELVHITDWYPTLISLA 450


>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
          Length = 596

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
          Length = 578

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
 gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
 gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
 gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
 gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
          Length = 599

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
          Length = 601

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 155/329 (47%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 72  TTSASQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 130

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 131 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 190

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 191 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDSDNGIYSTQMYTQRVQQ 250

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +PLFLYLA+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 251 ILASHNPTKPLFLYLAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 305

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 306 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 361

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 362 HSPLLKNKGTVCKELVHITDWYPTLISLA 390


>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
          Length = 597

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
          Length = 599

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
 gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
          Length = 598

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
          Length = 596

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
          Length = 598

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 154/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            AS+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TASTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHDPRKPIFLYIAYQAVHS-----PLQAPSRYFEHYRSIVNINRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 516

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 151/347 (43%), Positives = 221/347 (63%), Gaps = 17/347 (4%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L FT  +V +    S  P HI+FILADD G++D+G+H    I TPN+D LA  G+ L+NY
Sbjct: 22  LFFTTLIVLLLRPVSPEPTHIVFILADDYGFHDIGYHD-SIIKTPNLDRLASEGVKLENY 80

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +CTP+RS +M+G++ IHTG+QH +++ C+   LP++E  +PQ LKE GY T IVGK
Sbjct: 81  YVQPICTPTRSQLMSGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGK 140

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL---EPAWDLHG 218
           WHLG YKKE  PT RGF++  GY TG +DY+ H+    K  G+D R ++   +P ++  G
Sbjct: 141 WHLGMYKKECLPTERGFDTFFGYLTGSEDYYTHNRSYDKFHGMDFRENMQIVQPQYN--G 198

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
           +YST VF  +A +II +H    PLFLYL   A H      PLQ PD Y   + +I + +R
Sbjct: 199 QYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQAVHG-----PLQVPDQYEKPYANITNKQR 253

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +A ++  +DE+VG + + L+   +  NS+I+F +DNGG      +   +NWPLRG K 
Sbjct: 254 RTYAGMVSCMDEAVGNITQTLKDTGLYDNSVIIFSTDNGGQV----IAGGNNWPLRGWKG 309

Query: 339 TLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +LWEGG+ G G + SPL+  ++RG  +++ +HV+DW PT++  A  S
Sbjct: 310 SLWEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGS 356


>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
          Length = 596

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
          Length = 599

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 784

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 161/345 (46%), Positives = 218/345 (63%), Gaps = 19/345 (5%)

Query: 42  LAFTLSMVFVDL-VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           L + L +   D+ V     PHIIFILADDLGW+DVG+HG   I TPNIDALA  G+ L N
Sbjct: 23  LVYVLILEVTDIHVNGRQKPHIIFILADDLGWSDVGYHG-SVIKTPNIDALASEGVKLDN 81

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           YY   LCTPSR  +++G++ IHTG+QH  +   +   LP+ E ILPQ LKE GY T +VG
Sbjct: 82  YYMSLLCTPSRGQLLSGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKERGYATHMVG 141

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP-AWDLHGK 219
           KWHLGFY+KE  P +RGF++ +G++ G  DY+ H+       G D RR+ +  A    G+
Sbjct: 142 KWHLGFYRKECLPNYRGFDTFMGFYQGMADYYYHNISTGIYHGWDFRRNNDVIAQKYAGQ 201

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI--EDFK 277
           YST VF  EA  II  H+ + PLFL+L+  A H      PLQ P  Y ++++ +   +  
Sbjct: 202 YSTHVFADEAQIIIMKHNPEVPLFLFLSFQAIHL-----PLQVPSRYADMYKTLIPNNAD 256

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R K+A ++  +DE+VG+VV AL+   +  NS++VF +DNGG   G       NWPLRG K
Sbjct: 257 RQKYAGMVTCMDEAVGRVVTALKHTGLWENSVLVFSTDNGGILGG-------NWPLRGSK 309

Query: 338 NTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
            +L+EGGVRG G + SPLL S  RG V  + +H++DWLPTL+  A
Sbjct: 310 ASLYEGGVRGVGFVTSPLLPSRVRGTVNRELLHMTDWLPTLVRLA 354


>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
          Length = 597

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 69  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 127

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 128 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 187

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 188 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 247

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 248 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 302

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 303 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 358

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 359 HSPLLKNKGTVCKELVHITDWYPTLISLA 387


>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
          Length = 599

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
           (N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
           intestinalis]
          Length = 522

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 155/343 (45%), Positives = 221/343 (64%), Gaps = 20/343 (5%)

Query: 48  MVFVDLVASSG--PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
           +VFV ++  SG   P+I+ I+ADDLG+NDVG+H  D I TPNI+ LA  G+IL++YY   
Sbjct: 13  LVFVQVLYVSGHQKPNILLIVADDLGFNDVGYHNPD-IYTPNINKLAKDGVILESYYVQP 71

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           +CTPSRS +MTG++ IHTG+QH+V++  +   LP+ E ILPQ LKE GY T  VGKWHLG
Sbjct: 72  ICTPSRSQLMTGRYQIHTGLQHSVIFAPQPNCLPVDEIILPQKLKEAGYTTHAVGKWHLG 131

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDY--------FDHSAEEMKMWGLDMRRDLEPAWDLH 217
           FYKKE  PT RGF++  GY+ G +DY        F    +  ++ G D   +    W+ +
Sbjct: 132 FYKKECLPTSRGFDTFYGYYCGAEDYYTKQVHANFHFGNKTRRVSGFDFHDNSRTEWEAN 191

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           G YS+ ++   AV II +H++  PLF+YL   + H      PLQ P  Y+  +RHI+D K
Sbjct: 192 GTYSSYLYRDRAVRIIKSHNSSIPLFMYLPFQSVHF-----PLQVPAKYIKRYRHIKDRK 246

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  F+A++  +DE++G VV+AL+ + +  ++++VF +DNGG      L   +NWPLRG K
Sbjct: 247 RRTFSAMVTAMDEAIGSVVDALKWKGIWQDTLVVFTTDNGGQT----LFGGNNWPLRGRK 302

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            +LWEGGVRG GL+    +  +G  + + VH+SDW PTLL  A
Sbjct: 303 ASLWEGGVRGVGLVRGYGIRDKGRSSNELVHISDWFPTLLYIA 345


>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
          Length = 601

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 210/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A++  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 72  AAATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 130

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 131 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 190

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 191 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDSAAWDHDNGIYSTQMYTQRVQQ 250

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 251 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 305

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           V  V  AL+     +NSI+++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 306 VSNVTLALKTYGFYNNSILIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 361

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 362 HSPLLKNKGTVCKELVHITDWYPTLISLA 390


>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
          Length = 605

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 76  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 134

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 135 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 194

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 195 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQK 254

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 255 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 309

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 310 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 365

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 366 HSPLLKNKGTVCKELVHITDWYPTLISLA 394


>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
 gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
          Length = 576

 Score =  303 bits (775), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 208/325 (64%), Gaps = 4/325 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II I+ADD+G++D+   G  +  TPNIDALA+ G +L   Y   +CTPSR A+++GK+
Sbjct: 35  PNIIIIMADDMGFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAMCTPSRGALLSGKY 94

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTG QH V+   E   L L+  ++P+  +  GY T +VGKWHLG+ + E+TPT RGF+
Sbjct: 95  PIHTGTQHFVISNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGYARPEFTPTHRGFD 154

Query: 180 SHLGYWTGHQDYFDHSAE---EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
            H GYW  + DY+   ++   +  + G D RR++E      G Y TD+ T EA  +I   
Sbjct: 155 YHYGYWGAYIDYYQRRSQMPDKTYIMGYDFRRNMEVECADRGVYMTDLLTNEAERVIQET 214

Query: 237 ST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           +   +PLFL + H A H+ N  +PLQ P+  L    HI+D    K+AA++ KLD+SVG+V
Sbjct: 215 AAKQQPLFLMINHLAVHTGNDNDPLQVPEEELQKFTHIKDPNHRKYAAMVSKLDKSVGRV 274

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           + AL +   L NSI++F SDNG  + G   N  SNWPL+G KN+ WEGG+R AG IWSPL
Sbjct: 275 IGALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGLRVAGAIWSPL 334

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L++R  V  Q ++V D+LPTL  AA
Sbjct: 335 LKARSSVFTQPIYVGDFLPTLAHAA 359



 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 62/140 (44%), Gaps = 23/140 (16%)

Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM--------- 622
           ++DGID+W  LS  +  P     ILH++DD W +S+L  G WK V    +          
Sbjct: 368 QLDGIDLWPQLSGPKEAPHVPREILHSLDDVWNVSSLLLGHWKYVNGTTLAGQYDQQLVQ 427

Query: 623 -----------RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEP 671
                      RY V++        L+    +      M+ +   A + CG ++   C  
Sbjct: 428 RELDDLDPRANRYVVEVRNSLASRALARFDQQRLTQHRMKNIMIDAKVDCGDMQR-GCNA 486

Query: 672 QIAPCLFDIKNDPCEKNNLA 691
            +  CL+D+  DPCE+NNLA
Sbjct: 487 LLEECLYDMSVDPCERNNLA 506


>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
           caballus]
          Length = 523

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 208/328 (63%), Gaps = 16/328 (4%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           ++++L DD+G +DVG H   +  TP +DALA  G++L NYYT  LC PSRS +++G++ I
Sbjct: 37  LVYMLTDDMGXHDVGLHE-SRFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQLLSGRYQI 95

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
           HTG+QH +++ C+   LPL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF+++
Sbjct: 96  HTGLQHQIIWPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTY 155

Query: 182 LGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
            GY  G +DY+ H       A  +    LD R   E A      YST VFT  A  +I N
Sbjct: 156 FGYLLGSEDYYSHERCTFIDALNVTRCALDFRDGEEVATGYKNMYSTSVFTERATALITN 215

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H  ++PLFLYLA  + H     +PLQ P+ YL  +  I+D  R  +A ++  LDE+VG V
Sbjct: 216 HPPEKPLFLYLALQSVH-----QPLQVPEEYLKPYDFIQDKNRYHYAGMVSLLDEAVGNV 270

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
             AL+ R + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + SPL
Sbjct: 271 TAALKSRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASPL 326

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
           L+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 327 LKQKGVKNRELIHISDWLPTLVKLAGGS 354



 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   LPL EK+LPQ LKE GY T ++ 
Sbjct: 99  LQHQIIWPCQPSCLPLDEKLLPQLLKEAGYTTHMVG 134


>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
          Length = 600

 Score =  302 bits (773), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 154/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A++  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TAATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDRLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSIIV+ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIVYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
          Length = 571

 Score =  302 bits (773), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 152/323 (47%), Positives = 208/323 (64%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TGK+
Sbjct: 48  PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 106

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF+
Sbjct: 107 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 166

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
           +  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +H 
Sbjct: 167 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 226

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  
Sbjct: 227 PTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTL 281

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SPLL+
Sbjct: 282 ALKRYGFYNNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 337

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
           ++G V ++ VH++DW PTL+S A
Sbjct: 338 NKGTVCKELVHITDWYPTLISLA 360


>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
          Length = 597

 Score =  302 bits (773), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 152/323 (47%), Positives = 208/323 (64%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TGK+
Sbjct: 74  PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 132

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF+
Sbjct: 133 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 192

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
           +  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +H 
Sbjct: 193 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 252

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  
Sbjct: 253 PTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTL 307

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SPLL+
Sbjct: 308 ALKRYGFYNNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 363

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
           ++G V ++ VH++DW PTL+S A
Sbjct: 364 NKGTVCKELVHITDWYPTLISLA 386


>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
          Length = 599

 Score =  301 bits (772), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           V +   PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 71  VTAVSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 129

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 130 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 189

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 190 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 249

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 250 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 304

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 305 INNVTLALKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 360

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 361 HSPLLKNKGTVCKELVHITDWYPTLISLA 389


>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
          Length = 599

 Score =  301 bits (771), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             ++  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDRAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLACLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  ALE     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALETYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL++ A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLITLA 388


>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
 gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
 gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
          Length = 599

 Score =  301 bits (771), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           V +   PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 71  VTALSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 129

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 130 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 189

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 190 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQ 249

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 250 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 304

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 305 INNVTLALKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAIGFV 360

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 361 HSPLLKNKGTVCKELVHITDWYPTLISLA 389


>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  301 bits (771), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 149/333 (44%), Positives = 214/333 (64%), Gaps = 17/333 (5%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           SS P HI+FILADD G++D+G+H    I TPN+D LA  G+ L+NYY    CTP+RS +M
Sbjct: 10  SSSPTHIVFILADDYGFHDIGYHD-SIIKTPNLDRLASEGVKLENYYVQPKCTPTRSQLM 68

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +G++ IHTG+QH +++ C+   LP++E  +PQ LKE GY T IVGKWHLG YKKE  PT 
Sbjct: 69  SGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTE 128

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL---EPAWDLHGKYSTDVFTAEAVDI 232
           RGF++  GY TG +DY+ H+    K  G+D R ++   +P ++  G+YST VF  +A +I
Sbjct: 129 RGFDTFFGYLTGSEDYYTHNRSYDKFHGMDFRENMQIVQPQYN--GQYSTHVFAEKAKNI 186

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I +H    PLFLYL   A H      PLQ PD Y   + +I + +R  +A ++  +DE+V
Sbjct: 187 IKSHDPQIPLFLYLPLHAVHG-----PLQVPDQYEKPYTNITNKQRRTYAGMVSCMDEAV 241

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G + + LE   +  NS+I+F +DNGG          +NWPLRG K +LWEGG+ G G + 
Sbjct: 242 GNITQTLEDAGLYDNSVIIFSTDNGGQVTA----GGNNWPLRGWKGSLWEGGMHGVGFVN 297

Query: 353 SPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPL+  ++RG  +++ +HV+DW PT++  A  S
Sbjct: 298 SPLINHQARGTTSKELIHVTDWFPTIVHLAGGS 330


>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
          Length = 595

 Score =  301 bits (771), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 206/323 (63%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TGK+
Sbjct: 76  PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 134

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF+
Sbjct: 135 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 194

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
           +  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +H 
Sbjct: 195 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 254

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  
Sbjct: 255 PRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTL 309

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G + SPLL+
Sbjct: 310 ALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 365

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
           ++G V ++ VH++DW PTL+S A
Sbjct: 366 NKGTVCKELVHITDWYPTLISLA 388


>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
          Length = 600

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 206/323 (63%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TGK+
Sbjct: 76  PHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY 134

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF+
Sbjct: 135 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 194

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
           +  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +H 
Sbjct: 195 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHD 254

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  
Sbjct: 255 PRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTL 309

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G + SPLL+
Sbjct: 310 ALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 365

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
           ++G V ++ VH++DW PTL+S A
Sbjct: 366 NKGTVCKELVHITDWYPTLISLA 388


>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
 gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
          Length = 548

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 215/328 (65%), Gaps = 6/328 (1%)

Query: 66  LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
           +ADD+G++DV F G +   TPNIDALAYSG+IL N YT  +CTPSR+A++TGK+PI+TGM
Sbjct: 1   MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60

Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
           QH V+   +  GLP  EK + +  +E GY T ++GKWHLG  ++ +TPT RGF+ HLGY 
Sbjct: 61  QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120

Query: 186 TGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS---TD 239
             + DY+D + ++       G D R +L    D  G Y TDV +  AV++I  HS   + 
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFL L+H A H+AN  +P+QAP   L    +I +     +AA++ +LD SVG+V++AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRHYAAMVSRLDRSVGRVIDAL 240

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
            +++ML +SI++F+SDNGG   G +   ASN+PLRG KN+ WEGG+R +  IWS   E  
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSTTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300

Query: 360 GIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           G V +Q  ++ D LPTL +AA  +  P+
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPD 328



 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 29/143 (20%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS--ALTKGKWKLVKVV------------ 619
           ++DG+++WS L     S    I+HNID +      A T+GKWK+V               
Sbjct: 330 KLDGMNLWSALKYGYESLEREIVHNIDMDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGE 389

Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLALAM--------RKLRDAASIQC----GPVKEV 667
           +V R ++D      +  +   S  + L L M          LR  A I+C    G  K  
Sbjct: 390 RVTR-EMDPRSANYEELIRNTSVWQHLQLLMGSQNERNITSLRQQAQIECPEPSGATK-- 446

Query: 668 PCEPQIAPCLFDIKNDPCEKNNL 690
           PC P +APCLFDI  DPCE+NNL
Sbjct: 447 PCLPLLAPCLFDIDADPCERNNL 469



 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 39/80 (48%), Gaps = 14/80 (17%)

Query: 750 SNEEEGMRKLRDAASIQC----GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
           S  E  +  LR  A I+C    G  K  PC P +APCLFDI  DPCE+NNL +       
Sbjct: 421 SQNERNITSLRQQAQIECPEPSGATK--PCLPLLAPCLFDIDADPCERNNLYE------- 471

Query: 806 NHYTTEVGYLDPKQRFNQIA 825
             Y     +LD  QR  Q A
Sbjct: 472 -DYRNTTIFLDLLQRIEQFA 490


>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
 gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
          Length = 598

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             ++  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TTATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386


>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
 gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
          Length = 548

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 216/328 (65%), Gaps = 6/328 (1%)

Query: 66  LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM 125
           +ADD+G++DV F G +   TPNIDALAYSG+IL N YT  +CTPSR+A++TGK+PI+TGM
Sbjct: 1   MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60

Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
           QH V+   +  GLP  EK + +  +E GY T ++GKWHLG  ++ +TPT RGF+ HLGY 
Sbjct: 61  QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120

Query: 186 TGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS---TD 239
             + DY+D + ++       G D R +L    D  G Y TDV +  AV++I  HS   + 
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFL L+H A H+AN  +P+QAP   L    +I +     +AA++ +LD SVG+V++AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRYYAAMVSRLDRSVGRVIDAL 240

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
            +++ML +SI++F+SDNGG   G + + ASN+PLRG KN+ WEGG+R +  IWS   E  
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSSTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300

Query: 360 GIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           G V +Q  ++ D LPTL +AA  +  P+
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPD 328



 Score = 67.0 bits (162), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 25/141 (17%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS--ALTKGKWKLVKVV------------ 619
           ++DG+++WS L     S    I+HNID +      A T+GKWK+V               
Sbjct: 330 KLDGMNLWSALKYGYESLEREIVHNIDMDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGE 389

Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLALAM--------RKLRDAASIQCG--PVKEVPC 669
           +V R +VD      +  +   S  + L L M          LR  A I+C   P    PC
Sbjct: 390 RVTR-EVDPRSANYEELIRNTSVWQHLQLLMGSQNERNITSLRQQAQIECPEPPGATKPC 448

Query: 670 EPQIAPCLFDIKNDPCEKNNL 690
            P +APCLFDI  DPCE+NNL
Sbjct: 449 LPLLAPCLFDIDADPCERNNL 469



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 38/78 (48%), Gaps = 10/78 (12%)

Query: 750 SNEEEGMRKLRDAASIQCG--PVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINH 807
           S  E  +  LR  A I+C   P    PC P +APCLFDI  DPCE+NNL +         
Sbjct: 421 SQNERNITSLRQQAQIECPEPPGATKPCLPLLAPCLFDIDADPCERNNLYE--------D 472

Query: 808 YTTEVGYLDPKQRFNQIA 825
           Y     +LD  QR  Q A
Sbjct: 473 YRNTTIFLDLLQRIEQFA 490


>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 562

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 206/324 (63%), Gaps = 11/324 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+F++ DD G+ND+G+HG  +I TP +D LA  G+ L+NYY   +C+PSRS +MTG+
Sbjct: 29  PPHIVFVMVDDQGYNDIGYHG-SEIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGR 87

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LP     LP+ L+E GY T +VGKWHLGF   E  PT RGF
Sbjct: 88  YQIHTGLQHSIIRARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGF 147

Query: 179 ESHLGYWTGHQDYFD-HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           +S LG  TG  D+F   S +  +  G D+     PAW+L G YST ++T    DI+  H 
Sbjct: 148 QSFLGSLTGSGDHFSFQSCDGTEACGFDLHDGDRPAWELRGNYSTRLYTERVKDILRRHD 207

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLFLY+A  A H+     PLQAP H L  ++ + +  R  +AA++  +DESVG++V 
Sbjct: 208 HRKPLFLYVALQAVHT-----PLQAPGHLLRRYQALGNRPRRHYAAMVSGVDESVGEIVS 262

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            L +R    NS++++ SDNGG      L+   NWPLRG K + WEGGVR  G + SPLL+
Sbjct: 263 ELRERGYYDNSVLIYSSDNGGQP----LSGGCNWPLRGGKGSYWEGGVRAVGFVHSPLLK 318

Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
            +G+V+   +H+SDW PTLLS A 
Sbjct: 319 RKGVVSHALIHISDWYPTLLSLAG 342


>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
 gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
 gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
          Length = 597

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TAVTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386


>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
          Length = 515

 Score =  300 bits (768), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 207/329 (62%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGF +KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYTAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388


>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
          Length = 572

 Score =  300 bits (768), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 42  TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 100

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 101 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 160

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 161 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 220

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 221 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 275

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 276 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 331

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 332 HSPLLKNKGTVCKELVHITDWYPTLISLA 360


>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
          Length = 564

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 213/323 (65%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADD G+ D+G+HG  +I TP +D LA  G+ L+NYY   +C+PSRS  +TGK+
Sbjct: 47  PHIVFILADDQGYRDIGYHG-SEIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGKY 105

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL    LPQ LK+ GY+T +VGKWHLGFYKKE  PT RGF+
Sbjct: 106 QIHTGLQHSIIRPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGFD 165

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
           S  G   G  DY++H   +   + G D+  +   AWD  +G YST+++T   + I+ +H+
Sbjct: 166 SFFGSLLGSGDYYNHYKCDSPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSSHN 225

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            ++PLFLY+A+ A HS     PLQAP +YL+ +  I +  R ++AA+L  LD++V  V  
Sbjct: 226 PNKPLFLYIAYQAVHS-----PLQAPGNYLDNYNSILNENRRRYAAMLSCLDDAVNNVTT 280

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++     NSII++ SDNGG      +   +NWPLRG K T WEGG+R  G + SP+L+
Sbjct: 281 ALKKYGFYDNSIIIYSSDNGGQP----MAGGNNWPLRGSKGTYWEGGIRAVGFVHSPILK 336

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
            +G + ++ VH++DW PTL++ A
Sbjct: 337 MKGYICKELVHITDWFPTLVTLA 359



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 70/166 (42%), Gaps = 28/166 (16%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
           ++DG D+W  +S  + S R  ILHNID  +  +     A   G W   +   ++V  +++
Sbjct: 368 QLDGYDIWETISEGKRSPRQEILHNIDPMYTKARNGSWAAGYGIWNTAIQSAIRVNHWKL 427

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCLFDIKN 682
            LTG P      G  D  W+          +    GP +     V      +  LF+I  
Sbjct: 428 -LTGNP------GYGD--WIP-------PQSFSNVGPNRWHNERVTLASGKSVWLFNITA 471

Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
           DP E+ +L+DR  D  +      + +FN+ A P +    + +   K
Sbjct: 472 DPYERIDLSDRYPD-IVKQLLRRLAQFNKTAVPVRYPPRDPRSNPK 516


>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
          Length = 614

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386


>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
          Length = 607

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 208/323 (64%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TGK+
Sbjct: 84  PHIVFILADDQGFRDVGYHG-SEIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGKY 142

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF+
Sbjct: 143 QIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFD 202

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
           +  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T +   I+ +H 
Sbjct: 203 TFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHD 262

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  
Sbjct: 263 PRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAINNVTL 317

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++     NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SPLL+
Sbjct: 318 ALKRYGFYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLK 373

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
           ++G + ++ VH++DW PTL++ A
Sbjct: 374 NKGTICKELVHITDWYPTLITLA 396


>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
          Length = 301

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 142/224 (63%), Positives = 172/224 (76%), Gaps = 5/224 (2%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII ILADDLGWNDV FHG DQIPTPNIDALAY+GIIL N+Y   LCTPSRSA+MTGK+
Sbjct: 74  PHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKN 133

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH+VLY  E  GLPLSEK+LPQYL+E+GY+T  VGKWHLG++KK+YTPT+RGF+
Sbjct: 134 PIHLGMQHSVLYPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRGFD 193

Query: 180 SHLGYWTGHQDYFDHSAEEM-----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           SH GYW G +DY+ H A+E      +  G DMRR+L  AWD  GKY+TD+FT EAV +I+
Sbjct: 194 SHFGYWNGLEDYYTHIAQEPDSQYNEYKGFDMRRNLTVAWDTAGKYATDLFTNEAVRLIN 253

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            H T+ P+FLYLAH A H  N  + L+APD  +    +I D +R
Sbjct: 254 EHDTERPMFLYLAHLAVHKGNENQLLRAPDEEIAKFSYILDPER 297



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 28/32 (87%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
           MQH+VLY  E  GLPLSEK+LPQYL+E+GY+T
Sbjct: 139 MQHSVLYPTEPRGLPLSEKLLPQYLQEIGYKT 170


>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
 gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
 gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
 gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
          Length = 598

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386


>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
 gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
          Length = 470

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 153/324 (47%), Positives = 207/324 (63%), Gaps = 12/324 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFILADD GW+D+G+H    I TPN+D LA  G+ L+NYY   +C+PSR  +MTG++
Sbjct: 1   PNIIFILADDYGWDDIGYHN-HFIHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRY 59

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IH G+QH V+      GLPL E  LPQ LK+ GYRT +VGKWHLGF KKEYTP +RGF+
Sbjct: 60  QIHYGLQHGVIRNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFD 119

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G+ TG +DY+ H   +  + GLD+R   EP  D +G YST +F  +A D+I  H  +
Sbjct: 120 KFYGFLTGSEDYWTHRRYK-GVRGLDLRDQDEPVLDENGTYSTHLFARKATDMILKHDQN 178

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +P+FLYL   A H      PLQ P+ YL  + HI       +AA +  +DE+VG V  AL
Sbjct: 179 QPMFLYLPFQAVHG-----PLQVPEKYLQEYMHINFTVDRIYAAQVTAMDEAVGNVTNAL 233

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           +   +  N++++F +DNG        +  SNWPLRG KNTLWEGGVRG G + S LL+ +
Sbjct: 234 KNSGLWENTVLIFSTDNGARR-----HTGSNWPLRGWKNTLWEGGVRGVGFVNSNLLKRK 288

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
           G  ++  +H+SDW PTLL  ++ S
Sbjct: 289 GKTSDALIHISDWFPTLLRISDGS 312



 Score = 40.8 bits (94), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 23/107 (21%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW-------------------QISALTRGKWKL 550
           +DG DVW  +S+ + S R  +LHNID  +                     SAL  G WKL
Sbjct: 319 LDGFDVWDAISKGKTSPRKELLHNIDPFYDDQTGSYSPLSYDNIFNTSSYSALRSGDWKL 378

Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGID--VWSVLSRNEPSKRNTI 595
           +      G G +  R  ++ ++   D  +  +W    R +P +RN +
Sbjct: 379 L--TGYQGAGWNRRRPYNHEFEESPDPPNKRLWLFNVREDPGERNDL 423



 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 34/141 (24%), Positives = 58/141 (41%), Gaps = 12/141 (8%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQ 634
           +DG DVW  +S+ + S R  +LHNID  +     +        +     Y    +G  D 
Sbjct: 319 LDGFDVWDAISKGKTSPRKELLHNIDPFYDDQTGSYSPLSYDNIFNTSSYSALRSG--DW 376

Query: 635 VYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRS 694
             L+G     W     R+  +         +E P  P     LF+++ DP E+N+L+   
Sbjct: 377 KLLTGYQGAGW---NRRRPYNHE------FEESPDPPNKRLWLFNVREDPGERNDLSKMH 427

Query: 695 EDQRINHYTTEVGRFNQIAYP 715
            D  +     ++  +N+ A P
Sbjct: 428 PDI-VQDLLEKLAVYNKTAVP 447


>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
          Length = 593

 Score =  300 bits (767), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 159/374 (42%), Positives = 222/374 (59%), Gaps = 32/374 (8%)

Query: 29  GYRTRIMAFAVLPLAFTLSMVFVDLVASS--------------------GPPHIIFILAD 68
           G+  R++A A L   + LS++    ++                        PH+IFILAD
Sbjct: 19  GWPGRVVAMAALAGFWVLSLIAYGYLSFGQRLEKEEGSWRALASERLEPSQPHLIFILAD 78

Query: 69  DLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHN 128
           D G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TGK+ IHTG+QH+
Sbjct: 79  DQGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHS 137

Query: 129 VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH 188
           ++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF++  G   G 
Sbjct: 138 IIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGS 197

Query: 189 QDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
            DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +H+  +P+FLY+
Sbjct: 198 GDYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYI 257

Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
           A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  AL+     +
Sbjct: 258 AYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKAYGFYN 312

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           NSII++ SDNGG          SNWPLRG K T WEGG+R  G + SPLL+++G V ++ 
Sbjct: 313 NSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKEL 368

Query: 367 VHVSDWLPTLLSAA 380
           VH++DW PTL+S A
Sbjct: 369 VHITDWYPTLISLA 382


>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
 gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
          Length = 491

 Score =  300 bits (767), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 147/344 (42%), Positives = 222/344 (64%), Gaps = 16/344 (4%)

Query: 42  LAFTLSMVFVDLVA--SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L+F    + +++V   SS  PH++F+LADDLGW+DVGFHG  +I TPNID LA +G+IL 
Sbjct: 5   LSFHCFFLCLNVVVLQSSAKPHLLFVLADDLGWSDVGFHG-SKIQTPNIDRLAANGVILD 63

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           NYY   +CTP+R+++MTGK+PIHTG+QH +++     GLPL+  +LPQ L++ GY T ++
Sbjct: 64  NYYVQPVCTPTRASLMTGKYPIHTGLQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHML 123

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLGFY  E TPT+RGF++  G+++G ++++ H  +      LD+R + E   D +G 
Sbjct: 124 GKWHLGFYNWESTPTYRGFDTFYGFYSGAENHYTHVQDHY----LDLRDNEEIVRDQNGT 179

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           YS  +FT  A  I+  H    PLF+Y+A    HS     P+QAP  Y++ +  I+D  R 
Sbjct: 180 YSAHLFTKRAEQIVRAHDPSTPLFMYMAFQNVHS-----PVQAPKEYIDRYSFIKDPLRR 234

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            +AA++  +D+++G +  A ++  +  N+I++F +DNGG       N   ++PLRG K+T
Sbjct: 235 TYAAMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNGGVPK----NGGYDYPLRGRKDT 290

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LWEGGVRG   +    LE  G+  +  +HV+DW PTL+S A  S
Sbjct: 291 LWEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAGGS 334


>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
          Length = 555

 Score =  299 bits (766), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386


>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
          Length = 570

 Score =  299 bits (765), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386


>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
          Length = 597

 Score =  298 bits (764), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 212/326 (65%), Gaps = 11/326 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 77  PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 135

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+ELGY T +VGKWHLGFY+KE  PT RGF
Sbjct: 136 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 195

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS
Sbjct: 196 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 255

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct: 256 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 310

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 311 ALKRHGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 366

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +   VH++DW PTL++ A  +
Sbjct: 367 RKRRTSRALVHITDWYPTLVALAGGT 392


>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
          Length = 573

 Score =  298 bits (763), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 151/328 (46%), Positives = 211/328 (64%), Gaps = 12/328 (3%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A S  PHI+FILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  
Sbjct: 45  APSSQPHIVFILADDQGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQF 103

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY++E  PT
Sbjct: 104 ITGKYQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPT 163

Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
            RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T +   I
Sbjct: 164 QRGFDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQI 223

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           + +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++
Sbjct: 224 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAI 278

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
             V  AL++     NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + 
Sbjct: 279 NNVTLALKKYGYYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVH 334

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           SPLL+++G V ++ VH++DW PTL++ A
Sbjct: 335 SPLLKNKGSVCKELVHITDWFPTLITLA 362


>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
 gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
          Length = 478

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 208/324 (64%), Gaps = 17/324 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FILADD GW+D+G+HG   I TP +D LA  G+ L+NYY   +C+PSR  +MTG++
Sbjct: 1   PNIVFILADDYGWDDIGYHG-SFIQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRY 59

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  G+QH+V+      GLPL E  LPQ LKE GY T +VGKWHLGF++KE+ P  RGF+
Sbjct: 60  QIRYGLQHSVITSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFD 119

Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
              GY TG +DY+ H      + + +   GLD+R   +P  D +G YST +F  +A++II
Sbjct: 120 KFYGYLTGGEDYWTHRRPNLYAKDPLAFHGLDLRDQDKPVLDQNGTYSTHLFAKKAIEII 179

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            NH   +P+FLYL   A HS     PL+AP  Y+  ++ +++     +AA++  +DE+VG
Sbjct: 180 QNHERSKPMFLYLPFQAVHS-----PLEAPTEYVEKYKDVKNPLVKTYAAMVSAMDEAVG 234

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            V +AL++ R+  N++++F +DNG        NA SNWPLRG KNTLWEGGVRG G + S
Sbjct: 235 NVTDALKRTRLWDNTVLIFSTDNGARR-----NAGSNWPLRGWKNTLWEGGVRGVGFVTS 289

Query: 354 PLLESRGIVAEQYVHVSDWLPTLL 377
            LL       +  +H+SDW PTLL
Sbjct: 290 KLLRKGKRKCDALIHISDWFPTLL 313



 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 62/160 (38%), Gaps = 49/160 (30%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID-------------------DEWQISALTKGKWKL 615
           +DG D+W  +S   PS R  ILHNID                   +    +A+  G WKL
Sbjct: 326 LDGFDIWDTISSGTPSPRKEILHNIDPLHPKTDLSSLDSLHDHIFNPSVYAAIRSGDWKL 385

Query: 616 VKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAP 675
                       LTG  DQ    GL   ++   +  K  DA              P    
Sbjct: 386 ------------LTG--DQ----GLGGWKYSGTSGNKYEDAVE-----------PPNKHL 416

Query: 676 CLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
            LF+I++DP E+N+L+ +  D  +      +  +N+ A P
Sbjct: 417 WLFNIRSDPQERNDLSVQYPDV-VQELLERLAAYNKTALP 455



 Score = 40.8 bits (94), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 22/107 (20%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID-------------------DEWQISALTRGKWKL 550
           +DG D+W  +S   PS R  ILHNID                   +    +A+  G WKL
Sbjct: 326 LDGFDIWDTISSGTPSPRKEILHNIDPLHPKTDLSSLDSLHDHIFNPSVYAAIRSGDWKL 385

Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGID--VWSVLSRNEPSKRNTI 595
           +  +   G G   + ++ N Y++ ++  +  +W    R++P +RN +
Sbjct: 386 LTGDQGLG-GWKYSGTSGNKYEDAVEPPNKHLWLFNIRSDPQERNDL 431


>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 673

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 12/333 (3%)

Query: 50  FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
           F D  +   PPHIIFI+ DD G+ND+G+H  D I TP +D LA  G+ L+NYY   +CTP
Sbjct: 38  FNDRASGVQPPHIIFIMTDDQGFNDIGYHSSD-IKTPTLDKLAAKGVKLENYYIQPICTP 96

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           SRS  +TG++ IHTG+QH+++   +   LP  +  LPQ L++LGY T +VGKWHLGFYKK
Sbjct: 97  SRSQFITGRYQIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGFYKK 156

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
           E  PT RGF+++ G  TG  +Y+ +S+ +  ++ G D+      AWD  GKYST +FT  
Sbjct: 157 ECLPTRRGFDTYFGSLTGSVNYYTYSSCDGPELCGFDLHEGESVAWDQGGKYSTHLFTQR 216

Query: 229 AVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
              I+  H    +PLF++L+  A HS     PLQ+P  Y+  +  +E+  R K+AA++  
Sbjct: 217 VRKILARHDPQSQPLFIFLSFQAVHS-----PLQSPKEYICPYSGLENVARRKYAAMVST 271

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+V  +  AL +     NS+++F +DNGG      L+  SNWPLRG K T WEGG+RG
Sbjct: 272 VDEAVRNITYALRKYGYYQNSVLIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGIRG 327

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ R  V++  VH++DW PTL+  A
Sbjct: 328 LGFVHSPLLKKRKRVSKALVHITDWYPTLVGLA 360


>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
          Length = 573

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 210/326 (64%), Gaps = 12/326 (3%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S  PHI+FILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +T
Sbjct: 47  SSQPHIVFILADDQGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFIT 105

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           GK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY++E  PT R
Sbjct: 106 GKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQR 165

Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIH 234
           GF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T +   I+ 
Sbjct: 166 GFDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILA 225

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  
Sbjct: 226 SHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINN 280

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL++     NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SP
Sbjct: 281 VTLALKKYGYYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSP 336

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL+++G V ++ VH++DW PTL++ A
Sbjct: 337 LLKNKGSVCKELVHITDWFPTLITLA 362


>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
          Length = 565

 Score =  296 bits (759), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 212/326 (65%), Gaps = 11/326 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 40  PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 98

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+ELGY T +VGKWHLGFY+KE  PT RGF
Sbjct: 99  YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 158

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS
Sbjct: 159 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 218

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct: 219 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 273

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 274 ALKRHGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 329

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +   VH++DW PTL++ A  +
Sbjct: 330 RKRRTSRALVHITDWYPTLVALAGGT 355


>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
          Length = 573

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 217/336 (64%), Gaps = 13/336 (3%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A + PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS +
Sbjct: 45  AFARPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 103

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT
Sbjct: 104 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 163

Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
            RGF++ LG  TG+ DY+ + + +   + G D+    + AWD  GKYST ++      I+
Sbjct: 164 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKIL 223

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            +HS  EP+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +DE+V 
Sbjct: 224 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 278

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            +  AL++     NS+IVF +DNGG       +  SNWPLRG K T WEGGVRG G + S
Sbjct: 279 NITWALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHS 334

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
           PL++ +   +   VH++DW PTL+S A  N S++P 
Sbjct: 335 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPG 370


>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
          Length = 574

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 212/326 (65%), Gaps = 11/326 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 49  PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 107

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+ELGY T +VGKWHLGFY+KE  PT RGF
Sbjct: 108 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 167

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS
Sbjct: 168 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 227

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct: 228 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 282

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 283 ALKRHGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 338

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +   VH++DW PTL++ A  +
Sbjct: 339 RKRRTSRALVHITDWYPTLVALAGGT 364


>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
          Length = 569

 Score =  296 bits (758), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 210/326 (64%), Gaps = 11/326 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 46  PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ NHS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHS 224

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
              PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +   +H++DW PTL+  A  +
Sbjct: 336 RKRRTSRALMHITDWYPTLVGLAGGT 361


>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
          Length = 574

 Score =  296 bits (757), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 216/336 (64%), Gaps = 13/336 (3%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A + PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS +
Sbjct: 45  AFTRPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 103

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT
Sbjct: 104 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 163

Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
            RGF++ LG  TG+ DY+ + + +   + G D+      AWD  GKYST ++      I+
Sbjct: 164 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKIL 223

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            +HS  EP+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +DE+V 
Sbjct: 224 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 278

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            +  AL++     NS+IVF +DNGG       +  SNWPLRG K T WEGGVRG G + S
Sbjct: 279 NITWALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHS 334

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
           PL++ +   +   VH++DW PTL+S A  N S++P 
Sbjct: 335 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPG 370


>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
          Length = 574

 Score =  295 bits (756), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 216/336 (64%), Gaps = 13/336 (3%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A + PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS +
Sbjct: 45  AFARPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 103

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT
Sbjct: 104 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 163

Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
            RGF++ LG  TG+ DY+ + + +   + G D+      AWD  GKYST ++      I+
Sbjct: 164 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKIL 223

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            +HS  EP+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +DE+V 
Sbjct: 224 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 278

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            +  AL++     NS+IVF +DNGG       +  SNWPLRG K T WEGGVRG G + S
Sbjct: 279 NITWALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHS 334

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIPN 387
           PL++ +   +   VH++DW PTL+S A  N S++P 
Sbjct: 335 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPG 370


>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 490

 Score =  295 bits (756), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 154/350 (44%), Positives = 206/350 (58%), Gaps = 13/350 (3%)

Query: 44  FTLSMVFVDLVASSGP--------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           F L  + +D V   G         P+I+FIL DDLGW DV FHG  QIPTPNID LA  G
Sbjct: 2   FVLCRILLDAVFLLGTILQCHAKRPNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDG 61

Query: 96  IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
           +IL NYY + LCTPSR+A+MTG +PI  GMQ   +      GLPL  KILPQ+ K+LGY 
Sbjct: 62  VILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSIQAAGPWGLPLENKILPQHFKDLGYD 121

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
             ++GKWHLGF+K  Y P  RGF++  G++TG  DY++H++       L +   ++    
Sbjct: 122 VNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSNDYYNHTSGSSHRKILAVTSSVQVNTL 181

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTD-----EPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
             G+ S       +V  +H   +      +P F Y++H A H A   E  QAP   +   
Sbjct: 182 EKGRLSLWGPRELSVCFLHQIYSPLNFYLQPFFCYISHQAVHHALNAEMFQAPARNVLKF 241

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
            +I +  R+ +A +L  LDES+G+VVEAL    +L ++IIVF SDNGG+  G   N   N
Sbjct: 242 SYIGEPNRTIYAGMLDALDESMGRVVEALSSAGILEDTIIVFSSDNGGSPYGLESNRGYN 301

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           WPLRG K TLWEG VR    +W+P L  +  V+ Q +H+SDWLPTL +AA
Sbjct: 302 WPLRGAKFTLWEGSVRVPAFVWNPKLLKKSRVSNQLMHISDWLPTLYTAA 351



 Score = 46.2 bits (108), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 30/136 (22%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV------------------ 616
           IDG+D+W  L+    SKR  IL NID  + IS L   ++KLV                  
Sbjct: 361 IDGVDMWHSLTHGSASKRKEILLNIDSLYNISGLRYKQYKLVVGGGFDGELDDHYYFPGG 420

Query: 617 ----KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE--VPCE 670
                 +  +R Q  +     ++Y     DR W  LA  K   A   +   + E  VP +
Sbjct: 421 TRPTNNIDQLRNQSTMARVLKELYEK--QDRSWSPLAWSK-NVAVDCRRNRLTENFVPMQ 477

Query: 671 PQIAPCLFDIKNDPCE 686
           P   P LF+I+ DPCE
Sbjct: 478 P---PYLFNIEEDPCE 490



 Score = 43.9 bits (102), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 36/75 (48%), Gaps = 9/75 (12%)

Query: 484 IPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL 543
           +P L  +  GN ++            IDG+D+W  L+    SKR  IL NID  + IS L
Sbjct: 344 LPTLYTAAGGNASA---------LGGIDGVDMWHSLTHGSASKRKEILLNIDSLYNISGL 394

Query: 544 TRGKWKLVKENSING 558
              ++KLV     +G
Sbjct: 395 RYKQYKLVVGGGFDG 409


>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
           purpuratus]
          Length = 784

 Score =  295 bits (756), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 153/355 (43%), Positives = 218/355 (61%), Gaps = 27/355 (7%)

Query: 46  LSMVFVDLVASSG----------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           L++ F++L+  S            P+I+FIL DDLG+NDVG+HG  +I TPNID LA  G
Sbjct: 3   LALFFINLLIISNVCSKKFSQLTKPNIVFILIDDLGYNDVGYHG-SEIYTPNIDKLAREG 61

Query: 96  IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
           + L+NYY   +CTP+RS +++G++ IHTG+QH+ +   +   LP +       L+E GY 
Sbjct: 62  VRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYA 121

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           T  VGKWHLGFYKKE  PT RGF+S+ GY TG +DY+ H     K   L +R   + AW+
Sbjct: 122 THAVGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTH---HRKRPXLALRHVDKVAWE 178

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
             G YS  VFT +   I+  H  ++P  LYL   + HS     PLQ P  Y   +++I++
Sbjct: 179 YGGYYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERYKNIKN 233

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLR 334
             R  +A ++  LDE+VGK+V +L+Q  +  N++++F +DNGG  AAG N     NWPLR
Sbjct: 234 TNRRIYAGMMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNGGEVAAGGN-----NWPLR 288

Query: 335 GVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           G K ++WEGG+RG G + SPLL +  +G V +Q +HVSDW PTL+     S + N
Sbjct: 289 GWKRSIWEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDN 343


>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  295 bits (755), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 157/332 (47%), Positives = 211/332 (63%), Gaps = 16/332 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           + SS  PHIIFILADDLGW+DVG+HG   I TP+IDALA  G+ L NYYT  LCTPSRS 
Sbjct: 16  LVSSSRPHIIFILADDLGWSDVGYHG-SVIKTPHIDALASEGVKLDNYYTSLLCTPSRSQ 74

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++ IHTG+QH  +   +   LP+ E ILPQ LK+ GY T +VGKWHLGFY++E  P
Sbjct: 75  LMTGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKDRGYATHMVGKWHLGFYRQECLP 134

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFTAEAVDI 232
             RGF++ +G++    DY+ H+    K  G D RRD +   + + G+YST VF  EA DI
Sbjct: 135 NNRGFDTFMGFYQAMGDYYYHNVSTGKFNGWDFRRDNDVIAERYAGQYSTHVFADEARDI 194

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI--EDFKRSKFAAILHKLDE 290
           I  H+ D PLFL+L+  A H      PLQ P  Y +I+  +      R  +A ++  +DE
Sbjct: 195 ISKHNPDVPLFLFLSFQAIHF-----PLQVPSRYADIYNTLIPNSADRRTYAGMVTCMDE 249

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +VG VV  L+   +  NS++VF +DNGG  +        NWPLRG K +L+EGGVRG   
Sbjct: 250 AVGGVVATLKHSGLWENSVLVFSTDNGGIHS-----LGGNWPLRGGKASLYEGGVRGVAF 304

Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
           + SPLL S  RG + ++ +H++DW PTL+  A
Sbjct: 305 VTSPLLPSRVRGTINKELLHMTDWFPTLVRLA 336


>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
 gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
          Length = 270

 Score =  295 bits (755), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 138/268 (51%), Positives = 186/268 (69%), Gaps = 4/268 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ I+ADDLGW+DV FHG  QIPTP ID LA  G+IL +YY   +CTP+R+++MTGKH
Sbjct: 3   PNIVMIVADDLGWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTGKH 62

Query: 120 PIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           P++ GM    H  ++G +  GLPL E   PQY+K LGY T  +GKWHLGF++KEYTPT+R
Sbjct: 63  PMNLGMLIHTHATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYR 122

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           GF+S  G+W G +DY+DHS++E  +WG D+R + +P  +  G Y T++F   A  IIH H
Sbjct: 123 GFDSFYGFWNGKEDYWDHSSQE-DVWGTDLRDNEKPVRNESGHYGTELFAERAAQIIHLH 181

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           +  +PL+LYLA    HSAN  EPLQAP   +    HI   KR  +AA++  LDESV  V 
Sbjct: 182 NQTKPLYLYLAQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESVETVH 241

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFN 324
           +AL +  ML+N+++VF +DNGGA  GFN
Sbjct: 242 KALSETGMLNNTVLVFTTDNGGAPRGFN 269


>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
 gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
 gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
 gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
 gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
          Length = 573

 Score =  295 bits (754), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 151/331 (45%), Positives = 212/331 (64%), Gaps = 11/331 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           VA   PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 41  VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           T RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHI 219

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           + +H+   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V
Sbjct: 220 LASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 274

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
             +  AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + 
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVH 330

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ +   +   VH++DW PTL+  A  +
Sbjct: 331 SPLLKKKRRTSRALVHITDWYPTLVGLAGGT 361


>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
          Length = 571

 Score =  295 bits (754), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 151/331 (45%), Positives = 212/331 (64%), Gaps = 11/331 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           VA   PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 41  VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           T RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHI 219

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           + +HS  +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V
Sbjct: 220 LASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAV 274

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
             +  AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + 
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVH 330

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ +   +   VH++DW PTL+  A  +
Sbjct: 331 SPLLKRKRRTSRALVHITDWYPTLVGLAGGT 361


>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
          Length = 564

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 208/322 (64%), Gaps = 12/322 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           H+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +TG++ 
Sbjct: 44  HLIFILADDQGFRDVGYHG-SEIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQ 102

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+V+   +   LPL    LPQ LKE GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 103 IHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDT 162

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
             G   G  DY+ H   +  +M G D+  +   AWD  +G YST ++T +   I+ +H+ 
Sbjct: 163 FFGSLLGSGDYYTHYKCDSPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNP 222

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  A
Sbjct: 223 RKPIFLYIAYQAVHS-----PLQAPGMYYERYRSINNINRRRYAAMLACLDEAINNVTLA 277

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L++    +NS+I++ SDNGG          +NWPLRG K T WEGG+R  G + SPLL++
Sbjct: 278 LKRYGYYNNSVIIYSSDNGGQPTA----GGNNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 333

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           +G V ++ +H++DW PTL++ A
Sbjct: 334 KGCVCKELIHITDWFPTLITLA 355


>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
           melanoleuca]
          Length = 573

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 210/328 (64%), Gaps = 11/328 (3%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++T
Sbjct: 45  SQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 103

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT R
Sbjct: 104 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 163

Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           GF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +
Sbjct: 164 GFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILAS 223

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           HS   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +
Sbjct: 224 HSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRNI 278

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
             AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPL
Sbjct: 279 TWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPL 334

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
           L+ +   +   VH++DW PTL+  A  +
Sbjct: 335 LKRKRRTSRALVHITDWYPTLVGLAGGT 362


>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
          Length = 481

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 149/334 (44%), Positives = 208/334 (62%), Gaps = 14/334 (4%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           D+ A+  P HI+FI+ADDLGWND+GFH  D I TPNID LA  G++L ++Y   LC+PSR
Sbjct: 18  DVSAAGRPRHIVFIVADDLGWNDIGFHNPDII-TPNIDKLAREGLLLNHHYVQPLCSPSR 76

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           +A M+G +P  TG+QH+V+   +   LPL+  ILPQ LKELGY T IVGKWH GF     
Sbjct: 77  AAFMSGYYPFKTGLQHSVILENQPVCLPLNITILPQKLKELGYATHIVGKWHNGFCSWNC 136

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           TPT+RGF+S  GY+   +DY+ H         LD R +  P W  +G YST  FT  A D
Sbjct: 137 TPTYRGFDSFFGYYGAMEDYYTHVIRGF----LDYRNNTTPVWTDNGTYSTLRFTDVATD 192

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           II  H+  +PLFLYLA+ A      Y P++ P  Y  ++ +I+   R KF+ ++  LDE+
Sbjct: 193 IIERHNQSQPLFLYLAYQAV-----YGPIEVPAKYEAMYPNIKSENRRKFSGMVSALDEA 247

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VG V + L QR ++ +++I+F +DNGG        + +N+PLRG K T++EGG R  G +
Sbjct: 248 VGNVTKTLRQRGLMDDTLILFTADNGGGVD----ESGNNYPLRGSKFTVYEGGTRAVGFM 303

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
           +   L+  G V +  +H  DWLPTL +AA  + +
Sbjct: 304 YGSGLQKTGTVFDGMIHAVDWLPTLTAAAGGTPV 337


>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
          Length = 569

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 210/326 (64%), Gaps = 11/326 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 46  PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
              PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +   +H++DW PTL+  A  +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGT 361


>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
 gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
 gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
          Length = 569

 Score =  294 bits (752), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 210/326 (64%), Gaps = 11/326 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 46  PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
              PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   +   +H++DW PTL+  A  +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGT 361


>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
          Length = 503

 Score =  294 bits (752), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 142/325 (43%), Positives = 216/325 (66%), Gaps = 12/325 (3%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           +G P+I+F+LADD G++DVG+HG  +I TP +DAL+ SG+ L+NYY   +CTP+RS +M+
Sbjct: 31  AGQPNIVFVLADDFGFHDVGYHG-SEIHTPTLDALSASGVRLENYYVQPICTPTRSQLMS 89

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++ IHTG+QH ++  C+   LP     L   LKE GY T +VGKWHLGFYK+EY P  R
Sbjct: 90  GRYQIHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNR 149

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           GF+++ GY    +DYF+H+    ++  LD+R +  P  +  G+YS  +FT +A+D++ +H
Sbjct: 150 GFDTYFGYLNAAEDYFNHNVPWRQVRYLDLRDNNGPVRNETGQYSAHLFTGKAIDVVQSH 209

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           +T +PLFLYLA+ + H+     PL+ P+ Y + +R+I D  R  FA ++  LDE V  + 
Sbjct: 210 NTSKPLFLYLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLT 264

Query: 297 EALEQRRMLSNSIIVFVSDNGGAA-AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           +AL+ + + +N++++F +DNGG   AG N     N+PLRG K +LWEGG  G G +    
Sbjct: 265 QALKDKGLWNNTVLIFSTDNGGQIHAGGN-----NYPLRGWKASLWEGGFHGVGFVSGGA 319

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L+  G V++  +HVSDW PTL++ A
Sbjct: 320 LKRSGAVSKGLIHVSDWFPTLVTLA 344


>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
          Length = 569

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 149/332 (44%), Positives = 211/332 (63%), Gaps = 11/332 (3%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
            +  S PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS
Sbjct: 40  FIYPSRPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRS 98

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
            ++TG++ IHTG+QH+++   +   LPL +  LP  L+E GY T +VGKWHLGFY+KE  
Sbjct: 99  QLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPHKLQEAGYSTHMVGKWHLGFYRKECL 158

Query: 173 PTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           PT RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  
Sbjct: 159 PTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASH 218

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +HS   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+
Sbjct: 219 ILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEA 273

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           V  +  AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G +
Sbjct: 274 VRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFV 329

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            SPLL+ +   +   +H++DW PTL+  A  +
Sbjct: 330 HSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 361


>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
          Length = 1542

 Score =  293 bits (750), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 201/320 (62%), Gaps = 12/320 (3%)

Query: 60   PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
            PHIIFI+ DD G+ DVG+HG  +I TP +D LA +G+ L+NYY   LC+PSRS +MTG++
Sbjct: 1027 PHIIFIMVDDQGFRDVGYHG-SEIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGRY 1085

Query: 120  PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
             IHTG+QH+++   +   LPL    LPQ LK  GY T +VGKWHLGFYK+   PT RGF+
Sbjct: 1086 QIHTGLQHSIIRPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGFD 1145

Query: 180  SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHS 237
            +  G   G  DY+ H   +   + G D+    E AW+   G YST ++T +AV+I+ +H+
Sbjct: 1146 TFFGSLLGSGDYYSHYKCDSPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILASHN 1205

Query: 238  TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
               P+FLYLA  A HS     PLQ P  YL  ++ I +  R K+AA++  LDE+V  +  
Sbjct: 1206 PKRPIFLYLAFQAVHS-----PLQVPARYLERYKSIPNLHRRKYAAMVSCLDEAVRNLTL 1260

Query: 298  ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            AL+Q     N ++V+ SDNGG      +   SNWPLRG K + WEGG+R  G + SPLL 
Sbjct: 1261 ALKQYGYYDNMVMVYSSDNGGQP----MAGGSNWPLRGSKGSYWEGGIRAVGFVHSPLLV 1316

Query: 358  SRGIVAEQYVHVSDWLPTLL 377
             +G  ++  +H++DW PTL+
Sbjct: 1317 KKGTKSKALIHITDWYPTLV 1336


>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
          Length = 551

 Score =  293 bits (750), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 154/326 (47%), Positives = 204/326 (62%), Gaps = 13/326 (3%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           SS  PHIIFIL DD G+ DVG+HG D I TP +D LA  G+ L+NYY   LC+PSRS +M
Sbjct: 20  SSSQPHIIFILVDDQGFRDVGYHGSD-IKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLM 78

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IHTG+QH+V+   +   LPL    LPQ LK  GY T +VGKWHLGFYK+   PT 
Sbjct: 79  TGRYQIHTGLQHSVIRAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQ 138

Query: 176 RGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDII 233
           RGF++  G   G  DY+ H   +   M G D+    E AW+   G YST++FT +AV I+
Sbjct: 139 RGFDTFFGSLLGSGDYYSHYKCQGPSMCGYDLYEGEEAAWEQDRGLYSTEMFTQKAVSIL 198

Query: 234 HNHST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
            NH+   +PLFLYLA+ A HS     PLQ P  YL  ++ I +  R K+AA++  LDE++
Sbjct: 199 ANHNPRKQPLFLYLAYQAVHS-----PLQVPARYLERYKGIPNPYRRKYAAMVSCLDEAI 253

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
             +  AL+      N++IV+ SDNGG      L   SNWPLRG K T WEGG+R  G + 
Sbjct: 254 HNLTLALKHYGYYDNTVIVYSSDNGGQP----LAGGSNWPLRGSKATYWEGGIRTVGFVH 309

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLS 378
           SPLL ++G      +H++DW PTL++
Sbjct: 310 SPLLMNKGTKCRSLIHITDWFPTLVT 335



 Score = 41.2 bits (95), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 63/159 (39%), Gaps = 41/159 (25%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID--------DEWQI----------SALTKGKWKLV 616
           +DG DVW  +S   PS R+ ILHNID          W+           +AL  G WKL+
Sbjct: 347 LDGYDVWETISEGRPSPRHDILHNIDPIYIKAKNGSWKAGYGLWNTAIQAALRVGHWKLL 406

Query: 617 KVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPC 676
             V      V     P Q +L+     + L   +   R            VP +   +  
Sbjct: 407 TGVPGYSDWV-----PPQTFLN-----QRLTTRLHNER------------VPWDRGKSIW 444

Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           LF+I  DP E+ +L+ R     +      + ++N+ A P
Sbjct: 445 LFNITADPYERVDLSQRYP-HIVKKMLMRLAQYNKTAVP 482


>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
          Length = 575

 Score =  293 bits (749), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 210/323 (65%), Gaps = 11/323 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G++D+G+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 49  PPHIIFILTDDQGYHDIGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGR 107

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT RGF
Sbjct: 108 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGF 167

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++ LG  TG+ DY+ + + +   + G D+    + AWD  GKYST ++      I+ +HS
Sbjct: 168 DTFLGSLTGNVDYYTYDNCDGPDVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILASHS 227

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             EP+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +DE+V  +  
Sbjct: 228 PKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITW 282

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++     NS++VF +DNGG       +  SNWPLRG K T WEGGVRG G + SPL++
Sbjct: 283 ALKKYGYYDNSVMVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIK 338

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
            +   +   VH++DW PTL+S A
Sbjct: 339 RKRRTSWALVHITDWYPTLVSLA 361


>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
          Length = 613

 Score =  292 bits (748), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 208/322 (64%), Gaps = 11/322 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 93  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 151

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 152 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 211

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 212 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPR 271

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 272 QPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 326

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 327 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 382

Query: 360 GIVAEQYVHVSDWLPTLLSAAN 381
              +   VH++DW PTL+  A 
Sbjct: 383 RRTSRALVHITDWYPTLVGLAG 404


>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
          Length = 633

 Score =  292 bits (747), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 160/427 (37%), Positives = 239/427 (55%), Gaps = 16/427 (3%)

Query: 32  TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           T +  F V+ +A +L  VF  L      P II  + DD+GW+DV FH   QIPTPNIDA+
Sbjct: 31  TVLAIFGVIIIAISLLTVF--LWKRQRAPDIILFIVDDMGWSDVSFHANGQIPTPNIDAM 88

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
               ++L ++Y    CTPSR A++TGK+PI  G+Q  V+    +  L L  ++LPQYL++
Sbjct: 89  CSDAVLLNSHYVQASCTPSRGALLTGKYPIKIGLQEYVIQPGRQEALHLKHRLLPQYLRD 148

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK-MWGLDMRRDL 210
           LGY T +VGKWHLGFY ++Y P  RGF+S  G++ G   Y++HSA +     G D   + 
Sbjct: 149 LGYATHLVGKWHLGFYAEDYLPENRGFDSFYGFYNGAGTYYNHSASDADGRIGYDWHLNK 208

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA-NPYEPLQAPDHYLNI 269
           E   D HGKY+TD+ T     +I +   ++P+FL ++H A H   N  E  +    ++  
Sbjct: 209 ESDPDAHGKYATDIITQRVKHLIQSRDPEKPMFLMISHMAPHGGDNEDELFEVDRQWIED 268

Query: 270 HR--HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL-- 325
               HI    R+K+A ++  LD++ G+ V AL++  ML NS+I+F SDNGG +       
Sbjct: 269 PEIAHIMVESRTKYAGMIRALDDAFGQTVVALKEAEMLQNSVILFTSDNGGPSIVDESYK 328

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKS 383
             ASNWPLRG K ++WEGGVR +  +WSPLL+S+  + +   H++DWLPT   L+     
Sbjct: 329 TGASNWPLRGQKFSMWEGGVRVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGELR 388

Query: 384 DIP------NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
           D+P      +  +S   N     +N ++ Y+ G  E  + R  N    +     E +P Y
Sbjct: 389 DLPIDIDGHDIWDSIDRNTDSPRKNLLINYDTGGTENFAVRRGNFKLMHGKPPPEVDPWY 448

Query: 438 ENRYENG 444
           ++ Y   
Sbjct: 449 DSWYATA 455


>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 406

 Score =  292 bits (747), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 150/343 (43%), Positives = 207/343 (60%), Gaps = 5/343 (1%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
           FT+++      ASS PP+++ ILADDLGW+DV +HG DQIPTPNID LA  GIIL + Y 
Sbjct: 13  FTVALFLEATSASSQPPNLVMILADDLGWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYV 72

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-- 161
             L TP+R+A++TG +PIHTG Q   +   +  GL     +LPQ    L      +G   
Sbjct: 73  QPLSTPTRAALLTGLYPIHTGTQRLDIGSADPIGLSADFTLLPQLSVTLADNFTSLGARS 132

Query: 162 -WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE--MKMWGLDMRRDLEPAWDLHG 218
            WHLGF K E+ PT RGF++  G + G  DY+ H A +  + + G  ++ +     +  G
Sbjct: 133 GWHLGFCKDEFKPTKRGFDTFYGIYNGDSDYWTHFARDNNIDVSGHALKDEKRALVEEAG 192

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
           +Y T +   +AV +IHN   ++P FLY A  A H       LQAP  Y++   ++ D+ R
Sbjct: 193 RYLTSLLANQAVQLIHNRPKNKPFFLYFAPTAVHCGGSNGSLQAPKEYISKFGYLADYDR 252

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             FA  L +LD+SVG +VEAL     L+N++I F +DNGGA  GF+ N + NWPLRG K 
Sbjct: 253 QLFAGSLAELDKSVGLIVEALYVSGQLNNTVIAFSTDNGGAPVGFSANTSPNWPLRGTKG 312

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           T+ EGGVRG G +WS  L +RG V +Q  HV+DW+PT  +AA 
Sbjct: 313 TVAEGGVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPTFYTAAG 355


>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
          Length = 572

 Score =  292 bits (747), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 151/332 (45%), Positives = 211/332 (63%), Gaps = 11/332 (3%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
           L A    PHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS
Sbjct: 41  LAAPPQRPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRS 99

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
            ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  
Sbjct: 100 QLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECL 159

Query: 173 PTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           PT RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  
Sbjct: 160 PTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASH 219

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+
Sbjct: 220 ILASHNPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEA 274

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           V  +  AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G I
Sbjct: 275 VRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFI 330

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            SPLL+ +   +   VH++DW PTL+  A  +
Sbjct: 331 HSPLLKRKRWTSHALVHITDWYPTLVGLAGGT 362


>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 507

 Score =  292 bits (747), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 161/355 (45%), Positives = 216/355 (60%), Gaps = 20/355 (5%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           ++F +  +A  L+++ +        P+I+FILADDLGW+DVG+H    I TPNID LA  
Sbjct: 4   VSFYIAFIAGILTILPIAGAPFGTRPNIVFILADDLGWHDVGYHD-SIIRTPNIDKLAAE 62

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+ L+NYY   LCTP+R+ +MTG++ IHTGMQH VL   E   LP  E ++PQ LKE GY
Sbjct: 63  GVKLENYYVTPLCTPTRAVLMTGRYQIHTGMQHGVLMAQEPRCLPTDEVLMPQKLKESGY 122

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS----AEEMKMWGLDMRR-D 209
            T +VGKWHLGFYK   TP  RGF++  G +    DYF+H+       +  W  D+R  D
Sbjct: 123 TTHMVGKWHLGFYKWACTPNHRGFDTFFGMYLAGGDYFNHTRLCHGRRLAAW--DLRDGD 180

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
              A +  GKYST VF  +A +II  H    PLFLYL+  A H+     PLQ P+ Y+N+
Sbjct: 181 QVVAPEYVGKYSTIVFAEKAQEIIKKHDPTNPLFLYLSFQAVHA-----PLQVPERYINM 235

Query: 270 HR-HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
           ++  I D  R  +A +   +DE++G V   L +R +L N+IIVF SDNGGA     L  A
Sbjct: 236 YKDDIRDESRRIYAGMTTCMDEAIGNVTRTLARRGLLDNTIIVFSSDNGGA----KLYGA 291

Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAAN 381
           SN+PLRG K TL+EGGVR    ++SPLL +  RG    + ++V DW PT L  A 
Sbjct: 292 SNYPLRGQKGTLYEGGVRAPAFVYSPLLSNRVRGTATHELMYVGDWFPTFLHLAG 346


>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
 gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
          Length = 324

 Score =  291 bits (746), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 138/281 (49%), Positives = 188/281 (66%), Gaps = 3/281 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II IL DD+G+NDV FHG +QI TPNIDA AY+G+IL  YY   LCTPSR+A++TGK+
Sbjct: 28  PNIIIILIDDMGFNDVSFHGSNQILTPNIDAFAYNGVILNRYYVPNLCTPSRAALLTGKY 87

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH GMQH V    E  GLPL E+++PQ+ ++ GY T++VGKWHLGF+++++TP  RGF+
Sbjct: 88  PIHNGMQHFVQIPDEPWGLPLGERLMPQFFRDAGYSTQLVGKWHLGFWRQDHTPIMRGFD 147

Query: 180 SHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
            H GY+ G+ DY+DH+   +      G D RRDL+     +G Y+T+ FT EA  II  H
Sbjct: 148 HHFGYYNGYIDYYDHTHYMLDRNYTAGADFRRDLQRCHSDNGTYATEAFTKEARRIIEQH 207

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
               PLF+ L+H A H+ N  +P+QAP   +    HI D KR  +A ++  LD+SVG+ +
Sbjct: 208 DLSRPLFMVLSHLAVHTGNENQPMQAPYEEVAKFVHISDPKRRTYAGMVSSLDKSVGQTI 267

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
            AL  R ML+NSI++  SDNG    G + N+ SN P RGV 
Sbjct: 268 RALSDRGMLNNSIVLLYSDNGAPTLGIHSNSGSNHPFRGVS 308


>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 570

 Score =  291 bits (746), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 146/334 (43%), Positives = 206/334 (61%), Gaps = 12/334 (3%)

Query: 50  FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP 109
           F+       PPHIIFI+ DD G+ND+G+H  D I TP +D LA  G+ L+NYY   +CTP
Sbjct: 38  FIGRFPVPQPPHIIFIMTDDQGFNDIGYHSSD-IRTPVLDKLAADGVKLENYYIQPICTP 96

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           SRS  +TG++ IHTG+QH+++  C+   LP  +  LPQ L+ELGY T +VGKWHLGFYKK
Sbjct: 97  SRSQFITGRYQIHTGLQHSIIRPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKK 156

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
           E  PT RGF+++ G  TG  +Y+ +   +   + G D+      AW   GKYST ++T  
Sbjct: 157 ECLPTRRGFDTYFGSLTGSVNYYTYDGCDGAGLCGFDLHEGESVAWGQSGKYSTHLYTQR 216

Query: 229 AVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
              I+  H    +PLF++L+  A H+     PLQ PD Y+  +  +E+  R K+AA++  
Sbjct: 217 VRKILATHDPQSQPLFIFLSFQAVHT-----PLQYPDEYIYPYLGLENVARRKYAAMVSA 271

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+V  +  AL +     NS+I+F +DNGG      L+  SNWPLRG K T WEGG+RG
Sbjct: 272 VDEAVRNITYALRKYGYYQNSVIIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGIRG 327

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            G + SPLL  +  V++  VH++DW PTL+  A 
Sbjct: 328 LGFVHSPLLRKKRRVSKALVHITDWYPTLVGLAG 361


>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 311

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 141/310 (45%), Positives = 199/310 (64%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GW+D   HG  QIPTPN+DA+A  GIIL  +Y   LCTPSR+A+MTG++P H GMQH+V+
Sbjct: 1   GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
              E   LPL+  ++P+Y + LGY+T +VGKWHLG+Y ++Y P  RGF++ LG++    D
Sbjct: 61  KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120

Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
           Y++ +       G D R   +  W    +Y+T  +T + V+II  H+   P+FL+L+H A
Sbjct: 121 YYNQNFTGNNHTGHDFRCGDQNYWAEEKEYATYYYTNKTVEIIRRHNKSAPMFLFLSHQA 180

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            H +     LQ P H +    +I +  R+ FA ++  LD+SVG VV+ L++  ML+++II
Sbjct: 181 PHVSGGRPLLQVPTHGVRNFSYIGENNRTLFAGMVDALDQSVGIVVDTLQKAGMLNDTII 240

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           VF SDNG    GF  N   NWPLRG K TL+EGGVRG   +WS  L++   V+ Q +H+S
Sbjct: 241 VFCSDNGALPWGFKSNRGYNWPLRGGKFTLYEGGVRGTAFLWSRRLKNTKRVSNQLMHIS 300

Query: 371 DWLPTLLSAA 380
           DWLPTL SAA
Sbjct: 301 DWLPTLYSAA 310


>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 571

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 201/324 (62%), Gaps = 11/324 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH IFI+ DD G+ D+G+HG D I TP +D LA  G+ L+NYY   +C+PSRS +MTG+
Sbjct: 45  PPHFIFIMVDDQGYGDIGYHGSD-IHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMTGR 103

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LP     LP+ L E GY T +VGKWHLGF +    PT RGF
Sbjct: 104 YQIHTGLQHSIIRPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGRGF 163

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           +S LG  TG  D+F + S +  +  G D+     PAW++ G YST ++      I+  H 
Sbjct: 164 QSFLGTLTGSGDHFSYQSCDGAEACGFDLHDGDRPAWEMAGNYSTLLYIDRVKQILKRHD 223

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
              PLFLYL+  A H+     PLQ PDH+L+++    +  R  +AA+L  LD  V +VV+
Sbjct: 224 PHTPLFLYLSLQAAHT-----PLQVPDHFLHLYDSHRNRLRRHYAAMLSCLDSGVAQVVQ 278

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            L+ + +  NS++++ SDNGG      L+  SNWPLRG K T WEGG+R  G + SPLL+
Sbjct: 279 ELKTQGLYENSVLIYSSDNGGQP----LSGGSNWPLRGGKGTYWEGGIRAVGFVHSPLLK 334

Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
            +GIV+   +HVSDW PTLL  A 
Sbjct: 335 KKGIVSRALIHVSDWYPTLLGLAG 358


>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
 gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
          Length = 540

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 148/339 (43%), Positives = 216/339 (63%), Gaps = 19/339 (5%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           L++ +GPPHI+FIL DDLGW+DVG+H +   + TPNID LA  G+ L +YY+  +CTPSR
Sbjct: 28  LLSMAGPPHIMFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSR 87

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            A+MTGK+PIH GMQH V+      G+P     +PQ L+ LGYRT ++GKWHLGF+  +Y
Sbjct: 88  GALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDY 147

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           TP  RGF+S LG++ G QD++ HS    KM  LD RRD EPA +  G++STDVFT EA++
Sbjct: 148 TPLRRGFDSFLGFFAGEQDHWRHS----KMGFLDFRRDEEPANEYGGQHSTDVFTQEAIN 203

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I   H+  +PLFL L++AA H+     PLQA  + +N    + D  R  +  ++   D S
Sbjct: 204 IAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADWS 258

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNG---GAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           +G++++  ++  + +N+++++ SDNG   G   G+      NWPLRG K++L+EGGVR  
Sbjct: 259 IGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGY------NWPLRGYKSSLFEGGVRVP 312

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
             +   +L+ +G       HV+DW PTL+  A     P+
Sbjct: 313 AFVHGEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEVEPD 351



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 56/133 (42%), Gaps = 30/133 (22%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           +IDG+D W  LS  +PSKR  ILHNID                    +   Q +    P 
Sbjct: 351 DIDGVDQWPTLSEGKPSKREEILHNID--------------------IPANQEEERMAPR 390

Query: 634 QV-YLSGLSDREWLALAMRKLRDAASIQC------GPVKEVPCE---PQIAPCLFDIKND 683
              Y SG + R      + K+ DA   Q       GPV E   +   P +   L++I  D
Sbjct: 391 GFNYYSGAALRRGHMKLVYKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITAD 450

Query: 684 PCEKNNLADRSED 696
           P E+N+L+  + D
Sbjct: 451 PEERNDLSKLNPD 463



 Score = 40.4 bits (93), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 20/27 (74%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNID 535
           +IDG+D W  LS  +PSKR  ILHNID
Sbjct: 351 DIDGVDQWPTLSEGKPSKREEILHNID 377


>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
          Length = 575

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 147/326 (45%), Positives = 209/326 (64%), Gaps = 11/326 (3%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++T
Sbjct: 46  SRPPHIIFILTDDQGFHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLIT 104

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT R
Sbjct: 105 GRYQIHTGLQHSIIRPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRR 164

Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           GF++ LG  TG+ DY+ + + +   + G D+      AWD  GKYST ++      I+ +
Sbjct: 165 GFDTFLGSLTGNVDYYSYDNCDGPGVCGFDLHEGENVAWDQAGKYSTLLYAQRVNQILAS 224

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H+  +P+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +D +V  +
Sbjct: 225 HNPQQPIFIYVAFQAVHT-----PLQSPREYIYPYRGMGNVARRKYAAMVTCMDAAVKNI 279

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
             AL++    SNS+IVF SDNGG       +  SNWPLRG K T WEGGVRG G + SPL
Sbjct: 280 TRALKKYGYYSNSVIVFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPL 335

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAAN 381
           ++ +   +   +H++DW PTL+  A 
Sbjct: 336 IKKKRRTSRALMHITDWYPTLVKLAG 361


>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
          Length = 571

 Score =  290 bits (742), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPR 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  S
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGS 361


>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
          Length = 573

 Score =  290 bits (742), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 49  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T ++GKWHLGFY+KE  PT RGF++
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDT 167

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 168 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 227

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 228 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMSNVSRRKYAAMVTCMDEAVRNITWAL 282

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NSII+F SDNGG       +  SNWPLRG K T WEGGVRG G I SPLL+ +
Sbjct: 283 KRYGFYNNSIIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLKRK 338

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 339 RRTSHALVHITDWYPTLVGLARGT 362


>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
          Length = 572

 Score =  290 bits (742), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ NHS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   +H++DW PTL+  A  +
Sbjct: 338 RRTSRALMHITDWYPTLVGLAGGT 361


>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
 gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
          Length = 569

 Score =  290 bits (741), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   +H++DW PTL+  A  +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361


>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
          Length = 572

 Score =  290 bits (741), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGT 361


>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
 gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
 gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
          Length = 573

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 209/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 KPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGT 361


>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
          Length = 573

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 50  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 168

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 169 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTLLYAQRASHILASHSPR 228

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 229 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 283

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NSII+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 284 KRYGFYNNSIIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 339

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 340 RRTSRALVHITDWYPTLVGLAGGT 363


>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
          Length = 573

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 147/325 (45%), Positives = 207/325 (63%), Gaps = 11/325 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++
Sbjct: 49  PHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRY 107

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LPL +  LPQ L++LGY T +VGKWHLGFY+KE  PT RGF+
Sbjct: 108 QIHTGLQHSIIRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFD 167

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           + LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+  HS 
Sbjct: 168 TFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAGHSP 227

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  A
Sbjct: 228 RRPLFLYVAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNITGA 282

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ 
Sbjct: 283 LKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKR 338

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
               +   +H++DW PTL+  A  +
Sbjct: 339 TRRTSRALLHITDWYPTLVGLAGGT 363


>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
 gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
 gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
 gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
 gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
          Length = 569

 Score =  289 bits (739), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   +H++DW PTL+  A  +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361


>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
          Length = 569

 Score =  289 bits (739), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   +H++DW PTL+  A  +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361


>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
          Length = 584

 Score =  289 bits (739), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 209/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 55  HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 113

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E+GY T +VGKWHLGFYKK   PT RGF++
Sbjct: 114 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 173

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW+  G+YST ++   A  I+ +HS  
Sbjct: 174 FLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWEQSGQYSTLLYAQRASQILASHSPH 233

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 234 QPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 288

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++     NS+I+F +DNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 289 KRYGYYDNSVIIFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLKRK 344

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL++ A  S
Sbjct: 345 RRTSRALVHITDWYPTLVTLAGGS 368


>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
 gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
          Length = 575

 Score =  288 bits (738), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 51  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 109

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 110 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 169

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS  
Sbjct: 170 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPR 229

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 230 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSAL 284

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 285 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 340

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 341 RRTSRALVHITDWYPTLVGLAGGT 364


>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
          Length = 573

 Score =  288 bits (738), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 49  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS  
Sbjct: 168 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPR 227

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 228 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSAL 282

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 283 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 338

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 339 RRTSRALVHITDWYPTLVGLAGGT 362


>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
          Length = 570

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 46  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 104

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 105 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 164

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS  
Sbjct: 165 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPR 224

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 225 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 279

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 280 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 335

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 336 RRTSRALVHITDWYPTLVGLAGGT 359


>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 527

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 146/325 (44%), Positives = 204/325 (62%), Gaps = 12/325 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFI+ DD G+ND+G+H  D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 47  PPHIIFIMTDDQGFNDIGYHSSD-IKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGR 105

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LP  +  LPQ L+ELGY T +VGKWHLGFYKKE  PT RGF
Sbjct: 106 YQIHTGLQHSIIRPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGF 165

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           +++ G  TG  +Y+ + S +   + G D+      AW   GKYST ++T     I+  H 
Sbjct: 166 DTYFGSLTGSVNYYTYDSCDGPGVCGFDLHEGESVAWSQRGKYSTHLYTQRVRKILATHD 225

Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
              +PLF++L+  A H+     PLQ P  Y+  +R +E+  R K+AA++  +DE+V  + 
Sbjct: 226 PQSQPLFIFLSFQAVHT-----PLQCPQEYIYPYRGLENIARRKYAAMVSAVDEAVRNIT 280

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
             L +     NSII+F +DNGG      L+  SNWPLRG K T WEGGVRG G + SPLL
Sbjct: 281 YGLRKYGYYQNSIIIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLL 336

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
             +  V++  VH++DW PTL+  A 
Sbjct: 337 RKKRRVSKALVHITDWYPTLVGLAG 361


>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 570

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 203/324 (62%), Gaps = 12/324 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFI+ DD G+ND+G+H  D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++
Sbjct: 48  PHIIFIMTDDQGFNDIGYHSSD-IKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRY 106

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LP  +  LPQ L+ELGY T +VGKWHLGFYKKE  PT RGF+
Sbjct: 107 QIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFD 166

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           ++ G  TG  +Y+ + S +   M G D+      AW   GKYST ++T     I+  H  
Sbjct: 167 TYFGSLTGSVNYYTYDSCDGPGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHDP 226

Query: 239 -DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PLF++L+  A H+     PLQ P  Y+  +R +E+  R K+AA++  +DE+V  +  
Sbjct: 227 RSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNITY 281

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            L +     NSI++F +DNGG      L+  SNWPLRG K T WEGGVRG G I SPLL 
Sbjct: 282 GLRKYGYYENSIMIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLR 337

Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
            +  V++  VH++DW PTL+  A 
Sbjct: 338 KKKRVSKALVHITDWYPTLVGLAG 361


>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
          Length = 484

 Score =  286 bits (731), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 144/325 (44%), Positives = 202/325 (62%), Gaps = 13/325 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ ILADDLGWNDVG++   +I TPN+D LA +G+IL   Y +  C+PSR+A++TG++
Sbjct: 28  PNILLILADDLGWNDVGWNN-PEIKTPNLDRLASNGVILNASYALSTCSPSRTALLTGRY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   G+QH V+   +  GLPL+  +LPQ LK LGY T  +GKWHLGF + EYTPTFRGF+
Sbjct: 87  PFKLGLQHGVVKKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRGFD 146

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G+++G +DY+      ++  G D R + +       KYST  +   AV II  H   
Sbjct: 147 SFYGFYSGSEDYYKRKTAAIR--GYDFRMNTKVFKPKIKKYSTLDYGRRAVKIIQAHKRT 204

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           EPLFLY+   A HS     PLQ P  +   +R+I D  R  F+ ++  +D ++G  ++AL
Sbjct: 205 EPLFLYMPFQAVHS-----PLQVPKSFEFKYRNIVDRNRRIFSGMVSAMDSAIGMTIKAL 259

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP-LLES 358
           +QR ML+N++IVF+SDNGG          +NWPLRG K TLWEGG R    +W   LL+ 
Sbjct: 260 KQRNMLTNTMIVFLSDNGGTP----FFGGNNWPLRGSKATLWEGGTRVPSFVWGKMLLQK 315

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
            G  + + +H  DW PTL+S A  +
Sbjct: 316 AGYTSNEMIHAVDWFPTLVSLAGGT 340



 Score = 42.4 bits (98), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 38/160 (23%), Positives = 71/160 (44%), Gaps = 35/160 (21%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           + ++DG+D W +LS+ +PS+R   ++ I+     +A+ +GK+KL          ++ T G
Sbjct: 342 EKDVDGVDQWQMLSQRQPSRRKEFVYGINSITGRAAIRRGKYKL----------IEGTAG 391

Query: 632 PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA 691
               +    S R    + M++++                   +  L++I+ DP E N+L+
Sbjct: 392 HYSGWYPAPSSR---VVKMKRMKQLNKRHVN-----------STLLYNIQVDPIESNDLS 437

Query: 692 DRSED------QRINHY-TTEVGRFNQIAYPDKEEEEEKK 724
            +         +R+N Y   EV  FN    P KE   + K
Sbjct: 438 AKYPHLTQFMLRRLNRYKAQEVEAFN----PRKERYSQPK 473



 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 33/49 (67%)

Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
           + ++DG+D W +LS+ +PS+R   ++ I+     +A+ RGK+KL++  +
Sbjct: 342 EKDVDGVDQWQMLSQRQPSRRKEFVYGINSITGRAAIRRGKYKLIEGTA 390


>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 437

 Score =  285 bits (730), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 150/324 (46%), Positives = 199/324 (61%), Gaps = 5/324 (1%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ +  DDLGW DVGFHG  QIP PNIDALA  G+IL  YY       SR  ++TG +
Sbjct: 22  PNIVIVTVDDLGWADVGFHGSRQIPVPNIDALAADGVILNKYYAQPWPLSSRIGLLTGIY 81

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  TG+   V+  C+   LP   +ILP + K LGYRT  VG W+LGFYKK +TP  RGF+
Sbjct: 82  PYRTGVG-RVMLPCQPVALPSVFRILPTFFKSLGYRTHFVGVWNLGFYKKRFTPVNRGFD 140

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT--AEAVDIIHNHS 237
           +    WTG  DY+ H  +  KM G D+R + +  W+  G YST +FT  A+    +    
Sbjct: 141 TAYAKWTGPGDYWTHDMQT-KMQGFDLRLNDDLMWNQTGVYSTRLFTERADPTRKLCFFV 199

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +PL L L+H A H+AN +  LQ P  +L+    I D KR  FA  L ++D+S+G++ E
Sbjct: 200 LHQPLLLILSHQALHTANYHGSLQCPLEHLDSFGFIRDRKRRIFAGALTEVDQSLGQIFE 259

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           AL  RR+L+N+I+VFVS +GG   G F  N + N+PLRG K T +EGGVR    +WSPLL
Sbjct: 260 ALHNRRLLNNTILVFVSASGGMPVGDFTNNLSFNFPLRGTKGTCFEGGVRVPAFVWSPLL 319

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
                 + Q +HV+DWLPTL SAA
Sbjct: 320 NKTRRTSSQLMHVTDWLPTLFSAA 343


>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  285 bits (729), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 140/323 (43%), Positives = 203/323 (62%), Gaps = 11/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IFI+ DD G+ D+G+HG D + TP +D LA  G+ L+NYY   +C+PSRS +MTG++
Sbjct: 44  PHLIFIMVDDQGYADIGYHGSD-VHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGRY 102

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LP     LP+ L + GY T +VGKWHLGF +    PT RGF+
Sbjct: 103 QIHTGLQHSIIRPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGFQ 162

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           S  G  TG  D+F + S +  +  G D+     PAW++ G YST ++      I+ NH  
Sbjct: 163 SFFGTLTGSGDHFSYQSCDGAEACGFDLHDGSRPAWEMRGNYSTLLYIERVKQILRNHDP 222

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           + PLFLYL+  A H+     PLQ P H+LN ++   +  R  +AA+L  LD+ V +VV+ 
Sbjct: 223 NTPLFLYLSLQAAHT-----PLQVPAHFLNPYKSQNNRHRRHYAAMLSCLDDGVAQVVKE 277

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+   +  NSI+++ SDNGG      L+  SNWPLRG K + WEGG+R  G + SPLL+ 
Sbjct: 278 LKTTGIYDNSILIYSSDNGGQP----LSGGSNWPLRGGKGSYWEGGIRAVGFVHSPLLKK 333

Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
           +G+++++ +HVSDW PTLL  A 
Sbjct: 334 KGVISKEMIHVSDWYPTLLGLAG 356


>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
          Length = 473

 Score =  285 bits (729), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 138/307 (44%), Positives = 200/307 (65%), Gaps = 15/307 (4%)

Query: 83  IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE 142
           I TP++DALA  G++L NYY   LCTPSRS ++TG++ IH G+QH ++  C+   +PL E
Sbjct: 7   IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDE 66

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM--- 199
           K+LPQ LK+ G  T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H A      
Sbjct: 67  KLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIEC 126

Query: 200 ---KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP 256
                  LD+R   EPA +    YST++FT  A  +I NH  ++PLFLYLA  + H    
Sbjct: 127 LNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH---- 182

Query: 257 YEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
            +PLQ P+ Y+  +  I+D  R  +A ++  LDE+VG V +AL+ R + +N++++F +DN
Sbjct: 183 -DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDN 241

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           GG       +  +NWPLRG K TLWEGG+RGAG + SPLL+ +G+ + + +H++DWLPTL
Sbjct: 242 GGQTR----SGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTL 297

Query: 377 LSAANKS 383
           ++ A  S
Sbjct: 298 VNLAGGS 304


>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
          Length = 513

 Score =  285 bits (729), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 194/302 (64%), Gaps = 15/302 (4%)

Query: 85  TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI 144
           TP++DALA  G++L NYYT  LCTPSRS ++TG++ I TG+QH +++ C+   +PL EK+
Sbjct: 49  TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108

Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEE 198
           LPQ LKE GY T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H       A  
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALN 168

Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
           +    LD R   E A      YST++FT  A+ +I NH  ++PLFLYLA  + H     E
Sbjct: 169 VTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----E 223

Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           PLQ P+ YL  +  I+D  R  +A ++  +DE+VG V  AL+   + +N++ +F +DNGG
Sbjct: 224 PLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGG 283

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
                 L   +NWPLRG K +LWEGGVRG G + SPLL+ +G+   + +H+SDWLPTL+ 
Sbjct: 284 Q----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVK 339

Query: 379 AA 380
            A
Sbjct: 340 LA 341



 Score = 42.7 bits (99), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 89  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 124


>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 568

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/325 (43%), Positives = 202/325 (62%), Gaps = 12/325 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFIL DD G+ND+G+H   +I +P +D LA  G+ L+NYY   LCTPSRS ++TG+
Sbjct: 43  PPHIIFILTDDQGFNDIGYHS-GEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLITGR 101

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LPL    LPQ L+E+GY T +VGKWHLGFY+K+  PT RGF
Sbjct: 102 YQIHTGLQHSIIRPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTRRGF 161

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
            ++ G  TG  DY+ + S +   + G D+      AW   GKYST ++T     I+  H 
Sbjct: 162 HTYFGSLTGSVDYYTYGSCDGKSLCGFDLHEGESVAWGRAGKYSTHLYTQRVRKILATHD 221

Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
            T +PLF++L+  A H+     PL+ P  Y+  +R+I +  R K+AA++  +DE+V  + 
Sbjct: 222 PTSQPLFIFLSLQAVHT-----PLKPPKEYIYPYRNIGNVPRRKYAAMVSIVDEAVRNIT 276

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL +     NS+++F +DNG       L   SNWPLRG K T WEGG+RG G + SPL+
Sbjct: 277 YALRKYGFYRNSVVIFSTDNGAQP----LTGGSNWPLRGCKGTYWEGGIRGVGFVHSPLI 332

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
             R  ++   +H++DW PTL+  A 
Sbjct: 333 RHRRRISRDLIHITDWYPTLVGLAG 357


>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 153/331 (46%), Positives = 205/331 (61%), Gaps = 23/331 (6%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+FILADD G+ND+G+     + TPN+D LA  GI L NYY   +CTPSR+ +M+GK
Sbjct: 58  PPHIVFILADDYGFNDIGYRN-PAMRTPNLDYLAAEGIKLDNYYVQPICTPSRAQLMSGK 116

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH++++  +   LPL    LPQ LKE GY T + GKWHLGFYKKE  PT RGF
Sbjct: 117 YQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKECWPTNRGF 176

Query: 179 ESHLGYWTGHQDYFDHSAEEM-----KMW---GLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           +S LG   G  D+F H+ E         W   GLD R  L+      G YST V      
Sbjct: 177 DSFLGILLGKGDHFLHTEEGGGGPYPSTWPWEGLDFRDGLQSTNAYSGIYSTHVIAERVE 236

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAAILHKLD 289
           +II  H  D+PLFLY++  A H+     PLQ P+ YL      I+D KR  +A + + +D
Sbjct: 237 NIIEKHDKDKPLFLYVSFQAVHT-----PLQVPESYLQPFESSIQDEKRRIYAGMTYCMD 291

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN-AASNWPLRGVKNTLWEGGVRGA 348
           E+VG + + L+++ +  ++++VF SDNGG     N++  ASNWPLRG K TLWEGGVR  
Sbjct: 292 EAVGNITKKLKKQGLWDDTVLVFSSDNGG-----NIDQGASNWPLRGSKTTLWEGGVRAV 346

Query: 349 GLIWSPLLESR--GIVAEQYVHVSDWLPTLL 377
           G + SPLL  R  G V+ + + +SDW PTL+
Sbjct: 347 GFVTSPLLSERMKGTVSRELIDISDWYPTLI 377


>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
          Length = 560

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 211/359 (58%), Gaps = 25/359 (6%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSG------------PPHIIFILADDLGWNDVGFHGLDQ 82
           M    +PL+F L  +    V SS              PH++FIL DD G+ DVG+HG  +
Sbjct: 1   MLILWVPLSFLLGFIISHSVQSSWENWRNGGGEFDKQPHVVFILVDDQGFRDVGYHG-SE 59

Query: 83  IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE 142
           I TP +D LA  G+ L+NYY   LC+PSRS +MTG++ IHTG+QH+++   +   LPL  
Sbjct: 60  IKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSIIRATQPNCLPLEN 119

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKM 201
             LP  LK+ GY T +VGKWHLGFYK+   PT RGF++  G   G  D++ H   E   M
Sbjct: 120 VTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSGDHYSHYKCEAPGM 179

Query: 202 WGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEP 259
            G D+    E AW+   G YST +FT +A+ I+  H    +PLFLYLA+ A HS     P
Sbjct: 180 CGYDLYEGEEAAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLFLYLAYQAVHS-----P 234

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           LQ P  YL  ++ I +  R K+AA++  LDE++  +  AL++     N+++V+ SDNGG 
Sbjct: 235 LQVPSRYLERYKGISNVHRRKYAAMVSCLDEAIRNLTLALKRYGYYDNTVLVYSSDNGGQ 294

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
                L   SNWPLRG K + WEGG+R  G + SPLL ++G      +H++DW PTL+S
Sbjct: 295 P----LLGGSNWPLRGSKASYWEGGIRAVGFVHSPLLRNKGTKCRSLIHITDWFPTLVS 349



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 82/217 (37%), Gaps = 62/217 (28%)

Query: 517 SVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
           S L RN+ +K  +++H  D  W  + ++ G+            GT E   N       +D
Sbjct: 324 SPLLRNKGTKCRSLIHITD--WFPTLVSLGE------------GTLEEHLN-------LD 362

Query: 577 GIDVWSVLSRNEPSKRNTILHNID--------DEWQI----------SALTKGKWKLVKV 618
           G DVW  +S   PS R+ ILHNID          W+           +AL  G WKL+  
Sbjct: 363 GYDVWEAISEGRPSPRHDILHNIDPIYIKAKNGSWKAGYGIWNTAIQAALRVGHWKLLTG 422

Query: 619 VKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLF 678
           +      V     P Q + +      W    +R  R  +                   LF
Sbjct: 423 IPGYSDWV-----PPQTFSNQQMGSRWHNEHIRFDRGKSL-----------------WLF 460

Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           +I  DP E+ +L+ R   Q +      + ++N+ A P
Sbjct: 461 NITADPYERVDLSQRYP-QIVKKMLIRLAQYNKTAVP 496


>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 500

 Score =  282 bits (722), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 218/358 (60%), Gaps = 24/358 (6%)

Query: 44  FTLSMVFVDLVAS-----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           F ++++ +  V S     S  P+I+FIL DDLG+NDVG+HG  +I TPNID LA  G+ L
Sbjct: 6   FFINLIIISNVCSKKIIQSTKPNIVFILIDDLGYNDVGYHG-SEIYTPNIDKLAREGVRL 64

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           +NYY   +CTP+RS +++G++ IHTG+QH+ +   +   LP +       L+E GY T  
Sbjct: 65  ENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATHA 124

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM------W-GLDMRRDLE 211
           VGKWHLGFYKKE  PT RGF+S+ GY TG +DY+ H      +      W G+D+  + +
Sbjct: 125 VGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTHHRAGDGLLPNSNHWLGMDLWDNEK 184

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
            AW+  G YS  VFT +   I+  H  ++P  LYL   + HS     PLQ P  Y   ++
Sbjct: 185 VAWEYVGNYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERYK 239

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
           +I++  R  +A ++  LDE+VGK+V +L+Q  +  N++++F +DNGG  A       +NW
Sbjct: 240 NIKNTNRRIYAGMMTCLDEAVGKIVHSLQQAGVWDNTVLIFSTDNGGEVAA----GGNNW 295

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           PLRG K ++WEGG+RG G + S LL +  +G V +Q +HVSDW PTL+     S + N
Sbjct: 296 PLRGWKRSIWEGGMRGVGFVNSRLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDN 353


>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
          Length = 477

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 149/344 (43%), Positives = 204/344 (59%), Gaps = 19/344 (5%)

Query: 46  LSMVFVDLVA-------SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           LS+V   +VA       ++  PHI+FI+ DDLGWNDVG+   D I TP +D LA SG+IL
Sbjct: 3   LSVVLAIIVALCLKISTAADKPHIVFIVVDDLGWNDVGWINPD-IKTPTLDRLARSGVIL 61

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
            + Y   LC+PSR+  M+GK P HTG+Q  V++  +   +P +   +PQ LK LGY T +
Sbjct: 62  NSSYVQPLCSPSRNCFMSGKFPYHTGLQDKVVFIEQPKYMPANLTTIPQRLKTLGYDTHM 121

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW-GLDMRRDLEPAWDLH 217
           VGKWHLGF   +YTPT+RGF++ +GY+ G +DYF H  +E+  + G D R   +      
Sbjct: 122 VGKWHLGFCNWKYTPTYRGFDTFMGYYAGMEDYFTHVRDEVADYNGYDFRNMTDVYKGAQ 181

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           G+YST VF   A+DII NH   +P++LYL   A H      PLQ P  Y + + HI D  
Sbjct: 182 GEYSTYVFANRAIDIIMNHDKSKPMYLYLPFQAVHV-----PLQVPTKYSDRYSHIHDLN 236

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  ++ ++  LD++V  +   LE    + N ++VF +DNGG         A+NWPLRG K
Sbjct: 237 RKVYSGMVSALDDAVYNITTVLEDLGFMDNLLLVFTTDNGGPPT----RGANNWPLRGGK 292

Query: 338 NTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAA 380
           NTLWEGG RGA  ++S  LL+ +G       H  DW PTLL  A
Sbjct: 293 NTLWEGGTRGAAFVYSKTLLKKKGYTHPGLFHAVDWYPTLLDIA 336


>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
          Length = 481

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 205/321 (63%), Gaps = 11/321 (3%)

Query: 64  FILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHT 123
           FI+ DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ IHT
Sbjct: 39  FIVTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHT 97

Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
           G+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++ LG
Sbjct: 98  GLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLG 157

Query: 184 YWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPL 242
             TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS   PL
Sbjct: 158 SLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPL 217

Query: 243 FLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
           FLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL++ 
Sbjct: 218 FLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRY 272

Query: 303 RMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
              +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +   
Sbjct: 273 GFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRT 328

Query: 363 AEQYVHVSDWLPTLLSAANKS 383
           +   +H++DW PTL+  A  +
Sbjct: 329 SRALMHITDWYPTLVGLAGGT 349


>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
 gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
          Length = 487

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 194/318 (61%), Gaps = 14/318 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FI+ADDLGWNDVG+H  D + TP +D LA+ G+IL   Y   +CTPSR+A MTG +
Sbjct: 29  PHILFIVADDLGWNDVGWHNPD-VKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGYY 87

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P H G QH V    +  G+P +   LP+ LK+LGY T +VGKWHLGF   +YTPT+RGF+
Sbjct: 88  PYHAGSQHLVFLPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGFD 147

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  GY+   +DY+ H        GLD+R D E     +G+Y T  FT   VDII  H  D
Sbjct: 148 SFYGYYNADEDYYTHVVAG----GLDLRDDKEVVNTKNGQYGTYFFTDRMVDIIEKHPAD 203

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLYL     H     EPL+ P+ + NI+ ++++  R K   ++  LDE+VG V  A+
Sbjct: 204 TPLFLYLPFQNVH-----EPLEVPERFENIYMNVQNENRRKLLGMVSALDEAVGNVTMAM 258

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           +   +  N++++F +DNG    G+ + + +N+PLRG K TLWEGG RG   +   +L+  
Sbjct: 259 KNAGLWDNTLVIFTTDNG----GWIIASGNNYPLRGGKVTLWEGGTRGVAFVHGKMLQKT 314

Query: 360 GIVAEQYVHVSDWLPTLL 377
           G    + +H  DW PT+L
Sbjct: 315 GYTNNEMIHAVDWFPTIL 332


>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
          Length = 503

 Score =  278 bits (712), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 146/347 (42%), Positives = 215/347 (61%), Gaps = 23/347 (6%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L  TLS     ++ +S  P+I+FILADD G++DVG+HG   I TPNID LA+ G+ L+NY
Sbjct: 16  LCITLSQ---PVLGASKQPNILFILADDYGYHDVGYHG-SAIRTPNIDRLAFEGVRLENY 71

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +CTP+RS +++G++ IHTG+QH++++  +   LP     L   L+E GY   IVGK
Sbjct: 72  YVQPICTPTRSQLLSGRYQIHTGLQHSIIWAAQPNALPKELPTLADKLREEGYANHIVGK 131

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS-----AEEMKMWGLDMRRDLEPAWDL 216
           WHLGFYK+EY PT RGF+S  GY TG + Y++ +          + GLD R +     + 
Sbjct: 132 WHLGFYKEEYVPTNRGFDSFYGYLTGSEFYYNKTYCLAQINRSDVCGLDFRENDRSINE- 190

Query: 217 HGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
             +YST ++      ++ +H++   D+PLFLYLA  + H      PL+ P  Y   ++HI
Sbjct: 191 -SEYSTHLYAERTKQLVADHTSAHPDQPLFLYLALQSVHG-----PLEVPAQYRTPYKHI 244

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
           +D  R  +A ++  +DE+V  V +A ++  +  N+I+VF +DNGG  A       +NWPL
Sbjct: 245 KDENRQIYAGMVSCMDEAVKNVTDAFKKYGLWDNTILVFSTDNGGQVA----EGGNNWPL 300

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           RG K +LW+GG+RG G + S LL+ +G VA Q +HVSDW PTLL+ A
Sbjct: 301 RGWKGSLWDGGMRGVGFVHSTLLKQKGAVAHQLMHVSDWFPTLLNQA 347



 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 44/169 (26%)

Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID--------------DEWQIS 606
           T  N++  N+   ++DG D W+ +S      R  ILHNID              D    +
Sbjct: 342 TLLNQAGGNASSVKLDGFDQWAAISDGAEGPRTEILHNIDPLNGKRGKRTEEVFDNTVRA 401

Query: 607 ALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKE 666
           AL   KWKL            LTG P         +  W+    R+L         P K 
Sbjct: 402 ALRMNKWKL------------LTGEP--------GNGSWVPPPEREL---------PTKL 432

Query: 667 VPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
            P   +    LFDI+ DP E +N+A  +  + +     ++  +N  A P
Sbjct: 433 DPSSKKQNIWLFDIEADPYEISNVAAENP-EVVKTMLLKLAAYNATAVP 480


>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 648

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 191/565 (33%), Positives = 280/565 (49%), Gaps = 66/565 (11%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           V+ L  +L       VAS+ PPHI+F LADDLGW+DVG+H    I TPNID LA  G+ L
Sbjct: 7   VIILLISLISFVSSTVASTKPPHIVFFLADDLGWHDVGYHD-SIIRTPNIDKLAAEGVKL 65

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           +NYY   +CTP+R+ +MTG++ IHT MQH VL   E+  LP  E ++PQ LKE GY T +
Sbjct: 66  ENYYVTPICTPTRAVLMTGRYQIHTTMQHGVLMAQEQRCLPTDEVLMPQKLKESGYSTHM 125

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS----AEEMKMWGL-DMRRDLEPA 213
           VGKWHLGFYK + TP  RGF++  G++    +YF H+       +  W L D  + + P 
Sbjct: 126 VGKWHLGFYKWDCTPNHRGFDTFFGFYLAGGEYFTHTRKCHGHRLDAWDLRDGDKMVGPE 185

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RH 272
           +   G+YST ++  +A ++I     + P+FLY++  A H+     PL+ PD Y + + + 
Sbjct: 186 YT--GEYSTMLYARKAQEVIRKQDPNVPMFLYVSFQAVHA-----PLEVPDSYADAYGKD 238

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I D  R  +A +   +DE++G V + L+ R M  N+++VF +DNGG A GF    ASN+P
Sbjct: 239 IYDQSRKLYAGMTSCMDEAIGNVTQTLKDRGMWDNTVVVFSTDNGG-ATGF---GASNYP 294

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
           LRG K   +EGGVRG   + SPLL+   RG    + ++V DWLPT +  A  +       
Sbjct: 295 LRGQKCGHYEGGVRGPAFVSSPLLKPHVRGTKNTELMYVGDWLPTFVHLAGGTTHGTKPL 354

Query: 391 STVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNP 450
             V       E    R +   H  +  R ++   + E      NP ++ R        + 
Sbjct: 355 DGVNQWEAISEGLPSRRKEILHNIDPIRAKSPTLKTEPWM--LNPYFDIRVMAAIRVGHW 412

Query: 451 KYENRYENGTHEYNGPK--NENTNPRYENGTHEYNI---PRLENSING------------ 493
           K     +   +    P+  N  TNP   N    Y+I   P   N I+             
Sbjct: 413 KLLTGQKGNANWQEPPELGNRLTNPYTNNILQLYHIKEDPEERNDISATRPDVVYRILLR 472

Query: 494 ----NGTS---ENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID----------- 535
               N T+    +   + S  + +     WS  +   PS+R  ILHNID           
Sbjct: 473 LQALNSTAVLPTHVDPEFSLSDPVKHDMAWSPWAEGAPSRRTEILHNIDQLRGKTPVFLN 532

Query: 536 DEWQ---------ISALTRGKWKLV 551
           + W          ++A+  G WKL+
Sbjct: 533 EAWMNNPYFDIRVMAAIRVGHWKLL 557


>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 517

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 213/352 (60%), Gaps = 17/352 (4%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDAL 91
           M    + L F L ++ V   A    P+I+FILADD G+ND+G+H ++    + TP +D+L
Sbjct: 1   MYLLRVVLLFGLHVLCVP--AELTKPNIVFILADDYGFNDIGYHAVEHHSDMKTPFLDSL 58

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           A +G+ L+NYY   +C+PSRS +M+G++ IHTG+QH V+   +R GLPL   ILP+ L +
Sbjct: 59  AMAGVRLENYYIQPICSPSRSVLMSGRYQIHTGLQHYVISPQQRNGLPLDNIILPEQLHK 118

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY T +VGKWHLGFYK EY P  RGF S+ GY TG +DY+     + K+ G DM  +  
Sbjct: 119 CGYDTHMVGKWHLGFYKDEYLPWKRGFNSYFGYLTGGEDYYTKWRCDGKLCGYDMTSEKG 178

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
           P    +G+YS ++F  +A + I  H   +PLFLY+A  + HS     P++ P+ Y     
Sbjct: 179 PTNATYGQYSANLFANKANEAIDKHDKTKPLFLYVAFQSVHS-----PMEVPESYAKPFD 233

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
           +I++  R  +  ++  +DE+V  + E L+   +  N+I+VF +DNGG      L+  +NW
Sbjct: 234 YIKNHNRKMYGGMVAAMDEAVKNITEHLQAAGLWDNTILVFSADNGGQT----LSGGNNW 289

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRG---IVAEQYVHVSDWLPTLLSAA 380
           PLRG K TLWEGG++G G +   +L       IV  + +H+SDW PT++ A 
Sbjct: 290 PLRGRKLTLWEGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTIMEAT 341


>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 588

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 146/331 (44%), Positives = 203/331 (61%), Gaps = 17/331 (5%)

Query: 59  PPHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           PPH+I I+ADDLG+NDVG+H   G   I TPNID +AYSG+ L+NYY   +CTP+RS ++
Sbjct: 93  PPHVIMIIADDLGYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLI 152

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IHTGMQH  L+      LPL E  L Q LK+ GY T  VGKWHLG+  K+  P+ 
Sbjct: 153 TGRYQIHTGMQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSR 212

Query: 176 RGFESHLGYWTGHQDYFDHSAEEM----KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           RGFES  G   G  D++ H+   +     + G  M  +    W   G +ST ++T  A  
Sbjct: 213 RGFESFFGNIMGSADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQ 272

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDE 290
           +I     ++PLFLYL++ A H+     PL  P+ Y   +   I + KR ++A +++ LDE
Sbjct: 273 LIRKQPRNKPLFLYLSYEAVHT-----PLNVPEQYAKPYEGIIHNSKRRRYAGLVNILDE 327

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +V  V EAL+   +  NS+I+F +DNGG     ++   +NWPLRG K+TLWEGG+RG G 
Sbjct: 328 AVRNVTEALKYNGLYDNSVIIFTTDNGGRPKPRSV--GNNWPLRGGKSTLWEGGIRGVGF 385

Query: 351 IWSPLL--ESRGIVAEQYVHVSDWLPTLLSA 379
           + SPL+  E RG V  Q +HVSDW PT++  
Sbjct: 386 VHSPLIPWELRGTVNRQLIHVSDWFPTIVXG 416


>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 1410

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 136/321 (42%), Positives = 196/321 (61%), Gaps = 16/321 (4%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           I+FI+ DD+GWNDVG+HG   I TP++D LA  G  L+NYY   LC+PSR   +TGKH I
Sbjct: 29  IVFIMIDDVGWNDVGYHG-SSISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMI 87

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
           H GM+  ++   ER  LP++E  + Q LK   Y T  +GKWH+G+YKK   P  RGF++ 
Sbjct: 88  HLGMEGGIIMPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTF 147

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
            G   G  D++ H  +    W L  R ++  A +  G YSTD++  EA ++I NH   +P
Sbjct: 148 FGIIGGCADHYTH--KNTHWWEL-YRNNISIAQEYQGHYSTDLYAREATNVIRNHDASKP 204

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
           +F+YLA  A H      PLQAP  Y++++ +IED  R  +AA++  +D+++G VV+ L +
Sbjct: 205 MFMYLAFQAAHL-----PLQAPRKYIDMYSNIEDPDRRVYAAMVTCMDDAIGNVVDQLRE 259

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--R 359
             + +N++++  +DNGGA    N     NWPL+G K +LWEGGVRG   + SP L    R
Sbjct: 260 AGLWNNTVLIVSTDNGGAKISGN-----NWPLKGSKASLWEGGVRGVAFVTSPFLADHVR 314

Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
           G    Q +HV+DW PT+L  A
Sbjct: 315 GTSNHQLMHVTDWFPTMLHVA 335


>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
          Length = 476

 Score =  276 bits (706), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 140/325 (43%), Positives = 207/325 (63%), Gaps = 15/325 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+I+FI++DD GW+D+G+HG  +I TP +D LAY+GI L+NYY   +C+P+RS  M+G 
Sbjct: 7   PPNILFIMSDDYGWHDIGYHG-SKIRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSGV 65

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QHNV++  +  GLPL    +   ++E GY T + GKWHLG+YK+EY P  RGF
Sbjct: 66  YQIHTGLQHNVIWPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRGF 125

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH-- 236
           +++ GY  G +DY+D S       G D R + +  W+L   YST ++ +   +I+ NH  
Sbjct: 126 DTYYGYLNGCEDYYDKSYCHPYC-GYDFRLNDDIQWNL-TDYSTYLYVSRVNEILLNHKI 183

Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
            S D+PLFLYL   + H     EPL+ P  Y + + HI+D  R  +A ++  +DE+VGK+
Sbjct: 184 YSPDKPLFLYLPLQSVH-----EPLEVPKEYSDKYSHIKDNNRRTYAGMVAAMDEAVGKI 238

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
            +  ++  M   +++ F +DNGG          +NWPLRG K +LWEGG+RG G + SPL
Sbjct: 239 RDLFKKYEMWDETVLAFSTDNGGQIH----RGGNNWPLRGWKISLWEGGMRGVGFVHSPL 294

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L + G  ++  +HVSDWLPTL+  A
Sbjct: 295 LPNSGGKSKGLMHVSDWLPTLVHVA 319


>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 732

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 194/323 (60%), Gaps = 12/323 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FIL DD G+ND+G+H    I TP +D LA  G+ L+NYY   +CTPSRS ++TG++
Sbjct: 49  PHIVFILTDDQGFNDIGYHN-PTIKTPTLDKLAAEGVKLENYYVQPICTPSRSQLITGRY 107

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LP     LP+ L+E GY T +VGKWHLGFY+K   PT +GF+
Sbjct: 108 QIHTGLQHSIIRPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRKACLPTRKGFD 167

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +  G  TG  DY+ + S +   + G D+  D   AW   GKYST +FT  A  I+ +H  
Sbjct: 168 TFFGSLTGSVDYYSYESCDGKDLCGYDLHDDEGVAWGQEGKYSTTLFTQRARKILESHDP 227

Query: 239 DE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            E PLFL L+  A H+     PL  P+ Y+   R + +  R  FA ++  +DE+V  V  
Sbjct: 228 AERPLFLLLSFQAVHT-----PLNPPESYIYPFRDMVNVARRNFAGMVSTVDEAVRNVTY 282

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL +     NS+I++ +DNG           +NWPLRG K T WEGGVRG G + SPLL 
Sbjct: 283 ALRKYGYYRNSVIIYSTDNGAQP----FTGGNNWPLRGRKGTYWEGGVRGVGFVHSPLLR 338

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
            R  V++  +H++DW PTL+  A
Sbjct: 339 RRRRVSKALMHITDWFPTLVGLA 361


>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
           carolinensis]
          Length = 580

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 138/321 (42%), Positives = 205/321 (63%), Gaps = 11/321 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           H+IFIL DD G++D G+HG D I TP +D LA  G+ L+NYY   +CT SRS ++TG++ 
Sbjct: 59  HLIFILTDDQGFHDXGYHGSD-IXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQ 117

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LP ++  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 118 IHTGLQHSIIRPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 177

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW+  GKYST ++      I+  H+  
Sbjct: 178 FLGSLTGNVDYYTYDNCDGPGVCGYDLHDGENVAWEQSGKYSTFLYAQRVNKILAAHNPK 237

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           EP+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 238 EPIFIYIAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWAL 292

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++     NS+IVF +DNGG       +  SNWPLRG K T WEGGVRG G + SPL++ +
Sbjct: 293 KKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKHK 348

Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
             ++   +H++DW PTL++ A
Sbjct: 349 RRISRALIHITDWYPTLVTLA 369


>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 498

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 146/359 (40%), Positives = 222/359 (61%), Gaps = 20/359 (5%)

Query: 30  YRTR---IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTP 86
           Y TR   ++AFA++ LA ++S+V   + + S  PH++FI+ADDLGWND+G++    I TP
Sbjct: 4   YNTRSQSMLAFALVVLA-SMSLV---ICSGSRKPHVVFIVADDLGWNDIGYNN-PSIFTP 58

Query: 87  NIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILP 146
            +D LA  G+IL   Y + +CTP R+++M+G +    G+QH VL   E  GLPL+  ++P
Sbjct: 59  TLDKLAREGVILNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVLDHAEPAGLPLNFTLIP 118

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
           Q +KE GY T ++GKWHLGF K EYTPT+RGF+   G++   +DYF+H+  +     LD+
Sbjct: 119 QRMKEHGYTTYMLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAEDYFNHTTSKY----LDL 174

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
           R   E  W  +G YST ++  +A + I  H+   P+++YL   + H       ++AP  Y
Sbjct: 175 RNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQSVHGV-----IEAPQKY 229

Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
           L+++  I D  R   +A++  +D++V  V+ ALE+  +  N+++VF +DN         N
Sbjct: 230 LDMYTFIHDTNRRIKSAMVTAMDDAVKVVIAALERYELWDNTLLVFTTDN--GGPANPNN 287

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSD 384
           A +NWPLRG K TLWEGG RG G +   +LE +G V  + +HV+DW PTLL  A  K+D
Sbjct: 288 AGNNWPLRGSKLTLWEGGTRGVGFVHGKMLEKKGYVNNEMMHVTDWYPTLLHIAGGKAD 346


>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 574

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 138/324 (42%), Positives = 199/324 (61%), Gaps = 12/324 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FIL DD G+ND+G+H    I TP +D LA  G+ L+NYY   +CTPSRS +MTG++
Sbjct: 46  PHIVFILIDDQGFNDIGYHN-PTIKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTGRY 104

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LP     LP+ L++ GY T +VGKWHLGFY+K   PT +GF+
Sbjct: 105 QIHTGLQHSIIRPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKGFD 164

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS- 237
           +  G  TG  D++++ S +   + G D+      AW   GKYST +FT  A  I+ +H+ 
Sbjct: 165 TFFGSLTGSVDHYNYLSCDGPGVCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILESHNP 224

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
           T++PLFL L+  A H+     PLQ P  Y+  +R + +  R K AA++  +DE+V  V  
Sbjct: 225 TEKPLFLLLSLQAVHT-----PLQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVRNVTY 279

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL +     NS+I++ +DNG           SNWPLRG K T WEGG+RG   + SPLL+
Sbjct: 280 ALRKYGFYRNSVIIYSTDNGAQP----FTGGSNWPLRGRKGTYWEGGIRGVAFVHSPLLK 335

Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
            R  V++  +H++DW PTL+  A 
Sbjct: 336 RRRRVSKALLHITDWFPTLVGLAG 359



 Score = 46.6 bits (109), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 60/153 (39%), Gaps = 31/153 (20%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID---------DEWQIS---ALTKGKWKLVKVVKVM 622
           +DG DVW  +S    S R  ILHNID           W  S   A+  G WKL+      
Sbjct: 368 LDGFDVWPTISEGRDSPRQEILHNIDPLHKHVTSESLWDTSIQAAIRVGDWKLL-----T 422

Query: 623 RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKN 682
            Y       P QV L  L  R W         D       P K    +      LF+I  
Sbjct: 423 GYPGHGDWVPPQV-LPTLPGRWW---------DLERNILSPYKNSTYK---NIWLFNITA 469

Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
           DPCE+++LAD+  D  +      +  +NQ A P
Sbjct: 470 DPCERHDLADQRPDV-VLQLLARLAYYNQTAVP 501


>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 500

 Score =  273 bits (699), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 152/340 (44%), Positives = 201/340 (59%), Gaps = 14/340 (4%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
           F L +V  +++    PPHIIFI+ DDLGWNDVG++      TP ID LA SG+ L NYY 
Sbjct: 7   FLLLLVSGNVLGREKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGSGVKLLNYYV 65

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
              C PSR+ +MTG+H I  G+QH+  YG     LPL+E  + + LK +GY T IVGKWH
Sbjct: 66  ASHCLPSRNMLMTGRHAIQLGIQHDD-YGFHPRSLPLNETTIAEPLKHVGYSTHIVGKWH 124

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
            GFY     P  RGF++  G+     D++ HS     M  L    D   A    GKYST 
Sbjct: 125 CGFYSDNCLPHNRGFDTFFGFVGAGIDHYTHSDHFNHMHNLRKNDDC-IAKKYIGKYSTT 183

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFA 282
           +F  E  DII+ H  ++PLFLYL+ +A H+     PL+ P  YL  +   I D  R  +A
Sbjct: 184 IFANEGKDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYA 238

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+   +DE++  + EALE++ ML NSIIVF SDNGGA      +A +NWPLRG K+T WE
Sbjct: 239 AMTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVN---SAGNNWPLRGGKHTSWE 295

Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           GGVR  G ++  LL   SRG      +H++DW PTL++ A
Sbjct: 296 GGVRAIGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLA 335


>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 558

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 141/324 (43%), Positives = 200/324 (61%), Gaps = 12/324 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFI+ DD G+ND+G+H  D I TP +D LA +G+ L+NYY   +CTPSRS  +TG++
Sbjct: 38  PHIIFIMTDDQGFNDIGYHNTD-IHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGRY 96

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH+++   +   LP   + LPQ L+E GY T +VGKWHLGFYK++  PT RGF 
Sbjct: 97  QIHTGLQHSIIRSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGFN 156

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS- 237
           ++ G  TG  DY+ + S +  K+ G D+      AW   G+YST ++T     I+  H  
Sbjct: 157 TYFGSLTGSVDYYTYKSCDGPKVCGFDLHDGERVAWGQGGRYSTHLYTQRVRKILAAHDP 216

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
           + +PLF++L+  A H+     PLQ+P  Y+  +R + +  R K+A ++  +DE+V  V  
Sbjct: 217 SSQPLFIFLSFQAVHT-----PLQSPKEYVYPYRRMGNMFRRKYAGMVSAVDEAVRNVTY 271

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL +     N++I F +DNGG      L   SNWPLRG K T WEGGVRG G + SPLL 
Sbjct: 272 ALRKYGYYKNTVIFFSTDNGGQP----LFGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLR 327

Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
            R  V+   +H++DW PTL+  A 
Sbjct: 328 RRRRVSRALIHITDWYPTLMRLAG 351


>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 143/333 (42%), Positives = 198/333 (59%), Gaps = 14/333 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+FIL DD G+ND+G+H    I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct: 43  PPHIVFILIDDQGFNDIGYHN-PTIKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGR 101

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH+++   +   LP     LP+ L+  GY T +VGKWHLGFY+K   PT +GF
Sbjct: 102 YQIHTGLQHSIIRSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGF 161

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           ++  G  TG  DY+ + S     + G D+  +   AW   GKYST +FT  A  I+ +H 
Sbjct: 162 DTFFGSLTGSVDYYSYGSCNGPGVCGYDLHDNERVAWGHEGKYSTTLFTQRAHKILKSHD 221

Query: 238 -TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
             D PLFL L+  A H      PLQ P  ++ ++R + +  R KFAA++  +DE+V  V 
Sbjct: 222 PADRPLFLLLSLQAVHG-----PLQPPKSFVYLYRDMANVDRRKFAAMVSTVDEAVRNVT 276

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL +     NS+I++ +DNG           SNWPLRG K T WEGGVRG   + SPLL
Sbjct: 277 YALRKYGYYKNSVIIYSTDNGAQP----YVGGSNWPLRGRKGTYWEGGVRGVAFVHSPLL 332

Query: 357 ESRGIVAEQYVHVSDWLPTL--LSAANKSDIPN 387
           + R  V+   +H++DW PTL  L+  N S  P 
Sbjct: 333 KHRRRVSTALLHITDWFPTLVGLAGGNISQSPG 365


>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 562

 Score =  273 bits (698), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 212/355 (59%), Gaps = 18/355 (5%)

Query: 36  AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALA 92
           A  +L   F LS V          PH++F+LADD G+ND+G+H  +    + TP +D+LA
Sbjct: 10  AVRLLAFVFWLSAVACGKKQRGQRPHVVFVLADDFGFNDIGYHAREHYSDMYTPFLDSLA 69

Query: 93  YSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL 152
             G+IL+NYY   +C+P+R  ++TG++ IHTG+ H ++   +  GLPL   +L Q L++ 
Sbjct: 70  AKGVILENYYVQPICSPTRGQLLTGRYQIHTGLAHGIIRAAQPYGLPLDNILLSQQLRQC 129

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK----MWGLDM-R 207
           GY+T +VGKWHLGF+++EY P  RGF++  G+  G  ++F     E K      G DM  
Sbjct: 130 GYKTNMVGKWHLGFFREEYLPWNRGFQNFFGFLNGGVNHFTRYHCEPKKTRRFCGYDMID 189

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
               P    +G+YST++F  ++ ++I  H+  +P+FLYL+  A H      PLQ P+ YL
Sbjct: 190 SRYGPTNATYGEYSTNLFIRKSKEMIDKHNKQKPMFLYLSLQAVHG-----PLQVPNQYL 244

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
              +HI D  R  +A +++ +D  + ++V+ L++ RM  N+I +F +DNGG         
Sbjct: 245 KRFKHIRDKNRRIYAGMVYAMDRGIRQLVKHLKRARMWKNTIFIFSTDNGGQTT----RG 300

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            +NWPLRG K TLWEGG+RG G +   PL  +   V ++ +HVSDW PT++SA +
Sbjct: 301 GNNWPLRGKKGTLWEGGIRGVGFVHGKPLQVTTPRVNKELLHVSDWYPTIMSATH 355



 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 84/219 (38%), Gaps = 62/219 (28%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQISALTKGKWKLVKVV 619
           IDG D W  L  N+ SKR  ILHNID               D    +A+  G WKL    
Sbjct: 365 IDGYDQWETLRSNKTSKRTEILHNIDPLHKKLTESEFREGFDVSVRAAIRSGDWKL---- 420

Query: 620 KVMRYQVDLTGGPDQVYLSGLSDREWLA--------LAMRKLRDAASIQCGPVKEVPCEP 671
                   LTG       +G++  EW+         + + ++RD   +     K V    
Sbjct: 421 --------LTG------YAGMN--EWVVPPECTHGNITIGQIRDDFHVYPNSTKNVR--- 461

Query: 672 QIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKK- 730
                LF I  DP E+  ++D+  +  +N     +  +N  + P      +KK   K+  
Sbjct: 462 -----LFKITEDPHEEREVSDQYPEV-VNLLLARLKEYNTSSLPVDYPHNDKKSDPKRHG 515

Query: 731 ---------KKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
                     +K K+KK +  + +K K    +   +KL+
Sbjct: 516 GFWLPWRGVGRKYKRKKFRPHRARKPKLFRRKMKFKKLQ 554


>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 470

 Score =  273 bits (697), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 133/264 (50%), Positives = 167/264 (63%), Gaps = 2/264 (0%)

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            P+    QHNV+   E  GLPL+E +LP++L +LGY +  VGKWHLG++KK YTPT+RGF
Sbjct: 4   QPLLARTQHNVILEPEPWGLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGF 63

Query: 179 ESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           +S  G+W G+QDY+ H   A      G DMRRDL P W   GKYST +FT EA DII  H
Sbjct: 64  KSFYGFWNGYQDYYTHMVQATFASFEGFDMRRDLNPDWSSVGKYSTHLFTKEATDIITKH 123

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           +   PLFLYLAH A H+     PLQAP   +N  + I+D  R K+A ++  LD SV +V 
Sbjct: 124 NNSVPLFLYLAHLAPHAGTYENPLQAPQEDINSFQSIKDKYRRKYAGMMKNLDNSVAEVF 183

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL    ML N+I VFVSDNG    G + N  SNWPL+G K T WEGG+R A  +WS LL
Sbjct: 184 NALHTNNMLDNTIFVFVSDNGAPTNGIHRNYGSNWPLKGEKATPWEGGIRTAAFVWSKLL 243

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
             +  +    +H+SDWLPTL  AA
Sbjct: 244 SKKKTLPNPLMHISDWLPTLYQAA 267



 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 49/163 (30%), Positives = 77/163 (47%), Gaps = 24/163 (14%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVK---------------VV 619
           IDGI +W   S N  S R  ILHNIDD    +A+    +K +K                 
Sbjct: 277 IDGISMWESFSNNIKSPRKQILHNIDDITGYAAIRDTNFKYIKGSTFLGYLDYWSGSLSP 336

Query: 620 KVMRYQVD-LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEV-PCEPQIAPCL 677
             +RY VD +        LS ++        +  LR++++++C   K++ PC+P I PCL
Sbjct: 337 STIRYNVDAILNSTTAGILSQINGDSLTPSLIESLRNSSTVRCDFEKDMKPCKPFIKPCL 396

Query: 678 FDIKNDPCEKNNL-------ADRSEDQRINHYTTEVGRFNQIA 713
           F I  DPCE  NL         +  + +I+++ T +G+F + A
Sbjct: 397 FHIVKDPCELVNLNYKPNSKMRKFVEAKIDYFETSLGKFRESA 439



 Score = 46.6 bits (109), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 39/66 (59%), Gaps = 8/66 (12%)

Query: 756 MRKLRDAASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEVQ-------RINH 807
           +  LR++++++C   K++ PC+P I PCLF I  DPCE  NL  +   +       +I++
Sbjct: 368 IESLRNSSTVRCDFEKDMKPCKPFIKPCLFHIVKDPCELVNLNYKPNSKMRKFVEAKIDY 427

Query: 808 YTTEVG 813
           + T +G
Sbjct: 428 FETSLG 433


>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 519

 Score =  273 bits (697), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 147/353 (41%), Positives = 216/353 (61%), Gaps = 21/353 (5%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M FA+L +     +  +  V S   P+IIFIL+DD GWND+G+H    I +PNI+AL   
Sbjct: 1   MIFALLEILLCFHITVI--VGSQ--PNIIFILSDDHGWNDIGYHS-HIIRSPNINALCND 55

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+ L+NYY    CTPSRS +M+G++ IHTG+QH+V+   +   LPL E  L + LKE+GY
Sbjct: 56  GVRLENYYIQPGCTPSRSQLMSGRYQIHTGLQHSVIRNDQPNCLPLDEVTLAEKLKEVGY 115

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR-DLEPA 213
            T +VGKWHLGFY     PT RGF+S  GY  G +DY+ H  +     G D++R + + +
Sbjct: 116 ATHLVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQEDYYKHIHDG----GYDLKRHETDVS 171

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH- 272
               G Y+T VFT+EA +II +H    PLFLY+++ + H+      LQ P+HY N++   
Sbjct: 172 KQYQGDYTTHVFTSEAQNIIKSHDPSTPLFLYMSYQSVHANY----LQVPEHYSNMYNGV 227

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I+D  R   A ++  +DE +G + + L++  + +N+II+F SDNGG       +  +NWP
Sbjct: 228 IDDEDRRIVAGMVTCMDEGIGNITQTLKETELWNNTIIIFSSDNGGDPN----DGGNNWP 283

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LRG K T WEG + G G ++S  +  + RG V  + +HVSDW PT+++ A  S
Sbjct: 284 LRGEKGTHWEGAIHGLGFVYSSDIAKDVRGTVNTEMIHVSDWFPTIVNLAGGS 336


>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
 gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
          Length = 494

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/349 (39%), Positives = 204/349 (58%), Gaps = 18/349 (5%)

Query: 46  LSMVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           LS++F+       P       PHI+FI+ADDLGWNDVG+H  D + TP +D LA+ G+IL
Sbjct: 2   LSLLFLATSGLQQPGGNDAAKPHIVFIVADDLGWNDVGWHNPD-VRTPVLDQLAHEGVIL 60

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
              Y   +CTPSR+A MTG  P H G QH V    +   +  +   LP+ LK LGY T +
Sbjct: 61  NQSYVNYVCTPSRTAFMTGYFPYHVGSQHLVFRPDQPSAILSNFTFLPEKLKSLGYATHM 120

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLH 217
           VGKWHLGF   ++TPTFRGF+S+ GY++G +DYF H  +      G+D   + +   D +
Sbjct: 121 VGKWHLGFCNWKFTPTFRGFDSYYGYYSGAEDYFTHFRSIRNGTGGIDFHDNKDVVTDQN 180

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           G YS  +F+  A DI++ H  + PLFLYL     H+     PL+ P  + +++ +++D  
Sbjct: 181 GTYSAYLFSQRAADIVNKHDPNTPLFLYLPFQNVHA-----PLEVPKRFEDMYANVQDEN 235

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R K   ++  LDE+VG V  A+++  +  N++++F +DNGG      + + +N+PLRG K
Sbjct: 236 RRKLLGMVSALDEAVGNVTMAMKKTGLWDNTLLIFSTDNGGMV----IQSGNNYPLRGGK 291

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
            TLWEGG RG       +L+  G    + +H  DW PTLL+AA  +  P
Sbjct: 292 TTLWEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTLLAAAGGTADP 340


>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 504

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/351 (42%), Positives = 212/351 (60%), Gaps = 26/351 (7%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           I   +VL + F+ S + +D + SS   HIIFI+ DDLGWNDVG+H L  I TP ID LA 
Sbjct: 3   ISHISVLLVVFSSSCLLIDGL-SSDQAHIIFIVVDDLGWNDVGYHNL-YIKTPTIDRLAN 60

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
           +G+ L NYY   LC PSR+ +M+G+H       H V+ G  RG LPL+E  +   LKE G
Sbjct: 61  NGVKLNNYYAANLCVPSRNMLMSGRH------VHGVIMGYPRG-LPLNETTIANKLKEAG 113

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR-DLEP 212
           Y T +VGKW+ GFY KE+ P  RGF++  G+    +D++ H   ++     D+RR DL  
Sbjct: 114 YSTHLVGKWNCGFYSKEFLPHNRGFDTFFGFVDSKEDHYTHMVHDIS----DLRRNDLCV 169

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR- 271
           A   +G YST ++  E   II NH T++PLFL+++ +A H     EPLQ P  Y   +  
Sbjct: 170 ADKYYGNYSTIMYGNEGTTIIDNHDTNKPLFLFMSFSAVH-----EPLQVPSVYEKEYIP 224

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
            I+D  R  +A ++  +DE++  ++E LE + ML NSIIVF SDNGG      L + +NW
Sbjct: 225 TIDDTDRRIYAGMVSCVDEAIANIIETLENKDMLKNSIIVFTSDNGGDP----LYSGNNW 280

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           PLRG K + WEGGVR  G ++S +L   +RG    + +H++DW PTL+  +
Sbjct: 281 PLRGWKASNWEGGVRALGFVYSEILPTNARGTDNNELMHITDWFPTLVDIS 331


>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
          Length = 684

 Score =  270 bits (690), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 136/321 (42%), Positives = 193/321 (60%), Gaps = 14/321 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FI+ADDLGWNDVGFH    + TPNID LA  G+IL   Y   LC+PSR A+MTG +
Sbjct: 226 PHILFIVADDLGWNDVGFHN-PAMKTPNIDKLAREGLILNQTYLQPLCSPSRHALMTGYY 284

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P H G+QH V+   +    PL  K LPQ LK++GY T +VGKWHLGF     TPT+RGF+
Sbjct: 285 PYHAGLQHLVILPWQPVCSPLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNCTPTYRGFD 344

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  GY+    D++ H+        LD R + +P  +L+G YST  F + A DII  H++ 
Sbjct: 345 SFFGYYNAQGDHYSHTWYNY----LDYRDNEKPVKNLNGTYSTFTFVSRAQDIIKKHNSS 400

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A    H     +P+Q P  Y +++ +I+   R +F+ ++  LDE++G + + L
Sbjct: 401 QPLFLYMAFQNVH-----DPIQVPKQYEDMYPNIKTRGRRQFSGMVSALDEAIGNITDQL 455

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
            +  M  +++I+F SDNGG    +     +N+PLRG K T++EGG R    +    L+  
Sbjct: 456 RESGMYDDTLIIFTSDNGGWPKYY----GNNYPLRGGKFTVYEGGTRVVSFVHGAGLQKT 511

Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
           G   E  +H  DW  TL+ AA
Sbjct: 512 GEAYESIMHAVDWSDTLVEAA 532


>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 524

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 201/331 (60%), Gaps = 24/331 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           P+++FILADD G+NDVG+HG      I TPN+D LA  G+ L+NYY   +C+P+RS +M+
Sbjct: 29  PNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGEGVKLENYYVQPICSPTRSQLMS 88

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++ IHTG+QH V+   +   LPL E  LPQ LKE GY T +VGKWH+GFY     PT R
Sbjct: 89  GRYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGYATNMVGKWHIGFYLDACLPTER 148

Query: 177 GFESHLGYWTGHQDYFD------HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           GF+S+  +    +D+F                      +  P +   G+YST +FT + +
Sbjct: 149 GFDSYFAW----EDHFSCLPXXXXXXXXXXXXXXXXXANKTPVFQYEGQYSTHLFTNKTI 204

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
           D+I  H   +PLF+YLA+ A H      PL+ PD Y++ + +I D  R  +A ++  +DE
Sbjct: 205 DVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSYMDPYMNITDKNRRTYAGMVSCMDE 259

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
            VG V  AL++  +  N+II+F +DNGG       + A+NWPLRG K +LWEGG+ G G 
Sbjct: 260 GVGNVTRALKEAGLYDNTIIIFSTDNGGPIQ----SGANNWPLRGSKGSLWEGGIHGVGF 315

Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSA 379
           + SPLL +  +G V  + +HVSDWLPT+++ 
Sbjct: 316 VHSPLLPTSVKGTVNHELIHVSDWLPTIVAG 346


>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
          Length = 506

 Score =  266 bits (680), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 130/327 (39%), Positives = 199/327 (60%), Gaps = 18/327 (5%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           VA +  P+++ ++ADDLGWND+  H    + TPN+D L    + L++YY   +CTP+RS 
Sbjct: 13  VAIAKKPNVLILIADDLGWNDISLHN-SYLSTPNVDGLIQESLHLQSYYVNPICTPTRSV 71

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +M+G++ IHTG+QH V+ G +  GLPL++ + P+  K+ GYRT +VGKWHLGFY ++  P
Sbjct: 72  LMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKMVP 131

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
             RGFESH GY  G + +++HS       G+D R          G+YS D+F     D++
Sbjct: 132 ENRGFESHYGYLIGAEGHYNHSQFMQGQNGVDFRDGGASTNSSWGQYSADLFAKRVEDLV 191

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
             H  +E L++Y+     H      PL+AP HY++    I+D  R  +AA+   +D++VG
Sbjct: 192 EAHDVEESLYMYVGLQNVHY-----PLEAPQHYVDQFSWIKDRDRRVYAAMTKSMDDTVG 246

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           +++ + + + +  ++I+ F +DNGG+     L   +NWPLRG+KNTLWEGG+RG G I  
Sbjct: 247 RIINSFKNKGIWEDTIVYFTTDNGGST----LYGGNNWPLRGLKNTLWEGGIRGIGSIKI 302

Query: 354 PLLESRGIVA---EQYVHVSDWLPTLL 377
           P     G+ A   +Q +HV D  PTLL
Sbjct: 303 P-----GVSADKRDQLMHVVDMFPTLL 324


>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
          Length = 609

 Score =  266 bits (680), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 135/326 (41%), Positives = 201/326 (61%), Gaps = 26/326 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH +FILADD+G++D+G+H   +I TP +D LA +G+ L+ YY   +CTP+R  +MTG++
Sbjct: 98  PHFLFILADDMGYHDIGYHQA-EILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRY 156

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  GMQH V+   +  G+PL EK+LP+ L++ GY T ++GKWHLG + ++Y P  RGF+
Sbjct: 157 QIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFD 216

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL--EPA---WDLHGKYSTDVFTAEAVDIIH 234
             +G++TG QD++ H+     M G D R     +P    WDL+  YST VF  E    + 
Sbjct: 217 HFMGFYTGSQDFYSHNKCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELEKRLS 276

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
             +  EP F YL+  A HS     PLQAP  +++ +R++++ KR  + A++  +D ++ K
Sbjct: 277 KMNPSEPSFTYLSFQAVHS-----PLQAPRKFIDQYRNLKNRKRRNYNAMVTAMDTAIYK 331

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           VV+A ++ +   ++I++F SDNGG   G     ASNWPLRG K +L+EGGVR  G + SP
Sbjct: 332 VVKAYKKFKFWDDTILIFSSDNGGNLKG----GASNWPLRGAKGSLFEGGVRTIGFVHSP 387

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL            V+DW PTLL  A
Sbjct: 388 LLP-----------VTDWFPTLLQLA 402


>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
          Length = 339

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 141/318 (44%), Positives = 195/318 (61%), Gaps = 11/318 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FI+ADDLGWNDV  HG  QIPTP+IDA+A+SG+ L NY+   +CTP+RS  ++G+H
Sbjct: 33  PNIVFIVADDLGWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCTPTRSTFLSGRH 92

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+      G     L LS  +LP YLK+LGYRT  VGKWHLG   ++  PT RGF+
Sbjct: 93  VIHTGIYMPFAQGTAL-RLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTGRGFD 151

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            +LGYW+G +DY+ H        G D +   E A   +  YST +F   AV+ I     +
Sbjct: 152 EYLGYWSGAEDYYTHDTHG----GYDFQDGTECAIKYNNTYSTYIFAERAVNTILEADPE 207

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY A    H      PL+AP  Y+    HI + +R   AA+   LD++VG + +AL
Sbjct: 208 QPLFLYTAFQNVH-----WPLEAPAEYVARFSHIPNSERQYVAAMTSILDDAVGNITDAL 262

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++ R+  N+I++F SDNGG     N   ++N+PLRG KNTLW GG +  G+I    +E+ 
Sbjct: 263 KRSRIADNTILIFTSDNGGPVHDEN-TESNNYPLRGGKNTLWNGGTQVVGMIAGKGIENP 321

Query: 360 GIVAEQYVHVSDWLPTLL 377
           G      +H SDWLP+L+
Sbjct: 322 GTDCHGMMHASDWLPSLV 339


>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
          Length = 621

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 134/295 (45%), Positives = 184/295 (62%), Gaps = 11/295 (3%)

Query: 88  IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
           +D LA  G+ L+NYY   +CTPSRS  +TGK+ IHTG+QH+++   +   LPL    LPQ
Sbjct: 80  LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDM 206
            LKE+GY T +VGKWHLGFY+KE  PT RGF++  G   G  DY+ H   +   M G D+
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDL 199

Query: 207 RRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
             +   AWD  +G YST ++T     I+ +H   +P+FLY+A+ A HS     PLQAP  
Sbjct: 200 YENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGR 254

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
           Y   +R I +  R ++AA+L  LDE++  V  AL+     +NSII++ SDNGG       
Sbjct: 255 YFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTA--- 311

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              SNWPLRG K T WEGG+R  G + SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 312 -GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA 365


>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
          Length = 484

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/346 (40%), Positives = 206/346 (59%), Gaps = 21/346 (6%)

Query: 42  LAFTLSMVFVDLVA-------SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           +  TL ++F  L+A        +  PHI+FI+ADDLGWNDVG+H    + TPNID LA  
Sbjct: 1   MGVTLLVLFGGLLALLQRKQVEAKSPHILFIVADDLGWNDVGYHN-PAMKTPNIDKLARE 59

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+IL   Y   LC+PSR A MTG +P   G+QH V+   +    PL+ K LPQ LK+LGY
Sbjct: 60  GLILNQTYFQPLCSPSRHAFMTGYYPYRAGLQHLVIMPWQPVCSPLNMKFLPQRLKDLGY 119

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T +VGKWHLG    + TPT+RGF+S  G++    DY+ H + +     LD R + +P  
Sbjct: 120 ATHMVGKWHLGMCNWDCTPTYRGFDSFFGFYHAKADYYSHISYKY----LDYRDNEKPVK 175

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
           +L+G YST  FT+ A DII  H++ +PLFLY+A        P+EPLQ P  Y +++ +  
Sbjct: 176 NLNGTYSTFTFTSRAQDIIKKHNSSQPLFLYMAFPI-----PHEPLQVPQQYEDMYPNTM 230

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
              R +++ ++  LDE++G + + L +  M  +++IVF SDNGG+     ++  +N+PLR
Sbjct: 231 SKGRRQYSGMVSALDEAIGNITDQLRESGMYDDTLIVFTSDNGGSP----MDYGNNYPLR 286

Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           G K T++EGG R    +    L+  G   E  +H  DW  TL++AA
Sbjct: 287 GAKFTVYEGGTRVVSFVHGAGLQKTGEAYESMMHAVDWSDTLVAAA 332


>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 843

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 143/332 (43%), Positives = 196/332 (59%), Gaps = 14/332 (4%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           +++    PPHIIFI+ DDLGWNDVG++      TP ID LA SG+ L NYY    C PSR
Sbjct: 358 NVLGQEKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGSGVKLLNYYVASHCLPSR 416

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           + +MTG+H I  G+Q +  +G     LPL E  + Q LK++GY T I+GKWH GFY    
Sbjct: 417 NMLMTGRHAIQLGIQRHG-FGYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNC 475

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
            P  RGF++  G+     +++ HS     M  L    D   A    GKYST ++  E  +
Sbjct: 476 LPHNRGFDTFFGFVGAGIEHYTHSDHFNHMHNLRKNDDC-IAKQYIGKYSTTIYANEGKN 534

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDE 290
           II+ H  ++P FLYL+ +A H+     PL+ P  YL  +   I D  R  +AA+   +DE
Sbjct: 535 IINAHDQNKPFFLYLSFSAVHT-----PLEVPSSYLKQYESTIYDEDRRTYAAMTSCVDE 589

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           ++  + EALE++ ML NSIIVF SDNGGA      +A SNWPLRG K+T+WEGGV+  G 
Sbjct: 590 AIANITEALEEKDMLKNSIIVFTSDNGGAV---ERSAGSNWPLRGGKHTIWEGGVKAVGF 646

Query: 351 IWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           ++  LL   SRG      +H++DW PTL++ A
Sbjct: 647 VYGELLPHHSRGTENTGLMHITDWFPTLVTLA 678


>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
          Length = 545

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 135/345 (39%), Positives = 203/345 (58%), Gaps = 21/345 (6%)

Query: 43  AFTLSMVFVDLVASSG-----PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
            F + ++ +  + SS      PPHI++I+ADDLGWNDVG+H  D I TPN+D +A  G+I
Sbjct: 6   TFGVVLIGLCFLVSSAEGAKRPPHILYIVADDLGWNDVGWHNPD-IKTPNLDRMAGGGVI 64

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
           L + Y   +CTPSR++ +TG +P   G+    +   +   + L    LP+ LK+LGY T 
Sbjct: 65  LNSSYVHPICTPSRNSFLTGVYPFRVGLSGTAITPHQARFMSLKTPTLPEKLKKLGYSTH 124

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
           ++GKWHLGF  + YTPT RGF+S LG++TG QDY+ H+  +    G D R +    +   
Sbjct: 125 MIGKWHLGFCNERYTPTRRGFDSFLGFYTGTQDYYKHTTAK----GYDFRFNQTVFYPPK 180

Query: 218 GKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            +YST  +   AVDII  H   + PLFLYLA  + H+     PLQ P  Y   + +I++ 
Sbjct: 181 KQYSTKTYAKRAVDIITEHKRKKNPLFLYLAFQSVHT-----PLQVPKKYEKQYNNIKNK 235

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R  ++ ++  +D+++G V+ A  + + + N I+VF +DNGGAA        +N+PLRG 
Sbjct: 236 DRRVYSGMVTAMDDAIGAVMAAFRKHKFMGNLIVVFTTDNGGAAH----IVGNNFPLRGS 291

Query: 337 KNTLWEGGVRGAGLIWSP-LLESRGIVAEQYVHVSDWLPTLLSAA 380
           K T+WEGG R    ++S  LL+  G   +   H +DW PT+L+ A
Sbjct: 292 KTTVWEGGTRAVSFVYSKNLLKKTGYTHDGLFHATDWFPTILAVA 336



 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 15/48 (31%), Positives = 30/48 (62%)

Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKEN 554
           + ++DG++ W ++   +PS+R   ++ ID    +SA+  GK+KL+  N
Sbjct: 343 EKQLDGVNQWPMIRSGKPSRRTEFVYEIDKSRPVSAIRMGKYKLIVGN 390



 Score = 39.7 bits (91), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 14/45 (31%), Positives = 29/45 (64%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           + ++DG++ W ++   +PS+R   ++ ID    +SA+  GK+KL+
Sbjct: 343 EKQLDGVNQWPMIRSGKPSRRTEFVYEIDKSRPVSAIRMGKYKLI 387


>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 134/343 (39%), Positives = 205/343 (59%), Gaps = 17/343 (4%)

Query: 43  AFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           +F    +FV   L  S+ PPHI+FI+ADDLGWNDVGF   D + TPNID LA SG+IL +
Sbjct: 6   SFITCSIFVHFLLCVSAKPPHILFIVADDLGWNDVGFRNPDVL-TPNIDKLARSGMILNS 64

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
            Y + +CTPSR++ MTG +   +G+QH  +   +    PL+   LPQ LKELGY T  +G
Sbjct: 65  SYVMPVCTPSRNSFMTGHYAFKSGLQHLAINPQQATCAPLNYTFLPQKLKELGYATHAIG 124

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           KWHLGF K E TPT+RGF++  G++ G +DY+  S       G D R +  P  +  G+Y
Sbjct: 125 KWHLGFCKWECTPTYRGFDTFFGFYNGQEDYYTLSVAG----GKDFRDNKVPV-NATGEY 179

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
           ST +++  A  II  H   +P+++YL   + H+     PLQ P+ Y +++ ++    R  
Sbjct: 180 STFLYSRRAESIIKEHDASKPIYMYLPFQSVHA-----PLQVPNRYTDMYTNVHTKSRRT 234

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +  ++  +DE++G + +AL+ + + ++++I+F +DNGGA       A +N+PLRG K TL
Sbjct: 235 YMGMVTAMDEAIGNITKALKTKGIFNDTLIIFTTDNGGAVK----FAGNNYPLRGGKATL 290

Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +EGG R    +    ++         +H  DW+PT+LSAA  +
Sbjct: 291 FEGGTRATAFVTGAGIQKFNYEYSGLIHAVDWMPTVLSAAGGT 333


>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
 gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
          Length = 538

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 148/350 (42%), Positives = 209/350 (59%), Gaps = 12/350 (3%)

Query: 34  IMAFAVLPLAFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           I+ F  + +A   S  FV   + A++  P+I+ I+ADDLGWNDVG+H   +I TPN+D L
Sbjct: 15  IITFISVAIAAIYSNFFVSDGVRAAAQKPNIVIIMADDLGWNDVGYHN-SEIKTPNLDKL 73

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           A S   L  +Y    CTP+R+A+MTG+HP   GM   V++  ++ GLPL EK + Q LKE
Sbjct: 74  AESSTRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVIWPWDKVGLPLEEKTIAQTLKE 133

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY T IVGKWHLG YK+EY PT RGF+ H G++ G  DYF H  +     GLD  R+ +
Sbjct: 134 AGYYTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSIDYFTHQLDAGIQGGLDWHRNEQ 193

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
           P  +    Y+TD+   EAV +I +   ++ PLFLY++  A     P+ PLQA +  +  +
Sbjct: 194 PVEE--EGYATDLLAQEAVKLIRDCDYNKSPLFLYVSFNA-----PHAPLQAKEKDIKNY 246

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
            +I+D  R  FAA +  +DE+VG++V AL+++++  N+++ F SDNGG +         N
Sbjct: 247 ANIQDEGRRIFAAQVQSMDEAVGRIVAALKEKQVWDNTLLFFTSDNGGGSDQ-PWTRGDN 305

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            PLRG K TL++GGVR   +I  P     G V EQ   V D  PTL   A
Sbjct: 306 RPLRGQKGTLYDGGVRVPTIISFPAQLKGGQVIEQVFSVVDLYPTLAKLA 355


>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 355

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 148/342 (43%), Positives = 198/342 (57%), Gaps = 14/342 (4%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           ++F L +V  + V    PPHIIFI+ DDLGWNDVG++      TP ID LA +G+ L NY
Sbjct: 5   VSFLLLLVSGNAVGQEKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGNGVKLLNY 63

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y    C PSR+ +MTG+H I  G+  +  +G     LPL E  + + LK  GY T IVGK
Sbjct: 64  YVASHCLPSRNMLMTGRHAIQLGIPQDG-FGYHPRSLPLDETTIAEPLKHAGYSTHIVGK 122

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WH G+Y     P  RGF++  G+     D++ HS     M  L    D   A    GKYS
Sbjct: 123 WHCGYYADNCLPHNRGFDTFFGFVGAGIDHYTHSDHFNHMHNLRKNDDCI-AKKYIGKYS 181

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSK 280
           T +F  E  DII+ H  ++PLFLYL+ +A H+     PL+ P  YL  +   I D  R  
Sbjct: 182 TTIFANEGKDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRT 236

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +AA+   +DE++  + EALE++ ML NSIIVF SDNGGA      +A +NWPLRG K+T 
Sbjct: 237 YAAMTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVN---SAGNNWPLRGGKHTS 293

Query: 341 WEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           WEGGVR  G ++  LL   SRG      +H++DW PTL++ A
Sbjct: 294 WEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLA 335


>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
          Length = 438

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 131/291 (45%), Positives = 195/291 (67%), Gaps = 15/291 (5%)

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           +CTPSR A ++G+HP+ TGMQ   L   E  GL L++ +LP+YL++LGY T ++GKWH+G
Sbjct: 3   VCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWHVG 62

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR------DLEPAWDLHGK 219
           +  + Y PT RGF++ LGY+ G   YF+H+ EE +  G D+ R       +E  +D    
Sbjct: 63  YLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEENEQVGYDLHRIVGDNHTVEYRYD---- 118

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFK 277
           Y T++ T EA +II +H+T++P++L LAH A+H++N  E ++  D         +I+D  
Sbjct: 119 YMTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQDVN 178

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGV 336
           R KFA+++  LD+S+G+VV+AL ++ ML N+II+F+SDNG    G    N  SN+PLRG+
Sbjct: 179 RRKFASVVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQTEGPLYQNFGSNYPLRGL 238

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
           K +L+EGGVRGA  I+SPL+E+   V+ Q  H++DWLPTL SAA  N SD+
Sbjct: 239 KFSLYEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAGGNSSDL 289



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 70/144 (48%), Gaps = 19/144 (13%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM----------- 622
           ++DGID WS +   E SKR +++ NI+D    SA   G++KL++                
Sbjct: 291 QLDGIDQWSTIKTQEASKRKSVVINIED-VDYSAALMGRYKLIQDKSDFQKHYTNYKGND 349

Query: 623 ----RYQV-DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCL 677
               +Y V ++   P    ++ +S+    A  + +LR  A++ C  +    CE +   CL
Sbjct: 350 PSYPKYNVTNILESPAASAIANISNHIVTANKIEELRKEATVVCENLMTSDCESR--TCL 407

Query: 678 FDIKNDPCEKNNLADRSEDQRINH 701
           FDI  DPCE  +++ +  + R+N 
Sbjct: 408 FDIYEDPCEVTDISSQYPEVRLNQ 431


>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 495

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 149/344 (43%), Positives = 199/344 (57%), Gaps = 21/344 (6%)

Query: 47  SMVFVDLVASSG-------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           ++VF+ L+ +SG       PPHIIFI+ DDLGWNDVG++      TP ID LA +G+ L 
Sbjct: 3   TVVFLLLLMASGNVLGHEKPPHIIFIVVDDLGWNDVGYNN-PVFKTPTIDRLAGNGVKLL 61

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           NYY    C PSR+ +MTG+H I  G+  N  +G     LPL E  + + LK  GY   IV
Sbjct: 62  NYYVASHCLPSRNMLMTGRHAIQLGIP-NDGFGYHPRSLPLDETTIAEPLKHAGYSNHIV 120

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWH G+Y     P  RGF++  G+     D++ HS     M  L    D   A    GK
Sbjct: 121 GKWHCGYYADNCLPHNRGFDTFFGFVGAGIDHYTHSDHFNHMHNLRKNDDC-IAKKYIGK 179

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKR 278
           YST +F  E  DII+ H  ++PLFLYL+ +A H+     PL+ P  YL  +   I D  R
Sbjct: 180 YSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDR 234

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +AA+   +DE++  + EALE++ ML NSIIVF SDNGGA      +A +NWPLRG K+
Sbjct: 235 RTYAAMTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVN---SAGNNWPLRGGKH 291

Query: 339 TLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           T WEGGVR  G ++  LL   SRG      +H++DW PTL++ A
Sbjct: 292 TSWEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLA 335


>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 132/346 (38%), Positives = 203/346 (58%), Gaps = 16/346 (4%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             F++ + F+  V S+ PPHI+FI+ADD GWNDVGF   D + TPNID LA+SG++L + 
Sbjct: 8   FVFSIFIHFLPYV-SAKPPHILFIVADDYGWNDVGFRNPDVL-TPNIDKLAHSGMVLNSS 65

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y + +CTPSR++ MTG +   +G+QH  +   +    PL+   LPQ LKELGY T  +GK
Sbjct: 66  YVMPVCTPSRNSFMTGHYAFKSGLQHLAILPKQAACAPLNYTFLPQKLKELGYATHAIGK 125

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLGF K E TPT+RGF++  G++ G +DY+  S       G D R +  P  +  G+YS
Sbjct: 126 WHLGFCKWECTPTYRGFDTFFGFYNGQEDYYTLSVAG----GKDFRDNRTPV-NATGEYS 180

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T ++   A  II  H   +P+++YL   + H     EPLQ P  Y +++  +    R  +
Sbjct: 181 TFLYARRAESIIKEHDASKPMYMYLPFQSVH-----EPLQVPQKYSDMYSKVHTESRRTY 235

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
             ++  +DE++G + +AL+ + M ++++I+F +DNGG        A +N+PLRG K T++
Sbjct: 236 LGMVTAMDEAIGNITKALKTKGMFNDTLIIFTADNGGWIT----FAGNNYPLRGGKATVF 291

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           EGG R    +    ++    V    +H  DW+PT+L AA  +  P 
Sbjct: 292 EGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAGGTPDPT 337


>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 140/341 (41%), Positives = 202/341 (59%), Gaps = 23/341 (6%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           VA S  P+I+F+LAD+LGW+D+G+HG  ++ TP++D LA  GI L+NYY   +CTP+RS 
Sbjct: 17  VARSSQPNIVFVLADELGWHDIGYHG-SRVQTPHLDKLASEGIKLENYYVQPMCTPTRSQ 75

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +M+G++ IHTG+QH V+   +R  LPL E  + Q LKE GY T +VGKWHLG Y K   P
Sbjct: 76  LMSGRYQIHTGLQHTVINPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGCLP 135

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEE---------MKMWGLDMRRDLEP--AWDLHGKYST 222
           T RGF+S  G++    DY+ +   +         ++M G D+ R+ E   A    G Y T
Sbjct: 136 TKRGFDSFFGFYNCAVDYYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHYQT 195

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
            V   EA D+I  H+  +PLFLYLA  A H      PL+ P  Y +++  ++D  R    
Sbjct: 196 HVLAKEAEDVIRKHNPSKPLFLYLAFGAVHV-----PLEVPKVYEDMYADVKDNSRRILL 250

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+   +D+S+  V  AL+   +  N+I +F +DNG A A        +WPL+G K +L+E
Sbjct: 251 AMATCMDDSIADVTGALKDTGLWDNTIFIFSTDNGAATA----YGGCSWPLKGGKTSLFE 306

Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAAN 381
           GG+R  G + SPLL  + +G    + +HVSDW PTL+  A 
Sbjct: 307 GGIRAVGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAG 347


>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 496

 Score =  259 bits (661), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 141/347 (40%), Positives = 201/347 (57%), Gaps = 22/347 (6%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M+ A + + F+   +F      SG PHI+F + DDLGWNDVG+H    I TP ID LA S
Sbjct: 3   MSVANVVIIFS---IFSSAEPYSGQPHIVFFVVDDLGWNDVGYHN-SYIKTPTIDMLAKS 58

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+ L NYY    C PSR+ +++G+H I  G+QH  + G    GLPL E  +   LKE+GY
Sbjct: 59  GVRLNNYYVASHCVPSRNMLISGRHVIDIGLQHGEI-GYYPRGLPLDEFTIADKLKEIGY 117

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE-PA 213
            T ++GKWH G Y     P  RGF++  GY     D++ H      +   D+R + E   
Sbjct: 118 ATHLIGKWHCGCYSNHSLPHNRGFDTFFGYLGSSDDHYTHIIMSNGL--ADLRLNDECVG 175

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH- 272
           +   G YST ++  EA +II  H  ++PLFL LA +A H     +P+Q PD Y+  ++  
Sbjct: 176 YKYFGDYSTIMYANEAKNIIAQHDENKPLFLMLAFSAVH-----KPIQVPDVYMQYYKST 230

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I D +R  +A +   +DE++  + EALE + ML NS+IV  +DNGG + G      +NWP
Sbjct: 231 IHDNERRTYAGMASCVDEAIANITEALENKGMLKNSVIVLTTDNGGGSVG------NNWP 284

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLL 377
           LRG KN+ WEGGVRG   ++S LL  + RG    + +H++DW PTL+
Sbjct: 285 LRGRKNSHWEGGVRGVAFVYSDLLPMDVRGTENSELMHITDWFPTLV 331


>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 533

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 188/315 (59%), Gaps = 13/315 (4%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           I+F++ADDLG+NDVGFHG  QIPTP +DALA  G+ L NY+T  +C+PSR+++++G+H I
Sbjct: 58  ILFLVADDLGFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHAI 117

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
           H G+      G     L L  ++LP+ L+ LGY T  VGKWHLG   +   PT RGF+S 
Sbjct: 118 HHGIYMPFAQGTAY-HLSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRGFDSF 176

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
           LGYW+G +DYF H          D   +   AW   G YS   FT  AVD++ + ST  P
Sbjct: 177 LGYWSGAEDYFAHDCAG----AYDFANNETTAWAYDGVYSAYSFTDRAVDVVASAST--P 230

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
            FLY+A    H      PL+ P  YL    H+    R   + ++  LDE+VG V  AL+ 
Sbjct: 231 YFLYVAWQNVH-----WPLEVPARYLEPFGHVPAGPRRNVSGMVSALDEAVGNVTGALKA 285

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGI 361
           RR   ++++VFVSDNGG   G    A++NWPLRG KNTL+EGG R  GL+ +P + ++  
Sbjct: 286 RRAYDDAVVVFVSDNGGPTNGDEGTASNNWPLRGGKNTLYEGGTRVVGLLKAPGV-AKSS 344

Query: 362 VAEQYVHVSDWLPTL 376
            +   +H  DW PTL
Sbjct: 345 ASRAKLHAVDWFPTL 359



 Score = 45.8 bits (107), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 28/127 (22%)

Query: 576 DGIDVWSVLSRNEPSKRNTILH----NIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           DG+D + VL+R  P +   +L       +D     AL  G WK +++  V          
Sbjct: 378 DGVDSYDVLARGAPGRDELLLEAHAPGSNDTVHGEALIVGDWKFLRIGTVR--------- 428

Query: 632 PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQI-------APCLFDIKNDP 684
           PD       S+  W     ++    + + CG   E P +          APCLFD+  DP
Sbjct: 429 PD-------SEAGWTRPPGQEAHPRSVVDCGAAPE-PADANATRRQCVAAPCLFDVAADP 480

Query: 685 CEKNNLA 691
           CE+ + A
Sbjct: 481 CERVDRA 487


>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 469

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 148/358 (41%), Positives = 211/358 (58%), Gaps = 20/358 (5%)

Query: 33  RIMAFAVLPLAFTLSMVFV-DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           +I+ F  L L  T       +L A    P+II ILADDLGWNDVGFHG  +I TPN+D L
Sbjct: 7   KIVIFICLTLIITYYFRGANELKAEINKPNIIVILADDLGWNDVGFHG-SEIKTPNLDKL 65

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           A SG+ L+ +Y   +CTP+R+A +TG+HP   GM   V+   ++ GLPL EK + + LKE
Sbjct: 66  AASGVRLERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVIKPWDKVGLPLEEKTIAETLKE 125

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY T I+GKWHLG Y++ + PT RGF+ H G++ G  DYF H+ + +    LD  R+  
Sbjct: 126 AGYYTAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGIDYFTHNDDFLG--ALDWHRN-- 181

Query: 212 PAWDLHGK---YSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYL 267
               +H K   Y+TD+   EAV +I+NH+ + +PLFLY+A  A H+     PL A    +
Sbjct: 182 ---RIHLKEEGYATDLIGQEAVKLINNHNYEQQPLFLYIAFNAPHT-----PLHAKTEDI 233

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
             +  I+D KR  FAA +  +DE++GK++++LE +++  N+ + FVSDNGG+      N 
Sbjct: 234 EDYLTIDDEKRRVFAAQVQSMDEAIGKIIQSLENQQVCDNTFVFFVSDNGGSV--MRANR 291

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
             N PLRG KN+L+EGGVR   ++  P   S      Q   + D  PTL   A    +
Sbjct: 292 GDNRPLRGGKNSLYEGGVRVPAIVSYPPKLSANQEINQIFSIVDLYPTLAKLAGVESV 349


>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 522

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 150/373 (40%), Positives = 204/373 (54%), Gaps = 56/373 (15%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L+  L +      AS+  PHIIFILADDLGW+DVG+HG   + TP IDALA  G+ L+NY
Sbjct: 10  LSILLGVSLSLATASATRPHIIFILADDLGWDDVGYHG-SVMKTPYIDALAAEGVTLENY 68

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   LCTPSRS ++TG++ IHTG+QH  +   +   LPL E  LPQ LKE GY T +VGK
Sbjct: 69  YMPSLCTPSRSVLLTGRYEIHTGLQHGTILMMQPLCLPLDEITLPQKLKEEGYDTHMVGK 128

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLH-G 218
           WHLGFY+KE  P  RGF+S LG++    D+F H  SA      G D RR+ +   D + G
Sbjct: 129 WHLGFYRKECLPNNRGFDSFLGFYQAMGDHFYHNISASPGHFNGFDFRRNNDVVADQYAG 188

Query: 219 KYSTDVFTAEAVD--------------------------IIHNHSTDEPLFLYLAHAATH 252
           KYST +F    ++                          I  +++  +PLFLYL+  A H
Sbjct: 189 KYSTHIFXXXFINTQTLSFVCVNNVKGVYFRGYLSSFTPITSSYNPQQPLFLYLSFQAVH 248

Query: 253 SANPYEPLQAPDHYLNIHRHI--EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           +     PLQ P  Y  ++  +   D  R  +A ++  +DE+VG V   L+Q  +  NS++
Sbjct: 249 T-----PLQVPSRYAELYNDLIPNDEDRRIYAGMVTCMDEAVGYVTTVLKQSALWENSVV 303

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVH 368
           +F +                  LRG K TL+EGGVRG G + SPLL  + RG +  + +H
Sbjct: 304 IFSTX-----------------LRGGKGTLYEGGVRGVGFVTSPLLHEQVRGTINHELIH 346

Query: 369 VSDWLPTLLSAAN 381
           +SDW PTL   A 
Sbjct: 347 ISDWFPTLTRLAG 359


>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
          Length = 483

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 137/354 (38%), Positives = 208/354 (58%), Gaps = 17/354 (4%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M  +V    F + + F+  V S+ PPHI+FI+ADD GWNDVGF     + TPNID LA S
Sbjct: 1   METSVSVFVFCIFIHFLPYV-SAKPPHIVFIVADDYGWNDVGFRN-PSVLTPNIDKLARS 58

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+IL + Y + +CTPSR++ MTG++   +G+QH V+   +    PL+   LPQ LKELGY
Sbjct: 59  GMILNSSYVMPVCTPSRNSFMTGQYAFKSGLQHIVILPQQATCAPLNNTFLPQKLKELGY 118

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T  +GKWHLGF K E TPT+RGF++  GY+ G +DY++ S       G D R +  P  
Sbjct: 119 ATHAIGKWHLGFCKWECTPTYRGFDTFYGYYNGAEDYYNLSIAG----GKDFRDNRTPV- 173

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNIHRHI 273
           +  G+YST ++   A  II +H   +P+++YL   + H     EPLQ   +   +I+  +
Sbjct: 174 NATGEYSTILYARRAESIIKDHDASKPMYMYLPFQSVH-----EPLQVSLESSSDIYSKV 228

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
               R  +  ++  +DE++G + +AL+ + M ++++IVF +DNG    G+   A +N+PL
Sbjct: 229 HTESRRTYLGMVTAMDEAIGNITKALKTKGMFNDTLIVFTADNG----GWITYAGNNYPL 284

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           RG K T++EGG R    +    ++    V    +H  DW+PT+LSAA  +  P 
Sbjct: 285 RGGKTTVFEGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLSAAGGTPDPT 338


>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
          Length = 316

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 124/263 (47%), Positives = 172/263 (65%), Gaps = 12/263 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS ++
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 176 RGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 290 ESVGKVVEALEQRRMLSNSIIVF 312
           E+VG V  AL+   + +N++ +F
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIF 298



 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 145


>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  255 bits (651), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 197/344 (57%), Gaps = 23/344 (6%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
            +  +  LAF+       LV+S+  P+II  + DDLGWND  + G D I TPNID LA  
Sbjct: 4   FSLVLFLLAFS-----APLVSSANSPNIILFVIDDLGWNDTSYQGSD-IQTPNIDKLAEE 57

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GI LK YY   LC+PSRSA++ GK+P H G+ H V+      GL L+E  +  +LK+ GY
Sbjct: 58  GIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGY 117

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG-LDMRRDLEPA 213
            T  VGKW LG +K E+TPT+RGF++  GY+   +DY+ H     K+ G LD R + +P 
Sbjct: 118 STHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDADEDYYTH-----KVGGYLDFRNNTDPV 172

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
            D  G YST +FT    D I N  +D P F+Y A+ + HS     PL+APD YL    H 
Sbjct: 173 KDEDGTYSTFLFTKAIEDAI-NAKSDSPFFIYGAYQSVHS-----PLEAPDTYLE-KCHS 225

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
               R  F  ++  LDE +  +   L+ + +L ++II+  +DNGG  A      +SNWPL
Sbjct: 226 PYPNRKIFCGMMLALDEGISNITSLLQTKGLLDDTIIILTTDNGGQTA----LGSSNWPL 281

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           RG K T++EGGVRG   +WS  L          +H++DW PT++
Sbjct: 282 RGNKATVFEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTII 325



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 27/125 (21%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNI-----DDEWQI---SALTKGKWKLVKVVKVMRYQV 626
           +DG ++W  ++ + PS R  ILH +     D+  Q    +AL +G WKL+    + +   
Sbjct: 338 LDGFNMWDTINDDTPSPRTEILHQLDPPKYDERTQFIGQAALRQGDWKLI----IGQPNC 393

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCE 686
               GP  VY   L    W+ L             G ++E    P +   LF++K+DP E
Sbjct: 394 SEKLGPHTVY--DLCPIGWIHLN------------GTIEEPEVNPSLT-WLFNVKDDPNE 438

Query: 687 KNNLA 691
           +N L+
Sbjct: 439 RNELS 443


>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 476

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 148/353 (41%), Positives = 205/353 (58%), Gaps = 17/353 (4%)

Query: 33  RIMAFAVLPLAFTLSMVFVD-LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           RI+ F  L L  T      + L A    P+II ILADDLGWNDVGFHG  +I T N+D L
Sbjct: 7   RIVIFICLTLTITYYFRGANNLKAEINKPNIIVILADDLGWNDVGFHG-SEIKTTNLDKL 65

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           A SG+ L+ +Y   +CTP+R+A +TG+HP   GM    +      GLPL EK + + LKE
Sbjct: 66  AVSGVRLERFYVKAMCTPTRAAFLTGRHPFRYGMSAINVTPWSETGLPLEEKTIAETLKE 125

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY T I+GKWHLG Y++ Y PT RGF+ H G++    DYF H + +    GLD  R+  
Sbjct: 126 AGYYTAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGIDYFTHKSGD----GLDWHRNNN 181

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
           P + + G YSTD+   +AV +I+NH   + PLFLY+A  A     P+ PLQA    L  +
Sbjct: 182 PVY-IEG-YSTDLIAQDAVQLINNHDYHKNPLFLYIAFNA-----PHIPLQAKAEDLEDY 234

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL---NA 327
             IED +R  FAA +  +DE++G ++++L+ +++ +N+++VF SDNGG     ++     
Sbjct: 235 LTIEDEQRRLFAAQVQSMDEAIGSIIQSLQAKQVWNNTLLVFTSDNGGEIVANDIRFTGR 294

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             N PLRG K  L+EGGVR   +I  P   S G   EQ   + D  PT    A
Sbjct: 295 GDNRPLRGGKRNLYEGGVRVPTIISYPGHLSSGKTVEQMFSIVDLYPTFAKLA 347


>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 491

 Score =  253 bits (647), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 136/326 (41%), Positives = 188/326 (57%), Gaps = 16/326 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           V+S+  P+II  + DDLGWND  + G D I TPNID LA  GI LK YY   LC+PSRSA
Sbjct: 18  VSSANNPNIILFVIDDLGWNDTSYQGSD-IQTPNIDKLAEEGIRLKQYYVQPLCSPSRSA 76

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++ GK+P H G+ H V+      GL L+E  +  +LK+ GY T  VGKW LG +K E+TP
Sbjct: 77  LLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTP 136

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           T+RGF++  GY+   +DY+ H         LD R + +P  D  G YST +FT    D I
Sbjct: 137 TYRGFDTFYGYYDADEDYYTHKVGGY----LDFRNNTDPVKDEDGTYSTFLFTKAIEDAI 192

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            N  +D P F+Y A+ + H      PL+APD YLN   +I    R  F  ++  LDE + 
Sbjct: 193 -NAKSDSPFFIYGAYQSVHG-----PLEAPDIYLN-KCNIPYPNRKIFCGMMLALDEGIS 245

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            +   L+ + +L ++II+  +DNGG  A      +SNWPLRG K T++EGGVRG   +WS
Sbjct: 246 NITSLLQTKGLLDDTIIILTTDNGGQTA----LGSSNWPLRGNKATVFEGGVRGISFVWS 301

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSA 379
             L          +H++DW PT++  
Sbjct: 302 TKLRKSNYDNNAMMHITDWYPTIIEG 327


>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  252 bits (643), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 144/352 (40%), Positives = 208/352 (59%), Gaps = 24/352 (6%)

Query: 37  FAVLPLAFTLSMVFVDLVAS------SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           F  L  ++ L++V   +V +      S  P+++ ILADD+GW DVG+H   +I TP++D 
Sbjct: 8   FGCLSTSYVLALVIPSMVWAADQPLKSAKPNVVLILADDMGWGDVGWHD-SEIKTPHLDK 66

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           LA SG  L+ +Y   +C+P+R+A+MTG++P+  G+Q  V+    + GLPL+E+ LPQ LK
Sbjct: 67  LAASGTRLEQFYVQPVCSPTRAALMTGRYPMRHGLQVGVVRPWAQYGLPLNERTLPQALK 126

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           E+GY T I GKWHLG ++ EY PT RGF+   G++ G  DYF H    ++  G D  RD 
Sbjct: 127 EVGYETAICGKWHLGHFQPEYLPTHRGFDHQYGHYNGALDYFTH----IRDGGFDWHRDD 182

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
               D    YST +   EA  II +H T +PLFLY+   A H+     P Q P+ Y   +
Sbjct: 183 RVNRD--EGYSTHLIGREATRIIGHHDTSKPLFLYVPFNAVHA-----PHQVPEKYKEPY 235

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
             +++ +R+ +A +L  +DE+VG++  A++ + M SN+I +F SDNGG A G      SN
Sbjct: 236 SKLKEPRRT-YAGMLAAMDEAVGQIASAIDGKGMRSNTIFLFSSDNGGPAPG---QVTSN 291

Query: 331 WPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            PLRG K TL+EGGVR    I W   L   G V    +H+ D  PTLL+ A 
Sbjct: 292 GPLRGQKGTLYEGGVRVPAFISWEGHLRP-GTVVNAPLHIVDLFPTLLTLAG 342


>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
 gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
          Length = 449

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 128/293 (43%), Positives = 181/293 (61%), Gaps = 16/293 (5%)

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           LA  G+ L+NYY   +C+PSR  +MTG++ I  G+QH+V+      GLPL E  LPQ L+
Sbjct: 2   LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGL 204
           E GYR+ IVGKWHLGF++KEY P  RGF+   GY TG +DY+ H      + +     GL
Sbjct: 62  ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121

Query: 205 DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
           D+R   +P  D +G YST +F  +A++ I +H   +P+FLYL   A HS     PL+AP 
Sbjct: 122 DLRDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHS-----PLEAPT 176

Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
            Y+  ++ +       +AA++  +DE+VG V +AL++  +  N++++F +DNG       
Sbjct: 177 KYVEKYKDVTHPLIKIYAAMVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGARE---- 232

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
            NA SNWPLRG KNTLWEGGVRG G + S LL      ++  +H+SDW PTLL
Sbjct: 233 -NAGSNWPLRGWKNTLWEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLL 284



 Score = 43.1 bits (100), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 22/107 (20%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNI----------------DDEWQIS---ALTRGKWKL 550
           +DG D+W  +S   PS R  ILHNI                DD + IS   A+  G WKL
Sbjct: 297 LDGFDIWDTISTGTPSPRKEILHNIDPLHRKTASSSFNLLYDDIFNISVYAAIRSGDWKL 356

Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGID--VWSVLSRNEPSKRNTI 595
           +  +  +G G  ++ ++ + ++  +D     +W    RN+P +R+ +
Sbjct: 357 LTGDQGHG-GWKDSDTSGSKFEYAVDPPTKHLWLFNIRNDPQERSDL 402


>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
 gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
          Length = 357

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 136/326 (41%), Positives = 189/326 (57%), Gaps = 15/326 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ADDLGW+DV ++  + +  PN+  LA +G+I    Y+ +LCTPSR+A++TGK 
Sbjct: 8   PHILLIVADDLGWSDVSWNNPNVV-MPNLHTLATTGVIFNQTYSQRLCTPSRTALLTGKF 66

Query: 120 PIHTGMQ-HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           P   GMQ    ++     GLPL E++LPQ LK+LGY T +VGKWHLG  K EYTPT RGF
Sbjct: 67  PYRLGMQVQKSMFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGF 126

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +S  G+  G +DY+ H +E     GLD         D +G YST+ F   A +II  H  
Sbjct: 127 DSFYGFHHGGEDYYTHMSER----GLDFWDGRTSVSDRNGVYSTESFARRAENIISQHDP 182

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           + PLFLYL   + H+     P Q P  YL     I+D  R     +   LD+++ +V +A
Sbjct: 183 NTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDDNRKSILGMATALDDAIKRVTDA 237

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+ + +  N++ +F+SDNGG      +   SNWPLRG K TLWEGG R    +   +LE 
Sbjct: 238 LKGKGLWDNTLTIFMSDNGGNY----VQGQSNWPLRGAKGTLWEGGTRVPAFVHGNMLER 293

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSD 384
            G      +H  D LPTL+S A  ++
Sbjct: 294 TGYTYHGMMHGVDILPTLVSVAGGTE 319


>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
 gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
          Length = 353

 Score =  251 bits (640), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 136/325 (41%), Positives = 190/325 (58%), Gaps = 15/325 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ADDLGW+DV ++  + +  PN+  LA +G+I    Y  +LCTPSR+A++TGK 
Sbjct: 1   PHILLIVADDLGWSDVSWNNPNVV-MPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKF 59

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   GMQ  + +  +  GLPL E++LPQ LK+LGY T ++GKWHLG  K EYTPT RGF+
Sbjct: 60  PYRLGMQRPIRHK-KAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFD 118

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  GY  G QDY+ H ++     GLD         D +G YST+ F   A +II  H  +
Sbjct: 119 SFYGYHRGSQDYYTHMSDG----GLDFWEGKTAISDQNGVYSTESFATRAENIISQHDPN 174

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLYL     H+     P Q P  YL     I+D  R     ++  LD+++ +V +AL
Sbjct: 175 TPLFLYLPLQPVHT-----PHQVPSSYLQTFSTIQDHNRKSILGMVAALDDAIKRVTDAL 229

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           + + + +N++ +F+SDNGG      L+  SNWPLRG K T+WEGG R        +LE  
Sbjct: 230 QGKGLWNNTLTIFMSDNGGDY----LDGQSNWPLRGAKGTVWEGGTRVPAFAHGNMLERT 285

Query: 360 GIVAEQYVHVSDWLPTLLSAANKSD 384
           G      +H  D LPTL+S A  ++
Sbjct: 286 GYTYHGMMHGVDILPTLVSVAGGTE 310


>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 513

 Score =  249 bits (635), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 136/334 (40%), Positives = 203/334 (60%), Gaps = 23/334 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FILADD GW+D+G+HG   + +P +D LA  G+ L+NYY   +C+P+R+ +M+G++
Sbjct: 26  PNIVFILADDFGWHDIGYHG-SIVRSPYMDFLASEGVKLENYYVQPMCSPTRAQLMSGRY 84

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  G+QH V+   +R  LP  E  + Q +KE GY T +VGKWHLGFY+KE  P  RGF+
Sbjct: 85  QIRYGLQHLVIQPDQRACLPPDEVTIAQKMKEAGYATHMVGKWHLGFYRKECLPINRGFD 144

Query: 180 SHLGYWTG---HQDY---FDHSAE---EMKMWGLDMRRDLEPAWDLH-GKYSTDVFTAEA 229
           +  G+      H  Y   + H+ E   +  M+G D+ R+ +     H G+YST +F  EA
Sbjct: 145 TFFGFLNCLIYHYTYDFGYWHTPESGNKTIMFGWDLFRNHDCVAKEHKGEYSTILFAEEA 204

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKL 288
             +I NH  + P+FLYL  AA H+     PL  P  Y  ++   IED +R K AA++  +
Sbjct: 205 QRVIWNHDQETPMFLYLPFAAVHN-----PLDVPREYEAMYEGIIEDEQRRKKAAMVTCM 259

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE++  + +AL    M  N++++F +DNGGA +       +NWPLRG K +++EGG+RG 
Sbjct: 260 DEAIWNITKALMDTGMWDNTVLIFSTDNGGATS----VGGNNWPLRGGKRSMFEGGIRGV 315

Query: 349 GLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
           G + SPLL+   RG    Q +HV+DW PT +  A
Sbjct: 316 GFVTSPLLDEAVRGTENNQLIHVTDWFPTFVHLA 349


>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  249 bits (635), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 142/357 (39%), Positives = 198/357 (55%), Gaps = 22/357 (6%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M   V+ L     M   DL  ++  PHI+FILADDLG+ DVG+     + TPNID LA  
Sbjct: 1   MDLFVIILTLISLMTNFDLARTAQRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAE 60

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKEL 152
           G+ L+ +Y    C PSRS +M G++ IHTG  +    G +R  L +      +P  LKE 
Sbjct: 61  GVKLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRCKDG-KRSQLCMHPDTITMPMKLKEN 119

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH----SAEEMKMWGLDMRR 208
           GY T ++GKWHLG  + E  P  +GF++  GY +  +DYF H     A     W    R 
Sbjct: 120 GYATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIEDYFTHYTPAGANCHDFW----RN 175

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
             E A D  G+YST +FT EA DII NH  ++P+F+YL++ A H      PLQ P HY +
Sbjct: 176 HDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLSYQAVHG-----PLQVPPHYYD 230

Query: 269 IHRHIEDFK--RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
           ++         R  +AA+   +DE +G V   L+QR M  N++++FVSDNG   A F+  
Sbjct: 231 MYNDGPRVSTTRQTYAAMATCMDEGIGNVTSTLKQRGMWDNTVLIFVSDNG---AQFH-T 286

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
              NWPLRG K +++EGG+R    + SPL+E    ++E  VH+ DW PTL   A  +
Sbjct: 287 GGRNWPLRGGKGSVFEGGIRVVSFVTSPLIERPRRISEDIVHLMDWFPTLTHLAGGT 343


>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
 gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
          Length = 495

 Score =  248 bits (634), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 152/380 (40%), Positives = 204/380 (53%), Gaps = 33/380 (8%)

Query: 25  LKELGYRTRIMAFAVLPLAFTLSMVFVDLVA------------SSGPPHIIFILADDLGW 72
           +K +  ++R   F  L   FT  + +  L              SS PPHI+FI++DD GW
Sbjct: 1   MKNIFRKSRRFLFGFLFAVFTCCITWKLLTLNQQDLPVAVAQQSSQPPHILFIMSDDQGW 60

Query: 73  NDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG 132
            DVGFHG D I TPN+D LA +G  L+ YY+  +CTPSR+A++TG++P   G+Q  V+  
Sbjct: 61  KDVGFHGSD-IRTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVIPS 119

Query: 133 CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF 192
             + GLP  E +LPQ LKE GY T IVGKWHLG    +Y P  RGF+   G   G  DYF
Sbjct: 120 AGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEIDYF 179

Query: 193 DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH 252
            HSA     W  + +   E        Y T +   +AV +I  H+   PLFLYLA  A H
Sbjct: 180 THSAHGKVDWYRNNQLIKEEG------YVTTLLGQDAVKLIEKHNPKTPLFLYLAFTAPH 233

Query: 253 SANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
           +     P QAP  YL+ ++ I D  R  +AA++  +D+ +G+VV ALE+R M +N++IVF
Sbjct: 234 A-----PYQAPQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTLIVF 288

Query: 313 VSDNGGA-AAGFNLNA--------ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
            SDNGG  +A F            A N P R  K +L+EGG R   L   P     G V 
Sbjct: 289 QSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPGTVV 348

Query: 364 EQYVHVSDWLPTLLSAANKS 383
              +H+ D  PTL   A+ S
Sbjct: 349 NHPIHIVDMYPTLTGLASVS 368


>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
          Length = 659

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/332 (41%), Positives = 192/332 (57%), Gaps = 27/332 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+F+LADDLGWNDVG+H    I TP++DALA  G+ L+NYYT  LCTPSRS ++TG++
Sbjct: 174 PHIVFVLADDLGWNDVGYHD-SNIFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTGRY 232

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            IHTG+QH +++ C+   LPL EK+LP+ L+E GY T +VGKWHLG ++KE  PT RGF+
Sbjct: 233 QIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRGFD 292

Query: 180 SHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           +  GY  G +DY+ H       A  +    LD R   + A   +  YST++FT +A+D+I
Sbjct: 293 TFFGYLLGSEDYYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAIDLI 352

Query: 234 HNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             H      E L + L      +      L + +  ++     +    +    I+H    
Sbjct: 353 AKHPAQKGTEDLIVILDSRGVPT------LVSSEEQVSFLCKDQSSDHTSL-QIIH---- 401

Query: 291 SVGKVVEALEQRRMLSNSIIVF-VSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
             G +      R    NS   F   DNGG      L   +NWPLRG K TLWEGG+RG G
Sbjct: 402 -FGSIKNVQTPRASSENSSSAFPFPDNGGQ----TLAGGNNWPLRGRKWTLWEGGIRGVG 456

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            + SPLL+  G+ + + +H+SDWLPTL++ A 
Sbjct: 457 FVASPLLKQTGVQSRELIHISDWLPTLVNLAG 488



 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   LPL EK+LP+ L+E GY T ++ 
Sbjct: 238 LQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVG 273


>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
          Length = 300

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 128/286 (44%), Positives = 182/286 (63%), Gaps = 5/286 (1%)

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           +CTPSR+A++TGK+PI+TGMQH V+   +  GLPL+E  + +  +E GYRT ++GKWHLG
Sbjct: 1   MCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLG 60

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDLHGKYSTD 223
             ++ +TPT RGF+ HLGY   + DY+  S E+      G D R  L+   D  G Y TD
Sbjct: 61  LSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTD 120

Query: 224 VFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
           + T  AV  I +H +    +PLFL L H A H+AN  +P+QAP   ++   +I +     
Sbjct: 121 LLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRY 180

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +AA++ +LD+SVG V++AL ++ ML NSII+F+SDNGG   G +   ASN+PLRG KN+ 
Sbjct: 181 YAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSP 240

Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           WEG +R +  IWS   E  G V +Q +++ D LPTL +AA  S  P
Sbjct: 241 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDP 286


>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
          Length = 518

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 133/352 (37%), Positives = 205/352 (58%), Gaps = 12/352 (3%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
            +A A L L + + ++      ++ PP+I+ +LADDLGWNDV +H    +  PN+  LA 
Sbjct: 4   FVATAWLALQWLMPLIEGTNGQTTNPPNIVMVLADDLGWNDVSWHN-SIVQMPNLQDLAE 62

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
            G+IL++ Y  + CTPSR+A +TG++PI+TGMQ  V+   +  GLP+  K+LP YLK+ G
Sbjct: 63  RGVILEHAYAQEKCTPSRAAFLTGRYPINTGMQEEVVVATQMSGLPIEFKLLPSYLKDQG 122

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y T ++GKWH+G+  + YTPT RGF+SH G++     Y ++S+ E    G D R DL   
Sbjct: 123 YATHMIGKWHVGYCDEAYTPTRRGFDSHYGFYNSGISYSNYSSTEGTDVGYDYRDDLALN 182

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR-H 272
               GKY+T  FT +A  +I NH    P+FLY+A+ A H+     P +  + Y +I+  +
Sbjct: 183 LAAEGKYTTTDFTDQAKTLIDNHDQTNPMFLYMAYNAPHT-----PFEVEESYRDIYDGN 237

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           + D  R  +  ++  LDE VG++V+ L++  M SN++ VF SDNGG        + +N+P
Sbjct: 238 LRDGNRKTYLGMISALDEQVGQLVDKLKEVGMWSNTVFVFYSDNGGTQPQSG-QSGNNFP 296

Query: 333 LRGVKNTLWEGGVRGAGLIWSPL----LESRGIVAEQYVHVSDWLPTLLSAA 380
           LRG K +L+EGG R    + +      LE     +    H+SD   T ++ A
Sbjct: 297 LRGKKGSLFEGGYRVPAFLIARTRAGNLELIASTSSTLFHISDMFATFIALA 348


>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
          Length = 431

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 120/267 (44%), Positives = 174/267 (65%), Gaps = 15/267 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH +++ C+   LPL EK+LP+ L+E GY T +VGKWHLG YKKE  PT RGF++
Sbjct: 1   IHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDT 60

Query: 181 HLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A+ +    LD R   E A      YST++FT  A+D+I 
Sbjct: 61  YFGYLLGSEDYYSHDRCVLIKAKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIA 120

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           +H T++PLFLYLA  + H     EPL+ P+ Y+  +  I+D KR  +A ++  +DE+VG 
Sbjct: 121 HHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRRHYAGMVTLMDEAVGN 175

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           + +AL++  + +N+++VF +DNGG      L   +NWPLRG K TLWEGGVRG G + SP
Sbjct: 176 LTDALKRYGLWNNTVLVFSTDNGGQ----TLAGGNNWPLRGRKWTLWEGGVRGVGFVASP 231

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
           LL+ +G+ + + +H+SDWLPTL+  A 
Sbjct: 232 LLKQKGVESHELIHISDWLPTLVHLAG 258



 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   LPL EK+LP+ L+E GY T ++ 
Sbjct: 5  LQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVG 40


>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
          Length = 782

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 129/300 (43%), Positives = 181/300 (60%), Gaps = 20/300 (6%)

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           KNY  V     +  +  T +  IHTG+QH +++ C+   +PL EK+LP+ LKE GY T +
Sbjct: 332 KNYGYVPSYLNATKSPHTKREQIHTGLQHQIIWACQPNCVPLDEKLLPELLKEAGYATHM 391

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEP 212
           VGKWHLG Y+KE  PT RGF+S+ GY  G +DY+ H       +  +    LD R   E 
Sbjct: 392 VGKWHLGMYRKECLPTRRGFDSYFGYLLGSEDYYSHERCVLIRSLNVTRCALDFRDGEEV 451

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
           A      YST+VF   AVD+I NH  D+PLFLYLA  + H     EPLQ P+ Y+  +  
Sbjct: 452 AVGYKNMYSTNVFAKRAVDLIANHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSF 506

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           I++ KR  +A ++  +DE+VG V  AL++  +  N++++F +DNGG      L   +NWP
Sbjct: 507 IQNKKRRNYAGMVSIMDEAVGNVTAALKRHGLWDNTVLIFSTDNGGQ----TLAGGNNWP 562

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
           LRG K TLWEGGVRG G + SPLL+ +G+   + +H+SDWLPTL++ A       +VN T
Sbjct: 563 LRGRKWTLWEGGVRGIGFVASPLLKQKGVKNRELIHISDWLPTLVNLAG-----GHVNGT 617



 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LP+ LKE GY T ++ 
Sbjct: 358 LQHQIIWACQPNCVPLDEKLLPELLKEAGYATHMVG 393


>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
           caulinodans ORS 571]
 gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
           caulinodans ORS 571]
          Length = 490

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 196/332 (59%), Gaps = 23/332 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI++ILADDLG+ DVGFHG D I TPN+D LA +G  L  +YT  +CTP+R+A+MTG++
Sbjct: 47  PHILYILADDLGFADVGFHGSD-IKTPNLDKLAATGATLGQFYTQPMCTPTRAALMTGRY 105

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+  G+Q  V+      GL   E +LPQ LK +GY T ++GKWHLG  K+++ P  RGF+
Sbjct: 106 PLRYGLQTGVIPSGASYGLATDEFLLPQALKSVGYSTALIGKWHLGHAKQDFWPRQRGFD 165

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G   G  D++ H A  +  W  D ++ +E  +D      T++F  +A  +I  H   
Sbjct: 166 YFYGPLVGEIDHYKHEAHGVVDWYRDNKQVVEEGYD------TELFGTDAARLIGAHDPK 219

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLYLA  A H+     P QAP  Y++ + +I D  R  +AA++  +D+ +GKVV AL
Sbjct: 220 TPLFLYLAFTAPHT-----PFQAPQAYVDRYPNITDPARRLYAAMISAMDDQIGKVVAAL 274

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLN--------AASNWPLRGVKNTLWEGGVRGAGLI 351
           EQR M  N++IVF SDNGG  +   +          ASN P R  K TL+EGG R A L 
Sbjct: 275 EQRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELPASNAPYRDGKGTLYEGGTRVAALA 334

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTL--LSAAN 381
             P   + G  A+  +HV D +PTL  L+ AN
Sbjct: 335 NWPGRITPG-AADGVMHVVDMMPTLARLAGAN 365


>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
 gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
          Length = 501

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 142/331 (42%), Positives = 194/331 (58%), Gaps = 23/331 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I++I+ADD+G+ D GF+G D IPTPN+D LA  G  L+ +Y + +CTP+R+A+MTG++
Sbjct: 60  PNIVYIVADDMGFGDAGFNGSD-IPTPNLDKLAAEGARLEQFYALPMCTPTRAALMTGRY 118

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+  G+Q  V+       LP+ E +LPQ LK+ GY T +VGKWHLG  K E+ P  RGF+
Sbjct: 119 PLRYGLQVGVIPAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQRGFD 178

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G   G  D+F HS+  +K W  + +   E  +D        +F  EAV ++  H   
Sbjct: 179 YFYGALVGEIDHFKHSSHGVKDWYRNNKPLNETGFD------NTLFGNEAVRVVERHEGK 232

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLYLA  A H+     P QAP  YL+  + I D  R K+AA++  +D+ VGKVV AL
Sbjct: 233 SPLFLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAAMISVMDDGVGKVVAAL 287

Query: 300 EQRRMLSNSIIVFVSDNGG---------AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           E++ M  N+IIVF SDNGG         +  G  L  ASN P R  K TL+EGG R A L
Sbjct: 288 EKKGMRDNTIIVFHSDNGGVKNSLFAGDSKVGGEL-PASNGPYRDGKGTLYEGGTRVAAL 346

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              P     G+V +  VHV D  PTL + A+
Sbjct: 347 ANWPGKIKPGVV-DSMVHVVDMYPTLAAVAD 376



 Score = 42.7 bits (99), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 2/66 (3%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
           +DG++VW  LS  +PS R+ +++N+D      A+ +G WKLV + S+       N ++D 
Sbjct: 385 LDGMNVWDALSEGKPSPRSEVVYNVDP--LAGAVRKGDWKLVWKASLPPKVELFNLADDK 442

Query: 570 SYQNEI 575
           S  +++
Sbjct: 443 SEAHDL 448



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 2/42 (4%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           +DG++VW  LS  +PS R+ +++N+D      A+ KG WKLV
Sbjct: 385 LDGMNVWDALSEGKPSPRSEVVYNVDP--LAGAVRKGDWKLV 424


>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
          Length = 560

 Score =  246 bits (628), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 126/281 (44%), Positives = 180/281 (64%), Gaps = 11/281 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A S PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 96  AAPSQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 154

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGF++KE  P
Sbjct: 155 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLP 214

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           + RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I
Sbjct: 215 SCRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHI 274

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           + +HS   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V
Sbjct: 275 LASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 329

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
             +  AL++    +NS+I+F SDNGG       +  SNWPL
Sbjct: 330 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPL 366


>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 539

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 134/345 (38%), Positives = 195/345 (56%), Gaps = 18/345 (5%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             L   ++ ++ ++  PHI+FILADDLG+ DVG+     + TPNID LA  G+ L+ +Y 
Sbjct: 45  LILLTSYLTVIQATQRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYA 104

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI-LPQYLKELGYRTRIVGKW 162
              C PSRS +M G++ IHTG  +    G         + I +P  LKE GY T +VGKW
Sbjct: 105 QPSCMPSRSCLMMGRYQIHTGFNYKCTDGSGSQLCMHPDTITIPMKLKENGYATHMVGKW 164

Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMKMWGLDMRRDLEPAWDLHG 218
           HLG  + E  P  +GF++  GY    +DY+ H +    E   +W    R   + A + +G
Sbjct: 165 HLGNIRWECLPNAKGFDTFFGYHGASEDYYTHFSPAGRECRDLW----RNRDDVAQEYYG 220

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
           +YST +FT EA++II NH   +P+FLYL + A H      PLQ P+ Y N++    D  R
Sbjct: 221 QYSTHIFTNEALNIIENHDVSKPMFLYLPYQAVHG-----PLQVPEKYYNMYERPLDDIR 275

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
            K+AA+   +DE +G V  AL+QR M  N++ +FVSDNG            NWPLRG K 
Sbjct: 276 RKYAAMATCMDEGIGNVTNALKQRGMWDNTVFIFVSDNGAQFP----TGGRNWPLRGGKG 331

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +++EGG+R    + SPL+E     + + +H++DWLPTL   A  +
Sbjct: 332 SVFEGGIRVVSFVTSPLIERPQRSSNEMIHLTDWLPTLTHLAGGT 376


>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
          Length = 431

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 123/269 (45%), Positives = 167/269 (62%), Gaps = 15/269 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 3   IHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDT 62

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST+VFT  A  +I 
Sbjct: 63  YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTLIT 122

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A +   +DE+VG 
Sbjct: 123 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGN 177

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  ALE+R + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + SP
Sbjct: 178 VTAALERRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 233

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 234 LLKRKGVKTRELIHISDWLPTLVKLAGGS 262



 Score = 42.7 bits (99), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 7  LQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVG 42


>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
 gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
          Length = 491

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 149/347 (42%), Positives = 197/347 (56%), Gaps = 22/347 (6%)

Query: 46  LSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
           LS++     A+  P PHI++ILADDLG+ DVGFHG D I TPN+D LA  G  L  +YT 
Sbjct: 34  LSLLSSGARAADAPRPHIVYILADDLGFADVGFHGSD-IKTPNLDHLAAQGARLGQFYTQ 92

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
             CTP+R+A +TG++P+H G+Q   +    + GL   E +LPQ LK++GYRT +VGKWHL
Sbjct: 93  PFCTPTRAAFLTGRYPLHYGLQVGAIPSGAKYGLATDEFLLPQALKDVGYRTALVGKWHL 152

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
           G   +++ P  RGF+S  G   G  D+F H A  +  W  D  +  E  +D      T++
Sbjct: 153 GHADQKFWPRQRGFDSFYGPLVGEIDHFKHEAHGVTDWYHDNTQVKEEGYD------TEL 206

Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           F  EAV +I  H    PLFLYLA  A     P+ P QAP  YL+ + HI   +R  +AA+
Sbjct: 207 FGKEAVRLIAAHDPKTPLFLYLAFTA-----PHTPFQAPQSYLDQYAHIAAPQRRAYAAM 261

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----AAGFNLNA----ASNWPLRGV 336
           +  +D+ +G VV AL  R M  N++IVF SDNGG      AG    A    ASN P R  
Sbjct: 262 ITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDG 321

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           K +L+EGG R   L   P   + G  AE  +HV D LPTL   A  S
Sbjct: 322 KGSLYEGGTRVVALANWPGRIAPG-AAEGVMHVVDMLPTLAKLAGAS 367


>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 726

 Score =  245 bits (626), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/368 (37%), Positives = 206/368 (55%), Gaps = 51/368 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI-PTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           P++ FI+ DD+GWND+G+H  D    TPN+D L+ SG+ +  YY++ +CTP+R+A+MTG+
Sbjct: 112 PNVFFIMIDDMGWNDIGYHSTDLANVTPNLDRLSASGVKVSQYYSMSICTPARAALMTGR 171

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P+  G+Q+NV+      GLPL+EK+LP+Y+ E GY + +VGKWHLG Y   +TP  RGF
Sbjct: 172 YPVRYGLQYNVIQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAHTPHRRGF 231

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWG--------------LDMRRDLEP------------ 212
           E++ GY    + Y+ H      + G               D+    +P            
Sbjct: 232 ETYFGYLNDEEMYWTHQTWTATINGRKFFDFGFGNATGFYDVIERFDPPPGDDDLVSTGP 291

Query: 213 -------AWDLHGKYSTDVFTAEAVDIIHNHS--TDEPLFLYLAHAATHSANPYEPLQA- 262
                  + ++ G YST++FT  A++I+   +   + PLFLYL+H A H      P  A 
Sbjct: 292 TSSVYSSSLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVHDPLGVPPADAF 351

Query: 263 -PDHYLNIHRHIEDFK-----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
            P+ +  +   +ED       R +FA +L  LD+S+G++V+ LE    L NSIIV  SDN
Sbjct: 352 SPEEWA-VLTALEDESSDASLRVRFAKVLMYLDKSIGRLVDYLETEGWLENSIIVVASDN 410

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL---ESRGIVAEQYVHVSDWL 373
           GG  +       SN+PLRG+K++ WEGGV+    ++SP     E  G      +HV+DWL
Sbjct: 411 GGCPS----TGGSNYPLRGLKHSNWEGGVKVPAFVYSPSHIPEEHWGTEYRGLMHVTDWL 466

Query: 374 PTLLSAAN 381
           PTL  AAN
Sbjct: 467 PTLADAAN 474


>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 505

 Score =  245 bits (626), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 141/350 (40%), Positives = 198/350 (56%), Gaps = 15/350 (4%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
            AVL   F+ ++ +   V     P+IIFILADDLG+ DVG+     + TPNID LA  G+
Sbjct: 5   IAVLLAIFSFALQWAHGVTEQ--PNIIFILADDLGYFDVGYREGSIVKTPNIDKLAAEGV 62

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEKILPQYLKELGYR 155
            L+ YY    C PSRS +M G++ IHTG  +  L G   R  +      LP  L++ GY 
Sbjct: 63  KLERYYAQSSCMPSRSCLMMGRYQIHTGFDYRCLDGQLTRLCMAPDTVTLPMKLRQYGYA 122

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE-PAW 214
           T ++GKWHLG  +KE  PT +GF++  GY    ++Y+ H+A        D+ R++E  A 
Sbjct: 123 THMIGKWHLGHERKECVPTHKGFDTFFGYHGAAENYYTHTALGRPR-CHDLWRNMENVAE 181

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY-LNIHRHI 273
           D  G+YST  +T EA DII NH   +P F+YL++ A H      PL+ P +Y +  +   
Sbjct: 182 DYDGQYSTLFYTKEAQDIIKNHDKKKPFFMYLSYQAVHG-----PLEVPSNYRMPYYTSG 236

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
            D  R  +AA+   +DE +G + E L+   ML+N+I+VF SDNG   A F+     NWPL
Sbjct: 237 VDESRQIYAAMATCMDEGIGNITETLKASGMLNNTILVFSSDNG---AQFH-TGGRNWPL 292

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           RG K T+WEGGVR    + SPLL+    V  + +H SDW PT +  A  +
Sbjct: 293 RGGKGTMWEGGVRVLAFVNSPLLKKSRTVNHEMIHTSDWFPTFVRLAGGT 342


>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 631

 Score =  245 bits (625), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 134/330 (40%), Positives = 189/330 (57%), Gaps = 18/330 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II +LADDLG++DV FHG +QIPTPN+DA+A  G+IL  +Y     TPSR A  TGK 
Sbjct: 42  PNIIVVLADDLGYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGKL 101

Query: 120 PIHTGMQHN-VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           P+  G+    +L G    GL L  ++LP Y+++LGY T ++G+W LGFYK+   PT RGF
Sbjct: 102 PLRIGLNEGPILKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRGF 161

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMW-------GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           ++H G ++    Y  H + E   W         D+ RD +P +   G Y TD++      
Sbjct: 162 DTHYGPYSDSASYSSHLSRE-DAWKSLSVPPAYDLHRDGKPDFSGFGSYVTDLYKGRFER 220

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN---IHRHIEDFKRSKFAAILHKL 288
           II      +PLF+ L+H   H A+ + PL  P    N      HI+D  R  +A  +  L
Sbjct: 221 IIEQRR--KPLFIVLSHQTPHGAS-FGPLHQPPPRTNRASQFLHIKDRSRRSYARTVESL 277

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA--GFNLNAASNWPLRGVKNTLWEGGVR 346
           D+S   +V+ L++  +L+ S+I+F SDNGG      +    ASNWPLRG KNT WEGGVR
Sbjct: 278 DDSFAHMVDVLQRNNILNESVILFSSDNGGTTFKDPYFKTGASNWPLRGQKNTQWEGGVR 337

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
            +  +WS  + +   V+E+  H  DWLPT 
Sbjct: 338 VSSFVWSSQI-TEPYVSEELYHFVDWLPTF 366


>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 141/337 (41%), Positives = 193/337 (57%), Gaps = 44/337 (13%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           SS  P+I+FI+ DDLG++DVGF    QI TPNID                     +++I+
Sbjct: 31  SSSQPNIVFIVIDDLGFDDVGFKS-HQIKTPNID---------------------QASIL 68

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +G++ +H G+  N +   +  GLPL+   LPQ LK  GY T  +GKWHLGFYK +YTPTF
Sbjct: 69  SGRYAMHHGIV-NWIPPKDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPTF 127

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP---------AWDLHGKYSTDVFT 226
           RGF S LGY++G ++YF H          DM RD  P         A+DL G+YST +F+
Sbjct: 128 RGFNSFLGYYSGGENYFTHKNGP----AYDMHRDPLPSCGQNCSQIAFDLQGQYSTTIFS 183

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAIL 285
            EAV +I +H   +PLFLYLA+ A H     EP QAP  Y++ +   I D +R  FA +L
Sbjct: 184 DEAVRVIDDHIGPKPLFLYLAYQAVH-----EPAQAPQSYIDPYTDLIPDAQRRTFAGML 238

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
             LDE +G V  AL++    +N++IVF +DNGG   G +   A N+P+RG K+ ++EGGV
Sbjct: 239 SCLDEGIGNVTRALKRNGFYNNTLIVFTTDNGGPILGGDSVGARNYPMRGGKHAVYEGGV 298

Query: 346 RGAGLIWSP--LLESRGIVAEQYVHVSDWLPTLLSAA 380
           RG   +  P       G      +H +DWLPTL  AA
Sbjct: 299 RGIAFVHPPQNTFVETGAGYHNLMHAADWLPTLCDAA 335


>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
          Length = 502

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 121/258 (46%), Positives = 165/258 (63%), Gaps = 2/258 (0%)

Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
           GMQ   + G E  GLPLS +ILP++L+ LGY T+++GKWHLG++  ++TP  RGF+S LG
Sbjct: 2   GMQGESIQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFLG 61

Query: 184 YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
           ++  H  Y+D+      M G DM R   PA+ L  KY TD+FT EAV II  H    PL+
Sbjct: 62  FYNSHVSYYDYKYSYQNMSGYDMHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDPSRPLY 121

Query: 244 LYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
           L ++H A H+  P E  Q  DHY     HI +  R K+A ++ KLD+SVG +V  L  R 
Sbjct: 122 LQISHLAVHA--PLENPQDYDHYDKRFMHIVEQNRRKYAKMVSKLDDSVGLIVNTLGSRG 179

Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
           ML +S+I+F++DNG A  G   N  SN+PLRG+K TL+EGGVRG  ++WSP L     V 
Sbjct: 180 MLRDSVILFLTDNGAAPVGKFRNYGSNFPLRGMKYTLYEGGVRGVAVLWSPRLSKAARVC 239

Query: 364 EQYVHVSDWLPTLLSAAN 381
              +H++DWLPTL +AA+
Sbjct: 240 NNLMHITDWLPTLYAAAD 257


>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 658

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 141/334 (42%), Positives = 194/334 (58%), Gaps = 28/334 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+F+LADDLGW+DVG+HG   I TPNID LA  G+ L+NYY    C PSR  ++TG++
Sbjct: 204 PNIVFVLADDLGWHDVGYHG-SIIDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTGRY 262

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-GFYKKEYTPTFRGF 178
            I TG+  +V   C+  G  + E  L   L E GY T +VGKWHL GF  +E  P  RGF
Sbjct: 263 RIRTGLYGDV---CDFMG--IHEITLADKLYEAGYYTAMVGKWHLSGFQHRECYPAHRGF 317

Query: 179 ESHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           ++ LG+  G Q+YF H    S  E   W      D     +  G+YST V+  EA  II 
Sbjct: 318 QTFLGFHGGSQNYFTHRRGGSNSEYDFWA----NDTSIGREYDGRYSTMVYAEEAQRIIR 373

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESVG 293
           +H T++PLFLYL+  A HS     PL  P  Y + +R  IED KR  +AA+   +DE++G
Sbjct: 374 HHRTEQPLFLYLSFQAVHS-----PLLVPSAYEDKYRTGIEDDKRRVYAAMTSCMDEAIG 428

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            V + L++  + + ++ +F SDNGG          SNWPLRG K   +EGG+R  G + S
Sbjct: 429 NVTDTLKETGLWNKTVFIFSSDNGGTPT----FGGSNWPLRGRKGLWYEGGIRAVGFVNS 484

Query: 354 PLLESR--GIVAEQYVHVSDWLPTLLSAANKSDI 385
            LL +   G    + +H++DW PTLL  A K+D+
Sbjct: 485 YLLSNHVTGSTNHELIHITDWFPTLLRLA-KTDL 517


>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
          Length = 551

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 119/267 (44%), Positives = 166/267 (62%), Gaps = 15/267 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           I TG+QH +++ C+   +PL EK+LPQ+LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 123 IRTGLQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 182

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +I 
Sbjct: 183 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAATLIT 242

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+VG 
Sbjct: 243 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 297

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+ R + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + SP
Sbjct: 298 VTAALKSRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 353

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
           LL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 354 LLKQKGVKNRELIHISDWLPTLVKLAG 380



 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ+LKE GY T ++ 
Sbjct: 127 LQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVG 162


>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
          Length = 1941

 Score =  241 bits (616), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 134/359 (37%), Positives = 201/359 (55%), Gaps = 26/359 (7%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           R R++   +  +A  + +   D    + PPHI+ ILADDLGWND+G++ L QI TP +D 
Sbjct: 16  RYRLVTAFLFAVAAIMFITLYD--KKTNPPHIVLILADDLGWNDIGWNNL-QIKTPVLDK 72

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           LAY G+I    Y   LCTP+R+A+MTG +P   GMQH ++   +  GLPL  KILPQ LK
Sbjct: 73  LAYEGVIFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMVLPFQPSGLPLHLKILPQKLK 132

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDY---------FDHSAEEMKM 201
           + GY   IVGKWHLG+   EYTP  RGF+S  G ++   ++          DHS    K 
Sbjct: 133 QAGYINHIVGKWHLGYCNWEYTPLNRGFDSFYGSFSNSVNHNNKISQLPISDHS----KY 188

Query: 202 WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
            G D R +     +  G+  T +FT   VDII NH  D P+F+Y          P +  +
Sbjct: 189 KGYDFRDNTGVVQN-DGQPLTKLFTQRVVDIISNHHKDYPMFMYFP-----INQPSKGSE 242

Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
             + Y  ++ ++ D +  ++ A L  +D++VG V++AL+ R M   ++++F+SDNG    
Sbjct: 243 PDEKYTALYPNVTDDQVRQWYATLSLMDDAVGHVIDALKSRGMWEETLLIFISDNG---- 298

Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            F+    +N PLRG   TL+EGG+R   +    +++  G +  +  H++DW  T+LS A
Sbjct: 299 AFDFMEGTNLPLRGAGTTLFEGGMRVPAIAHGKMIKKTGYINNELNHITDWHSTILSLA 357



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 24/124 (19%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           ++DGI++W  +S  + S RN I++NID  DE + +AL  G WK++            TG 
Sbjct: 364 DLDGINIWGTVSEGKASARNEIVYNIDEIDECRGAALRLGDWKVI------------TGN 411

Query: 632 PDQV--YLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
           P  +  +L       WL  + R   + A      +  V         LF++K DP E++N
Sbjct: 412 PGLMCRFLDLNKQDGWLQ-SGRHSDNQAITDFQSISNV-------SYLFNLKEDPEERHN 463

Query: 690 LADR 693
           +A++
Sbjct: 464 VAEK 467



 Score = 43.9 bits (102), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 32/48 (66%), Gaps = 2/48 (4%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTRGKWKLVKEN 554
           ++DGI++W  +S  + S RN I++NID  DE + +AL  G WK++  N
Sbjct: 364 DLDGINIWGTVSEGKASARNEIVYNIDEIDECRGAALRLGDWKVITGN 411


>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
          Length = 519

 Score =  241 bits (616), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 119/269 (44%), Positives = 166/269 (61%), Gaps = 15/269 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 91  IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 150

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +I 
Sbjct: 151 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIT 210

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+VG 
Sbjct: 211 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAVGN 265

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGG+RG G + SP
Sbjct: 266 VTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVASP 321

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 322 LLKQKGVKNRELIHISDWLPTLVKLARGS 350



 Score = 43.1 bits (100), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 95  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 130


>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 128/326 (39%), Positives = 189/326 (57%), Gaps = 12/326 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIFILADDLG+ DVG+     I TPNID LA  G+ L+ +Y    C PSR+ +M G++
Sbjct: 12  PHIIFILADDLGYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACLMMGRY 71

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
            IHTG +   +    R    ++  I  LP  +K+ GY T ++GKWHLG    +  P  +G
Sbjct: 72  QIHTGYRDECMND-TRYQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCLPNAKG 130

Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           F+++ GY    +DY+ H     +      R  ++ A    G+YST +FT EAV+II NH 
Sbjct: 131 FDTYFGYNAAAEDYYTHMLSGRQNCSDLWRDRMDVADKYIGQYSTRIFTEEAVNIIENHD 190

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
             +P+F+YLAH A H+     PLQ P  Y +++    +  R K+AA++  +DE +G V E
Sbjct: 191 ISQPMFMYLAHQAVHA-----PLQVPQEYYSLYETTLNDVRRKYAAMVTCMDEGIGNVTE 245

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
           AL++  M  N+++VFVSDNGG  +        NWPLRG K +++EGG+R    + S L+E
Sbjct: 246 ALKESGMWDNTVLVFVSDNGGELSA----GGRNWPLRGGKGSVFEGGIRTVSFVTSSLIE 301

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKS 383
               ++   +H++DW PTL   A  +
Sbjct: 302 RPQRISNAMIHITDWFPTLTHLAGGT 327


>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 512

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 128/349 (36%), Positives = 199/349 (57%), Gaps = 19/349 (5%)

Query: 43  AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALAYSGIILK 99
           +F L  V   +  ++  PH++ ILADDLG+NDVG+ G +      TP +D+LA +G+ L+
Sbjct: 12  SFLLICVVGTVAEATKKPHVVLILADDLGYNDVGYWGQNHGSAAKTPFLDSLAENGVRLE 71

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           NYYT  +C+P+R A+MTG++ I  G+ H +++  +  GLPL   +LP+ L   GY T+++
Sbjct: 72  NYYTHSVCSPTRGALMTGRNRIDIGLAHGIIHTTQIEGLPLDNVLLPEQLSNCGYNTQMI 131

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---SAEEMKMWGLDMRRDLE-PAWD 215
           GKWHLGF   +Y P  RGF    G+  G ++Y+      A    + G+D       P  +
Sbjct: 132 GKWHLGFSSSKYAPWNRGFHGFYGFLAGSENYWSKWLPMARHSNIGGVDFTDSTTGPTNE 191

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
             G+YS  V+ + A  +I +H   +PLFLYL         P+ PL AP HY    + IED
Sbjct: 192 TWGQYSAHVYASRARYVIQHHDQSKPLFLYLP-----LQTPHTPLGAPSHYYEPFKDIED 246

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R K+ +++  LDE+V  V   L+   M  +++++F +DNGG       +  +NWP RG
Sbjct: 247 DDRMKYLSMVSVLDETVRNVTNYLKDAGMWEDTLLIFSTDNGGEVK----HGGNNWPYRG 302

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGI---VAEQYVHVSDWLPTLLSAAN 381
            KNTL+EGGV+  G +   +L        V  + +HV+DW PT+++AA+
Sbjct: 303 TKNTLYEGGVKAVGFVHGKILGDLSPAQRVNRELIHVTDWYPTIMAAAD 351


>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
          Length = 431

 Score =  240 bits (612), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 119/269 (44%), Positives = 166/269 (61%), Gaps = 15/269 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 3   IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 62

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +I 
Sbjct: 63  YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIT 122

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+VG 
Sbjct: 123 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAVGN 177

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGG+RG G + SP
Sbjct: 178 VTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVASP 233

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 234 LLKQKGVKNRELIHISDWLPTLVKLARGS 262



 Score = 42.7 bits (99), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 7  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 42


>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
 gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
          Length = 260

 Score =  239 bits (611), Expect = 4e-60,   Method: Composition-based stats.
 Identities = 117/254 (46%), Positives = 167/254 (65%), Gaps = 10/254 (3%)

Query: 49  VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYTVQLC 107
           V V   +++ PPHI+FIL DDLGW+DVG+H +   + TPNID LA  G+ L +YY+  +C
Sbjct: 14  VMVARTSATKPPHILFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMC 73

Query: 108 TPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           TPSR A+MTGK+PIH GMQH V+      G+P     +PQ L+ LGYRT ++GKWHLGF+
Sbjct: 74  TPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFF 133

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
             +YTP  RGF+S LG++ G QD++ HS    KM  LD RRD EPA +  G++STDVFT 
Sbjct: 134 DWDYTPLRRGFDSFLGFFAGEQDHWRHS----KMGFLDFRRDEEPANEYGGQHSTDVFTQ 189

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA++I   H+  +PLFL L++AA H+     PLQA  + +N    + D  R  +  ++  
Sbjct: 190 EAINIAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGA 244

Query: 288 LDESVGKVVEALEQ 301
            D S+G++++  ++
Sbjct: 245 ADWSIGRLIDVYKR 258


>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 585

 Score =  239 bits (611), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 133/331 (40%), Positives = 195/331 (58%), Gaps = 17/331 (5%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+ +LADD+GWNDVG++  D +PTP ++ LA +G+IL N Y+   C+PSR+A++TGK
Sbjct: 107 PPHIVLVLADDVGWNDVGWNN-DFMPTPILNELASNGVILNNTYSQPACSPSRAALLTGK 165

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P + G+QH V+       LPL   +L   LKELGY    +GKWHLGF   +YTP +RGF
Sbjct: 166 YPANAGIQHLVVQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHLGFCNWKYTPLWRGF 225

Query: 179 ESHLGYWTGH-QDYFDHSAE-----EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +S  G + G+  DY  H        E    GLD+R +       +G + T +FT  A  I
Sbjct: 226 DSFYGIFNGYLSDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHENGTHVTYLFTERAERI 285

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH+   PLFLY A        P+ PL AP  + N    + D +R ++A ++  +DE+V
Sbjct: 286 IRNHNPAAPLFLYTALLV-----PHGPLDAPPGFEN-SVEVNDTRRREYAGMISAMDEAV 339

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G +  AL+ + M ++++ +F+SDNGG        A +N+P RG K +LWEG V+  G I+
Sbjct: 340 GNITNALKDKGMWNDTLFIFLSDNGGDM----FYAGNNYPFRGNKASLWEGSVKIPGFIY 395

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
             +LE +G    + +H +D   TLLSAA  +
Sbjct: 396 GNMLEKKGYSNNELIHFTDLFATLLSAAGGT 426


>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
 gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
          Length = 498

 Score =  239 bits (611), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 143/340 (42%), Positives = 188/340 (55%), Gaps = 24/340 (7%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           +A S  P+I+FI+ADDLGW DVGFHG D I TPNID LA  G  L+ +Y   +CTP+R+A
Sbjct: 50  LAQSTKPNIVFIVADDLGWKDVGFHGSD-IKTPNIDQLAEKGGRLEQFYAQPMCTPTRAA 108

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++P   GMQ  V+      GL L + +LP+ LKE GY T   GKWHLG     + P
Sbjct: 109 LMTGRYPFRYGMQTAVIPQGGTYGLALDDYLLPEMLKEAGYATAASGKWHLGHADTAFWP 168

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
             RGF+S  G   G  D+F H SA     W  +     E  +D        +F  EAV +
Sbjct: 169 RQRGFDSFYGALLGEIDHFTHKSANGNADWYRNNEAIEEAGFD------NILFATEAVRV 222

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I+ H   +PLFLYLA       +P+ P QAP  YL+ + HI D  R  +AA++  +D+ +
Sbjct: 223 INEHDQSKPLFLYLAF-----TSPHTPFQAPKEYLDRNSHIADESRRAYAAMISVMDDGI 277

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN--------AASNWPLRGVKNTLWEGG 344
           G+VVEALE R M  N++I+F SDNGG    F            A N P R  K T +EGG
Sbjct: 278 GRVVEALETRGMRENTLILFHSDNGGVVDAFFAGESDVKGKLPADNGPYRQGKGTTYEGG 337

Query: 345 VRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            R  GLI W   +E     A+  +HV D  PTL + A  +
Sbjct: 338 TRVVGLINWPGKIEPGS--ADGMMHVVDMYPTLAALAGAT 375



 Score = 43.1 bits (100), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 47/112 (41%), Gaps = 27/112 (24%)

Query: 464 NGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQ---------------- 507
           NGP  +     YE GT      R+   IN  G  E  S D                    
Sbjct: 323 NGPYRQGKGTTYEGGT------RVVGLINWPGKIEPGSADGMMHVVDMYPTLAALAGATP 376

Query: 508 ---NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
                +DG+D+W +L   +PS R  I++N+  E   +A+ +G WKLV + ++
Sbjct: 377 GKNKPLDGMDMWPMLGEGKPSPRTEIVYNV--EPMTAAVRQGDWKLVWKTTL 426



 Score = 40.4 bits (93), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 28/42 (66%), Gaps = 2/42 (4%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           +DG+D+W +L   +PS R  I++N+  E   +A+ +G WKLV
Sbjct: 382 LDGMDMWPMLGEGKPSPRTEIVYNV--EPMTAAVRQGDWKLV 421


>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
 gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
          Length = 498

 Score =  239 bits (609), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 139/334 (41%), Positives = 194/334 (58%), Gaps = 29/334 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI++IL DDLG+ DVG+HG D + TPN+D LA  G  L  +YT  LCTP+R+A+MTG +
Sbjct: 53  PHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGCY 111

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+  G+Q  V+    R GL  +E +LPQ LKE GY+T +VGKWHLG   ++Y P  RGF+
Sbjct: 112 PMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGFD 171

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G   G  D+F H A  +  W  D     EP +D      T++F A+A+ +I  H + 
Sbjct: 172 YFYGPLVGEIDHFKHEAHGITDWYRDNEMVKEPGYD------TELFGADAIRLIEEHDSA 225

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PL++YL+  A H+     P QAPD Y +++  I D  R+ +AA++  +D+ VG V++AL
Sbjct: 226 TPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRNAYAAMISCMDDQVGPVLQAL 280

Query: 300 EQRRMLSNSIIVFVSDNGGA----AAGFNLNAAS----NWPLRGVKNTLWEGGVRGAGLI 351
           E+R M  +++++F SDNGG      AG    A      N PLR  K TL+EGG R   L 
Sbjct: 281 ERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALA 340

Query: 352 -WS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            W    P  E+ G+     +HV D LPTL   A 
Sbjct: 341 NWPGRIPAGETHGM-----MHVVDMLPTLAGLAQ 369



 Score = 39.7 bits (91), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 28/46 (60%), Gaps = 2/46 (4%)

Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
           ++  ++DG+DVW  +S  + S R  +++NI  E    AL  GKWKL
Sbjct: 373 AHAGQLDGMDVWQAISAGKASPREEVVYNI--EPTQGALRDGKWKL 416


>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
          Length = 475

 Score =  239 bits (609), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 119/262 (45%), Positives = 161/262 (61%), Gaps = 15/262 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 52  IHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDT 111

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST+VFT  A  +I 
Sbjct: 112 YFGYLLGSEDYYSHERCTVIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTLIT 171

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A +   +DE+VG 
Sbjct: 172 THPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRRHYAGMASLMDEAVGN 226

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  ALE R + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + SP
Sbjct: 227 VTAALESRGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 282

Query: 355 LLESRGIVAEQYVHVSDWLPTL 376
           LL+ +G+   + +H+SDWLP L
Sbjct: 283 LLKRKGVKTRELIHISDWLPPL 304



 Score = 43.1 bits (100), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 56 LQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVG 91



 Score = 39.3 bits (90), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 49/110 (44%), Gaps = 17/110 (15%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW-QIS-------ALTRGKWKLVKENSINGNGT 561
           +DG DVW+ +S   PS R  +LHNID ++  IS       AL +    L++ ++ N +  
Sbjct: 313 LDGFDVWNTISEGSPSPRMELLHNIDPDFVDISLCPGNSMALAKDDSSLLEYSAFNTSIH 372

Query: 562 SENRSNDNSYQNEIDGIDVW---------SVLSRNEPSKRNTILHNIDDE 602
           +  R  +        G   W         SV+  ++P  +   L +ID +
Sbjct: 373 AAIRHQNWKLLTGYPGCGHWFPPPSQHNVSVIPSSDPPAKTLWLFDIDQD 422


>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 497

 Score =  238 bits (608), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 117/277 (42%), Positives = 174/277 (62%), Gaps = 12/277 (4%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQ---IPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           +G P+++FILADD G+NDVG+HG +    + TPN+D LA  G+ L+ YY   +C+P+RS 
Sbjct: 32  TGRPNVVFILADDFGYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQ 91

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +++G++ IHTG+Q+ V+   +   LPL E  LPQ LKE  Y T +VGKWH+GFYK   TP
Sbjct: 92  LLSGRYQIHTGLQYGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTP 151

Query: 174 TFRGFESHLGYWTGHQDYFDHSAE----EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           T RGF+S+ GY +G +DY+ HS         + GLD+  +  PA+   G+YST +FT++A
Sbjct: 152 TERGFDSYFGYLSGAEDYYSHSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKA 211

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           +D+I+NH   +PLFLYLA+ A HS     PLQ P  Y   + +I    R  +A ++  +D
Sbjct: 212 IDVINNHERSKPLFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMD 266

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
           E +G V  AL    + +N+II+F +    +  G N+ 
Sbjct: 267 EGIGNVTRALVDAGLYNNTIIIFSTGMSSSLVGKNMT 303


>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
           sphaeroides KD131]
 gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
           KD131]
          Length = 509

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 138/334 (41%), Positives = 193/334 (57%), Gaps = 29/334 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI++IL DDLG+ DVG+HG D + TPN+D LA  G  L  +YT  LCTP+R+A+MTG++
Sbjct: 64  PHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRY 122

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+  G+Q  V+    R GL  +E +LPQ LKE GY+T +VGKWHLG   ++Y P  RG +
Sbjct: 123 PMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVD 182

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G   G  D+F H A  +  W  D     EP +D      T++F A+A+ +I  H + 
Sbjct: 183 YFYGPLVGEIDHFKHEAHGITDWYRDNEMVKEPGYD------TELFGADAIRLIEEHDSA 236

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PL++YL+  A H+     P QAPD Y +++  I D  R  +AA++  +D+ VG V++AL
Sbjct: 237 TPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVLQAL 291

Query: 300 EQRRMLSNSIIVFVSDNGGA----AAGFNLNAAS----NWPLRGVKNTLWEGGVRGAGLI 351
           E+R M  +++++F SDNGG      AG    A      N PLR  K TL+EGG R   L 
Sbjct: 292 ERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALA 351

Query: 352 -WS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            W    P  E+ G+     +HV D LPTL   A 
Sbjct: 352 NWPGRIPAGETHGM-----MHVVDMLPTLAGLAQ 380



 Score = 39.7 bits (91), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 28/46 (60%), Gaps = 2/46 (4%)

Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKL 615
           ++  ++DG+DVW  +S  + S R  +++NI  E    AL  GKWKL
Sbjct: 384 AHAGQLDGMDVWQAISAGKASPREEVVYNI--EPTQGALRDGKWKL 427


>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
          Length = 430

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 118/267 (44%), Positives = 164/267 (61%), Gaps = 15/267 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 2   IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +I 
Sbjct: 62  YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+VG 
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 232

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
           LL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAG 259



 Score = 42.7 bits (99), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 6  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 41


>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
          Length = 430

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 118/267 (44%), Positives = 164/267 (61%), Gaps = 15/267 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF++
Sbjct: 2   IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61

Query: 181 HLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G +DY+ H       A  +    LD R   E A      YST++FT  A  +I 
Sbjct: 62  YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+VG 
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG G + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRGVGFVASP 232

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
           LL+ +G+   + +H+SDWLPTL+  A 
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAG 259



 Score = 42.7 bits (99), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 6  LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 41


>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 564

 Score =  237 bits (604), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 141/345 (40%), Positives = 190/345 (55%), Gaps = 37/345 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ DDLG +D+G H    I TP+ D LA  G+ L  YY +  C+P+R+++++G++
Sbjct: 64  PHILMIIMDDLGSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRASLLSGRY 123

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+HTG  H ++   E  GLPL E+ LPQ L+  GY+   VGKWH+G  +   TPTFRGF+
Sbjct: 124 PLHTGC-HTIVNDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQTPTFRGFQ 182

Query: 180 SHLGYWTGHQDYFDH-------SAEEMKMWGLDMR----RDLEPAWDLHGKYSTDVFTAE 228
           S  G++ G QDY  H       +A EM     D R    RD     D  G YST VFT E
Sbjct: 183 SFFGFYLGAQDYNTHIKQGERGNAYEMHW---DARGKCGRDCSRLVDERGNYSTHVFTRE 239

Query: 229 AVDIIHNHST--DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE----DFKRSKFA 282
           A+ +I NH     EPLFLYLAH A H      P Q P+ Y   +          +R  +A
Sbjct: 240 AIRVIENHPQRPHEPLFLYLAHQAVH-----WPDQVPETYRKFYEGATYSNWTDQRKTYA 294

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
            +L   DES+G V +AL+   M  N+++VF +DNGG  A      +SN+P RG K T++E
Sbjct: 295 GMLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCAAQGSSNYPKRGGKCTVYE 354

Query: 343 GGVRGAGLI----WSPLLESR----GIVAEQY---VHVSDWLPTL 376
           GG  G G +    W+ +  SR        E Y    HV DWLPTL
Sbjct: 355 GGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTL 399


>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 425

 Score =  236 bits (603), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 122/279 (43%), Positives = 163/279 (58%), Gaps = 9/279 (3%)

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
           + LCTPSRSA+MTG +PIH GMQ  V+      GLPL  KILPQ+ K+LGY   ++GKWH
Sbjct: 1   MPLCTPSRSALMTGLYPIHNGMQSTVILPAAPWGLPLENKILPQHFKDLGYDVNMIGKWH 60

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM--KMWGLDMRRDLEPAWDLHGKYS 221
           LGF++  Y P  RGF++  G +TG  DY++H++  +  K   +     L      HG   
Sbjct: 61  LGFFQTPYVPMKRGFDTFFGLYTGSNDYYNHTSGSVSKKQEQITGALCLTVGGITHG--V 118

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             + T   ++        +P F Y +H A H A   E  QAP   +    +IE+  R+ +
Sbjct: 119 EPLLTDSPLNFY-----PQPFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEESNRTIY 173

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           A +L  LDES+G+VVEAL    ML ++IIVF SDNGG   G   N   NWPLRG K T+W
Sbjct: 174 AGMLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGGGPYGIESNRGFNWPLRGAKFTVW 233

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           EGGVR    +WSP    +  V+ Q +H+SDWLPTL +AA
Sbjct: 234 EGGVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAA 272



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 61/148 (41%), Gaps = 30/148 (20%)

Query: 568 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVD 627
           D S    IDG+D+W+ L+    SKR  IL N +  + IS L   ++KLV V      ++D
Sbjct: 275 DASALGTIDGVDMWNSLTHRSASKRKEILLNSNSLYNISGLRYKQYKLV-VGGGFDGELD 333

Query: 628 LTGGPDQVYLSG----------LSDREWLALAMRKLRDAASIQCGPVK-----EVPCE-- 670
                D  Y  G          L +    A  +RK  +       P+       V C   
Sbjct: 334 -----DHYYFPGGTRPTNDIDQLRNESTTARVLRKFYEKQDRSWSPLAWSNNVAVDCRRN 388

Query: 671 -------PQIAPCLFDIKNDPCEKNNLA 691
                  P   P LF+I+ DPCE NNLA
Sbjct: 389 KLTENFVPMQPPYLFNIEKDPCELNNLA 416



 Score = 40.4 bits (93), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 30/56 (53%)

Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
           D S    IDG+D+W+ L+    SKR  IL N +  + IS L   ++KLV     +G
Sbjct: 275 DASALGTIDGVDMWNSLTHRSASKRKEILLNSNSLYNISGLRYKQYKLVVGGGFDG 330


>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
 gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
          Length = 502

 Score =  236 bits (603), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 143/332 (43%), Positives = 184/332 (55%), Gaps = 24/332 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I++ILADDLGW DVGFHG D I TPNID LA  G  L+ YY   +CTP+R+A MTG++
Sbjct: 60  PNIVYILADDLGWKDVGFHGSD-IKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGRY 118

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   GMQ  V+      GL L + +LP+ LKE GY T   GKWHLG  K  + P  RGF+
Sbjct: 119 PFRYGMQTAVIPQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGFD 178

Query: 180 SHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           S  G   G  D+F H SA     W  +     E  +D        +F  EAV +I+ H  
Sbjct: 179 SFYGALLGEIDHFTHKSANGNADWYRNNDALEEEGFD------NVLFADEAVRVINEHDQ 232

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +PLFLYLA       +P+ P QAP  +L  + HI D  R  +AA++  +D+ +G+VVEA
Sbjct: 233 AKPLFLYLAF-----TSPHTPFQAPKEFLERNSHIADESRRNYAAMISVMDDGIGRVVEA 287

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGF-----NLNA---ASNWPLRGVKNTLWEGGVRGAGL 350
           LE R M  N++IVF SDNGG    F     N+     A N P R  K T +EGG R  GL
Sbjct: 288 LEMRGMRENTLIVFHSDNGGVVDAFFAGESNVKGKLPADNGPYREGKGTTYEGGTRVVGL 347

Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           + W   +E     A   +HV D  PTL + A 
Sbjct: 348 VNWPGKIEPG--TANGMMHVVDMYPTLAALAG 377



 Score = 43.1 bits (100), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 15/106 (14%)

Query: 464 NGPKNENTNPRYENGTHEYNIPRLENSI---NGNG---------TSENRSNDNSYQNE-I 510
           NGP  E     YE GT    +      I     NG         T    +     +N+ +
Sbjct: 327 NGPYREGKGTTYEGGTRVVGLVNWPGKIEPGTANGMMHVVDMYPTLAALAGAELGKNKPL 386

Query: 511 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
           DG+D+W VL   +PS R  I++NI  E   +A+ +G WKLV + ++
Sbjct: 387 DGMDMWPVLGEGKPSPRTEIVYNI--EPMTAAVRQGDWKLVWKTTL 430



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 23/94 (24%)

Query: 523 EPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWS 582
           EP   N ++H +D    ++AL              G    +N+         +DG+D+W 
Sbjct: 355 EPGTANGMMHVVDMYPTLAALA-------------GAELGKNKP--------LDGMDMWP 393

Query: 583 VLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           VL   +PS R  I++NI  E   +A+ +G WKLV
Sbjct: 394 VLGEGKPSPRTEIVYNI--EPMTAAVRQGDWKLV 425


>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
          Length = 511

 Score =  236 bits (602), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 189/336 (56%), Gaps = 22/336 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ I+ADDLG +DV  HG D I TPN+D LA  G++L NYY   +C+P+R ++MTG++
Sbjct: 20  PNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRY 78

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   G+QH  L G    GLPL E I+PQY+KE GY+T +VGKW LGF+K  Y P  RGF+
Sbjct: 79  PYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFD 138

Query: 180 SHLGYWTGHQDYFDHSA-----EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
              G   G QDY+             + G D+R +  P  D   KY   ++  +A +   
Sbjct: 139 EFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRDTSMKYQPFLYADKAREKFF 198

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK--FAAILHKLDESV 292
            H+  +PL++Y+A  + H      PLQAP  Y++++         +  FA ++  LD +V
Sbjct: 199 AHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQMSNGQRVFAGMVTALDAAV 253

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G++V+ L+   + +N++I F +DNG           S++ L+G KN  W+GG+ G G   
Sbjct: 254 GEIVDGLKDSGLWNNTLIYFSNDNG-------REDDSDFVLKGGKNDFWQGGITGYGFFS 306

Query: 353 SPLL--ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
            P +  + RG       HVSD LPT+LSAA    +P
Sbjct: 307 GPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342



 Score = 42.7 bits (99), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH  L G    GLPL E I+PQY+KE GY+T ++ 
Sbjct: 84  LQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119


>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1517

 Score =  236 bits (602), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 140/353 (39%), Positives = 194/353 (54%), Gaps = 44/353 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PH++F L DDLG++DVG+   D  + TP+IDALA  G+ L  YY+   CTP+R A++TG 
Sbjct: 71  PHVVFALMDDLGFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFSCTPARGALLTGL 130

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            P   G+QH  ++  +  GLP    ILPQ+L +LGYR+ +VGKWHLG +  E  PT RGF
Sbjct: 131 SPHRLGLQHGQVFPEQPWGLPSKFSILPQHLAKLGYRSHLVGKWHLGHFSAERLPTARGF 190

Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWG-------------------------LDMRRDLEP 212
           +S  G   G Q Y  H  A + K+ G                          D+R + + 
Sbjct: 191 DSFFGGLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEHGCYFDLRENNDR 250

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHS-----TDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
             DL G YST +F  +A ++I  HS      ++PLFL L+  A H+     P+ AP+   
Sbjct: 251 VEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-----PVWAPEDTY 305

Query: 268 NIH---RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
             H    ++ +  R KFAA L  +D++VG VV ALE+ RM +NSI++F SDNG       
Sbjct: 306 ETHPDLLNVTNGNRRKFAAALRLVDDAVGNVVAALEEARMDANSILIFASDNGANPE--- 362

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
            +  SN PLRG K  L+EGGVR    + +P    RG   +   HV+DW+PTLL
Sbjct: 363 -HGGSNAPLRGSKGYLFEGGVRVPAFVRAPKYLPRGATYDHPFHVTDWVPTLL 414


>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
          Length = 581

 Score =  236 bits (601), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 131/336 (38%), Positives = 194/336 (57%), Gaps = 21/336 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IF+L DDLG++D+G+   D I +PNIDALA   + LK +Y    CTPSR+A +TG++
Sbjct: 24  PHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRY 82

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  GMQ  V+   E  G+PL E +L +  K+ GYRT + GKWHLGFY  ++ P  RGF+
Sbjct: 83  NIRMGMQSGVIRATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQIRGFD 142

Query: 180 SHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDLH--GKYSTDVFTAEAVDIIHN 235
              G++ G QD++ H +  +K +    D+  D     DLH  G YST +F  + ++ +  
Sbjct: 143 RFYGFYLGSQDFYFHDSGRLKAYPGNGDVENDTILD-DLHTNGTYSTKLFVDDFINDLAK 201

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---FKRSKFAAILHKLDESV 292
           H    PLF Y++    H      PLQ  + +   + +  D    +R   +  +  LD  +
Sbjct: 202 HDPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHERILISTKITTLDHHI 256

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           GK+V AL+++   +N+++VF SDNGG         ASNWPLRG K T+++GG++    I 
Sbjct: 257 GKMVTALKEKNYWNNTVLVFTSDNGGQPK----EGASNWPLRGSKGTIYDGGLKSRAFIA 312

Query: 353 SPLLESR--GIVAEQYVHVSDWLPTLLSAANKSDIP 386
           SPLL++R  G       HVSDW PTLL+ +   D+P
Sbjct: 313 SPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG-CDVP 347


>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
 gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
          Length = 455

 Score =  236 bits (601), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 122/260 (46%), Positives = 169/260 (65%), Gaps = 14/260 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FILADD GWND+G+H    I TPN+D LA  G+ L+NYY   +C+PSR  +MTG++
Sbjct: 1   PNIVFILADDYGWNDIGYHN-SFIRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRY 59

Query: 120 PIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            IH G+QH+V+  C+R  GLPL E  LPQ LKE GY T +VGKWHLGF++KEY P  RGF
Sbjct: 60  QIHYGLQHSVIM-CDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGF 118

Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           E   GY TG +DY+ H      S +  +  GLD+R   +P  D +G YST +F  +A+++
Sbjct: 119 ERFFGYLTGGEDYWTHRKPSQFSKDPSEFHGLDLRDQDKPVLDQNGTYSTHLFARKAIEM 178

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I NH   +P+FLYL   A H      PL+AP+ Y  I+  I++     +AA++  +DE+V
Sbjct: 179 ILNHDQSKPMFLYLPFQAVHG-----PLEAPEEYKRIYEDIDNSLVRTYAAMVTVMDEAV 233

Query: 293 GKVVEALEQRRMLSNSIIVF 312
           G V  AL+ R M  N+I++F
Sbjct: 234 GNVTGALKHRGMWRNTILIF 253


>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
           obscuriglobus UQM 2246]
          Length = 459

 Score =  236 bits (601), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 131/336 (38%), Positives = 189/336 (56%), Gaps = 15/336 (4%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L F L+   V    +   P ++F+LADDLG  D GF G  +I TPNID +A +G  L  +
Sbjct: 7   LLFLLAPASVARAEAPAQPSVVFLLADDLGREDCGFMGGKEIKTPNIDKIAAAGATLDAF 66

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +C+P+R+A MTG++P+  G+Q  V+    + GLPL E+ + Q LK+ GY T ++GK
Sbjct: 67  YAQPVCSPTRAAFMTGRYPMRHGLQVGVVRPWAQYGLPLDERTVAQGLKDAGYTTAVIGK 126

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLG +  EY PT RGF+   G++ G  DYF H    ++  G D  RD +   D    YS
Sbjct: 127 WHLGHFAPEYLPTKRGFDHQYGHYNGALDYFTH----IRDGGFDWHRDDKVNSDE--GYS 180

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T +   +AV  +  H+  +P FLY+   A H+     P Q P+ YL  + H++  +R K+
Sbjct: 181 THLVAKDAVQFVQTHAGKKPFFLYVPFNAVHA-----PHQVPEEYLKPYGHLKG-ERQKY 234

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           A +L  +DE+VG++V A+E+  +  N++ VF+SDNGG   G      SN   R  K TL+
Sbjct: 235 AGMLAAMDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQPGV---VTSNGKFRAGKGTLY 291

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           EGGVR A           G    + +HV DW PTLL
Sbjct: 292 EGGVRVAACAAFDGHIRPGTTITEPLHVVDWYPTLL 327


>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
          Length = 427

 Score =  235 bits (600), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 189/336 (56%), Gaps = 22/336 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ I+ADDLG +DV  HG D I TPN+D LA  G++L NYY   +C+P+R ++MTG++
Sbjct: 20  PNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRY 78

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   G+QH  L G    GLPL E I+PQY+KE GY+T +VGKW LGF+K  Y P  RGF+
Sbjct: 79  PYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFD 138

Query: 180 SHLGYWTGHQDYFDHSA-----EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
              G   G QDY+             + G D+R +  P  D   KY   ++  +A +   
Sbjct: 139 EFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRDTSMKYQPFLYADKAREKFF 198

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK--FAAILHKLDESV 292
            H+  +PL++Y+A  + H      PLQAP  Y++++         +  FA ++  LD +V
Sbjct: 199 AHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQMSNGQRVFAGMVTALDAAV 253

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G++V+ L+   + +N++I F +DNG           S++ L+G KN  W+GG+ G G   
Sbjct: 254 GEIVDGLKDSGLWNNTLIYFSNDNG-------REDDSDFVLKGGKNDFWQGGITGYGFFS 306

Query: 353 SPLL--ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
            P +  + RG       HVSD LPT+LSAA    +P
Sbjct: 307 GPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH  L G    GLPL E I+PQY+KE GY+T ++ 
Sbjct: 84  LQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119


>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 473

 Score =  235 bits (600), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 137/354 (38%), Positives = 199/354 (56%), Gaps = 43/354 (12%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M   V      +S+V  DL   +  PHI+FILADDLGW+DVG+H   +I TPNID LA  
Sbjct: 1   MYVCVFLGILAISVVHADL--GTLRPHILFILADDLGWHDVGYHD-SEIQTPNIDMLAAE 57

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+ L+NYY   LCTPSR+ +MTG++ IH+GMQH VL       LP  E +LPQ LK+ GY
Sbjct: 58  GVKLENYYVTPLCTPSRAVLMTGRYLIHSGMQHGVLVAQNPRCLPTDEILLPQMLKDSGY 117

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T +VGKWHLGF K + TP  RGF++  G+         ++A  +  +    RR      
Sbjct: 118 STHMVGKWHLGFCKFQCTPNHRGFDTFFGW---------YNAAVISPFISCRRR------ 162

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-I 273
                       +   +   +H+T  P+F+YL   A H+     PLQ PD Y ++++  I
Sbjct: 163 ------------SAGCEFRVSHTT-VPMFIYLPFQAVHA-----PLQVPDKYSDMYKDTI 204

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
            D  R  +A +   +DE++G +   L+ + + +N++IVF +DNGGA        ASN+PL
Sbjct: 205 HDESRRTYAGMTSCMDEAIGNITRTLKDKGIWNNTVIVFSTDNGGART----FGASNYPL 260

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDI 385
           RG K + +EGG+RG   + SPLL+   RG +  + +++ DW PT ++ A    I
Sbjct: 261 RGQKASNFEGGIRGPAFVSSPLLDPLVRGTINNELMYIGDWFPTFMNLAGGHVI 314


>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
 gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
          Length = 497

 Score =  235 bits (599), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 183/336 (54%), Gaps = 32/336 (9%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A    P+I++I++DD GW DVG+HG D I TPNID LA  G  L+ +Y   +CTPSR+A 
Sbjct: 50  AQGAAPNIVYIISDDSGWKDVGYHGSD-IRTPNIDRLAAEGARLEQFYVQPMCTPSRAAF 108

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           MTG++P   G+Q  V+      GL L E  LPQ LK+ GY T + GKWHLG  K  Y P 
Sbjct: 109 MTGRYPFRYGLQTAVIPQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHSKTAYWPR 168

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTDVFTAE 228
            RGF+S  G   G  D+F H A            +  P W  + K      Y   +  AE
Sbjct: 169 QRGFDSFYGALLGEIDHFTHKAA-----------NGNPDWYRNNKALKEEGYDNILIGAE 217

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           AV +I+ H   +PLFLYLA  A H+     P QAP  YL+ + HI D  R K+AA++  +
Sbjct: 218 AVRVINKHDQQKPLFLYLAFTAPHT-----PYQAPKEYLDRNSHIADESRRKYAAMISVM 272

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN--------AASNWPLRGVKNTL 340
           D+ +GKVV ALE+R M  N++IVF SDNGG    F            A N P R  K T 
Sbjct: 273 DDEIGKVVAALEKRGMRDNTLIVFHSDNGGVVNAFFAGESEVKGKLPADNGPYREGKGTT 332

Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +EGG R AGL+  P     G + +  +HV D  PTL
Sbjct: 333 YEGGTRVAGLVNWPGKIKPGTI-DGPMHVVDMYPTL 367



 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 44/107 (41%), Gaps = 27/107 (25%)

Query: 464 NGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQ---------------- 507
           NGP  E     YE GT      R+   +N  G  +  + D                    
Sbjct: 322 NGPYREGKGTTYEGGT------RVAGLVNWPGKIKPGTIDGPMHVVDMYPTLAGLVGATL 375

Query: 508 ---NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLV 551
                +DG+D+W  L+  +PS R  I++NI  E   +A+ +G WKLV
Sbjct: 376 GKNKPLDGMDMWPTLAEGKPSPRTEIVYNI--EPMTAAVRQGDWKLV 420



 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 2/42 (4%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           +DG+D+W  L+  +PS R  I++NI  E   +A+ +G WKLV
Sbjct: 381 LDGMDMWPTLAEGKPSPRTEIVYNI--EPMTAAVRQGDWKLV 420


>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
          Length = 460

 Score =  234 bits (597), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 116/269 (43%), Positives = 167/269 (62%), Gaps = 15/269 (5%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG  +KE  PT RGF++
Sbjct: 32  IHTGLQHLLIRVCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMCQKECLPTRRGFDT 91

Query: 181 HLGYWTGHQDYFDHSAEEM------KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           + GY  G ++Y+ H    +          LD R   EPA   +  YST +F+  A  +I 
Sbjct: 92  YFGYLLGSENYYTHEVCALIEPLNVTRCALDFRDGEEPATKYNNTYSTHIFSQRATTLIA 151

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +DE+VG 
Sbjct: 152 NHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRRIYAGMVSLMDEAVGN 206

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           V  AL+   + +N++ VF +DNGG      L+  +NWPLRG K TLWEGG+RG G + SP
Sbjct: 207 VTAALKSHGLWNNTVFVFSTDNGGQ----TLSGGNNWPLRGRKWTLWEGGIRGVGFVASP 262

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           LL+ +G+ + + +H+SDWLPTL++ A  S
Sbjct: 263 LLKQKGVKSRELIHISDWLPTLVNLAGGS 291


>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
          Length = 501

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 129/346 (37%), Positives = 199/346 (57%), Gaps = 19/346 (5%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           LP+   L + F   + +S  P+ ++I ADDLG++D+G    D + TP +D LA  G+  K
Sbjct: 3   LPILIAL-VGFFQGIHTSKQPNFVWIFADDLGYHDIGLRNPD-LHTPTLDKLATKGVQFK 60

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           N Y +  C+PSR + M+G++P  + MQ +V++       PL  K LP+YLKELGY T  V
Sbjct: 61  NNYVMHACSPSRHSFMSGRYPFTSQMQKDVIFPVSPDCSPLKLKFLPEYLKELGYGTHAV 120

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLG+ +++  PT RGF+S  G   G  DY  H +     W  +   D   +    G+
Sbjct: 121 GKWHLGYCREDCMPTSRGFDSFYGTLDGEGDYLTHMSAGFYDWHTNGTVDRSKS----GQ 176

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA--PDHYLNIHRHIEDFK 277
           +S D+ TA   DII   + ++P FLY A     + NP+ PLQA   D     ++++ D  
Sbjct: 177 HSQDLHTAALADIIERQTEEKPFFLYFA-----AQNPHTPLQAKKKDMKKYKNQNLSDTL 231

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           + K+  ++  LD  V  + + LE++ +L+N+ I+F++DNG      ++   SN P RG K
Sbjct: 232 K-KYYGLITGLDGMVKTIKDTLEKKNLLNNTYIIFLADNGA-----DVGIGSNEPFRGTK 285

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +TLWEGGV+    I+SPLL+  G   +  +H++DW+PTL+S A  S
Sbjct: 286 STLWEGGVKSLSFIYSPLLKKTGYENDGLIHITDWMPTLVSLAGGS 331


>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
          Length = 484

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 119/264 (45%), Positives = 172/264 (65%), Gaps = 7/264 (2%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A S PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 43  AAPSQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 101

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGF++KE  P
Sbjct: 102 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLP 161

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           + RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I
Sbjct: 162 SCRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHI 221

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           + +HS   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V
Sbjct: 222 LASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 276

Query: 293 GKVVEALEQRRMLSNSIIVFVSDN 316
             +  AL++    +NS+I+F SDN
Sbjct: 277 RNITWALKRYGFYNNSVIIFSSDN 300


>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 530

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 132/347 (38%), Positives = 193/347 (55%), Gaps = 29/347 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PHI+++L DD+GWND+G+   D    TP +  LA +G+ L  YYT   CT SR+A+++G 
Sbjct: 27  PHIVYVLFDDMGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQSTCTVSRAALLSGV 86

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            P+H G+ H  +      GLPL  K+LPQYL+E GYRT +VGKW +G + +EY P  RGF
Sbjct: 87  LPMHNGISHGTIVMDSPIGLPLKYKLLPQYLQESGYRTYMVGKWDIGHFNEEYLPHNRGF 146

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWG----LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           +   G++     YF H +            D+R + E   +   +Y+TD+F   AV  + 
Sbjct: 147 DHFFGFYGADITYFSHISSRGYCANPNCFPDLRNEDETMANASMRYTTDLFRERAVGFVE 206

Query: 235 ----NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDF---KRSKFAAILH 286
               NH+TD PLFLYL+  A     P+ P  AP  ++ N    +  F   +R  FAA+++
Sbjct: 207 GHAANHATD-PLFLYLSFNA-----PHYPTSAPQEFMRNEAELLAPFTNRERRVFAAVVN 260

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL-------NAASNWPLRGVKNT 339
             D SVG+VVEAL+     ++S+I+F SDNG      N        N  SNWPLRG+K T
Sbjct: 261 YADRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESNTGSNWPLRGMKAT 320

Query: 340 LWEGGVRGAGLIWSPLL---ESRGIVAEQYVHVSDWLPTLLSAANKS 383
            WEGG R    + +P      + G + +  VH++DW+PT++    +S
Sbjct: 321 YWEGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTIVRGVLRS 367


>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
          Length = 1683

 Score =  233 bits (595), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 113/283 (39%), Positives = 172/283 (60%), Gaps = 17/283 (6%)

Query: 98   LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
            L++YY   +CTP+RS +M+G++ IHTG+QH V+ G +  GLPL++ + P+  K+ GYRT 
Sbjct: 1233 LQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTH 1292

Query: 158  IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
            +VGKWHLGFY ++  P  RGFESH GY  G + +++HS       G+D R          
Sbjct: 1293 MVGKWHLGFYDEKMVPENRGFESHYGYLIGAEGHYNHSQFMQGQHGVDFRDGGASTNSSW 1352

Query: 218  GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
            G+YS D+F     D++  H  +E L++Y+     H      PL+AP HY++    I+D  
Sbjct: 1353 GQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHY-----PLEAPQHYVDQFSWIKDRD 1407

Query: 278  RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
            R  +AA+   +D++VG+++ + + + +  ++I+ F +DNGG+     L   +NWPLRG+K
Sbjct: 1408 RRVYAAMTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGST----LYGGNNWPLRGLK 1463

Query: 338  NTLWEGGVRGAGLIWSPLLESRGIVA---EQYVHVSDWLPTLL 377
            NTLWEGG+RG G I  P     G+ A   +Q +HV D  PTLL
Sbjct: 1464 NTLWEGGIRGIGSIKIP-----GVSADKRDQLMHVVDMFPTLL 1501


>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
          Length = 775

 Score =  233 bits (594), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 118/272 (43%), Positives = 179/272 (65%), Gaps = 14/272 (5%)

Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
           GMQ   L G ER G+PL+  +LP+YL+ LGY T +VGKWH+G++ K + PT RGF++ LG
Sbjct: 300 GMQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGFDTFLG 359

Query: 184 YWTGHQDYFDHSAEEMKMWGLDMRR------DLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           Y+TG   YF+H+  E    G D+ R       +E  +D    Y TD+ T E   II +H+
Sbjct: 360 YYTGMIQYFNHTLYESGQLGYDLHRIVGENHTVEYRYD----YMTDLLTDEVESIISSHN 415

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPD--HYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           T++P++L L+H A H+++  E ++  D     +   +I+D  R K+A+++  LD+SVG+V
Sbjct: 416 TEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDLNRRKYASVVATLDKSVGRV 475

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           V+AL+++ +L+N+II+F++DNG    G+  N  SN+PLRGVK TL+EG VRG   I+SPL
Sbjct: 476 VDALKRKDILNNTIIIFMADNGAQTEGYLANYGSNYPLRGVKFTLFEGAVRGVACIYSPL 535

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
           ++    V+ +  H++DWLPTL SAA  N SD+
Sbjct: 536 IDRSSRVSTKLFHITDWLPTLYSAAGGNPSDL 567



 Score =  219 bits (557), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 113/257 (43%), Positives = 159/257 (61%), Gaps = 9/257 (3%)

Query: 125 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY 184
           MQ   + G E  GLPL+ KILP++L+ LGY T ++GKWHLG++  ++TP++RGF+   G+
Sbjct: 1   MQGEDIQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGF 60

Query: 185 WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFL 244
           +  H  Y D+      M G DM     PA+ L+ KY TD+FT +AV II NH +  PL+L
Sbjct: 61  YNSHVSYHDYKYSYQNMSGYDMHCGDAPAYGLNDKYVTDLFTDKAVKIIENHDSFRPLYL 120

Query: 245 YLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRM 304
            ++H A H+  P E  Q  DH      HI +  R K+A ++ KLD+SVG++V  L  R M
Sbjct: 121 QISHLAVHA--PLENPQDYDHSDRRFIHIREQHRRKYARMVSKLDDSVGRIVHTLGNRGM 178

Query: 305 LSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
           L +S+I+F++DNG A  G   N  SN+PLRG       GGVRG  ++WSP L     V +
Sbjct: 179 LRDSLILFLTDNGAAPIGKFRNFGSNYPLRG-------GGVRGVSVLWSPRLNKPARVCD 231

Query: 365 QYVHVSDWLPTLLSAAN 381
             +H+SDWLPTL +AA+
Sbjct: 232 NLIHISDWLPTLYAAAD 248



 Score = 43.1 bits (100), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 19/138 (13%)

Query: 573 NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGP 632
           +++DGI+ WS + + +  +R +IL NI ++ + + +  GK+KLV+     +   D   G 
Sbjct: 568 DQLDGINQWSTIKKVKIDERRSILVNIQEDNEAALM--GKYKLVRDRSDNQKHYDYYSGN 625

Query: 633 DQVY----------------LSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPC 676
           +  Y                ++ +S     A  ++ LR  A+I C  +   P   +   C
Sbjct: 626 NVSYPKYNVTNVLTSPAASAIASVSAHVLTAAKIKALRKKATIVCKNLTN-PSSCKNRTC 684

Query: 677 LFDIKNDPCEKNNLADRS 694
           LF+I  DPCE  +L+ ++
Sbjct: 685 LFNIYKDPCEFTDLSSKN 702



 Score = 40.0 bits (92), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQ   L G ER G+PL+  +LP+YL+ LGY T ++ 
Sbjct: 301 MQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVG 336


>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 706

 Score =  233 bits (594), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/382 (35%), Positives = 208/382 (54%), Gaps = 61/382 (15%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           A +  P++ FI+ DD+GWND+G+  +D Q  TP++D LA  G+ + NYYT+ +CTP+R++
Sbjct: 126 AETAKPNVFFIMVDDMGWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARAS 185

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++ +  G+Q++V+      GLPL+EKI P+Y+K+ GY T ++GKWH+G Y   + P
Sbjct: 186 LMTGRYVMRYGLQYSVIQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIP 245

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGL----------------------DMRRD-- 209
           + RGF+++LGY    + Y+ H +   K  G                       D  RD  
Sbjct: 246 SQRGFDTYLGYLNDEEMYWTHQSWTAKYNGRKFFDFGFGNATGYYDVIDREVPDQLRDGT 305

Query: 210 -----------------LEPAWDLHGKYSTDVFTAEAVDIIHNHS--TDEPLFLYLAHAA 250
                                    G+YST VF   A++IIH  +   DEPL++YLAH A
Sbjct: 306 ISDGDDDETMLSATSSYSSSGGFFKGRYSTQVFHNRALEIIHEKTPYDDEPLYMYLAHQA 365

Query: 251 THSANPYEPLQ--APDHYLNIHRHIE-------DFKRSKFAAILHKLDESVGKVVEALEQ 301
            H      P +   PD  L++   IE       +  R++FA +L  LD ++G++VE LE 
Sbjct: 366 VHDPLGLPPAEHFTPDQ-LDLLERIETGSEEGTEALRARFAKVLMYLDYTIGELVEYLET 424

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP--LLESR 359
              + NSII+  SDNGG  +       SN+PLRG+K++ WEGG +    ++S   + E R
Sbjct: 425 NGWMENSIIIVASDNGGCPS----CGGSNYPLRGIKHSYWEGGTKVPSFVYSKSHIPEER 480

Query: 360 -GIVAEQYVHVSDWLPTLLSAA 380
            G   +  +HV+DWLPT+  AA
Sbjct: 481 WGTEYDGLMHVTDWLPTIAGAA 502


>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 534

 Score =  233 bits (593), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 142/345 (41%), Positives = 189/345 (54%), Gaps = 39/345 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+F+LADDLGW+DVG+H    I TPNID LA  G+ L+NYY    C PSR  +MTG++
Sbjct: 69  PNIVFVLADDLGWHDVGYHD-SVIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGRY 127

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-GFYKKEYTPTFRGF 178
            I TG+  +V   C+  G  + E  L   L E GY T +VGKWHL GF   E  PT RGF
Sbjct: 128 RIRTGLYGDV---CDFMG--IHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRGF 182

Query: 179 ESHLGYWTGHQDYFDHSA----EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           ++ LGY  G Q+YF H           W      D   A    G+YST +F  EA  II 
Sbjct: 183 QTFLGYHGGSQNYFTHRRGGPHAPYDFWA----NDTSIAVKYEGQYSTMIFADEAQRIIR 238

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESVG 293
            H+T +PLFLYL+  A H      PL  P  Y + +R  IED KR  +AA+   +DE++G
Sbjct: 239 QHNTKQPLFLYLSFQAVHV-----PLLVPPSYEDQYRSLIEDDKRRVYAAMASCMDEAIG 293

Query: 294 KVVEALEQRRMLSNSIIVFVS-----------DNGGAAAGFNLNAASNWPLRGVKNTLWE 342
            V   L+++ + +N++ +F S           DNG        +  SNWPLRG K   +E
Sbjct: 294 NVTATLKEKGLWNNTVFIFSSARLNFKHDFCLDNGATPT----DGGSNWPLRGRKGLWYE 349

Query: 343 GGVRGAGLIWSPLLESR--GIVAEQYVHVSDWLPTLLSAANKSDI 385
           GG+R  G + S LL     G    + +HV+DW PTLL  A K+D+
Sbjct: 350 GGIRAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLA-KADL 393


>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
          Length = 466

 Score =  232 bits (592), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 117/265 (44%), Positives = 163/265 (61%), Gaps = 11/265 (4%)

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           ++ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RG
Sbjct: 1   RYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRG 60

Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHN 235
           F++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +
Sbjct: 61  FDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILAS 120

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V
Sbjct: 121 HDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNV 175

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
             AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G + SPL
Sbjct: 176 TLALKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAIGFVHSPL 231

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L+++G V ++ VH++DW PTL+S A
Sbjct: 232 LKNKGTVCKELVHITDWYPTLISLA 256


>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
          Length = 514

 Score =  232 bits (592), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 176/315 (55%), Gaps = 14/315 (4%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GWNDV +H  +++ +P ++ LA  G+IL  +Y +  CTP+R+A+MTG++P   G+Q + +
Sbjct: 36  GWNDVSWHN-ERMESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPYKLGIQSHGI 94

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
              E  GLPL    L + LK  GY T   GKWHLG+  +  TPT RGF++  G++ G QD
Sbjct: 95  RTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTFRGFYVGGQD 154

Query: 191 YFDHSAEEMKMW----GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
           YF H+    K      G D R   E  +   G Y+T +     +  I     D+P+FLY+
Sbjct: 155 YFSHTLSGGKTSATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQPDKPMFLYV 214

Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
           A  A H+     PLQ P  Y  +     + KR     +L  +D +VG +V+ L+ + M  
Sbjct: 215 AFQAVHA-----PLQVPTQYRKMCSIYRNPKRKLLCEMLAVMDHAVGLIVKKLKMKDMWR 269

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           N+++VF+SDNGG      L   +NWPLRG KNTL+EGG R    +  PL+ + G  +   
Sbjct: 270 NTLLVFISDNGGQI----LYGGNNWPLRGNKNTLFEGGTRVPAFVAGPLIRNGGRNSSSI 325

Query: 367 VHVSDWLPTLLSAAN 381
           +HV DW PTLLSAA 
Sbjct: 326 IHVVDWFPTLLSAAG 340


>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
          Length = 539

 Score =  231 bits (590), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 117/262 (44%), Positives = 160/262 (61%), Gaps = 11/262 (4%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 77  IHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDT 136

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
             G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+ +H  
Sbjct: 137 FFGSLLGSGDYYTHYKCDSPGMCGYDLYENENAAWDYDNGIYSTQMYTQRVQQILASHDP 196

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  A
Sbjct: 197 KRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 251

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G + SPLL++
Sbjct: 252 LKMYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 307

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           +G V ++ VH++DW PTL+S A
Sbjct: 308 KGTVCKELVHITDWYPTLISLA 329


>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
          Length = 345

 Score =  231 bits (590), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 20/324 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+ +FI+ADDLG++D+G    D I TPN+DALA  G+IL N Y   LCTPSR A+M+G++
Sbjct: 26  PNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMSGRY 84

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  + MQ  V+   E     L  K LPQYLKELGY+  +VGKWHLG+ + E  PT RGF+
Sbjct: 85  PSASAMQSMVIQPMEARCAGLEYKFLPQYLKELGYKNHMVGKWHLGYCRDECLPTSRGFD 144

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-IIHNHST 238
           +  G + G  DY+ H       W ++     E     +G +S D+   E +D +   H +
Sbjct: 145 TFYGLYAGAGDYWSHEIFGKYDWHINGDIHFEA----NGTHSQDL-EMEGLDKVFDEHDS 199

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLDESVGKVV 296
            +PLFLY A       NP+ P Q  D +LN++     +D +R K+  +   LD  VGK+V
Sbjct: 200 KDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKFDDIRR-KYLGLTSGLDAMVGKIV 253

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           E LE   ++ N+ ++FVSDNG            N P RG K +L+EGG +    ++SPLL
Sbjct: 254 EKLENNGLMDNTYVIFVSDNGADPP-----EGLNTPFRGGKGSLFEGGTKSNSFVYSPLL 308

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
              G   +  +H++DWLPTL+  A
Sbjct: 309 NKTGYENDGLMHITDWLPTLVKLA 332


>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
 gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
          Length = 455

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 133/348 (38%), Positives = 195/348 (56%), Gaps = 22/348 (6%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           +L +A TL+ V       +  P+I+F+LADDLG  DV + G   I TP +DALA SG  L
Sbjct: 8   ILTIALTLASVATTFATDAPRPNIVFLLADDLGGADVSWRG-SPIKTPQLDALANSGAKL 66

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           + +Y   +C+P+RSA++TG++P+  G+Q  V+      GLPL E+ L + L++ GY T I
Sbjct: 67  EQFYVQPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQDAGYETAI 126

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD--MRRDLEPAWDL 216
           VGKWHLG     Y P  RGF+   G++ G  DYF H  +    W  D  + RD       
Sbjct: 127 VGKWHLGHVSPAYLPMARGFDHQYGHYNGALDYFTHDRDGGHDWHKDDHVNRD------- 179

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
              Y+T +   EAV +I +    +PLFLY+   A HS     PLQ P+ Y   +  ++  
Sbjct: 180 -EGYATHLIAQEAVRVIQDRDKKKPLFLYVPFNAVHS-----PLQVPESYAAPYGDMKK- 232

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R  +A ++  LDE+VG++V+ ++++ ML N++ +F SDNGG   G       N PLRG 
Sbjct: 233 RRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEPG---KLTDNGPLRGG 289

Query: 337 KNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           K+TL+EGGVR      W   + + G   E  +H+ DW PTL+  A  S
Sbjct: 290 KHTLYEGGVRVCAFASWKGRI-APGSKVEAPLHIVDWYPTLIELAGGS 336


>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 517

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 139/348 (39%), Positives = 201/348 (57%), Gaps = 29/348 (8%)

Query: 48  MVFVDLVASSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
           ++F+ L  S+     P++I IL DDLGW DV +HG   IPTPNIDALA +G+ L  +Y  
Sbjct: 4   LIFLILFVSASIYSKPNVIIILTDDLGWGDVSYHG-GHIPTPNIDALAKNGVELNRFYAS 62

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLY-GCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
            +C+P+R++++TG H  + G+   +     E+ GLP+  KI+PQ+ KE GY+T + GKWH
Sbjct: 63  PVCSPTRASLLTGLHIFNHGIIRPLANPTAEQYGLPVDLKIMPQFFKEAGYQTALSGKWH 122

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
           LG + +EY PT RGF+   G+  G   YFDH         LD  R+ EP ++    YST+
Sbjct: 123 LGMHLEEYWPTNRGFDQSYGHMLGGIGYFDHVHSSR----LDWHRNEEPLFE--DGYSTE 176

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +   EAV II     + PLFLY+A  A H+     P+QAPD  + +  +IED     +AA
Sbjct: 177 LIANEAVRIIETKDPNRPLFLYVAFNAPHT-----PIQAPDKNIELFSYIEDPLDRAYAA 231

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----------AAGFNLNAASNWPL 333
            ++ LD  +GK++ A+E + +L  +IIVF SDNG            A G      S   L
Sbjct: 232 NVNALDAEIGKIIYAIEDQGILEETIIVFFSDNGPVFDINPIIAVLAPGLVEARGSTAGL 291

Query: 334 RGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G K +  EGG+R  A ++W  +L++    ++QY+ V D LPTLLSAA
Sbjct: 292 PGSKTSAREGGIRVPASIMWKGVLDNSK--SDQYIFVQDLLPTLLSAA 337


>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
          Length = 471

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 114/270 (42%), Positives = 169/270 (62%), Gaps = 11/270 (4%)

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
            +++ ++ IHTG+QH+V+   +   LPL    LPQ LKE GY T +VGKWHLGFY+KE  
Sbjct: 2   VLVSSRYQIHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECM 61

Query: 173 PTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAV 230
           PT RGF++  G   G  DY+ H   +  +M G D+  +   AWD  +G YST ++T +  
Sbjct: 62  PTQRGFDTFFGSLLGSGDYYTHYKCDSPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQ 121

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
            I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE
Sbjct: 122 QILASHNPRKPIFLYIAYQAVHS-----PLQAPGTYYERYRSINNINRRRYAAMLACLDE 176

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           ++  V  AL++    +NS+I++ SDNGG          +NWPLRG K T WEGG+R  G 
Sbjct: 177 AINNVTLALKRYGYYNNSVIIYSSDNGGQPTA----GGNNWPLRGSKGTYWEGGIRAVGF 232

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           + SPLL+++G V ++ +H++DW PTL++ A
Sbjct: 233 VHSPLLKNKGCVCKELIHITDWFPTLITLA 262


>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
          Length = 543

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 125/295 (42%), Positives = 180/295 (61%), Gaps = 15/295 (5%)

Query: 91  LAYSGIIL-KNYYTVQLCTPS--RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
           +A +GI L +++ ++    PS  R  +++  + IHTG+QH+++   +   LPL    LPQ
Sbjct: 48  MASAGITLFRSWLSLASSHPSDVRCGLLS-LYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 106

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDM 206
            LKE+GY T +VGKWHLGFYKKE  PT RGF++  G   G  DY+ H   +   M G D+
Sbjct: 107 KLKEVGYSTHMVGKWHLGFYKKECMPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDL 166

Query: 207 RRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
             +   AWD  +G YST ++T     I+ +H+  +P+FLY+A+ A HS     PLQAP  
Sbjct: 167 YENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGR 221

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
           Y   +R I +  R ++AA+L  LDE++  V  AL+     +NSII++ SDNGG       
Sbjct: 222 YFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTA--- 278

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              SNWPLRG K T WEGG+R  G + SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 279 -GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA 332


>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
          Length = 504

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 117/276 (42%), Positives = 165/276 (59%), Gaps = 11/276 (3%)

Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           C  +  +    K+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGF
Sbjct: 24  CPQNEGSCFQQKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGF 83

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDV 224
           Y+KE  PT RGF++  G   G  DY+ H   +   M G D+  + + AW+  +G YST +
Sbjct: 84  YRKECMPTQRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDDAAWEYDNGIYSTQM 143

Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           +T     I+ +H    PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+
Sbjct: 144 YTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAM 198

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           L  LDE++  V  AL++     NSI+++ SDNGG          SNWPLRG K T WEGG
Sbjct: 199 LSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGG 254

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +R  G + SPLL+++G V  + VH++DW PTL++ A
Sbjct: 255 IRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLA 290


>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
          Length = 602

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 125/335 (37%), Positives = 194/335 (57%), Gaps = 19/335 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IF+L DDLG++D+G+   D I +PNIDALA   + LK +Y    CTPSR+A +TG++
Sbjct: 45  PHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRY 103

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  GMQ  V+   E  G+PL E +L +  K+ GYRT + GKWHLGFY  ++ P  RGF+
Sbjct: 104 NIRMGMQSGVIRAPEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFD 163

Query: 180 SHLGYWTGHQDYFDHSAEEMKMW--GLDMRRD-LEPAWDLHGKYSTDVFTAEAVDIIHNH 236
              G++ G QD++ H +  ++ +    D+  D +   +  +G YST +F  + ++ +  H
Sbjct: 164 RFYGFYLGSQDFYFHDSGRLEAYPGNGDVENDTILDDFHTNGTYSTKLFVDDFINDLAKH 223

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---FKRSKFAAILHKLDESVG 293
               PLF Y++    H      PLQ  + +   + +  D    +R   +  +  LD  +G
Sbjct: 224 DPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHERILISTKITTLDHHIG 278

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           ++V AL+++   +N+++VF SDNGG         ASNWPLRG K T+++GG++    + S
Sbjct: 279 RMVTALKEKNYWNNTVLVFTSDNGGQPN----EGASNWPLRGSKGTIYDGGLKSRAFVAS 334

Query: 354 PLLESR--GIVAEQYVHVSDWLPTLLSAANKSDIP 386
           PLL++R  G       HVSDW PTLL+ +   D+P
Sbjct: 335 PLLQNRMKGQNYNYLFHVSDWFPTLLTLSG-CDVP 368


>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
 gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
          Length = 224

 Score =  230 bits (586), Expect = 3e-57,   Method: Composition-based stats.
 Identities = 111/225 (49%), Positives = 149/225 (66%), Gaps = 1/225 (0%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+IIF+LADDLGW DV F G  QIPTPN+D LA  GIIL NYY + LCTPSR A+M+G 
Sbjct: 1   PPNIIFLLADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGL 60

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +PIHTG+QH V    E  GLPL+  I+P++LK LGY T ++GKW+LG+YK+ YTPT RGF
Sbjct: 61  YPIHTGLQHYVQLPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGF 120

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +S  G+  G +DY+DH+   +   GLD      P  +    YSTD+FT +A+ +I +H  
Sbjct: 121 DSFYGFLNGGEDYYDHTILFVST-GLDFWDGTTPVRNQSHHYSTDLFTKKALALIKDHDQ 179

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
            +P+FLY +H A HS +    L+AP   +    +I++  RS  A 
Sbjct: 180 AKPMFLYFSHQAVHSGDFKVELEAPASAIAHFPYIKEQNRSIIAG 224


>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
          Length = 547

 Score =  229 bits (585), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 185/324 (57%), Gaps = 36/324 (11%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 50  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +       +                         RGF++
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQSXXXXXXR-------------------------RGFDT 143

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS  
Sbjct: 144 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPR 203

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 204 QPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 258

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 259 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 314

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 315 RRTSRALVHITDWYPTLVGLAGGT 338


>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 140/393 (35%), Positives = 201/393 (51%), Gaps = 63/393 (16%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPP 60
           +QH V+   +   LP+ E  LPQ LKE GY T ++                      G  
Sbjct: 42  LQHGVIRHAQPNCLPMDEVTLPQKLKENGYTTNMV----------------------GKW 79

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HI F L D     + GF     +    I  L    + L       LC           + 
Sbjct: 80  HIGFYL-DACTPTERGFDSFFDV----IMVLWSRELCLVYLLRASLC-----------YW 123

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH V+   +   LP+ E  LPQ LKE GY T +VGKWH+GFY    TPT RGF+S
Sbjct: 124 IHTGLQHGVIRHAQPNCLPVDEVTLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGFDS 183

Query: 181 HLGYWTGHQDYF------------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             G+  G +D++            D S + +   G D+  +  P +D  G+YST +FT +
Sbjct: 184 FFGFLIGAEDHYTRMLSCEAYNANDGSTKTLS--GHDLHANKTPVFDYEGQYSTHLFTNK 241

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
            VD+I  H   +PLF+YLA+ A H+     PLQ PD Y++ +++I D  R  +A ++  +
Sbjct: 242 TVDVIERHDKTKPLFIYLAYQAVHA-----PLQVPDSYMDPYKNIADKTRRTYAGMVSCM 296

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE +G V +AL+   +  N+II+F +DNGG          +NWPLRG K ++WEGG+ G 
Sbjct: 297 DEGIGNVTQALKDAGLYDNTIIIFSTDNGGHID----RGGNNWPLRGSKGSMWEGGIHGV 352

Query: 349 GLIWSPLLES--RGIVAEQYVHVSDWLPTLLSA 379
           G + SPLL    +G V  + +HVSDWLPT+++ 
Sbjct: 353 GFVHSPLLPDAVKGTVNHELMHVSDWLPTIVAG 385



 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 3/82 (3%)

Query: 104 VQLCTPSRSAIMT---GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
            Q C  + + + T   G + IHTG+QH V+   +   LP+ E  LPQ LKE GY T +VG
Sbjct: 18  AQRCILAAAIVGTPDFGHNMIHTGLQHGVIRHAQPNCLPMDEVTLPQKLKENGYTTNMVG 77

Query: 161 KWHLGFYKKEYTPTFRGFESHL 182
           KWH+GFY    TPT RGF+S  
Sbjct: 78  KWHIGFYLDACTPTERGFDSFF 99


>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
 gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
          Length = 500

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 135/375 (36%), Positives = 208/375 (55%), Gaps = 36/375 (9%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           RT +  F +L L     +  VD    +  P+++FI+ADDLG+ND+ +     + TPN+  
Sbjct: 3   RTTLPFFLLLVLQ-HYGIADVDGQTLTQKPNVLFIMADDLGFNDLDWKD-STLHTPNLRN 60

Query: 91  LAY--SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQY 148
           LA+  +  +L N Y  QLCTP+RSA MTG +P   G Q+ V    E  G+P     L + 
Sbjct: 61  LAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSEN 120

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE----EMK--MW 202
           +++L Y T +VGKWHLG+ KKE+ PT RGF+   G++     YF+HSA+    E+K  + 
Sbjct: 121 MRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVK 180

Query: 203 GLDMRRDL-----EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS---- 253
           GLD+  ++      P +  +G YSTD+FT  A+ ++ NH+T +P F++L++ A H     
Sbjct: 181 GLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVLDNHNTTKPFFMFLSYQAVHPPLQI 240

Query: 254 ----------ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
                     A  +E  ++ D Y +    ++D  R  +  +L  +D ++G++VE L+   
Sbjct: 241 KSEKKLTPKIATHFEVPRSFDKYCS---RVKDRYRRIYCGMLTAMDFAIGRLVEYLKASN 297

Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
           +  N++IVF SDNGG +       ASN PLRG K+T+WEGG +    + SP+    G   
Sbjct: 298 LYENTVIVFTSDNGGTSN----FGASNAPLRGEKDTIWEGGTKTTTFVHSPMYIEEGGTR 353

Query: 364 EQYVHVSDWLPTLLS 378
           E   HV DW  T+LS
Sbjct: 354 EMMFHVVDWHATILS 368


>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
          Length = 466

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 115/262 (43%), Positives = 164/262 (62%), Gaps = 11/262 (4%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 3   IHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDT 62

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
             G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T +   I+ +H+ 
Sbjct: 63  FFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNP 122

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  A
Sbjct: 123 RKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 177

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L++     NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SPLL++
Sbjct: 178 LKRYGFYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 233

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           +G + ++ VH++DW PTL++ A
Sbjct: 234 KGTICKELVHITDWYPTLITLA 255


>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 614

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 187/340 (55%), Gaps = 20/340 (5%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRS 112
            A+   PH++F++ DD+G+NDVG+   D    TP +D L   G+ +   Y  Q+CTPSR+
Sbjct: 57  AAAETRPHVVFVMLDDVGFNDVGYASSDLGEMTPFLDGLMADGVRVDRLYGQQVCTPSRA 116

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           A++TGK PIH  +QH  +   E  GLP  E  L QYLK LGY T +VGKWHLG Y    T
Sbjct: 117 AMLTGKLPIHLELQHWQVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYNNAST 176

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL--DMRRDLEPAWDLHGKYSTDVFTAEAV 230
           P  RGF+S  G+++G  DY  H      +W    D+  D  P  D HG++ T +    A+
Sbjct: 177 PLNRGFDSFYGFYSGGVDYLTHDPSTGYVWRCYRDLWDDERPVTDAHGQHQTSLMNERAI 236

Query: 231 DIIHNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLN----IHRHIEDFKRSKFAA 283
            ++  H+ +   EP+F Y+++      N + PLQ P   L         I +  R  FAA
Sbjct: 237 AVLERHAVEKKSEPVFAYVSY-----PNAHLPLQPPTELLERRNATLLDIPNHDRKNFAA 291

Query: 284 ILHKLDESVGKVV-EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           ++   DES+  +    L+   +   ++IV  SDNGG  +   +   SN+PLRG K  L+E
Sbjct: 292 LMMNADESLANLTNRGLKATGLYDTAVIVIASDNGGQVSA--MGGGSNYPLRGEKKYLFE 349

Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           GGVR   +I SPLL   +RG    + +H+SDWLPTLL  A
Sbjct: 350 GGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTLLQGA 389


>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1818

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 136/356 (38%), Positives = 189/356 (53%), Gaps = 51/356 (14%)

Query: 59  PPHIIFILADDLGWNDVGFHG----LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           PPH++F LADD+G+ DVG++G     +++ TP IDALA +G+ L  YYT   CTPSR+A+
Sbjct: 43  PPHVVFFLADDVGFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAAL 102

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           ++GK+P  TG  H VL      GLPL   +LP+ L    YR+  VGKW +G    +  P 
Sbjct: 103 LSGKYPATTGTYHGVLNPQSTWGLPLEHALLPEALPG-AYRSHAVGKWDVGHSSAKRLPE 161

Query: 175 FRGFESHLGYWTG-HQDYFDHSAEEM---------------------------KMWGLDM 206
            RGF+S LG++   +    D+S  E+                              GL  
Sbjct: 162 ARGFDSFLGFYLCFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFSGLGG 221

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
           +R      D  G Y+TDVF   AVD+I   + D PLFLY+A  A H      PL+A +  
Sbjct: 222 QRR-----DYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEAD 271

Query: 267 LNIHRHI----EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           +           D KR+ FAA+   +D ++G+VV+AL+      N+++VF SDNG     
Sbjct: 272 VAAFEATFLPESDSKRATFAAVALAMDRAIGRVVDALQATGAYENALVVFASDNGAIPG- 330

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTL 376
             +   SNWPLRG K + WEGGVR    + SPLL +  RG V +   HV+DWLPT+
Sbjct: 331 -QMGGGSNWPLRGGKFSAWEGGVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTI 385



 Score =  219 bits (559), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 143/399 (35%), Positives = 207/399 (51%), Gaps = 54/399 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHG----LDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
           A+S PPH++  LADD G++DVG++      +Q+ TP +D+LA  G+ L  YYT   CTPS
Sbjct: 614 AASNPPHLLLFLADDAGFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCTPS 673

Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           R+A+++G +P  +GM H ++      GL L  +++PQ L    YR+  VGKW +G Y   
Sbjct: 674 RAALLSGMYPASSGMYHKMITAQSNWGLDLDLELIPQRLPA-AYRSHAVGKWDVGHYTWS 732

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSA-------EEMKMWGLDMRR-------------DL 210
           + P FRGF S LG+++   DY+ H         EEM++   + R              D 
Sbjct: 733 HVPQFRGFRSFLGFYSPIIDYYTHETFDTLQCLEEMELTECEARLASECSSSIKDFNFDG 792

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-- 268
           +P     G YSTDVF A A D+I   +   PLFLY+A  A H+     PL+A +  +   
Sbjct: 793 DPLPLADGTYSTDVFAARARDLIRKEAPKHPLFLYVAFNAVHA-----PLEADEDVIESF 847

Query: 269 IHRHIEDFK--RSKFAAILHKLDESVGKVVEALEQ-RRMLSNSIIVFVSDNGGAAAGFNL 325
               +E+ +  R+ FAA+   +D ++  VV+ L+       NS+I+F SDNGG      +
Sbjct: 848 ASSFLEEARDARASFAAVALGMDRAMEAVVDELKAVDGAYENSVIMFASDNGGLPG--QI 905

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKS 383
              SNWPLRG K + +EGGVR    + SPLL +  RG       HV+DWLPTL+  A  +
Sbjct: 906 AGGSNWPLRGQKFSPFEGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLAGGA 965

Query: 384 DIPNYVNSTVENIIPRYENSILRYEN--GTHEYNSPRIE 420
                         PR  + + +YE   G H    PR E
Sbjct: 966 P-------------PRNVDGVDQYEAIFGEHPAEHPRTE 991


>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
          Length = 598

 Score =  228 bits (582), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 119/285 (41%), Positives = 180/285 (63%), Gaps = 14/285 (4%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 215 HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 273

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E+GY T +VGKWHLGFYKK   PT RGF++
Sbjct: 274 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 333

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW+  G+YST ++   A  I+ +H+  
Sbjct: 334 FLGSLTGNVDYYTYDNCDGPGVCGYDLHEGESVAWEQSGQYSTLLYAQRASQILASHNPR 393

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 394 QPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITWAL 448

Query: 300 EQRRMLSNSIIVFVSDNGGA---AAGFNLNAASNWPLRGVKNTLW 341
           ++     NS+I+F +DN  A    + +NL   ++    G + ++W
Sbjct: 449 KRYGYYDNSVIIFSTDNXLANFPGSWWNLERLAD----GTRQSVW 489


>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
          Length = 467

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 114/265 (43%), Positives = 166/265 (62%), Gaps = 11/265 (4%)

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           ++ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY++E  PT RG
Sbjct: 1   RYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60

Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHN 235
           F++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T +   I+ +
Sbjct: 61  FDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILAS 120

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V
Sbjct: 121 HNPSKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNV 175

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
             AL++     NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SPL
Sbjct: 176 TLALKKYGYYDNSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPL 231

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
           L+++G V ++ VH++DW PTL++ A
Sbjct: 232 LKNKGSVCKELVHITDWFPTLITLA 256


>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
 gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
          Length = 333

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 129/331 (38%), Positives = 184/331 (55%), Gaps = 23/331 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+ +FILADDLG++D+G    D I TPN+DALA  G+IL N Y   LCTPSR A+MTG++
Sbjct: 3   PNFVFILADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMTGRY 61

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  + MQ +V+         L  K+LPQYLK+LGY+  +VGKWHLG+ + E  PT RGF+
Sbjct: 62  PSASAMQTSVILPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRGFD 121

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-------- 231
           +  G + G  DY+ H+      W  +   D E A   H +  +        D        
Sbjct: 122 TFYGLYAGTGDYWSHTFFGKYDWHTNADIDFE-ANSTHSQVRSSYMNFVLQDLEMERLDK 180

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLD 289
           +   H + +PLFLY A       NP+ P Q  D +LN++      D +R K+  +   LD
Sbjct: 181 VFDEHDSKDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKFNDIRR-KYLGLTSGLD 234

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
             VGK+VE L +  M++N+ ++FVSDNG            N P RG K++L+EGG +   
Sbjct: 235 AMVGKIVEKLIENGMMNNTYVIFVSDNGADPP-----EGLNTPFRGGKSSLFEGGTKSNS 289

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            I+SPLL+      +  +H++DWLPTL+  A
Sbjct: 290 FIYSPLLKKTEYENDGLMHITDWLPTLVRLA 320


>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 132/341 (38%), Positives = 187/341 (54%), Gaps = 19/341 (5%)

Query: 48  MVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
            VF+ L   S P       PH+I I  DD+GWNDV +H  D I  PN+  LA  G+I + 
Sbjct: 8   FVFLALAVVSKPSIADEDQPHVILIFVDDMGWNDVHWHNPD-IAMPNLMDLADDGVIFEQ 66

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
            YT   CTPSR+A+MTG +P  TG QH +++     G+PL+ K+LP+ LKE+GY T +VG
Sbjct: 67  SYTHPTCTPSRAAMMTGLYPFKTGNQHQMVFNLHPSGVPLNFKLLPEKLKEVGYATHMVG 126

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           KWHLGF K EY PT RGF+SH G WT G  DY   +       G D R +LE      G 
Sbjct: 127 KWHLGFCKDEYLPTNRGFDSHYGLWTLGVGDYDKLNGVLSPSAGYDFRDNLEVVPKSDG- 185

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           Y   +    A +I++NHS + PLFL           P + L+ P+ Y  ++  IED +  
Sbjct: 186 YLALMLGDRAEEIVNNHSPETPLFLVFTLDI-----PAKHLEIPEEYEELYADIEDDRTR 240

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           +F   L  +D+ +G VV+AL+ R M  +S+ +F+ DNG  ++     A SN+P RG+  T
Sbjct: 241 QFYGKLTMMDDIIGGVVDALKDRDMYDDSLFIFIGDNGALSS----QAGSNYPFRGIAGT 296

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           L+EG  R   ++    +E  G  + +   + D   T+L  A
Sbjct: 297 LFEGATRVPSIVSGKGIEKTGYKSNELYSIVDIHRTILDVA 337



 Score = 42.7 bits (99), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 63/133 (47%), Gaps = 31/133 (23%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           E+DG+ +W   S +EPS RN I++NIDD+     +A+  G +KL+            TG 
Sbjct: 344 ELDGLSMWPTWSEDEPSPRNEIIYNIDDDPVSPGAAIRIGDYKLI------------TGH 391

Query: 632 PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPC--------EPQIAPCLFDIKND 683
           PD +Y      R+     +R   +  +    P   +P          PQ    LF+I +D
Sbjct: 392 PDLLY----PYRD-----IRLTDNYYNYGDPPASGLPTTDQEANAPAPQNVEYLFNIVDD 442

Query: 684 PCEKNNLADRSED 696
           P E+++++  ++D
Sbjct: 443 PEERHDISADNQD 455



 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
           E+DG+ +W   S +EPS RN I++NIDD+     +A+  G +KL+
Sbjct: 344 ELDGLSMWPTWSEDEPSPRNEIIYNIDDDPVSPGAAIRIGDYKLI 388


>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
          Length = 518

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/334 (37%), Positives = 195/334 (58%), Gaps = 20/334 (5%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           +++++ DD GW D G HG  ++ TPN+DA+A  GI+L+ YYT Q+C+P+RS+++TG++PI
Sbjct: 20  VVYLMIDDFGWGDFGVHG-SKLETPNLDAIARDGILLEKYYTQQVCSPTRSSLLTGRYPI 78

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             GMQHNV+   +  G+P    +LPQ LK  G+ T +VGKWH G   +   P  RGF+S+
Sbjct: 79  RYGMQHNVILAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPFNRGFDSY 138

Query: 182 LGYWTGHQDYFDHSAEEMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
            GY  G +D++     + K W G+D   +  P     G Y T++++A+   ++   S +E
Sbjct: 139 YGYLQGAEDHYSRIQCQAKEWCGVDFCTENGPTNSTWGTYGTEIYSAQVAQVLDKVSKEE 198

Query: 241 -PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD-ESVGKVVEA 298
            P FLY A    H     +PLQAP+HY      IED  R  +AA++  +D E+VG  +  
Sbjct: 199 KPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIEDPDRRTYAAMVSIMDEETVGDFMTM 253

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +R +  +++I+  +DNGG       +  +N+PLR  K +L+EGGV    ++   L++ 
Sbjct: 254 LHERNLYDDTLIIITADNGGETR----SGGNNFPLRSQKWSLYEGGVHVNAMLSGGLIKD 309

Query: 359 RG-IVAEQYVHVSDWLPTLLSA------ANKSDI 385
           +     +  VHVSDW  T+L A      A+KSD+
Sbjct: 310 QSKSTFDGLVHVSDWRETILEAMECKPPADKSDL 343


>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
          Length = 318

 Score =  227 bits (578), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 126/323 (39%), Positives = 185/323 (57%), Gaps = 18/323 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+ +FI+ADDLG++D+G    D I TPN+DALA  G+IL N Y   LC+PSR A+MTG++
Sbjct: 3   PNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCSPSRHALMTGRY 61

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  + MQ  V+   E     L  K+LPQYLKELGY   ++GKWHLG+ ++E  PT RGF+
Sbjct: 62  PYKSAMQSFVVLPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRGFD 121

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G   G  DY++H+   +  W L+     E   + +G +S D+       +   H   
Sbjct: 122 SFYGLLDGAGDYWEHTTSGVYDWHLND----EVFHEAYGNHSQDLELDRLDKLFAEHDNK 177

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK--RSKFAAILHKLDESVGKVVE 297
           +PLFLY A       NP+ P Q  + YL+++   +DF   R ++ A+   LD  VGK+V+
Sbjct: 178 DPLFLYFA-----PQNPHVPSQPTEEYLSMYS-ADDFSDVRRRYLALTSALDSMVGKIVQ 231

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            L+   M+ N+ ++F+SDNG            N P RG K +L+EGG +    I+S LL+
Sbjct: 232 HLKDNCMMDNTYLIFMSDNGADPG-----EGQNTPFRGGKTSLFEGGTKSNSFIYSQLLK 286

Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
                 +  +H++DWLPTL+  A
Sbjct: 287 KTEYENDGLMHITDWLPTLVKLA 309


>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
          Length = 370

 Score =  227 bits (578), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 134/326 (41%), Positives = 190/326 (58%), Gaps = 20/326 (6%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+IIFILADDLG+NDVGF   D I TPNID LA  G+++ N Y+  +CTPSR A+MTG+
Sbjct: 10  PPNIIFILADDLGYNDVGFRNPDII-TPNIDKLARKGVVMTNSYSTHVCTPSRHALMTGR 68

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P  TGMQ+ V+ G       L  K LPQYLK LGY T  VGKWHLG  + E  PT RGF
Sbjct: 69  YPYKTGMQNFVIPGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDECLPTERGF 128

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +S  G   G   Y++H+      +     +DL+ +   +G +S D+       +  +H+ 
Sbjct: 129 DSFYGLLLGGGGYWNHTYTLFGAYDWFNNKDLDLS--ANGTHSQDLMVDRLSAVFASHNR 186

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK----RSKFAAILHKLDESVGK 294
           +EP+FLY A       NP+ P +A +++L   RH         R ++  ++  LD  VG+
Sbjct: 187 EEPMFLYFA-----PQNPHTPSEATENFL---RHYPSSMYTEIRRQYLGLVSGLDWMVGE 238

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           ++E +    ML N+ I+F SDNGG A       A N P RG K +L+EGG +    I+SP
Sbjct: 239 LLELVIANNMLDNTYIIFQSDNGGDAV-----EALNGPYRGGKGSLFEGGSKVVNFIYSP 293

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
           LL+  G     ++H++DW+PT++S A
Sbjct: 294 LLKKVGYENNGWMHITDWMPTIISLA 319


>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
          Length = 488

 Score =  226 bits (577), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 115/262 (43%), Positives = 163/262 (62%), Gaps = 11/262 (4%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY++E  PT RGF+S
Sbjct: 25  IHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDS 84

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
             G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T +   I+ +H+ 
Sbjct: 85  FFGSLLGSGDYYTHFKCDSPGICGYDLYENNNAAWDHDNGIYSTQMYTQKVQQILASHNP 144

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V  A
Sbjct: 145 RKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSINNINRRRYAAMLACLDEAINNVTLA 199

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +     NSII++ SDNGG      +   SNWPLRG K T WEGG+R  G + SPLL++
Sbjct: 200 LRKYGYYENSIIIYSSDNGGQP----MAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 255

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           +G V ++ VH++DW PTL++ A
Sbjct: 256 KGSVCKELVHITDWFPTLITLA 277


>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
          Length = 418

 Score =  226 bits (576), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 113/260 (43%), Positives = 160/260 (61%), Gaps = 15/260 (5%)

Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ 189
           ++ C+   +PL EK+LPQ LKE GY T +VGKWHLG ++KE  PT RGF+++ GY  G +
Sbjct: 1   IWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSE 60

Query: 190 DYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
           DY+ H       A  +    LD R   E A      YST++FT  A  +I NH  ++PLF
Sbjct: 61  DYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIANHPPEKPLF 120

Query: 244 LYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
           LYLA  + H     EPLQ P+ YL  ++ I+D  R  +A ++  +DE+VG V  AL+   
Sbjct: 121 LYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRHHYAGMVSLMDEAVGNVTAALKSHG 175

Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVA 363
           + +N++ +F +DNGG      L   +NWPLRG K +LWEGG+RG G + SPLL+ +GI  
Sbjct: 176 LWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVASPLLKQKGIKN 231

Query: 364 EQYVHVSDWLPTLLSAANKS 383
            + +H+SDWLPTL++ A  S
Sbjct: 232 RELIHISDWLPTLVTLAGGS 251


>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 635

 Score =  226 bits (576), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 136/352 (38%), Positives = 188/352 (53%), Gaps = 37/352 (10%)

Query: 59  PPHIIFILADDLGWNDVGF----HGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           PPH++ ILADD+G+ND+G+    H  +Q+ TP +D LA  G+ L  YYT   C+PSR A+
Sbjct: 117 PPHLVLILADDMGYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGAL 176

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG +P  TG+ H V+      GLPL   ++PQ+L    YR+  +GKW +G Y   + PT
Sbjct: 177 LTGLYPASTGLYHGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPT 235

Query: 175 FRGFESHLGYWTGHQDYFDH----------SAEEMKMWGLDMRRDL-----EPAWDLHGK 219
            RGF S++G++    DY+ H           +  +K    D   DL         + + +
Sbjct: 236 GRGFHSYVGFYGTDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNYDGAATGDEYYNR 295

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF--- 276
           YSTD+FT  AV+++   S   PLFLY+A  A H      PL+A D    I R    F   
Sbjct: 296 YSTDIFTDRAVELLRTESARNPLFLYVAFNAVHG-----PLEADD--ATIDRFNASFLYE 348

Query: 277 ---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
              +R+ FAA    +D +V  +V+ +       NSI++F SDNG   A       SNWPL
Sbjct: 349 ARTERATFAAAALLMDRAVESIVKTMHAVGAYDNSIVLFASDNGATLA--QTGGGSNWPL 406

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKS 383
           RG K T +EGGVR    + SPLL S  RGI      HV+D LPTL+ AA  S
Sbjct: 407 RGSKFTPYEGGVRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAGGS 458


>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 496

 Score =  225 bits (574), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 125/324 (38%), Positives = 179/324 (55%), Gaps = 16/324 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FILADDLG +DVG+ G  +I TP++D LA +G  L  +Y   +C+P+R+A++TG++
Sbjct: 53  PNIVFILADDLGSHDVGWRG-SEIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGRY 111

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   G Q  V+      GLPL E+ LPQ LKE GY T I GKWHLG ++  Y PT RGF+
Sbjct: 112 PFRYGFQTGVVRPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGFD 171

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G++ G  DY+ H    ++  G D  R+ +   D    YST++   EA   +      
Sbjct: 172 HQYGHYNGMLDYYTH----IRHGGFDWHRNDQENHDE--GYSTELVGKEAARRVRERDKS 225

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+     HS     P Q P  Y   +   E  KR  +A ++  LD +VG++V+A+
Sbjct: 226 RPLFLYVPFNGVHS-----PHQVPVEYETPYTQFEG-KRKTYAGMVTALDAAVGRIVQAV 279

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
              +++ N++ +F SDNGG          SN PLR  K T++EGGVR       P     
Sbjct: 280 RDEKLVENTLFIFSSDNGGPNPK---QLTSNGPLRAGKGTVYEGGVRVCAFATWPGKIKP 336

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
           G   +  +H++DW PTLL  A  S
Sbjct: 337 GTSVQAPIHIADWFPTLLGLAGAS 360


>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 739

 Score =  225 bits (573), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 113/271 (41%), Positives = 160/271 (59%), Gaps = 21/271 (7%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH V+   +   LPL E  LPQ LKE GY T +VGKWH+GFY    TPT RGF+S
Sbjct: 318 IHTGLQHGVIRHAQPNCLPLDEVTLPQKLKESGYATNMVGKWHIGFYLDACTPTERGFDS 377

Query: 181 HLGYWTGHQDYFDHS----------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
             G+  G +D++ HS               + G D+  +  P +D  G+YST +FT + +
Sbjct: 378 FFGFLVGGEDHYTHSGHCEVYNADDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNKTI 437

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
           D I  H   +PLF+YLA+ A H      PLQ PD Y++ +++I D  R  +A ++  +DE
Sbjct: 438 DAIERHDKTKPLFMYLAYQAVHL-----PLQVPDRYMDPYKNIADENRRTYAGMVSCMDE 492

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
            +G V  AL+   +  N+II+F +DNGG       + A+NWPLRG K  +WEGG+ G G 
Sbjct: 493 GIGNVTRALKDAGLYDNTIIIFSTDNGGPIQ----SGANNWPLRGSKGNMWEGGIHGVGF 548

Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSA 379
           + SPLL +  +G V    +HVSDWLPT+++ 
Sbjct: 549 VHSPLLPTSVKGTVNHDLMHVSDWLPTIVAG 579


>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  225 bits (573), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 136/351 (38%), Positives = 190/351 (54%), Gaps = 29/351 (8%)

Query: 44  FTLSMVFVDL---------VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           + L  VFV L         +A +  PHI+ I  DDLGWNDV +H  D I  P +  LA  
Sbjct: 2   YCLYFVFVVLALPYISKPCIAKAEQPHIVLIFVDDLGWNDVEWHNPD-IKMPVLSKLAAD 60

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+I    YT   CTPSR+A+MTG +P  TG QH +++     G+PL  K+LP+ LKE+GY
Sbjct: 61  GVIFNQSYTHPTCTPSRAAMMTGMYPFRTGNQHQMIFNLHPSGVPLEFKLLPEKLKEVGY 120

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWGLDMRRDL- 210
            T +VGKWHLGF K+EY PT RGF+SH G WT   GH D  +      +  G D R ++ 
Sbjct: 121 FTHMVGKWHLGFCKEEYLPTSRGFDSHYGLWTLGVGHYDKMNGVLSPSE--GYDFRDNIG 178

Query: 211 -EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
             P  D   +Y T +    A  II+ H    PLFL           P + L+ PD +  +
Sbjct: 179 VVPKSD---EYLTLMLAERAEHIINGHYNKHPLFLQFTMDI-----PAKHLEIPDTFEAL 230

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
           +  IED +  KF   L  +D  +G+VV+AL+ R M  +++++F+ DNG  A+     A S
Sbjct: 231 YCDIEDERTRKFYGKLSLMDHVIGRVVDALKAREMWDDTVLMFIGDNGALAS----QAGS 286

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           N+P RG+  TL+EG  R   LI   +L   G V  +  ++ D   T+L+ A
Sbjct: 287 NYPFRGIAGTLFEGATRVPSLIHGSMLRETGYVNNELWNLVDLHRTILALA 337


>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
          Length = 496

 Score =  225 bits (573), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 117/262 (44%), Positives = 156/262 (59%), Gaps = 11/262 (4%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+V+   +   LPL    LPQ L+E GY T +VGKWHLGFY++E  PT RGF++
Sbjct: 33  IHTGLQHSVIRPAQPNCLPLGHATLPQKLREAGYATHMVGKWHLGFYRRECMPTHRGFDT 92

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
             G   G  DY+ H   +     G D+  +   AWD   G YST ++T     I+  H  
Sbjct: 93  FFGSLLGSGDYYTHYKCDGPGACGYDLYENDRAAWDHDDGVYSTHMYTRRVQQILAAHDP 152

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+VG V  A
Sbjct: 153 GKPLFLYIAYQAVHS-----PLQAPSRYFRHYRSIANTHRRRYAAMLSCLDEAVGNVTLA 207

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L    + +NSI+++ SDNGG  +       SNWPLRG K T WEGG+R  G + SPLL  
Sbjct: 208 LRAHGLYNNSILIYSSDNGGQPSA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLRQ 263

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
           RG V  + VH++DW PTL++ A
Sbjct: 264 RGTVCRELVHITDWYPTLVALA 285


>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 486

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 134/356 (37%), Positives = 194/356 (54%), Gaps = 27/356 (7%)

Query: 42  LAFTLSMVFVD---LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           LA TL+   +    L A+   P+II I+ADDLGW DVGF+G   I TP++DALA  G   
Sbjct: 9   LAGTLAASLLPQLPLAAAPSAPNIIHIVADDLGWQDVGFNGCKDIQTPHLDALAKGGARF 68

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
             +Y   +CTP+R+A+MTG++P   G+Q  V+      GL   E +LPQ L++ GY T I
Sbjct: 69  TQFYVQPMCTPTRAALMTGRYPFRYGLQTAVIPSVSTYGLDTGEYLLPQCLQDAGYTTAI 128

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           +GKWHLG   K++ P  RGFE   G   G  DY+ HS   +    LD  RD EP  +   
Sbjct: 129 IGKWHLGHADKKFWPKQRGFEYQYGAMIGELDYYTHSEHGV----LDWFRDNEPVHE--E 182

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y+T++  A+AV  +     D P +LYLA  A H+     P QAP  Y++ + HI D  R
Sbjct: 183 GYTTNLLGADAVKYLEKQKADRPFYLYLAFNAPHT-----PYQAPQEYIDRYTHIADPTR 237

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----AAGFNLNAA------ 328
             +A ++  LD+++G+VV AL+++ +  N++I+F SDNGG      AG   + +      
Sbjct: 238 RTYAGMVACLDDNIGRVVAALDKKGLRENTLILFHSDNGGTHNPMFAGVMADVSKIKIPC 297

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            N P R  K +L+EGGVR   L  W   ++   +  +  +H +D   T    A  S
Sbjct: 298 DNSPYRDGKGSLFEGGVRVCALANWPGKIKPATV--DGLIHATDIFTTFTKLAGAS 351


>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 781

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 187/348 (53%), Gaps = 53/348 (15%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           S+  P++ F L DD+G+ D+G+   D    TPN+DALA  G+ L NYYT+ LCTP+R++I
Sbjct: 213 STSKPNVFFFLIDDMGFGDMGYQSTDLSEITPNLDALAAGGVKLSNYYTMTLCTPARASI 272

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           MTG++P+  GMQ++V+      GLP SEKILP+Y+ E GY + +VGKWHLG Y+ E  P+
Sbjct: 273 MTGRYPVRYGMQYSVIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLGSYRDESLPS 332

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKM--------------------------------- 201
            RGF++ LGY  G + Y+ H   E  +                                 
Sbjct: 333 QRGFKTFLGYLNGIETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNHDENVGGPCT 392

Query: 202 -----WGLDMRRDLEPA-WDLHGKYSTDVFTAEAVDIIHNHST--DEPLFLYLAHAATHS 253
                WG D+  + +PA     G YSTD F   A  I+ + +   ++PLF+Y+AH + HS
Sbjct: 393 DGGPRWG-DVMENEDPADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMYIAHQSVHS 451

Query: 254 AN---PYEPLQAPDHYL--NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
                PYE     +  L   +    E  KR+ FA +L  LD+ +G+ V+ LEQ   L NS
Sbjct: 452 PTGPAPYEEFSPEELALLEGVKDATESEKRTAFAGVLLYLDKRIGEFVDMLEQEGWLENS 511

Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           +IV  SDNG        +  SN+PLRG K +++EGG R  G   SP L
Sbjct: 512 VIVVSSDNGACPD----DGGSNYPLRGSKQSVFEGGSR-TGYQPSPAL 554


>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
          Length = 328

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 129/331 (38%), Positives = 184/331 (55%), Gaps = 22/331 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+ +FILADDLG+ND+GF   D + TPN+D LA  G+IL N Y   +CTPSR A+MTG++
Sbjct: 1   PNFVFILADDLGYNDIGFRNPD-VQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRY 59

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  + MQ+ V+   +     L  K LPQYLKELGY+  ++GKW+LG+ ++E  PT RGF+
Sbjct: 60  PYRSAMQNFVINPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFD 119

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS----TDVFTAEAVD---- 231
           S  G   G  DY++H+   +          +    +   K S       F    +D    
Sbjct: 120 SFFGLLDGAGDYWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVICFQDLELDRLDK 179

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK--RSKFAAILHKLD 289
           I   H   +PLFLY A       NP+ P Q  D +LN +   +DF+  R ++ A+   LD
Sbjct: 180 IFTEHDNKQPLFLYFA-----PQNPHTPSQPTDEFLNKYS-ADDFEDVRRRYLALTSGLD 233

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
             VGKVVE L    M+ N+ ++F+SDNG   A        N P RG K +L+EGG + + 
Sbjct: 234 SMVGKVVELLNDNHMMDNTYLIFLSDNGADPA-----EGLNAPFRGGKGSLFEGGTKSSS 288

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            I+SPLL+      +  +H++DWLPTL+  A
Sbjct: 289 FIYSPLLKKTEYENDGLMHITDWLPTLVKLA 319


>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
          Length = 330

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 125/329 (37%), Positives = 184/329 (55%), Gaps = 21/329 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+ +FILADDLG+ND+G    D I TPN+DALA  G+IL N Y   +C+PSR A+MTG++
Sbjct: 3   PNFVFILADDLGYNDLGLRDPDVI-TPNMDALASKGVILTNNYVQAVCSPSRHALMTGRY 61

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  + MQ  V+   E     L  K LPQYLK+LGY+  ++GKWHLG+ ++E  PT RGF+
Sbjct: 62  PSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFD 121

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYST------DVFTAEAVDII 233
           +  G +    DY++H    M  W  +   D E A   H +  +      D+       + 
Sbjct: 122 TFYGLYASSGDYWEHGIMGMYDWHTEAGVDFE-ARGTHAQVGSRYWHIYDLEMERLDKVF 180

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLDES 291
             H   +PLFLY A   +H+     P Q  D ++N++      D +R K+  +   LD  
Sbjct: 181 DEHDNKDPLFLYFAPQNSHT-----PHQPTDEFINLYSEDKFSDIRR-KYLGLTSGLDAM 234

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VGK+V+ L +  M++N+ ++FVSDNG            N P RG K +L+EGG + +  I
Sbjct: 235 VGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAPFRGGKGSLFEGGTKSSSFI 289

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +SPLL+      +  +H++DWLPTL+  A
Sbjct: 290 YSPLLKKTEYENDGLMHITDWLPTLVKLA 318


>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 514

 Score =  223 bits (568), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 128/329 (38%), Positives = 183/329 (55%), Gaps = 21/329 (6%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
           II +     GWNDVG+H  D +PTPN++ LA  G+IL N Y+  +CTPSR A+MTGK+P 
Sbjct: 41  IIILTISASGWNDVGWHN-DFMPTPNLNTLAREGVILDNMYSQPICTPSRVALMTGKYPA 99

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             GMQH V+       LP +   L + LKE GY   IVGKWHLG    +YTP +RGF+SH
Sbjct: 100 KVGMQHFVVLPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMWRGFDSH 159

Query: 182 LGYWTGHQDYFDHSAEEMKMW-------GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
            G   G    F+     M  W       G D+R +       +G ++T +F+  A  I+ 
Sbjct: 160 YGCHEGVTSNFE---THMLTWPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAERIVK 216

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
           NH+ + PLFLY+ + A     P+ PLQAP  +      ++D  R  FA ++  +D+ VG 
Sbjct: 217 NHNPESPLFLYVPYMA-----PHFPLQAPQGFEEA-VQLDDTDRRTFAGMMAAMDQGVGN 270

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           + +AL+ + M  +++ VF+SDNGG      + A SN+P RG K T+WEGGV+    I   
Sbjct: 271 ITQALKDKGMWDDTLFVFLSDNGGD----KMFAGSNYPYRGNKATMWEGGVKVPAFIRGS 326

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +L++ G V  +  H +D   TLL  A  +
Sbjct: 327 MLKTSGTVNNELYHFTDIFATLLKVAGGT 355


>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
 gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
          Length = 488

 Score =  223 bits (568), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 194/345 (56%), Gaps = 27/345 (7%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCT 108
           VD   ++  P+++FI+ADDLG++DV +     + TPN+  LA+  +  +L N Y  QLCT
Sbjct: 22  VDGQTATQKPNVLFIMADDLGFSDVDWKD-STLHTPNLRHLAFHKNTALLSNSYVNQLCT 80

Query: 109 PSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           P+RSA MTG +P   G Q+ V    E  G+P     L + +++L Y T +VGKWHLG+ K
Sbjct: 81  PTRSAFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCK 140

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAE----EMK--MWGLDMRRDLE-----PAWDLH 217
           KE+ PT RGF+   G++     YF+HSA+    E+K  + GLD+  ++      P +  +
Sbjct: 141 KEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQN 200

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF- 276
           G YSTD+FT  A+ ++ NH+  +P F++L++ A H      PLQ       I +  E   
Sbjct: 201 GVYSTDLFTDVAMSVLDNHNNSKPFFMFLSYQAVH-----PPLQVSQQSKTIGQGKEATF 255

Query: 277 ---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
                +    +L  +D ++G++VE L+   +  N++IVF SDNGG A       ASN PL
Sbjct: 256 ILRSHAHSTRMLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTAN----FGASNAPL 311

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           RG K+T+WEGG +    + SP+    G   +   HV DW  T+LS
Sbjct: 312 RGEKDTIWEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILS 356


>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  223 bits (567), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 124/322 (38%), Positives = 178/322 (55%), Gaps = 11/322 (3%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+ ILADDLGWNDVG+H   ++  P ++ LA  G+I    Y    CTP+R+A+M+G 
Sbjct: 30  PPHIVLILADDLGWNDVGWHN-PEVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSGY 88

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P  TG QH +L     GGLPL  K LPQ LK++GY T IVGKWHLGF K+ + PT RGF
Sbjct: 89  YPFKTGNQHQLLLNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRGF 148

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +S  G  T    +F      +   G D   D          Y   +    AV II+ H  
Sbjct: 149 DSFYGGLTLGTSHFSKMNGILSTPGYDF-YDNSGVVPQTNDYLAFMLADRAVKIINGHYQ 207

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           + PLF+Y +        P +  + P  Y  ++ +I D +  +F   L  +D++VG VV+A
Sbjct: 208 EYPLFMYFSMDV-----PAKSPEVPPEYEALYANITDDRTRRFYGKLSVMDDAVGSVVDA 262

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L++R M  +++++F+ DNG  A+    ++ SNWPLRG+  TL+EG  R    I   +L+ 
Sbjct: 263 LKKRDMWDDTLLIFIGDNGALAS----SSGSNWPLRGIAATLFEGATRVPAFIHGNMLKQ 318

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
            G    +  H+ D   T+LS A
Sbjct: 319 TGYENNELWHIVDLHKTILSLA 340



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 22/124 (17%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS---ALTKGKWKLVKVVKVMRYQVDLTG 630
           EIDG+D+W   S+NE S RN I++NIDD+  IS   A+  G +KL+             G
Sbjct: 347 EIDGMDMWDTFSKNESSPRNEIVYNIDDD-PISPGAAIRVGDYKLI------------AG 393

Query: 631 GPDQVYLSGLSDRE--WLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKN 688
            PD  Y   L ++   W           + +   P  + P  P +   LF++ +DP E+N
Sbjct: 394 NPDLFYPLRLINQSDGWFNYGDVP---TSGLPVLPTGDEPPPPNVT-YLFNVIDDPEERN 449

Query: 689 NLAD 692
           N+AD
Sbjct: 450 NIAD 453



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 4/49 (8%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS---ALTRGKWKLVKEN 554
           EIDG+D+W   S+NE S RN I++NIDD+  IS   A+  G +KL+  N
Sbjct: 347 EIDGMDMWDTFSKNESSPRNEIVYNIDDD-PISPGAAIRVGDYKLIAGN 394


>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
          Length = 515

 Score =  222 bits (566), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 134/361 (37%), Positives = 197/361 (54%), Gaps = 40/361 (11%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M F  L L FTLS+           P++I IL DDLGW DV +H    IPTPNID+   +
Sbjct: 1   MKFYSLVLFFTLSL--------CSQPNVIIILTDDLGWGDVSYHN-GFIPTPNIDSFVSN 51

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLY-GCERGGLPLSEKILPQYLKELG 153
           GI L  +Y    C+P+R++++TG H  + G+    +    E+ GLP   KI+P+Y KE G
Sbjct: 52  GIELNRFYANPTCSPTRASLLTGLHIFNHGVIRPFMNPSAEQTGLPEHLKIMPEYFKEAG 111

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW---GLDMRRDL 210
           Y+T + GKWHLG +K+EY PT RGF+S  G+  G   Y+DH       W   G+ +  D 
Sbjct: 112 YQTALSGKWHLGMHKEEYLPTNRGFDSSYGHMLGGIGYYDHVHTNRMDWHRDGVSLNED- 170

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
                    YST++   EA++II N   D PLFLY+A  A H+     P++AP+  +N  
Sbjct: 171 --------GYSTELIADEAINIIKNKDDDRPLFLYVAFNAPHT-----PIEAPEEDVNNF 217

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA----------A 320
            +IED     +AA + KLD  +G+++ +++   +L  +II+F+SDNG            A
Sbjct: 218 LYIEDELDRNYAANISKLDIEIGRIINSIKDLGILEETIIIFLSDNGPVFDINPIVKTIA 277

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
            G      S   LRG K +  +GG+R  A + W  +LE     ++Q++ + D LPTLL+A
Sbjct: 278 PGLTKAKGSTAGLRGSKTSALDGGIRVPAAIWWKGVLEKSK--SDQFIFIQDLLPTLLTA 335

Query: 380 A 380
            
Sbjct: 336 T 336


>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 469

 Score =  222 bits (566), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 124/332 (37%), Positives = 179/332 (53%), Gaps = 55/332 (16%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHG-LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           AS   PHI+F++ DDLG+NDVG+H     I T NIDALA  G+ L++YY   LCTPSRS 
Sbjct: 24  ASPPTPHIVFVMVDDLGYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQ 83

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            ++GK+ IH GMQH V+       LPL +  +   L + GY T +VGKWHLGFYK+E  P
Sbjct: 84  FLSGKYLIHNGMQHLVIDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWP 143

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
             RGF+S  G                                L G+ + D          
Sbjct: 144 LNRGFQSFFGM-------------------------------LLGQSTVD---------- 162

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR-HIEDFKRSKFAAILHKLDESV 292
                 +P+F+YL++ A     P+ P + PD Y   +R  I +  R  +A ++  LDES+
Sbjct: 163 --QGGTKPMFMYLSYQA-----PHLPFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDESI 215

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           GK+ + L++  + ++++ +F +DNGG       NA +NWPLRGVK   +EGG+RG G + 
Sbjct: 216 GKLTDTLKEEGLWNDTVFIFSTDNGGVG---KKNAGNNWPLRGVKGNYFEGGIRGVGFVA 272

Query: 353 SPLLES--RGIVAEQYVHVSDWLPTLLSAANK 382
            PLL +  +G ++   +H+SDW PTL+    K
Sbjct: 273 GPLLSTNVQGTISTDLMHISDWYPTLVEGVAK 304



 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 28/127 (22%), Positives = 58/127 (45%), Gaps = 11/127 (8%)

Query: 575 IDGIDVWSVLSRNEPSKRN-TILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           +DG+++W V+S+ E    +  I++NID + Q  A    K+     +  +     +  G  
Sbjct: 314 LDGVNMWDVISQGESGDPDREIVYNIDPDMQRPASIDSKYDSYSTIFDISIHASIRRGDM 373

Query: 634 QVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR 693
           ++     +D +WL          A ++ G       +   A  L++I  DP E+++L+D 
Sbjct: 374 KLVTGYQADTDWL----------APVESGFANVYATDAGKAIWLYNITADPTEQDDLSDT 423

Query: 694 SEDQRIN 700
             D+ ++
Sbjct: 424 MADEVVD 430


>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
 gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
          Length = 519

 Score =  222 bits (565), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 138/355 (38%), Positives = 202/355 (56%), Gaps = 32/355 (9%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L + L +VF   + SS  P+II I+ DDLGW DV ++G   I TPNID LA  G+ +  +
Sbjct: 2   LKYILILVFSISLLSSERPNIIIIITDDLGWGDVSYNG-GPINTPNIDKLADDGLQMNRF 60

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLY-GCERGGLPLSEKILPQYLKELGYRTRIVG 160
           Y+   C+P+R+A+ TG + +  G+   +     ER GLPL  KILP+YLKE+GY+T + G
Sbjct: 61  YSAPTCSPTRAALFTGINSLKNGIIRPLNNPTAERYGLPLKHKILPEYLKEIGYQTALSG 120

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           KWHLG +  EY P  RGFES  G+  G   YFDH+        LD  R+ E  ++    Y
Sbjct: 121 KWHLGMFSDEYLPRNRGFESTYGHLGGGIGYFDHALSGR----LDWHRNGEILYE--DGY 174

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
           ST +   EA+ II N + + PLFLY+A  A H+     P+QA +  +N    I D K   
Sbjct: 175 STTLIADEAIRIIENKNNETPLFLYVAFNAPHT-----PIQAEEKIINNLSDISDKKERV 229

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN--AASNWP------ 332
           +AA +  LD  +GK+++A+E + +L+ +II+F SDNG     F+++  AA+  P      
Sbjct: 230 YAANIITLDREIGKIIDAIESKGILNETIIIFFSDNGPV---FDIDPIAATIAPDILDSK 286

Query: 333 -----LRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                L+G K + +EGG+R   +I +  +LE    ++ Q+  V D LPTL  A N
Sbjct: 287 GNTLGLKGSKGSAYEGGIRVPAVIYYKGILEKS--LSNQFFFVDDILPTLFGALN 339



 Score = 40.4 bits (93), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 17/26 (65%), Positives = 21/26 (80%)

Query: 9   CERGGLPLSEKILPQYLKELGYRTRI 34
            ER GLPL  KILP+YLKE+GY+T +
Sbjct: 93  AERYGLPLKHKILPEYLKEIGYQTAL 118


>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
          Length = 476

 Score =  221 bits (563), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 113/267 (42%), Positives = 165/267 (61%), Gaps = 10/267 (3%)

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           ++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RG
Sbjct: 8   RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 67

Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           F++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +H
Sbjct: 68  FDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASH 127

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           S  +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  + 
Sbjct: 128 SPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNIT 182

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL
Sbjct: 183 WALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLL 238

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKS 383
           + +   +   VH++DW PTL+  A  +
Sbjct: 239 KRKRRTSRALVHITDWYPTLVGLAGGT 265


>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 461

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 130/329 (39%), Positives = 181/329 (55%), Gaps = 15/329 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+ I+ADDLGW DV FHG  QIPTPNID LA  G+ L +YY       +RSA+MTG +
Sbjct: 9   PHIVVIVADDLGWGDVSFHGSRQIPTPNIDTLAGDGLALHHYYVQSSGAATRSALMTGLY 68

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTG++H          +PLS  I+  ++        +V   H+GF  +  +       
Sbjct: 69  PIHTGIKHASHICISARLIPLSGGIVRCHIVLTIKYAFMVNIPHIGFLFEHISKGAHAVN 128

Query: 180 SHL---GYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           +     GYW+     F H+      + L +  + E  +    ++      +    I   H
Sbjct: 129 ASWWTNGYWSKR---FIHA------FFLQLFMNAE-IFFFFNQWPCREMRSPKWKIAPLH 178

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
           +   PL LYLAH A H  N Y PL+AP  Y+   R+I+D +R  FA ++  LD+SVG V 
Sbjct: 179 NCSVPLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRRIFAGMVAALDQSVGDVF 238

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
            AL+++ +L ++I++  SD+GGAA G      SNWPLRG K TLWEGGVR  GL+WS  L
Sbjct: 239 RALQRKGILEDTIVLLTSDSGGAAGGVEQGVGSNWPLRGTKKTLWEGGVRAVGLLWSAKL 298

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
            +RG VA Q +H S + P L +A +  DI
Sbjct: 299 -ARGRVASQLIHRSAYFP-LTNADSCGDI 325



 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 35/56 (62%)

Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
           T+ +   D +    IDG D+W+VL  + PS R  +L+NID  W+++A+  G +KLV
Sbjct: 317 TNADSCGDITDLGSIDGKDLWTVLVDDRPSPRTELLYNIDYRWKLAAIRVGPYKLV 372



 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 7/77 (9%)

Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
           T+ +   D +    IDG D+W+VL  + PS R  +L+NID  W+++A+  G +KLV    
Sbjct: 317 TNADSCGDITDLGSIDGKDLWTVLVDDRPSPRTELLYNIDYRWKLAAIRVGPYKLVL--- 373

Query: 556 INGNGTSENRSNDNSYQ 572
               G S +   D+ Y+
Sbjct: 374 ----GASSSDQFDDWYE 386


>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/346 (38%), Positives = 189/346 (54%), Gaps = 24/346 (6%)

Query: 46  LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
           L       V ++   H++ I+ADD GW DV FH    I TPNID L   G+ L NYYT  
Sbjct: 9   LFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQH 68

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHL 164
           +CTP+RSA++TG++PIHTG+Q NV+   +  GL   E +LP+YL+    + R +VGKWH+
Sbjct: 69  ICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHV 128

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYF--DHSAEEMKMWGLDMRRDLE----PAWDLHG 218
           G      TP  RGFE+  GY  G +D++  +   E+    G+D          P     G
Sbjct: 129 GHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSWG 188

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
           KYS D++  +  +I+ +    +  F+Y A    H      PLQA + +L  +  I+D  R
Sbjct: 189 KYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSWIKDTNR 243

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +AA++  LDE VG VV+   + R+L N+I  FV+DNGG       +  +NWPL G K 
Sbjct: 244 RNYAAMVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGETR----DGGNNWPLSGQKW 299

Query: 339 TLWEGGVRGAGLIWS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           TL EGGV+  G ++    PL +  G+      HVSD LPT++ A N
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL-----FHVSDILPTVMEAFN 340


>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 189/346 (54%), Gaps = 24/346 (6%)

Query: 46  LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
           L       V ++   H++ I+ADD GW DV FH    I TPNID L   G+ L NYYT  
Sbjct: 9   LFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQH 68

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHL 164
           +CTP+RSA++TG++PIHTG+Q NV+   +  GL   E +LP+YL+    + R +VGKWH+
Sbjct: 69  ICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHV 128

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYF--DHSAEEMKMWGLDMRRDLE----PAWDLHG 218
           G      TP  RGFE+  GY  G +D++  +   E+    G+D          P     G
Sbjct: 129 GHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSWG 188

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
           KYS D++  +  +I+ +    +  F+Y A    H      PLQA + +L  +  I+D  R
Sbjct: 189 KYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSWIKDTNR 243

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +AA++  LDE VG V++   + R+L N+I  FV+DNGG       +  +NWPL G K 
Sbjct: 244 RNYAAMVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNGGETR----DGGNNWPLSGQKW 299

Query: 339 TLWEGGVRGAGLIWS---PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           TL EGGV+  G ++    PL +  G+      HVSD LPT++ A N
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL-----FHVSDILPTVMEAFN 340


>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
          Length = 873

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 116/269 (43%), Positives = 160/269 (59%), Gaps = 12/269 (4%)

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           ++    + IHTG+QH+V+   +   LPL    LPQ L++ GY T +VGKWHLGFYK+   
Sbjct: 415 SVFHSGYQIHTGLQHSVIRATQPNCLPLENVTLPQKLRDAGYSTHMVGKWHLGFYKRGCL 474

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMK-MWGLDMRRDLEPAWDL-HGKYSTDVFTAEAV 230
           PT RGF+S  G   G  DY+ H   E+  M G D+    E AW+   G YST +FT +AV
Sbjct: 475 PTQRGFDSFFGSLLGSGDYYSHYKCEVPGMCGYDLYEGEEAAWEQDRGLYSTVMFTRKAV 534

Query: 231 DII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
            I+ H++    PLFLYLA+ A HS     PLQ P  YL  ++ I +  R K+AA++  LD
Sbjct: 535 SILAHHNPRAHPLFLYLAYQAVHS-----PLQVPARYLERYKGISNLSRRKYAAMVSCLD 589

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E++  +  AL++     N++IV+ SDNGG      L   SNWPLRG K T WEGG+R  G
Sbjct: 590 EAIHNLTLALKRYGYYDNTVIVYSSDNGGQP----LAGGSNWPLRGSKATYWEGGIRAVG 645

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
            + SPLL  +G      +H++DW PTL++
Sbjct: 646 FVHSPLLMKKGTKCRSLIHITDWFPTLVT 674


>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
          Length = 382

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 122/328 (37%), Positives = 184/328 (56%), Gaps = 16/328 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+II+I  DD G+NDVGF   + I +PN+DALA SGIIL N YT   C+PSR + ++G+
Sbjct: 20  PPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGR 78

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +   + MQH V+   +   L L   ILP YLKELGY T   GKWHLG+ + E TPT RGF
Sbjct: 79  YSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGF 138

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +S  G ++G  ++++H+      +  D     E  +D  GK+S D+        +  +  
Sbjct: 139 DSFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQ 196

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---DFKRSKFAAILHKLDESVGKV 295
             PLF+Y+A    HS     PL     +L ++  ++   D+    F+ ++  +D  +G +
Sbjct: 197 SSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDASTDYSGFFFSGLVSGMDYIIGGI 251

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
           V+ +++  +  N+ I+F SDNGG     ++    N P RG K+TLWEGG +      SPL
Sbjct: 252 VDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSPL 306

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
           L + G+     +HV+DWLPT++  A  S
Sbjct: 307 LGASGVENNGLMHVTDWLPTIVDLAGGS 334


>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 506

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 134/366 (36%), Positives = 191/366 (52%), Gaps = 33/366 (9%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L  +  +++V V +V S   PHI+ ILA+DLGWNDVG+H  D +  P ++ LA  G+I  
Sbjct: 3   LACSILVALVCVSVVKSDKQPHIVLILAEDLGWNDVGWHNPD-LKMPILNQLAADGVIFN 61

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
             Y    CTPSRSA+ TG +P     QH +L   E  GL L  K LP+ LK++GY T +V
Sbjct: 62  QSYVQPACTPSRSALFTGYYPFKIKRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTHLV 121

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLGF K+EY P  RGF+S  G+ T   D +  + E +   G D R +     +    
Sbjct: 122 GKWHLGFCKEEYLPNKRGFDSFYGWLTLGTDLY--TKENVLAPGYDFRDNTGVVQE-SDT 178

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           Y   +    AVDI+  H  + PLFL  + A      P + L+ P  Y +++  I+D +  
Sbjct: 179 YLPFMLAERAVDIVMGHYKEYPLFLEFSMAL-----PGKFLEVPQDYEDLYSDIDDDRTR 233

Query: 280 KFAAI-----------------LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           KF  I                 L  +D S+G +VEAL+ R M  +++ +F SDNG     
Sbjct: 234 KFYGICNLNIVVSKNTKLPNRKLSAMDNSIGLLVEALKTRGMWDDTLFIFTSDNG----A 289

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL---SA 379
           F   + SNWPLRG   TL+EGG R     +  +L+  G V  + +H+ D   T++    A
Sbjct: 290 FATESGSNWPLRGNIATLFEGGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELAGA 349

Query: 380 ANKSDI 385
             +SDI
Sbjct: 350 TTESDI 355



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 65/123 (52%), Gaps = 16/123 (13%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           +++IDG+D+W  +S+ EPS R   ++NIDD    + L+ G       ++V  Y++ +TG 
Sbjct: 352 ESDIDGMDMWKTISKGEPSPRTEFVYNIDD----NELSPGA-----AIRVGDYKL-ITGH 401

Query: 632 PDQVYLSGLSDR--EWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
           PD +Y   L +R  +W       +   + I   P    P  P     LF+I +DP E+NN
Sbjct: 402 PDLMYPFRLLNRSDDWYNYGDEPI---SGILLPPTGNEP-PPANVTYLFNIIDDPEERNN 457

Query: 690 LAD 692
           LAD
Sbjct: 458 LAD 460



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 54/102 (52%), Gaps = 8/102 (7%)

Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTR-GKWKLVKEN-SINGNGTSE 563
           +++IDG+D+W  +S+ EPS R   ++NIDD E    A  R G +KL+  +  +       
Sbjct: 352 ESDIDGMDMWKTISKGEPSPRTEFVYNIDDNELSPGAAIRVGDYKLITGHPDLMYPFRLL 411

Query: 564 NRSND--NSYQNEIDGIDVWSVLSRNEPSKRN-TILHNIDDE 602
           NRS+D  N     I GI +    + NEP   N T L NI D+
Sbjct: 412 NRSDDWYNYGDEPISGILLPP--TGNEPPPANVTYLFNIIDD 451


>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 486

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 195/358 (54%), Gaps = 30/358 (8%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
           AVL +AFT     V   A+   P+I+ I+ADDLGW DVGF+G  +I TPNIDALA  G  
Sbjct: 11  AVLLVAFTA----VASQAADRQPNIVHIVADDLGWKDVGFNGCTEIKTPNIDALAKGGAK 66

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
              +Y   +CTP+R+ +MTG+ P   G+Q  V+      GL  SE ++PQ L + GY+T 
Sbjct: 67  FSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQCLGDAGYKTA 126

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
           I+GKWHLG   ++Y P  RGF+   G   G  DYF H    +    LD  RD +P   +H
Sbjct: 127 IIGKWHLGHADQKYWPKQRGFDYQYGAMIGELDYFTHDEHGV----LDWFRDNKP---VH 179

Query: 218 GK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            + Y+T +   +AV  IH     +P +LYL   A H+     P QAP  Y+  + +I + 
Sbjct: 180 EQGYTTTLIGDDAVKYIHGQDGKKPFYLYLTFNAPHT-----PYQAPKEYITKYLNIAEP 234

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA----AGFNLNAA---- 328
            R  +AA++  LDE++GKVV AL+Q+ +  N++I F SDNGG      AG   + +    
Sbjct: 235 TRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAGQMADMSKVVL 294

Query: 329 --SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
              N P R  K +L+EGG R   L  W   ++++ +  +  +H  D  PT  + A  S
Sbjct: 295 PCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTV--DGMIHAVDLYPTFAALAGAS 350


>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 489

 Score =  219 bits (558), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 127/335 (37%), Positives = 184/335 (54%), Gaps = 13/335 (3%)

Query: 46  LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
           +++V V +V S   PHI+ ILA+DLGWNDVG+H  D I  P ++ LA  G+I    Y   
Sbjct: 9   VALVCVIVVESGKQPHIVLILAEDLGWNDVGWHNAD-IKMPILNQLAADGVIFNQSYVQP 67

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
            CTPSR+A++TG +P     QH +L   E  GL L  K LP+ LK++GY T +VGKWHLG
Sbjct: 68  ACTPSRAALLTGYYPFKIQRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLG 127

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
           F K+EY P  RGF+S  G+ T     +  S E +   G D R +     +    Y   + 
Sbjct: 128 FCKEEYLPNKRGFDSFYGWLTLGTTLY--SKENIIAPGYDFRDNTGVVQE-SDTYLPFML 184

Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
              AVDI+  H  + PLFL  + A +        L+ P  Y +++  IED ++ KF   L
Sbjct: 185 AERAVDIVMGHYKEYPLFLEFSMALSGKF-----LEVPQEYEDLYSDIEDDRQRKFYGKL 239

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
             +D S+G +V+AL+ R M  +++ +F SDNG  A+     + SNWPLRG   TL+EG  
Sbjct: 240 SVMDNSIGLLVDALKTRGMWEDTLFIFTSDNGALAS----ESGSNWPLRGNIATLFEGAT 295

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           R        +L++ G V  + +H+ D   T++  A
Sbjct: 296 RVPTFFHGSMLKNTGYVNNELMHIVDLHKTIIELA 330



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 64/123 (52%), Gaps = 16/123 (13%)

Query: 572 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           + +IDG+D+W  +S+ +PS R   ++NIDD    + L+ G       ++V  Y++ +TG 
Sbjct: 335 ECDIDGMDLWKTISKGKPSPRTEFVYNIDD----NELSPGA-----AIRVGDYKL-ITGH 384

Query: 632 PDQVYLSGLSDR--EWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNN 689
           PD +Y   + +R  +W       +   + I   P    P  P     LF+I +DP E+NN
Sbjct: 385 PDLLYPFRMLNRSDDWYNYGDEPI---SGIPIPPTGNEP-PPSNVTYLFNIIDDPEERNN 440

Query: 690 LAD 692
           LA+
Sbjct: 441 LAE 443



 Score = 40.8 bits (94), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 8/102 (7%)

Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTR-GKWKLVKEN-SINGNGTSE 563
           + +IDG+D+W  +S+ +PS R   ++NIDD E    A  R G +KL+  +  +       
Sbjct: 335 ECDIDGMDLWKTISKGKPSPRTEFVYNIDDNELSPGAAIRVGDYKLITGHPDLLYPFRML 394

Query: 564 NRSND--NSYQNEIDGIDVWSVLSRNEPSKRN-TILHNIDDE 602
           NRS+D  N     I GI +    + NEP   N T L NI D+
Sbjct: 395 NRSDDWYNYGDEPISGIPIPP--TGNEPPPSNVTYLFNIIDD 434


>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 181

 Score =  219 bits (558), Expect = 6e-54,   Method: Composition-based stats.
 Identities = 94/159 (59%), Positives = 122/159 (76%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L F L    ++ V ++ PP++IF++ADDLGWNDV FHG  QIPTP+IDALA  G+IL NY
Sbjct: 11  LTFFLLGCMLECVYTAQPPNVIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNY 70

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +CTP+RSAIMTGKHPIHTG+Q++V+   E  GL  +E I+PQYL+ LGYRT +VGK
Sbjct: 71  YVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGK 130

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
           WHLGF+K+  TP+ RGFES+ GY+ G QDYF H + E++
Sbjct: 131 WHLGFFKESLTPSHRGFESYYGYYGGMQDYFTHESTEVR 169


>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
          Length = 316

 Score =  218 bits (556), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 113/263 (42%), Positives = 158/263 (60%), Gaps = 11/263 (4%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LP  +  LPQ L+ELGY T +VGKWHLGFYKKE  PT RGF++
Sbjct: 2   IHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDT 61

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-T 238
           + G  TG  +Y+ + S +   M G D+      AW   GKYST ++T     I+  H   
Sbjct: 62  YFGSLTGSVNYYTYDSCDGPGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHDPR 121

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +PLF++L+  A H+     PLQ P  Y+  +R +E+  R K+AA++  +DE+V  +   
Sbjct: 122 SQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNITYG 176

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +     NSI++F +DNGG      L+  SNWPLRG K T WEGGVRG G I SPLL  
Sbjct: 177 LRKYGYYENSIMIFSTDNGGQP----LSGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLRK 232

Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
           +  V++  VH++DW PTL+  A 
Sbjct: 233 KKRVSKALVHITDWYPTLVGLAG 255


>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
          Length = 551

 Score =  218 bits (556), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 122/325 (37%), Positives = 182/325 (56%), Gaps = 14/325 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II+I  DD G+NDVGF   + I +PN+DALA SGIIL N YT   C+PSR + M+G++
Sbjct: 26  PNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGRY 84

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
              + MQH V+   +   L L   ILP YLKELGY T   GKWHLG+ + E TPT RGF+
Sbjct: 85  SYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFD 144

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G ++G  ++++H+      +  D     E  +D  GK+S D+        +  +   
Sbjct: 145 SFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQS 202

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSKFAAILHKLDESVGKVVEA 298
            PLF+Y+A    HS     PL     +L ++  ++    R K+  ++  +D  +G +V+ 
Sbjct: 203 SPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDK 257

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           +++  +  N+ I+F SDNGG     ++    N P RG K+TLWEGG +      SPLL +
Sbjct: 258 IKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSPLLGA 312

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
            G+     +HV+DWLPT++  A  S
Sbjct: 313 TGVENNGLMHVTDWLPTIVDLAGGS 337


>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  218 bits (554), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 182/343 (53%), Gaps = 19/343 (5%)

Query: 46  LSMVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           L ++F+ L   S P       PHII I  DDLGWNDV +H  D I  PN+  LA  G+I 
Sbjct: 6   LIVIFIALAVVSKPCIADAHQPHIILIFVDDLGWNDVHWHNPD-IAMPNLMDLAADGVIF 64

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
              YT   CTPSR+A+MTG +P  TG QH +L+     GLPL  K+LP+ LKE+GY T +
Sbjct: 65  NQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMLFNLHPSGLPLEFKLLPEKLKEIGYSTHM 124

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
           VGKWHLGF K EY PT RGF+SH G WT G  DY   +       G D R +       +
Sbjct: 125 VGKWHLGFCKDEYLPTNRGFDSHYGIWTLGVGDYDKMNGVLSPSKGYDFRDNTGVVQKSN 184

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           G Y   +    A  I++ H    PLFL     A     P + L  P+ Y N +  IED +
Sbjct: 185 G-YLALMLGDRAEHIVNTHYPGTPLFL-----AFTLDIPAKHLAIPEEYENKYADIEDSR 238

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
              F   L  +D+ +G VV+AL+ R +  +S+I+F+ DNG  ++     A SN+P RG+ 
Sbjct: 239 TRHFYGKLTMMDDIIGGVVDALKNRGIYDDSLIIFIGDNGALSS----QAGSNYPFRGIA 294

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            TL+EG  R   ++    ++  G  + +   + D   T+L  A
Sbjct: 295 GTLFEGATRVPSIVSGKGIKKTGYESNELYSIVDIHRTILDVA 337



 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 61/133 (45%), Gaps = 31/133 (23%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           E+DG+ +W   S+ +PS RN I++NIDD+     +A+  G +KL+            TG 
Sbjct: 344 ELDGLSMWPTWSKGKPSPRNEIIYNIDDDPVSPGAAIRIGDYKLI------------TGH 391

Query: 632 PDQVYLS---GLSDREW-----LALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKND 683
           PD +Y     G++D  +     LA  +     AA         +   P     LF++  D
Sbjct: 392 PDLLYPYRDIGITDNYYNYGDPLASGLPTADQAA---------MAPPPLDVEYLFNVIVD 442

Query: 684 PCEKNNLADRSED 696
           P E+N+++    D
Sbjct: 443 PEERNDISADHPD 455


>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
 gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
          Length = 473

 Score =  218 bits (554), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 203/338 (60%), Gaps = 32/338 (9%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           ++G P+I+FIL DDLG+ DVG+ G  QI TPNID LA  G+ LK+ Y   +C+P+R+A+M
Sbjct: 39  ATGAPNILFILLDDLGYGDVGYLG-SQIQTPNIDNLASQGVTLKHGYAYPICSPTRAALM 97

Query: 116 TGKHPIHTG----MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           TGK+P++ G    M+++ +       LP     +P+  +E GY+T +VGKWHLG  K+  
Sbjct: 98  TGKNPLNFGIDGPMENDAM-------LPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSA 150

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
            P  RGF+   G+  G  DY+ H    +   GLD + + + +    G + T++ TA+A+D
Sbjct: 151 MPHNRGFDDFYGFLGGFVDYYTH----VYFGGLDWQNN-DTSLREEG-FVTELLTAKAID 204

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
            I +   D+P F+YL+++A H+     PLQ   + +N +   E   R  FA +   +D+ 
Sbjct: 205 KITHFKGDKPFFMYLSYSAPHT-----PLQYVPNAINDYSGFESKDRQVFADMTSDVDQD 259

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA-ASNWPLRGVKNTLWEGGVRGAGL 350
           +G+V+ AL++R +  N+I+VF+SDNGG     NL A A+N  L+G K++++EGGVR   L
Sbjct: 260 IGEVLAALDKRGIAENTIVVFMSDNGG-----NLEAGANNGSLQGGKSSVYEGGVRVPVL 314

Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           I W   L+S+ +   Q +   DW PTLL AAN    PN
Sbjct: 315 IRWPAQLQSKTMT--QPMFTQDWAPTLLEAANIDYQPN 350


>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
          Length = 529

 Score =  218 bits (554), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 121/325 (37%), Positives = 182/325 (56%), Gaps = 14/325 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II+I  DD G+NDVGF   + I +PN+DALA SGIIL N YT   C+PSR + ++G++
Sbjct: 4   PNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRY 62

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
              + MQH V+   +   L L   ILP YLKELGY T   GKWHLG+ + E TPT RGF+
Sbjct: 63  SYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFD 122

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
           S  G ++G  ++++H+      +  D     E  +D  GK+S D+        +  +   
Sbjct: 123 SFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQS 180

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSKFAAILHKLDESVGKVVEA 298
            PLF+Y+A    HS     PL     +L ++  ++    R K+  ++  +D  +G +V+ 
Sbjct: 181 SPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDK 235

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           +++  +  N+ I+F SDNGG     ++    N P RG K+TLWEGG +      SPLL +
Sbjct: 236 IKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSPLLGA 290

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKS 383
            G+     +HV+DWLPT++  A  S
Sbjct: 291 TGVENNGLMHVTDWLPTIVDLAGGS 315


>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
 gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
          Length = 526

 Score =  217 bits (553), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/359 (36%), Positives = 195/359 (54%), Gaps = 32/359 (8%)

Query: 40  LPLAFTLSMVF--VDLVASSGP-----------PHIIFILADDLGWNDVGFHGLDQIPTP 86
           L ++ +L MVF    L A+              P+I+ ILADDLGW DVG+HG D I TP
Sbjct: 18  LSVSLSLLMVFWIAGLTAAQASEMASNNKSDERPNIVIILADDLGWGDVGYHGSD-IQTP 76

Query: 87  NIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILP 146
           +ID LA  G  L  +Y    C+P+R+A+MTG+ P+  G+ ++VL   E GG+ L E  LP
Sbjct: 77  HIDRLAKEGAKLNRFYATPFCSPTRAALMTGRDPLKLGVAYSVLMPWENGGVSLDEHFLP 136

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
           Q  +  GY T +VGKWHLG   +++TP  RGF+   G+      YFDH        G D 
Sbjct: 137 QSFQAAGYNTAMVGKWHLGHTIEQHTPNARGFDLFYGHMHTQVSYFDHQIAN----GHDF 192

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
           + + +P  D +G+Y+TDV  A+A   I +     +P  LY+   A HS     P++AP+ 
Sbjct: 193 QENGKPV-DHNGEYATDVHGAQAARFITDLRDKTKPFLLYVPFLAPHS-----PMEAPEA 246

Query: 266 YLNIHRHIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
             + +    D     R  +AA++  +D+++GKV+ AL++  +  N+I+ F SDNG    G
Sbjct: 247 LKDKYSSRLDLPGSTRKTYAAMVDSMDQAIGKVLTALDEEGVADNTIVFFFSDNG----G 302

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           F    + N P RG K  ++EGG+R   ++  P   + G   ++ V V D LPTL  AA 
Sbjct: 303 FENYGSDNGPYRGGKLEVYEGGIRVTAVMRWPEKLAAGSEVDEIVSVMDLLPTLTHAAG 361


>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 126/328 (38%), Positives = 184/328 (56%), Gaps = 22/328 (6%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S  PHIIF++ DDLG++D+G+   D + +PNID LA + + ++NYY    CTPSR+A MT
Sbjct: 21  SKKPHIIFVMIDDLGFDDLGYVN-DDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMT 79

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++ I  GMQ  V+   E   +PLSE +LPQ  K+ GY T + GKWHLGFY +++ P  R
Sbjct: 80  GRYNIRYGMQSGVIKPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHLGFYTEKHCPQNR 139

Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMW---GLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           GF+   G++ G QDYF H S     ++   G +  R      DL+G YST     + +  
Sbjct: 140 GFDRFFGFYLGSQDYFYHDSGNNCYLYEPNGTEKVR-----LDLNGTYSTKAIAEDFIAK 194

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQ-APDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           +  +  + PLF +L+    H      PLQ  P   L +    E ++R   +  +  +D  
Sbjct: 195 LDEYDPETPLFEFLSFQEVHG-----PLQWLPAEVLKMAGGWEGWERKMLSRKIVIVDHF 249

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +G+VV+ L+++    ++I+V  SDNGG         ASNWPLRG K  ++EGG+R    I
Sbjct: 250 IGEVVKKLKEKGFWKDTILVITSDNGGQTR----EGASNWPLRGRKGDVFEGGIRSRAFI 305

Query: 352 WSPLLES--RGIVAEQYVHVSDWLPTLL 377
            SP L    +G   +   HV+DW PTLL
Sbjct: 306 HSPKLPDSLKGSSFQHVFHVTDWFPTLL 333


>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
          Length = 571

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 126/339 (37%), Positives = 190/339 (56%), Gaps = 33/339 (9%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+ ++I+ADDLG+NDV +H   Q+ TPN+  +A +G+IL   Y+   CTPSR++ MTG 
Sbjct: 31  PPNFLYIMADDLGYNDVSWHN-PQVLTPNLGKMAKNGVILTESYSQAACTPSRASYMTGY 89

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G+Q++V+       +PL    LP+ LKE GY + +VGKWHLG  +++ TP  RGF
Sbjct: 90  YPFRIGVQNSVVREGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRDVTPPGRGF 149

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP-----AWDLHGK----------YSTD 223
           ++ LG   G+ DY+       K+  +    D +P      +D              Y+TD
Sbjct: 150 DTFLGLLNGYNDYY-----TKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDYTTD 204

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-PDHYLNIHRHIEDFKRSKFA 282
           +FT  A+++I   S D P FL     A H   P+ PLQ  P      +  ++D  R  + 
Sbjct: 205 IFTNRAIELIQ-QSKDTPFFL-----ALHYTAPHWPLQKHPGFSDKDYPDVKDLNRRIYL 258

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           + +  +D  +GKVV+AL+Q  +L N++IVF SDNGG        +A+N PLR  K TL+E
Sbjct: 259 SSITAMDTGIGKVVDALKQHNLLDNTLIVFQSDNGGDTHF----SANNGPLRDRKTTLYE 314

Query: 343 GGVRGAGLIWSP-LLESRGIVAEQYVHVSDWLPTLLSAA 380
           GGV+   + + P LL +    ++   H+SDW  T+LSAA
Sbjct: 315 GGVKVPSMAYGPGLLTNTPRTSDDLFHISDWFTTILSAA 353


>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 223

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 105/223 (47%), Positives = 148/223 (66%), Gaps = 5/223 (2%)

Query: 125 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY 184
           MQ + ++     G+ L EK LPQYLK +GY+T  +GKWHLGF+ KEYTPT+RGF+S  GY
Sbjct: 1   MQSDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGY 60

Query: 185 WTGHQDYFDHSAEEMKMWGLDMRRDLEPA----WDLHGKYSTDVFTAEAVDIIHNHSTDE 240
           + G  DY+DHS      WGLD+  D   +    ++  G YST +++ EA+D I NH++ +
Sbjct: 61  YGGQADYWDHSLASNGWWGLDLHYDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNSTQ 120

Query: 241 PLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           P+FLYLA+ A HSAN  E PLQAP  +++   HI+   R  +AA+L  +D  +G++   L
Sbjct: 121 PMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSEL 180

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
            ++ ML +SII+F SDNGG A G N N A+N+PLRG K T++E
Sbjct: 181 LKQNMLDDSIIIFTSDNGGPANGLNNNWATNYPLRGTKTTVYE 223


>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 490

 Score =  216 bits (550), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 138/377 (36%), Positives = 195/377 (51%), Gaps = 40/377 (10%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M F    L   ++ +     +++  P+I+ I+ADDLG  D+G+ G  +I TP+IDALA  
Sbjct: 1   MTFIKWFLPIFVTTLSFGAASAAERPNIVVIVADDLGNADLGYRG-SKIKTPHIDALAKG 59

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G+ L++YY + LCTP+R+A+MTG++P+  G+Q  V++   R GLP  EK LPQ LKE+GY
Sbjct: 60  GVRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVIFPSHRYGLPTDEKTLPQALKEVGY 119

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW---GLDMRRDLE 211
            T +VGKWHLG   K++ P  RGF+   G   G  DYF      +  W   G  +R D  
Sbjct: 120 HTAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVDYFTRERGGVVDWQRNGEFLRED-- 177

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
                   Y  D+   EAV +I  H   +PLFLY A  A H+     P QAP   ++ + 
Sbjct: 178 -------GYYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPHA-----PYQAPKADIDAYN 225

Query: 272 HIEDFKRSK-FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA-- 328
            I D +  + +A ++  LD  VG+VV ALE   M  N+II F +DNGGA +      A  
Sbjct: 226 DIFDNEMHRTYAGMISNLDRQVGRVVAALETNGMRENTIIFFTTDNGGATSALFATGARA 285

Query: 329 ------------------SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
                             SN P  G K +L +GGV    ++  P      +V E   HV 
Sbjct: 286 PEEREASGGVALGSKPPCSNLPFSGGKGSLKDGGVHLPAIVNWPGRLKPSVVNEPLHHV- 344

Query: 371 DWLPTLLSAANKSDIPN 387
           D +PTLL+ A     P+
Sbjct: 345 DLMPTLLALAGGKGDPS 361


>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  216 bits (549), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 187/343 (54%), Gaps = 19/343 (5%)

Query: 42  LAFTLSMVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           L   L  VF +       S  PHIIF++ DDLG++D+G+   D + +PNID LA + + +
Sbjct: 3   LIIFLLFVFRECFGHGRFSKKPHIIFVMIDDLGFDDLGYVN-DDVISPNIDFLAKNALHI 61

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           +NYY    CTPSR+A MTG++ I  GMQ  V+   E   +PLSE +LPQ LK+ GY T +
Sbjct: 62  ENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVIKPDEPEAIPLSETLLPQALKKCGYNTSM 121

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLH 217
            GKWHLGFY +++ P  RGF+   G++ G QDYF H S     ++    R  +    DL+
Sbjct: 122 HGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQDYFYHDSGNNCYLYEPCYREKVR--LDLN 179

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DF 276
           G YST     + +  +  +  + PLF +L+    H      PLQ    Y  + +  +   
Sbjct: 180 GTYSTKAIAEDFIAKLDEYDPETPLFEFLSFQEVHG-----PLQWLSEYSTLFKEKDWTH 234

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R   +  +  +D  +G+VV+ L+++    ++I+V  SDNGG         ASNWPLRG 
Sbjct: 235 YRKMLSRKIVIVDHFIGEVVKKLKEKGFWKDTILVITSDNGGQTR----EGASNWPLRGR 290

Query: 337 KNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLL 377
           K  ++EGG+R    I SP L +  +G       HV+DW PTLL
Sbjct: 291 KGDVFEGGIRSRAFIHSPKLPNSLKGSSFPHVFHVTDWFPTLL 333


>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
          Length = 511

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 185/343 (53%), Gaps = 38/343 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+F++ DD GWND GF G  ++ TP ID L   GI L  +Y  ++C+P+R+A+MTG++
Sbjct: 14  PHILFVVVDDHGWNDCGFAGT-RVKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGRY 72

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P   G+Q     G     L  +E +LPQY+K  GY TR VGKWHLGF + ++TPTFRGF+
Sbjct: 73  PHRYGLQFPFCGGAAMA-LNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGFD 131

Query: 180 SHLGYWTGHQDYFDHS-----AEEMKMWGLDMRRDLEP---------AWDLHGK----YS 221
           S  G+++  +DYF H      +    +  LD   D  P         A++  G     YS
Sbjct: 132 SFYGFYSCAEDYFFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAVGTDWQHYS 191

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSK 280
           T +F    VDI+  H   +PLFLY A   TH+     P + P  Y +  +  + D  R K
Sbjct: 192 TTLFAGRIVDIVDGHDPSQPLFLYFASQDTHN-----PGEVPQCYKDRYNTTVPDQVRRK 246

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG---AAAGFNLN------AASNW 331
            AA L  +D ++  + EA ++  M  N +IV+ +DNGG     +G N         +SN+
Sbjct: 247 LAAKLTTVDSALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPNKGKQEDSIGSSNY 306

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY---VHVSD 371
           PLRG K+  +EGGVR    +  P L++   +  +Y   +H+ D
Sbjct: 307 PLRGGKHNAYEGGVRSTAFLSGPALKTLARLVRKYQYKLHIGD 349


>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 470

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/362 (37%), Positives = 203/362 (56%), Gaps = 24/362 (6%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTP 86
           Y+     F    L F+LS   +   ++SG    P+ + I+ADDLG+ D+ F G  Q+ TP
Sbjct: 2   YKIGCFLFLQSLLIFSLSAQIISTPSTSGRNSHPNFVLIVADDLGYGDLSFTGSTQVKTP 61

Query: 87  NIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG------GLP 139
           +ID LA SG+   + Y +  +C+PSR+ ++TG++ +  G  +N L   + G      GLP
Sbjct: 62  HIDELAASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNN-LANSQPGFDPAFLGLP 120

Query: 140 LSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM 199
           ++ K +  +LK+LGY T +VGKWHLG Y+ +++P  RGF+   GY  G  DYF+ S E  
Sbjct: 121 VNVKTVGDHLKKLGYVTGLVGKWHLG-YEDQFSPLNRGFDEFWGYLGGGHDYFEAS-EAK 178

Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           + +   ++ + +   ++   Y TD    E ++ I  H  DEP FLY +  A H+     P
Sbjct: 179 RGYKAKIKCNYKTPQEI--TYITDDKGDECINFIQRHK-DEPFFLYASFNAPHT-----P 230

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           +QA    L I++HIED KR  +AA++H+LD +VG++VEA+E+  +L N+I VF+SDNGG 
Sbjct: 231 MQATAEDLAIYQHIEDRKRRTYAAMVHRLDINVGRIVEAVEEADLLENTIFVFISDNGGP 290

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                 N + N P  G K TL EGG+R    I  P     G V E  V   D  PT ++ 
Sbjct: 291 VM---TNGSINAPYNGKKGTLLEGGIRVPFFISWPGHLEPGKVFEHPVSSLDLTPTFVAL 347

Query: 380 AN 381
           A 
Sbjct: 348 AG 349


>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 184/342 (53%), Gaps = 12/342 (3%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           + +   L++V    +A +  PH+I I  DD+GWNDV +H  D I  PN+  LA  G+I  
Sbjct: 7   IAVFLALAVVSKPSIAEADQPHVILIFVDDMGWNDVHWHNPD-IAMPNLMDLAADGVIFN 65

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
             YT   CTPSR+A+MTG +P  TG QH +++     G+PL  K+LP+ LKE+GY T +V
Sbjct: 66  QSYTHPTCTPSRAAMMTGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMV 125

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           GKWHLGF K EY PT RGF+SH G WT G  DY   +       G D R ++       G
Sbjct: 126 GKWHLGFCKDEYLPTNRGFDSHYGLWTLGVGDYDKMNGVLSPSAGYDFRDNMGVVPKSDG 185

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y   +    A  I++ H    PLFL     A     P + L+ P+ Y   +  IED + 
Sbjct: 186 -YLALMLGDRAEHIVNTHYPGTPLFL-----AFTLDIPAKHLEIPEEYEEKYSDIEDDRT 239

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
            +F   L  +D+ +G VV+AL+ R M  +S+I+F+ DNG  ++     + SN+P RG+  
Sbjct: 240 RQFYGKLTMMDDIIGGVVDALKNRGMYDDSLIIFIGDNGALSS----QSGSNYPFRGIAG 295

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           T++EG  R   ++    ++  G  + +   + D   T+L  A
Sbjct: 296 TMFEGATRVPSIVSGKGIKKTGYESNELYSIVDIHRTILDVA 337



 Score = 40.8 bits (94), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 14/65 (21%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           E+DG+ +W   S ++PS RN IL+NIDD+     +A+  G +KL+            TG 
Sbjct: 344 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI------------TGH 391

Query: 632 PDQVY 636
           PD +Y
Sbjct: 392 PDLLY 396



 Score = 39.3 bits (90), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
           E+DG+ +W   S ++PS RN IL+NIDD+     +A+  G +KL+
Sbjct: 344 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI 388


>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
 gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
          Length = 452

 Score =  212 bits (540), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 125/334 (37%), Positives = 186/334 (55%), Gaps = 35/334 (10%)

Query: 66  LADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCTPSRSAIMTGKHPIHT 123
           +ADDLG++DV +     + TPN+  LA+  +  +L N Y  QLCTP+RSA MTG +P   
Sbjct: 1   MADDLGFSDVDWKD-STLHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRV 59

Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
           G Q+ V    E  G+P     L + +++L Y T +VGKWHLG+ KKE+ PT RGF+   G
Sbjct: 60  GTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYG 119

Query: 184 YWTGHQDYFDHSAE----EMK--MWGLDMRRDLE-----PAWDLHGKYSTDVFTAEAVDI 232
           ++     YF+HSA+    E+K  + GLD+  ++      P +  +G YSTD+FT  A+ +
Sbjct: 120 FYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSV 179

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF--------AAI 284
           + NH+  +P F++L++ A H      PLQ         + I   K + F          +
Sbjct: 180 LDNHNNSKPFFMFLSYQAVH-----PPLQVSQQ----SKTIGQGKEATFILRSHAHSTRM 230

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           L  +D ++G++VE L+   +  N++IVF SDNGG A   N   ASN PLRG K+T+WEGG
Sbjct: 231 LTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTA---NF-GASNAPLRGEKDTIWEGG 286

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
            +    + SP+    G   +   HV DW  T+LS
Sbjct: 287 TKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILS 320


>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
          Length = 898

 Score =  212 bits (539), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 148/222 (66%), Gaps = 10/222 (4%)

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           VGKWH+GFYKKEY PT RGF+S+ GY TG +DY+ H     KM G+D+R +  PA +  G
Sbjct: 526 VGKWHVGFYKKEYLPTSRGFDSYFGYLTGSEDYYTHHRCYGKMCGMDLRENAVPA-NYSG 584

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            YST +F  +A DI+  H +D+PLFLYL   + HS     PLQ P+ Y+  + HI+D KR
Sbjct: 585 IYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHS-----PLQVPEKYIQPYLHIKDKKR 639

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
            K+A ++  +DE++G + E  +Q+ + +N+++VF +DNGG      L   +N+PLRG K 
Sbjct: 640 RKYAGMVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNGGQI----LEGGNNYPLRGWKA 695

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +LWEGG+ G G +   +L+ +G+V++  +HV+DW PTL+S A
Sbjct: 696 SLWEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLA 737



 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 56/82 (68%), Gaps = 3/82 (3%)

Query: 42  LAFTLSMVF--VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L F + + F    +   S  PHI+F+LADD G+ND+G+HG  +I TPN+D LA  GI L+
Sbjct: 7   LTFLIVLFFNTFPVQGKSSQPHILFVLADDFGYNDIGYHG-SEIKTPNLDKLAGDGIRLE 65

Query: 100 NYYTVQLCTPSRSAIMTGKHPI 121
           NYY   +CTP+RS +++G++ +
Sbjct: 66  NYYVQPICTPTRSQLLSGRYQV 87


>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 485

 Score =  211 bits (537), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 140/376 (37%), Positives = 200/376 (53%), Gaps = 51/376 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK-NYYTVQLCTPSRSAIMTGK 118
           P+I+ +LADDLG+ ++G  G  QIPTP+ID+LA +G+  +  Y T   C+PSR+ ++TG+
Sbjct: 42  PNIVLLLADDLGYGELGCQGNPQIPTPHIDSLAANGVRFRCGYVTAAYCSPSRAGLLTGR 101

Query: 119 HPIHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +    G + N           GLPL EK L + L + GY T +VGKWHLG   + + P  
Sbjct: 102 YQSRFGYEQNPTGARNEDPELGLPLEEKTLARRLHDAGYATGLVGKWHLGGTAR-FHPLR 160

Query: 176 RGFESHLGY-----------WTGHQDYFDHSA----------EEMKMWGLDMRRDLEPAW 214
           RGF+   G+           + G   +    A          +E  M    M  D EPA+
Sbjct: 161 RGFDEFFGFLHEGHFFVPPPYEGVSTFLRRRALPNGKTGRWGDERLMLSTHMGHD-EPAY 219

Query: 215 D-----LHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
           D     L G        Y TD FT EA D I  +  D P FLYLA+ A HS     PLQ 
Sbjct: 220 DANNPILRGGQPVEEAAYLTDAFTREACDFIARNQ-DRPFFLYLAYNAVHS-----PLQG 273

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
            D Y+    HI D +R  FAA+L  LD+SVG+V++ LEQ ++ SN+I++F+SDNGG    
Sbjct: 274 ADAYMQQFAHIADQQRRIFAAMLANLDDSVGEVLQTLEQHQLTSNTIVIFLSDNGGPTRE 333

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
                +SN PLRG K  ++EGG+R   L+  P     G V E+ +   D  PT L+AA +
Sbjct: 334 LT---SSNAPLRGEKGDMYEGGIRVPVLMQWPGQIPAGRVEERPIISLDLFPTCLAAA-E 389

Query: 383 SDIPNYVNSTVENIIP 398
           +++P  ++    N++P
Sbjct: 390 AELPTELDGV--NLLP 403


>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
 gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 463

 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 135/344 (39%), Positives = 183/344 (53%), Gaps = 21/344 (6%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           LA  L   F  L A    P+II +L DD GW DVGF+G   IPTPN+D LA  GI+  N 
Sbjct: 9   LALALMTAFSSL-AQEKKPNIIVVLTDDQGWADVGFNGATDIPTPNLDRLASEGIVFDNA 67

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTR 157
           Y     C+PSR+ ++TG++    G   N+ Y  E     G PLSEK++P+ LKE GYRT 
Sbjct: 68  YVSHPYCSPSRAGLLTGRYQARFGHDCNMPYDSENDDTVGTPLSEKMIPEALKEHGYRTS 127

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
            +GKWHLG +   + P  +GF+   G+  G  +Y+      +K     + R+ EP     
Sbjct: 128 AIGKWHLGDHPSLH-PIHQGFDHWFGFAGGGMNYWGIPDGPIKT----IVRNGEPVPQNE 182

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
            +Y TD FT EA+D I     D+P F+YLA+ A     P+ P QA   YL   +HIE   
Sbjct: 183 LRYLTDDFTDEAIDFI-TKKDDKPFFMYLAYNA-----PHAPDQATKQYLEQTKHIEYGG 236

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           RS +AA+++ +D +VG++   L    M  N+I+VF+SDNGG         A N P RG K
Sbjct: 237 RSIYAAMVNAVDANVGRIDSTLVANGMKENTILVFLSDNGGRG-----EHADNRPYRGHK 291

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L+EGG++    I  P     G   E+ V   D  PT L AA 
Sbjct: 292 GMLFEGGIKVPFFITWPKKLKGGQHYEKMVSSLDLFPTFLEAAG 335


>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
 gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
          Length = 485

 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 113/324 (34%), Positives = 164/324 (50%), Gaps = 56/324 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FI+ADDLGWNDVG+H  D + TP +D LA  G+IL   Y   +CTPSR+A MTG  
Sbjct: 28  PNILFIVADDLGWNDVGWHNPD-VKTPVLDQLANEGVILNQCYVNYVCTPSRTAFMTGYF 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P H G QH V +  +  G+P     LP+ LK+LGY T +VG                   
Sbjct: 87  PYHVGSQHTVFFPQQAQGIPSKFPFLPEKLKQLGYATHMVG------------------- 127

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
                                      + D +      G+Y T  FT   VDI+  H  D
Sbjct: 128 ---------------------------KDDKDVVKTRSGQYGTYFFTDRVVDIVEKHPAD 160

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLF+YL +   H     +PLQ P  + +++  ++D  R K   ++  LDE+VG V  A+
Sbjct: 161 TPLFMYLPYQNVH-----QPLQVPQRFEDMYASVQDENRRKLLGMVSALDEAVGNVTMAM 215

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++  +  N++++F +DNGG  +     + +N+PLRG K TLWEGG RG       +L+  
Sbjct: 216 KKAGLWDNTLVIFTTDNGGWISA----SGNNYPLRGGKVTLWEGGTRGVAFAHGKMLQKT 271

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
           G    + +H  DW PT+L+AA  +
Sbjct: 272 GYTNNEMIHAVDWFPTILAAAGGT 295


>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 451

 Score =  208 bits (530), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 115/269 (42%), Positives = 158/269 (58%), Gaps = 5/269 (1%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           I +GMQ  V+      GLPL  KI+PQ+ K+LGY+  ++GKWHLGF++  YTP  RGF+S
Sbjct: 1   ILSGMQAEVIRDAAPWGLPLELKIMPQHFKDLGYQVNMIGKWHLGFFQTPYTPPKRGFDS 60

Query: 181 HLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF-TAEAVDIIHN-HST 238
             GY+TG  DY++H+A  +    L      +    +  KYST  +   EA  ++ N HS 
Sbjct: 61  FFGYYTGENDYYNHTAGTLNRPPLYGTFKTKQTRCVLAKYSTHTWGLLEAFPLLLNLHS- 119

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +P F Y ++ A H+A    P QAP+  +    +I +  R+ +A +L  LDES+G+VVE 
Sbjct: 120 -QPFFCYFSYQAVHAALMNVPFQAPERNVLKFPYIGETNRTLYAGMLDSLDESIGRVVEI 178

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L    ML ++II F SDNGG       N   NWPLRG K TLWEGGVR    +WS     
Sbjct: 179 LGNAGMLEDTIIAFSSDNGGLPYNAFSNRGFNWPLRGAKGTLWEGGVRVPAFVWSSKFLK 238

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           +  V++Q +H+SDWLPTL SAA   D+ N
Sbjct: 239 KPRVSDQLMHISDWLPTLYSAAG-GDVSN 266


>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
          Length = 274

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 103/227 (45%), Positives = 149/227 (65%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           S  P+IIFILADDLGW+DV FHG  QIPTPN+DALA  GIIL  +Y   LCTPSR+A+MT
Sbjct: 45  SSMPNIIFILADDLGWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMT 104

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G++PI+TGMQH V+   E  GLPL  +++P++  +LGY+T +VGKWHLG +KK++ P  R
Sbjct: 105 GRYPIYTGMQHFVIQPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRR 164

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           GF+S  G++   QDY++ +  E +  G D   + +     + +YST  +T  AV +I +H
Sbjct: 165 GFDSFYGFYNADQDYYNKTLTEGEHTGYDFWLNEDIHIYPNNRYSTHHYTERAVSLIRSH 224

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +  +PLFLYL++           L+AP+  +N   +I +  R+ +A 
Sbjct: 225 NPSQPLFLYLSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRTIYAG 271


>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 480

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 125/322 (38%), Positives = 176/322 (54%), Gaps = 12/322 (3%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII I  DD+GWNDV +H  D I  PN+  LA  G+I    YT   CTPSR+A+MTG +
Sbjct: 10  PHIILIFVDDMGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLY 68

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P  TG QH +++     G+PL  K+LP+  KE+GY T +VGKWHLGF K EY PT RGF+
Sbjct: 69  PFKTGNQHQMVFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLPTNRGFD 128

Query: 180 SHLGYWT-GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           SH G WT G  DY   +       G D R ++      +G Y   +    A  I++NH  
Sbjct: 129 SHYGIWTLGVGDYDKMNGVLSPSAGYDFRDNMGVVPKSNG-YLALMLGDRAEHIVNNHYP 187

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFL     A     P + L+ P+ Y   +  IED +  +F   L  +D+ +G VV+A
Sbjct: 188 GTPLFL-----AFTLDIPAKHLEIPEEYEEPYADIEDDRTRQFYGKLTMMDDIIGGVVDA 242

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+ R M  +S+I+F+ DNG  ++     A SN+P RG+  TL+EG  R   ++    ++ 
Sbjct: 243 LKNRGMYDDSLIIFIGDNGALSS----QAGSNYPFRGIAGTLFEGATRVPSIVSGKGIKK 298

Query: 359 RGIVAEQYVHVSDWLPTLLSAA 380
            G  + +   + D   T+L  A
Sbjct: 299 TGYESNELYSIVDIHRTILDVA 320



 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 22/128 (17%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           E+DG+ +W   S ++PS RN IL+NIDD+     +A+  G +KL+            TG 
Sbjct: 327 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI------------TGH 374

Query: 632 PDQVYLS---GLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKN 688
           PD +Y      L+D  +          A+ +     +     PQ    LF++  DP E+N
Sbjct: 375 PDLLYPYRDISLTDNYY----NYGDPPASGLPSTDQEANAPAPQNVEYLFNVVVDPEERN 430

Query: 689 NL-ADRSE 695
           ++ ADR +
Sbjct: 431 DISADRPD 438



 Score = 39.3 bits (90), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
           E+DG+ +W   S ++PS RN IL+NIDD+     +A+  G +KL+
Sbjct: 327 ELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI 371


>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
          Length = 523

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 102/236 (43%), Positives = 149/236 (63%), Gaps = 15/236 (6%)

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM------KMWGLDMR 207
           Y T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H A             LD+R
Sbjct: 128 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLR 187

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
              EPA +    YST++FT  A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+
Sbjct: 188 DGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYM 242

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
             +  I+D  R  +A ++  LDE+VG V +AL+ R + +N++++F +DNGG       + 
Sbjct: 243 EPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SG 298

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            +NWPLRG K TLWEGG+RGAG + SPLL+ +G+ + + +H++DWLPTL++ A  S
Sbjct: 299 GNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 354


>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
          Length = 252

 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 102/237 (43%), Positives = 149/237 (62%), Gaps = 15/237 (6%)

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDM 206
           GY T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H A    E +      LD+
Sbjct: 2   GYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDL 61

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
           R   EPA + +  YST++FT  A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y
Sbjct: 62  RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEY 116

Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
           +  +  I+D  R  +A ++  +DE+VG V +AL+   + +N++ +F +DNGG       +
Sbjct: 117 MEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----S 172

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
             +NWPLRG K TLWEGG+RG G + SPLL+ +G+ + + +H+SDWLPTL+  A  S
Sbjct: 173 GGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGS 229


>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1605

 Score =  206 bits (525), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 191/351 (54%), Gaps = 27/351 (7%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           S  PH++F+L DD+G NDVG+   D +  TP ID LA  G+ LK YY++ +CTP+R+A+M
Sbjct: 89  SAKPHVLFVLVDDMGSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCTPARAALM 148

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +G +P+  GMQ   +      G+P     + + LK LGY T  VGKW LG  +  + P  
Sbjct: 149 SGHYPMRIGMQLENIKPDSPWGMPRELTTMAETLKNLGYNTHGVGKWGLGHSQHGFLPVN 208

Query: 176 RGFESHLGYWTGHQDYFDH-----------SAEEMKMWG--LDMRR----DLEPAWDLHG 218
           RGF++  GY +   DY+ H            A  M  +   + M R    DL+   DL+G
Sbjct: 209 RGFDTWYGYLSDEIDYYSHEYPAPFETVEDGATVMASFTDYVFMERSRPYDLQYMPDLNG 268

Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA--PDHYLNIHRHIED 275
            +S++++T     I+ + +++ EPLF+Y A   TH      P  A     +  I   I+ 
Sbjct: 269 THSSELYTQRVQQIVKSANASREPLFVYYASQMTHGPLDVPPRSAFSDAEWARIGADIKG 328

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R KFA++L  LD+SVG+VVE +++  +  +++ VF SDNGG  A       +N+PLRG
Sbjct: 329 TVRRKFASLLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNGGCWA----QGGTNYPLRG 384

Query: 336 VKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSD 384
            K+ LWEGGV+    ++S L+ S  R        HV+DW PTL+     S+
Sbjct: 385 GKHFLWEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTLVKLVASSN 435


>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
          Length = 330

 Score =  206 bits (524), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 171/318 (53%), Gaps = 36/318 (11%)

Query: 60  PHIIFILADDLG------------------WNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           P+ +FILADDLG                  +ND+G    D I TPN+DALA  G+IL N 
Sbjct: 21  PNFVFILADDLGGMLASKLKNFQACSFSVGYNDLGLRDPDVI-TPNMDALASKGVILTNN 79

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +C+PSR A+MTG++P  + MQ  V+   E     L  K LPQYLK+LGY+  ++GK
Sbjct: 80  YVQAVCSPSRHALMTGRYPSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMIGK 139

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLG+ ++E  PT RGF++  G +    DY++H    M  W  +   D E      G ++
Sbjct: 140 WHLGYCREECLPTSRGFDTFYGLYASSGDYWEHGIMGMYDWHTEAGVDFEA----RGTHA 195

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRS 279
            D+       +   H   +PLFLY A   +H+     PLQ  D ++N++      D +R 
Sbjct: 196 QDLEIERLDKVFDEHDNKDPLFLYFAPQNSHT-----PLQPTDEFINLYPEDKFSDIRR- 249

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
           K+  +   LD  VGK+V+ L +  M++N+ ++FVSDNG            N P RG K +
Sbjct: 250 KYLGLTSGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAPFRGGKGS 304

Query: 340 LWEGGVRGAGLIWSPLLE 357
           L+EGG + +  I+SPLL+
Sbjct: 305 LFEGGTKSSSFIYSPLLK 322


>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
          Length = 467

 Score =  205 bits (522), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 129/349 (36%), Positives = 188/349 (53%), Gaps = 27/349 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FI+ADDLGW DVG+HG   IPTPNIDALA  GI +  +YT  + +P+R+ +MTG++
Sbjct: 32  PNILFIVADDLGWGDVGYHG-SVIPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTGRY 90

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTPTFRGF 178
           P   G++  V+      GL   E+ L   L   GY  R IVGKWHLG  ++ Y P  RGF
Sbjct: 91  PSRFGIRKTVIPPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNRGF 150

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
               G   G  DYF H  E      LD   D E   D    YSTD+   EAV  I  +++
Sbjct: 151 THFYGCLNGALDYFTHEREGE----LDWHNDWESCRDE--GYSTDLIADEAVRCIGGYAS 204

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK----RSK----FAAILHKLDE 290
           + P FLY+A  A H+     P QAP+  +  H   E F     R K    + A++ ++D+
Sbjct: 205 EGPFFLYVAFNAPHT-----PFQAPEDEIAEHISPEKFAALTPREKDAYTYRAMVTRMDK 259

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW-PLRGVKNTLWEGGVRGAG 349
            VG+++E LE+  ++ N++++F+SDNGG     NL   S+  PLRG K   W+GG+R   
Sbjct: 260 GVGRILEKLERSGLVENTLVLFMSDNGGIP---NLPGGSSCAPLRGHKGREWDGGLRVPA 316

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
           ++      + G+ +EQ     D  PT+ +     + P   +    N++P
Sbjct: 317 VLCRKGRFAEGVRSEQLAAFVDMAPTIAALVGAGEPPRPYDGV--NLLP 363


>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
          Length = 370

 Score =  205 bits (521), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 115/315 (36%), Positives = 174/315 (55%), Gaps = 15/315 (4%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           G++D+GF   D I TPNIDALA  G+I  N Y   +CTPSR A++TG++P  + MQ+ V+
Sbjct: 10  GYHDLGFRNPDVI-TPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRSAMQNLVI 68

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
              +     L  K LP+YLK+LGY T  VGKWH+G+ ++E  PT RGF+S  G + G   
Sbjct: 69  MSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFGLYDGDGY 128

Query: 191 YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
           Y++H++  +     D         +  G +S D+       I+   +  EP FLY +   
Sbjct: 129 YWNHTSTVIP-GAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNAKEPFFLYFS--- 184

Query: 251 THSANPYEPLQAPDHYLNIH--RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
               NP+ P Q    +LN++      D +R K+  ++  LD  VG +V+ L++  +++N+
Sbjct: 185 --PQNPHTPSQPQAEFLNLYPENRYPDIRR-KYLGLVSGLDRMVGGIVDGLKRNGLMNNT 241

Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVH 368
            I+FVSDNG      +     N P RG K +L+EGG + A  I+SPLL   G      +H
Sbjct: 242 YIIFVSDNGA-----DPEEGLNDPFRGGKGSLFEGGTKSASFIYSPLLNKAGYENNGLMH 296

Query: 369 VSDWLPTLLSAANKS 383
           ++DW+PT+L  A  S
Sbjct: 297 ITDWMPTILKLAGGS 311


>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 454

 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 183/352 (51%), Gaps = 36/352 (10%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK-NYY 102
           F   +    L+ ++  P+I+ ILADDLG+ DVG+HGL++IPTPNID +A  G+     Y 
Sbjct: 4   FITLLFSCSLLWATDKPNILIILADDLGYADVGYHGLEEIPTPNIDRIANEGVQFSAGYS 63

Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQ----------HNVLYGCERGGLPLSEKILPQYLKEL 152
              +C P+R+A+M+G +    G +          H V+      G+P   K L QY +E 
Sbjct: 64  NGSICGPTRAALMSGVYQQRIGCEGICGGRKLNEHVVV------GMPREVKTLAQYFQEA 117

Query: 153 GYRTRIVGKWHLG---FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
           GY T + GKWHLG    + K   PT RGF+   G   G   Y D    E K     +R+D
Sbjct: 118 GYATGLFGKWHLGGERLFDKTLMPTSRGFDEFFGILEGASLYDDTVNRERKY----IRQD 173

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
                D  G+Y TD    EAV  I     D+P FLYL   A H+     P+QA + Y+  
Sbjct: 174 T--VIDYEGEYFTDAIGREAVSFI-TRKGDKPFFLYLPFTAVHA-----PMQASEKYMQR 225

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
             HI D  R  FAA+L  +D+++G+V +ALE + +L N++IVF SDNGG     + N + 
Sbjct: 226 FAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKP---DNNYSL 282

Query: 330 NWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           N PL+G K   +EGG+R    + W       G   +Q V + D  P+ L AA
Sbjct: 283 NHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAA 334


>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
 gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
          Length = 472

 Score =  204 bits (520), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 199/344 (57%), Gaps = 22/344 (6%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHP 120
           ++ I+ADDLG+ D+GF G  QI TP++D LA +G+   + Y +  +C+PSR+  +TG + 
Sbjct: 37  LVLIVADDLGYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQ 96

Query: 121 IHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +  G  +N L G E G      G+PLS+K +  +L +LGY   ++GKWHLG  + ++ P 
Sbjct: 97  VEFGHDNN-LAGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLG-KEPQFHPL 154

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
            RGF+   GY  G  DYF+ S    K +   +  + +    +   Y TD    E+VD I 
Sbjct: 155 KRGFDEFWGYTGGGHDYFE-SLPNGKGYKEPLESNFKTPDPI--TYITDDVGNESVDFIE 211

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H  DEP FL+ A  A H+     P+QA +  L +++HIED KR  +AA++H+LD +VGK
Sbjct: 212 RHK-DEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRRTYAAMVHRLDLNVGK 265

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
           ++ +LE++ +  N+++VF SDNGG     + NA+ N P RG K  L EGG+    ++  P
Sbjct: 266 IMTSLEEQGLSENTLVVFFSDNGGPT---DSNASLNAPYRGQKGILLEGGIHVPFVMNLP 322

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
            L   G++ ++ V   D +PT L+ A  ++    + S V+ +IP
Sbjct: 323 GLLPEGLIYQEQVTSLDVVPTFLALAGDTETSMDMFSGVD-LIP 365


>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
           siliculosus]
          Length = 597

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 130/366 (35%), Positives = 190/366 (51%), Gaps = 53/366 (14%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLD--QIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
           +SSG P++IFIL DD+G NDVG+   D  Q+ TP ++ LA  G+IL +YY+ ++CTPSR+
Sbjct: 69  SSSGQPNVIFILVDDVGTNDVGYESTDLWQL-TPFMNTLAAEGVILDDYYSNEICTPSRA 127

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           ++MTG+    TGMQ  V+      GLP+ E  L +  +  GY T + GKWHLG Y     
Sbjct: 128 SLMTGRDSFRTGMQFGVVEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANY 187

Query: 173 PTFRGFESHL----------------------GYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           P  RGF++ L                      G +T +QD+     +    +  +  R  
Sbjct: 188 PFSRGFDTFLGYTGGGEGYYTHRECVTPEFEGGQYTCYQDFGYGDKDGYINFTTNTTRK- 246

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
               ++ GKYST V T  A+++   H   S  +PLFLY+AH A H     EPL +P    
Sbjct: 247 --GPEMTGKYSTTVITDRAIEVAREHVEKSPSDPLFLYVAHQAVH-----EPLDSPPPGS 299

Query: 268 NIHRHIE-----DF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
                +E     D+   +R   A+I++ LD+ V ++  AL+   +L N++IVF SDNG  
Sbjct: 300 FTDEEVELLEGLDYSHRERRALASIVYYLDKEVRRLHTALDALGVLDNAVIVFASDNGAC 359

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL-----ESRGIVAEQYVHVSDWLP 374
            +       SN+PLRG K+T ++GGVR  G ++S        E+RG      +H  DW P
Sbjct: 360 PS----KGGSNYPLRGFKHTTFQGGVRVPGFVYSKSADLIPEEARGTRYSGMMHSMDWTP 415

Query: 375 TLLSAA 380
           TL S  
Sbjct: 416 TLGSVV 421


>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 474

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 130/380 (34%), Positives = 211/380 (55%), Gaps = 32/380 (8%)

Query: 26  KELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPT 85
           K  GY T ++AF    L              S  P+I+ IL+DD GW DVGF+G   IPT
Sbjct: 3   KYYGYLTLLLAFCCTTLQ----------AQKSKQPNILIILSDDQGWGDVGFNGATDIPT 52

Query: 86  PNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG----GLPL 140
           PN++ALA  G+I    Y+    C+PSR+ ++TG++    G ++N     +      GLPL
Sbjct: 53  PNLNALAKDGVIFSQGYSSHPYCSPSRAGLLTGRYQQKFGHENNPENEKQNEDTVIGLPL 112

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
           +E ++ + L++  Y T  +GKWHLG   K + P  RGF+   G+  G  +Y+  +  + K
Sbjct: 113 NELMISEVLQQNNYHTCAIGKWHLGNAHK-FLPNQRGFKDWFGFSGGGFNYWGKTTPKNK 171

Query: 201 MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEP 259
             G+   ++ +P  +    Y TD F+ +A++ I  +S T++P F+YLA+ A     P+ P
Sbjct: 172 ELGV--MKNGKPVPENTLTYLTDDFSNQAINYIDQYSKTEQPFFMYLAYNA-----PHAP 224

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           +QA   Y N+  HIE+ +R+ +AA++  +D  +GKV++ L++  + +N++IVF SDNGG 
Sbjct: 225 IQATKEYTNLVTHIENGERAAYAAMVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGGH 284

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLS 378
           + G     AS+ P RG K  L+EGG+R   +I W   ++      E  + + D  PT+L+
Sbjct: 285 SHG-----ASSAPYRGHKGMLFEGGIREPFVISWPNKIKGNRTYKEPIIAL-DIFPTILA 338

Query: 379 AANKSDIPNYVNSTVENIIP 398
           A N + +P   N    N++P
Sbjct: 339 ATN-TQVPASKNIDGVNLLP 357


>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
          Length = 548

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 117/323 (36%), Positives = 179/323 (55%), Gaps = 16/323 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II+  ADD G++D G+   D I TPNID L   GI   N Y+ Q C+PSRS+ ++G++
Sbjct: 29  PNIIWFFADDYGYHDAGYRNSD-IHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRY 87

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
              +GMQH V+       + L    L  YLKEL Y T   GKWHLG+  KE TPT+RGF+
Sbjct: 88  AYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGFD 147

Query: 180 SHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +  G ++G   Y++H+  + M  W    + D   +    G +S D+  A+ ++ + + + 
Sbjct: 148 TFSGGYSGEGKYYEHTTFQGMYDWHNGTKLDRSAS----GIHSQDLIEADLLNQL-DEND 202

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLF Y A   THS  P +P   P+H            R K+  ++  +D  +GKVV  
Sbjct: 203 GTPLFYYAAFHNTHS--PMQP--KPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGKVVAK 258

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+++ M  N+ I F SDNGG     +++   N P RG K++L++GG        SPLL+ 
Sbjct: 259 LKEKDMFDNTYIFFSSDNGG-----DVHEGDNSPKRGAKSSLFDGGCHAHSWASSPLLKK 313

Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
           +G+ A+  +H++DWLPT++  A 
Sbjct: 314 KGVEADGLLHITDWLPTIVKLAG 336


>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 441

 Score =  204 bits (518), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 132/361 (36%), Positives = 194/361 (53%), Gaps = 24/361 (6%)

Query: 43  AFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
            F  +++++ L +++     P+I+ I+ADDLG+ DVGFHG   IPTP+ID+LA SG    
Sbjct: 9   GFFFAILWMGLASAADAAKLPNILVIVADDLGYADVGFHGGKDIPTPHIDSLAASGTRFS 68

Query: 100 N-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           + Y +   C+P+R+ ++TG++    G + N   G    GLPL+E  +   L+  GY T +
Sbjct: 69  SGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP--GGANKGLPLTETTIADRLQAAGYATGL 126

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           VGKWHLG   K + P  RGF    G+  GH  YFD    +++       +  EP      
Sbjct: 127 VGKWHLGTDPK-FHPLKRGFGEFFGFLAGHHTYFDKQEADIQR---GTTKVTEPG----- 177

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
            Y TD F  EAV  I  H  + P FLYLA  A H+     PL+A D  LN    I+   R
Sbjct: 178 -YLTDAFGREAVSFIERHQ-NHPFFLYLAFNAVHT-----PLEADDDRLNAFSSIDATPR 230

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVK 337
             +AA+L  +D+++GKV+ +L +     N++I F+SDNGG    G  +N++ N PLRG K
Sbjct: 231 RTYAAMLSAMDDAIGKVLASLRETAQDENTLIFFISDNGGPTMPGTTINSSINAPLRGSK 290

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENII 397
            T  EGG+R   ++  P     G V    V   D  PT L+AA  +  P++    V N++
Sbjct: 291 RTTLEGGIRVPFVVRWPGHVPAGAVYAHPVIQLDIQPTALAAAGVTIQPDWKLDGV-NLL 349

Query: 398 P 398
           P
Sbjct: 350 P 350


>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 468

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 129/353 (36%), Positives = 192/353 (54%), Gaps = 29/353 (8%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           +R  + A   L L   L+     + A +  P+I+ I+ DD+G++D+G HG   IPTP++D
Sbjct: 3   FRYSLRALVALGL---LTAATTSMAADASRPNIVVIVGDDMGYHDLGVHGCKDIPTPHLD 59

Query: 90  ALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV-LYGCERGGLPLSEKILPQ 147
           ALA SG+     Y +   C+P+R+ ++TG++    G + N         GLPLSE  L  
Sbjct: 60  ALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNPGPTPTGEIGLPLSETTLAD 119

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF----DHSAEEMKMWG 203
            LK++GY+T +VGKWHLG  +K + P  RGF+   G+  G + YF    + SA    + G
Sbjct: 120 RLKKVGYKTGMVGKWHLGNDEKRH-PLSRGFDEFFGFLGGARTYFATPGNASAGTKLLRG 178

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
            ++  + E        Y TD F  EAV  I + S   P FLYL   A H+     P++A 
Sbjct: 179 REVVDEKE--------YLTDAFAREAVAYI-DRSKASPFFLYLTFNAVHT-----PMEAS 224

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
             YL+    + D KR K+ A++  +D++VG+VV  LE+ ++L N++I FVSDNGG  A  
Sbjct: 225 QKYLDRFTAVSDPKRQKYCAMMSAMDDAVGQVVAKLEREKLLENTLIFFVSDNGGPTAA- 283

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
             N   N PLRG K T WEGG+R    + W   + + G   +Q V   D++PT
Sbjct: 284 --NTGDNTPLRGFKATTWEGGIRVPYFVSWKGKIPA-GKTYDQPVIQIDFVPT 333


>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
          Length = 176

 Score =  203 bits (517), Expect = 3e-49,   Method: Composition-based stats.
 Identities = 91/162 (56%), Positives = 121/162 (74%), Gaps = 8/162 (4%)

Query: 44  FTLSMVFVDLVASSGP--------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           F L ++ + ++++SG         P+I+ I+ DDLGWNDV FHG  QIPTPNIDALAY G
Sbjct: 6   FGLVLLLLAVISASGDRSRGSNERPNIVVIVTDDLGWNDVSFHGSSQIPTPNIDALAYQG 65

Query: 96  IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
           IIL  +YT  LCTPSR+++M+GKHPI+ GMQH+V+   E  GL L +K++P+Y +E GYR
Sbjct: 66  IILNRHYTPPLCTPSRASLMSGKHPINVGMQHHVIESNEPWGLGLDQKLMPEYFREAGYR 125

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE 197
           TR+VGKWHLGF++K YTPT RGF+SH GY   + DY+DHS +
Sbjct: 126 TRLVGKWHLGFFRKAYTPTRRGFDSHFGYIGPYIDYWDHSLQ 167



 Score = 42.4 bits (98), Expect = 1.3,   Method: Composition-based stats.
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           MQH+V+   E  GL L +K++P+Y +E GYRTR++ 
Sbjct: 95  MQHHVIESNEPWGLGLDQKLMPEYFREAGYRTRLVG 130


>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 610

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 136/361 (37%), Positives = 194/361 (53%), Gaps = 50/361 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           ++SG PH+IFIL DD+G ND+G+   D    TP +D+LA SG++L NYYT QLCTPSR++
Sbjct: 65  SASGQPHVIFILIDDVGTNDMGYRSTDLWELTPFLDSLASSGVVLDNYYTNQLCTPSRAS 124

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG+    TGMQH ++      GLP  E  L    K  GY T + GKWHLG +     P
Sbjct: 125 LMTGRDSFRTGMQHGIVDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYP 184

Query: 174 TFRGFESHLGYWTGHQDYFDHSA-----------EEMKMWG-------LDMRRDLE---P 212
             RGF++ LGY  G + Y++HS              +K +G       +D   +     P
Sbjct: 185 FSRGFDTFLGYTGGGEGYYNHSTCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKEGP 244

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH----STDEPLFLYLAHAATHSANPYEPLQAP----- 263
           A  +   YST + T  A+D+   H    S+D+PLFLY+A+ A H+     PL AP     
Sbjct: 245 A--MVDNYSTTIMTDRAIDVAREHTGTASSDDPLFLYVAYQAAHT-----PLDAPPPGTF 297

Query: 264 --DHYLNIHRHIEDF-KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
             +    + R + D   R  FA I++ LD+ V ++ +ALE   +L N++IV  SDNG  +
Sbjct: 298 TEEEEETLARVVSDEPDRHVFARIIYYLDKEVRRLHDALEALGVLDNAVIVLASDNGACS 357

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL-----ESRGIVAEQYVHVSDWLPT 375
           +    +  SN PLRG KNT ++GGVR    ++S        E+RG      +H +DW  T
Sbjct: 358 S----SGGSNHPLRGFKNTTFQGGVRVPAFVYSKSTELIPEEARGTTYSGMMHSTDWTQT 413

Query: 376 L 376
           L
Sbjct: 414 L 414


>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 453

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 126/342 (36%), Positives = 188/342 (54%), Gaps = 24/342 (7%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN- 100
           L F+++ V      +S  P+I+ I+ADDLG+ D+GF G   IPTP ID LA SG+   + 
Sbjct: 15  LLFSMTAVVGWAADASSKPNIVLIVADDLGYGDLGFQGGRDIPTPRIDGLARSGVTCSSG 74

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQH---NVLYGCERGGLPLSEKILPQYLKELGYRTR 157
           Y +   C+P+R+ ++TG++    G +    N     E  GL L E+ LPQ LK+ GY T 
Sbjct: 75  YVSGPYCSPTRAGLLTGRYQQRFGHEFNPGNAARVTETFGLSLEERTLPQRLKQAGYATG 134

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF-DHSAEEMKMWGLDMRRDLEPAWDL 216
           IVGKWHLGF   ++ P  RGF+   G+  G   YF D ++ +       +RR  E    +
Sbjct: 135 IVGKWHLGF-APQFQPLERGFDEFFGFLGGAHPYFPDANSND------PIRRGREAV--V 185

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
             +Y TD F  E+V  I + + + P FLYLA  A H+     P  A   +L   + I D 
Sbjct: 186 ESEYLTDAFARESVAYI-DRNKNRPFFLYLAFNAVHN-----PQHAKPEHLERFKSIADE 239

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R  +A +L  +D++VG+V++ L   ++   +++VF+SDNGG  A    N ++N PL G 
Sbjct: 240 RRRTYAGMLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPPA----NGSTNTPLNGQ 295

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           K T WEGGVR   ++  P   + G   E+ V   D LPT+L+
Sbjct: 296 KATTWEGGVRVPFVVSWPGQLAAGKKYEEPVIQLDLLPTILT 337


>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
          Length = 515

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 188/343 (54%), Gaps = 22/343 (6%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHG--------LDQIPTPNIDALAYSGIILKNYYTVQ 105
           VA +  P+I+ I+ DD+GW +   H            + TP +  L   G+ L  +Y  +
Sbjct: 22  VARTSQPNIVLIIGDDVGWANTELHCNISDPAQVCSDVKTPFLLQLKDEGVHLLRHYAYK 81

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
            C+P+RSA+M+G+ PIH   + N  Y  + GG+ L+  +LP+ L  +GY T  VGKWHLG
Sbjct: 82  FCSPTRSALMSGRLPIHVN-EVNPTY-IKGGGVSLNFTMLPEKLSRVGYATHHVGKWHLG 139

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
                  PT RGF+S LGY  G +D++  +  +    G+D+  +  PA+  + +Y   +F
Sbjct: 140 MSSHSRVPTGRGFDSSLGYLLGAEDHWKQTVTQGNATGVDLWLNEGPAYLRNNEYGDLLF 199

Query: 226 TAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
              AV++IH+H+    D PLFLY+     HS     P + P  Y + +  I+   R+   
Sbjct: 200 GRRAVEVIHDHAKNHRDSPLFLYVGLQVNHS-----PAEVPKEYQDRYPGIQ-VDRAIVN 253

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
            +   +D+++GK+  AL +  M  ++++V++SDNGG A+G N N  +N+PLRG K+T +E
Sbjct: 254 GMSSVMDDAIGKITTALWETTMWEDTLVVYISDNGG-ASGINANNGNNYPLRGGKHTDFE 312

Query: 343 GGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKS 383
           GGVR A L+    L  + RG   + Y+H  DW  TL   A  S
Sbjct: 313 GGVRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGAS 355


>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 536

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 179/344 (52%), Gaps = 21/344 (6%)

Query: 60  PHIIFILADDLGWNDVGFHG----LDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           PH+  I+ADDLG+ DV ++G     + + TP +D LA  GI L  +Y+   CTP+R+A++
Sbjct: 33  PHVFLIVADDLGYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAALL 92

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++P +TGMQH V+    +  LP    +LP  L E GYR   +GKW +G  +   TPT 
Sbjct: 93  TGRYPSNTGMQHEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPTA 151

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGL--DMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           RGF+SHLGY+     Y +H+A      G   DM  D         +YST +F   A+ ++
Sbjct: 152 RGFDSHLGYYGAEITYDEHAALRSCSNGTIRDMNHDGATLAATEDRYSTHLFADHAMALV 211

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPD----HYLNIHRHIEDFKRSKFAAILHKLD 289
              + +  LFLYL   A H     +PL A       + +      D  RS FAA+  +LD
Sbjct: 212 DREADEYKLFLYLCFQAVH-----QPLAADAALVKRFADAFLDESDAARSTFAAVALELD 266

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            +V + VE  + R    +++  F+SDNG   A       SNWPLRG K T +EGGVR   
Sbjct: 267 RAVERFVEFTKARGAYDDAVFFFMSDNGATLA--QSGGGSNWPLRGSKFTAYEGGVRVPA 324

Query: 350 LIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAANKSD-IPNYVN 390
            + S  +    RG       HV D LPTL + A  +D +P  V+
Sbjct: 325 FLHSARVPPARRGAAHAGLFHVVDVLPTLAAFAESADRVPAGVD 368


>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 460

 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 125/345 (36%), Positives = 185/345 (53%), Gaps = 21/345 (6%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L LAF     F  L+ +   P+II +L DD GW DVGF+G   IPTPN+D +A  G+I  
Sbjct: 7   LILAFCF-FYFAQLITAQKQPNIIVVLTDDQGWADVGFNGATDIPTPNLDRIASEGVIFS 65

Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYR 155
           N Y     C+PSR+ ++TG++    G   N+ Y  E     G PLSEK++ + LKE GYR
Sbjct: 66  NGYVSHPYCSPSRAGLLTGRYQARFGHDCNMPYEGENDATVGTPLSEKLISEALKEQGYR 125

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           T  +GKWH+G +   + P  +GF+   G+  G  +Y+  +  +++     + R+ +P  +
Sbjct: 126 TSAIGKWHIGDHPNLHPPA-QGFDHWFGFPGGSMNYWGKATSKIQT----IYRNTKPVAE 180

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               Y TD FT EA++ I N     P F+YLA+ A     P+ P  A   YL   +HIE 
Sbjct: 181 EELTYLTDDFTNEAINFI-NKKDKNPFFIYLAYNA-----PHAPDHATKKYLEQTKHIEY 234

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             RS +AA+++ +D ++GK+   L    +  N+I+VF+SDNGG      +  A N P RG
Sbjct: 235 AGRSVYAAMVNAVDANIGKIDSTLVANGIKDNTILVFLSDNGG-----RIQHADNRPYRG 289

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            K  L+EGG++    I  P          + +   D  PTLL+AA
Sbjct: 290 HKGMLFEGGIKVPFFISWPSKIKEDSKYTKPISSLDLFPTLLNAA 334


>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
 gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
          Length = 496

 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 191/348 (54%), Gaps = 25/348 (7%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L L   L + F+    +S  P+I+ ILADDLG+ D+GF G D I TPN+D LA  GI+L 
Sbjct: 3   LGLLILLFVAFISGAKASEKPNIVIILADDLGYADLGFRGSD-IQTPNLDRLAAEGIVLN 61

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
            +Y++ +CTP+RSA+MT + PI  G  +  L   E GG+   E  +P+  K+ GY+T ++
Sbjct: 62  RFYSLPICTPTRSALMTARDPIKLGTAYAGLQPWENGGVSPDEHFMPESFKKAGYQTAMI 121

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA---EEMKMWGLDMRRDLEPAWDL 216
           GKWH+G   +   P  RGF+   G+     DY+ H++    +++  G  ++RD       
Sbjct: 122 GKWHIGRQYESLVPHHRGFDHFFGHLNTQVDYYTHASAGGHDLQENGKSLKRD------- 174

Query: 217 HGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
              Y+TD+   E+V  +       +P  LY+   A HS     P+QAP+     ++   +
Sbjct: 175 --AYATDIHGDESVRYLKEIRDPSKPFLLYVPFLAPHS-----PMQAPEALEAKYKSRLN 227

Query: 276 FKRSK--FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
           F   K  +AA++  +D+++G++++ L+++ +  N+II+F SDNGG  AGF      N P 
Sbjct: 228 FPVPKRTYAAMVDSMDQAIGRIMDELDRQGVSDNTIILFFSDNGG-FAGF---GGVNKPY 283

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           RG K   +EGG+R   ++  P     G    Q + V D  PTL +AA+
Sbjct: 284 RGGKLETFEGGIRVNAIMRWPGAIPAGSKTAQAISVMDVFPTLAAAAD 331


>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 487

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 174/317 (54%), Gaps = 27/317 (8%)

Query: 32  TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           ++I AFA   LA  ++ V      ++  P+I+ I+ DD+G+ DVGFHG   IPTPN+DAL
Sbjct: 14  SQISAFAFCMLALVITPVI-----AADRPNILLIVGDDMGYADVGFHGCKDIPTPNLDAL 68

Query: 92  AYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A SG+     Y T   C+P+R+ ++TG++    G + N        GLPL+E  +   LK
Sbjct: 69  AKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNP--SGANTGLPLTEVTIADRLK 126

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           ++GY T +VGKWHLG  +    P  RGFE  +G+  G   +FD            + R  
Sbjct: 127 QVGYTTGLVGKWHLG-SQPAMHPQERGFEEFIGFLGGAHSFFDAQG---------ILRGH 176

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
           EP   +   Y+TD+F  EAV  I  H  D+P FLYL+  A H+     P+ A +  +   
Sbjct: 177 EPVKTI--DYTTDLFGREAVSFIEKHR-DKPWFLYLSFNAVHT-----PMHATEDRMAKL 228

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAAS 329
             I D +R  +AA++  +DE++GKV+  LE       ++++F+SDNGG    G  +N + 
Sbjct: 229 ASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSI 288

Query: 330 NWPLRGVKNTLWEGGVR 346
           N PLRG K T  EGG+R
Sbjct: 289 NTPLRGSKRTTLEGGIR 305


>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
          Length = 913

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 131/417 (31%), Positives = 209/417 (50%), Gaps = 44/417 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI++ L DDLGW+DV +H    + TP + ALA  G++L  +Y  + C+PSRS++++G++
Sbjct: 411 PHIVYFLVDDLGWHDVPWHN-PSLKTPTLAALAADGVVLDRFYAYRFCSPSRSSLLSGRY 469

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+H   Q+N+      GG+ ++   + + LK  GY T  +GKWH G    +  P  RGF+
Sbjct: 470 PMHVN-QYNMAGDALGGGVHVNMTTIAKKLKGAGYATHQLGKWHAGQSSADLVPAARGFD 528

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWG-LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           + LGY  G +D++  +     +   +D+     PA+  +G Y   ++   A+DII +H  
Sbjct: 529 TSLGYLNGAEDHWTQARPACGVGNFVDLYATDGPAFGKNGTYGAQIYHDAALDIIADHDA 588

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK----RSKFAAILHKLDESVGK 294
             PLF+Y A    H+     P+Q PD Y  ++   +D K    RS + A+    D+ V  
Sbjct: 589 SVPLFVYFAFQINHA-----PMQVPDAYARVY-PCDDPKTCGVRSTYQAMTAMADDVVRN 642

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
              AL+ + M  +++IV  SDNGG  +G + ++++N+PLRG K +  EGGVR AGL+   
Sbjct: 643 ATAALKAKGMWDDTLIVLTSDNGG-PSGTDADSSNNFPLRGGKYSDLEGGVRAAGLVAGG 701

Query: 355 LL--ESRGIV---AEQYVHVSDWLPTL-------------------LSAANKSDIPNYVN 390
            L   +RG     A  YVH+ DW  T                    L A +  D+ +++ 
Sbjct: 702 FLPDAARGRTLDGAGAYVHICDWYATFARLAGVDAADDASDADGRPLPAVDSLDVWDFL- 760

Query: 391 STVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN--RYENGT 445
           S   +  PR E   +    G H+           RY+   +  +P   N   Y NGT
Sbjct: 761 SGAADASPRTE---IVLTAGRHQPGGSGGGYIGGRYKVVFNTQSPAIWNGPEYPNGT 814


>gi|323454930|gb|EGB10799.1| hypothetical protein AURANDRAFT_22180 [Aureococcus anophagefferens]
          Length = 530

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 171/340 (50%), Gaps = 25/340 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPT--PNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
           PHI F+L DDLG+ND        +    P++ ALA   + +  YYT  LCTP+R A MTG
Sbjct: 20  PHIAFMLVDDLGFNDAWAVNTSDVAGAWPHVGALANESVRIATYYTTMLCTPTRGAFMTG 79

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           + P   G+ H V+ G +  GLP +E  +   LK  GY T  VGKWHLG +     PT RG
Sbjct: 80  RLPQRLGLHHGVIGGFQDYGLPANETTIADKLKGAGYATAHVGKWHLGNFDDASEPTRRG 139

Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD------LEPAWDLHGKYSTDVFTAEAVD 231
           F++  GY  G +D+F    + +  +      D      ++    ++G +++ +F   AV 
Sbjct: 140 FDASYGYQNGEEDHFTRILQGLGDFQASSLADGASYYTVDDLLKINGTHNSFLFVDRAVA 199

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY--------LNIHRHIEDFKRSKFAA 283
           ++  H    PLFLYLA   TH+     P+QAP  Y        +   RH  D  R +F  
Sbjct: 200 VVGAHDASIPLFLYLALQDTHA-----PIQAPAGYADDAACASIGSSRHYPDGARREFCG 254

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           ++   DES+  V  A++      NS+IV   DNGG   G      +NWPLRG K   WEG
Sbjct: 255 MVRAADESIANVTAAIDANFGKRNSVIVIAGDNGGIVKG----GGNNWPLRGQKGQPWEG 310

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           GVR   LI +P   + G+  +  VHV D   TL++ A  +
Sbjct: 311 GVRNHALIRAPGRLAPGLYDKGMVHVVDLHATLVALAGGA 350


>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 495

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 119/331 (35%), Positives = 180/331 (54%), Gaps = 21/331 (6%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
            VA++  P+I+F+L DD G+ DVGF     I +PN D LA +G++L  +Y  + C PSR+
Sbjct: 17  FVATAKLPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           +++TG+ P H   Q N+       G  L+  + P  LK   Y T +VGKWH GF+   Y 
Sbjct: 76  SLLTGRWPHHV-YQWNLATD-ATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           P  RGF++  G+  G +D+   +A    +  +D  ++  P    +G Y   ++  +  DI
Sbjct: 134 PINRGFDTSSGFLCGSEDHMTQNA----ICAIDYWKNNAPD-PRNGTYDAYIYRDDLTDI 188

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDES 291
           I++H+TDEPLFLYL     H      PLQAP  +L+I+       KR    A++   D  
Sbjct: 189 INSHNTDEPLFLYLPLHNVH-----LPLQAPKEWLDIYPANSTCDKRHTLQAMVSVADNV 243

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
            G +VE L+++ +  N+I+V  +DNGGA       + SN+PLRG K TL+EGGVR    +
Sbjct: 244 TGHLVELLKKKGLWDNTIMVISADNGGAHC-----SGSNYPLRGCKETLFEGGVRSLAFV 298

Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
              LL    RG   + ++H++DW  T    A
Sbjct: 299 NGGLLPESRRGQSTDGFIHIADWYTTFCKLA 329


>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
          Length = 558

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/338 (35%), Positives = 183/338 (54%), Gaps = 24/338 (7%)

Query: 60  PHIIFILADDL-----GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           P++I+  ADD      G+ D G+     I TPNID L   GI   N Y+ Q C+PSRS+ 
Sbjct: 66  PNVIWFFADDYDLALPGFQDAGYRN-SAIHTPNIDKLVGDGISFTNAYSSQQCSPSRSSF 124

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           ++G++P  +GMQH V+       + L  K L  YLK+L Y T  VGKWHLG+  K+ TPT
Sbjct: 125 LSGRYPYKSGMQHGVISDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNKKCTPT 184

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAV-DI 232
           +RGF++  G ++G  DY++H+       W      DL  A    G +S D+  A+ + ++
Sbjct: 185 YRGFDTFSGGYSGEGDYYEHTTHGGWYDWHNGTETDLTAA----GVHSQDLVEADMMKNL 240

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             N  T  P F Y A   THS  P +P   P+H    +       R K+  ++  +D  +
Sbjct: 241 DENDGT--PFFYYTAFHNTHS--PLQP--KPEHKALYNDMDATPSRKKYLGLVSGMDAVI 294

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G VV  L+ + M  N+ I F SDNGG     ++N   N P RG K++L+EGG        
Sbjct: 295 GNVVAKLKAKGMFDNTYIFFSSDNGG-----DVNEGDNSPWRGAKSSLFEGGCHAHSWAS 349

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
           SPL+  +G+ A+  +H++DWLPT+L+ A  + +P+ ++
Sbjct: 350 SPLIRKKGVEADGLLHMTDWLPTILTLAGGT-VPDGID 386


>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
 gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
          Length = 520

 Score =  200 bits (508), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 126/361 (34%), Positives = 190/361 (52%), Gaps = 30/361 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  LPL F  +       A S PP+I+ ILADD+G+ D+G  G   + TPN+D LA SG+
Sbjct: 34  FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 92

Query: 97  ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
           +    Y    +C+PSR+ ++T + P   G + N+           E  GLP SEK L  +
Sbjct: 93  LCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADH 152

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           L   GY T ++GKWHLG  +  + P  RGF+   G  TG   YF  + + +      + R
Sbjct: 153 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPATMKHV------IER 205

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
           + +   D   +Y TD FT E +  I  H +   D+P F++ ++ A     P+ P+ A + 
Sbjct: 206 NGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNA-----PHTPMHATEA 260

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            L    +I++ KR  +AA+++ LD  VG++ E LE+     N+++VF SDNGGA      
Sbjct: 261 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGATN---- 316

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           N + N PLRGVK ++ EGG+R   +IW+ P     G++ +  V   D LPT  SAA    
Sbjct: 317 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAAGAEP 375

Query: 385 I 385
           +
Sbjct: 376 L 376


>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 494

 Score =  200 bits (508), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/331 (36%), Positives = 182/331 (54%), Gaps = 21/331 (6%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
            VA++  P+IIF+L DD G+ DVGF     I +PN D LA +G++L  +Y    C+PSR+
Sbjct: 16  FVATAELPNIIFVLVDDWGFADVGFKN-PAISSPNFDHLAKTGLVLNRHYVYMYCSPSRA 74

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           +++TG+ P HT  Q N L      G  L+  ++P  LK   Y T +VGKWH GF+   Y 
Sbjct: 75  SLLTGRWPHHT-HQWN-LGNNSTAGTNLAMTMIPAKLKAANYATHMVGKWHQGFFDPRYL 132

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           P  RGF++  G+  G +D+   +A    +  +D  ++  P    +G Y   ++  +  DI
Sbjct: 133 PINRGFDTSSGFLCGSEDHMTQNA----ICAIDYWKNNAPD-PRNGTYDAYIYRDDLTDI 187

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDES 291
           I++H+T+EPLFLYL     H      PLQAP  +L+I+       KR    A++   D  
Sbjct: 188 INSHNTNEPLFLYLPLHNVH-----LPLQAPKEWLDIYPANSTCEKRHTLQAMISVADNV 242

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
            G +VE L+++ +  N+I+V  +DNGGA       + SN+PLRG K TL+EGGVR    +
Sbjct: 243 TGHLVELLKKKGIWDNTIMVISADNGGAHC-----SGSNYPLRGCKETLFEGGVRSLAFV 297

Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
              LL    RG   + ++H++DW  T    A
Sbjct: 298 NGGLLPESRRGQSTDGFIHIADWYTTFCKLA 328


>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 263

 Score =  200 bits (508), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 95/186 (51%), Positives = 127/186 (68%), Gaps = 2/186 (1%)

Query: 203 GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
           G D+R ++   + ++G Y+T+VFT +A  II  H+   PLFLYLAH A HS NP  PL+A
Sbjct: 2   GYDLRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLEA 61

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           P+ Y++   HI+D  R  FA ++  LDESVG + + L+   M +N+II+F +DNGG AAG
Sbjct: 62  PEKYMSRFPHIQDKNRKTFAGMVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAAG 121

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAA 380
           FN N ASNWPLRG+K TLWEGGVRG G + SPLLE  G V+   +HV DWLPT+  ++  
Sbjct: 122 FNNNHASNWPLRGIKATLWEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIYHIAGG 181

Query: 381 NKSDIP 386
           N S +P
Sbjct: 182 NVSSLP 187



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 33/59 (55%)

Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
            N +S    +DG +VW  LSR   S R  ILHNID   QI+AL  GK+K++      GN
Sbjct: 181 GNVSSLPTNLDGFNVWDTLSRGSVSPRTEILHNIDPILQIAALRVGKYKIIMGECYRGN 239



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%)

Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLV 616
            N +S    +DG +VW  LSR   S R  ILHNID   QI+AL  GK+K++
Sbjct: 181 GNVSSLPTNLDGFNVWDTLSRGSVSPRTEILHNIDPILQIAALRVGKYKII 231


>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
          Length = 486

 Score =  200 bits (508), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 110/312 (35%), Positives = 171/312 (54%), Gaps = 20/312 (6%)

Query: 81  DQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
           D I TPNIDA++ +G+ L+NYY   +CTPSRS +++G++ IHTG+QH +++      LPL
Sbjct: 9   DIIKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPSALPL 68

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS-AEEM 199
             ++LP+ ++  GY T   GKWHLG+ K   TP  RGF +  GY  G +DY+  +   + 
Sbjct: 69  DTELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTRCIDH 128

Query: 200 KMWGLDMRRDLEPAWDL-----HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
              G+D   D E   +        +YS   +  +A + I     D+P FLYL   + H+ 
Sbjct: 129 HKCGIDQNTDGEIFGERVYNADASEYSAFKYIRQAKNYIDGRDKDKPFFLYLPMQSVHA- 187

Query: 255 NPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
               PL+AP  Y+  +   IED  R  +A ++  +DE +G +   L++  + +N+I +F 
Sbjct: 188 ----PLEAPAEYIAKYDGIIEDKPRKIYAGMVSVMDEGIGNITSHLKESGLWNNTIFIFS 243

Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR----GIVAEQYVHV 369
           +DNGG A        +N PLRG K   WEGG+ G G +     E R     +V  + +H+
Sbjct: 244 TDNGGQA----YVGGNNLPLRGNKAGYWEGGIHGNGFVAGGYFEDRRKGSELVNNELIHI 299

Query: 370 SDWLPTLLSAAN 381
           SDW PT++ A +
Sbjct: 300 SDWYPTIMEATS 311


>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 543

 Score =  199 bits (507), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 180/339 (53%), Gaps = 28/339 (8%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
           +V +   P+I+ I+ADDLG++DVGF+G  +IPTP++D LA SG++  N Y     C+PSR
Sbjct: 38  VVGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97

Query: 112 SAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           + ++TG+H    G   N       +G +  G+PLSE  L   LKE GY T  +GKWHLG 
Sbjct: 98  AGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG- 156

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
             K + P  RGF+   G+  G   Y+      MK   L + R  EP       + TD F+
Sbjct: 157 DAKPFWPNRRGFDEWFGFSGGGFSYWGDLG--MKDPLLGVHRGDEPVDPKTLTHLTDDFS 214

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFA 282
            EAV  I  H T EP FLYLA+ A H         APDH    H     HIE   R+ + 
Sbjct: 215 TEAVKFIQRHET-EPFFLYLAYNAPH---------APDHATRAHLQKTAHIEYGGRAVYG 264

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A++  +DE +G+VV+ + +  +  N++I+F SDNGG         A N+P RG K  L+E
Sbjct: 265 AMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFE 319

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           GG+R   L+  P     G+  E  +   D  PT L+AA 
Sbjct: 320 GGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAG 358


>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 826

 Score =  199 bits (507), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 125/339 (36%), Positives = 182/339 (53%), Gaps = 28/339 (8%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
           +V +   P+I+ I+ADDLG++DVGF+G  +IPTP++D LA SG++  N Y     C+PSR
Sbjct: 38  VVGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97

Query: 112 SAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           + ++TG+H    G + N       +G +  G+PLSE  L   LKE GY T  +GKWHLG 
Sbjct: 98  AGLLTGRHQQRFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG- 156

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
             K + P  RGF+   G+  G   Y+    ++  + G+   R  EP       + TD F+
Sbjct: 157 DAKPFWPNHRGFDEWFGFSGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLTHLTDDFS 214

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFA 282
            EAV  I  H + EP FLYLA+ A H         APDH    H     HIE   R+ + 
Sbjct: 215 TEAVKFIQRHES-EPFFLYLAYNAPH---------APDHATRAHLQKTAHIEYGGRAVYG 264

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A++  +DE +G+VV+ + +  +  N++++F SDNGG         A N+P RG K  L+E
Sbjct: 265 AMVAGMDEGIGRVVDQIRESGLGENTMVIFYSDNGG-----RREHAVNFPYRGHKGMLFE 319

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           GG+R   L+  P     G+  E  +   D  PT L+AA 
Sbjct: 320 GGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAG 358


>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
          Length = 697

 Score =  199 bits (507), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 114/326 (34%), Positives = 175/326 (53%), Gaps = 18/326 (5%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI++ L DDLG+ + G    D+  TP+ID LA +G +L+ +YT + C+P+RS++++G+
Sbjct: 74  PPHIVYFLVDDLGYANTGVTN-DEPLTPHIDELARNGTMLRRFYTYKYCSPTRSSLLSGR 132

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            PIH   Q N       GG+  +   L + L+  GY T  VGKWH G    +  P  RGF
Sbjct: 133 LPIHVN-QENRPPPVPGGGIDQNMTTLAEVLQRAGYATHQVGKWHCGMSSPDRLPVHRGF 191

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
            S  GY +G +    H   +++   +D+ RD +PA+  +G Y T ++  EA  IIH H  
Sbjct: 192 NSSFGYLSGAE----HHYSQIRQGYVDLWRDTQPAYGENGTYGTFMYAREAERIIHAHDP 247

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLFLYLA    HS     PL+ PD +L+ +  ++   R    A++  +DE++  V  A
Sbjct: 248 TTPLFLYLAFQNVHS-----PLEVPDRFLDKYPDVDYRPRKHCLAMVSAVDEAINNVTLA 302

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L +  +  +++I+F SDNGG       + A+N+P RG K   +EGGVR    +   ++  
Sbjct: 303 LRRAGLFDDTLIIFSSDNGGPH-----DHANNYPFRGAKTADFEGGVRAVAFVSGGVVPK 357

Query: 359 R--GIVAEQYVHVSDWLPTLLSAANK 382
              G      +HV DW  TL   A +
Sbjct: 358 NMVGTNVHGLMHVCDWYSTLARLAGE 383


>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 826

 Score =  199 bits (506), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 126/339 (37%), Positives = 182/339 (53%), Gaps = 28/339 (8%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
           +V +   P+I+ I+ADDLG++DVGF+G  +IPTP++D LA SG++  N Y     C+PSR
Sbjct: 38  VVGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97

Query: 112 SAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           + ++TG+H    G + N       +G +  G+PL+E  L   LKE GY T  +GKWHLG 
Sbjct: 98  AGLLTGRHQQRFGHESNPEPDTQWHGEDTPGMPLTETTLADALKEAGYVTGAIGKWHLG- 156

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
             K + P  RGF+   G+  G   Y+    ++  + G+   R  EP       + TD F+
Sbjct: 157 DAKPFWPNRRGFDEWFGFSGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLTHLTDDFS 214

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFA 282
            EAV  I  H T EP FLYLA+ A H         APDH    H     HIE   R+ + 
Sbjct: 215 TEAVKFIQRHET-EPFFLYLAYNAPH---------APDHATRAHLQKTAHIEYGGRAVYG 264

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A++  +DE +G+VV+ + +  +  N++I+F SDNGG         A N+P RG K  L+E
Sbjct: 265 AMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFE 319

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           GG+R   L+  P     G+  E  +   D  PT L+AA 
Sbjct: 320 GGIRVPFLVSWPGTVRSGMKEELPITALDLFPTALAAAG 358


>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
          Length = 492

 Score =  199 bits (506), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 180/365 (49%), Gaps = 67/365 (18%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IF+L DDLG++D+G+   D I +PNIDALA   + LK +Y    CTPSR+A +TG++
Sbjct: 24  PHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRY 82

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  GMQ  V+   E  G+PL E +L +  K+ GYRT + GKWHLGFY  ++ P  RGF+
Sbjct: 83  NIRMGMQSGVIRPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFD 142

Query: 180 SHLGYWTGHQDYFDHSAEEM-------------------------KMWGLDMRRDLEPAW 214
              G++ G QD++ H +  +                         + WG D R       
Sbjct: 143 RFYGFYLGSQDFYFHDSGRLSSHKFSYEDEVSGVNNKRWLQGPHGRPWGYDFREAYPANG 202

Query: 215 DL-----------HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
           D+           +G YST +F  + ++ +  H     LF Y++    H      PLQ  
Sbjct: 203 DVENDTILDDFHTNGTYSTKLFVNDFINDLAKHDPAVSLFNYVSFQDVHG-----PLQTV 257

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
           + +   +++  D    +   I  K+                   S++VF SDNGG     
Sbjct: 258 NKFKKHYKNKTDTWTHERILISTKI------------------TSLLVFTSDNGGRPK-- 297

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR--GIVAEQYVHVSDWLPTLLSAAN 381
               ASNWPLRG K T+++GG++    + SPLL++R  G       HVSDW PTLL+ + 
Sbjct: 298 --EGASNWPLRGSKGTIYDGGLKSRAFVASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG 355

Query: 382 KSDIP 386
             D+P
Sbjct: 356 -CDVP 359


>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 826

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 125/332 (37%), Positives = 179/332 (53%), Gaps = 28/332 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ I+ADDLG++DVGF+G  +IPTP++D LA SG++  N Y     C+PSR+ ++TG+
Sbjct: 45  PNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGR 104

Query: 119 HPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           H    G + N       +G +  G+PLSE  L   LKE GY T  +GKWHLG   K + P
Sbjct: 105 HQQRFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWP 163

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
             RGF+   G+  G   Y+    ++  + G+   R  EP       + TD F+ EAV  I
Sbjct: 164 NHRGFDEWFGFSGGGFSYWGDLGKKDPLLGV--HRGDEPVEPKTLTHLTDDFSTEAVKFI 221

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH----RHIEDFKRSKFAAILHKLD 289
             + T EP FLYLA+ A H         APDH    H     HIE   R+ + A++  +D
Sbjct: 222 QRNET-EPFFLYLAYNAPH---------APDHATRAHLQKTTHIEYGGRAVYGAMVAGMD 271

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E +G+VV+ + +  +  N++I+F SDNGG         A N+P RG K  L+EGG+R   
Sbjct: 272 EGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEGGIRVPF 326

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           L+  P     G+  E  +   D  PT L+AA 
Sbjct: 327 LVSWPGTVRSGMKEESPITALDLFPTALAAAG 358


>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
          Length = 617

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 122/356 (34%), Positives = 189/356 (53%), Gaps = 42/356 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FI+ADDLG+ND+G++ ++   TPNI+ LA +GIIL ++Y+  +C+PSRS  +TG++
Sbjct: 158 PNIVFIMADDLGYNDIGYNSIEAF-TPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRY 216

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
               GMQH  +   +  G+PL+EK+LP+  KE GY T   GKWH G + + Y PT RGF+
Sbjct: 217 SFRYGMQHRNILPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFD 276

Query: 180 SHLGYWTG------HQDYFDHSAEE--MKMWGLDMRRD--LEPAWDLHGKYSTDVFTAEA 229
             +G ++G      H+  F+    +      G DMR    +E   +L G   +D    E 
Sbjct: 277 KFVGSYSGSSQHSTHEKCFNSPQRDSPFSFCGYDMREAEVVEARQNLTG---SDFLLREI 333

Query: 230 V--DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
                +H   T +P               YE L         HR+   +KR+K  A++  
Sbjct: 334 CIRITVHGPLTTDP----------RDEKIYEEL------YERHRNETIYKRTKMLAMVTG 377

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           LD ++G+++E+ ++     N+I  F SDNGG+        A+N P RG KN L EGGVR 
Sbjct: 378 LDRAIGRIIESFKKYGKYKNTIFAFASDNGGSLT----YRANNMPFRGGKNDLLEGGVRT 433

Query: 348 AGLIWSPLLESRGIVAE--QYVHVSDWLPTLLSA--ANKSDIPNYVN--STVENII 397
              + S L+ +  I +       ++DW PTLL A   ++ ++PN ++  S  EN  
Sbjct: 434 LSFLHSSLIPNDKIGSRYSNLFDITDWFPTLLDAGKCSQKNLPNQLDGKSHFENFF 489


>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
 gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
          Length = 545

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 123/363 (33%), Positives = 193/363 (53%), Gaps = 36/363 (9%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L     ++ V   A S  P+I+ ++ADDLGW DVG+HG D I TP++D LA  G+ L  +
Sbjct: 15  LLMMCVILGVPASAQSQKPNILIMVADDLGWADVGYHGGD-IDTPSLDRLAQQGVRLNRF 73

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           YT  +C+P+R+A+MTG+ PI  G+ + V++  +  G+   E  +P+  +  GY+T I+GK
Sbjct: 74  YTTPICSPTRAALMTGRDPIRLGVTYGVIFPWDNIGVHPDEHFMPETFQAAGYQTAIIGK 133

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLG  +  Y P  RGFE   G+      ++   + +    G D +R+   + D  G Y 
Sbjct: 134 WHLGHAQMTYHPNNRGFEHFYGHLHTEVGFYPPFSNQG---GKDFQRN-GVSIDDQG-YE 188

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP----DHYLNIH------- 270
           T +   E    I     D P  +Y+   A H+     PL AP    D Y +I        
Sbjct: 189 TYLLADEVSRYIRERDRDRPFLVYMPFIAPHT-----PLDAPVELQDKYKDIETDLPMAR 243

Query: 271 -RHIEDFK-----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
            R  +D +           R  +AA++  +D+++G+V++ L+Q  +  N+I++F SDNGG
Sbjct: 244 SRQTDDTRLISRVMLQPSARPMYAAVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGG 303

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           AA  ++   A+N PLRG K   +EGG+R   L+  P +   G + EQ + V D  PTL+ 
Sbjct: 304 AA--YSYGGANNAPLRGGKGETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVD 361

Query: 379 AAN 381
           AA+
Sbjct: 362 AAD 364


>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 498

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 126/361 (34%), Positives = 188/361 (52%), Gaps = 30/361 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  LPL F  +       A S PP+I+ ILADD+G+ D+G  G   + TPN+D LA SG+
Sbjct: 12  FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 70

Query: 97  ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
           +    Y    +C+PSR+ ++TG+ P   G + N+           E  GLP SEK L  +
Sbjct: 71  LCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADH 130

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           L   GY T ++GKWHLG  +  + P  RGF+   G  TG   YF  +   +      + R
Sbjct: 131 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGGHHYFPTTMNHV------IER 183

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
           + +   +   +Y TD FT E +  I  H +   D+P F++ ++ A     P+ P+ A + 
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNA-----PHTPMHATEA 238

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            L    +I+  KR  +AA+++ LD  VG++ E LE+     N+++VF SDNGGA      
Sbjct: 239 DLARFANIQTQKRRTYAAMMYALDRGVGRIREHLEESGQWGNTLLVFFSDNGGATN---- 294

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           N + N PLRGVK ++ EGG+R   +IW+ P     G+  +  V   D LPT  SAA    
Sbjct: 295 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVVSSLDLLPTFCSAAGAEP 353

Query: 385 I 385
           +
Sbjct: 354 L 354


>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
 gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
          Length = 458

 Score =  198 bits (504), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 188/348 (54%), Gaps = 28/348 (8%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           +L  +  +S   +        P+I+ +LADDLGW DVGFHG  +I TP ++ALA  G++L
Sbjct: 5   ILGASMVVSATTLSFAQQVERPNIVIVLADDLGWGDVGFHG-SEIKTPCLNALAAEGVVL 63

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR- 157
             +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ L   L   GY  R 
Sbjct: 64  DRFYTAPISTPTRAGLMTGRYPNRFGIRTTVIPPWREDGLDENEETLADMLARNGYSNRA 123

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
           I+GKWHLG  +K + P  RGF    G+  G  DYFDH    M+   LD   D E  +D  
Sbjct: 124 IIGKWHLGHTRKVHYPINRGFSHFYGHLNGAIDYFDH----MREGELDWHNDWETCYD-- 177

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF- 276
             YST++ T EAV  I+ +  + P  LY+A+ A H+     PLQA +   +I  + +DF 
Sbjct: 178 KGYSTELITQEAVRCINTYEKEGPFLLYVAYNAPHT-----PLQAQEK--DIELYCDDFG 230

Query: 277 -------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
                  KR  + A++  +D  +G +V+AL+++ ++ N+ ++F SDNG A     +  +S
Sbjct: 231 SLTPKEQKRVTYQAMVSCMDRGIGTIVDALKKKGIMDNTFLIFFSDNGPAG----VPGSS 286

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESR-GIVAEQYVHVSDWLPTL 376
           +  LRG K   W+GGVR   + +    ES    ++ Q     D +PTL
Sbjct: 287 SGKLRGRKFDEWDGGVRVPAVFYWKRAESNYKNLSSQVTGFVDIVPTL 334


>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 467

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 124/371 (33%), Positives = 206/371 (55%), Gaps = 30/371 (8%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL- 98
           L ++  + +++     ++  P+II IL+DD G+ D   +G  +IPTPNI+ LA  G +  
Sbjct: 8   LSISLLVVLIYTTQAQTTTKPNIIVILSDDAGYADFECYGNKEIPTPNINRLAKEGTLFS 67

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKEL 152
           K Y +  +C PSR+ ++TG++    G ++N        +  E  GL LSEK +   +KE 
Sbjct: 68  KAYVSASVCAPSRAGLLTGRYQQRFGFENNPTGKPREGFKKEDMGLALSEKTIGDRMKEE 127

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS---AEEMKMWGLDMRRD 209
           GYRT  VGKWHLG   K + P  RGF+   G+  GH+D+F      AE+  +W  D    
Sbjct: 128 GYRTLAVGKWHLGNDAK-FFPLKRGFDEFYGFQEGHRDFFSFKKKRAEKYALWDND---K 183

Query: 210 LEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           + P  ++   Y TD+FT +A+  I  N    +P F+YLA+ A H+     PLQA  + L+
Sbjct: 184 IIPEEEI--TYLTDMFTDKALKFIDENADKKQPFFIYLAYNAVHT-----PLQAKKNDLD 236

Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
               +    R  + A+L  +D ++G+V++ L+ + +  N++++F++DNGGA      N +
Sbjct: 237 KFAQVGSEGRQTYDAMLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNGGATT----NYS 292

Query: 329 SNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            N  LRG+K ++WEGGVR G  + W+  + +  +V ++ V   D LPT L+AA  +    
Sbjct: 293 DNGQLRGMKGSVWEGGVRVGYIMRWNGKIPAN-VVYDKAVSSLDILPTSLAAAGNTRKDK 351

Query: 388 YVNSTVENIIP 398
           +++    N++P
Sbjct: 352 HLDGV--NLLP 360


>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 498

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 125/361 (34%), Positives = 189/361 (52%), Gaps = 30/361 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  LPL F  +       A S PP+I+ ILADD+G+ D+G  G   + TPN+D LA SG+
Sbjct: 12  FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 70

Query: 97  ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
           +    Y    +C+PSR+ ++TG+ P   G + N+           E  GLP SEK L  +
Sbjct: 71  LCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADH 130

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           L   GY T ++GKWHLG  +  + P  RGF+   G  TG   YF  + + +      + R
Sbjct: 131 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMKHV------IER 183

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
           + +       +Y TD FT E +  I  H +   D+P F++ ++ A     P+ P+ A + 
Sbjct: 184 NGKRVDGFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNA-----PHTPMHATEA 238

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            L    +I++ KR  +AA+++ LD  VG++ E LE+     N+++VF SDNGGA      
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGATN---- 294

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           N + N PLRGVK ++ EGG+R   +IW+ P     G+  +  +   D LPT  SAA    
Sbjct: 295 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAEP 353

Query: 385 I 385
           +
Sbjct: 354 L 354


>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 498

 Score =  197 bits (502), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 125/361 (34%), Positives = 189/361 (52%), Gaps = 30/361 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  LPL F  +       A S PP+I+ ILADD+G+ D+G  G   + TPN+D LA SG+
Sbjct: 12  FVFLPLIFVFATE-ASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGV 70

Query: 97  ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-------LYGCERGGLPLSEKILPQY 148
           +    Y    +C+PSR+ ++TG+ P   G + N+           E  GLP SEK L  +
Sbjct: 71  LCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPKSEKTLADH 130

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           L   GY T ++GKWHLG  +  + P  RGF+   G  TG   YF  +   +      + R
Sbjct: 131 LGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMNHV------IER 183

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDH 265
           + +   +   +Y TD FT E +  I  H +   D+P F++ ++ A     P+ P+ A + 
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNA-----PHTPMHATEA 238

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            L    +I++ KR  +AA+++ LD  VG++ E LE+     N+++VF SDNGGA      
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGATN---- 294

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           N + N PLRGVK ++ EGG+R   +IW+ P     G+  +  +   D LPT  SAA    
Sbjct: 295 NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAEP 353

Query: 385 I 385
           +
Sbjct: 354 L 354


>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 160

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 91/148 (61%), Positives = 115/148 (77%), Gaps = 1/148 (0%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHII I+ADDLGWND+GFHG  +I TPNID LA +G++L NYY + +CTPSRSAIMTG++
Sbjct: 10  PHIILIVADDLGWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTGRY 69

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIHTGMQ + +YG    G+ L+EK LPQYLK+ GY+T  VGKWHLGF+ KEYTPT+RGF+
Sbjct: 70  PIHTGMQQDTIYGPNPYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRGFD 129

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
           S+ G + G  DY++HS  E    GLD+ 
Sbjct: 130 SYYGSYLGKGDYWNHSNTE-TFTGLDLH 156


>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/349 (34%), Positives = 185/349 (53%), Gaps = 24/349 (6%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M+     L   L   FV   A++  P+I+F+L DDLG+ DVGF     I +PN D LA +
Sbjct: 1   MSILQCLLVCVLCSCFV---ATAKLPNIVFVLIDDLGFADVGFRN-PAISSPNFDQLAKT 56

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G++L  +Y  + C+PSR++++TG+ P H   Q N L      G  ++  +LP  LK   Y
Sbjct: 57  GLVLNRHYVFKYCSPSRASLLTGRWP-HHAHQWNPLMD-STIGTNINMTMLPAKLKAANY 114

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            T +VGKWHLGF+   Y P  RGF++  G++   +D+ +    E     +D  ++  P  
Sbjct: 115 STHMVGKWHLGFFDPRYLPINRGFDTSTGFFGCCEDHMN----EKSGCSIDYWKNNAPD- 169

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
             +G Y    +  +  D++ N++T+ PLFLYL     H+     P QAP  +L+I+    
Sbjct: 170 PRNGTYDAYNYRDDLTDVMSNYNTENPLFLYLPLHNVHA-----PFQAPKEWLDIYPANS 224

Query: 275 DF-KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
              KR    A++   D   G +VE L+++ +  N+I+V  +DNGGA       A SN+PL
Sbjct: 225 TCEKRHTLQAMISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPC-----AGSNYPL 279

Query: 334 RGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           +G K T +EGGVR    +   LL    RG   + ++H++DW PT    A
Sbjct: 280 KGCKTTFFEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLA 328


>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 461

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 119/320 (37%), Positives = 170/320 (53%), Gaps = 16/320 (5%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
           +GWNDV +H  D I  PN+  LA  G+I    YT   CTPSR+A+MTG +P  TG QH +
Sbjct: 1   MGWNDVHWHNSD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GH 188
           ++     G+PL  K+LP+ LKE+GY T +VGKWHLGF K EY PT RGF+SH G WT G 
Sbjct: 60  VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 189 QDYFDHSAEEMKMWGLDMRRDLE--PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
            DY   +       G D R ++   P  D    Y   +    A  I++ H    PLFL  
Sbjct: 120 SDYDKMNGVLSPSAGYDFRDNMGVVPKSD---DYLALMLGDRAEHIVNTHYPGTPLFL-- 174

Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
              A     P + L+ P+ Y   +  IED +  +F   L  +D+ +G VV+AL+ R M  
Sbjct: 175 ---AFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYD 231

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           +S+I+F+ DNG  ++     + SN+P RG+  TL+EG  R   ++    ++  G  + + 
Sbjct: 232 DSLIIFIGDNGALSS----QSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287

Query: 367 VHVSDWLPTLLSAANKSDIP 386
             + D   T+L  A     P
Sbjct: 288 YSIVDIHRTILDVAGAEAEP 307



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 21/123 (17%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTKGKWKLVKVVKVMRYQVDLTGG 631
           E+DG+ +W   S+ +PS RN IL+NIDD+     +A+  G +KL+            TG 
Sbjct: 308 ELDGLSMWPTWSKGKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI------------TGH 355

Query: 632 PDQVYLS---GLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKN 688
           PD +Y      L+D  +          A+ +     +     PQ    LF+I  DP E+N
Sbjct: 356 PDLLYPYRDISLTDNYY----NYGDAPASGLPSTDQQSHAPAPQNVEYLFNIVVDPEERN 411

Query: 689 NLA 691
           +++
Sbjct: 412 DIS 414



 Score = 39.7 bits (91), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDE--WQISALTRGKWKLV 551
           E+DG+ +W   S+ +PS RN IL+NIDD+     +A+  G +KL+
Sbjct: 308 ELDGLSMWPTWSKGKPSPRNEILYNIDDDPVSPGAAIRIGDYKLI 352


>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
          Length = 540

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 186/348 (53%), Gaps = 34/348 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHIIF L DDLGWN   FH  ++I TP +  L  +G+ L ++YT   C+P+R++ +TG+ 
Sbjct: 30  PHIIFFLTDDLGWNAPSFHN-NEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTGRF 88

Query: 120 PI-HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           P  H     N+L      GL LS   L   LK+  Y T  +GKWHLG YKKEYTP +RGF
Sbjct: 89  PYKHEMTNTNLLPPTRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYRGF 148

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           ++  G+ TG ++++   A  +    +D+  +  PA+ ++G ++  +FT  A+DII N++ 
Sbjct: 149 DTTFGFLTGGENHYTQRA-FVSPPAVDLWDEEAPAYGMNGTWTGKMFTDAALDIIRNNAQ 207

Query: 239 -------DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK--RSKFAAILHKLD 289
                    PLF+Y A    H+     P Q+P     ++    D    +S F  ++  +D
Sbjct: 208 LRNATGDAPPLFIYFALHDVHA-----PTQSPVRLAQLYNFTGDDAGLKSTFYGMISGVD 262

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FN--------------LNAASNWPLR 334
           E+V  V +AL+   M +++++V+ SDNG    G FN                  SN+PLR
Sbjct: 263 EAVRNVTDALKAAGMWNDTLLVWTSDNGSPTMGSFNGGCKNVVPDSRGGACMCGSNYPLR 322

Query: 335 GVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
           G K+T +EGGVR    +  P+L +  RG   +  +HV DW  TL S A
Sbjct: 323 GSKHTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVA 370


>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 810

 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 130/350 (37%), Positives = 184/350 (52%), Gaps = 26/350 (7%)

Query: 44  FTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
           F  SM+   LV ++        P+I+ I+ADDLG++DVGF+G  +IPTP +D LA  G++
Sbjct: 7   FLSSMLIGLLVLAASSVDAKERPNIVLIVADDLGYSDVGFNGCKEIPTPRLDELAGEGVV 66

Query: 98  LKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKE 151
             N Y     C+PSR+ ++TG++    G + N       +G +  G+PLSE  L   LKE
Sbjct: 67  FTNGYASHPYCSPSRAGLLTGRYQQRFGHEGNPEPDPQWHGDDTPGMPLSETTLADALKE 126

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY T  +GKWHLG   K + P  RGF+   G+  G   Y+        + G+   R  E
Sbjct: 127 AGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGFSGGGLSYWGDLGRMDPLLGV--HRGDE 183

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
           P       Y TD F+ EAV  I  H TD P FLYLA+ A     P+ P QA   +L    
Sbjct: 184 PVDRKSLTYLTDDFSTEAVKFIQRHETD-PFFLYLAYNA-----PHAPDQATRAHLEKTA 237

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
           HIE   R+ + A++  +DE +G+VV+ + +  +  N++IVF SDNGG         A N 
Sbjct: 238 HIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLDDNTLIVFYSDNGG-----RREHAMNL 292

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           P RG K  L+EGG+R   L+  P     G+  +  V   D  PT+L+AA 
Sbjct: 293 PYRGHKGMLFEGGIRVPFLVSWPGTLPGGVREQSPVTALDVFPTVLAAAG 342


>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
          Length = 427

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 99/225 (44%), Positives = 138/225 (61%), Gaps = 11/225 (4%)

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
           +VGKWHLGFY+KE  PT RGF++  G   G  DY+ H   +   M G D+  +   AWD 
Sbjct: 1   MVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDY 60

Query: 217 -HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
            +G YST ++T     I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +
Sbjct: 61  DNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIIN 115

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++AA+L  LDE++  V  AL+     +NSII++ SDNGG          SNWPLRG
Sbjct: 116 INRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRG 171

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            K T WEGG+R  G + SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 172 SKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA 216


>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
 gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
          Length = 507

 Score =  196 bits (498), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 141/399 (35%), Positives = 202/399 (50%), Gaps = 63/399 (15%)

Query: 25  LKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLD 81
           +K  G R  ++  +V  LA   +       AS G    P+++ ILADD+GWNDVG+HG D
Sbjct: 3   IKIPGSRIALIFLSVTVLAAAQTGAQTRSSASPGAIVRPNVVIILADDMGWNDVGYHGSD 62

Query: 82  QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS 141
            I TP+ID LA  G+ L  +Y    C+P+R+A+++G+     G+ ++ L      GL L 
Sbjct: 63  -IHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQSSQSLGI-YSPLSKLNPTGLALD 120

Query: 142 EKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM 201
           +KI+P Y ++ GY+T +VGKWHLGFY+ EY P  RGF+   G  TG   Y++H    +  
Sbjct: 121 QKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGFDHFYGNLTGGVGYWNH----VHG 176

Query: 202 WGLDMRRDLEPAWDLHGK------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
            GLD        W  +GK      YST + +AE   +I     ++PLFLY A  A     
Sbjct: 177 GGLD--------WQRNGKTLRQEGYSTHLQSAEITRLIQQRDPEKPLFLYAAFNA----- 223

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
           P+ P +AP   L  + HIE+  R   AA++ +LD ++G+++E L    ML N++I F+SD
Sbjct: 224 PHLPNEAPADTLARYAHIENPNRRIHAAMVTELDSAIGQLMETLSTEGMLENTLIWFMSD 283

Query: 316 NGG---AAAGFNLNAAS-----------------------------NWPLRGVKNTLWEG 343
           NGG    A    L + S                             N P R  K +++EG
Sbjct: 284 NGGLNRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNALDGGSDNSPHRKGKQSIYEG 343

Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G R    + W   L    I   Q V V D LPTLLSA +
Sbjct: 344 GARVPSFVYWKGRLSPERIT--QMVTVKDVLPTLLSATD 380


>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 472

 Score =  196 bits (498), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 129/375 (34%), Positives = 189/375 (50%), Gaps = 47/375 (12%)

Query: 37  FAVLP-LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           FA++   AF  + V  +    S PPHI+ I+ADDLGWNDVGFHG  +I TPNID LA   
Sbjct: 5   FAIMTCFAFVGANVSAETNGGSRPPHIVHIIADDLGWNDVGFHG-SEIRTPNIDRLASES 63

Query: 96  IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
           + L  +Y   +C+P+R+ ++TG +P   G+   V+   ++ GLP   +  P++L +LGY 
Sbjct: 64  VTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPPQLETTPEHLSKLGYD 123

Query: 156 TR-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            R + GKWHLG     + P   G     G++ G  DYF       +   LD  RD +   
Sbjct: 124 HRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRE----RFGQLDWHRDFD--- 176

Query: 215 DLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA----------- 262
            +H + YST++     VD I  ++   P++ Y+A  A HS     PLQA           
Sbjct: 177 SVHEEGYSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEYGFD 231

Query: 263 PDHYL--NIHRHIE------DFK--------RSKFAAILHKLDESVGKVVEALEQRRMLS 306
           P   L  N  R I       D+         R  FAA+   +D  +G++++A+++  M  
Sbjct: 232 PTGKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRE 291

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           N+++VF SDNG        +  SN PLRG K T WEGGVR   ++  P     GI  +  
Sbjct: 292 NTLVVFHSDNGADPK----HGGSNEPLRGNKFTTWEGGVRVVAMMRWPRELPAGIAYDSV 347

Query: 367 VHVSDWLPTLLSAAN 381
               D LPT+++AA 
Sbjct: 348 TSYVDLLPTMVAAAG 362


>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
           anophagefferens]
          Length = 351

 Score =  195 bits (496), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 129/358 (36%), Positives = 180/358 (50%), Gaps = 51/358 (14%)

Query: 59  PPHIIFILADDLGWNDVGFH----GLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           PP++I  LADDLG NDVG+     G  +I +P IDALA   ++L  +Y   +CTP+R+A+
Sbjct: 10  PPNLIVFLADDLGRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAAL 69

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           MTG+H   TG+ + VL   +  GLP +E  + + L++ GY T ++GKWHLGF KK   PT
Sbjct: 70  MTGRHAYRTGLAYFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLPT 129

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEM--KMWGLDMRRDLEPA---------WD-------- 215
            RGF+   GY  G  DY+ H + E    +   D     EP          WD        
Sbjct: 130 SRGFDRFFGYCLGSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPRT 189

Query: 216 --LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI-HRH 272
                 +S D+F A A +       D+PLFLY A  A H+           H++    RH
Sbjct: 190 PKTENVHSADLFAARATETFATAPRDKPLFLYYASQAPHA-----------HFVGAPPRH 238

Query: 273 IEDF---------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
           +             R+  AA++  LDE VG VV+ +   R   N+++ F+ DNG      
Sbjct: 239 VAAAAAMGFAAGDPRADVAALVAALDEQVGNVVDGVRASRGWENTVLWFMGDNGPEPG-- 296

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
               AS+WPLRG K TL+EGGVR    +++P   + G  +   VHV D  PTLL  A 
Sbjct: 297 --TGASSWPLRGAKRTLFEGGVREPSFVYAPGRVAPG-ASSAPVHVIDVAPTLLGLAG 351


>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 434

 Score =  195 bits (496), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 122/321 (38%), Positives = 173/321 (53%), Gaps = 20/321 (6%)

Query: 64  FILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIH 122
            IL DD GW DVGF+G   IPTPN+D LA  G+I  N Y     C+PSR+ ++TG++   
Sbjct: 1   MILTDDQGWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQAR 60

Query: 123 TGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            G   N+ Y  +     G PLSEK++ + LKE GYRT  +GKWHLG +   Y P  +GF+
Sbjct: 61  FGHDCNMPYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLGDHPDLYPPA-QGFD 119

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G+  G  +Y+  S  E++   +   R + P  +L   Y TD FT EA+  I     +
Sbjct: 120 HWFGFPGGGMNYWGESKNEIQT--IYRNRKVVPEEEL--TYLTDDFTTEAIRFI-TQKDE 174

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +P F+YLA+ A H+     P QA   YL   +HIE   RS +AA+++ +D +VGK+   L
Sbjct: 175 KPFFMYLAYNAPHA-----PDQATKEYLEKTKHIEYAGRSVYAAMVNAVDANVGKIDSTL 229

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
               +  N+I+VF+SDNGG      +  A N P RG K  L+EGG++    I  P     
Sbjct: 230 IANGLKENTILVFLSDNGG-----RIEHADNRPYRGHKGMLFEGGIKVPFFITWPTKIKE 284

Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
                + +   D  PT L+AA
Sbjct: 285 KYAYNEPISSLDLFPTFLNAA 305


>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
 gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
          Length = 461

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 187/354 (52%), Gaps = 41/354 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+ + I+ADD GWND  FHG  +I TPN+D LA  G+ L  +YT   C+P+R++++TG+ 
Sbjct: 39  PNFLVIIADDAGWNDFSFHG-SEIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTGRP 97

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
               G+   +  G     LP S   LPQ L +L Y+T ++GKWHLG  K E  P   GF+
Sbjct: 98  ASRMGIVAPI-SGRSELNLPDSITTLPQALSKLNYKTALMGKWHLGL-KPESGPEVYGFD 155

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------STDVFTAEAVDII 233
              G+  G  D + H+            ++ +  W  +GK+       TD+ T  AV  I
Sbjct: 156 FSYGFLHGQLDQYAHT-----------YKNGDSTWYRNGKFISEKGHVTDLLTQSAVHYI 204

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
               TD+  +L +A++A     P+ PLQ P  +L  +  I+D  R  +AA +  +D  +G
Sbjct: 205 DTLQTDQNFYLQVAYSA-----PHIPLQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIG 259

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGG-----------AAAGFNLNAASNWPLRGVKNTLWE 342
           ++++ L+ + +  N++++FVSDNG               G N +  SN PLR  K + +E
Sbjct: 260 EILQKLKDKDLEKNTVVLFVSDNGAQEKWVPNTQYDGKYGPNYSLGSNLPLRDFKTSNYE 319

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
           G +R   +I  P   + G  +  Y++V+DW+PT L+ AN  ++P    STVE +
Sbjct: 320 GALRVPAIISWPENLNSG-TSTNYINVTDWMPTFLNWANAEELP----STVEGV 368


>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 135/375 (36%), Positives = 201/375 (53%), Gaps = 29/375 (7%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHG-LDQIPTPNIDALAYSGIIL 98
           L L F L  V   L AS+  P+II ILADDLG+ DV +HG L +  TP+ID++A SG   
Sbjct: 4   LALQF-LCFVLASLSASAAKPNIIVILADDLGYGDVSYHGTLKETTTPHIDSIAQSGAWF 62

Query: 99  KNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV----LYGCERGGLPLSEKILPQYLKELG 153
           +N Y+   +C PSR+ +++G++    G   N+    L      GLPLS+K++P+ L + G
Sbjct: 63  QNGYSAAPVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEG 122

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF-----DHSAEEMKMWGLDMRR 208
           Y T +VGKWH G   K + P  RGF+   G+  G  + +     +H+ +E   WG   R 
Sbjct: 123 YATGMVGKWHDGDQHK-FWPYNRGFQEFYGFNNGAINNWVLKGENHTVDE---WGAVHRE 178

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           +     +  G+Y T+ F  EAV+ I  H T EP FLYL+  A H      PLQAP  Y N
Sbjct: 179 NKRV--ENSGEYMTEAFGREAVEFIDRHKT-EPFFLYLSFNAVHG-----PLQAPKSYTN 230

Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             +HI+   R+   A+L  +D+++G V+E L +  +  N+II F SDNGG   G   N +
Sbjct: 231 QFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLKG---NYS 287

Query: 329 SNWPLRGVKNTLWEGGVRGA-GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            N   RG KNT+++GG+     + W   + ++    E  VH  D   T+ +AA   +I +
Sbjct: 288 FNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAG-VEIKD 346

Query: 388 YVNSTVENIIPRYEN 402
                  N++P  +N
Sbjct: 347 EYKLDGRNLLPYLKN 361


>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 489

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 127/352 (36%), Positives = 181/352 (51%), Gaps = 46/352 (13%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHI+ I+ADDLGWNDVGFHG D I TPNID LA   ++L  +Y   +C+P+R+ ++TG 
Sbjct: 45  PPHIVHIIADDLGWNDVGFHGSD-IRTPNIDRLARESVVLDRFYVTPICSPTRAGVLTGL 103

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTPTFRG 177
           +P   G+   V+   ++ GLP   +  P++L +LGY  R + GKWHLG     + P   G
Sbjct: 104 YPFRFGIWGGVVSPTKKHGLPSELETTPEHLAKLGYDHRAMFGKWHLGLASTLFHPLRHG 163

Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVDIIHNH 236
                G++ G  DYF       +   LD  R+ +    +H + YST++     VD I  H
Sbjct: 164 MTEFYGHYNGAIDYFSRE----RFGQLDWHRNHD---SVHEEGYSTELVGNAVVDFIDRH 216

Query: 237 STDEPLFLYLAHAATHSANPYEPLQA-----------PDHYL--NIHRHI-------EDF 276
           +  +PL+ Y+A  A HS     PLQA           P   L  N  R I       E  
Sbjct: 217 AGQQPLYAYVAFNAPHS-----PLQALRSDLDDYGFDPTGKLAPNTDRKIAKRENALEYG 271

Query: 277 KRSK-------FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
           KR K       FAA+   +D  +G++++A+++  M  N+++VF SDNG        +  S
Sbjct: 272 KRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPK----HGGS 327

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           N PLRG K T WEGGVR    +  P     G+  E      D LPT+++AA 
Sbjct: 328 NEPLRGNKFTTWEGGVRVVATMRWPREFPAGVTYESVASYVDLLPTMVAAAG 379


>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 471

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 126/354 (35%), Positives = 189/354 (53%), Gaps = 25/354 (7%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           +R  ++   VL + F+L        A++G P++I I++DD GW+DVGF+G  +IPTP++D
Sbjct: 3   FRLVLVCLCVLGVLFSLLPE-----ANAGQPNVIVIVSDDQGWSDVGFNGCKEIPTPHLD 57

Query: 90  ALAYSGIILK-NYYTVQLCTPSRSAIMTGKHPIHTGMQ-----HNVLYGCERGGLPLSEK 143
           ALA SG+     Y +   C+PSR+ ++TG++    G +     H      E  GLPLSE 
Sbjct: 58  ALAKSGVAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECNPGAHGNDDAIEMEGLPLSET 117

Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
           +L    +  GYRT  +GKWHLG  + ++ PT RGFE   G+  G   Y+    ++  + G
Sbjct: 118 LLSTVFRNAGYRTGAIGKWHLG-DEPQFWPTERGFEEWFGFSGGGLSYWGDLGKKPPLHG 176

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
           +    D+ P  +L   Y TD F+ EAV  +  +    P FLYLA+ A H+     P Q  
Sbjct: 177 VLRNGDVVPKDEL--TYLTDDFSTEAVKFVEENRA-RPFFLYLAYNAPHA-----PDQVT 228

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
             +L    HIE   R+ + A++  +DE +G+VV+AL+ + +   ++I F SDNGG A   
Sbjct: 229 REHLKKTSHIEYGGRAVYGAMVAAMDEGIGRVVQALDDQNLRDETVIFFYSDNGGRA--- 285

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
               A N+P RG K  L+EGG+R    +  P     G   E  V   D  PT+L
Sbjct: 286 --TLARNFPYRGHKGMLFEGGIRVPFCVSWPEQIEGGRRYEPPVTALDIFPTVL 337


>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 472

 Score =  194 bits (494), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 130/378 (34%), Positives = 190/378 (50%), Gaps = 49/378 (12%)

Query: 35  MAFA---VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           MAF    V   AF  + V  +    S PPHI+ I+ADDLGWNDVGFHG  +I TPNID L
Sbjct: 1   MAFPFAIVACFAFVGANVSAEANGGSQPPHIVHIIADDLGWNDVGFHG-SEIRTPNIDRL 59

Query: 92  AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
           A   + L  +Y   +C+P+R+ ++TG +P   G    V+   ++ GLP   +  P++L +
Sbjct: 60  ANESVTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVVSPTKKHGLPPQLETTPEHLSK 119

Query: 152 LGYRTR-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           LGY  R + GKWHLG     + P   G     G++ G  DYF       +   LD  R+ 
Sbjct: 120 LGYDHRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAIDYFSRE----RFGQLDWHRNF 175

Query: 211 EPAWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA------- 262
           +    +H + YST++     VD I  ++   P++ Y+A  A HS     PLQA       
Sbjct: 176 D---SVHEEGYSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDE 227

Query: 263 ----PDHYL--NIHRHIE------DFK--------RSKFAAILHKLDESVGKVVEALEQR 302
               P++ L  N  R I       D+         R  FAA+   +D  +G++++A+++ 
Sbjct: 228 YGFDPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRN 287

Query: 303 RMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
            M  N+++VF SDNG        +  SN PLRG K T WEGGVR   ++  P     GI 
Sbjct: 288 GMRENTLVVFHSDNGADPK----HGGSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGIT 343

Query: 363 AEQYVHVSDWLPTLLSAA 380
            +      D LPT+++AA
Sbjct: 344 YDSVTSYVDLLPTMVAAA 361


>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
 gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
          Length = 498

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/357 (34%), Positives = 192/357 (53%), Gaps = 28/357 (7%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           Y++ IM      L    +++   L A +  P+I+ +LADDLGW DVGFHG  +I TP++D
Sbjct: 36  YKSYIMKEEFFGLLCGCTLLPAFLHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLD 94

Query: 90  ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           AL   G+ L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L
Sbjct: 95  ALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADML 154

Query: 150 KELGYRTR-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
              GY+ R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   
Sbjct: 155 ARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHN 210

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           D E   D    YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +
Sbjct: 211 DWETCHD--KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--D 261

Query: 269 IHRHIEDF--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           I  + ++F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A 
Sbjct: 262 IKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG 321

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
               +  +S+ PLRG K   W+GG    A L W    +    ++ Q     D +PTL
Sbjct: 322 ----VPGSSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 374


>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 498

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 128/364 (35%), Positives = 189/364 (51%), Gaps = 33/364 (9%)

Query: 38  AVLPLAFTLSMVFVDLVAS----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           AV  L F    VF    AS    S PP+I+ ILADD+G+ D+G  G   + TPN+D LA 
Sbjct: 8   AVSRLVFLSLTVFFATKASRAAESAPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKIL 145
           SG++    Y    +C+PSR+ ++TG+ P   G + N+    ER        GLP+SEK L
Sbjct: 68  SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDERYATRPELLGLPVSEKTL 127

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
             +L   GY T ++GKWHLG  +  + P  RGF+   G  TG   YF  +   +      
Sbjct: 128 GDHLGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMNHV------ 180

Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST---DEPLFLYLAHAATHSANPYEPLQA 262
           + R+ +   D   +Y TD FT E +  I  H     D+P F++ ++ A     P+ P+ A
Sbjct: 181 IERNGQRVEDFSNEYLTDFFTDEGLRFIDQHEAAKPDQPWFVFYSYNA-----PHTPMHA 235

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
            +  L    +I++ KR  +AA++  LD  VG++ + LE+     N+++VF SDNGGA   
Sbjct: 236 TEADLARFANIQNKKRRTYAAMMFALDRGVGRIRKHLEETGQWKNTLLVFFSDNGGATN- 294

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWS-PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              N + N PLRGVK ++ EGG+R   +IW+ P     G   +  +   D LPT  SAA 
Sbjct: 295 ---NGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGKRYDGVISSLDLLPTFCSAAG 350

Query: 382 KSDI 385
              +
Sbjct: 351 AQPL 354


>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
          Length = 507

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/362 (33%), Positives = 191/362 (52%), Gaps = 32/362 (8%)

Query: 42  LAFTLSMV-FVDLVAS---SGPPHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYS 94
            A TL+++  + LVA    +  PHI+ +L DD GW +VG+H      ++ TPNID L   
Sbjct: 2   FAVTLALLSMIGLVAGQPVTLKPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQ 61

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQH---------NVLYGCERGGLPLSEKIL 145
           G+ L  +Y  + C+PSRS++++G+ PIH   Q+         + + G     +P +   +
Sbjct: 62  GLELNQHYAYRCCSPSRSSLISGRLPIHVSDQNIAPTNYNPNDPISGFS--AIPRNMTGI 119

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL- 204
            + +KE GY T  VGKW  G    ++TP  RGF++  GY+  + DY+    +     G+ 
Sbjct: 120 AEKMKEAGYATHQVGKWDAGMATPDHTPKGRGFDTSFGYFHHYNDYYTEVVDSCNGTGVV 179

Query: 205 DMRRDLEPAWDLHG----KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
           D+    +PA  ++G    KY   +F    +DI+  H    PLFLY A    H+     PL
Sbjct: 180 DLWNTDQPAHGINGTGPDKYEEALFRERLLDIVSKHDPSTPLFLYYAPHIVHT-----PL 234

Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           Q PD YLN    I+D  R  + A+++ LD+ VG +V AL+++ M  N + V  SDNGG  
Sbjct: 235 QVPDEYLNKFSFIDDKDRMYYHAMVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNGGPV 294

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLS 378
             +    A+N+PL+G K + W+GG+R    +    L  + RG   + Y+H++DW  T  +
Sbjct: 295 --YPFGGANNYPLKGGKLSDWQGGIRVNAFVSGGHLPEKMRGQKTDGYIHLADWYGTFCA 352

Query: 379 AA 380
            A
Sbjct: 353 IA 354


>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 115/331 (34%), Positives = 177/331 (53%), Gaps = 21/331 (6%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
             A++  P+I+F+L DD G+ DVGF     I +PN D LA +G++L  +Y  + C+PSR+
Sbjct: 16  FAATAKVPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYVFKYCSPSRA 74

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           + +TG+ P H   Q N L      G  L+  +LP  LK   Y T +VGKWHLGF+   Y 
Sbjct: 75  SFLTGRWP-HHAHQWNPLMD-NMIGTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRYL 132

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           P  RGF++  G+  G +D+ +    E     +D  ++  P    +G Y    +  +  D+
Sbjct: 133 PINRGFDTSTGFLGGGEDHMN----EKSGCSIDYWKNNAPD-PRNGTYDAYNYRDDLTDV 187

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDES 291
           ++NH+ D PLF YL     H+     PLQAP  +L+I+       KR    A++   D  
Sbjct: 188 MNNHNADNPLFFYLPLHNVHT-----PLQAPKEWLDIYPANSTCEKRHILQAMVSVADNV 242

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
            G +VE L+++ M  N+I+V  +DNGGA       + SN+PL+G K+T +EGG+R    +
Sbjct: 243 TGHLVELLKKKGMWDNTIMVISADNGGAPC-----SGSNYPLKGCKSTFFEGGIRSLAFV 297

Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
              LL    RG   + ++H++DW  T    A
Sbjct: 298 NGGLLPESRRGQSTDGFIHIADWYTTFCKLA 328


>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 539

 Score =  194 bits (492), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/374 (33%), Positives = 188/374 (50%), Gaps = 41/374 (10%)

Query: 32  TRIMAFAVLPLAF-TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
            RI AF +  L F T S VF     +   P+II ++ADDLGW DVGFHG   I TP++D 
Sbjct: 2   VRIGAFCLSVLVFLTSSSVF-----AGDKPNIIVMVADDLGWADVGFHGNQIIETPSLDR 56

Query: 91  LAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           +A  G  L  +YT  +C+P+R+A+MTG+ PI  G+ ++ +      G+   E  LP+   
Sbjct: 57  IAAEGTQLNRFYTTPICSPTRAALMTGRDPIRLGVAYSTIMPWHNNGIHPEETFLPELFA 116

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
             GY+T +VGKWHLG  ++ Y P  RGFE   G+      +F   A    + G D +R+ 
Sbjct: 117 GAGYQTAMVGKWHLGHAQQTYHPNARGFEHFYGHLHTEVGFFPPFA---SLGGKDFQRN- 172

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
             + D  G Y + +   E    I      +P F+Y+   A H+     PL AP+     +
Sbjct: 173 GVSIDDQG-YESYLLADEVSRYIRERDAAKPFFIYMPFIAPHT-----PLDAPEDLKAKY 226

Query: 271 RHIEDFK-----------------------RSKFAAILHKLDESVGKVVEALEQRRMLSN 307
             +ED +                       R  +AA++  +D+S+G+V++ L+   +  N
Sbjct: 227 ADMEDDRGKSRSKMADNTRFMAKVTRRESARPMYAAVVDGMDQSIGRVLDTLDAEGLTDN 286

Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
           +I++F SDNGGA   + +  A N PLRG K   +EGG+R    +  P   + G   +  +
Sbjct: 287 TIVLFFSDNGGAV--YAIGGADNAPLRGGKGDTFEGGIRVVATMRWPEKIAPGGKVDSIM 344

Query: 368 HVSDWLPTLLSAAN 381
            V D  PTLL+AA 
Sbjct: 345 SVMDVFPTLLAAAG 358


>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
          Length = 637

 Score =  194 bits (492), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 110/241 (45%), Positives = 141/241 (58%), Gaps = 11/241 (4%)

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+V+   +   LPL    LPQ L+E GY T +VGKWHLGFY+ E  PT RGF+S
Sbjct: 112 IHTGLQHSVIRPTQPNCLPLDAATLPQKLREAGYATHMVGKWHLGFYRTECLPTRRGFDS 171

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHST 238
             G   G  DY+ H   +   + G D+  +   AW    G YST ++T     I+ +H  
Sbjct: 172 FFGSLLGSGDYYTHYKCDSPGVCGYDLHENDRAAWGRGGGAYSTQMYTQRVQRILASHDP 231

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
            +PLFLY+A+ A HS     PLQAP  YL  +R I + +R ++AA+L  LDE+V  V  A
Sbjct: 232 GKPLFLYIAYQAVHS-----PLQAPGRYLGHYRSIANARRRRYAAMLSCLDEAVANVTLA 286

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+      NS+IV+ SDNGG  A       SNWPLRG K T WEGGVR  G + SPLL  
Sbjct: 287 LKAHGFYDNSVIVYSSDNGGQPAA----GGSNWPLRGRKGTYWEGGVRAVGFVHSPLLRQ 342

Query: 359 R 359
           R
Sbjct: 343 R 343


>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  193 bits (491), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 115/331 (34%), Positives = 176/331 (53%), Gaps = 21/331 (6%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
            VA++  P+I+F+L DD G+ DVGF     I +PN D LA +G++L  +Y  + C PSR+
Sbjct: 17  FVATAKLPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           +++TG+ P H   Q N+       G  L+  + P  LK   Y T +VGKWH GF+   Y 
Sbjct: 76  SLLTGRWPHHV-YQWNLATD-ATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           P  RGF++  GY  G  D+F+    + +   +D  ++  P    +G Y +  +  +  +I
Sbjct: 134 PINRGFDTSSGYLCGWVDHFN----QKQGCAVDCWKNNAPD-PRNGTYDSYYYRDDLTNI 188

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RSKFAAILHKLDES 291
           ++NH  + PLF+YL     H+     PL+AP  +L+I+      K R    A++   D  
Sbjct: 189 VNNHDANNPLFIYLPLHNVHT-----PLEAPKEWLDIYPANSTCKTRHTLQAMVSVADNV 243

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
            G +VE L+++ M  N+I+V  +DNGG         ASN+PLRG K T +EGGVR    +
Sbjct: 244 TGHLVELLKEKEMWDNTIMVISADNGGDQC-----VASNYPLRGCKYTFFEGGVRSLAFV 298

Query: 352 WSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
              LL    RG   + ++H+SDW  T    A
Sbjct: 299 NGGLLPESRRGQSTDGFIHISDWYTTFCKLA 329


>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 588

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 130/391 (33%), Positives = 195/391 (49%), Gaps = 79/391 (20%)

Query: 66  LADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTG 124
           + DD+GW+D+G+  +D +  TP +D LA  G+ + NYYT+  CTP+R+++MTG++ +  G
Sbjct: 1   MIDDMGWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYG 60

Query: 125 MQHNVLYG-CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
           MQ+NV     E  G+PLSEK+LP+Y KE GY T +VGKWHLG +  E+ P+ RGF++++G
Sbjct: 61  MQYNVAINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMG 120

Query: 184 YWTGHQDYFDHSA----------------------------EEMKMWGLDMRRDLEPA-- 213
           Y  G + Y+ H                                  ++G D R D + A  
Sbjct: 121 YVGGFEAYWTHETVGVISDGRHVCDFGFGNASGYYDIIDRPSSQDIYGSDHREDQKEAEE 180

Query: 214 ----WDLHGKYST-------------------------DVFTAEAVDIIHNHS--TDEPL 242
                D  G +ST                          +F   A++I+   S   +EP+
Sbjct: 181 NRDDGDGEGVHSTTPTKTPWASSFRSNSGVMKGQQYSTQMFQERAIEILETKSPFDEEPI 240

Query: 243 FLYLAHAATHSANPYEPLQ--APDHYL---NIHRHIEDFK----RSKFAAILHKLDESVG 293
           +++LAH A HS     P    + D  L    I   I D +    R     +L  LD+SVG
Sbjct: 241 YMHLAHQAVHSPLGLPPADSFSEDETLLLDEIEGKIGDVQGAQTRRHLTEVLMFLDKSVG 300

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            ++  LE    + NSI+V  SDNGG +        SN+PLRG K + WEGG +    ++S
Sbjct: 301 HLIHYLEDEGWMENSIVVVASDNGGDSN----EGGSNYPLRGKKASYWEGGSKVPAFVYS 356

Query: 354 P--LLES-RGIVAEQYVHVSDWLPTLLSAAN 381
              + E+ RG   +  +HV+DWLPTL + A 
Sbjct: 357 TSHIPEARRGSEYDGLMHVTDWLPTLAAGAG 387


>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
 gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
          Length = 579

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 129/409 (31%), Positives = 199/409 (48%), Gaps = 47/409 (11%)

Query: 4   NVLYGCERGGLPLSEKILPQYLKELGYRTRIMAFAVLP-LAFTLSMVFVDLVASSGPPHI 62
            + +G    G+ + ++I   +            FA++   A   + V  +    S PPHI
Sbjct: 79  GICFGVVGDGIGMVDRIRSVFSHCACDLKMAFPFAIMTCFALVGANVSAETNGGSQPPHI 138

Query: 63  IFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIH 122
           + I+ADDLGWNDVGFHG  +I TPNID LA   + L  +Y   +C+P+R+ ++TG +P  
Sbjct: 139 VHIIADDLGWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFR 197

Query: 123 TGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTPTFRGFESH 181
            G+   V+   ++ GLP   +  P++L +LGY  R + GKWHLG     + P   G    
Sbjct: 198 FGIWGGVVSPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEF 257

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVDIIHNHSTDE 240
            G++ G  DYF       +   LD  RD +    +H + YST++     VD I  ++   
Sbjct: 258 YGHYNGAIDYFSRE----RFGQLDWHRDFD---SVHEEGYSTELVGNAVVDFIDRNANAG 310

Query: 241 PLFLYLAHAATHSANPYEPLQA-----------PDHYL--NIHRHIE------DFK---- 277
           P++ Y+A  A HS     PLQA           P++ L  N  R I       D+     
Sbjct: 311 PVYAYVAFNAPHS-----PLQALRSDLDEYGFDPNNKLAPNTDRKIAKREKALDYGKRGK 365

Query: 278 ----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
               R  FAA+   +D  +G++++A+++  M  N+++VF SDNG        +  +N PL
Sbjct: 366 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPK----HGGNNEPL 421

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
           RG K T WEGGVR   ++  P     GI  +      D LP+++ AA  
Sbjct: 422 RGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPSMVGAAGS 470


>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 307

 Score =  193 bits (490), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 118/320 (36%), Positives = 168/320 (52%), Gaps = 16/320 (5%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
           +GWNDV +H  D I  PN+  LA  G+I    YT   CTPSR+A+MTG +P  TG QH +
Sbjct: 1   MGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GH 188
            +     G+PL  K+LP+ LKE+GY T +VGKWHLGF K EY PT RGF+SH G WT G 
Sbjct: 60  AFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 189 QDYFDHSAEEMKMWGLDMRRDLE--PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
            DY           G D R ++   P  D    Y   +    A  I++ H    PLFL  
Sbjct: 120 GDYDKMDGVLSPSAGYDFRDNMGVVPKSD---DYLALMLGDRAEHIVNTHYPGTPLFL-- 174

Query: 247 AHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLS 306
              A     P + L+ P+ Y   +  IED +  +F   L  +D+ +G VV+A++ R M  
Sbjct: 175 ---AFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDAMKNRGMYD 231

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           +S+I+F+ DN G ++     + SN+P RG+  TL+EG  R   ++    ++  G  + + 
Sbjct: 232 DSLIIFIGDNEGLSS----QSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287

Query: 367 VHVSDWLPTLLSAANKSDIP 386
             + D   T+L  A     P
Sbjct: 288 YSIVDIHRTILDVAGAKAEP 307


>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 480

 Score =  192 bits (489), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 122/320 (38%), Positives = 170/320 (53%), Gaps = 11/320 (3%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GW DV FHG  QIPTPNID LA  G+IL NYY + LCTPSR+A+MTG +PIHTGM   V+
Sbjct: 5   GWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGVI 64

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-GFYKKEYTPTFRGFE-SHLGYWTGH 188
                 GL L  KI+PQ+ ++LGY   ++GK H  G +  + T +  G +  H    +  
Sbjct: 65  QDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSHHDGCHNFDSTKS--GIDLLHTPLISSI 122

Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
               D++    K +     R +     L  +    V       ++  HS  +P F YL+H
Sbjct: 123 PGQHDNTWMYGKTYNFYRTRAIC---RLQKRVIQGVVVYLIRFLLDLHS--QPFFCYLSH 177

Query: 249 AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES-VGKVVEALEQRRMLSN 307
            A HSA   +P QAP   L    +I +  R+ +A    K     +G  + +L + +   +
Sbjct: 178 QAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETKKTHLPWLGLALVSLVKTKNYVD 237

Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
            +++  SDNGG+  G   N   NWPLRG K  LWEGGVR    +WS     +  V++Q +
Sbjct: 238 ILLMGFSDNGGSPFGLFSNRGFNWPLRGAKANLWEGGVRVPAFVWSSKFLKKPRVSDQLM 297

Query: 368 HVSDWLPTLLSAANKSDIPN 387
           H+SDWLPTL SAA   D+ N
Sbjct: 298 HISDWLPTLYSAAG-GDVSN 316


>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
 gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
          Length = 470

 Score =  192 bits (489), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 123/371 (33%), Positives = 196/371 (52%), Gaps = 21/371 (5%)

Query: 43  AFTLSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           AF L +  V +VA +    P++I I++DD G+ D G +G  +IPTPNIDALA +G +  +
Sbjct: 8   AFLLCISCVTVVAQAQQKKPNVIVIVSDDAGYADFGCYGGKEIPTPNIDALAKNGTLFTD 67

Query: 101 -YYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKELG 153
            Y +  +C PSR+ I+TG +    G +HN+       Y     G+    K +   +K  G
Sbjct: 68  AYVSASVCAPSRAGILTGMYQQRFGFEHNISELPVKPYTLNDVGMDPKIKTIGDQMKHNG 127

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           YRT  +GKWH G    +Y P  RGF+   G+  GH+ +F +   +     L +  + +  
Sbjct: 128 YRTIAIGKWHQGDL-PQYFPLKRGFDEFYGFVGGHRSFFGYPGGKAPSHELALFDNDKIV 186

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
            +    Y TD+FT +A+  +  + + +P F+YLA+ A H      P+ A    ++   +I
Sbjct: 187 PENTIGYLTDMFTDKAISFVKENKS-KPFFMYLAYNAVHV-----PMNAKKELMDRFPNI 240

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
            D  R  +AA++  LD+ VG++V+ L+++ +  N++IVFV+DNG A      N A N  L
Sbjct: 241 TDPGRKAYAAMMVSLDDGVGELVKTLKEQNLYDNTLIVFVNDNGAATG----NYADNGKL 296

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTV 393
           RG+K + WEGG+R A ++  P     G V  + V   D +PT + A     I   V    
Sbjct: 297 RGLKGSKWEGGIRVAYIMQYPDQIPAGKVYSKMVSGLDIMPTAVGAGKGELISGQVTDG- 355

Query: 394 ENIIPRYENSI 404
           +N+IP   N+I
Sbjct: 356 KNLIPYLNNTI 366


>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 479

 Score =  192 bits (488), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 185/356 (51%), Gaps = 25/356 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ IL DDLG+NDVGF+G   I TPN+D LA +G+I+   Y     C PSR++IMTGK
Sbjct: 28  PNIVIILCDDLGYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMTGK 87

Query: 119 HPIHTGMQHNVLYGCERGG--LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +    G Q N+    E  G  +PL+ K + + L+E GY T   GKWHLG     + P  R
Sbjct: 88  YAHTMGAQFNIPSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLG-ADTPFHPNKR 146

Query: 177 GFESHLGYWTGHQDY----FDHSAEEMKMWGLDMRRDLEPAWDLHG------KYSTDVFT 226
           GF+   G+  G  DY    +    E +K  G    RD     + +G      +Y TD  +
Sbjct: 147 GFDEFYGFLGGGHDYIPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEKEYITDGLS 206

Query: 227 AEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
            EAV+ ++  S   +P F+YLA+ A     P+ PLQA    + + + I+D KR  +AA++
Sbjct: 207 REAVNFVYKASEKKQPFFMYLAYNA-----PHVPLQAKKEDMAVFKSIKDEKRRTYAAMV 261

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
           + +D  VGK+VEAL+  + L N+++VF SDNGG         A+N+PL   K + +EGG 
Sbjct: 262 YAVDRGVGKLVEALKANQQLENTLLVFFSDNGGKLG----KGANNFPLTEGKGSAYEGGH 317

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYE 401
           R       P     G      V   D  PT  S A K+   N  N    NI+P  +
Sbjct: 318 RVPMFFHWPNRIKAGQKFHNPVSALDLYPTFASLA-KTKPSNAKNLAGTNILPHLQ 372


>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 551

 Score =  192 bits (488), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 197/384 (51%), Gaps = 42/384 (10%)

Query: 29  GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
           G  T ++A   +   FT ++   D V     P++I ++ADDLGWNDVG+HG   I TP++
Sbjct: 9   GLTTTLIALVFVQTVFTQAVYAEDGVR----PNVIVMVADDLGWNDVGYHG-GNIDTPSL 63

Query: 89  DALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQY 148
           D LA  G+ L  +YT  +C+P+R+A+MTG+ P+  G+ + V+   +  G+  +E  +PQ 
Sbjct: 64  DKLAEQGVQLNRFYTTPICSPTRAALMTGRDPMRLGIAYGVILPWDNIGVNPAEHFMPQS 123

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
            +  GY+T +VGKWHLG  +  Y P  RGFE   G+      ++   A    + G D + 
Sbjct: 124 FQAAGYQTAMVGKWHLGHAQMTYHPNQRGFEHFYGHLHTEVGFYPPFA---NVGGKDFQE 180

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           +   + D  G Y T +   E    I +   ++P F+Y+   A H+     PL AP     
Sbjct: 181 N-GVSIDDEG-YETYLLADEVSRYIRDRDEEKPFFIYMPFIAPHT-----PLDAPQELQE 233

Query: 269 IHRHIED-----------------------FKRSKFAAILHKLDESVGKVVEALEQRRML 305
            ++ IE                          R  +AA++  +D+++G+V++ L++  + 
Sbjct: 234 KYKDIETDLAPARSNQTDSTRRMAKLMMQPSARPMYAAVVDAMDQAIGQVLDTLDEEGLA 293

Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAE 364
            N+I++F SDNGGAA  ++   A N PLRG K   +EGG+R   L+ W  +LE      +
Sbjct: 294 DNTIVLFFSDNGGAA--YSYGGADNAPLRGGKGETFEGGIRVVSLMRWPGVLEPAQSF-D 350

Query: 365 QYVHVSDWLPTLLSAANKSDIPNY 388
           Q + V D  PTL  A   + + N+
Sbjct: 351 QIMTVMDVFPTLAEATGVTPLNNF 374


>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
          Length = 328

 Score =  192 bits (487), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 124/337 (36%), Positives = 172/337 (51%), Gaps = 27/337 (8%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS-GIILKNYYTVQLCTPSRSAIMTGKH 119
           H + I+ADDLG+ D+G+   D I TPNID LA   G+IL N Y  + C+PSR++ +TG+ 
Sbjct: 1   HHLQIMADDLGYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRV 59

Query: 120 PIHT--GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           PIH   G       GC      L+  ++P  L+  GYRT +VGKWH G    E  P  RG
Sbjct: 60  PIHVHQGNPGLAAAGCTN----LNYTMIPAQLRRAGYRTAMVGKWHQGASLPECLPVNRG 115

Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           F++  GY +G +D+ D +    +    D   D  PA   +G YS+  +    VDII  H+
Sbjct: 116 FDTSFGYLSGEEDHMDQTTNGGQCNVTDFWLDSGPAIRRNGTYSSFQYNDAIVDIIQQHA 175

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYL-----------NIHRHIEDFKRSKFAAILH 286
            ++PL LY A    H   PYE    P+ Y                      R    A+++
Sbjct: 176 PEQPLMLYAALQNVHG--PYE---VPERYRAKFPADKNCTETGDNPTSQCTRDFMLAMVN 230

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA-SNWPLRGVKNTLWEGGV 345
            LD+ VG +  AL+   M +N+++VF SDNGGA  G   + + +N PLRG K   +EGGV
Sbjct: 231 VLDDIVGNITRALKANGMWNNTVLVFSSDNGGAVPGAPQHGSMNNHPLRGGKVAYFEGGV 290

Query: 346 RGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           R A  + SPLL    RG      +H+SDW  T    A
Sbjct: 291 RTAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLA 327


>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
          Length = 504

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/343 (34%), Positives = 182/343 (53%), Gaps = 33/343 (9%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY---------- 102
            VA++  P+I+F+L DD G+ DVGF     I +PN D LA +G++L  +Y          
Sbjct: 17  FVATAKLPNIVFVLVDDWGFADVGFRN-PAISSPNFDQLAKTGLVLNRHYLAKTGLVLNR 75

Query: 103 --TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
               + C+PSR++ +TG+ P H   Q N L   +  G  L+  ++P  LK   Y T +VG
Sbjct: 76  HYVFKYCSPSRASFLTGRWP-HHAHQWN-LPQSDMAGTNLNMTMIPAKLKAAKYSTHMVG 133

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           KWH GF+   Y P  RGF++  G+  G +D+    AE      +D  ++  P    +G Y
Sbjct: 134 KWHQGFFDPRYLPINRGFDTSSGFLGGGEDHMTQEAE----CAIDYWKNDAPD-PRNGTY 188

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RS 279
               +  +  DI++NH+ ++PLF+YL     H+     PLQAPD +LNI+      + R 
Sbjct: 189 DAYNYGDDLTDIMNNHNVNDPLFIYLPLHNVHA-----PLQAPDEWLNIYSVNSTCENRR 243

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            + A++   D   G +V+ L+++ M  N+II+  +DNGGAA G      SN+PL+G K +
Sbjct: 244 TYQAMVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG-----GSNYPLKGCKGS 298

Query: 340 LWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
            +EGGVR    +   LL    RG   + ++H++DW  T    A
Sbjct: 299 FFEGGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLA 341


>gi|345317091|ref|XP_001516952.2| PREDICTED: arylsulfatase I-like, partial [Ornithorhynchus anatinus]
          Length = 264

 Score =  191 bits (486), Expect = 1e-45,   Method: Composition-based stats.
 Identities = 103/243 (42%), Positives = 145/243 (59%), Gaps = 12/243 (4%)

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
            ++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT R
Sbjct: 5   ARYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRR 64

Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           GF++ LG  TG+ DY+ + + +   + G D+      AW+  GKYST ++   A  I+ +
Sbjct: 65  GFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWEQSGKYSTLLYAQRAGKILAS 124

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H    PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +
Sbjct: 125 HDPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRNI 179

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPL 355
             AL++     NS+IVF +DNGG       +  SNWPLRG K T WEGG   A L  SP 
Sbjct: 180 TRALKKYGYYDNSVIVFSTDNGGQ----TFSGGSNWPLRGRKGTYWEGGTPPATL--SPS 233

Query: 356 LES 358
           L +
Sbjct: 234 LTA 236


>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 441

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 123/361 (34%), Positives = 192/361 (53%), Gaps = 47/361 (13%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           + + F L  +    + ++ PP+II ILADD G +D   +G  Q+ TP+ID++A++GI   
Sbjct: 1   MKILFCLFSLLCTSLLANEPPNIIIILADDAGSSDFSCYGSKQLLTPHIDSIAHNGIKFT 60

Query: 100 NYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYG------CERGGLPLSEKILPQYLKEL 152
             YT   +C+PSR+ ++TG++    G   N+ +        E  GLP++E  L   LKEL
Sbjct: 61  QAYTASSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLADSLKEL 120

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL-- 210
           GY T  +GKWHLG     + P  RGF++  G+ +G + YF          G ++R D+  
Sbjct: 121 GYSTHCIGKWHLG-EADHFHPNARGFDNFYGFLSGARTYF---------LGGELRGDMDR 170

Query: 211 --------EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
                   EP+      Y+T+VFT EA+ II     D+P F+YL+H A H      P+ A
Sbjct: 171 IMRNKEFAEPS----SGYTTEVFTQEAIRIIQEEQ-DKPFFIYLSHNAVHG-----PMDA 220

Query: 263 PDHYLNIHRHIEDFK---RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
            D  +  +    DFK   R K++ ++  LD+  G +++AL+  +   N++I F+SDNGG 
Sbjct: 221 KDEDIMSY----DFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGP 276

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                 N +SNWPLRG K + +EGG R   L+  P   S G+ +++ +   D   T + A
Sbjct: 277 TTH---NGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATCIQA 333

Query: 380 A 380
           A
Sbjct: 334 A 334


>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 128/352 (36%), Positives = 183/352 (51%), Gaps = 26/352 (7%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
           A +G P+++ ++ADDLG+ D+GF G   IPTP++DALA  G+     Y +   C+P+R+ 
Sbjct: 32  APAGRPNVVLLVADDLGYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAG 91

Query: 114 IMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           ++TG++    G + N          +  GLP++E  L   LK  GY T +VGKWHLG   
Sbjct: 92  LLTGRYQQRFGHEFNPGGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLGSEA 151

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
           K + P  RGF+   G+  G   YF   +        D+ R  E   +    Y TD F+ E
Sbjct: 152 K-FHPQKRGFDEFFGFLGGQHTYFASKSG-------DVYRGTEVVKEE--AYLTDAFSRE 201

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A+  I  H  D P FL L+  A H+     P+ A +  +     IED KR  +AA+L  L
Sbjct: 202 ALSFIDRHK-DHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRRTYAAMLSAL 255

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           DE+ G V+E L    +   ++I+F SDNGG   AG   N + N PLRG K T  EGGVR 
Sbjct: 256 DEAAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTTNGSKNDPLRGSKRTTLEGGVRV 315

Query: 348 AGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
             ++ W   L + G V +Q +   D LPT L+AA     P +    V N++P
Sbjct: 316 PFVLSWKGTLPA-GKVYDQPIIQLDILPTALAAAGVDAEPEWKLDGV-NLLP 365


>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
 gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 511

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 136/400 (34%), Positives = 202/400 (50%), Gaps = 37/400 (9%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F +   A  LS+  + L A    P+I+ +L DDLG+ DVGF+G   I TP +D LA +G 
Sbjct: 26  FKLFLYAAVLSVANIGL-AQETRPNILVVLCDDLGYADVGFNGSTDILTPELDNLAQNGS 84

Query: 97  ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGC-----ERGGLPLSEKILPQYLK 150
           I  + Y     C PSRSAI+TG++P  TG  +N+ +       +  G+P+ E  + + L+
Sbjct: 85  IFTSAYVAHPFCGPSRSAILTGRYPHLTGTAYNLFHNSSEDDKDNMGVPVEETYMSKVLQ 144

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF----DHSAEEMKMWGLDM 206
             GY T  +GKWHLG   K + P  RGF+   G+  G  DYF      + +  K  G   
Sbjct: 145 NAGYYTSAIGKWHLGAAPK-FHPNKRGFDDFYGFLGGGHDYFPSEYQKTYKAQKKAGNPN 203

Query: 207 RRDLEPAWDLHGK------YSTDVFTAEAV-DIIHNHSTDEPLFLYLAHAATHSANPYEP 259
            RD     + +GK      Y TD F+ EA+ +I    +  +P F+YLA+ A     P+ P
Sbjct: 204 IRDYVFPMEHNGKPANETEYITDGFSREAIKNIKIAAAKKQPFFIYLAYNA-----PHVP 258

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           LQA    +    HI+D  R  +AA+++ +D  VGK+V+ L++ +   N++IVF+SDNGG 
Sbjct: 259 LQAKAEDVAKFAHIKDKDRRTYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGG- 317

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--- 376
              FN + A+N+PL+G K   WEGG R       P    +    +  V   D  PT    
Sbjct: 318 --NFN-HGANNYPLKGTKGDTWEGGYRVPMFFHWPKKIKKDQRFDFPVSSLDLYPTFTGL 374

Query: 377 ----LSAANKSDIPNYVNSTVENIIPRYENSI--LRYENG 410
               L    + D  N ++  ++N  P  +  I  LRY  G
Sbjct: 375 AEAKLPKGKQLDGKNIMDDVLKNTEPYKDEMIYSLRYREG 414


>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 286

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/275 (40%), Positives = 154/275 (56%), Gaps = 12/275 (4%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV 129
           +GWNDV +H  D I  PN+  LA  G+I    YT   CTPSR+A+MTG +P  TG QH +
Sbjct: 1   MGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 130 LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GH 188
           ++     G+PL+ K+LP+ LKE+GY T +VGKWHLGF K EY PT RGF+SH G WT G 
Sbjct: 60  VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
            DY   +       G D R ++       G Y   +    A  I++ H    PLFL    
Sbjct: 120 GDYDKLNGVLSPSAGYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLFLTFTL 178

Query: 249 AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
                  P + L+ P+ Y   +  IED +  +F   L  +D+ +G VV+AL+ R M  +S
Sbjct: 179 DI-----PAKHLEIPEEYEEAYADIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYDDS 233

Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           +I+F+ DNG  ++     + SN+P RG+  TL+EG
Sbjct: 234 LIIFIGDNGALSS----QSGSNYPFRGIAGTLFEG 264


>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 512

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 117/338 (34%), Positives = 179/338 (52%), Gaps = 27/338 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH+IF+LADD GWN+ GF+  D + TP+ DAL  +G+ L N+Y  + C+P+RS+ ++G+ 
Sbjct: 21  PHLIFLLADDEGWNN-GFYNEDVL-TPHTDALVAAGVKLTNHYVYKFCSPTRSSFLSGRL 78

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           PIH   Q N        G+PL    + + ++  GYRT  VGKWH G     +TP  RGF+
Sbjct: 79  PIHVN-QENSATEQAGAGVPLGMSTIGETMRAAGYRTAHVGKWHCGMATPAHTPYGRGFD 137

Query: 180 SHLGYWTGHQDYF---------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           S +G++   +D++         +      +  G+D+ RD  PA   +G Y    FT  AV
Sbjct: 138 SSIGFFNFGEDHYTQVRGGQALEEGGASGQCSGVDLWRDGAPARGENGTYGGYTFTRAAV 197

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             +  + T +PLFL+ A    H      PL+ P  Y  + R+  +  R+    +   LD+
Sbjct: 198 AAVEANDT-KPLFLFAAFQNLH-----PPLEVPPEY--VARYANETLRTTINGMAAFLDD 249

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           SVG V +AL+++ M  +S+IV+  DNG    G+      + P RG K + ++GGVR A  
Sbjct: 250 SVGNVTKALKKQNMYDSSLIVYTPDNG----GYLTQGGDDAPFRGGKFSDFQGGVRVAAF 305

Query: 351 IWSPLLES--RGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           I   L+ +  RG  +   VHV DW+ T  +A     IP
Sbjct: 306 ISGGLVPAGLRGTASAALVHVCDWMAT-FAALGGGAIP 342


>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
 gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
          Length = 458

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYCDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K+  ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGSAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLCWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
 gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
          Length = 458

 Score =  190 bits (483), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LYAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K+  ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 469

 Score =  190 bits (482), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 116/308 (37%), Positives = 170/308 (55%), Gaps = 18/308 (5%)

Query: 46  LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ 105
           +S+   ++ AS   P+I+ ILADDLG+ D+GF G  +I TPNIDALA +G   KN Y   
Sbjct: 3   MSINSFNVNASEQKPNIVVILADDLGYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTH 62

Query: 106 -LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKW 162
             C PSR  ++TG++    GM++NV Y  +    GLPLSE      L+++GY T + GKW
Sbjct: 63  PYCGPSRVGLLTGRYQARLGMENNVSYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKW 122

Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYF-DHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           HLG     + P  RGF+   G+  G  +Y  D        + L + R+ +       +Y 
Sbjct: 123 HLGG-APHFQPNKRGFDYFYGFLDGGHNYMPDQVTVGGDGYSLPLMRNTQVT--EFDEYL 179

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T   + +AV  IH    + P F+YL++ A H+     PLQAP  Y+  ++HIED  R  +
Sbjct: 180 TTALSRDAVKYIHRQQ-ESPFFMYLSYNAPHT-----PLQAPAEYIEKYKHIEDEDRRVY 233

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW----PLRGVK 337
           AA++  +DE +G VV+AL+    L N++I+F+SDNGG     +    S+W    P R  K
Sbjct: 234 AAMIDVMDEGIGMVVDALKDINQLDNTLIIFLSDNGGVFPE-SWQPTSDWADNSPFRRGK 292

Query: 338 NTLWEGGV 345
             L EGG+
Sbjct: 293 AALLEGGI 300


>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
          Length = 458

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
 gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
          Length = 458

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 187/348 (53%), Gaps = 26/348 (7%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +  + ++ +  D
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIKLYTNNFD 230

Query: 276 F------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
                  K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  +S
Sbjct: 231 SLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPGSS 286

Query: 330 NWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           + PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 287 SGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
 gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
          Length = 458

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 458

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 473

 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 127/345 (36%), Positives = 180/345 (52%), Gaps = 35/345 (10%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
           A +  P+I+ I ADD G+ D GF G  Q+ TPN+D LA SG+   + Y +   C PSR+ 
Sbjct: 24  AQTEKPNIVLIFADDAGFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAG 83

Query: 114 IMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
           +MTGK+    G         M  N        GLPL +K +  YLKE GY+T + GKWHL
Sbjct: 84  LMTGKYQQRFGYEEINVPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHL 143

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK-----MWGLDMRRDL----EPAWD 215
           G   + + P  RGF++ LG+  G + YF++S +EMK      +   + RD     EP   
Sbjct: 144 GDADR-FHPLKRGFDTFLGFRGGDRSYFNYSEQEMKNGNKHFFDKKLERDFGNYEEPK-- 200

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
              +Y TDV   EA   I   + DEP F+YLA  A H+     PL++    L    ++  
Sbjct: 201 ---EYLTDVLGKEAAKYIE-QNKDEPFFIYLAFNAVHT-----PLESDPKDLAKFPNLTG 251

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
            KR + AA+   LD + G V++ L++  +  N+I+VF +DNGG +   + NA++N PL G
Sbjct: 252 -KRKELAAMTLGLDRASGYVLDKLKELGLDDNTIVVFSNDNGGPS---DKNASNNAPLAG 307

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            K+   EGG+R   LI  P     G   +  V   D LPT  SAA
Sbjct: 308 TKSNQLEGGIRVPFLISWPKHIKPGSTYDYPVSTLDLLPTFYSAA 352


>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
 gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
          Length = 472

 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 183/331 (55%), Gaps = 20/331 (6%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRS 112
           V ++  P++I I++DD G+ D G +G  QIPTPNIDA+A  G    + Y +  +C PSR+
Sbjct: 23  VKTAAKPNVIVIVSDDAGYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRA 82

Query: 113 AIMTGKHPIHTGMQHN---VL---YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
            I+TG++    G +HN   VL   Y     G+  SE+ +   ++  GY+T  +GKWH G 
Sbjct: 83  GILTGRYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQGD 142

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
             K + P  RGF    G+  GH+D+F +  +      L   +++ P  ++   Y TD+FT
Sbjct: 143 EPKHF-PLNRGFNEFYGFTGGHRDFFAYKGKRTNEHALYNNKEIVPENEI--TYLTDMFT 199

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
            +A   I   + D+P F+YL++ A H+     P+ A    +  +  I D  R  +AA++ 
Sbjct: 200 DKATSFI-TANKDKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRRAYAAMMT 253

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
            LD+ +GKV+  L+  ++  N++I+F++DNGGA     +N++ N PLRG+K + WEGG+R
Sbjct: 254 SLDDGIGKVMATLKANQLDKNTLIIFINDNGGA----TVNSSDNGPLRGMKGSKWEGGIR 309

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
            A ++  P   +      + V   D LPT +
Sbjct: 310 VAMMMKWPGHIAANKTDSRPVSSLDILPTAI 340


>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 478

 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 110/310 (35%), Positives = 170/310 (54%), Gaps = 22/310 (7%)

Query: 47  SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQ 105
           S++   L + +  P+II I+ADD+G+ D GF G   I TPN+D LA SG+I  + Y T  
Sbjct: 9   SLLIASLASGADRPNIIVIMADDMGYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHA 68

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
            C PSR+  + G++    G +HN  Y       G+ + E + P  L+++GY T I+GKWH
Sbjct: 69  FCGPSRAGFLAGRYQHRFGFEHNTPYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWH 128

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFD----HSAEEMKMWGLDMRRDLEPAWDLHGK 219
           LG     + P  RGF+   G+ TG  DYF+       +   + GL   + +    +  G 
Sbjct: 129 LG-ASSPFYPLNRGFDYFYGFLTGGHDYFEIDVTQPVKSAYLQGLFRNKRVA---NFEG- 183

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
           Y T   + +AV  + N + + P FL+L++ A     P++PLQAP   +  + HI+D KR 
Sbjct: 184 YLTTALSRDAVQFV-NDNKENPFFLFLSYNA-----PHQPLQAPQEDIARYAHIKDKKRR 237

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG----FNLNAASNWPLRG 335
            +AA++  +D  +G+VV AL++  +  N++I F+SDNGG  +        N +SN P RG
Sbjct: 238 VYAAMVDVMDRGIGEVVTALDRNGLRENTLIFFLSDNGGPVSSKKHPGKGNGSSNGPFRG 297

Query: 336 VKNTLWEGGV 345
            K   +EGGV
Sbjct: 298 GKTDFYEGGV 307


>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 616

 Score =  189 bits (481), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 123/359 (34%), Positives = 174/359 (48%), Gaps = 51/359 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP-TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           P++IFIL DD+G ND+G+   D    TP +D+L+  G+ L  YYT QLCTPSR+++MTG+
Sbjct: 120 PNVIFILIDDVGTNDIGYQSTDLWELTPFMDSLSSEGVRLTKYYTNQLCTPSRASLMTGR 179

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               TGMQ+ V+      GLPL E  L +  K         GKWHLG Y   + P  RGF
Sbjct: 180 DTFRTGMQYEVVEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFARGF 231

Query: 179 ESHLGYWTGHQDYFDHSA-----------EEMKMWGLD--------MRRDLEPAWDLHGK 219
           ++ LGY    + Y  H                K +G          +           G 
Sbjct: 232 DTFLGYMGAVRGYSSHEGCNTPTFEGGEYSCFKDFGYGDKDGYINHITNTTRQGPSFVGN 291

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI------ 273
           YST + T  A+++   H  D P FLY++H A HS  P +P   PD Y     H       
Sbjct: 292 YSTTIITDRAIEVAKEHGED-PFFLYVSHQAVHS--PVDP--PPDDYFTEEEHATLARVV 346

Query: 274 -EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
            ED  R +FA ++  LD+ + ++ + L+    L N+++V  SDNG           SN+P
Sbjct: 347 SEDAYRQRFARVILFLDKEMRRLHDELDALGALDNTVLVVASDNGACPTA----GGSNYP 402

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLL-----ESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           LRG K+T++EGGVR    ++S        E+RG      +H +DW PT  S     D+P
Sbjct: 403 LRGYKHTIFEGGVRVPAFVYSKSTDLIPEEARGTRYSGMMHSTDWTPTFGSVV--PDLP 459


>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 443

 Score =  189 bits (480), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 118/291 (40%), Positives = 161/291 (55%), Gaps = 19/291 (6%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHP 120
            + ILADDLG+ DVGF G  QI TP+ID LA  G+I  + Y +  +C PSR+ +MTGK+ 
Sbjct: 11  FVLILADDLGYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGLMTGKNQ 70

Query: 121 IHTGMQHNVL-----YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +  G  +N+      +  E  GLP+SEK L   L E GY   +VGKWHLG  K++Y P  
Sbjct: 71  VRFGFDNNLTNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLG-DKEQYHPLK 129

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           RGF    GY  G   YF  S    K +   +  + +    +   Y TD    E VD I  
Sbjct: 130 RGFHEFWGYLGGGHHYF-RSKPNGKGYDCPIECNYKTPQPI--SYITDDKGDECVDFIRR 186

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
           H  DEP FL+ +  A     P+ P+ A +  L ++ HIE  KR  + A+ H LD +VGK+
Sbjct: 187 HK-DEPFFLFASFNA-----PHAPMHAKEEDLKLYSHIEGEKRRAYCAMTHNLDLNVGKI 240

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           V  L+ + +  N+++VF+SDNGG       NA+ N PLRG K  L EGG+R
Sbjct: 241 VAELKAQNIYDNTVVVFLSDNGGPTPN---NASLNAPLRGGKGILLEGGIR 288


>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 489

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 122/355 (34%), Positives = 180/355 (50%), Gaps = 46/355 (12%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             S PPHI+ I+ADDLGWNDVGFHG  +I TPNID LA   + L  +Y   +C+P+R+ +
Sbjct: 41  GGSQPPHIVHIIADDLGWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGV 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR-IVGKWHLGFYKKEYTP 173
           +TG +P   G+   V+   ++ GLP   +  P++L +LGY  R + GKWHLG     + P
Sbjct: 100 LTGLYPFRFGIWGGVVSPTKKHGLPTLLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHP 159

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVDI 232
              G     G++ G  DYF       +   LD  RD +    +H + YST++     VD 
Sbjct: 160 LHHGMTEFYGHYNGAIDYFSRE----RFGQLDWHRDFD---SVHEEGYSTELVGNAVVDF 212

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQA-----------PDHYL--NIHRHIE----- 274
           I  ++    ++ Y+A  A HS     PLQA           P++ L  N  R I      
Sbjct: 213 IDRNANAGAVYAYVAFNAPHS-----PLQALRSDLDEYGFDPNNKLSPNTDRKIAKREKA 267

Query: 275 -DFK--------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            D+         R  FAA+   +D  +G++++A+++  M  N+++VF SDNG        
Sbjct: 268 LDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPK---- 323

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +  SN PLRG K T WEGGVR   ++  P     GI  +      D LP +++AA
Sbjct: 324 HGGSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPIMVAAA 378


>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
           XB1A]
          Length = 458

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 123/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP++DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +P L
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPML 334


>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 121/368 (32%), Positives = 184/368 (50%), Gaps = 42/368 (11%)

Query: 42  LAFTLSMVFVDLVASSG---PPHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSG 95
           +A  + +    LVA       PHI+ +L DD GW +VG+H      ++ TPNID L   G
Sbjct: 2   IAAVVLLCIAGLVAGQPVRQKPHIVLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQG 61

Query: 96  IILKNYYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILP 146
           + L  +Y  + C+PSRS +M+G+ PIH             ++ + G     +P +   + 
Sbjct: 62  LELDQHYAYKFCSPSRSCLMSGRLPIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTGIA 119

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-------SAEEM 199
           + +KE GY T  VGKW  G    ++TP  RGF++  GY+    DY+          +  +
Sbjct: 120 EKMKEAGYATHQVGKWDAGMATPDHTPKGRGFDTSFGYYHHDNDYYTEVVGPQCSGSPIV 179

Query: 200 KMWGLDMRRDLEPAWDLHG----KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
            +W  D      PA  ++G    KY   +F    +D++  H  + PLFLY A    H+  
Sbjct: 180 DLWDTD-----HPAHGINGTGPDKYEEGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-- 232

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
              PLQ PD YLN    I+D  R  + A+++ LD+ VG +V AL+Q+ M  N + V  SD
Sbjct: 233 ---PLQVPDDYLNKFSFIDDSDRKYYHAMVNYLDDVVGDLVAALKQKGMWDNLLYVTSSD 289

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWL 373
           NGG    +    A+N+PL+G K T W+GG+R    +    L  + RG   + YVH++DW 
Sbjct: 290 NGGPI--YAGGGANNYPLKGGKLTDWQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWY 347

Query: 374 PTLLSAAN 381
            T  + A 
Sbjct: 348 GTFCAIAG 355


>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 458

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K+  ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 461

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 124/345 (35%), Positives = 192/345 (55%), Gaps = 25/345 (7%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
           A++ PP+I+FILADDLG+ D+   G   + TP+ID+L  +G+   N+Y    +C+PSR+A
Sbjct: 31  AANRPPNIVFILADDLGYGDLSCFGSTDLKTPHIDSLIGAGMRFTNFYANSSVCSPSRAA 90

Query: 114 IMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           ++TG++P   G+   +    +   G L  S  +LP YL++ GY +  +GKWHLG  +   
Sbjct: 91  LLTGRYPERVGVPGVIRDEVQDSWGYLASSATLLPTYLRKQGYHSANIGKWHLGL-ESPN 149

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
            P  RGF+   G   G  D +          G +  R      D  G ++TDVFT  AV 
Sbjct: 150 LPNERGFQEFYGLLEGMMDDYVVKLRH----GQNFLRHNGQVIDPPG-HATDVFTDAAVR 204

Query: 232 IIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---DFKRSKFAAILHK 287
            +++  +  +P FLYLA+ A     P++PLQ P  YLN  +  +   D +R+K  A++  
Sbjct: 205 YLNDRKAKKDPFFLYLAYTA-----PHDPLQPPVAYLNRVQKRQPGIDPQRAKLVALIEH 259

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR- 346
           LD+++G+V+  L     L+N+++VF SDNGG   G     A+N   RGVK   +EGG+R 
Sbjct: 260 LDDNIGRVLATLRANGQLANTLVVFTSDNGGWGPG----KANNGQFRGVKGQFYEGGIRI 315

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
             G +WS  +E+ G  +   + + DW PTLL AA  S +P+ +N 
Sbjct: 316 PTGAMWSGHIEA-GSESGALLQLMDWFPTLLQAAGAS-VPDGING 358


>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 474

 Score =  188 bits (478), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 122/341 (35%), Positives = 173/341 (50%), Gaps = 54/341 (15%)

Query: 48  MVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTV 104
            +F   +A + P  P+I+FI+ADDLG+ + G +G   IPTPNID L  SG+   + Y + 
Sbjct: 14  FLFCAQLAIAAPKRPNILFIVADDLGYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSA 73

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVG 160
             C  SR+A+MTG++    G ++N + G +      GLP++EK +   L+++GY T +VG
Sbjct: 74  PFCAASRAALMTGRYQTRFGFEYNPI-GAKNADPGTGLPVNEKTVADRLRDVGYATGLVG 132

Query: 161 KWHLGFYKKEYTPTFRGFESHLGY-----------WTGHQDYFDHSA------------E 197
           KWHLG     + P  RGF+   G+           W+G   +    A            +
Sbjct: 133 KWHLG-GTAPFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPD 191

Query: 198 EMKMWGLDMRRDLEPAWDLHGKYS------------TDVFTAEAVDIIHNHSTDEPLFLY 245
              +W  D+  + EPA+D                  TD FT EA   I  H   +P FLY
Sbjct: 192 GHTVWSTDLHEN-EPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRHQA-QPWFLY 249

Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRML 305
           LA+ A HS     PLQ  D Y+    HI D +R  FAA+L  LDE +GKV   L    + 
Sbjct: 250 LAYNAVHS-----PLQGEDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLE 304

Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
            N+++VF+SDNGG         +SN PLRG K  LW+GG+R
Sbjct: 305 ENTLVVFLSDNGGPTKELT---SSNLPLRGGKGDLWDGGIR 342


>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
 gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
          Length = 488

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 117/327 (35%), Positives = 182/327 (55%), Gaps = 26/327 (7%)

Query: 35  MAFAVLPLA---FTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPT 85
           M F+ L LA     LS +    VA +G       P+II ILADDLG+ D+GF G  +I T
Sbjct: 1   MKFSALILAAKCLCLSTLLATSVAYAGQTKADSRPNIIVILADDLGYGDLGFTGSREIKT 60

Query: 86  PNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERG--GLPLSE 142
           PNIDALA+ G++  N Y     C PSR+ ++TG++    GM+ N  +  +    GLP+ E
Sbjct: 61  PNIDALAHKGVVFSNAYVTHPYCGPSRAGLLTGRYQARFGMEINAAHSPDDPFMGLPVDE 120

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE-EMKM 201
               + +++ GY+T ++GKWH+G +   + P  RGF+   G+  G  DYF  S +   + 
Sbjct: 121 PTFAKRMQKAGYKTAVIGKWHMGSHPN-FHPNNRGFDYFYGFLGGGHDYFPESVKVSNEE 179

Query: 202 WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
           + + + R+ +PA     +Y T   + EA +     +T +P  +Y+A+ A     P++PL+
Sbjct: 180 YSIPLSRNGKPAQ--LNEYLTTAISKEAAEFA--MTTSQPFMMYVAYNA-----PHQPLE 230

Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA-- 319
           A    L  ++HIED  R  +AA++  +D+ +G++V  L+Q     N++I F+SDNGG   
Sbjct: 231 ATQKDLAKYQHIEDINRRTYAAMVDSMDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYP 290

Query: 320 -AAGFNLNAASNWPLRGVKNTLWEGGV 345
            +   N N ASN P R  K +L EGGV
Sbjct: 291 ESWMPNSNWASNKPFRRGKVSLTEGGV 317


>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
 gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
          Length = 458

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 123/350 (35%), Positives = 188/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP++DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GG    A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
 gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
          Length = 458

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 187/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP +DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPCLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETVADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RGF    G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITKEAIHCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K+  ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GGV   A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 486

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 122/346 (35%), Positives = 189/346 (54%), Gaps = 41/346 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+FILADD+GW+D+G +G D   TPNID  A   +   + Y + +C+PSRS +MTGK 
Sbjct: 26  PNILFILADDMGWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMTGK- 84

Query: 120 PIHTGMQHNVLY--GCERGG--------------LPLSEKILPQYLKELGYRTRIVGKWH 163
             H    H  ++  G + GG              LP SEK +  YLK  GY T ++GKWH
Sbjct: 85  --HAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWH 142

Query: 164 LGFYKKEYTPTFRGFESHLG--YWTGHQDY-FDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
           LG +  E+ P   GF+ ++G   W   Q + + +S       G + R      +   G+Y
Sbjct: 143 LGDW--EHYPEAHGFDINIGGTNWGAPQTFWWPYSGSGTH--GPEFRYIPHLEYGHPGEY 198

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP----DHYLNIHRHIEDF 276
            TD  T EA+ +I +H+ D+P F+YLAH A H+     P++A      H+   +R   + 
Sbjct: 199 LTDRLTDEAIKVI-DHAGDQPFFVYLAHHAVHT-----PIEAKADDIQHFDAKYRDGMNH 252

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-----AAAGFNLNAASNW 331
           + + +AA+  +LDE+VG+V+E L++R +  N++++F SDNGG       +G N+   +N 
Sbjct: 253 RHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNA 312

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           PLR  K  L+EGG+R   +I  P +   G   ++ V ++D L T L
Sbjct: 313 PLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFL 358


>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 490

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 127/332 (38%), Positives = 172/332 (51%), Gaps = 20/332 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+II I++DD G+ D  F G   I TPN+DALA SG+   + Y T  +C+PSR+ +MTG+
Sbjct: 39  PNIIVIVSDDQGYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGR 98

Query: 119 HPIHTGMQHNVL--YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +    G  +N++         LP +E +LPQ L + GY T +VGKWHLG  +    P  R
Sbjct: 99  YQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGL-QDGCRPYER 157

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE---PAWDLHGKYSTDVFTAEAVDII 233
           GF+   G  TG  DYF +  EE  +     +  +E   P  +    Y TD F A+AV II
Sbjct: 158 GFDEFFGIITGGHDYFVNHPEERAVGDQSYKARIERNGPVGEAVPGYLTDAFGADAVRII 217

Query: 234 ---HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAAILHKLD 289
              H    D+PLFLYLA  A H+     P QAP   ++ +   +E   R  +AA +  +D
Sbjct: 218 RESHTKRPDQPLFLYLAFNAPHT-----PTQAPKDLVDTMPATLESKDRRTYAAQITSMD 272

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            SVGKV  AL++  M  ++ IVF SDNGGA    N     N PLR  K +L+EGG+R   
Sbjct: 273 ASVGKVRAALKENGMEKDTFIVFFSDNGGA----NHPYYDNTPLRDHKGSLYEGGIRVPF 328

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
               P     G V E  V   D   T  + A 
Sbjct: 329 FAVYPGHIPAGSVCELPVTSLDVFATACALAG 360


>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
          Length = 426

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 139/227 (61%), Gaps = 10/227 (4%)

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
           +VGKWHLGFY+KE  PT RGF++ LG  TG+ DY+ + + +   + G D+      AW L
Sbjct: 1   MVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            G+YST ++   A  I+ +HS   PLFLY+A  A H+     PLQ+P  YL  +R + + 
Sbjct: 61  SGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNV 115

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R K+AA++  +DE+V  +  AL++    +NS+I+F SDNGG       +  SNWPLRG 
Sbjct: 116 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGR 171

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           K T WEGGVRG G + SPLL+ +   +   +H++DW PTL+  A  +
Sbjct: 172 KGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 218


>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 123/345 (35%), Positives = 189/345 (54%), Gaps = 39/345 (11%)

Query: 53  LVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTP 109
           L A+ G   P+++ +LADDLGW DVGF+G  +  TPN+D LA  G   K +YT   +C P
Sbjct: 31  LAAAEGAKRPNVLLLLADDLGWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAP 90

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK- 168
           SR+A+MTG++ IH G+  N         LP  E  L +  K  GY T + GKWH G  + 
Sbjct: 91  SRAALMTGRYTIHDGVSRN------NDDLPAREVTLAEAFKTHGYDTALFGKWHHGQPRD 144

Query: 169 --KEYT-PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
             K Y  P  +GF+   G+ T  +  ++   E+  +W     R+L+P       YS D+F
Sbjct: 145 GSKTYVHPMDQGFDEFFGF-TDAKHAWEKYPEQ--LW---HGRELKPV----SGYSDDMF 194

Query: 226 TAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----HIEDFK--R 278
              A+D +  H     P FLY+    TH       ++AP   + +H+     ++  K  R
Sbjct: 195 ADHAIDFLKRHKEKPTPFFLYVPFINTHFN-----IEAPAEEVALHQGKFTEVDPSKPIR 249

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA----SNWPLR 334
           + +AA++ +LD++VG+++ ALE   + +++++VFVSD+G      NL A+    SN P R
Sbjct: 250 ATYAAMVTQLDKNVGRIMTALEGLGLAADTLVVFVSDHGATFESGNLGASDYHDSNRPFR 309

Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
           G K TLWEGG+R  G+   P     G+++ + VH++D  PTLL+A
Sbjct: 310 GQKRTLWEGGIRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAA 354


>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
          Length = 339

 Score =  187 bits (476), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 117/339 (34%), Positives = 182/339 (53%), Gaps = 31/339 (9%)

Query: 46  LSMVFVDLVASSGP-----------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           +S++ V ++A++G            P+II+  ADD G+ D G+   D I TPNID L   
Sbjct: 7   VSVLVVMVLATAGKLKQRKRQGESKPNIIWFFADDYGYQDAGYRNSD-IHTPNIDKLVAD 65

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           GI   N Y+ Q C+PSRSA ++G++   +GMQH V+   +   + L    +  YLKEL Y
Sbjct: 66  GISFTNAYSAQQCSPSRSAFLSGRYAYTSGMQHGVIGDTKAHCMDLKYNFISDYLKELKY 125

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA-EEMKMW--GLDMRRDLE 211
            T   GKWHLG+  KE TPT+RGF++  G ++G  +Y++H+  + M  W  G ++ R   
Sbjct: 126 NTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEGEYYEHTTFQGMYDWHNGTELDRSAS 185

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
                 G +S D+  A+ ++ + + +   PLF Y A   THS  P  P   P+H      
Sbjct: 186 ------GIHSQDMIEADLLNQL-DKNDGTPLFYYAAFHNTHS--PMHP--KPEHEALYDD 234

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
                 R K+  ++  +D  +GKVV  L+++ M  N+ I F SDNGG     +++   N 
Sbjct: 235 LDAPPTRRKYLGLVSGMDAVIGKVVTKLKEKDMFDNTYIFFSSDNGG-----DVHEGDNS 289

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           P RG K++L++GG        SPLL+ +G+ A+ +V  S
Sbjct: 290 PKRGAKSSLFDGGCHAHSWASSPLLKKKGVEADGWVFAS 328


>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
          Length = 426

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 139/227 (61%), Gaps = 10/227 (4%)

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
           +VGKWHLGFY+KE  PT RGF++ LG  TG+ DY+ + + +   + G D+      AW L
Sbjct: 1   MVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            G+YST ++   A  I+ +HS   PLFLY+A  A H+     PLQ+P  YL  +R + + 
Sbjct: 61  SGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNV 115

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R K+AA++  +DE+V  +  AL++    +NS+I+F SDNGG       +  SNWPLRG 
Sbjct: 116 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGR 171

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           K T WEGGVRG G + SPLL+ +   +   +H++DW PTL+  A  +
Sbjct: 172 KGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 218


>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
          Length = 426

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 139/227 (61%), Gaps = 10/227 (4%)

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
           +VGKWHLGFY+KE  PT RGF++ LG +TG+ DY+ + + +   + G D+      AW L
Sbjct: 1   MVGKWHLGFYRKECLPTRRGFDTFLGSFTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            G+YST  +   A  I+ +HS   PLFLY+A  A H+     PLQ+P  YL  +R + + 
Sbjct: 61  SGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNV 115

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R K+AA++  +DE+V  +  AL++    +NS+I+F SDNGG       +  SNWPLRG 
Sbjct: 116 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGR 171

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           K T WEGGVRG G + SPLL+ +   +   +H++DW PTL+  A  +
Sbjct: 172 KGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGT 218


>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 501

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/400 (32%), Positives = 202/400 (50%), Gaps = 73/400 (18%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L L F L   F  L A+  PP+II I++DD G+ D+G  G ++I TP++D LA  G 
Sbjct: 16  FISLALLFCLCSAF-GLKAAETPPNIIMIVSDDQGYRDLGSFGSEEIMTPHLDRLAKEGA 74

Query: 97  ILKNYY-TVQLCTPSRSAIMTGKHPIHTGM------------------QHNVLYGCER-G 136
            L ++Y T   CTPSR +++TG++P   G+                  ++ V +  ER G
Sbjct: 75  KLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTF--ERIG 132

Query: 137 GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA 196
           G+ + EK+LP  LK  GY + I GKW LG +K+ + P  RGF+   G+     DYF H  
Sbjct: 133 GMDVREKLLPALLKPAGYVSAIYGKWDLGIHKR-FLPLARGFDDFYGFTNTGIDYFTH-- 189

Query: 197 EEMKMWGL-DMRRDLEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSA 254
              + +G+  M R+ +P  +  G Y T +F  EAV  I  NH   +P FLYL   A H A
Sbjct: 190 ---ERYGVPSMYRNNQPTEEDKGTYCTYLFQREAVRFIKENHQ--KPFFLYLPFNAPHGA 244

Query: 255 NPYEP-----LQAPDHYLNIHRHIEDF-----------------------------KRSK 280
           +  +P      QAP+ Y N++ H++D                              +R +
Sbjct: 245 SSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSASKRRLE 304

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           + A +  +D+++G+V+  L++ ++  N+I+VF SDNGG+        A N PL+G K  +
Sbjct: 305 YVASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGSG------GADNSPLKGKKGMM 358

Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +EGG+R   L+  P     G V ++ +   + +PT L  A
Sbjct: 359 FEGGIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEA 398


>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
          Length = 482

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 125/391 (31%), Positives = 195/391 (49%), Gaps = 48/391 (12%)

Query: 23  QYLKELGYRTRIMAFAVLPLAFTL--------SMVFVDLVA--SSGPPHIIFILADDLGW 72
           +YL E+ +RT+ +   ++   F +        S     LVA  SS  P++I ILADDLG+
Sbjct: 36  RYLYEIRFRTKSVFLFLVRNPFCVRNPSNSFSSFFAAFLVAKGSSSKPNVILILADDLGF 95

Query: 73  NDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG 132
           ND+ ++    I  PN+ +LA +G IL N+Y   +CTPSRSA+MT ++PI  G+Q +V+  
Sbjct: 96  NDMPWNN-PAIIAPNLHSLAKNGTILSNFYVQPVCTPSRSALMTSRYPIRLGLQTDVITA 154

Query: 133 CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF 192
            +   LPL E  +    +  GY T IVGKWHLG Y  +  P  RGF++  GY TG +DY+
Sbjct: 155 PQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLGHYCPQCLPNNRGFDTFRGYLTGAEDYY 214

Query: 193 DHSA-------EEMKMWGLDMRRDLEPAWDLHGKYSTDV-------------FTAEAVDI 232
             +        +     G D   +       +G YST               F   + ++
Sbjct: 215 KKTFCIPLVPNQRPAACGFDFYDNENRMPKANGTYSTYQVLIYLFIHTIILKFADASREV 274

Query: 233 IHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I +H  +  P FLYL   + H      P+  P +Y +++ + ++  R ++  ++  +DE+
Sbjct: 275 IKSHEGSKTPFFLYLPFQSVH-----YPVMVPKNYSDLYPNEKNHTRKEYMGMVTAMDEA 329

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VG +V+ L+      N++  + +DNGG       NA    P RG K TLW+GG+R    I
Sbjct: 330 VGHIVDELKAAGAYENTLFFYATDNGGLVGAGGRNA----PFRGQKATLWQGGLRVPAFI 385

Query: 352 WSPLLESRGIVAEQY---VHVSDWLPTLLSA 379
             P + S    A  Y   +H+ D   T+L A
Sbjct: 386 SGPGVPS----ASNYKNMMHIVDVQMTVLDA 412


>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
 gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
          Length = 482

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 118/351 (33%), Positives = 180/351 (51%), Gaps = 43/351 (12%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAI 114
           ++GPP+I+FI+ADDLG+ DVG +G   + TP+ID +A  G+   + Y    +C+ +R+A+
Sbjct: 33  AAGPPNIVFIMADDLGYADVGCYGRPDLKTPHIDRIAARGVRFTQGYANSAVCSATRTAL 92

Query: 115 MTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           +TG++      Q+ +  G E        GGLP     LP  L+  GYRT +VGKWHLG  
Sbjct: 93  ITGRY------QYRLPVGLEEPVTTRFTGGLPAEHPTLPGLLRRAGYRTALVGKWHLGSV 146

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK--MWGLDMRRDLEPAWDLHGKYSTDVF 225
            K + P  RG+++  G+  G  DYF H   + K  +W  D+R +          Y+T + 
Sbjct: 147 AK-FGPLQRGYDTFWGFRGGTIDYFSHRGTDDKPDLWDGDVRVE-------EVGYATHLI 198

Query: 226 TAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-------RHIEDFK 277
              AV+ I        P FL L     H   P+ P + P+             RH +   
Sbjct: 199 GRRAVETIGTAVRAGRPFFLSL-----HFNAPHWPWETPEDEAESRRIAGSNLRHFDGGT 253

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           ++ +  I+ ++D  +G+V++ALEQ  +  ++I+VF SDNGG         +  WP  G K
Sbjct: 254 QATYHRIVAEMDAQIGRVLDALEQHGIAQDTIVVFTSDNGGE------RFSDTWPFTGRK 307

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             L EGG+R   ++  P     G+V+ Q +   DWLPTLL+AA  S  P Y
Sbjct: 308 TELLEGGLRIPAIVSWPRRLPAGVVSAQPMITMDWLPTLLAAAGASPDPVY 358


>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
 gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
          Length = 510

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 119/331 (35%), Positives = 187/331 (56%), Gaps = 28/331 (8%)

Query: 33  RIMAFAVLPLAFTLS-MVFV-DLVASSGP---------PHIIFILADDLGWNDVGFHGLD 81
           R   F++L +A  L+ +VF  DL A +           P+I+ ILADDLG+ D+GF G  
Sbjct: 19  RFNLFSLLFVALCLAGIVFSSDLAAQTKKTHKLVAKERPNIVVILADDLGYADLGFTGSK 78

Query: 82  QIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCE--RGGL 138
           +I TPNIDALA +G++ KN Y T   C PSR+ ++TG++    GM+ N  +  +    GL
Sbjct: 79  EIFTPNIDALANNGVVFKNGYVTHPYCGPSRAGLLTGRYQARFGMEVNAAHSPDDPYMGL 138

Query: 139 PLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE 198
           P+ E    + +++ GY+T ++GKWH+G +   + P  RGF+   G+  G  DYF  S + 
Sbjct: 139 PVEELTFAKRMQQAGYKTAVMGKWHMGSHPN-FHPNNRGFDEFFGFLGGGHDYFPESVKV 197

Query: 199 MKM-WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
               + + + R+ +PA     +Y T   + EA   +   +T++P  +Y+A+ A HS    
Sbjct: 198 SSAEYSIALSRNGKPAQ--LNEYLTTAISKEAARFVS--ATEQPFMMYVAYNAPHS---- 249

Query: 258 EPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG 317
            PLQA +  L  ++HI D  R  +AA++  +D+ VG++V AL+Q   L N++I F+SDNG
Sbjct: 250 -PLQATEQDLAKYQHIADLDRRTYAAMIDSMDQGVGRIVAALKQSEKLHNTLIFFLSDNG 308

Query: 318 GAAA---GFNLNAASNWPLRGVKNTLWEGGV 345
           G        + + A+N P R  K +L EGGV
Sbjct: 309 GVYPEEWMPDSDWANNAPFRRGKVSLTEGGV 339


>gi|347528892|ref|YP_004835639.1| putative arylsulfatase [Sphingobium sp. SYK-6]
 gi|345137573|dbj|BAK67182.1| putative arylsulfatase [Sphingobium sp. SYK-6]
          Length = 458

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 129/377 (34%), Positives = 188/377 (49%), Gaps = 30/377 (7%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           R  +   A LPLA  +        A + PPHI+FI+ADDLG+ D+ F G  +  TP+ID+
Sbjct: 5   RAFLGTAAALPLAARVE------AAPARPPHIVFIMADDLGYADLSFTGRTEYATPHIDS 58

Query: 91  LAYSGIILKNYYT-VQLCTPSRSAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQ 147
           +A  G    + Y    +C+P+R A+ TG++   +  G+   +  G E  GLP     +  
Sbjct: 59  IAEQGCFFSDAYANSSVCSPTRMALATGRYQYRLRGGLDEPIGGGPEPVGLPPEHPTIAS 118

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
            L+E GYRT +VGKWHLG     ++P   G++   G   G  DYF HS + M+  G    
Sbjct: 119 RLREAGYRTVLVGKWHLG-SPPRFSPLLSGYDRFFGIHGGGADYFTHS-QSMQGGGSSDL 176

Query: 208 RDLEPAWDLHGKYSTDVFTAEAV-DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DH 265
            D     D HG Y TD+ T  AV +I     T +PLFL L     H   P+ P + P D 
Sbjct: 177 YDGSSVTDRHG-YLTDILTERAVAEIDLAARTRKPLFLSL-----HYTAPHWPWEGPEDE 230

Query: 266 YLNIH----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
            L        + +    S FA ++  LD+ VGKV+ ALE + M  ++I+VF SDNGG   
Sbjct: 231 ALAAQIKSLLNYDGGSDSTFAKMVQALDKGVGKVLAALEAQGMTQDTIVVFTSDNGGE-- 288

Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                 +  WP  G+K  L EGG+R   ++  P     G  + Q     D++PT L+ A 
Sbjct: 289 ----RFSKVWPFDGMKGDLLEGGIRVPMMMRWPGKIPAGSRSAQIAMTMDYMPTFLATAG 344

Query: 382 KSDIPNYVNSTVENIIP 398
            S  P + +  + N++P
Sbjct: 345 GSADPAFPSDGI-NLMP 360


>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 500

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 120/332 (36%), Positives = 171/332 (51%), Gaps = 28/332 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+F+LADDLG+NDVGF+G   I TPN+D LA +G+     Y     C PSR+AIMTG+
Sbjct: 40  PNILFVLADDLGYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGR 99

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G Q N+       G+   E  + Q +K  GY T  +GKWHLG    EY P   GF
Sbjct: 100 YPHKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLG-EASEYHPNKHGF 158

Query: 179 ESHLGYWTGHQDYFDHSAEE-------MKMWGLDMRRDLEPAWDLHGK------YSTDVF 225
           +   G+  G  +YF    E          M  ++M   L P  + +GK      Y TD  
Sbjct: 159 DEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMY--LTPL-EHNGKEVRETEYITDGL 215

Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           + EAV+ +   +  + P FLYLA+ A     P+ PLQA +  + +   I+D KR  +A +
Sbjct: 216 SREAVNFVDKAAAKKKPFFLYLAYNA-----PHVPLQAKEEDMAMFSQIKDKKRRTYAGM 270

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           ++ +D  VG++VE L++     N++IVF SDNGG         A+N+PL+  K ++ EGG
Sbjct: 271 VYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNGGKLG----QGANNYPLKEGKGSVQEGG 326

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
            R   L+  P     G      V   D  PT 
Sbjct: 327 FRTPMLVHWPKHMKAGSRFSHPVLALDLYPTF 358


>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  186 bits (472), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 122/355 (34%), Positives = 194/355 (54%), Gaps = 33/355 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P++I I+ADDLG+NDVGF+G  +IPTP ID++A +G+   N YT   +C PSR+  +TG+
Sbjct: 21  PNLIVIMADDLGYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGR 80

Query: 119 HPIHTGMQHNVLYGCE--RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +    G + N  +        LP SE  + + L ++GY   I+GKWHLG  +    P  R
Sbjct: 81  YQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLG-AEPSLRPNKR 139

Query: 177 GFESHLGYWTGHQDYF------DHSAE---EMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           GF+   G+  G   +        H+ E   E+  +   + R+  P      KY T+ F+ 
Sbjct: 140 GFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTT--KYLTEEFSD 197

Query: 228 EAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EAV  I  NH   +P FL+L++ A     P+ PLQA + YL    HI+D KR  +AA++ 
Sbjct: 198 EAVSFIKRNHQ--KPFFLFLSYNA-----PHLPLQATEKYLARFPHIKDPKRKTYAAMVS 250

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
            +D+ V +V+++L++  +  N+I+ F+SDNGG +   + N + N+PL+G K+ +WEGG R
Sbjct: 251 AVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPS---HKNKSDNFPLKGQKSDVWEGGFR 307

Query: 347 GAGLIWSPLLESRGIVAEQ-YVHVSDWLPTLLSAANKSDIPNYVNSTVE--NIIP 398
               +  P      I A+Q Y H    L    + A+ +  P + +  ++  N+IP
Sbjct: 308 VPFAMQYP----AAIQAKQVYDHPVSSLDIFATIASLAQSPTHADKPLDGVNLIP 358


>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
          Length = 458

 Score =  186 bits (472), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 187/350 (53%), Gaps = 30/350 (8%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           F  L    TL   F  L A +  P+I+ +LADDLGW DVGFHG  +I TP++DAL   G+
Sbjct: 5   FFGLLCGCTLLPAF--LHAQTERPNIVIVLADDLGWGDVGFHG-SEIKTPSLDALVGEGV 61

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+ +YT  + TP+R+ +MTG++P   G++  V+      GL  +E+ +   L   GY+ 
Sbjct: 62  ELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLARNGYKN 121

Query: 157 R-IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           R I+GKWHLG  KK + P  RG     G+  G  DYFD + E      LD   D E   D
Sbjct: 122 RAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAIDYFDLTREGE----LDWHNDWETCHD 177

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
               YST++ T EA+  I  +  + P  LY+A+ A H+     PLQA +   +I  + ++
Sbjct: 178 --KGYSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEK--DIKLYTDN 228

Query: 276 F--------KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           F        K++ ++A++  +D  +G +V+AL+++ ++ N+  +F SDNG A     +  
Sbjct: 229 FDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNGTAG----VPG 284

Query: 328 ASNWPLRGVKNTLWEGGVRG-AGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           +S+ PLRG K   W+GG    A L W    +    ++ Q     D +PTL
Sbjct: 285 SSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score =  186 bits (471), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 115/362 (31%), Positives = 186/362 (51%), Gaps = 32/362 (8%)

Query: 42  LAFTLSMVFV-DLVASSGP----PHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAY 93
           LA  +  VF+  L + + P    PHI+ +L DD GW +VG+H      ++ TPNID L  
Sbjct: 2   LAVLVVGVFLLGLASGASPNANLPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDKLVK 61

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKI 144
            G+ L  +Y  + C+PSRS++++G+ PIH             ++ + G     +P +   
Sbjct: 62  EGLELNQHYAFRFCSPSRSSLISGRLPIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTG 119

Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL 204
           + + ++  GY T  VGKW  G    ++TP  RGF++  GY+    DY+   A + K+  +
Sbjct: 120 IAEKMRGAGYATHQVGKWDAGMATSDHTPKGRGFQTSFGYFHHANDYYTEIAGKCKIPIV 179

Query: 205 DMRRDLEPAWDLHGK----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
           D+    +PA  ++G     Y   +F    + ++ NH    PLFLY A    H+     P 
Sbjct: 180 DLWETDKPAHGINGTGPDNYEEGLFKERVLSVVSNHDPSTPLFLYYAPHIVHA-----PY 234

Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           + PD YLN    I++  R  + A+++ LD+ VG++ +AL+ + M  N + V  SDNGG  
Sbjct: 235 EVPDRYLNKFSFIDNHLRQYYHAMVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGPV 294

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLS 378
                  A+N+PL+G K T W+GGVR    +    L  + RG   + Y+H++DW  T  +
Sbjct: 295 --HEGTGANNYPLKGGKTTDWQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCA 352

Query: 379 AA 380
            A
Sbjct: 353 IA 354


>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
 gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
          Length = 157

 Score =  185 bits (470), Expect = 8e-44,   Method: Composition-based stats.
 Identities = 78/157 (49%), Positives = 111/157 (70%), Gaps = 2/157 (1%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
           AV    F  SM+      ++ PPHI+FI  DDLGW+DVG+H +  + TPNID LA  G++
Sbjct: 2   AVFFYVFVASMLTA--CHAAKPPHIVFIFVDDLGWSDVGYHNITDLKTPNIDRLAGEGVV 59

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
           L+NYY   +CTP+R   ++G++PIHTG+QH+ ++  E  GLPL   +LPQ LK+ GY T 
Sbjct: 60  LENYYVQPICTPARGTFLSGRYPIHTGLQHSNIHETEPFGLPLDFTLLPQKLKKAGYSTH 119

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH 194
            VGKWHLGF++KEYTP +RGF++  GY++G +D++ H
Sbjct: 120 AVGKWHLGFFEKEYTPLYRGFDTFFGYYSGSEDHYTH 156


>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
          Length = 266

 Score =  185 bits (470), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 101/268 (37%), Positives = 151/268 (56%), Gaps = 42/268 (15%)

Query: 42  LAFTLSMVFVD-----LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           LA+  S + V+     + A    P+I+ ++ADD G+ D+G+HG  +  TPN+D LA  G+
Sbjct: 5   LAWLSSFLIVNCLRMVICAKQQQPNIVIVVADDYGYRDIGYHG-AEFATPNLDKLAAEGV 63

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
            L+NYY   +C+P+RS +MTG++ IHTG+QH++++  +  GLPL    +   LK +GY T
Sbjct: 64  KLENYYVQPICSPTRSQLMTGRYQIHTGLQHDIIWPSQPYGLPLQFPTIADMLKSVGYST 123

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH---------------------- 194
             +GKWHLG YKKEYTP +RGF+S+ GY  G +DY+ +                      
Sbjct: 124 HAIGKWHLGLYKKEYTPLYRGFDSYYGYLEGGEDYYTYYNCDTFHNRTTPADTSILESYS 183

Query: 195 --------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST-DEPLFLY 245
                     +E K  G D+R   EP  D++G YST ++T +A+DII+  ST  +P  LY
Sbjct: 184 PKNILLGKHEDENKWCGYDLRDMNEPVTDMNGTYSTHLYTKKAIDIINGASTGGKPFLLY 243

Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRHI 273
           LA+ A HS     P++ P  Y   +  I
Sbjct: 244 LAYQAVHS-----PMEVPAEYTKPYTFI 266


>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 466

 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 187/354 (52%), Gaps = 26/354 (7%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           +   FTL M+    V++   P+II IL+DD G+ D G +G   IPTP+IDALA  G+   
Sbjct: 7   IVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGVRFT 66

Query: 100 N-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKEL 152
           N Y T  +C PSR+ ++ G++   +G +HNV       Y  +  GL  + + +   ++  
Sbjct: 67  NSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQMQDIGLSDTVRTIADQMQSN 126

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF---DHSAEEMKMWGLDMRRD 209
           GY T  +GKWH G   K + P  +GF    G+  GH+ +F       +E K+       D
Sbjct: 127 GYETMAIGKWHQGNETKHH-PLHKGFNHFFGFIGGHRSFFPIRTAIKQEEKILNDYTEVD 185

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
            +  +     Y TD+FT +A+  +     D+P F+YL++ A H+     P++A    L  
Sbjct: 186 EKDVY-----YLTDMFTDKAISYM-RQKRDKPYFIYLSYNAVHT-----PVEATPQKLAQ 234

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
             H++D +R  +A ++  +D+ +G++VE L++   L N++++F++DNG A      N A 
Sbjct: 235 FAHLKDAQRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAATN----NGAD 290

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           N PLRG+K + WEGG+R   ++  P     G   +  V   D +PT ++AA  +
Sbjct: 291 NGPLRGLKGSKWEGGIRVPMIMHWPKQLPAGKDYKGLVSSLDLVPTCVAAAQGT 344


>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
 gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
          Length = 484

 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 115/297 (38%), Positives = 171/297 (57%), Gaps = 21/297 (7%)

Query: 60  PHIIFILADDLGWNDVGF-HGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTG 117
           P+I+ ILADDLG+NDVGF +G  +I TP +DALA  G++ +N Y T   C PSR+ ++TG
Sbjct: 28  PNIVVILADDLGYNDVGFTNGQTEIKTPRLDALANEGVVFENGYVTHPYCGPSRAGLITG 87

Query: 118 KHPIHTGMQHNVLYGCE--RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           ++    GM++NV Y  +    GLPL+EK  P  L+E+GY+T I GKWHLG     + P  
Sbjct: 88  RYQARFGMENNVTYSPDDKYMGLPLTEKTFPARLQEVGYKTAIFGKWHLGG-APHFQPNE 146

Query: 176 RGFESHLGYWTGHQDYFD---HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           RGF+   G+  G  +Y     H      +  + MR      +D   +Y T   + +A   
Sbjct: 147 RGFDYFYGFLDGGHNYMPGEVHLGAGGYLLPI-MRNKGVAEFD---EYLTTALSRDAARY 202

Query: 233 IHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I   S ++ P F+Y+++ A H+     PLQAP +YL  + HI+D KR  +AA++  +DE 
Sbjct: 203 IERTSKEQAPFFIYMSYNAPHA-----PLQAPQNYLEKYAHIKDEKRRTYAAMVDAMDEG 257

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGA---AAGFNLNAASNWPLRGVKNTLWEGGV 345
           +G +++AL++   L N++I F+SDNGG    +   +   A N P R  K  L EGGV
Sbjct: 258 IGMLIDALDKSGELDNTLIFFLSDNGGLYPYSWWPDYTWADNSPFRFGKVALTEGGV 314


>gi|449692603|ref|XP_004213101.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 181

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 92/182 (50%), Positives = 131/182 (71%), Gaps = 2/182 (1%)

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLGF+ K+YTPT+RGF+S+ G + G  DY++HS  E    GLD+  +    +   G YS
Sbjct: 1   WHLGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNTETYS-GLDLHDNENGVFSQDGNYS 59

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK 280
           T+++TAEA+  I+NH++ EPLFLYLA+ A HSAN  E PLQAP  +++   +I+  +R K
Sbjct: 60  TEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRK 119

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +AA+L  +D  VG+V +AL +++ML NSII+F +DNGG A GF+ N A+N+PLRGVK TL
Sbjct: 120 YAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATL 179

Query: 341 WE 342
           +E
Sbjct: 180 FE 181


>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
 gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
          Length = 465

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 120/339 (35%), Positives = 183/339 (53%), Gaps = 22/339 (6%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIM 115
           S PP++I ILADDLG+ D+G  G  QI TP++D LA  G+   + Y T  +C+PSR  ++
Sbjct: 26  SSPPNMIVILADDLGYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLL 85

Query: 116 TGKHPIHTGMQHN----VLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           TG+ P   G+  N    + Y  E   GLP +EK++P+YL   GYR+ + GKWHLG + K 
Sbjct: 86  TGRFPKRYGITTNPNIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHLG-HTKG 144

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           YTP  RGF    G+  G + YF    E   +    +  +     D+   Y TD  T  AV
Sbjct: 145 YTPPERGFTHWWGFLGGSRHYFPVKKEAEGLNPSMIVSNFTDKTDI--TYLTDDITDRAV 202

Query: 231 DIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           + +     D +P F+++++ A H  N  +P       +   R++++ +R  + A+++ +D
Sbjct: 203 EFLQEAGKDKKPFFMFVSYNAPHWPNEAKP-----EDIAKFRNVQNGERRVYCAMVYAMD 257

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
             +G++++AL+   +  ++I+VF+SDNGGA    + NA    P RG K   +EGGVR   
Sbjct: 258 RGIGRILDALKADGLEKDTIVVFLSDNGGAPEASSCNA----PFRGAKRQHFEGGVRVPF 313

Query: 350 LIWSPLLESR--GIVAEQYVHVSDWLPTLLSAANKSDIP 386
           +I  P  +    G V  Q V   D LP LL  AN   IP
Sbjct: 314 IIRYPADKRLVPGSVCRQPVSSVDLLPALLK-ANGRHIP 351


>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
 gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
          Length = 440

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 179/359 (49%), Gaps = 27/359 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLGW D+G +G     TP++D LA  G+     Y    +C+ +R A++TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +   +  G++  +     + GLP     LP  LK+ GY T ++GKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           G++   G  +G  DYF H     + +  D+ +  EP       Y T +   EA   +H  
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSSDLWQGNEPV--ERTGYYTYILGDEATRYVHER 180

Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLDE 290
              D+P FL L     H   P+ P + PD     H +    H +     K+  I+  LD+
Sbjct: 181 KGQDKPFFLSL-----HFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDK 235

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +VG+V++AL+      N+I++F SDNGG         +  WP  G K  L EGG+R   L
Sbjct: 236 AVGQVLQALDDSGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPTL 289

Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE 408
           + WS  ++ +  V EQ     DWLPTLL+AA     P++  S  +NI+P  E    + E
Sbjct: 290 LRWSARIQPQ--VQEQVTASFDWLPTLLAAAGARQHPDF-PSDGQNILPILEGHAAQTE 345


>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
 gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
          Length = 440

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 179/359 (49%), Gaps = 27/359 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLGW D+G +G     TP++D LA  G+     Y    +C+ +R A++TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +   +  G++  +     + GLP     LP  LK+ GY T ++GKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           G++   G  +G  DYF H     + +  D+ +  EP       Y T +   EA   +H  
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSSDLWQGNEPV--ERTGYYTYILGDEATRYVHER 180

Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLDE 290
              D+P FL L     H   P+ P + PD     H +    H +     K+  I+  LD+
Sbjct: 181 KGQDKPFFLSL-----HFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDK 235

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +VG+V++AL+      N+I++F SDNGG         +  WP  G K  L EGG+R   L
Sbjct: 236 AVGQVLQALDDSGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPTL 289

Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE 408
           + WS  ++ +  V EQ     DWLPTLL+AA     P++  S  +NI+P  E    + E
Sbjct: 290 LRWSARIQPQ--VQEQITASFDWLPTLLAAAGARQHPDF-PSDGQNILPILEGHAAQTE 345


>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 126/396 (31%), Positives = 192/396 (48%), Gaps = 47/396 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PHI+FI+ DD G+++ G     +  TPN+D LA  G++L   Y+   CTPSRS+ ++G+ 
Sbjct: 41  PHIVFIMIDDWGFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGRL 100

Query: 120 PIHT-----------GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           P+H              QH    G    G+P +   +P ++++ GY+T +VGKW  G   
Sbjct: 101 PLHVFHSNRVSSASWDSQHPDTAGV---GIPRNMTTIPAFMRKAGYKTHMVGKWDAGIAT 157

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEM--------------KMWGLDMRRDLEPAW 214
            +++P  RGF+S L Y+    +Y+ ++  +                   +D+    EPA 
Sbjct: 158 PQHSPLGRGFDSSLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPAD 217

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
              G Y   VF   A+D+I  H    PLFLY A    H+     PLQ P  YL+  +HI 
Sbjct: 218 APIGTYEEHVFRDHALDVISKHDASTPLFLYYASHIAHA-----PLQVPQAYLDRFQHIP 272

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA----GFN---LNA 327
           D  R  + A++   D+ VG +  AL+ + M  N+++V  SDNGGA      G +      
Sbjct: 273 DPIRRTYQAMVLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGG 332

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAAN-KSD 384
           ASNWPLRG K + ++GG+R    +    L S  RG V    V ++DW+ T    A    +
Sbjct: 333 ASNWPLRGGKLSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAGVACE 392

Query: 385 IPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIE 420
            P    + +  I  +   S+L   N T    SPR+E
Sbjct: 393 DPIAKKANLPAIDSQSLWSLLSGANST----SPRVE 424


>gi|167523060|ref|XP_001745867.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775668|gb|EDQ89291.1| predicted protein [Monosiga brevicollis MX1]
          Length = 221

 Score =  183 bits (465), Expect = 3e-43,   Method: Composition-based stats.
 Identities = 94/207 (45%), Positives = 131/207 (63%), Gaps = 16/207 (7%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
           V  VA+   PH++FI+ DDLG++D+ F G  QI TPNIDAL   G++    Y   +C+PS
Sbjct: 21  VHAVAAGTKPHVLFIVIDDLGYDDLYFRG-HQIRTPNIDALQEEGLLFTQMYMQDVCSPS 79

Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           R++I++G++ +H G+  + +   +  GL L++  L   ++E GY TR VGKWH+GFYK  
Sbjct: 80  RASILSGRYAMHHGVT-DWIPPRDSYGLMLNDTTLADKMREAGYDTRAVGKWHMGFYKWA 138

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE---------PAWDLHGKYS 221
           YTPTFRGF S LGY++G +DYF H  +       DM RD           PAWDL G+YS
Sbjct: 139 YTPTFRGFNSFLGYYSGGEDYFTHETDN----AYDMHRDEGRHCGPNCSIPAWDLKGQYS 194

Query: 222 TDVFTAEAVDIIHN-HSTDEPLFLYLA 247
           T +F+ EA+ II+   + D PLFLYLA
Sbjct: 195 TTIFSEEAIRIINQRQAADPPLFLYLA 221


>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 193

 Score =  183 bits (465), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 82/144 (56%), Positives = 105/144 (72%), Gaps = 3/144 (2%)

Query: 69  DLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGM--- 125
           +LG+NDV FHG  QIPTPNID +A  G+IL NYY + +CTPSRSAIMTG++PIHTG+   
Sbjct: 44  NLGFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTGRYPIHTGIFYH 103

Query: 126 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYW 185
            ++ ++     G+ L EK LPQYLK +GY+T  +GKWHLGF+ KEYTPT+RGF+S  GY+
Sbjct: 104 TYDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYY 163

Query: 186 TGHQDYFDHSAEEMKMWGLDMRRD 209
            G  DY+DHS      WGLD+  D
Sbjct: 164 GGQADYWDHSLASNGWWGLDLHYD 187


>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 433

 Score =  183 bits (465), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 107/267 (40%), Positives = 146/267 (54%), Gaps = 25/267 (9%)

Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
           +TGMQH VL   E  GLPL+ K+LP++L +LGY      K  L   +  +   F      
Sbjct: 1   YTGMQHFVLMEMEPRGLPLNFKLLPEWLGDLGYVHNQTTKQTLLHKELRFVYAFPYLRR- 59

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEP 241
                                G D RR L    +  G+Y T++ T EA+ II  H  ++P
Sbjct: 60  ---------------------GFDFRRGLLGGREFDGRYYTELMTEEAIRIIETHPLEKP 98

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
           LFLYL+H A HS N  +PLQ P  + +++  I    R+ +A ++  LDESVG VVEAL +
Sbjct: 99  LFLYLSHLAPHSGNLRDPLQVPKKFSDMYHDIGSRNRTIYAGMVSALDESVGAVVEALGR 158

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE-SRG 360
           R ML ++++VF SDNG  + G N N AS WP +G K T WEGGVR   +IWSP L  ++G
Sbjct: 159 RGMLGDTVLVFSSDNGANSRGENPNYASAWPFKGQKITPWEGGVRAPAIIWSPSLSGTQG 218

Query: 361 IVAEQYVHVSDWLPTL--LSAANKSDI 385
                  H+SDWLPTL  L+  + SD+
Sbjct: 219 RDYSNIFHISDWLPTLYQLAGGDPSDL 245



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 17/145 (11%)

Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQ 625
             D S   +IDGI     L     + R  +L NID    ++A+ +G +KLV         
Sbjct: 239 GGDPSDLGDIDGISQLEALRHGTEAPRKEVLINIDPIDNVTAIIEGHFKLVSGTVKGGAL 298

Query: 626 VDLTGGPDQVYLSGLSDREWL--ALAMRKLRDAA-SIQCG--------PVK------EVP 668
            D    P  V       R     ++A + LR++   + CG        PVK        P
Sbjct: 299 DDWFQIPGNVAWDTDRTRHECETSVAAQVLRNSGHDVTCGSGQGSYAIPVKCGRRDTSKP 358

Query: 669 CEPQIAPCLFDIKNDPCEKNNLADR 693
           C P +APCLFD+  DPCE NN+A++
Sbjct: 359 CTPTLAPCLFDLSEDPCEYNNVAEQ 383



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 762 AASIQCGPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADR-SEVQR 804
           A  ++CG      PC P +APCLFD+  DPCE NN+A++ SEV R
Sbjct: 345 AIPVKCGRRDTSKPCTPTLAPCLFDLSEDPCEYNNVAEQHSEVVR 389


>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 434

 Score =  182 bits (462), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 118/339 (34%), Positives = 180/339 (53%), Gaps = 35/339 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+II    DDLG+ D+G +G D + TP++D LA  GI   N+Y+   +C+PSR++++TGK
Sbjct: 4   PNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLTGK 63

Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P   G+    + G +RG  GL L +  L   LKE GY T + GKWHLG    EY P   
Sbjct: 64  YPAKAGV--TSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLG-ASAEYGPNAH 120

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGL--------DMRRDLEPAWDLHGKYSTDVFTAE 228
           GF+   G+  G  DY+ H    +  WG         D+ R+    W+ +G+Y T+  T E
Sbjct: 121 GFDQFYGFRAGCIDYYSH----IFYWGQGGGVNPVHDLWRNETEVWE-NGEYMTEAITRE 175

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A   I     DEP F+Y+A+ A     P+ P+ AP  YL+    +    R   AA++  +
Sbjct: 176 ATSYIDAAPDDEPYFMYVAYNA-----PHYPMHAPKAYLDRFPDLPP-DRRIMAAMIAAV 229

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTL 340
           D+ VG++V+AL+Q+    ++II F SDNG +    N            S    RG K +L
Sbjct: 230 DDGVGEIVKALKQKGAYEDTIIFFSSDNGPSTESRNWLDGTEDLYYGGSAGRFRGHKASL 289

Query: 341 WEGGVRGAGLIWSP--LLESRGIVAEQYVHVSDWLPTLL 377
           +EGG+R   ++  P  L E +G ++++   + D  PT+L
Sbjct: 290 FEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTML 328


>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 512

 Score =  182 bits (462), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 121/368 (32%), Positives = 191/368 (51%), Gaps = 54/368 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMT-- 116
           P+II I ADD+G++DVG+HG  +I TPNID++A  G+   + Y +  +C PSR+ ++T  
Sbjct: 21  PNIILIFADDMGYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRAGLLTGV 80

Query: 117 -------GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
                  G++P  +G  + + Y     GLP S+ ++ + LK LGY   ++GKWH+GF   
Sbjct: 81  YQQRFGCGENPNGSGYPNQMKY--PMAGLPQSQSMISEELKTLGYTNGMIGKWHMGF-DM 137

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMK---MWGLDMRRDLEPA-----WDLHGK-- 219
              P  RG++   G+  G  DY + + E  K    W +    ++EPA      D+  +  
Sbjct: 138 SLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKG 197

Query: 220 -------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
                  Y TD+FT EAV+ I + + D+P FLYLA+ A H      P Q   H L+   H
Sbjct: 198 VKVVDENYLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH-----HPWQTTQHALDKTAH 251

Query: 273 IEDFKRSK-FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA----------- 320
           ++D K    FA++++ +DE +GKV++ L+++ +  N+II+F+SDNG              
Sbjct: 252 LKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDP 311

Query: 321 ----AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
                GF +  +S    RG K   +EGG+R    I  P    +G   +  +   D  PTL
Sbjct: 312 NRHRGGFTM--SSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISALDLQPTL 369

Query: 377 LSAANKSD 384
           + AA  +D
Sbjct: 370 VKAAGGND 377


>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score =  182 bits (462), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 179/347 (51%), Gaps = 22/347 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P++I   ADDLGW + G  G  QIPTP+ID++A +G+   + +     C+PSR+ ++TG+
Sbjct: 41  PNVIIFYADDLGWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGR 100

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G + N +      GL L E  L   L  LGY+T  VGKWHLG    EY PT RGF
Sbjct: 101 YPTRFGHEFNRIANVS--GLDLQETTLADRLHGLGYKTACVGKWHLG-DGPEYRPTKRGF 157

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +   G  T     F H     K     +  D+    D    Y+TD +   +V+ I     
Sbjct: 158 DEFFG--TLANTPFFHPT---KFVDSRVSNDVAEVSD-ENFYTTDEYAKRSVEWI-GQQQ 210

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             P FLYL   A H+     PLQAP  YL+    I D KR  FAA++  +D+++G+V+  
Sbjct: 211 QSPWFLYLPFNAQHA-----PLQAPQKYLDRFESIADPKRKLFAAMMSAMDDAIGQVLGK 265

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLE 357
           + +     N+++ F+SDNGG   G     + N PLRG K T +EGG R   L+ W   L 
Sbjct: 266 VRELGQEENTLVFFISDNGGPTQG---TTSQNGPLRGFKMTTFEGGTRVPFLVQWKGKLP 322

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSI 404
           + G   +  V   D LPT+L+AA     P +    V+ ++P + +SI
Sbjct: 323 A-GKTYDNPVINLDVLPTVLTAAGSKIDPAWKLDGVD-LVPYFTSSI 367


>gi|449138314|ref|ZP_21773584.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883087|gb|EMB13631.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 485

 Score =  182 bits (462), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 124/387 (32%), Positives = 199/387 (51%), Gaps = 61/387 (15%)

Query: 42  LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           L  TL+M+ V + AS+   P+I+ I++DD G+ND+G  G + I TP +D LA  G  L N
Sbjct: 11  LFTTLAMLLVAIKASAADRPNILLIVSDDQGYNDLGQLG-NGIITPALDRLANEGTRLTN 69

Query: 101 YYTV-QLCTPSRSAIMTGKHPIHTGM------------------QHNVLYGCER-GGLPL 140
           +Y     CTPSR++++TG++P   G+                  Q+ V +  ER GG+  
Sbjct: 70  FYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDYGHRYTPDQYAVTF--ERIGGMDE 127

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
            E I+P  LK  GY++ I GKW LG  ++   P  RGF+   G+     DYF H    + 
Sbjct: 128 REVIIPAVLKRAGYKSGIYGKWDLGALRR-MLPNSRGFDDFYGFVNTGIDYFTHERYGVP 186

Query: 201 MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP- 259
                M R+LEP  +  G Y T +F  EA+  +  H+ +EP FLY+   A H+++  +P 
Sbjct: 187 C----MVRNLEPTEEDKGTYCTYLFQREALRFLDEHAGEEPFFLYVPFNAPHNSSSLDPK 242

Query: 260 ----LQAPDHYLNIHRHIED--------------------FKRSKFAAILHKLDESVGKV 295
               +QAP+ +  ++  +E                      +R  + A +  +D ++G++
Sbjct: 243 IRSSVQAPERFKAMYPPVESETRVTDRYRYGTPATVATPQARRRDYRAAVTCMDAAIGEM 302

Query: 296 VEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSP 354
           ++ LE ++ML ++I+VF SDNGG+        A N PLRG K   WEGG+R   ++ W+ 
Sbjct: 303 LDRLEAKQMLEDTIVVFFSDNGGSG------GADNSPLRGHKAQTWEGGIRVPCIVRWTA 356

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAAN 381
                G V ++++   + LP+  SAA 
Sbjct: 357 GKIPAGAVNDEFLTSLELLPSFASAAG 383


>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 196

 Score =  182 bits (461), Expect = 9e-43,   Method: Composition-based stats.
 Identities = 96/223 (43%), Positives = 123/223 (55%), Gaps = 42/223 (18%)

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH 217
           I+G WHLG+YK+ YTPT RGFES  GY+   +DY+                         
Sbjct: 10  ILGAWHLGYYKEAYTPTERGFESFYGYYNFGEDYY------------------------- 44

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
                            NH+ D PLFLYL+H A H+   Y P+QAP+  +    +I    
Sbjct: 45  -----------------NHTLDLPLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYIGVKN 87

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           RS +A  +  LD+SVG V +ALE++ ML N+++VF+SDNG    G   NA SNWPLRG K
Sbjct: 88  RSLYAGAVDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGAIPWGEWSNAGSNWPLRGAK 147

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            TLWEGGVR    +WSPLL     V+ Q +HVSDW+PTL S A
Sbjct: 148 FTLWEGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLA 190


>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 479

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 129/365 (35%), Positives = 197/365 (53%), Gaps = 27/365 (7%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTP 109
           V+  + SG P+I+ I+ADDLG+ D+G  G  +IPTP++D LA SGI   N Y +   C+P
Sbjct: 29  VNQTSKSGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSP 88

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           SR+  +TGK+    G + N   G E + GLPL E  +   L+  GYRT ++GKWH GF  
Sbjct: 89  SRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGF-S 147

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL---DM---RRDLEPAWDLHGKYST 222
           K++ P  RGF+   G+  G  +Y  H   + +       DM    R++EP       Y+T
Sbjct: 148 KDHHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYRGREVEP----QEGYAT 203

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ-APDHYLNIHRHIEDFKRSKF 281
           D+FT EA+  +     ++P FLYL++ A H+     PL+ AP     I   ++   R  +
Sbjct: 204 DLFTNEALRWMSG-PPNKPWFLYLSYNAVHT-----PLEIAPHLQKRIPESVKLPARRGY 257

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASNWPLRGVKNT 339
            ++L  LD+S+G++ + L Q  +   ++I+F+SDNGG+  A     N+  N PLRG K  
Sbjct: 258 LSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNHPLRGDKGQ 317

Query: 340 LWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSDIPNYVNSTVE--N 395
             EGG+R    + W   L +R I  EQ +   D LPT+   AAN    P  +   ++  N
Sbjct: 318 TLEGGIRVPFFVSWPGQLPARTIY-EQPIISLDLLPTVCQLAANNPAKPQPLPQGIDGVN 376

Query: 396 IIPRY 400
           ++P +
Sbjct: 377 LMPYW 381


>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
          Length = 314

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 111/309 (35%), Positives = 168/309 (54%), Gaps = 25/309 (8%)

Query: 74  DVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGC 133
           DV F  +  IPTP+ID LA SG+ L++YY  QLC+P+R+++++G++    GM   V+   
Sbjct: 21  DVAFGPM--IPTPSIDELALSGVRLESYYVNQLCSPTRTSLLSGRYAYTLGMNAEVIVDG 78

Query: 134 ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFD 193
               LPL+ K + ++L+E G+ T   GKW  G      TPT RGF++  G++    DYF 
Sbjct: 79  HPDQLPLTVKTIGEHLQEGGWATFAGGKWDAGMTSWGCTPTCRGFDTFSGFYNAFNDYFT 138

Query: 194 HSAEEMKMWGLDMRRDLEP--AWDLH-GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHA 249
           H         LD+R D  P  A D H G Y T++ T+     I   + T++  F Y+AH 
Sbjct: 139 HRVGNY----LDLRHDFAPDLADDNHTGVYMTELLTSRVQQFITTAAETEQSTFAYVAHQ 194

Query: 250 ATHSANPYEPLQAPDHYLN---IHRHIEDF-KRSKFAAILHKLDESVGKVVEALEQRRML 305
           A H+ N     Q P  YL    +    ED   R     ++  +DESV  +    +Q  + 
Sbjct: 195 AVHAPN-----QVPMSYLEGYCLDTIPEDRPTRRILCGMMRAVDESVRNITATYKQLGLW 249

Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVA 363
           ++++++F +DNGG          SN+PLRG K T +EGG+RG G + SPLL    RG ++
Sbjct: 250 NDTVLIFTTDNGGNPE----TGGSNYPLRGQKATTFEGGMRGVGFVNSPLLNESQRGFIS 305

Query: 364 EQYVHVSDW 372
           ++ +HVSDW
Sbjct: 306 DELIHVSDW 314


>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 234

 Score =  182 bits (461), Expect = 1e-42,   Method: Composition-based stats.
 Identities = 94/225 (41%), Positives = 136/225 (60%), Gaps = 2/225 (0%)

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW-DL 216
           I+ +WHLG+Y  +Y P  RGF++ LG+++   DY+  +       G D     +  W D 
Sbjct: 9   ILLQWHLGYYDSKYVPVNRGFDTFLGFYSPALDYYTQNFTFDNHTGHDFWNGDKIYWVDK 68

Query: 217 HGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
             K YST  +T + V +I  H+   PLFL L+H A H++     LQ P   +    +I++
Sbjct: 69  EKKQYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSYIQE 128

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R+ FA ++  LD+SVG VV+AL++  ML+++I+VF +DNG    GF  N  SNWPLRG
Sbjct: 129 ENRTLFAGMVDALDQSVGSVVDALQKADMLTDTILVFCTDNGALPWGFKSNRGSNWPLRG 188

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            K TL+EGGVR    IWSPL+ +   V+ Q +H++DWLPTL SAA
Sbjct: 189 TKFTLYEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAA 233


>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 445

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 117/332 (35%), Positives = 180/332 (54%), Gaps = 33/332 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+ I+ADDLG+ D+  +G + I TPN+D LA  G++  +Y++   +C+P+R+A+MTGK
Sbjct: 28  PNIVLIMADDLGYGDLSCYGNEYINTPNLDLLASEGVLFTDYHSNGSVCSPTRAALMTGK 87

Query: 119 HPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           +   TG++  V     R  GL L+E  L + LK+LGY T + GKWHLG Y K + PT +G
Sbjct: 88  YQQRTGVEGVVTAKSHRDVGLALAEVTLAEELKQLGYNTGMFGKWHLG-YDKAFNPTLQG 146

Query: 178 FESHLGYWTGHQDYFDHSAEE--MKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           F+  +G+ +G+ DY  H  +E  +  W G+ ++ +          Y+TD+ +   V  I 
Sbjct: 147 FDEFVGFVSGNVDYHGHIDQEGYLDWWDGVKIKNE--------KGYTTDLISEYGVKFIQ 198

Query: 235 NHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYL------NIHRHIEDFKRSKFAAIL 285
            H+ +    P FLYL H A HS  PY+  +  D  L           I+D   S +  ++
Sbjct: 199 EHNPEVKRAPFFLYLPHEAPHS--PYQ--RRIDKVLREIGTAGTQEVIQDSISSIYKEMV 254

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
             +DE VG+++++L++     N+I++F+SDNG    G       N  LRG K   +EGG 
Sbjct: 255 EVMDEGVGRIMQSLKETGQYENTIVIFISDNGANHYG------DNGGLRGFKAGPYEGGS 308

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           R   +   P     G V  Q V   D LPTLL
Sbjct: 309 RVPAIFSFPKEVKGGSVNNQTVLSMDLLPTLL 340


>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
 gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
          Length = 471

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/334 (34%), Positives = 168/334 (50%), Gaps = 24/334 (7%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSR 111
           L A    P+IIFILADDLG  D+G +G   I TPN+D LA  GI   + Y    +C+ +R
Sbjct: 26  LAAGKEQPNIIFILADDLGHADLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATR 85

Query: 112 SAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
            A++TG++   +  G++  +    +  GLP +   LP  LK+ GY T ++GKWHLG Y  
Sbjct: 86  FALITGRYQYRLRGGLEEPIAGASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLG-YLP 144

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
            + P   G++S  G + G  DYF H          D+     P   +   Y TD+  A A
Sbjct: 145 TFGPLKSGYDSFFGNYGGAIDYFTHKPGVGPQVKEDLYEGEVPVHQI--GYYTDLLGARA 202

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF------KRSKFAA 283
           VD +      +P  L L     H   P+ P + P     I R I++         + +  
Sbjct: 203 VDFVQKQQAGKPFLLSL-----HYTAPHWPWEGPGDE-AISRDIDNIFHYDGGDLATYGR 256

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           ++  LD S+G+V++ALE + +  N+I++F SDNGG         +  WPL G K  L EG
Sbjct: 257 MVESLDASIGRVLKALEDKGLAENTIVIFTSDNGGE------RFSKTWPLTGQKTELLEG 310

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           G+R   ++  P   + G V+EQ     DWLPTLL
Sbjct: 311 GIRVPAIVRWPAKIAAGQVSEQVAISMDWLPTLL 344


>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
 gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
          Length = 440

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 119/353 (33%), Positives = 177/353 (50%), Gaps = 27/353 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLGW D+G +G     TP++D LA  G+     Y    +C+ +R A++TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +   +  G++  +     + GLP     LP  LK+ GY T ++GKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPSYGPQRS 122

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           G++   G  +G  DYF H     + +  D+ +  EP       Y T +   EA   +H  
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSPDLWQGSEPV--ERTGYYTYILGDEATRYVHER 180

Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLDE 290
              D+P FL L     H   P+ P + PD     H +    H +     K+  I+  LD+
Sbjct: 181 KGQDKPFFLSL-----HFTAPHWPWEGPDDEHVAHNIKDLFHYDGGSLKKYGEIVEALDK 235

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           +VG+V++AL+      N+I++F SDNGG         +  WP  G K  L EGG+R   L
Sbjct: 236 AVGQVLQALDDTGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPTL 289

Query: 351 I-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
           + WS  ++ +  V EQ     DWLPTLL+AA     P++  S  +NI+P  E 
Sbjct: 290 LRWSARIQPQ--VQEQVTVSFDWLPTLLAAAGARPHPDF-PSDGQNILPILEG 339


>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 418

 Score =  181 bits (459), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/260 (40%), Positives = 146/260 (56%), Gaps = 5/260 (1%)

Query: 124 GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
           G+QH  +   E  GLPL   ++PQ+ K+LGY T +VGK  +G +    T   R  + H  
Sbjct: 1   GLQHLTIEEGEAWGLPLRFTLMPQHFKKLGYATHMVGK--VGQFIISDTSCRRYVKRHTQ 58

Query: 184 YWTGHQDYFDHSAEE---MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE 240
                +   +   +E       GLD+  +        G Y TD+ T +AV II +H T +
Sbjct: 59  KSCSQRCKRERKVKEPCGGDHIGLDLWDNEVAVRTECGNYDTDMLTDKAVSIIASHDTTK 118

Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE 300
           PLFLYL+H + HSA    PLQAP+  +    +I +  R+ +A ++  +D++ G++VEAL 
Sbjct: 119 PLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGERNRTIYAGMVDSMDDAFGRLVEALS 178

Query: 301 QRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG 360
              +L+N+I+VF SDNG        N   NWPLRG K TLWEG VR    IWSPLLES G
Sbjct: 179 GAGVLNNTIVVFSSDNGAVPIIEFPNRGFNWPLRGAKKTLWEGAVRVPAFIWSPLLESSG 238

Query: 361 IVAEQYVHVSDWLPTLLSAA 380
            V++Q +H+ DWLPT  S A
Sbjct: 239 RVSDQMMHIVDWLPTFYSVA 258



 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 54/125 (43%), Gaps = 19/125 (15%)

Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMRYQVDLT 629
           S+  E DG D+W  LS    S R  +L NID     S+L   ++K  KVV          
Sbjct: 263 SHLGEQDGFDMWKALSEGTKSPREEMLLNIDPVLNTSSL---RYKNHKVVL--------- 310

Query: 630 GGPDQVYLSGLSDREWLALAMRK-LRDAASIQCGPVKEVP--CEPQIAPCLFDIKNDPCE 686
                 Y +G  D  +     R   R  A+++CG         +P   P LFD+  DPCE
Sbjct: 311 ----GTYHNGSFDHRFKTTGRRPGWRRRATVRCGECNLAKDNVKPSSPPYLFDVVADPCE 366

Query: 687 KNNLA 691
            NNLA
Sbjct: 367 LNNLA 371


>gi|440716877|ref|ZP_20897380.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436438070|gb|ELP31646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 485

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/374 (32%), Positives = 187/374 (50%), Gaps = 56/374 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+I+ I++DD G+ND+G  G + I TP +D LA  G  L N+Y     CTPSR++++TG+
Sbjct: 30  PNILLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGR 88

Query: 119 HPIHTGMQHNVL-----YG-----------CER-GGLPLSEKILPQYLKELGYRTRIVGK 161
           +P   G+   +      YG            ER GG+   E ILP  L+  GY++ I GK
Sbjct: 89  YPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGK 148

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           W LG  ++   PT RGF+   G+     DYF H    +      M R+LEP     G Y 
Sbjct: 149 WDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTHERYGVPC----MVRNLEPTEADKGTYC 203

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIHRHIE-- 274
           T +F  EA+  +  H+ +EP FLY+   A H+++  +P     +QAPD +  ++  +E  
Sbjct: 204 TYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVEVE 263

Query: 275 ------------------DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
                               +R  + A +  +D ++G++++ LE ++ML N+I+VF SDN
Sbjct: 264 TRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDNTIVVFFSDN 323

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
           GG+        A N PLRG K   WEGG+R   L+ W       G+V  +++   + LP+
Sbjct: 324 GGSG------GADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNNEFLTSLELLPS 377

Query: 376 LLSAANKSDIPNYV 389
              AA     P  V
Sbjct: 378 FAIAAGVEPPPGVV 391


>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 484

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/356 (34%), Positives = 177/356 (49%), Gaps = 30/356 (8%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
             L  A  L    ++  + +  P+I+ IL DDLG+NDVGF+G   I TPN+D LA  G I
Sbjct: 9   CALVSAVVLFSSCINAQSQTEQPNILLILCDDLGYNDVGFNGSTDITTPNLDQLAQDGTI 68

Query: 98  LKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNV-LYGCERG-GLPLSEKILPQYLKELGY 154
             + Y     C PSR+A++TG++P   G Q N+   G   G G+ + EK +   +++ GY
Sbjct: 69  FTSAYVAHPFCGPSRAALLTGRYPHTLGSQFNLPANGASTGKGISVEEKFMGVPMQKAGY 128

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM-------WGLDMR 207
            T  +GKWHLG    EY P  RGF    G+  G   YF    EE K+        G    
Sbjct: 129 YTGAIGKWHLG-ETAEYHPNKRGFNDFYGFLGGGHKYF---PEEYKLQYKHQKEMGTKNI 184

Query: 208 RDLEPAWDLHGK------YSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPL 260
            D     + +G       Y TDV + E +      H   +P FLYLA+ A     P+ PL
Sbjct: 185 NDYVLPLEHNGAIVEENDYLTDVLSREGIRFTKEAHDKKKPFFLYLAYNA-----PHVPL 239

Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           +A +  L   + IED  R  +AA+++ +D  VG++V +L++     N++I+F+SDNGG  
Sbjct: 240 EAKEKDLEKFKDIEDIDRRTYAAMVYAVDRGVGEIVASLKKTGQFDNTLIIFLSDNGGHT 299

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
                + A+N+PL G K   WEGG R       P    +G   +  V   D  PT+
Sbjct: 300 G----HGANNYPLTGRKGDTWEGGFRVPMFFHWPKKIKKGQKFDYPVSALDLYPTI 351


>gi|32475143|ref|NP_868137.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32445684|emb|CAD78415.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 485

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 121/374 (32%), Positives = 187/374 (50%), Gaps = 56/374 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+I+ I++DD G+ND+G  G + I TP +D LA  G  L N+Y     CTPSR++++TG+
Sbjct: 30  PNILLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGR 88

Query: 119 HPIHTGMQHNVL-----YG-----------CER-GGLPLSEKILPQYLKELGYRTRIVGK 161
           +P   G+   +      YG            ER GG+   E ILP  L+  GY++ I GK
Sbjct: 89  YPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGK 148

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           W LG  ++   PT RGF+   G+     DYF H    +      M R+LEP     G Y 
Sbjct: 149 WDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTHERYGVPC----MVRNLEPTEADKGTYC 203

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIHRHIE-- 274
           T +F  EA+  +  H+ +EP FLY+   A H+++   P     +QAPD +  ++  +E  
Sbjct: 204 TYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQFKAMYPPVEVE 263

Query: 275 ------------------DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
                               +R  + A +  +D ++G++++ LE ++ML  +I+VF SDN
Sbjct: 264 TRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEILDRLEAKQMLDETIVVFFSDN 323

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
           GG+        A N PLRG K   WEGG+R   L+ W       G+V ++++   + LP+
Sbjct: 324 GGSG------GADNSPLRGHKAQTWEGGIRVPCLVRWPAGQIPAGVVNDEFLTSLELLPS 377

Query: 376 LLSAANKSDIPNYV 389
             +AA     P  V
Sbjct: 378 FAAAAGVEPPPGVV 391


>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 466

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 112/354 (31%), Positives = 185/354 (52%), Gaps = 26/354 (7%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           +   FTL M+    V++   P+II IL+DD G+ D G +G   IPTP+IDALA  GI   
Sbjct: 7   IVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGIRFT 66

Query: 100 N-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVL------YGCERGGLPLSEKILPQYLKEL 152
           N Y T  +C PSR+ ++ G++   +G +HNV       Y  +  GL  + + +   ++  
Sbjct: 67  NSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQIQDIGLSDTVRTIADQMQSN 126

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF---DHSAEEMKMWGLDMRRD 209
           GY T  +GKWH G   K + P  +GF    G+  GH+ +F       +E K+       D
Sbjct: 127 GYETMAIGKWHQGNETKHH-PLHKGFNHFFGFIGGHRSFFPIRTAIKQEEKILNDYTEVD 185

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
            +  +     Y TD+FT +A+  +     D+P F+YL++ A H+     P++A    L  
Sbjct: 186 EKDVY-----YLTDMFTDKAISYM-RQKRDKPYFIYLSYNAVHT-----PVEATPQKLAQ 234

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
              +++  R  +A ++  +D+ +G++VE L++   L N++++F++DNG A      N A 
Sbjct: 235 FARLKNAHRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAATN----NGAD 290

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           N PLRG+K + WEGG+R   ++  P     G   +  V   D +PT ++AA  +
Sbjct: 291 NGPLRGLKGSKWEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPTCVAAAQGT 344


>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 123/405 (30%), Positives = 192/405 (47%), Gaps = 56/405 (13%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGL----DQIPTPNIDALAYSGIILKNYYTVQLCTPSR 111
           S+ PPHI+ +L DDLGW DVG+H        I TP ID L   GI L+ +Y  ++C+P+R
Sbjct: 49  SNQPPHILMVLVDDLGWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTR 108

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           ++  +G+ P+H G+   V+    R G+P +   + Q+L + GY +  VGKW +G     +
Sbjct: 109 ASFQSGRLPVH-GIDGQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSH 167

Query: 172 TPTFRGFESHLGY------------WTGHQDYFDH-----SAEEMK-MWGLDMRRDLEPA 213
           TP  RG+ + L Y            W G Q+   H     + +  K  W  D      PA
Sbjct: 168 TPHGRGYNTSLNYFGHANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDFWDTD-----RPA 222

Query: 214 WDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
            +L G  Y   +F     DII  H   +PLFL  A    H      PLQAP  Y     +
Sbjct: 223 HNLSGTLYEEQLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFAN 277

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA----- 327
           IE   R  + A++  LD+++  +   ++++ M +N+++VF SDNGG    F+ +      
Sbjct: 278 IEPPSRRVYHAMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHP 337

Query: 328 ------------ASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWL 373
                       A+N+PLRG K + +EGG+R    +   L+  + RG      +H++DW 
Sbjct: 338 SRGYMCYNGEAGANNYPLRGGKYSFFEGGIRVNAFVSGGLIPADQRGSNRTGIMHIADWY 397

Query: 374 PTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
            T  + A     P    +   N+ P   +++  Y  G   + SPR
Sbjct: 398 ATFCALAGVD--PTDHRAAAANLPPIDSHNMWPYLAGQARF-SPR 439


>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 491

 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 115/334 (34%), Positives = 177/334 (52%), Gaps = 35/334 (10%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M +  + L  T +     ++++   P+I+ ILADDLG++D+G+ G  +I +P ID LA +
Sbjct: 1   MTWFRIQLLATFACALTAVLSADERPNILIILADDLGYSDLGYTGSTEIESPVIDKLANN 60

Query: 95  GIILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGC--ERGGLPLSEKILPQYLKE 151
           G+I  N Y T   C PSR+ ++TG+H    GM+ N  Y    +  GLP+ E    + L+ 
Sbjct: 61  GVIFANGYVTHPYCGPSRAGLITGRHQARFGMEINATYSPFDQHMGLPVDEPTFAKRLQP 120

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF-----------------DH 194
            GYRT I+GKWHLG    ++ P  RGF+   G+ +G  DYF                 ++
Sbjct: 121 AGYRTGIIGKWHLG-AAPQFHPNNRGFDYFYGFLSGGHDYFPESVNTHLELVLPNGKPNY 179

Query: 195 SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
            A E  +  L +R      +D    Y T   + +A   +   S+++P  LYLA+ A H+ 
Sbjct: 180 GANEGTLLPL-LRNKNAAEFD---DYLTTALSKDAARFVT--SSEQPFCLYLAYNAPHT- 232

Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
               PLQAP   +  + HI+D KR  +AA++ ++D  +G VV+ALEQ   L N++I F+S
Sbjct: 233 ----PLQAPKETIAKYSHIKDPKRRIYAAMIDEMDAGIGLVVDALEQSGKLDNTLIFFLS 288

Query: 315 DNGGA---AAGFNLNAASNWPLRGVKNTLWEGGV 345
           DNGG    A     + A N P +  K + +EGG 
Sbjct: 289 DNGGCVPQAWDSYSDFADNGPFKDGKTSFYEGGT 322


>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
          Length = 503

 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 176/351 (50%), Gaps = 43/351 (12%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
           V +  PP+++ +L DD+GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+
Sbjct: 6   VGAPKPPNLVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRA 65

Query: 113 AIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           A++TG+ PI  G         N     E  GG+P  E++LP+ LK  GY ++IVGKWHLG
Sbjct: 66  ALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLG 125

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---- 221
            ++ ++ P  RGF+   G    H   +D+ A       + + RD    W++ G+Y     
Sbjct: 126 -HRPQFHPLKRGFDEWFGSPNCHFGPYDNRARP----NIPVYRD----WEMVGRYYEEFP 176

Query: 222 ----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
                     T ++T EA+D +   H+   P FLY A  ATH+     P+ A   +L   
Sbjct: 177 INLKTGEANLTQLYTQEALDFVQRQHAARRPFFLYWAIDATHA-----PVYASRPFLGTS 231

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
           R      R ++   + ++D+SVGK++  L   R+  ++ + F SDNG A         SN
Sbjct: 232 R------RGRYGDAVREVDDSVGKILSLLRDLRIAEDTFVFFTSDNGAALISAPKQGGSN 285

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            P    K T +EGG+R   + W P     G V+ Q   + D   T LS A 
Sbjct: 286 GPFLCGKQTTFEGGMREPAIAWWPGRVPAGQVSHQLGSIMDLFTTSLSLAG 336


>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 484

 Score =  180 bits (456), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 197/388 (50%), Gaps = 59/388 (15%)

Query: 42  LAFTLSMVFVDL-VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           L  TL+   +    A + PP+I+ I++DD G+ND+G    D I TP++D LA  G  L +
Sbjct: 9   LCLTLAAFCLPFSCAEAAPPNILLIVSDDQGYNDLGVLNSDLI-TPHLDRLAAEGTRLTD 67

Query: 101 YYTV-QLCTPSRSAIMTGKHPIHTGMQHNVL-----YG-----------CER-GGLPLSE 142
           +Y     CTPSR++++TG++P   G+   +      YG            ER GG+   E
Sbjct: 68  FYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDYGYKYKPAEYEVSFERIGGMDQRE 127

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW 202
           K+LP+YLK+LGY + I GKW LG  K+ + PT RGF+   G+     DYF H     + +
Sbjct: 128 KLLPEYLKKLGYTSAIFGKWDLGSLKR-FLPTNRGFDEFYGFVNTGIDYFTH-----ERY 181

Query: 203 GL-DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-- 259
           G+  M R      +  G+Y+T++F  EA+  +      EP  LYL   A H+++  +P  
Sbjct: 182 GVPSMFRQTSLTEEDRGEYATELFKREALAFLDRAEASEPFLLYLPFNAPHNSSSLDPRI 241

Query: 260 ---LQAPDHYLNIHRHIEDFKRS--------------------KFAAILHKLDESVGKVV 296
              +QAP+ Y  ++  ++   R+                     + A +  +D ++G+V+
Sbjct: 242 RSTVQAPEEYKAMYPAVDPEYRTVNRYRYGNPAQVVTPVARQRDYRAAVTCMDAAIGEVL 301

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPL 355
             LE + +L N+I++F SDNGG+        A N PLRG K  +WEGG+R   L+ W   
Sbjct: 302 GKLEDKGVLDNTIVIFFSDNGGSG------GADNSPLRGHKAQVWEGGIRVPCLVRWPDG 355

Query: 356 LESRGIVAEQYVHVSDWLPTLLSAANKS 383
               G   + ++   + +P+L+ AA  +
Sbjct: 356 GVPAGATNDSFLSSLELVPSLVKAAGGT 383


>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  180 bits (456), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 111/336 (33%), Positives = 170/336 (50%), Gaps = 35/336 (10%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A S  P+ +F+L DD G+ DV F     I +PN + LA  G+IL  +Y  + C+PSR++ 
Sbjct: 17  AFSTKPNFVFVLVDDWGYADVSFRN-PAIHSPNFEKLAKEGLILDRHYVFKYCSPSRASF 75

Query: 115 MTGKHPIHTGMQH---NVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           +TG+ P H    +   + L G       L   ++P  LK   Y+T ++GKWH G YK+ Y
Sbjct: 76  LTGRWPHHAHQWNPPEDALVGAN-----LKMTMIPAKLKLARYKTHMIGKWHEGLYKEAY 130

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAE-EMKMW---GLDMRRDLEPAWDLHGKYSTDVFTA 227
            P  RGF++  G+  G +++ +         W   G D R         +G Y    +  
Sbjct: 131 LPINRGFDTMSGFLGGGENHMNQQVGCATDFWKNDGPDSR---------NGSYDAYTYRD 181

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RSKFAAILH 286
           +  DII NH+  +P FLYL     H+     PLQAPD +LNI+      + R  + A++ 
Sbjct: 182 DLTDIITNHNPSDPFFLYLPLHNVHT-----PLQAPDEWLNIYPANSTCQNRRIYQAMVS 236

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
             D   G +V+ L+   M  N+++V  +DNGGA  G      SN+PL+G K T +EGGVR
Sbjct: 237 VADNVTGHLVKLLQDNGMWDNTLLVISADNGGAPCG-----GSNYPLKGSKGTFYEGGVR 291

Query: 347 GAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
               +   LL +  +G   + ++H++DW  T    A
Sbjct: 292 SLAFVNGGLLPTNRKGKSTQGFIHIADWYTTFCQLA 327


>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 508

 Score =  179 bits (455), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 118/378 (31%), Positives = 191/378 (50%), Gaps = 47/378 (12%)

Query: 43  AFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSGIIL 98
           AF L      L     P P+I+ +L DD GW +VG+H      ++ TPNID+L   G+ L
Sbjct: 7   AFILLCTTACLAVPPPPGPNIVMMLVDDWGWANVGYHRNPPTKEVVTPNIDSLVRQGLEL 66

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNV---LYGCER-----GGLPLSEKILPQYLK 150
             +Y   +C+PSRS++M+G+ PIH     N+    Y  +        +P +   + Q +K
Sbjct: 67  DQHYVFNVCSPSRSSLMSGRLPIHVN-DLNIEPDYYNPDDPVSGFSAIPRNMTGIAQKMK 125

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFD-------HSAEEMKMWG 203
             GY T  VGKW  G     +TP  RGF+S  GY+    D++        +  + + +W 
Sbjct: 126 LGGYDTHQVGKWDAGMATHTHTPKGRGFDSSFGYFHHANDFYTEIDGKPCNKTKIVDIWV 185

Query: 204 LDMRRDLEPAWDLHGK----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
            D     +P + L+G     Y   +F  + + +++ H T +PLFLY A    H+     P
Sbjct: 186 TD-----KPGYGLNGTGPDNYEEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----P 235

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           LQ P  Y +    I+D  R  + A+++ LD+ VG++V+AL+++ +  N + V  SDNGG 
Sbjct: 236 LQVPQRYQDKFSFIDDHDRQIYHAMVNYLDDVVGELVDALKKKGLWENLLFVTSSDNGGP 295

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLL 377
              +    A+N+PL+G K + W+GG+R    +    L  + RG   + Y+H++DW  T  
Sbjct: 296 V--YPGGGANNYPLKGGKASDWQGGIRSNAFVSGGYLPEKMRGKKLDGYIHLADWYATFC 353

Query: 378 SAAN---------KSDIP 386
           + AN         K+D+P
Sbjct: 354 ALANVDPTDEAAAKADLP 371


>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  179 bits (454), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 136/371 (36%), Positives = 197/371 (53%), Gaps = 32/371 (8%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGL--DQIPTPNIDAL 91
           I +F  L LA  L + FV L  S   P+II IL+DD G+ DV ++    D I TP+ DAL
Sbjct: 2   INSFTKLFLAL-LCVNFVALADSK--PNIIVILSDDQGYADVSYNPEHDDYISTPHTDAL 58

Query: 92  AYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGL--PLSEKILPQY 148
           A SG+I  + Y +  +C+ +RS +MTG++      Q   +Y    GG    L+ K +P Y
Sbjct: 59  AKSGVIFHRGYTSGSVCSTTRSGLMTGRY-----QQRYGIYTAGEGGTGTDLNAKFIPNY 113

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEMKMWGLDMR 207
           LKE GY++   GKWHLG ++ +Y P  RGF+   G+   G  D+F    E    +G  + 
Sbjct: 114 LKEAGYKSMAFGKWHLG-HEMKYHPLHRGFDDFYGFMGRGAHDFFRLEKEYDGKFGGPIY 172

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
           R LEP  D    Y T   T E V  I  +  D+P F Y+A+ A H+     P QAP    
Sbjct: 173 RGLEPIDD--KGYLTTRITEETVKFIEENK-DKPFFAYVAYNAVHT-----PAQAPAE-- 222

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           +I     D  R    A+L  LD  VG++V+ L++  +  N+II+++SDNGGA +      
Sbjct: 223 DIKAVSGDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAKS----MV 278

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           A+N PLRGVK+ +++GG+R   L+  P     G   +  V   D LPTLL AA    +P 
Sbjct: 279 ANNKPLRGVKHDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLDAAG---LPA 335

Query: 388 YVNSTVENIIP 398
             +   E+++P
Sbjct: 336 LSDIDGESMLP 346


>gi|340369113|ref|XP_003383093.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 374

 Score =  179 bits (454), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 170/324 (52%), Gaps = 28/324 (8%)

Query: 43  AFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFH---GLDQIPTPNIDALAYSG 95
              + +  + L + + P    PHI+ +L DD GW +VG+H      ++ TPNID L   G
Sbjct: 4   VLAVGVFLLGLASGASPNANLPHIVMMLVDDWGWANVGYHRNPPTCEVVTPNIDKLVKEG 63

Query: 96  IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL---YGCER-----GGLPLSEKILPQ 147
           + L  +Y  + C+PSRS++++G+ PIH   Q N+L   Y  +        +P +   + +
Sbjct: 64  LELNQHYVYKFCSPSRSSLISGRLPIHVNDQ-NLLPTNYNPDDPVSGFSAIPRNMTGIAE 122

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM-KMWGLDM 206
            ++  GY T  VGKW  G    ++TP  RGF++  GY+    DY+  +A    +   +D+
Sbjct: 123 KMRGAGYATHQVGKWDAGMATSDHTPKGRGFQTSFGYFHHANDYYKETASACNRTQIVDL 182

Query: 207 RRDLEPAWDLHGK----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
               +PA  ++G     Y   +F    +D++ NH    PLFLY A    H+     PLQ 
Sbjct: 183 WETDKPAHGINGTGPDNYEEGLFKERVLDVVSNHDPSTPLFLYYAPHIVHT-----PLQV 237

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           PD YL +   I D  R  + A+++ LD+ VG++ +AL+ + M  N + V  SDNGG    
Sbjct: 238 PDRYLKLFSFINDHDRQYYHAMVNYLDDVVGEITDALKNKGMWDNLLFVTSSDNGGPV-- 295

Query: 323 FNLNAASNWPLRGVKNTLWEGGVR 346
           +    A+N+PLRG K T W+GGVR
Sbjct: 296 YPGGGANNYPLRGGKVTDWQGGVR 319


>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
 gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
          Length = 474

 Score =  179 bits (453), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 186/357 (52%), Gaps = 28/357 (7%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
            + L LA  +   +    A    P+I+FIL+DD G+ D GF G  +IPTPN+D LA  G+
Sbjct: 13  LSYLGLALLICSAWACYGAELAKPNIVFILSDDAGYADFGFQGSTEIPTPNLDQLAQEGV 72

Query: 97  ILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEKILPQ 147
           + K  Y +  +C PSR+ ++TGK+P   G + N + G         +  G+ L +  +  
Sbjct: 73  VFKQAYVSASVCGPSRAGLLTGKYPQRFGFEENNVPGYMSSSGATGDDMGMRLDQLTMAN 132

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
           YL E GYRT ++GKWH G  +  + P  RGF+   G+  G + YF  +           R
Sbjct: 133 YLAERGYRTSLIGKWHQG-NEDRFHPLKRGFDEFFGFRGGARSYFPFTQAHPS----SRR 187

Query: 208 RD-LEPAWDLHGK---YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
            D LE  ++ +G+   Y TD    E ++ I   +  +P F +L+ +A     P+ P++A 
Sbjct: 188 EDFLERNFNNYGESPLYLTDALANETIEFI-KRNKHQPFFTFLSLSA-----PHYPMEAE 241

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
              L+   H+   +R  +AA++  +D ++G+V++ L+Q  +  N++++F +DNGG +   
Sbjct: 242 KSDLDQFPHLTG-ERKIYAAMMLNMDRAIGRVLQTLQQEGLSENTLVIFTNDNGGPS--- 297

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           + N + N PL G K  L EGG+R   ++  P +   G   +  V   D LPT  +AA
Sbjct: 298 DHNGSINLPLSGTKANLLEGGIRVPMIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAA 354


>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_a [Homo
           sapiens]
          Length = 708

 Score =  179 bits (453), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 13  LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G      +           GG+P SE++LP+ LK+ G
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W++ G+Y               T ++  EA+D I   +   P FLY A  ATH+     P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           + A   +L         +R ++   + ++D+S+GK++E L+   +  N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                    SN P    K T +EGG+R   L W P   + G V+ Q   + D   T L+ 
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352

Query: 380 AN 381
           A 
Sbjct: 353 AG 354


>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
 gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 525

 Score =  179 bits (453), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 110/349 (31%), Positives = 177/349 (50%), Gaps = 30/349 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGK 118
           P+I+ ++ADD+GW DVG+ G   + TPNIDALA  G+   + Y +  +  PSR+ I+TG 
Sbjct: 32  PNIVLVIADDMGWGDVGYQGAVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGV 91

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G  +N+    +   +P  +  L + +++ GY T  VGKWH+     E +P  RGF
Sbjct: 92  YQQRFGFYNNLHPWAK---IPEGQSTLGEMVRDCGYATGFVGKWHMA-DSPEQSPNRRGF 147

Query: 179 ESHLGYWTGHQDYFDHS----AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           +   G+W+   DY+  +     E      L    +++P     G+Y TD FT EAV+ I 
Sbjct: 148 DQFYGFWSDTHDYYRSTDKPGVELYDFCPLYRNGEIQPPLHESGEYITDCFTREAVEFID 207

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI--HRHIEDFKRSKFAAILHKLDESV 292
            H++  P  L L++ A HS     P Q P+HY+N    R      R  FAA++  LD+ +
Sbjct: 208 KHAS-SPFLLCLSYNAVHS-----PWQVPEHYVNRLEGRRFHHEDRKVFAAMVLALDDGI 261

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGG-------AAAGF------NLNAASNWPLRGVKNT 339
           G+V+E+L +  +  N++ + +SDNG         + G+      N   +S  P RG K  
Sbjct: 262 GRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGYKAD 321

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
            +EGG+R   ++  P    +G+V +  V   D  PT++ A   +    Y
Sbjct: 322 TYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTVMQAVGGTSRQKY 370


>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
 gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
 gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
 gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_c [Homo
           sapiens]
          Length = 522

 Score =  179 bits (453), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 13  LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G      +           GG+P SE++LP+ LK+ G
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W++ G+Y               T ++  EA+D I   +   P FLY A  ATH+     P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           + A   +L         +R ++   + ++D+S+GK++E L+   +  N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                    SN P    K T +EGG+R   L W P   + G V+ Q   + D   T L+ 
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352

Query: 380 AN 381
           A 
Sbjct: 353 AG 354


>gi|323448512|gb|EGB04410.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 572

 Score =  179 bits (453), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 119/362 (32%), Positives = 172/362 (47%), Gaps = 48/362 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFH-------GLDQIPTPNIDALAYSGIILKNYYTVQLC 107
            ++ PPHII I+ADDLGW+ V +H         D   TP +DAL  +G+ L+ +Y  Q C
Sbjct: 31  GAAPPPHIIHIVADDLGWSYVDWHRDPRTADPPDTPLTPRLDALRRAGVTLERHYAFQYC 90

Query: 108 TPSRSAIMTGKHPIHTGMQHNV---------LYGCERGGLPLSEKILPQYLKELGYRTRI 158
           +PSRSA+++G++PI   +Q+ V         + G +  G+P +   L + L+  GYRT  
Sbjct: 91  SPSRSALLSGRNPISVNVQNVVPEVSNPEDPVGGWQ--GIPTAMTGLAEVLRRRGYRTEA 148

Query: 159 VGKWHLGFY-KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK------MWGLDMRRDLE 211
           VGKW +G      + P  RGF++  GYW    DY+  + E         +W  +   D  
Sbjct: 149 VGKWDVGMATAAHHHPRARGFDAWTGYWHHANDYWTFAEETCDGEAVQDLWHYNATFDGP 208

Query: 212 PAWDLHGK----------------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
             W  +G                 Y   +    AV  +  H    PLFLY A    H   
Sbjct: 209 ATWLRNGPSCTQGNQTHTAASPCVYEEQLLADHAVRAVEEHDDTAPLFLYWALHLVHM-- 266

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
              PLQ PD YL     I D  R    A++   D+ VG++V+ALE+R ML+N+++ F SD
Sbjct: 267 ---PLQVPDAYLARFDGIPDEPRRFMTAMVRYADDEVGRLVDALERRGMLANAVVTFHSD 323

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWL 373
           NGG   G      +NWPL G K + +EGG+R    +    +    RG    Q   + DW 
Sbjct: 324 NGGEIMGAGACGGNNWPLTGGKFSNFEGGIRVNAFVSGGAVPEARRGAALSQLSTLWDWY 383

Query: 374 PT 375
            T
Sbjct: 384 AT 385


>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
          Length = 522

 Score =  179 bits (453), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 13  LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAGGLLFPNF 72

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G      +           GG+P SE++LP+ LK+ G
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W++ G+Y               T ++  EA+D I   +   P FLY A  ATH+     P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           + A   +L         +R ++   + ++D+S+GK++E L+   +  N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                    SN P    K T +EGG+R   L W P   + G V+ Q   + D   T L+ 
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352

Query: 380 AN 381
           A 
Sbjct: 353 AG 354


>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 462

 Score =  179 bits (453), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 189/371 (50%), Gaps = 29/371 (7%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L F LS+    +V +S  P++I I+ADDLG+ DVGF+G   IPTP ID++A +G+   + 
Sbjct: 5   LLFILSLGLSLIVQASSKPNLIVIMADDLGYADVGFNGCKDIPTPGIDSIANNGVKFSSG 64

Query: 102 YT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCE--RGGLPLSEKILPQYLKELGYRTRI 158
           YT   +C PSR+  +TG++    G + N  +        LP SE  + + L+++GY   I
Sbjct: 65  YTSYSVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLQQVGYHCGI 124

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF---------DHSAEEMKMWGLDMRRD 209
           +GKWHLG  +    P  RGF    G+  G   YF         +    EM  +   + R+
Sbjct: 125 IGKWHLG-AEPSLRPNQRGFNEFFGHLGGGHAYFPEKLRIIKTEDVKNEMDSYASYITRN 183

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
             P      KY TD F+ EA+  +  +  ++P FL+L++ A H+     PLQA   YL+ 
Sbjct: 184 DTPVKTT--KYLTDEFSDEAIRFVEKN-YEQPFFLFLSYNAPHT-----PLQATQKYLDR 235

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
             HIED  R  + A++  +D+ + ++++ L+  ++  ++I+ F+SDNGG       N + 
Sbjct: 236 FPHIEDQNRRTYCAMVSAMDDGIVRLMKKLDALKIKDDTIVFFLSDNGGPTTK---NNSD 292

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
           N  LRG K  ++EGG R    +  P     G   +  V   D   T+ S A     P + 
Sbjct: 293 NGDLRGKKGDVFEGGFRVPFAMQYPRELKPGQTYDLPVSSLDIFATIASLAQS---PTHA 349

Query: 390 NSTVE--NIIP 398
           +  ++  N++P
Sbjct: 350 DKPLDGVNLLP 360


>gi|421613760|ref|ZP_16054831.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495346|gb|EKJ99933.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 485

 Score =  179 bits (453), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 121/375 (32%), Positives = 190/375 (50%), Gaps = 58/375 (15%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+I+ I +DD G+ND+G  G + I TP +D LA  G  L N+Y     CTPSR++++TG+
Sbjct: 30  PNILLIASDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGR 88

Query: 119 HPIHTGMQHNVL-----YG-----------CER-GGLPLSEKILPQYLKELGYRTRIVGK 161
           +P   G+   +      YG            ER GG+   E ILP  L+  GY++ I GK
Sbjct: 89  YPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAMLRPAGYKSGIYGK 148

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD-MRRDLEPAWDLHGKY 220
           W LG  ++   PT RGF+   G+     DYF H     + +G+  M R+LEP     G Y
Sbjct: 149 WDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTH-----ERYGVPCMVRNLEPTEADKGTY 202

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIHRHIE- 274
            T +F  EA+  +  H+ +EP FLY+   A H+++  +P     +QAPD +  ++  +E 
Sbjct: 203 CTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVEV 262

Query: 275 -------------------DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
                                +R  + A +  +D ++G++++ LE ++ML  +++VF SD
Sbjct: 263 ETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDETMVVFFSD 322

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLP 374
           NGG+        A N PLRG K   WEGG+R   L+ W       G+V ++++   + LP
Sbjct: 323 NGGSG------GADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNDEFLTSLELLP 376

Query: 375 TLLSAANKSDIPNYV 389
           +  +AA     P  V
Sbjct: 377 SFAAAAGVEPPPGVV 391


>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 458

 Score =  178 bits (452), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 109/316 (34%), Positives = 176/316 (55%), Gaps = 24/316 (7%)

Query: 47  SMVFVDLVASS---GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYY 102
           + +F+ L+  +     P++I IL DD G+ DVGF+G   IPTP++D++A +G+  +  + 
Sbjct: 4   TFIFLYLILMNSIIAGPNLIIILTDDQGYQDVGFNGCKDIPTPHLDSIAQNGVNCIDAHV 63

Query: 103 TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVG 160
           +  +C PSR+ ++TG++    G   N     E    GLPL EK + + LKE+GY + I+G
Sbjct: 64  SYPVCGPSRAGLLTGRYQDRFGFTTNPTVNPENPIAGLPLEEKNIAEVLKEVGYSSSIIG 123

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWG---LDMRRDLE 211
           KWH+G +   + P  RGF+   G+ +G  DYF         +E  ++W      + RD E
Sbjct: 124 KWHMGTHPIHH-PLNRGFDHFFGFLSGGHDYFPAKYNLKDLSEVKRIWDWYRTHLIRDRE 182

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
                 G Y TD+ T  AVD I   ++++ P  LYL++ A H+     PLQA + YL   
Sbjct: 183 RIQVSEG-YLTDILTDAAVDFIDKKASEKKPFMLYLSYNAPHT-----PLQASEKYLKRF 236

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
            HI+D KR  +AA++  +D+ VG+V+  L++ ++  ++++ F+SDNGG   G  +    N
Sbjct: 237 THIKDSKRRTYAAMVSAVDDGVGRVMARLKKHKIEEDTLVFFLSDNGGPLKG-PMPFTDN 295

Query: 331 WPLRGVKNTLWEGGVR 346
            P    K +L EGG R
Sbjct: 296 GPFSKGKGSLHEGGTR 311


>gi|383114003|ref|ZP_09934770.1| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
 gi|382948682|gb|EFS34104.2| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
          Length = 496

 Score =  178 bits (452), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/335 (35%), Positives = 182/335 (54%), Gaps = 31/335 (9%)

Query: 60  PHIIFILADDLGWNDVG-FHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
           P+I+ ILADD G+  V  +  + +I TPNID LA SG+  ++ Y +  L +P+R+ +MTG
Sbjct: 37  PNIVIILADDQGYGGVNCYPHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTG 96

Query: 118 KHPIHTGMQHNVLYGCER---GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           K+      Q    YG      GG+P  +K+L +YL E GY T  +GKWHLG Y + + P 
Sbjct: 97  KY-----QQSFGFYGLSTPHVGGIPQDQKLLSEYLVENGYNTACIGKWHLGDYIRSH-PN 150

Query: 175 FRGFESHLGYWTGHQDYFD---HSAEEMKMWGLDMRRD-LEPAWDLHGKYSTDVFTAEAV 230
            RGF++  G+  G  DY+D     + +    GL    D +EP  ++  +YST  +T  AV
Sbjct: 151 NRGFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLDNMEPVTEM--EYSTYEYTKRAV 208

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFA-AILHKL 288
           D I  ++ D P FLYL + A HS     PLQAP+  +  +  + ++  +   A A+   L
Sbjct: 209 DFIQKNA-DHPFFLYLPYNAIHS-----PLQAPEELIGELAINPQEIGKDDIARAMTFAL 262

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D+ VGKVVE LEQ  +  N+II ++SDNG       +  +  W  RG K + +EGG+R  
Sbjct: 263 DQGVGKVVETLEQLGLRDNTIIFYLSDNGA------VEYSDKWEFRGRKGSYYEGGIRVP 316

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            ++  P   ++G +  + V   D  PT++  A  S
Sbjct: 317 FIVSYPAKLAKGTIYNKPVMSIDIAPTVMELAGLS 351


>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 480

 Score =  178 bits (452), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 173/338 (51%), Gaps = 22/338 (6%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
           AS   P+II I+ADDLG+ DVGF+G   I TPNID LA SG    + Y     C PSR+A
Sbjct: 25  ASEQKPNIILIVADDLGYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAA 84

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++P   G Q N+       G+P   K + + L E  Y T  +GKWH+G    +Y P
Sbjct: 85  LMTGRYPHKIGSQFNLPTRGSNVGVPTDAKFISKLLNENNYFTGALGKWHMG-DAPQYHP 143

Query: 174 TFRGFESHLGYWTGHQDYF----DHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTD 223
             RGF+ + G+  G  +YF        ++ +  GL    +     + +GK      Y TD
Sbjct: 144 NKRGFDEYYGFLGGGHNYFPDQYQPQYKKQQAQGLKNIFEYITPLEHNGKEVKETQYITD 203

Query: 224 VFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
             + EAV+ +      + P FLYLA+ A     P+ PLQA D  + +  +I++  R  +A
Sbjct: 204 ALSREAVNFVDKAVNKKNPFFLYLAYNA-----PHVPLQAKDEDMAMFPNIKNKDRKTYA 258

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
            +++ +D  VGK+VE L++     N++IVF+SDNGG  +      A+N+PL+  K +  E
Sbjct: 259 GMVYAVDRGVGKLVETLKKNNQYDNTLIVFMSDNGGKLS----KGANNFPLKAGKGSTQE 314

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           GG R   L   P     G      V   D  PT  + A
Sbjct: 315 GGFRVPMLFHWPKHVPAGKRFSHPVSALDLYPTFAALA 352


>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 540

 Score =  178 bits (452), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/364 (34%), Positives = 187/364 (51%), Gaps = 38/364 (10%)

Query: 44  FTLSMV-FVDLV-ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           F +S+V F  L  A    P+II  L DDLGW D+   G   I TP ID+L   G+ L  +
Sbjct: 6   FIVSLVTFCALANAEEKRPNIIIFLVDDLGWADISLRGA-PIDTPAIDSLFSEGLTLDRF 64

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           YT  +C+P+R+A+MTG+ P+  G+ ++V+      G+   E  +P+  K  GY+T +VGK
Sbjct: 65  YTTPICSPTRAALMTGRDPLRLGISYSVVMPWMNNGVHPDEHFMPESFKAAGYQTAMVGK 124

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLG  ++ + P  RGF+   G+      YF   A +    G+D +R+     D    Y 
Sbjct: 125 WHLGHSQEIFHPNARGFDDFYGHLHTEVGYFLPFANQG---GVDFQRNGVTIAD--EGYE 179

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK---- 277
           T +   EA   I     D+P FLY+   A HS     PL+APD  +  + ++ED +    
Sbjct: 180 TFLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYENLEDTRELTR 234

Query: 278 -----------------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
                            R  +AA++  LD+++G+V++ L +  +   +II+F SDNGGAA
Sbjct: 235 SAAIDRTRTLNGFSPSARPMYAAVVDGLDQAIGQVLDTLTKEGIEEETIILFSSDNGGAA 294

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                  A N+PLRG K   +EGG+R  A + W   L++ G   E  + V D  PTL SA
Sbjct: 295 Y--AGGGADNFPLRGGKGDTYEGGIRVVAAMRWKGTLDA-GKSFESIMTVMDVFPTLASA 351

Query: 380 ANKS 383
           A  S
Sbjct: 352 AGVS 355


>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
 gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
          Length = 377

 Score =  178 bits (452), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 104/242 (42%), Positives = 136/242 (56%), Gaps = 13/242 (5%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
            VL LA             +  PHI+ I+ADDLGW+DV ++    + TPN+  LA +G+I
Sbjct: 3   VVLFLAIVGCFFGHGTAQDTAKPHILLIVADDLGWSDVSWNN-PYVVTPNLHTLATTGVI 61

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
               Y    C+PSR+A++TGK P   GMQ  V+   +  GLPL E++LPQ LK+LGY T 
Sbjct: 62  FNQTYAQPTCSPSRTALLTGKFPFRLGMQR-VMDSKKPHGLPLDEELLPQKLKKLGYATH 120

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
           +VGKWHLG  K EYTPT RGF+S  GY  G QDY+ H SA  +  W  D +  +    D 
Sbjct: 121 MVGKWHLGSCKWEYTPTERGFDSFYGYHHGSQDYYTHKSARGLDFW--DGKTSIS---DQ 175

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
           +G YST+ F   A +II  H  + PLFLYL   + H+     P Q P  YL     I+D 
Sbjct: 176 NGVYSTESFATRAENIISQHDPNTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDD 230

Query: 277 KR 278
            R
Sbjct: 231 NR 232


>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
 gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
          Length = 439

 Score =  178 bits (451), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 110/338 (32%), Positives = 183/338 (54%), Gaps = 26/338 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ + +  DDLG+ D+G +G D + TP++D LA  G+   N+Y+   +C+PSR++++TGK
Sbjct: 4   PNFVIVYCDDLGYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLTGK 63

Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P+  G+    + G +RG  GLP +E  L + LK  GYRT + GKWHLG   KE +P   
Sbjct: 64  YPVRAGVGE--ILGAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGL-SKETSPNAH 120

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
           GF+   G+  G  D++ H     +  G++   DL       W+ +G+Y T++ T  +VD 
Sbjct: 121 GFDEFFGFKAGCVDFYSHIFYWGQGHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 179

Query: 233 I-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I  +   + P FL+ ++ A     P+ P+ AP  Y++   H+  + R   AA++  +D+ 
Sbjct: 180 IKRSREQEAPFFLFASYNA-----PHYPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDG 233

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
           VG++VEAL++     +++I F SDNG ++   N            S    RG K +L+EG
Sbjct: 234 VGEIVEALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDIYYGGSAGIFRGHKASLFEG 293

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+R   ++  P     G V ++   + D +PT L  A 
Sbjct: 294 GIREPAILNWPNGWEGGQVRDEVAAMMDIVPTFLDLAG 331


>gi|313216787|emb|CBY38029.1| unnamed protein product [Oikopleura dioica]
          Length = 383

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 128/208 (61%), Gaps = 11/208 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH +FILADD+G++D+G+H   +I TP +D LA +G+ L+ YY   +CTP+R  +MTG++
Sbjct: 176 PHFLFILADDMGYHDIGYHQA-EILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRY 234

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            I  GMQH V+   +  G+PL EK+LP+ L++ GY T ++GKWHLG + ++Y P  RGF+
Sbjct: 235 QIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFD 294

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL--EPA---WDLHGKYSTDVFTAEAVDIIH 234
             +G++TG QD++ H+     M G D R     +P    WDL+  YST VF  E    + 
Sbjct: 295 HFMGFYTGSQDFYSHNKCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELEKRLS 354

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQA 262
             +  EP F YL+  A HS     PLQ 
Sbjct: 355 KMNPSEPSFTYLSFQAVHS-----PLQG 377


>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
          Length = 522

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 13  LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNF 72

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G      +           GG+P SE++LP+ LK+ G
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W++ G+Y               T ++  EA+D I   +   P FLY A  ATH+     P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           + A   +L         +R ++   + ++D+S+GK++E L+   +  N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKMLELLQDLHVADNTFVFFTSDNGAA 292

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                    SN P    K T +EGG+R   L W P   + G V+ Q   + D   T L+ 
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVAAGQVSHQLGSIMDLFTTSLAL 352

Query: 380 AN 381
           A 
Sbjct: 353 AG 354


>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
           mulatta]
          Length = 503

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 112/345 (32%), Positives = 174/345 (50%), Gaps = 42/345 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG
Sbjct: 11  PPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTG 70

Query: 118 KHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G      +           GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ +
Sbjct: 71  RLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQ 129

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A       + + RD    W++ G+Y          
Sbjct: 130 FHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKT 181

Query: 222 -----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
                T ++  EA+D I   +   P FLY A  ATH+     P+ A   +L         
Sbjct: 182 GEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------ 230

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R ++   + ++D+SVGK++E L    +  N+ + F SDNG A       A SN P    
Sbjct: 231 QRGRYGDAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCG 290

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           K T +EGG+R   L W P   + G V+ Q   + D   T L+ A 
Sbjct: 291 KQTTFEGGMREPALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAG 335


>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
          Length = 502

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 173/346 (50%), Gaps = 43/346 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 9   PPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 68

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE++LPQ LKE GY T+IVGKWHLG ++ +
Sbjct: 69  RLPIRNGFYTTNGHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLG-HRPQ 127

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A       + + R+    WD+ G++          
Sbjct: 128 FHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRN----WDMVGRFYEEFPINVKT 179

Query: 222 -----TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++  EA+D I    +   P FLY A  ATH+     P+ A   +L I      
Sbjct: 180 GESNLTQIYLQEALDFIRQQQAAQHPFFLYWAVDATHA-----PVYASRPFLGIS----- 229

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
            +R ++   + ++D+SVGK++  L    +  NS + F SDNG A         SN P   
Sbjct: 230 -QRGRYGDAVREIDDSVGKILSLLRTLSIAENSFVFFTSDNGAALISAPSQGGSNGPFLC 288

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T +EGG+R   + W P     G V+ Q   + D   T L+ A 
Sbjct: 289 GKQTTFEGGMREPAIAWWPKHIPAGQVSHQLGSIMDLFTTSLALAG 334


>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
 gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
          Length = 440

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 176/350 (50%), Gaps = 29/350 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLGW D+G +G     TP++D LA  G+     Y    +C+ +R A++TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDHLASQGVRFNQAYANSAVCSATRIALITGR 63

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +   +  G++  +     + GLP     LP  LK+ GY T ++GKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQRS 122

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           G++   G  +G  DYF H     + +  D+ +  EP       Y T +   EA   +H  
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSPDLWQGNEPV--ERTGYYTYILGDEATRYVHER 180

Query: 237 -STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF------KRSKFAAILHKLD 289
              D+P FL L     H   P+ P + P    ++ R I+D          K+  I+  LD
Sbjct: 181 KDQDKPFFLSL-----HFTAPHWPWEGPGDE-HVARSIKDLFHYDGGSLKKYGEIVEALD 234

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           ++VG+V++AL+      N+I++F SDNGG         +  WP  G K  L EGG+R   
Sbjct: 235 KAVGQVLQALDDTGQADNTIVIFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPT 288

Query: 350 LI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
           L+ WS  +  R  V EQ     DWLPTLL+AA     P++  S  +NI+P
Sbjct: 289 LLRWSARI--RPQVQEQVTASFDWLPTLLAAAGARPHPDF-PSDGQNILP 335


>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
 gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
          Length = 443

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/338 (33%), Positives = 180/338 (53%), Gaps = 26/338 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ I I  DDLG+ D+G +G D + TP++D LA  GI   N+Y+   +C+PSR++++TGK
Sbjct: 8   PNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGK 67

Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P   G+    + G +RG  GLP  E  L + LK  GYRT + GKWHLG   +E +P   
Sbjct: 68  YPARAGVGE--ILGAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGL-SEETSPNAH 124

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
           GF+   G+  G  D++ H     +  G++   DL       W+ +G+Y T++ T  +VD 
Sbjct: 125 GFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 183

Query: 233 IHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I      E P FL+ ++ A     P+ P+ AP  Y++   H+  + R   AA++  +D+ 
Sbjct: 184 IQRSREQEAPFFLFASYNA-----PHYPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDG 237

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
           VG++V+AL++     +++I F SDNG ++   N            S    RG K +L+EG
Sbjct: 238 VGEIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEG 297

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+R   ++  P     G V ++ V + D  PT L  A 
Sbjct: 298 GIREPAILSWPNGWEGGQVRDEVVAMMDIAPTFLDLAG 335


>gi|431907857|gb|ELK11464.1| Arylsulfatase B [Pteropus alecto]
          Length = 274

 Score =  177 bits (449), Expect = 2e-41,   Method: Composition-based stats.
 Identities = 88/209 (42%), Positives = 127/209 (60%), Gaps = 13/209 (6%)

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           AI+T K  IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  
Sbjct: 16  AILTLK--IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECL 73

Query: 173 PTFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
           PT RGF+++ GY  G +DY+ H      +A  +    LD R   E A      YST++FT
Sbjct: 74  PTRRGFDTYFGYLLGSEDYYSHERCTLINALNVTRCALDFRDGEEVATGYENMYSTNIFT 133

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
             A  +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++ 
Sbjct: 134 QRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVS 188

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSD 315
            +DE+VG V  AL+   + +N++ +F ++
Sbjct: 189 LMDEAVGNVTAALKSHGLWNNTVFIFSTE 217



 Score = 43.1 bits (100), Expect = 0.70,   Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1  MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
          +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 26 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 61


>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
          Length = 534

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 121/358 (33%), Positives = 169/358 (47%), Gaps = 46/358 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           PH++F+L DD+G+ DVG+   D I +P +D LA  G+ L  +Y    C PSR+A++TG +
Sbjct: 27  PHLLFVLGDDVGFGDVGYSSPDVI-SPTLDRLAAEGLKLGRHYAYMWCAPSRAALLTGYY 85

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKE-LGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           P  TG+         +  LPL   +LP  L++  GYRT  VGKWHLGF  +   P  RGF
Sbjct: 86  PSTTGVYST---SGAQNALPLEFALLPGLLRDRAGYRTAAVGKWHLGFMSEADLPERRGF 142

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL------EPAWD---LHGKYSTDVFTAEA 229
           +   G+  G +D++          G D   DL       PA D     G+YS +++   A
Sbjct: 143 DGFFGFLDGGEDHYSRVGAGAP--GCDRVFDLWDSRRRGPATDDPSAFGRYSAELYGEAA 200

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH------------------- 270
            D I  H   EPLFLY A    HS     P +  D Y +                     
Sbjct: 201 ADAIRGHDAAEPLFLYAAFQVAHSPLEVPPAKYFDAYADAALCANASTKGFACARPGADA 260

Query: 271 -----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
                 H     R   AA++  LD +V  +  AL  + M ++ ++V+  DNGG      +
Sbjct: 261 PYRAPNHDCGCDRLVVAAMVTALDAAVANMTTALASKNM-ADFVLVYSGDNGGP----EI 315

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
               N  LRG K T +EGGVR A  + SPLL  RG      +H+ DWLPT L+ A  +
Sbjct: 316 EGHWNGGLRGGKWTWFEGGVRPAAFVHSPLLTRRGW-HNGTLHLVDWLPTFLALAGAA 372


>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
 gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
          Length = 452

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/338 (33%), Positives = 179/338 (52%), Gaps = 26/338 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ I I  DDLG+ D+G +G D + TP++D LA  GI   N+Y+   +C+PSR++++TGK
Sbjct: 17  PNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGK 76

Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P   G+    + G +RG  GLP  E  L + LK  GYRT + GKWHLG   +E +P   
Sbjct: 77  YPARAGVGE--ILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGL-SEETSPNAH 133

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
           GF+   G+  G  D++ H     +  G++   DL       W+ +G+Y T++ T  +VD 
Sbjct: 134 GFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 192

Query: 233 IHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I      E P FL+ ++ A     P+ P+ AP  Y++   H+  + R   AA++  +D+ 
Sbjct: 193 IQRSREQEAPFFLFASYNA-----PHYPMHAPQKYMDRFAHLP-WDRQVMAAMIAAVDDG 246

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
           VGK+V+AL++     +++I F SDNG ++   N            S    RG K +L+EG
Sbjct: 247 VGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEG 306

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+R   ++  P     G V ++   + D  PT L  A 
Sbjct: 307 GIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAG 344


>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 389

 Score =  177 bits (449), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 117/348 (33%), Positives = 182/348 (52%), Gaps = 25/348 (7%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY-SGI 96
           A+  L F LS   +     +  P+++F+L DD G+ DV F     I +P+ ++LA   G+
Sbjct: 2   ALFKLVF-LSFFAIAAATVNAKPNLVFVLVDDWGFADVSFRN-PAISSPHFESLATKEGL 59

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
           IL  +Y  + C+PSR++ +TG+ P H    +    G    G  ++  ++P  LK  GY+T
Sbjct: 60  ILDRHYVFKYCSPSRASFLTGRFPHHAHQWNPPQSGLV--GANINMTMIPAKLKTAGYKT 117

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
            +VGKWH GFY K++ P  RGF++  G+  G +   DH  +E+ +  +D  ++   A D 
Sbjct: 118 HMVGKWHEGFYLKKFLPINRGFDTMSGFLGGGE---DHMTQEI-ICAVDYWKN--DAIDT 171

Query: 217 -HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RHIE 274
            +G Y    +  +   I+ NH   +P FLYL     H+     PLQAPD +LN++  +  
Sbjct: 172 RNGTYDDYTYVEDLSKILTNHDPSDPFFLYLPLHNVHA-----PLQAPDEFLNLYPVNST 226

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
              R    A++   D   G VV+ L+ + M  N++IV  +DNGGA         SN+PL+
Sbjct: 227 CSDRRTIQAMVSVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPC-----EGSNYPLK 281

Query: 335 GVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
           G K T +EGGVR    +   LL S  RG   E ++H++DW  T    A
Sbjct: 282 GSKMTFYEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLA 329


>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
           [Strongylocentrotus purpuratus]
          Length = 401

 Score =  176 bits (447), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 140/248 (56%), Gaps = 20/248 (8%)

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-------SAEEMKM 201
           L++ GY T  VGKWH+G YK    PT RGF+S  G  TG   ++ H          + K 
Sbjct: 5   LRKAGYATHAVGKWHMGLYKTCCWPTNRGFDSFFGLLTGRGGFYTHKHYGGHPGLVDSKN 64

Query: 202 W-GLDMRRDLEP-AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W G D+R +LE  A D  G YST +FT ++ +II  H+  +PLFLY +  A H      P
Sbjct: 65  WSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVH-----YP 119

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           L+ P  Y+    +I D +R  +A ++  +DE+VG +   L++  + +N+II+F SDNG  
Sbjct: 120 LEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSDNG-- 177

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLL 377
            A FN    SNWPLRG+K +LWEGG++  G I SPLL    RG V     HV+DW PTL+
Sbjct: 178 -ANFNY-GGSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLVRGTVNNNLFHVTDWFPTLV 235

Query: 378 SAANKSDI 385
               +  +
Sbjct: 236 RGVARGSL 243


>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
 gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
          Length = 446

 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 112/338 (33%), Positives = 184/338 (54%), Gaps = 26/338 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ I I  DDLG+ D+G +G D + TPN+D+LA  GI   N+Y+   +C+PSR++++TGK
Sbjct: 11  PNFIDIYCDDLGYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLTGK 70

Query: 119 HPIHTGMQHNVLYGCERG--GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P   G+    + G +RG  GLP +E  L + LK  GYRT + GKWHLG   +E +P   
Sbjct: 71  YPARAGVGE--ILGAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGV-SEETSPNAH 127

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVDI 232
           GF+   G+  G  D++ H     +  G++   DL       W+ +G+Y T++ T  +VD 
Sbjct: 128 GFDEFFGFKAGCIDFYSHIFYWGQGHGVNPLHDLWENETEVWE-NGRYMTELITERSVDF 186

Query: 233 I-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I  +   ++P FL++++ A     P+ P+ AP  Y++   H+  + R   AA++  +D+ 
Sbjct: 187 IKRSREQEDPFFLFVSYNA-----PHYPMHAPKEYMDRFAHLP-WDRQVMAAMIAAVDDG 240

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWEG 343
           VG++V+AL+      N++I F SDNG ++   N            +    RG K +L+EG
Sbjct: 241 VGEIVKALKAAGCYENTVIFFSSDNGPSSESRNWLDGTEDVYYGGTAGIFRGHKASLFEG 300

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+R   ++  P     G V ++   + D +PT L  A 
Sbjct: 301 GIREPAILSWPNGLEGGQVRDEVGVMMDIVPTFLDLAG 338


>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
 gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
          Length = 622

 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 117/384 (30%), Positives = 189/384 (49%), Gaps = 62/384 (16%)

Query: 40  LPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           L L F +  VF+   + +       P+I++IL DDLGW DVG++      TP +D L  +
Sbjct: 4   LCLCFAIVAVFLKRESFAQETEDLRPNIVYILVDDLGWADVGWNNKGLESTPFMDKLVKN 63

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           G      Y+   C+PSR+  +TG++   +GM    +      G+   +K LP+YLKE+GY
Sbjct: 64  GTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPIAREVPFGMNTQDKTLPEYLKEVGY 123

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---------------HQDYFDHSAEEM 199
            T  VGKWHLG     Y PT RGF++  G+++G               + D+FD++ E+ 
Sbjct: 124 DTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVDYRGHFIKRSKNFYHDFFDNTIEQH 183

Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYE 258
           K   LD+  D        G+++TD+F    +DI+     +  P ++YLA  A     P+E
Sbjct: 184 K---LDLESD--------GQWTTDLFRDRTIDILKEAKRSKTPAYVYLAFNA-----PHE 227

Query: 259 PLQAPDHYLNIHRHIED-----FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
           P +AP     I R +E      + R++  A ++++D +V ++ +   ++ +   ++IVF+
Sbjct: 228 PTRAPADL--IARILEKHPNLPYTRAEHLAAVNQIDTAVRQIYQQ-AKKDVKRETLIVFM 284

Query: 314 SDNGGAAAGF-----------------NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           SDNGGA   F                 +L  A N+P +G KN+L+EGG+     + S   
Sbjct: 285 SDNGGAVKKFGGGRSPDNTGHEYDDEESLPRACNYPFKGFKNSLFEGGILSPSFLISTAR 344

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAA 380
           +          H+ DW PT L  A
Sbjct: 345 KFSRTRFNFPFHIIDWAPTFLDFA 368


>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 494

 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 112/308 (36%), Positives = 159/308 (51%), Gaps = 33/308 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P++I ILADDLG+ DVGF G  +I TP +D LA  G+I  N Y T   C PSR+ ++TG+
Sbjct: 23  PNLILILADDLGYADVGFTGSTEIQTPVLDRLAAGGVIFNNGYVTHAYCGPSRAGLITGR 82

Query: 119 HPIHTGMQHNVLYGC--ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +    G++ N  Y       GLP  EK     LK+ GYRT ++GKWHLG     Y P  R
Sbjct: 83  YQARFGVEVNFPYAPFDPHSGLPTDEKTFATRLKQSGYRTAMIGKWHLG-AAYPYHPNNR 141

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD----------------MRRDLEPAWDLHGKY 220
           GF+   G+  G  DY   +        L+                MR ++   +D   +Y
Sbjct: 142 GFDYFYGFLGGAHDYMPENTSTTVPLTLENGKVNHMANAGSYLPLMRNNVNAEFD---EY 198

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
            T   + +A   I    T+ P  +YL++ A H+     PLQAP   +  + HIE  KR  
Sbjct: 199 LTTALSRDAARFIEK--TEGPFCVYLSYNAPHT-----PLQAPKALIEKYAHIESQKRRT 251

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--AAAGF-NLNAASNWPLRGVK 337
           +AA++  +D+ +G+VV+ALE+   L N++I F SDNGG     G+   + A N P RG K
Sbjct: 252 YAAMIDSMDQGIGRVVDALEKSGKLENTLIFFFSDNGGPRPKPGYEGEDFADNTPYRGGK 311

Query: 338 NTLWEGGV 345
             + EGG 
Sbjct: 312 GAMLEGGT 319


>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 494

 Score =  176 bits (445), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 116/342 (33%), Positives = 179/342 (52%), Gaps = 25/342 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ +L DDLG++DVGF+G   I TP +D LA +G I  + Y     C PSR+A++TGK
Sbjct: 42  PNILLVLCDDLGYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAALLTGK 101

Query: 119 HPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P   G Q N+    E    G+  +E+ + + L+E GY T  +GKWHLG   +E+ P  R
Sbjct: 102 YPHTIGSQFNLPANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLG-ATEEFHPNQR 160

Query: 177 GFESHLGYWTGHQDYF----DHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTDVFT 226
           GF    G+  G  +YF        ++ K     + RD     + +GK      Y TD F+
Sbjct: 161 GFTDFYGFLGGGHNYFPEQYQAQYQKQKKAKKKIIRDYILPLEHNGKEVKETEYLTDAFS 220

Query: 227 AEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
            EA   +   S  + P FLYLA+ A     P+ PL+A +  L     I+D  R  +AA++
Sbjct: 221 REASRFVKEASNKKKPFFLYLAYNA-----PHVPLEAKEEDLEKFSVIKDKDRRTYAAMV 275

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
           + +D  + ++V  L++   L N++I+F+SDNGG       + A+N PL G K   WEGG 
Sbjct: 276 YAVDRGIKQIVNTLKETNQLENTLIIFLSDNGGNTD----HGATNNPLHGRKGDTWEGGY 331

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           R    ++ P   ++G      +   D  PT L+   K+ +P+
Sbjct: 332 RVPMFMYCPKNIAKGHTNNFPITSLDLYPT-LAHIGKAKLPD 372


>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
          Length = 459

 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 189/362 (52%), Gaps = 34/362 (9%)

Query: 39  VLPLAFTLSMVFVD-LVASSG---------PPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
           + PL   L++  +  L A++G          P+I+ IL DDLG+ D+   G   + +PNI
Sbjct: 11  LCPLNAILALFSIGCLAAATGTCYAQERPDAPNILCILVDDLGYGDLSCQGATDLQSPNI 70

Query: 89  DALAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKIL 145
           DALA +G+   N+Y    +C+PSR+A++TG++P   G+   +    E   G L     ++
Sbjct: 71  DALAANGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVIRQNPENNWGNLADDAVLI 130

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGL 204
           P  L   GY T I+GKWHLG  + + TP  RGF    G+     D Y+DH    +    L
Sbjct: 131 PSELNPAGYHTGIIGKWHLGLEEPD-TPNDRGFTYFKGFLGDMMDDYWDHRRGGINWMRL 189

Query: 205 DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAP 263
           + R +++P       ++TD+FT   +D +     +E P FLYLA+ A     P+ P+Q P
Sbjct: 190 N-REEIDPK-----GHATDLFTDWTIDFLKERQGEEQPFFLYLAYNA-----PHFPIQPP 238

Query: 264 DHYLNIHRHIE---DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
             +L+  R  E     KR+K  A +  LD SVG+V+EAL+   +  N+++VFVSDNGGA 
Sbjct: 239 REWLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGGAL 298

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                 A SN PLRG K  ++EGG+R   + +     + G  ++    + D  PT    A
Sbjct: 299 ----WYAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELA 354

Query: 381 NK 382
            +
Sbjct: 355 GR 356


>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
 gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
          Length = 482

 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 111/334 (33%), Positives = 175/334 (52%), Gaps = 22/334 (6%)

Query: 57  SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIM 115
           + PP+I+FIL+DD G+ D GF G D + TPN+D LA  G +  + Y +  +C PSR+ I+
Sbjct: 35  TSPPNILFILSDDAGYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAGIL 94

Query: 116 TGKHPIHTGMQHNVL--YGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           TGK+    G + N +  Y  + G      GLPL +K +  YL+E GY+T ++GKWH G  
Sbjct: 95  TGKYQQRFGYEENNVPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQG-N 153

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
              + PT RGF+   G+  G + YF   A+    +  D        +    +Y T+    
Sbjct: 154 ADRFHPTKRGFDEFYGFRGGARSYFGFGAQNPVSYPEDKLERGFAHFQESKRYLTEALAT 213

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           E V+ I  +    P F++L+  A H+     P++A    L    +++  KR + AA+   
Sbjct: 214 ETVEFIKRNQK-HPFFVFLSFNAVHT-----PMEAKPADLAQFSNLKG-KRQQLAAMTLS 266

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D  +GKV+  L++  +  N+++VF +DNGG +   + NA++N PL G K    EGG+R 
Sbjct: 267 MDREIGKVLNTLDELGLSDNTLVVFTNDNGGPS---DTNASNNGPLSGTKANHLEGGIRV 323

Query: 348 AGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             ++ W     S+G      +   D LPT  SAA
Sbjct: 324 PFIMRWPAGKVSQGEYPNP-ISTLDLLPTFFSAA 356


>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
 gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
          Length = 703

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 125/349 (35%), Positives = 188/349 (53%), Gaps = 47/349 (13%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           PPH++ I+ DD G  DVG++   + P    TP +D+LA   + LK YY   +CTP+R+A+
Sbjct: 61  PPHVLMIVVDDAGVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCTPTRAAL 120

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK-ELGYRTRIVGKWHLGFYKKEYTP 173
           +TG++ ++ GM    L G    GL  S   LP+ LK E  Y T +VGKWHLG  K +  P
Sbjct: 121 LTGRYAVNVGMPFP-LIGDAISGLDGSIPTLPEMLKSEANYSTHLVGKWHLGAAKAKNRP 179

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMK-MWGLDMRRDLE-PAWDLHGK-YSTDVFTAEAV 230
             RGF+S  G      D++     E++ +W    + + E PA ++  K ++T +F+ EAV
Sbjct: 180 LARGFDSFYGLLGASFDHYTKKMGEVRDLW----KNEAEVPAKEVDEKEHATTLFSREAV 235

Query: 231 DIIHNHSTD----------EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
            +I  HS            +PLFLYLA++A     P+ PLQA + ++ +   + +  R  
Sbjct: 236 KVIEEHSARGHAGAKDGDMDPLFLYLAYSA-----PHAPLQADEKFMKLCSDVPNRHRRT 290

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--AAAGFNLNAASNWPLRGVKN 338
           F A++ +LDE +G V  +L+   M  ++++VF SDNGG     GFN N       RG K+
Sbjct: 291 FCAMMAQLDEGIGNVTASLKAHGMWEDTVVVFTSDNGGYFLVGGFNYN------YRGGKS 344

Query: 339 TLWEGGVRGAGLIWSPL------LESRGIVAEQYVHVSDWLPTLLSAAN 381
             WEGGVR    I +P        + RG+      HV+DW+PTL+  A 
Sbjct: 345 GGWEGGVRVPAFIRAPKRYNFAPRDFRGM-----AHVADWMPTLMGMAT 388


>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
          Length = 478

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/334 (32%), Positives = 172/334 (51%), Gaps = 29/334 (8%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
           PP+I+FI+ADDLG+ DV  +G   + TPN+D +A  G+  L+ Y    +C+ +R+A++TG
Sbjct: 38  PPNIVFIMADDLGYADVSCYGRPDLNTPNVDRVALKGVRFLQAYANSAVCSATRTALITG 97

Query: 118 KHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           ++   +  G++  +  G +  GLP     LP  L++ GYRT ++GKWHLG   K + P  
Sbjct: 98  RYQYRLPIGLEEPLGIGRDV-GLPPEHPTLPSLLRKAGYRTTLLGKWHLGALPK-FGPLQ 155

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
            G++   G+  G  DY+ H+  + +    D    L         Y TD+  + AVD+I+ 
Sbjct: 156 SGYDHFYGFRGGSVDYYTHAGPDQRDDLWDDDVPLR-----QSGYLTDLLGSRAVDVING 210

Query: 236 HS-TDEPLFLYLAHAATHSANPYEPLQAPDHYL-------NIHRHIEDFKRSKFAAILHK 287
           +S +D P  + L     H + P+ P +AP              RH +   +  +  ++ +
Sbjct: 211 YSHSDRPFLVSL-----HFSAPHWPWEAPGDEAESNRIRSTTLRHYDGGTQKTYQRMIEQ 265

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D  +G+V+ AL+      N+I+VF SDNGG         A  WP  G K  L EGG+R 
Sbjct: 266 MDLQIGRVLGALDANGATDNTIVVFTSDNGGE------RFADTWPFTGRKTELLEGGLRI 319

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             LI  P    +G   +Q     DWLPTLL+AA 
Sbjct: 320 PALISWPARIRQGATTDQVAMSMDWLPTLLAAAG 353


>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 472

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 109/319 (34%), Positives = 170/319 (53%), Gaps = 28/319 (8%)

Query: 42  LAFTLSMVFVDL--VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L FTL+++ +       +  P+II  +ADD+G+ D GF G   I TPN+D LA +G   K
Sbjct: 6   LLFTLTIIGITFGHAQKNKKPNIIVFMADDMGYADTGFTGATDIQTPNLDNLAKNGAFFK 65

Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQYLKELGYRT 156
             Y     C PSR+A+++G++    G + N  Y       G+ + EK+ P+ L+E GY+T
Sbjct: 66  QGYANHAYCGPSRAALLSGRYQHRFGFETNPAYDPANPHMGIDVGEKLFPKRLQEAGYKT 125

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP---- 212
             +GKWHLG    E+ P  RGF+   G+  G  DYF     + K+W       L+P    
Sbjct: 126 GAIGKWHLG-AAAEFHPLNRGFDYFYGFLGGGHDYFRIDGTK-KVW----EAYLQPLVRN 179

Query: 213 --AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
             A +  G Y T   + +A   + ++  + P FLY+A+ A     P+ PLQAP   +  +
Sbjct: 180 KRADNFEG-YLTTALSNDAAQFVKDNK-ENPFFLYVAYNA-----PHMPLQAPKEDIARY 232

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG----AAAGFNLN 326
            HI+D KR  +AA++  +D  +G V+++L+   +  N++I F+SDNGG      +    N
Sbjct: 233 AHIKDNKRRVYAAMVDVMDRGIGTVIQSLKDSGIYENTLIFFLSDNGGPQSKGGSTKGWN 292

Query: 327 AASNWPLRGVKNTLWEGGV 345
            + N P RG K  L++GGV
Sbjct: 293 GSDNQPFRGGKGNLYDGGV 311


>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
           domestica]
          Length = 520

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 187/371 (50%), Gaps = 47/371 (12%)

Query: 38  AVLPLAFTLSMVF----VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           A + L   +S++F    +   A+S PP+I+F+L DD+GW D+G  G     TP++D +A 
Sbjct: 2   ATVALVLGVSLLFGWAPLGAGATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAA 61

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
            G++  N+YT   LC+PSR+A++TG+ PI  G         N     E  GG+P SE +L
Sbjct: 62  EGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLL 121

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           P+ LK+ GY  +IVGKWHLG ++ ++ P   GF+   G    H   +D+ A    M  + 
Sbjct: 122 PELLKKAGYVNKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKA----MPNIP 176

Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAA 250
           + R+    W++ G++               T ++  EAVD I    +  +P FLY A  A
Sbjct: 177 VYRN----WEMVGRFYEDFPINHKTGEANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDA 232

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           TH+     P+ A   +L         +R ++   + ++D+S+GK+++ L+   +  N+ +
Sbjct: 233 THA-----PVYASKSFLGTS------QRGRYGDAVREIDDSIGKILKLLQDLGIKENTFV 281

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            F SDNG A         SN P    K T +EGG+R   + W P     G V+ Q  +V 
Sbjct: 282 FFTSDNGAALISAPKEGGSNGPFLCGKQTTFEGGLREPAIAWWPGHIPAGQVSHQLGNVM 341

Query: 371 DWLPTLLSAAN 381
           D   T LS A 
Sbjct: 342 DLFTTGLSLAG 352


>gi|301608268|ref|XP_002933701.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Xenopus
           (Silurana) tropicalis]
          Length = 520

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 184/371 (49%), Gaps = 48/371 (12%)

Query: 39  VLPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
            L + + L++ F++ V   G      P+I+ +L DD+GW D+G  G     TPN+D +A 
Sbjct: 2   ALYIMYALALGFLNPVLVLGGMNGTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMAS 61

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
            G++  N+YT   LC+PSR+A++TG+ PI  G         N     E  GG+P SE + 
Sbjct: 62  EGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNSHARNAYTPQEIMGGIPDSEVLF 121

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           P+ LK+ GY  +I+GKWHLG ++  Y P  RGF+   G    H   +D+         + 
Sbjct: 122 PELLKKAGYVNKIIGKWHLG-HRPSYHPLRRGFDEWFGSPNCHFGPYDNKQRP----NIP 176

Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAA 250
           + RD    WD+ G+Y               T ++  EA++ I+  + D+ P FLY +  A
Sbjct: 177 VYRD----WDMVGRYYEDFNIDHKTGESNLTQIYLQEALEFINRQAGDQQPFFLYWSIDA 232

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           TH+     P+ A   +L   R      R  +   + ++D S+GK++  L Q+ ++ ++ +
Sbjct: 233 THA-----PVYASRPFLGTSR------RGLYGDAVREIDFSIGKILGLLNQKDIIKDTFV 281

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            F SDNG A     +   SN P    K T +EGG+R  G+ W P   + G V+ Q   + 
Sbjct: 282 FFTSDNGAALISAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIM 341

Query: 371 DWLPTLLSAAN 381
           D   T LS A 
Sbjct: 342 DLFTTSLSLAG 352


>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
 gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
          Length = 462

 Score =  174 bits (442), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 179/324 (55%), Gaps = 26/324 (8%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+FI ADDLG+ND+  +G   I TP ID+L   GI   ++Y+   +C+PSR+A++TG
Sbjct: 34  PPNIVFIFADDLGYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRAALLTG 93

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           ++PI  G+   V +     G+  +E  + + L+E GYRT +VGKWHLG ++K + P   G
Sbjct: 94  RYPIRQGIT-GVFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQK-HLPLQNG 151

Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           F S+ G            + +M M       D+E ++++   Y+T  +T EAV  I  + 
Sbjct: 152 FHSYFGI---------PYSNDMDMVVYMRGNDVE-SYEVDQHYTTRRYTEEAVQFIEQNK 201

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            D+P FLYLAH+      P+ P+ A ++++         KR  +  ++ +LD SV ++++
Sbjct: 202 -DQPFFLYLAHSM-----PHVPIYASENFVGTS------KRGLYGDVIQELDWSVAQILD 249

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            L++ ++  N+++VF SDNG   A  +L  ++  PLR  K   ++GG+R   L+  P   
Sbjct: 250 TLDKHQLSENTLVVFTSDNGPWTALKHLGGSAA-PLREGKMFTFDGGMRVPCLVRWPAQI 308

Query: 358 SRGIVAEQYVHVSDWLPTLLSAAN 381
             G  +    ++ DW PT    AN
Sbjct: 309 PAGQTSHAMANMMDWFPTFSRIAN 332


>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
 gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
          Length = 463

 Score =  174 bits (442), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 107/294 (36%), Positives = 156/294 (53%), Gaps = 31/294 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ IL DD G+ND GF G  ++ TPNIDAL   G++  + +    + +PSR+ ++TG+
Sbjct: 32  PNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGR 91

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+       GLPL E+ + +  K  GYRT  +GKWHLG  + E  P  RGF
Sbjct: 92  YGHRFGYECNL--SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGF 148

Query: 179 ESHLGYWTGHQDYF------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +   G   G +DYF      D   +E  +   D +   E       KY TD F+ +AV+ 
Sbjct: 149 DLFYGMKAGGRDYFYNEKKSDRPGDERNLLLNDRQVKFE-------KYLTDAFSEKAVEF 201

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I+  S  +P  +YLA+ A H+     P+QA D  +      E   R K AA+ + LD  V
Sbjct: 202 INESS--QPFMMYLAYNAVHT-----PMQATDEDM---AKFEGHPRQKLAAMTYALDRGV 251

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           G V+  L+      N++I F+SDNGGA      N +SN+PL+G K   +EGG R
Sbjct: 252 GTVIRGLKDSGKFDNTLIFFLSDNGGATT----NQSSNYPLKGFKGNKFEGGHR 301


>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 470

 Score =  174 bits (442), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/334 (33%), Positives = 175/334 (52%), Gaps = 23/334 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ ++ADD G++D GF G + + TP ID LA   ++ +  Y T  +C PSR+ + TGK
Sbjct: 27  PNILLVIADDAGYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYTGK 86

Query: 119 HPIHTGMQHNVLYG--------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           +    G + N + G         ++ GLP ++  + ++LKELGY T + GKWH G +  +
Sbjct: 87  YQQRFGFEENNVPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQGNH-DD 145

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           Y PT RGF++  G+  G + YF +S EE + +    M RD +   + H  Y TD    + 
Sbjct: 146 YHPTKRGFDNFYGFREGARGYFAYSNEEQQAYPSQKMERDFKHYIE-HEGYLTDALATQT 204

Query: 230 VDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
              I     + +P F  L+ +A H+     P+QA +  L    H+   KR   AA+   +
Sbjct: 205 SHFIGQSVVNKQPFFAVLSFSAVHA-----PMQATNADLAQFPHLTG-KRKILAAMNFAM 258

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D ++  V++ L   ++  N+++VF++DNGG     + N A+N+PL G K    EGG+R  
Sbjct: 259 DRAIAVVLDKLNALKISDNTLVVFINDNGGPT---DQNYANNYPLSGTKANHLEGGIRVP 315

Query: 349 GLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            L+ W   L S        V   D LPT + AAN
Sbjct: 316 MLLKWPKRLSSNNTRYTYPVSSLDLLPTFIHAAN 349


>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
 gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
          Length = 463

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 107/294 (36%), Positives = 156/294 (53%), Gaps = 31/294 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ IL DD G+ND GF G  ++ TPNIDAL   G++  + +    + +PSR+ ++TG+
Sbjct: 32  PNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGR 91

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+       GLPL E+ + +  K  GYRT  +GKWHLG  + E  P  RGF
Sbjct: 92  YGHRFGYECNL--SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGF 148

Query: 179 ESHLGYWTGHQDYF------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +   G   G +DYF      D   +E  +   D +   E       KY TD F+ +AV+ 
Sbjct: 149 DLFYGMKAGGRDYFYNEKKSDRPGDERNLLLNDRQVKFE-------KYLTDAFSEKAVEF 201

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I+  S  +P  +YLA+ A H+     P+QA D  +      E   R K AA+ + LD  V
Sbjct: 202 INESS--QPFMMYLAYNAVHT-----PMQATDEDM---AKFEGHPRQKLAAMTYALDRGV 251

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           G V+  L+      N++I F+SDNGGA      N +SN+PL+G K   +EGG R
Sbjct: 252 GTVIRGLKDSGKFDNTLIFFLSDNGGATT----NQSSNYPLKGFKGNKFEGGHR 301


>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 500

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 177/350 (50%), Gaps = 24/350 (6%)

Query: 37  FAVLPLA-FTLSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           FA L L  F+LS+  V + A++ P  PHI+F+L DD G+ D  F     I TP+   L  
Sbjct: 2   FAQLLLGLFSLSVCAVTVGAAAPPDKPHIVFVLVDDWGFADASFRN-PAIKTPSFQYLVD 60

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
           +G+IL  +Y  + C+PSR++ +TG+ P H    +    G    G  ++  +LP  LK  G
Sbjct: 61  NGLILNRHYVFKYCSPSRASFLTGRFPHHVHQWNPTPPGMV--GTNINMTMLPAKLKTAG 118

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y T +VGKWH G Y   Y P  RGF++  G+    + +F+ +        +D  ++  P 
Sbjct: 119 YSTHMVGKWHQGLYDPAYLPVNRGFDTSSGFLQAGEGHFNQTIG----CAVDFWKNHAPD 174

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RH 272
              +G Y + ++  +   I   H   +PLFLYL     H+     P++AP  +L+++  +
Sbjct: 175 -TRNGTYDSYIYNKDLTTIFSKHDASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPVN 228

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
                R    A++   D   G VV  L++  M  N+I+V  +DNGGA         SN+P
Sbjct: 229 STCATRRVLQAMVSVADNVTGHVVNLLKENDMWDNTIMVISADNGGADC-----KGSNYP 283

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAA 380
           L+G K+T +EGGVR        L+ +  RG   + ++HV+DW  T    A
Sbjct: 284 LKGSKHTFFEGGVRVIAFASGGLIPAARRGKSTDGFIHVADWYTTFCKMA 333


>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 484

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 123/356 (34%), Positives = 181/356 (50%), Gaps = 36/356 (10%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKN 100
           L F L +     + +   P+IIF+ +DD G+ D GF G   + TPN+D LA SG    + 
Sbjct: 5   LIFILLLTVAINLNAQEKPNIIFLFSDDAGYADFGFQGSKIMKTPNLDKLAKSGAKFTQG 64

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILPQYLKE 151
           Y T   C PSR+ ++TGK+    G         M  N  +  +  GLPL +  +  +LK+
Sbjct: 65  YVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSANSKFLADDMGLPLDQLTIADHLKK 124

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD-- 209
           LGY+T + GKWHLG   + Y PT RGF+   G+  G ++YF ++  ++    LD R +  
Sbjct: 125 LGYKTAMYGKWHLGDADR-YHPTKRGFDEFYGFRGGARNYFGYN--DVSKANLDNRMERG 181

Query: 210 ----LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
                EP      +Y TD    EAV  I  +  + P F+YLA  A H+     P+QA   
Sbjct: 182 FGNYQEPT-----EYVTDALAKEAVSFIEKNKGN-PFFIYLAFNAVHT-----PMQATKE 230

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            LN   ++   KR + AA+   LD + G V+  L++  +  N+I+VF +DNGG     + 
Sbjct: 231 DLNKFPNLTG-KRKELAAMTLALDRACGTVLNKLKELGLDKNTIVVFSNDNGGPT---DK 286

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           NA+ N PL G K+   EGG+R   LI W   ++S+  V    V   D LPT  +A 
Sbjct: 287 NASLNLPLSGTKSNHLEGGIRVPFLISWPKQIKSK-TVYNFPVSTLDLLPTFYAAG 341


>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 460

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 181/358 (50%), Gaps = 32/358 (8%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L F     F   + ++  P+I+ I  DD G NDVG +G  +IPTP+ID LA  G++ + Y
Sbjct: 10  LMFLSLFAFCSQLQAAERPNILIIFTDDQGINDVGCYG-SEIPTPHIDQLAKEGLLFRQY 68

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGC--------ERGGLPLSEKILPQYLKEL 152
           Y+   +CTPSR  I+TG++P  +    + L G         +  G+   E  +   L++ 
Sbjct: 69  YSASAICTPSRFGILTGRNPTRS---QDQLLGALMFMSDIDQNRGIQPGETTIADVLQQN 125

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP 212
           GY+T ++GKWHLG   + + PT  GF+   G+  G  DYF  +   +  W  + R   E 
Sbjct: 126 GYQTALLGKWHLGHGTESFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSEN 185

Query: 213 AWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEP--------LQAP 263
                  Y+TD+ T EA   + +  +TD+P FL+L++ A H    + P        +QA 
Sbjct: 186 G------YATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQAR 239

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
              L     I+D  R +FAA+   LD+ +G+V+ +L+   +  N++++F++D+GG     
Sbjct: 240 GDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGGDY--- 296

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            +   +N P RG K TL+EGG+R   +I  P     G    +     D  PT+   AN
Sbjct: 297 -VYGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFAN 353


>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 471

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 117/353 (33%), Positives = 172/353 (48%), Gaps = 30/353 (8%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKN 100
           L   LS+    + ++   P+IIF+ +DD G+ D GF G  ++ TPN+D LA SG+   + 
Sbjct: 4   LILFLSICCFSISSAQEKPNIIFLFSDDAGFADFGFQGSTEMKTPNLDKLANSGVKFTQG 63

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTG---------MQHNVLYGCERGGLPLSEKILPQYLKE 151
           Y T   C PSR+ ++TGK+    G         M  N  +  +  GLPL +  +  YLK+
Sbjct: 64  YVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSENSKFLADDMGLPLDQLTIGDYLKK 123

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL- 210
           LGY T + GKWHLG     + P  RGF+   G+  G + YF +           M R   
Sbjct: 124 LGYNTAMYGKWHLG-NADRFHPMNRGFDEFYGFRGGARSYFGYDVASSAHHDTKMERGFG 182

Query: 211 ---EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
              EP      +Y TD    EA+  I  +  + P F+YLA  A H+     P++A    L
Sbjct: 183 NFEEPQ-----EYVTDALADEAISFIEKNKKN-PFFIYLAFNAVHT-----PMEATKEDL 231

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           N   ++   KR + AA+   LD + G+V++ L+Q  +  N+IIVF +DNGG     + NA
Sbjct: 232 NQFPNLTG-KRKELAAMTLALDRACGRVLDKLKQLGLDKNTIIVFSNDNGGPT---DKNA 287

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           + N PL G K+   EGG+R   L+  P            +   D LPT  +A 
Sbjct: 288 SLNLPLSGTKSNHLEGGIRVPFLMSWPKHIQPKTTYTFPISTMDLLPTFYAAG 340


>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 468

 Score =  174 bits (440), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 194/381 (50%), Gaps = 48/381 (12%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVA---SSGPPHIIFILADDLGWNDVGFHGLDQIPTP 86
           + TR+M F     A   +++  + VA   S  PP I+ I++DD G+ D+   G +   TP
Sbjct: 2   FSTRLMTFVC---ALASALLVSNAVAAEKSKRPPSIVLIVSDDQGFADLSCIGDNGCRTP 58

Query: 87  NIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGM---------QHNVLYGCERG 136
            +D LA SG  L ++Y     CTPSR+++MTG++P   G           ++ LY  E  
Sbjct: 59  RLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEEY 118

Query: 137 --------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH 188
                   G  L E  L   LK+ GY + + GKW  G  K+ Y P  RGF+ + G+    
Sbjct: 119 AVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YLPLQRGFDQYYGFANTG 177

Query: 189 QDYFDHSAEEMKMWGL-DMRRDLEPAWDLHGKYSTDVFTAEAVDII-HNHSTDEPLFLYL 246
            DYF H     + +G+  M RD +P  +  G Y TD+F  EA+  I  NH  D P FLYL
Sbjct: 178 VDYFTH-----ERYGVPSMFRDNQPTEEDKGTYLTDLFEREAIRFIDENH--DRPFFLYL 230

Query: 247 AHAATHSANPYEP-----LQAPDHYLNIHRHIE---DFKRSKFAAILHKLDESVGKVVEA 298
              A HSA+  +       QAP  YL+     E   + +R  + A + ++DE++GKVV+ 
Sbjct: 231 PFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQ 290

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+Q ++  N++I+F+SDNG          A N PLRG K  ++EGG R   ++  P    
Sbjct: 291 LQQHQIADNTLIIFLSDNG------GGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVP 344

Query: 359 RGIVAEQYVHVSDWLPTLLSA 379
            G V+ Q++   +  PT+++A
Sbjct: 345 AGKVSNQFLTSLEVFPTVIAA 365


>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
 gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
          Length = 502

 Score =  173 bits (439), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 109/345 (31%), Positives = 173/345 (50%), Gaps = 42/345 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+Y+   L +PSR+A++TG
Sbjct: 4   PPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTG 63

Query: 118 KHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G      +           GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ +
Sbjct: 64  RLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQ 122

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A       + + RD    W++ G+Y          
Sbjct: 123 FHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKT 174

Query: 222 -----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
                T ++  EA+D I   +   P FLY A  ATH+     P+ A   +L         
Sbjct: 175 GEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------ 223

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R ++   + ++D+S+GK++E L+   +  N+ + F SDNG A         SN P    
Sbjct: 224 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG 283

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           K T +EGG+R   L W P   + G V+ Q   + D   T L+ A 
Sbjct: 284 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAG 328


>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
 gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
          Length = 522

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 177/363 (48%), Gaps = 43/363 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct: 12  LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           Y    LC+PSR+A++TG+ PI TG         N     E  GG+P  E +LP+ LK  G
Sbjct: 72  YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 182

Query: 214 WDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYE 258
           W++ G++               T ++  EA+D I    +T  P FLY A  ATH+     
Sbjct: 183 WEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA----- 237

Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           P+ A   +L         +R ++   + ++D+SVG++V  L   ++  N+ + F SDNG 
Sbjct: 238 PVYASRAFLGTS------QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGA 291

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           A         SN P    K T +EGG+R   + W P     G V+ Q   V D   T LS
Sbjct: 292 ALVSAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLS 351

Query: 379 AAN 381
            A 
Sbjct: 352 LAG 354


>gi|320102821|ref|YP_004178412.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319750103|gb|ADV61863.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 480

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 180/350 (51%), Gaps = 31/350 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ ILADDLGW D+ ++G  Q  TP++D LA  G     +YT   +C PSR+A++TGK
Sbjct: 36  PNIVLILADDLGWGDLSYNGRIQWSTPHLDDLAARGARFDRFYTAGVVCAPSRAALLTGK 95

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE----YTPT 174
           +PIH G   N         LP  E  + + LK  GY T + GKWH G   +E      P 
Sbjct: 96  NPIHCGTWRN------DHDLPAEEVTIAEALKAKGYHTALYGKWHQGRPTREGGTRTHPL 149

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
            +GF+   G+ +       H+ E+      + R  +E      G+Y+ D FT + V+ I 
Sbjct: 150 DQGFDETFGFLSA-----THAWEKFPNHLFNGRDRVEVP---QGRYADDWFTDKGVEFIA 201

Query: 235 NHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH-IEDFKRSKFAAILHKLDESV 292
            H  D  P FLYL   +TH       ++AP+  +  HR    D  ++ +AAI+ ++D  V
Sbjct: 202 RHKDDPTPFFLYLPLTSTHFH-----IEAPEDEIAKHRSTFSDPIQTAYAAIVTRMDAQV 256

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA----SNWPLRGVKNTLWEGGVRGA 348
           G++V+A+    +  +++IVF SD+G    G +  A+    SN P RG K T++EGG+R  
Sbjct: 257 GRIVQAVRDAGLEDSTLIVFTSDHGATFEGGSKEASSSLDSNHPFRGQKRTVYEGGIRVP 316

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
           G++  P +   G +    V   D LPT L  A     P++    + NI+P
Sbjct: 317 GIMTYPGVIPAGRIVSTPVQTIDLLPTFLELAGGRPEPDWKVDGL-NILP 365


>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 465

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 121/372 (32%), Positives = 190/372 (51%), Gaps = 35/372 (9%)

Query: 42  LAFTLSMVFVDLVASS-GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-K 99
           + F +    + LV ++   P+I+ + ADD G+ D GF G   + TPN+D LA SG+   +
Sbjct: 4   ITFLICFFGIHLVVNAQNKPNIVLLFADDAGYMDFGFQGSKVMKTPNLDKLAKSGVTFTQ 63

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQ----------HNVLYGCERGGLPLSEKILPQYL 149
            Y +   C PSR+ +MTGK+    G +          H+ L G E G LPL +K++  YL
Sbjct: 64  GYVSDPTCGPSRAGMMTGKYQARFGYEEINVPGYMSSHSALKGDEMG-LPLDQKLMSNYL 122

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
           K+LGY+T + GKWHLG     + P  RGF+   G+  G + YF +             R 
Sbjct: 123 KDLGYKTAVYGKWHLG-NADRFHPLNRGFDEFYGFRGGARSYFAYKNPTGD-------RK 174

Query: 210 LEPAWDLH---GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
           +E  +  +     Y+TDVF  +AV  I  +  + P F+YL+  A H+     P++A +  
Sbjct: 175 METNFGQYEEPNHYATDVFAEKAVHFIERNK-EHPFFIYLSFNAVHT-----PMEATEAD 228

Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
           L    ++   KR + AA+   LD + GKV+  L++  +  N+I+VF +DNGG +   + N
Sbjct: 229 LAQFPNLTG-KRQQLAAMTLALDRACGKVLNKLKELGIDRNTIVVFSNDNGGPS---DKN 284

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           A++N PL G K+   EGG+R   L+  P    +       V   D LPT  +AA   D+ 
Sbjct: 285 ASNNKPLSGTKSNHLEGGIRVPFLMSWPKEIKKKSRFNYPVVTFDLLPTFYAAAG-GDLK 343

Query: 387 NYVNSTVENIIP 398
           +  +    N++P
Sbjct: 344 DLKDINGVNVLP 355


>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
           variabilis ATCC 29413]
 gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
           ATCC 29413]
          Length = 457

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 119/378 (31%), Positives = 191/378 (50%), Gaps = 35/378 (9%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M  A   +A   + +F    A S  P+++FIL DD+GW D+  +G     TPN+D LA  
Sbjct: 17  MTAAGTLMATASANLFSRATAQSSRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQ 76

Query: 95  GIILKNYYTVQ-LCTPSRSAIMTGKHP--IHTGMQHNVLYGCERG----GLPLSEKILPQ 147
           G+   N Y  Q +CTP+R A +TG++   +  G++  +    +      G+P ++  +  
Sbjct: 77  GVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIAS 136

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
            LK  GY T +VGKWH G Y   + P  +GF+ + G+ +G  +YF H+  +     LD+ 
Sbjct: 137 LLKANGYETALVGKWHAG-YPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTDRI---LDLY 192

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
            +  P       Y TD+FT  AV+ I   HS   P +L L     H   P+ P Q P+  
Sbjct: 193 ENDVPV--QRSGYVTDLFTDRAVEFIQRPHS--RPFYLSL-----HYNAPHWPWQGPNDQ 243

Query: 267 LNIHRHIED-----FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
            +   ++ +       ++ +AA++  LD+ VG+V++ALE      N++++F SDNGG   
Sbjct: 244 ASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGG--- 300

Query: 322 GFNLNAASNW-PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                  SN+ P RG K +L+EGG+R   +I  P +     V+ Q +   D   T+L+A 
Sbjct: 301 ----ERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAAT 356

Query: 381 NKSDIPNYVNSTVENIIP 398
             S  PNY     +N++P
Sbjct: 357 GTSFHPNYPPDG-QNLLP 373


>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 481

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 121/357 (33%), Positives = 184/357 (51%), Gaps = 31/357 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           PHI+ I+ DDLG+ D+G +G   I TP ID+LA  G  L ++++   +CTP+R+A++TG+
Sbjct: 65  PHIVLIMTDDLGYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPVCTPTRAALLTGR 124

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            P   G++  +       GLP+ E IL + LKE+GYRT +VGKWHLG Y+ E+ P   GF
Sbjct: 125 WPQRVGLEWALSASDTEPGLPVEEPILSRPLKEVGYRTVMVGKWHLG-YRPEFGPNAHGF 183

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +   G  +G+ D++ H     K    D   + +P   + G YSTD+ +  AV  I   + 
Sbjct: 184 DEFFGLLSGNVDHYSHREINGKE---DWYENTKPV-RVEG-YSTDLLSDRAVAAIQKTAA 238

Query: 239 D-----EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF--KRSKFAAILHKLDES 291
                 +PL+LY+A+ A H      P Q P    +I      F   R+ +  ++  +D  
Sbjct: 239 QPPDQRQPLWLYVAYNAVH-----WPFQPPGRPQDIRDRSTWFNGTRADYVKMVESIDAG 293

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           VG++++ALE   M  +++++F +DNGG         + N P    K T+WEGG R   LI
Sbjct: 294 VGRILDALEAGGMADHTLVIFTNDNGGE------RLSDNGPFFHHKGTVWEGGHRVPCLI 347

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE 408
             P   + G    Q     D   T+L+AA     P      V N++P     ILR E
Sbjct: 348 RWPGRIAAGTEFGQPTIGMDLTATILAAAKIPPNPERPLDGV-NLVP-----ILRGE 398


>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
           musculus]
 gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
 gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
           musculus]
          Length = 520

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 174/346 (50%), Gaps = 43/346 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 27  PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ +
Sbjct: 87  RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A+      + + RD    W++ G++          
Sbjct: 146 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197

Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++T EA+D I   H+   P FLY A  ATH+     P+ A   +L        
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P   
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 307 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352


>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 459

 Score =  172 bits (437), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 125/373 (33%), Positives = 191/373 (51%), Gaps = 39/373 (10%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
           R + FA+      L + F    A+  PP+I+ I+ADDLG+ D+  +G  Q+ TP+ID LA
Sbjct: 8   RSVLFALFIAVSCLLIRFSAAEAAQQPPNIVLIMADDLGYGDLACYGNKQVKTPHIDRLA 67

Query: 93  YSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG---GLPLSEKILPQY 148
            S +   ++++   +CTP+R+A++TG++    G Q       +     GLP     + + 
Sbjct: 68  ASALKFTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAEL 127

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---HQDYFDHSAEEMKMWGLD 205
           LK+ GY T   GKWHLG Y+  + PT +GF+   G  +G   H  + D S  E   W  +
Sbjct: 128 LKQQGYATACFGKWHLG-YQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNE--DWWHN 184

Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
               +E        Y+ D+ +  +V  +  + T  P FLY+ H A H   P++  Q P H
Sbjct: 185 NEISMEKG------YTADLLSKYSVAFMEANRT-RPFFLYVPHLAIHF--PWQGPQDPPH 235

Query: 266 YLNIHRHIEDFKRSKFA-------------AILHKLDESVGKVVEALEQRRMLSNSIIVF 312
                +  +D+   K+              A++  LD+SVGK++ AL++  +  N++++F
Sbjct: 236 ----RKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIF 291

Query: 313 VSDNGG-AAAGFNL-NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            SDNGG    G N  N +SN PLRG K TL+EGG R   LI  P + + G V +Q  H  
Sbjct: 292 TSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAG-VTDQTAHSV 350

Query: 371 DWLPTLLSAANKS 383
           D LPTL  AA  S
Sbjct: 351 DLLPTLAQAAGIS 363


>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
           familiaris]
 gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
           familiaris]
          Length = 522

 Score =  172 bits (437), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 176/368 (47%), Gaps = 53/368 (14%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct: 12  LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 71

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER------------GGLPLSEKILPQY 148
           Y+   LC+PSR+A++TG+ PI  G      Y   R            GG+P  E +LP+ 
Sbjct: 72  YSANPLCSPSRAALLTGRLPIRNG-----FYTTNRHARNAYTPQEIVGGIPDQEHVLPEL 126

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           LKE GY ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + R
Sbjct: 127 LKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYR 181

Query: 209 DLEPAWDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHS 253
           D    W++ G+Y               T V+  EA+D I    +   P FLY A  ATH+
Sbjct: 182 D----WEMVGRYYEEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA 237

Query: 254 ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
                P+ A   +L         +R ++   + ++D SVGK++  L+  R+  N+ + F 
Sbjct: 238 -----PVYASRPFLGTS------QRGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFT 286

Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
           SDNG A         SN P    K T +EGG+R   + W P     G V+ Q   + D  
Sbjct: 287 SDNGAALISAPNQGGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLF 346

Query: 374 PTLLSAAN 381
            T LS A 
Sbjct: 347 TTSLSLAG 354


>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 466

 Score =  172 bits (436), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 118/335 (35%), Positives = 178/335 (53%), Gaps = 22/335 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ I AD+LG+ D+G +G   + TP +D LA  G+ L ++YT    CT SR+ ++TG+
Sbjct: 35  PNILLITADNLGYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGR 94

Query: 119 HPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           +P   G+ H +      G GL  SE ++P+YLK+ GYRT   GKW++GF      PT RG
Sbjct: 95  YPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVGFSPGS-RPTERG 153

Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           F+   G+  G+ DY+ H  A    +W     R L+  + + G YSTD+F   A   I   
Sbjct: 154 FDEFFGFAAGNIDYYHHYYAGRHDLW-----RGLKEVF-VEG-YSTDLFADAACQYISAE 206

Query: 237 STDEPLFLYLAHAATH--SANPYEP-----LQAPDHYLNIHRHIEDFK--RSKFAAILHK 287
           S D+P F+YL   A H  S    +P      QAPD     + +    K  + ++ A++  
Sbjct: 207 S-DQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVTA 265

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVR 346
           LD ++G+V++ L+   +   +I+++ SDNG        L  ASN PLR    TLWEGG+R
Sbjct: 266 LDSAIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGIR 325

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              +I  P     G V +  +   D LPTL++ A 
Sbjct: 326 VPAIIRYPGHLKAGTVNQSPLISLDILPTLITLAG 360


>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 461

 Score =  172 bits (436), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 120/380 (31%), Positives = 191/380 (50%), Gaps = 51/380 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+I+ ++ADD GWNDVG+HG  +I TP +D+LA +G  L+ +Y    C+PSR+A++TG  
Sbjct: 36  PNILLVIADDAGWNDVGYHG-SKIKTPVLDSLANNGAKLERFYVAPTCSPSRAALLTGIP 94

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
               G+   +  G  +  LP S   LP+ +K+LGYRT + GKWHLG    E  P   GF+
Sbjct: 95  ASRLGIVAPIA-GKSKIALPDSLVTLPKAMKKLGYRTALFGKWHLGL-TPENGPQAYGFD 152

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------STDVFTAEAVDII 233
           +  G+  G  D + H             ++ +P+W  +GK+       TD+ T  A+   
Sbjct: 153 TSYGFLHGQIDQYTHE-----------YKNGDPSWHKNGKFLKEDGHVTDLLTDAAIAYF 201

Query: 234 HNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI-EDFKRSKFAAILHKLDES 291
           +  + T+ P F+ LA++A     P+ PLQ  + Y   +R I  D  R  +AA +  LD +
Sbjct: 202 NQETKTETPSFVTLAYSA-----PHFPLQEEERYKRPYRTIFTDSSRVDYAAAMTHLDIA 256

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGG-----------AAAGFNLNAASNWPLRGVKNTL 340
            G+++ +L++   L N++I+F+SDNG               G N     N PL+  K T 
Sbjct: 257 FGRLLNSLKRSGKLDNTLILFMSDNGAMKNWYPKDQYQGKHGPNTTLGDNTPLKDYKTTN 316

Query: 341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
           +EG +R    + W      + +  + ++ V D LPTL+     S  PN    T+E     
Sbjct: 317 YEGAIRVPAFVYWKNHFNGKTL--DDFIGVIDVLPTLVELTGASVKPN----TIEG---- 366

Query: 400 YENSILRYENGTHEYNSPRI 419
             +S L   NG  +   P++
Sbjct: 367 --SSFLNLLNGASQTKIPQL 384


>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
 gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score =  172 bits (436), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 173/346 (50%), Gaps = 43/346 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 27  PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ +
Sbjct: 87  RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF    G    H   +D+ A+      + + RD    W++ G++          
Sbjct: 146 FHPLKHGFNEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197

Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++T EA+D I   H+   P FLY A  ATH+     P+ A   +L        
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P   
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 307 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352


>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
          Length = 537

 Score =  172 bits (436), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 174/346 (50%), Gaps = 43/346 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 44  PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 103

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ +
Sbjct: 104 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 162

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A+      + + RD    W++ G++          
Sbjct: 163 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 214

Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++T EA+D I   H+   P FLY A  ATH+     P+ A   +L        
Sbjct: 215 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 265

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P   
Sbjct: 266 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 323

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 324 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 369


>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
          Length = 418

 Score =  172 bits (435), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 94/213 (44%), Positives = 121/213 (56%), Gaps = 3/213 (1%)

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           Y PT RGF++  GY+   + YF        + G D   +     D  G Y T + T  AV
Sbjct: 1   YLPTRRGFDTFYGYYNTAKYYFTQKLNFYNICGRDYWDNETLVTDKDGTYDTHLITDRAV 60

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
           DII +H  D+PLF+Y+A  A H       + AP   L    +I D  R+  A  +  LDE
Sbjct: 61  DIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIGDHNRTLLAGAVDALDE 120

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           SVG++V AL QRRML+NS++VF SDNGG   G   N  SNWPLRG K T+WEGGVR   +
Sbjct: 121 SVGRIVAALHQRRMLANSVLVFTSDNGGIPWGDFSNTGSNWPLRGTKGTMWEGGVRLPAV 180

Query: 351 IWSPLLE--SRGIVAEQYVHVSDWLPTLLSAAN 381
           +WS +L   SR +V    +H+ DWLPTL SAA 
Sbjct: 181 VWSSMLAPGSR-VVVPHPMHLVDWLPTLYSAAG 212


>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
           [Ailuropoda melanoleuca]
          Length = 519

 Score =  172 bits (435), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 174/363 (47%), Gaps = 43/363 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS      + +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 9   LLLVLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNF 68

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G         N     E  GG+P  E +LP+ LK  G
Sbjct: 69  YSANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAG 128

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 129 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 179

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYE 258
           W++ G+Y               T V+  EA+D +      + P FLY A   TH+     
Sbjct: 180 WEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTHA----- 234

Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           P+ A   +L   R      R ++   + ++D+SVGK++  L   R+  N+ + F SDNG 
Sbjct: 235 PVYASRPFLGTSR------RGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGA 288

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           A         SN P    K T +EGG+R   + W P     G V+ Q   V D   T L+
Sbjct: 289 ALISAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLA 348

Query: 379 AAN 381
            A 
Sbjct: 349 LAG 351


>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 471

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 114/342 (33%), Positives = 176/342 (51%), Gaps = 29/342 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+F+ +DD G+ D GF G + + TPN+D LA  G+   + Y +   C PSR+ IMTG+
Sbjct: 27  PNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGR 86

Query: 119 HPIHTGM----------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           +    G           +H+ + G E  G+PL E  +  Y+K LGYRT   GKWHLG   
Sbjct: 87  YQQKFGYEEINVPGYMSEHSAIKGAEM-GIPLDEVTMGDYMKSLGYRTAFYGKWHLG-GT 144

Query: 169 KEYTPTFRGFESHLGYWTGHQDY--FDHSAEEMKMWGLDMRRDLEPAWDL---HGKYSTD 223
            E  P  RGF+   G+  G + Y  ++ +A E K   +   + LE   D    H  Y TD
Sbjct: 145 DELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKS-AVFTDKKLEHGIDQFQEHEGYLTD 203

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           V   +A   I   + D+P F++L+  A H+     P++A    L     ++  KR + AA
Sbjct: 204 VLAEKANQFIEK-APDKPFFIFLSFNAVHT-----PMEATPEDLAKFPQLKG-KRKEVAA 256

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           +   LD + G V+  L++  +  ++++VF +DNGG     + NA+SN+PL G K+   EG
Sbjct: 257 MTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPT---DKNASSNYPLAGTKSNFLEG 313

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
           G+R   L+  P   + G V ++ V   D LPT   A    ++
Sbjct: 314 GIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKAGGGEEV 355


>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 474

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 116/357 (32%), Positives = 181/357 (50%), Gaps = 27/357 (7%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
           ++M F++L L++ +S +      +   P+I+FILADD G+ D GF G  +  TP++D LA
Sbjct: 16  KVMLFSML-LSWCVSTL------AQKQPNIVFILADDAGYADFGFQGSSEFKTPHLDQLA 68

Query: 93  YSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEK 143
              I   + Y +  +C PSR+ I+TGK+    G + N + G         +  GLPL +K
Sbjct: 69  SQSIRFSQAYVSAAVCGPSRAGILTGKYQQRFGYEENNVPGYMSASATTGDEMGLPLDQK 128

Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
           +LP+YLKE GY+T + GKWH+G   K + PT RGF++  G+  G + Y++ +        
Sbjct: 129 LLPEYLKEQGYKTALFGKWHMGNADK-FHPTKRGFDTFYGFRGGARSYYEFNENNKNNRQ 187

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
            D        ++    Y TD       D I  +   +P F+YL+  A H+     P++A 
Sbjct: 188 EDRLERGFGNFEESKLYLTDALAEATTDFIEKNQ-KQPFFVYLSFNAVHT-----PMEAR 241

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
              L    +++  KR   AA+   LD + G V+  L++  +  N++IVF +DNGG +   
Sbjct: 242 PDDLKQFPNLKG-KRKTLAAMTLGLDRACGNVLAKLDELGVRENTLIVFTNDNGGPSG-- 298

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             NA+SN PL G K    EGG+R   L+  P       V +  +   D LPT    A
Sbjct: 299 -TNASSNKPLGGTKANHLEGGIRVPFLMSWPKQLKENTVYDYPISTLDLLPTFYQIA 354


>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 460

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/333 (33%), Positives = 173/333 (51%), Gaps = 24/333 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P++I+ILADD+G+ D G        TPN+D L   G  + + Y    +C P+R+A++TG+
Sbjct: 8   PNVIYILADDMGYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLTGR 67

Query: 119 HPIHTGMQHNV-LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           +P  TG        G +R  + L E  L    +  GYRT ++GKWHLG   KEY P  RG
Sbjct: 68  YPHRTGAVDTYEAIGGDR--MALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHPCRRG 125

Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
           F++ +G+  G  DY+ +         LD    LE +    G Y TDV T E++  I  + 
Sbjct: 126 FDTFIGFRGGWSDYYQYK--------LDRNGILEAS---DGTYMTDVITEESIRFIRENR 174

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            ++P FL+ A+ A     P+ P Q P+ Y    R   +   +    ++  +DE +GK+++
Sbjct: 175 -EQPFFLHAAYNA-----PHFPFQCPEEYAAPFRKRFNPTLATLYGMIACMDEGIGKLLD 228

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
            LE+  +  N+I++F SDNG    G + N  + + L G K   +EGG+R   +I  P   
Sbjct: 229 VLEETGLSDNTIVIFASDNGPQLYG-DTNRYNCY-LNGQKGEAFEGGIRVPAVIRWPGHI 286

Query: 358 SRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
           +       + H  DW PTLLSA  + ++P+ V+
Sbjct: 287 APDSRCHAFFHGVDWFPTLLSAC-RLELPDGVS 318


>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 442

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 121/356 (33%), Positives = 177/356 (49%), Gaps = 36/356 (10%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           ++ F  LPLA         LV +   P+II I++DD G+ D  F       TPNID +A 
Sbjct: 7   LLLFIWLPLA---------LVVAQERPNIIVIVSDDAGYADFSFQSTKLTTTPNIDRIAQ 57

Query: 94  SGIILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG------GLPLSEKILP 146
            G    N Y T  +C+PSR+ +++G +    G  +N + G E        G+P + K + 
Sbjct: 58  EGAKFTNAYVTASVCSPSRAGLLSGTNQPEFGHIYNYIKGVEYSIQQHEYGIPHNIKTIG 117

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
           +YL+ LGY T   GKWH GF  +E+ P  +GF+   G+  G   Y  H+ +      +D+
Sbjct: 118 EYLQPLGYATAAFGKWHEGF-AEEFQPEQKGFDHFWGFLWGANHY--HTGK-----AIDV 169

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
             D +P       Y TD  T EA+     H  D P FLY+A+ A HS     PL+A   +
Sbjct: 170 IDDGKPVDPTTIPYMTDAITDEALSFASAHEED-PFFLYIAYNAVHS-----PLEAKPEH 223

Query: 267 LNIH--RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
           + ++  +  +D KR   AA+ H LDE+VGK+   LE    L N+II F++DNGG+     
Sbjct: 224 IALYEGKFPDDPKREILAAMTHSLDENVGKIFNRLEAMGELDNTIIFFINDNGGSKH--- 280

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             +A N PL+G K  L+EGG+R    +  P       V        D LP++L A 
Sbjct: 281 -ISADNSPLKGYKGQLYEGGIRVPFAVRWPGKIKGKTVCHSITSSLDILPSILDAV 335


>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
 gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
          Length = 470

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 111/331 (33%), Positives = 164/331 (49%), Gaps = 32/331 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ D+  +G   I TP ID +   G+ +L+ Y    +C+ +R+AIMTG+
Sbjct: 36  PNIIFILADDLGYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNSPVCSATRTAIMTGQ 95

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +     +           GLP S+  LP  LK+ GY T ++GKWHLG Y K Y P   G+
Sbjct: 96  YQYRLALGLEEPLAGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHLGAYPK-YGPLKSGY 154

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH-NHS 237
           +   G+      Y++H        G D   D  P       Y TD+   + V++I  + S
Sbjct: 155 DHFYGFRGSALSYYNH--------GKDFWEDDAPV--EKAGYFTDLLGDKTVELIQKSDS 204

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHY--------LNIHRHIEDFKRSKFAAILHKLD 289
            + P F     A+ H   P+ P +AP            N     +   +  +  I+ ++D
Sbjct: 205 CERPFF-----ASVHFNAPHWPWEAPGEQGKKEAARKKNSLAAFDAGTQDVYRQIVERMD 259

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
             VG++++AL  +++  N+I+VF SDNGG         ++ WP  G K+ L EGG+R   
Sbjct: 260 FQVGRILQALTDKKIRENTIVVFTSDNGGE------RFSNTWPFSGKKSELLEGGIRIPS 313

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           LI  P     G  +EQ     DWLPTLL AA
Sbjct: 314 LICWPAKIPSGQTSEQVTLSMDWLPTLLEAA 344


>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 112/351 (31%), Positives = 175/351 (49%), Gaps = 43/351 (12%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
           V +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+
Sbjct: 22  VGAPQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRA 81

Query: 113 AIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           A++TG+ PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG
Sbjct: 82  ALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG 141

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---- 221
            ++ ++ P   GF+   G    H   +D+ A+      + + RD    W++ G++     
Sbjct: 142 -HRPQFHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFP 192

Query: 222 ----------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
                     T ++  EA+D I   H+   P FLY A  ATH+     P+ A   +L   
Sbjct: 193 INRKTGEANLTQLYLQEALDFIRTQHARQGPFFLYWAIDATHA-----PVYASRQFLGTS 247

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
                  R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN
Sbjct: 248 L------RGRYGDAVREIDDSVGKILNLLQNLGISKNTFVFFTSDNGAALISAPNEGGSN 301

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            P    K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 302 GPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352


>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
 gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
          Length = 452

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 137/416 (32%), Positives = 213/416 (51%), Gaps = 41/416 (9%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
           A   LAF+ +      +  S  P+I+ IL DDLG+ D+   G  + I TP+ID L   G+
Sbjct: 8   AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
              N++    + +PSR++++TG++P   G+   +    E     LSE   +LPQ LK+ G
Sbjct: 66  RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
           Y   +VGKW+LG  +   TPT RGF+ + G+     D Y+ H     + +G +  R+   
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             D  G ++T++F+  A+  + +     EP FLYLA+ A H+     P+Q P  +L   +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233

Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             E     KR+K  A++  LD +VG+V EALEQ   L N+II+F SDNGG         A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           +N P RG K  ++EGG+R AG I W    + R  V + +V +SD  PTL    + + +P 
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL---CDLTAVP- 343

Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
            V+  ++ I      SIL    G  +    R+ +   R  N  +     Y +RY++
Sbjct: 344 -VSHEIDGI------SILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKD 392


>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
 gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
          Length = 452

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 138/416 (33%), Positives = 212/416 (50%), Gaps = 41/416 (9%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
           A   LAF+ +      +  S  P+I+ IL DDLG+ D+   G  + I TP+ID L   G+
Sbjct: 8   AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
              N++    + +PSR++++TG++P   G+   +    E     LSE   +LPQ LK+ G
Sbjct: 66  RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
           Y   +VGKW+LG  +   TPT RGF+ + G+     D Y+ H       +G +  R+   
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTHCR-----FGNNYMRENLK 179

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             D  G ++T+VF+  A+  + +     EP FLYLA+ A H+     P+Q P  +L   +
Sbjct: 180 EIDPQG-HATEVFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233

Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             E     KR+K  A++  LD +VG+V EALEQ   L N+II+F SDNGG         A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           +N P RG K  ++EGG+R AG I W    + R  V + +V +SD  PTL    + + +P 
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL---CDLTAVP- 343

Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
            V+  ++ I      SIL    G  +    R+ +   R  N  +     Y +RY++
Sbjct: 344 -VSHEIDGI------SILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKD 392


>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
          Length = 522

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 175/359 (48%), Gaps = 35/359 (9%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 12  LLLMLSAAELGVAPAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDQMATEGMLFPNF 71

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           YT   LC+PSR+A++TG+ PI +G         N     E  GG+P SE +LP  LK  G
Sbjct: 72  YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 131

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD E  
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 186

Query: 214 WDLHGKYS----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQA 262
              + ++           T ++  EA++ I    +   P FLY A  ATH+     P+ A
Sbjct: 187 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PVYA 241

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
              +L         +R ++   + +LD+SVG+++  L+   +  N+ + F SDNG A   
Sbjct: 242 SKPFLGTS------QRGRYGDAIRELDDSVGRILRLLQDLGIAENTFVFFTSDNGAALIS 295

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                 SN P    K T +EGG+R   + W P     G V+ Q   + D   T LS A 
Sbjct: 296 APRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAG 354


>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
 gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
           norvegicus]
          Length = 524

 Score =  169 bits (429), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 177/360 (49%), Gaps = 43/360 (11%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
            LS + +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+ 
Sbjct: 17  VLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSA 76

Query: 105 Q-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRT 156
             LC+PSR+A++TG+ PI  G         N     E  GG+P SE +LP+ LK+ GY  
Sbjct: 77  NPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTN 136

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
           +IVGKWHLG ++ ++ P   GF+   G    H   +D+  +      + + RD    W++
Sbjct: 137 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEM 187

Query: 217 HGKYS--------------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQ 261
            G++               T ++  EA+D I   H+   P FLY A  ATH+     P+ 
Sbjct: 188 VGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVY 242

Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
           A   +L          R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A  
Sbjct: 243 ASKQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI 296

Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                  SN P    K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 297 SAPKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 356


>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
 gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
          Length = 522

 Score =  169 bits (429), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 174/359 (48%), Gaps = 35/359 (9%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 12  LLLVLSAAELGVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNF 71

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           YT   LC+PSR+A++TG+ PI +G         N     E  GG+P SE +LP  LK  G
Sbjct: 72  YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 131

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD E  
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 186

Query: 214 WDLHGKYS----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQA 262
              + ++           T ++  EA++ I    +   P FLY A  ATH+     P+ A
Sbjct: 187 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PIYA 241

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
              +L         +R ++   + +LD+SVG+++  L    +  N+ + F SDNG A   
Sbjct: 242 SKPFLGTS------QRGRYGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALIS 295

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                 SN P    K T +EGG+R   + W P     G V+ Q   + D   T LS A 
Sbjct: 296 APRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAG 354


>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
          Length = 452

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 124/349 (35%), Positives = 187/349 (53%), Gaps = 30/349 (8%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
           A   LAF+ +      +  S  P+I+ IL DDLG+ D+   G  + I TP+ID L   G+
Sbjct: 8   AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
              N++    + +PSR++++TG++P   G+   +    E     LSE   +LPQ LK+ G
Sbjct: 66  RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
           Y   +VGKW+LG  +   TPT RGF+ + G+     D Y+ H     + +G +  R+   
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             D  G ++T++F+  A+  + +     EP FLYLA+ A H+     P+Q P  +L   +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233

Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             E     KR+K  A++  LD +VG+V EALEQ   L N+II+F SDNGG         A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
           +N P RG K  ++EGG+R AG I W    + R  V + +V +SD  PTL
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL 336


>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
 gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
 gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
          Length = 452

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 124/349 (35%), Positives = 187/349 (53%), Gaps = 30/349 (8%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
           A   LAF+ +      +  S  P+I+ IL DDLG+ D+   G  + I TP+ID L   G+
Sbjct: 8   AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
              N++    + +PSR++++TG++P   G+   +    E     LSE   +LPQ LK+ G
Sbjct: 66  RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
           Y   +VGKW+LG  +   TPT RGF+ + G+     D Y+ H     + +G +  R+   
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             D  G ++T++F+  A+  + +     EP FLYLA+ A H+     P+Q P  +L   +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233

Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             E     KR+K  A++  LD +VG+V EALEQ   L N+II+F SDNGG         A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQDDA----GA 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
           +N P RG K  ++EGG+R AG I W    + R  V + +V +SD  PTL
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL 336


>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 508

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 127/396 (32%), Positives = 188/396 (47%), Gaps = 73/396 (18%)

Query: 59  PPHIIFILADDLGWNDVGFHGL----DQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
           PP++IFILADDLG ND+   G      ++ TPNID LA  G +    Y+    C PSR+ 
Sbjct: 18  PPNVIFILADDLGINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPSRAM 77

Query: 114 IMTGKHPIHTGMQHN--------------------------------VLYGCERGGLPLS 141
           +MTG++P  TG ++                                 ++      GLP  
Sbjct: 78  LMTGRYPTRTGFEYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGLPTE 137

Query: 142 EKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM 201
           E  L + LK+ GY T  +GKWHLG     + P  +GF+  L   +G    F    +    
Sbjct: 138 EVTLAEVLKDRGYHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASG---LFLPPGDPR-- 191

Query: 202 WGLDMRRDLEPA------------------WDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
            G++ R D +P                   W   G Y TD +T E++ +I + + + P F
Sbjct: 192 -GVEARLDFDPIDKFLWARMDFAASYNGSDWFEPGGYLTDYWTDESLKVI-DANKNRPFF 249

Query: 244 LYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRR 303
           LYLAH   H+     PLQA          IE  +   +AA+L  +D SVG+++E LE+  
Sbjct: 250 LYLAHWGVHT-----PLQATKEDYEAVGDIEPHRLRVYAAMLRAVDRSVGRIMEKLEEEG 304

Query: 304 MLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIV 362
           +  N+IIVF SDNGG  AG+      N P RG K TL+EGG+R    + W   ++   ++
Sbjct: 305 LADNTIIVFSSDNGG--AGYIGIPEVNAPYRGWKITLFEGGIRVPLFMKWPGTIQPGTVI 362

Query: 363 AEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
            E   H+ D +PT L+AA  + +P+ V    +N++P
Sbjct: 363 DEPVAHI-DLMPT-LAAATGARLPSSVEIDGQNLLP 396


>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
 gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
          Length = 452

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 115/353 (32%), Positives = 177/353 (50%), Gaps = 26/353 (7%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
           AS+  P+I+FILADDLGW D+  +G     TP++DALA  G+   + Y    +C+ +R A
Sbjct: 11  ASTQAPNIVFILADDLGWADLSVYGQADFTTPHLDALAGEGVRFTQAYANSAVCSATRFA 70

Query: 114 IMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           ++TG++   +  G++  ++      GLP     LP  L++ GY T ++GKWHLG     +
Sbjct: 71  LITGRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRDAGYDTALIGKWHLGSLPT-F 129

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
            P   G++   G + G  DYF H          D+     P   +   Y T +   EA  
Sbjct: 130 GPLKSGYDRFFGNYGGAIDYFTHKPGVGDAVARDLYEGEVPVERV--GYYTQLLADEATR 187

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNIHR----HIEDFKRSKFAAILH 286
            I   S  +P FL L     H   P+ P   P D  +++      H +    + +A ++ 
Sbjct: 188 WIGERSAAKPFFLSL-----HFTAPHWPWVGPEDEEISLGLKDLFHYDGGNLATYARMVR 242

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
            LD++VG+V+EAL+ + +  N+I+VF SDNGG         +  WP  G K  L EGG+R
Sbjct: 243 SLDKAVGQVLEALKAQGLADNTIVVFTSDNGGE------RFSKTWPFTGQKTELLEGGIR 296

Query: 347 GAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
              L+ W   +  +  V++Q     DWLPTLL+AA  +   +Y     ENI+P
Sbjct: 297 VPTLLRWPARIAPQ--VSDQVTATMDWLPTLLAAAGVAPHADYPPDG-ENILP 346


>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 444

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 112/335 (33%), Positives = 169/335 (50%), Gaps = 32/335 (9%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+II    DD G+ D+G +G +   TP++D LA  GI   N+Y    +CTPSR+ ++TG
Sbjct: 33  PPNIIVFFTDDQGYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSRAGLLTG 92

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           ++P  + +   VL+    GGL      + + LK  GY T  +GKWHLG +K EY P  +G
Sbjct: 93  QYPKRSNLHEAVLFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEYMPYNQG 151

Query: 178 FESHLG--YWTGHQDYF----DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           F++  G  Y     +Y+    D  +  +  +  +  + +E   D   +Y T  +T E V 
Sbjct: 152 FDTFYGVPYSNDMDNYYYKNIDFQSPPLPFY--ENTKVIENGSD--QRYLTKRYTEETVK 207

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
            I N   ++P F+YLAH   H+     PL A   +          K   +  ++ +LD S
Sbjct: 208 RIKNRG-EKPFFIYLAHNMPHT-----PLFASPAFEGKS------KNGLYGDVIMELDWS 255

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
            G+++ AL++  +  N+I VF SDNG A         S  PLRG K   WEGG R  G+I
Sbjct: 256 AGEIINALKEEGIYENTIFVFTSDNGPA-------KGSAKPLRGKKAQTWEGGQRVPGII 308

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
             P    RG+V +++V   D  PT    A+ + IP
Sbjct: 309 TWPGSIPRGVVTDEFVSTLDLFPTFAKLAS-TKIP 342


>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
          Length = 520

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 176/365 (48%), Gaps = 45/365 (12%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS      + +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 8   LLLVLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNF 67

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G         N     E  GG+P  E +LP+ LK  G
Sbjct: 68  YSANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAG 127

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 128 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 178

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYE 258
           W++ G+Y               T V+  EA+D +      + P FLY A   TH+     
Sbjct: 179 WEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTHA----- 233

Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           P+ A   +L   R      R ++   + ++D+SVGK++  L   R+  N+ + F SDNG 
Sbjct: 234 PVYASRPFLGTSR------RGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGA 287

Query: 319 A--AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           A  +A       SN P    K T +EGG+R   + W P     G V+ Q   V D   T 
Sbjct: 288 ALISAPKQGQCGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTS 347

Query: 377 LSAAN 381
           L+ A 
Sbjct: 348 LALAG 352


>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
 gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
          Length = 453

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 187/359 (52%), Gaps = 26/359 (7%)

Query: 33  RIMAFAVLPLAFTLS--MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           +I   A+L L  T++              P+++ IL DDLG  D+       + TPNID 
Sbjct: 2   KIEKIALLALGTTMTSGAALASNTVDKATPNVLLILVDDLGLGDLSCQYAKDLSTPNIDR 61

Query: 91  LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQ 147
           +  +G+ L N+Y    + +PSR+A++TG+ P   G+   +    ++  G    S   +P+
Sbjct: 62  IFETGVRLDNFYANSSVSSPSRAALLTGRFPAMVGVPGVIRPSIDQNWGYFDPSAVTMPE 121

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDM 206
            LK  GYRT ++GKWHLG +K    P  RGFE   G+     D Y+ H  +      L+ 
Sbjct: 122 VLKNGGYRTALIGKWHLG-WKSPNLPNERGFEHFHGFLADMMDDYYTHQRQGGNYMYLN- 179

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDH 265
            +++ P       ++T++FT+ +VD I   + ++ P FLYLA+ A HS     PLQ P  
Sbjct: 180 DKEITPK-----GHATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVE 229

Query: 266 YLNIHRHIED---FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           ++N  +  +     KR++  A++  LD+++GKV++ALE+   LSN++++F SDNGG    
Sbjct: 230 WVNKVQERDKSLPVKRARLIALIEHLDDNIGKVMQALEESGQLSNTLVIFASDNGGDRG- 288

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              + A+N P RG K  ++EGG+     +  P +   G     +V + D +PT+    N
Sbjct: 289 ---SMANNGPTRGAKGDMFEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTICDFVN 344


>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
 gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 462

 Score =  169 bits (427), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 109/330 (33%), Positives = 166/330 (50%), Gaps = 20/330 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+FIL+DD G+ D GFHG  +  TP +D  A + +   + Y +  +C PSR+ ++TGK
Sbjct: 26  PNIVFILSDDAGYADFGFHGSKEFKTPELDKFAKNAVRFSQAYVSAAVCGPSRAGLLTGK 85

Query: 119 HPIHTGMQHNVLYGC--------ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           +    G + N + G         +  GLPL +K +  YLKE GYRT + GKWH G     
Sbjct: 86  YQQKFGFEENNVPGLMSKNGLTGDDMGLPLDQKTIADYLKEQGYRTALFGKWHQG-NADR 144

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           + PT RGF+   G+  G + Y    A+       D        +  H  Y T+    EA+
Sbjct: 145 FHPTKRGFDEFYGFRGGARSYMPFGADNELTRNEDRLERGFGGFQEHEGYLTEELADEAI 204

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             I  +  + P F+YLA  A H+     P++A +  L   + +   KR   AA+   +D 
Sbjct: 205 AFIERNQKN-PFFVYLAFNAVHT-----PMEAKEEDLKKVKGLSG-KRKTLAAMTIAMDR 257

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
           + GKV++ ++   +  N++IVF +DNGG +   + N +SN PL G K    EGG+R   L
Sbjct: 258 ACGKVLDRIKALGLDKNTLIVFTNDNGGPS---DANESSNLPLSGTKANHLEGGIRVPFL 314

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +  P       V E  +   D LPT ++AA
Sbjct: 315 MSWPGKLKPNTVYENPISTLDLLPTFVTAA 344


>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
 gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
          Length = 474

 Score =  168 bits (426), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 197/390 (50%), Gaps = 40/390 (10%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
           F L     DL A+ GP ++I I ADDLG+ D+G  G   I TP +D +A  G+    +Y+
Sbjct: 22  FLLLTSVADLNAAPGP-NVIVIYADDLGYGDLGCFGHPTIKTPALDQMAAEGMKFTQFYS 80

Query: 104 VQ-LCTPSRSAIMTGKHPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
              +CTPSR+A++TG++PI +GM   +  VL+    GG+P SE  L + LK  GY+T  V
Sbjct: 81  AAPVCTPSRAALLTGRYPIRSGMCSDKRRVLFPNSGGGIPASEVTLAEALKAAGYKTACV 140

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE-----------EMKMWGLDMRR 208
           GKWHLG +  ++ PT  GF+S+ G    + +  D  A+           E+K W + + R
Sbjct: 141 GKWHLG-HLPQFLPTNNGFDSYFG--IPYSNDMDRVADRKHGRSIFLKPEVKFWNVPLMR 197

Query: 209 DLEPAWDLHGKYSTDV--FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
           + E   +L    +T    +T EA+ +I  +   +P F+YLAH       P+ PL     +
Sbjct: 198 NTE-VVELPADQTTITKRYTEEAIKLIQQNK-QQPFFIYLAHNM-----PHVPLFRSPEF 250

Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
            +  R      R  +  ++ ++D SVG+V++ L+ + +  N+I+ F SDNG     FN  
Sbjct: 251 QDKSR------RGLYGDVIEEIDWSVGEVLKELKTQGLDQNTIVWFCSDNGPWLI-FNEQ 303

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
             S   LR  K + WEGG+R   + W P     G V+++     D   T +  A  S   
Sbjct: 304 GGSAGLLRDGKGSTWEGGMREPTIAWWPGHIPSGSVSQELGSTMDIYTTSIKLAGGSVPT 363

Query: 387 NYVNSTVE--NIIPRYENSI---LRYENGT 411
           + V   V+   ++ + ENS    L Y  GT
Sbjct: 364 DQVVDGVDLTPVLLKKENSPREELAYYRGT 393


>gi|338212321|ref|YP_004656376.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
 gi|336306142|gb|AEI49244.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
          Length = 446

 Score =  168 bits (426), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/306 (35%), Positives = 165/306 (53%), Gaps = 22/306 (7%)

Query: 48  MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQL 106
           + F     S   P+I+ IL DD+GW +VG +G     TP++D LA  G+   N+Y    +
Sbjct: 15  LFFTSSAFSQQKPNIVMILVDDMGWGEVGIYGSHYCQTPHLDNLAKQGMRFTNFYANSTV 74

Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHL 164
           C+P+R+A+MTG++P   G+   +    E      S K   LPQ LK  GYRT +VGKWHL
Sbjct: 75  CSPTRAALMTGRYPDLVGVPGVIRGNVENSWGYFSPKAVTLPQVLKTAGYRTAMVGKWHL 134

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
           G  + E  P   GF    G+     D +     E K W     + ++P+      ++TD+
Sbjct: 135 GL-EPENHPNRHGFTHFHGFLEDMMDDYYTHLREGKNWMRLNGQVIDPS-----GHATDL 188

Query: 225 FTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDF--KRSK 280
           FT  A++ ++    + +P FLYLA+ A     P+ P+Q P  +L  + +   +   KR+K
Sbjct: 189 FTQWAIEYLNTQKKNPQPFFLYLAYNA-----PHFPVQPPREWLEKVKKRNPNLSEKRAK 243

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
             A++  LDES+GKV+EAL Q     N++IVF SDNGG  +    + A N   +G K T+
Sbjct: 244 LTALIEHLDESIGKVMEALRQNGQADNTLIVFSSDNGGLLS----DEADNGIWKGGKQTM 299

Query: 341 WEGGVR 346
           +EGG+R
Sbjct: 300 YEGGIR 305


>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 336

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 84/176 (47%), Positives = 119/176 (67%), Gaps = 1/176 (0%)

Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE-PLQAPD 264
           M ++ E  ++  G YST +++ EA+D I NH++ +P+FLYLA+ A HSAN  E PLQAP 
Sbjct: 1   MAQNQENIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQ 60

Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
            +++   HI+   R  +AA+L  +D  +G++   L ++ ML +SII+F SDNGG A G N
Sbjct: 61  EWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANGLN 120

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            N A+N+PLRG K T++EGGVRGA  IWS  L     V+   +H++DWLPTLLSAA
Sbjct: 121 NNWATNYPLRGTKTTVYEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSAA 176



 Score = 39.7 bits (91), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTR-GKWKLVKENSI 556
           D +    +DG+++W  L    PS R  +L NID    ++A  R G WK+VK+N+ 
Sbjct: 179 DTTLLPNLDGLNLWDTLINQVPSPRTEVLINIDPLLYMNAALRVGDWKIVKQNNF 233


>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
          Length = 466

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 177/360 (49%), Gaps = 43/360 (11%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
            LS + +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+ 
Sbjct: 17  VLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSA 76

Query: 105 Q-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRT 156
             LC+PSR+A++TG+ PI  G         N     E  GG+P SE +LP+ LK+ GY  
Sbjct: 77  NPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTN 136

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
           +IVGKWHLG ++ ++ P   GF+   G    H   +D+  +      + + RD    W++
Sbjct: 137 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEM 187

Query: 217 HGKYS--------------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQ 261
            G++               T ++  EA+D I   H+   P FLY A  ATH+     P+ 
Sbjct: 188 VGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVY 242

Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
           A   +L          R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A  
Sbjct: 243 ASKQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI 296

Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                  SN P    K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 297 SAPKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 356


>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
           anophagefferens]
          Length = 304

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 104/292 (35%), Positives = 153/292 (52%), Gaps = 27/292 (9%)

Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           CTPSR ++++G++P+  G+Q  V+    + GL  + ++LP+ L+  GYRT +VGKWHLG 
Sbjct: 1   CTPSRGSLLSGRYPVRLGLQRGVVSAASKWGLDPAIRLLPETLQTAGYRTALVGKWHLGH 60

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK-----------MWGLDMRRDLEPAWD 215
           Y +  TP  RGF    G+  G  D+F H AE                 LD + +  P   
Sbjct: 61  YNESLTPARRGFGESYGFLGGFVDHFTHVAEAPPACASETPATDGTCALDWQVNGLPRPR 120

Query: 216 LHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDH---YLNIHR 271
             G Y+       A  ++ +H   +EPLFLY A        P+EP+  PD    Y  + +
Sbjct: 121 ATGVYAAYACRDAARRLVASHEGKEEPLFLYYA-----LPTPHEPITPPDDVETYDLLPK 175

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
                +R  +AA++   D ++G +  AL++  +  N+++V +SDNG +AAG      SN 
Sbjct: 176 LDVAGRRRDYAAVVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAG-----GSNA 230

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLES--RGIVAEQYVHVSDWLPTLLSAAN 381
           PLRG K   +EGGVR   L+   LL    RG   E  VHV+DW PTLL+AA 
Sbjct: 231 PLRGAKGGYFEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAG 282


>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
          Length = 526

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 173/358 (48%), Gaps = 43/358 (12%)

Query: 47  SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ- 105
           S V +  V +   P+I+ +L DD+GW D+G +G     TPN+D +A  G +  N+Y+   
Sbjct: 21  SSVGLGAVGAPQSPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANP 80

Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRI 158
           LC+PSR+A++TG+ PI  G      +           GG+P SE +LP+ LKE GY ++I
Sbjct: 81  LCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKI 140

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
           VGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    W++ G
Sbjct: 141 VGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD----WEMVG 191

Query: 219 KYS--------------TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAP 263
           ++               T ++  EA+D I      + P FLY A  ATH+     P+ A 
Sbjct: 192 RFYEEFPINLKTGESNLTQIYLQEALDFIKRQQAQQHPFFLYWAIDATHA-----PVYAS 246

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
             +L         +R ++   + ++D+SVGK+++ L    +  N+ + F SDNG A    
Sbjct: 247 KAFLGTS------QRGRYGDAVREIDDSVGKILDLLRGLSIGENTFVFFTSDNGAALISA 300

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                SN P    K T +EGG+R   + W P       V+ Q   + D   T LS A 
Sbjct: 301 PRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAAQVSHQLGSIMDLFTTSLSLAG 358


>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
 gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
          Length = 454

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 175/368 (47%), Gaps = 47/368 (12%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAI 114
           ++  P+I+FILADDLGW D+G +G     TPN+D LA  G+   + Y    +C+ +R A+
Sbjct: 8   TTAAPNILFILADDLGWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATRIAL 67

Query: 115 MTGKHPIHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           +TG++      Q+ +  G E        GLP     LP  L+E GY T ++GKWHLG   
Sbjct: 68  ITGRY------QYRLPAGLEEPIARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLG-KP 120

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD-----LHGKYSTD 223
             Y P   G++   G   G  DYF H        G  + RDL   W+         Y T+
Sbjct: 121 PTYGPLKSGYDRFFGNIGGALDYFTHKPGV----GAQVPRDL---WEGDVPVERTGYYTN 173

Query: 224 VFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF------ 276
           +   EA   +     + +P FL L     H   P+ P + P     + R I D       
Sbjct: 174 ILGDEASAYVRAREDEKKPFFLSL-----HFTAPHWPWEGPGDE-AVSREITDLFHYDGG 227

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
             +K+  ++  LD +VG+V+ AL++     N+++VF SDNGG         +  WP  G 
Sbjct: 228 NLAKYGELVEALDAAVGQVLAALDETGQADNTLVVFTSDNGGE------RFSKTWPFTGQ 281

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
           K  L EGG+R   L+  P    R  V  Q    +DWLPTLL+AA     P+Y  S  ENI
Sbjct: 282 KTELLEGGLRVPTLLRWP-ARVRPQVQAQVTITADWLPTLLAAAQVWPHPDY-PSDGENI 339

Query: 397 IPRYENSI 404
           +P  E  +
Sbjct: 340 LPVLEGQV 347


>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
            queenslandica]
          Length = 2318

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 159/324 (49%), Gaps = 21/324 (6%)

Query: 60   PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
            PHI+F+L DD G+ D  F     I TPN   L  +G+IL  +Y  + C+PSR++ +TG+ 
Sbjct: 1141 PHIVFVLVDDWGFADASFRN-PAIKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGRF 1199

Query: 120  PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            P H    +    G    G  ++  +LP  LK  GY T +VGKWH G Y   Y P  RGF+
Sbjct: 1200 PHHVHQWNPTPLG--MVGTNINMTMLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGFD 1257

Query: 180  SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            +  G+    + +F+ +        +D  ++  P    +G   + ++  +   + + H   
Sbjct: 1258 TSSGFLQAEEGHFNQTIG----CAVDFWKNDAPD-TRNGTCDSYIYNKDLTTVFNEHDAS 1312

Query: 240  EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF-KRSKFAAILHKLDESVGKVVEA 298
            +PLFLYL     H+     P++AP  +L+++        R    A++   D   G VV  
Sbjct: 1313 KPLFLYLPLHNVHA-----PIEAPQEWLDLYPANSTCDTRRILQAMVSVADNVTGHVVNL 1367

Query: 299  LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
            L +  M  N+++V  +DNGGA         SN+PL+G K T +EGGVR        L+ +
Sbjct: 1368 LRENDMWDNTLMVVSADNGGADC-----KGSNYPLKGSKYTFFEGGVRVIAFASGGLIPA 1422

Query: 359  --RGIVAEQYVHVSDWLPTLLSAA 380
              RG   + ++H++DW  T    A
Sbjct: 1423 GRRGKSTDGFIHIADWYTTFCKMA 1446


>gi|374369744|ref|ZP_09627765.1| sulfatase [Cupriavidus basilensis OR16]
 gi|373098668|gb|EHP39768.1| sulfatase [Cupriavidus basilensis OR16]
          Length = 433

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 183/366 (50%), Gaps = 31/366 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGK 118
           P++I+I+ADDLG+ D+G +G  +  +P +D+LA +G++L + Y    +C+P+R A+MT +
Sbjct: 3   PNLIYIVADDLGYADLGCYGGREPVSPVLDSLAANGLMLTQGYANSPVCSPTRFALMTAR 62

Query: 119 HPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +     G     +    RG    GLP +   LP  L+  GYRT ++GKWHLGF    + P
Sbjct: 63  YQYRLRGAAEEPINSKSRGSSALGLPPAHPTLPSLLRAQGYRTALIGKWHLGF-APAFGP 121

Query: 174 TFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
              G+++  G   G  DYF H  S+    +W  D     EP       Y TD+ +  AVD
Sbjct: 122 LLSGYQAFFGPMAGGVDYFTHCDSSGRHDLWHNDAEHQ-EPG------YLTDLLSRRAVD 174

Query: 232 IIHNHSTD-EPLFLYLAHAATHSANPYE----PLQAPDHYLNIHRHIEDFKRSKFAAILH 286
            +   S +  P FL L + A H   P+E       AP    N+  H++      +  ++ 
Sbjct: 175 YVTRMSGEGAPFFLSLHYTAPHW--PWETRDDAALAPQVSNNLF-HLDGGNIHTYRRMIS 231

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
            +DE +G +V+AL Q   L N++IVF SDNGG         + NWPL G K  L EGG+R
Sbjct: 232 HMDEGIGWLVQALRQLGQLDNTLIVFTSDNGGE------RFSDNWPLVGGKMDLTEGGIR 285

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILR 406
              +   P + + G  + Q+    DW  TLL AA     P Y    V +++P   ++  R
Sbjct: 286 VPWIAHWPAMIAPGGTSAQHCMTMDWSATLLDAAGVQADPGYPLDGV-SLMPLLRDAAHR 344

Query: 407 YENGTH 412
           +    H
Sbjct: 345 FHRPLH 350


>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 452

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 123/349 (35%), Positives = 187/349 (53%), Gaps = 30/349 (8%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
           A   LAF+ +      +  S  P+I+ IL DDLG+ D+   G  + I TP+ID L   G+
Sbjct: 8   AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
              N++    + +PSR++++TG++P   G+   +    E     LSE   +LPQ LK+ G
Sbjct: 66  RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
           Y   +VGKW+LG  +   TPT RGF+ + G+     D Y+ H     + +G +  R+   
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             D  G ++T++F+  A+  + +     EP FLYLA+ A H+     P+Q P  +L   +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233

Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             E     KR+K  A++  LD +VG+V EAL++   L N+II+F SDNGG         A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQDDA----GA 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
           +N P RG K  ++EGG+R AG I W   +  R  V E +V +SD  PTL
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKNRI--RPAVRENFVMLSDMFPTL 336


>gi|374374259|ref|ZP_09631918.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
 gi|373233701|gb|EHP53495.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
          Length = 476

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 124/371 (33%), Positives = 183/371 (49%), Gaps = 35/371 (9%)

Query: 37  FAVLPLAF-TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           FA+   +F TL +     +A+   P+I+ ILADDLG+ DVG +G  +I TP+IDALA  G
Sbjct: 4   FAIFFASFLTLLIRLPTRLAAQTKPNIVLILADDLGYGDVGVYGQQKIKTPHIDALAKEG 63

Query: 96  IILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKE 151
           +    +Y    +C PSRS+++TG+H  HT ++ N+  G +  G   LP +   +   LK 
Sbjct: 64  LRFTRFYCGTAVCAPSRSSLLTGQHTGHTPIRGNI--GVKPEGQWPLPGNTVTIATLLKS 121

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY--WTGHQDYF-DHSAEEMKMWGLDMRR 208
            GY T   GKW LGF      P  +GF+   GY   T   DYF DH      +W  + R 
Sbjct: 122 AGYSTGCFGKWGLGFIGTSGDPLKQGFDRFYGYNCQTLAHDYFPDH------LWNNEQRV 175

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA----------NPY- 257
           +L    +    YS  +   +A+  I  H   +P FL LA+   H+A          N Y 
Sbjct: 176 ELSNTHEKQTIYSAQLIHEQAMQFIRQHQR-QPFFLLLAYTLPHAALQLPKDDTLYNYYK 234

Query: 258 -----EPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
                +P+   +H+       + +  + +AA++ KLD  VG+V + L+   +  N++I+F
Sbjct: 235 KLFNEKPVPVAEHWNGKGYEPQAYPHAAYAAMVGKLDAYVGEVQQQLKVLGLDKNTMILF 294

Query: 313 VSDNGGAAAGFNLNAA--SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            SDNG    G N  A   SN  LRG+K  L+EGG+R   + W P     G  + Q     
Sbjct: 295 ASDNGPHREGGNDPAFFNSNGGLRGIKRDLYEGGIRSPFIAWWPGTIQAGTQSNQVGAFW 354

Query: 371 DWLPTLLSAAN 381
           D LPT  + A 
Sbjct: 355 DLLPTFAALAG 365


>gi|50753991|ref|XP_414208.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gallus gallus]
          Length = 513

 Score =  167 bits (423), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 182/368 (49%), Gaps = 50/368 (13%)

Query: 35  MAFAVL-PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           MA AVL PL    ++        + PP+++ +L DD+GW D+G  G     TPN+D +A 
Sbjct: 1   MARAVLVPLWLCWAL------GCAAPPNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMAS 54

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKIL 145
            G++  ++Y    LC+PSR+A++TG+ P+  G      +           GG+  SE +L
Sbjct: 55  EGMLFLDFYAANPLCSPSRAALLTGRLPVRNGFYTTNAHARNAYTPQDIVGGIQDSEILL 114

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           P+ LK+ GY  +I+GKWHLG ++ ++ P   GF+   G    H   +D+ A    +  + 
Sbjct: 115 PELLKKAGYTNKIIGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRA----LPNIP 169

Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAA 250
           + RD    W++ G+Y               T ++  EA+D I    ++ +P FLY A  A
Sbjct: 170 VYRD----WEMIGRYYEDFKIDLRTGEANLTQIYLQEALDFISKQQASQQPFFLYWAIDA 225

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           TH+     P+ A  H+L         +R ++   + ++D+SVGK+++ L++  +  N+ +
Sbjct: 226 THA-----PVYASKHFLGTS------QRGRYGDAVREIDDSVGKILKHLQKLGISENTFV 274

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            F SDNG A         SN P    K T +EGG+R   + W P     G V+ Q   V 
Sbjct: 275 FFTSDNGAALISAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGSVSRQLGSVM 334

Query: 371 DWLPTLLS 378
           D   T LS
Sbjct: 335 DLFTTSLS 342


>gi|330505678|ref|YP_004382547.1| putative sulfatase [Pseudomonas mendocina NK-01]
 gi|328919964|gb|AEB60795.1| probable sulfatase precursor [Pseudomonas mendocina NK-01]
          Length = 629

 Score =  167 bits (423), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 125/388 (32%), Positives = 192/388 (49%), Gaps = 46/388 (11%)

Query: 29  GYRTRIMAFAVLPLAFTLSMVFVDLVASS-GPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
           G R R++   +L     L   +    A S   P+I+ +L DDLG ND+   G  + PTP 
Sbjct: 10  GARMRLVLLGIL-----LCTAYAGNAAESHAKPNILLVLVDDLGNNDIASWGDGRAPTPT 64

Query: 88  IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
           +DAL+ S + L+ +YT   C+PSR+++++G+HP+  G Q + L      GL      LP+
Sbjct: 65  LDALSASAVRLRRHYTDSTCSPSRASLISGRHPVSVGFQADGL------GLSPDLVTLPK 118

Query: 148 YLKELGYRTRIVGKWHLG--FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
            L+ LGYRT  VGKWHLG     +E  P  +GF+    YW G  ++F          G  
Sbjct: 119 SLRSLGYRTLHVGKWHLGEALEYEEIQPGQQGFD----YWFGMLNHFVLQGPGPD--GRP 172

Query: 206 MRR----------DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
           +RR          D   A   H  Y  D+ T +AV+++ +   ++P F+ L   + H+  
Sbjct: 173 VRRQPTHINPWLQDNGSAPAQHQGYLDDILTDKAVELVKSGVGEKPWFINLWLFSPHT-- 230

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
           PY+P  A            D    K+ AIL +LD ++ ++++ALE+     N++++F SD
Sbjct: 231 PYQPSPA------FSTQFPDTPEGKYFAILSQLDHNMARLLKALEESGQADNTVVIFASD 284

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NGGA    N+N  +N+PL G K T  EGGVR   L+  P       + E  V + D  PT
Sbjct: 285 NGGA----NINMDNNYPLIGKKATYTEGGVRTPLLLRWPGHYENQDIREPTVFM-DIFPT 339

Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENS 403
           L+S A  +           N++P +E  
Sbjct: 340 LVSIAGGNPPKGLAG---RNLLPLFEGG 364


>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
           ATCC 8503]
          Length = 452

 Score =  167 bits (422), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 135/416 (32%), Positives = 213/416 (51%), Gaps = 41/416 (9%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGI 96
           A   LAF+ +      +  S  P+I+ IL DDLG+ D+   G  + I TP+ID L   G+
Sbjct: 8   AACGLAFSAASAQTKALPDS--PNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGV 65

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELG 153
              N++    + +PSR++++TG++P   G+   +    E     LSE   +LPQ LK+ G
Sbjct: 66  RFTNFHANCPVSSPSRASLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRG 125

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEP 212
           Y   +VGKW+LG  +   TPT RGF+ + G+     D Y+ H     + +G +  R+   
Sbjct: 126 YHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH-----RRFGNNYMRENLK 179

Query: 213 AWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             D  G ++T++F+  A+  + +     EP FLYLA+ A H+     P+Q P  +L   +
Sbjct: 180 EIDPQG-HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVK 233

Query: 272 HIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             E     KR+K  A++  LD +VG+V EAL++   L N+II+F SDNGG         A
Sbjct: 234 KREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQDDA----GA 289

Query: 329 SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           +N P RG K  ++EGG+R AG I W    + R  V + +V +SD  PTL    + + +P 
Sbjct: 290 NNGPFRGAKQDMYEGGIRVAGGIYWKN--QIRPAVRDNFVMLSDMFPTL---CDLTAVP- 343

Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
            V+  ++ I      SIL    G  +    R+ +   R  N  +     Y +RY++
Sbjct: 344 -VSHEIDGI------SILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKD 392


>gi|255690672|ref|ZP_05414347.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
 gi|260623696|gb|EEX46567.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 459

 Score =  167 bits (422), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/337 (32%), Positives = 179/337 (53%), Gaps = 34/337 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ + I+ADD+G+ DVG +G + I TPNID +A  G++  ++++   + +P+R  ++TG+
Sbjct: 29  PNFVIIVADDMGYGDVGIYGNEYIKTPNIDQIAREGMMFTDFHSNGSVSSPTRCGLLTGR 88

Query: 119 HPIHTGMQHNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +    G++  +L   +      GLP  E    + L + GYRT ++GKWHLG+ +K +   
Sbjct: 89  YQQRAGLEKVLLVPRDDKDKEVGLPSEEITFAKILGDNGYRTALIGKWHLGYLQKHHPMN 148

Query: 175 FRGFESHLGYWTGHQDYFDHSAE--EMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           F GF+  +G+ +G+ DY  H     +M  W GL+M+       D+ G Y+T + T  + D
Sbjct: 149 F-GFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEMK-------DMSG-YTTTLLTTLSED 199

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHRHIEDFKRSK------FAAI 284
            I   + D+P  LY+AHAA HS     P+Q PD   +      E  K S       +  +
Sbjct: 200 YI-KENKDKPFCLYIAHAAPHS-----PMQGPDEKAVRTEATPEGDKNSDRSNKEIYKDM 253

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           + +LD SVG+++E L++ ++  N+ +VF SDNG        N  S    +G K + WEGG
Sbjct: 254 VEELDWSVGRILETLKKYKLDENTFVVFFSDNGPVIN----NGGSAGGYKGAKGSPWEGG 309

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            R  G+ + P     G   EQ V   D  PT+L  A+
Sbjct: 310 HRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLDMAD 346


>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 463

 Score =  167 bits (422), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 124/411 (30%), Positives = 204/411 (49%), Gaps = 47/411 (11%)

Query: 36  AFAVLP-LAFTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPTPNI 88
           + A +P L   +S+V V L   S        P+++ IL DDLG+ D+  +G   + +PNI
Sbjct: 3   SLATVPKLQLLMSLVAVVLFGVSSTAQAADKPNVLLILVDDLGYGDLSCYGATDLQSPNI 62

Query: 89  DALAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKIL 145
           D L   G+   N+Y    +C+P+R+AI++GK+P   G+   +    +   G L    ++L
Sbjct: 63  DKLVSRGLKFTNFYANCPVCSPTRAAILSGKYPDRVGVPGVIRTHADNSWGYLAPEAELL 122

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ-DYFDHSAEEMKMWGL 204
           P  L+  GY + I+GKWHLG  +    P  RGF+   GY      DY+DH     +  G+
Sbjct: 123 PSLLQPAGYHSAIIGKWHLGL-EAPNRPNDRGFDHFKGYLGDMMDDYYDH-----RRHGI 176

Query: 205 DMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAP 263
           +  R+ E   D  G ++TD+FT  + D +   + +E P FLYLA+ A H+     P+Q P
Sbjct: 177 NYMRENEQEIDPEG-HATDLFTEWSCDYLKEQADNEQPFFLYLAYNAPHT-----PIQPP 230

Query: 264 DHYLNIHRHIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
             +L   +  E     +R+K  A++  +D+ +G+V+  LEQ     +++++F SDNGG  
Sbjct: 231 QDWLEKVKKREAGIGEQRAKLVALIEHMDDGIGQVLTCLEQTGQADDTLVIFTSDNGG-- 288

Query: 321 AGFNLN-AASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
               LN  A+N   R  K +++EGG++    ++W   +E+ G   E      D+  T+L 
Sbjct: 289 ---QLNVGANNGATRDGKQSMYEGGLKVPTAVVWPGKIEA-GQTTEFQALSMDFCSTILD 344

Query: 379 AA---------NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIE 420
            A          +S +P  + +  +   P  + S  R   G   YN   IE
Sbjct: 345 VAGVEQPSDCDGRSILPTLLGNEQK---PLRDLSFFRRREGGIRYNGKTIE 392


>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
           musculus]
 gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
          Length = 440

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 43/319 (13%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 27  PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ +
Sbjct: 87  RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A+      + + RD    W++ G++          
Sbjct: 146 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197

Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++T EA+D I   H+   P FLY A  ATH+     P+ A   +L        
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P   
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306

Query: 336 VKNTLWEGGVRGAGLIWSP 354
            K T +EGG+R   + W P
Sbjct: 307 GKQTTFEGGMREPAIAWWP 325


>gi|359400556|ref|ZP_09193534.1| sulfatase [Novosphingobium pentaromativorans US6-1]
 gi|357597898|gb|EHJ59638.1| sulfatase [Novosphingobium pentaromativorans US6-1]
          Length = 493

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 116/346 (33%), Positives = 170/346 (49%), Gaps = 36/346 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+++ IL DDL W DV  +G   I TPNIDA+  +G    + Y T  +C+ SR+ +++G+
Sbjct: 45  PNVVVILVDDLAWTDVSANGASTIQTPNIDAIGAAGATFSSGYVTASVCSVSRAGLLSGR 104

Query: 119 HPIHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           +P   G  +N+    ERG      GLP+    L + LK LGY T + GKWH G    E+ 
Sbjct: 105 NPTTFGFMYNI---TERGAVDKGAGLPVDVPTLAERLKPLGYDTAVFGKWHQG-STPEFY 160

Query: 173 PTFRGFESHLGYWTGHQDY--------------FD--HSAEEMKMWGLDMRRDLEPAWDL 216
           PT RGF+   G+  G   Y              FD     E      L    D+EP    
Sbjct: 161 PTRRGFDEFFGFLAGESIYATPDTPGIVTTPTSFDKYQPKEREGAQRLFQGPDMEPVAQP 220

Query: 217 HGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
           H KY T+  T  AVD I   +   +P FLY+A+ A     P+ PLQ P  Y + H  I D
Sbjct: 221 H-KYLTEEITEHAVDFIDRKADGKDPFFLYVAYNA-----PHWPLQVPRTYYDRHSEIAD 274

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R  + A++  LD  VG++++AL+ + +  ++++VF+SDN G    F     ++ P  G
Sbjct: 275 PARRTYVAMIDALDAGVGQILKALDDKGLREDTLVVFLSDN-GCPVQFGYCTQTH-PWGG 332

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T  EGG+R   +   P   + G      V   D +PT+L A +
Sbjct: 333 GKFTYLEGGMRVPFMASWPGHIAAGQRIATPVSSLDIVPTVLEAVS 378


>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
           [Ciona intestinalis]
          Length = 513

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 119/390 (30%), Positives = 190/390 (48%), Gaps = 36/390 (9%)

Query: 43  AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
            F L ++    +A S  P+IIF+L DD+GW D+G +G     TPN+D +A  G +  ++Y
Sbjct: 4   CFLLLLMCGYSLAESSKPNIIFMLMDDMGWGDLGINGQPSKETPNLDNMAKEGTLFTDFY 63

Query: 103 TVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGY 154
           +   LC+PSR+A++TG+ PI  G   +  +G          GG+P  E ++ + L   GY
Sbjct: 64  SANPLCSPSRAALLTGRLPIRNGFYTSNYHGHNGYTPQHIVGGIPDHEILVSELLSSAGY 123

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWG---LDMRRDL 210
             +++GKWHLG  +++Y P   GF    G    H   Y D +   + ++    +  R   
Sbjct: 124 TNKLIGKWHLG-QQEQYLPLKHGFHEWFGSPNCHFGPYDDKTTPNIPVYNNTEMVGRYYE 182

Query: 211 EPAWDLHGKYS--TDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYL 267
           E A + H   S  T  +  EA+D I     +E P FLY A  ATHS     P+ +   + 
Sbjct: 183 EFAIESHKYLSNMTQYYIQEALDFIERMERNEKPFFLYWAPDATHS-----PVYSSPMFR 237

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
              R      R  +   + +LD  VG +++ L+Q  +  N++++F SDNG A  G     
Sbjct: 238 GASR------RGPYGDAVMELDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMIGSAFEN 291

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            SN P    K T +EGG+R   + W P    +G V+ Q   + DW  T L  A+ + +P 
Sbjct: 292 GSNGPFLCGKQTTFEGGIREPTIAWGPGFVQQGQVSHQVGSLMDWFATALDLASIA-VPK 350

Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSP 417
             +  ++ +      S+L   NG  E++ P
Sbjct: 351 --DRIIDGV------SLLPVLNGAAEFDRP 372


>gi|196232399|ref|ZP_03131252.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196223471|gb|EDY17988.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 452

 Score =  166 bits (420), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 127/372 (34%), Positives = 184/372 (49%), Gaps = 52/372 (13%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M FAV   A T S   VD    +  P+II ILADDLG  +    G   I TPNID L + 
Sbjct: 10  MMFAV---ALTCSAFAVD----ASKPNIIVILADDLGRGEYSDFGTKDIQTPNIDRLCHE 62

Query: 95  GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER---GGLPLSEKILPQYLK 150
           G+   N+Y    +C+PSR+A+MTG  P   G+   +    +    G L    ++ PQ LK
Sbjct: 63  GLTFDNFYANSCVCSPSRAALMTGCFPNRVGVPGVIREEDQDNNWGWLAQGVRLFPQLLK 122

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG--------YWT---GHQDYFDHSAEEM 199
             GY++ IVGKWHLG      TPT RGF+   G        YWT     Q++  H+ E +
Sbjct: 123 PAGYQSAIVGKWHLGINSPN-TPTERGFDRFDGFLGDMMDDYWTHLRHGQNFMRHNQEVV 181

Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYE 258
              G                ++TDVFT  A+  +   S + EP FLYLA+ A     P++
Sbjct: 182 SPEG----------------HATDVFTDWALRYLDERSKSKEPFFLYLAYNA-----PHD 220

Query: 259 PLQA-PDHYLNIHRHIEDF--KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
           P+Q  PD    +         KR+K  A++  LD+ +GKV+  L++  +  N+I++F SD
Sbjct: 221 PIQPKPDWLAKVKAREPGLSEKRAKLVALIEHLDDGIGKVLAKLDETGLAKNTIVLFSSD 280

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NGG  +    N A+N P R  K  ++EGG+R   ++  P   + G  + +   + D LPT
Sbjct: 281 NGGVLS----NEANNGPWRSGKTHMYEGGLRVPCMVRWPGHVAPGTHSNRVALLMDILPT 336

Query: 376 LLSAANKSDIPN 387
            L    ++  PN
Sbjct: 337 ALEIVGQTPPPN 348


>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
 gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
          Length = 502

 Score =  166 bits (420), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 109/330 (33%), Positives = 164/330 (49%), Gaps = 20/330 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY-SGIILKNYYTVQLCTPSRSAIMTGK 118
           P+I+ IL+DD G+ D GF G  QI TPN+D LA  S +  + Y +  +C PSR+ I+TGK
Sbjct: 60  PNIVLILSDDAGYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAVCGPSRAGILTGK 119

Query: 119 HPIHTGMQHNVL--YGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           +    G + N +  Y  + G      GLPL+++ +  YL   GY T ++GKWH G   K 
Sbjct: 120 YQQRFGFEENNVPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTALIGKWHQGNADK- 178

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           + PT RGFE   G+  G + YF+        +  D        +     Y T     EA+
Sbjct: 179 FHPTKRGFEHFYGFRGGARSYFEFGPNNPVSYPEDRLERGFAHYKESPHYLTQALADEAI 238

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             I  +   EP FL+L+  A H+     P+ A    L     +   KR + AA+   +D 
Sbjct: 239 KFIKQNQR-EPFFLFLSFNAVHT-----PMDANKEDLAQFPQLSG-KRQRVAAMTLSMDR 291

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGL 350
            +GKV+ +L++  +  N+++VF +DNGG +   + NA+ N PL G K    EGG+R   L
Sbjct: 292 EIGKVLASLDELGLSENTLLVFTNDNGGPS---DTNASDNTPLSGTKANHLEGGIRVPFL 348

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +  P          + +   D LPT  +AA
Sbjct: 349 MRWPAGNIGPNTYHEPISTLDLLPTFFAAA 378


>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 453

 Score =  166 bits (420), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 183/354 (51%), Gaps = 24/354 (6%)

Query: 36  AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           A   L  A T              P+++ IL DDLG  D+       + TPNID +  +G
Sbjct: 7   ALLALGAAMTSGAALASNTVDKSTPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETG 66

Query: 96  IILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKEL 152
           + L N+Y    + +PSR+A++TG+ P   G+   +    ++  G    S   +P+ LK  
Sbjct: 67  VRLDNFYANSSVSSPSRAALLTGRFPAMVGVPGVIRPSVDQNWGYFDPSAVTMPEVLKNG 126

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLE 211
           GYRT ++GKWHLG ++    P  RGF+   G+     D Y+ H  +      L+  ++++
Sbjct: 127 GYRTALIGKWHLG-WESPNLPNERGFDHFHGFLADMMDDYYTHRRQGGNYMYLN-DKEID 184

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
           P       ++T++FT+ +VD I   + ++ P FLYLA+ A HS     PLQ P  ++N  
Sbjct: 185 PK-----GHATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKV 234

Query: 271 RHIED---FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
           +  +     KR++  A++  LD ++GKV+++LE+   L+N++++F SDNGG       + 
Sbjct: 235 QERDKSLPVKRARLIALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGDRG----SM 290

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           A+N P RG K  ++EGG+  A  +  P +   G     +V + D +PT+    N
Sbjct: 291 ANNGPTRGAKGDMFEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVN 344


>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
           musculus]
          Length = 462

 Score =  166 bits (420), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 43/319 (13%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 49  PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 108

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ +
Sbjct: 109 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 167

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A+      + + RD    W++ G++          
Sbjct: 168 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 219

Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++T EA+D I   H+   P FLY A  ATH+     P+ A   +L        
Sbjct: 220 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 270

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P   
Sbjct: 271 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 328

Query: 336 VKNTLWEGGVRGAGLIWSP 354
            K T +EGG+R   + W P
Sbjct: 329 GKQTTFEGGMREPAIAWWP 347


>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
 gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
          Length = 477

 Score =  166 bits (419), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 125/365 (34%), Positives = 179/365 (49%), Gaps = 51/365 (13%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L+  L  VF     ++  P+I+FILADDLG+ DV  +G     TPNID LA  GI   + 
Sbjct: 5   LSLLLMAVFCLSTKAANKPNIVFILADDLGYTDVACYGSKYYETPNIDKLAKDGIKFTDG 64

Query: 102 YTVQL-CTPSRSAIMTGKHPIHTGM-----------QHNVLYGCER-GGLPLSEKILPQY 148
           +T    C P+R+++M+G++   TG+           Q   L+  E    LPL +  L Q 
Sbjct: 65  HTCGPNCQPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQS 124

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           LK+ GY T + GKWHLG   KE+ P  RGF+  L     H D+  +         +D  +
Sbjct: 125 LKKAGYATGMFGKWHLG-EDKEHHPAQRGFDEALVSMGVHFDFVTNPK-------VDYPK 176

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL- 267
           D         +Y  D  T +A+D I  H  DEP FLYL H A H     +PLQA    + 
Sbjct: 177 D---------EYLADFLTDKALDFIKRHK-DEPFFLYLPHYAVH-----KPLQAKKELIQ 221

Query: 268 --NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA----- 320
             +  + ++      +AA++  +DESVG+VV  L++ ++  N++++F SDNGG       
Sbjct: 222 KFSAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTLVIFSSDNGGVGGYQRE 281

Query: 321 ----AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
               AG   +   N PLRG K  L+EGG R   +   P     G V +Q +   D  PTL
Sbjct: 282 GIKKAG---DVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYPTL 338

Query: 377 LSAAN 381
           L  A 
Sbjct: 339 LELAG 343


>gi|294053769|ref|YP_003547427.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613102|gb|ADE53257.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 490

 Score =  166 bits (419), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 105/307 (34%), Positives = 162/307 (52%), Gaps = 32/307 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+ IFI+ADDLG+ D+GF+G   + TP +D LA++G+I +N Y T   C PSR+ ++TG+
Sbjct: 23  PNFIFIVADDLGYGDLGFNGSKVVETPTLDRLAHNGVIFQNGYVTHPYCGPSRAGLITGR 82

Query: 119 HPIHTGMQHNVLYGC--ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +    GM+ N+ Y       GLPL EK   + ++  GYRT ++GKWHLG   + + P  R
Sbjct: 83  YQARFGMEINLTYSPFDLHQGLPLDEKTFAERVRPAGYRTGMIGKWHLG-ASEPFHPNSR 141

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGL---------------DMRRDLEPAWDLHGKYS 221
           GF+   G+ +G  DYF           L                + R+   A     +Y 
Sbjct: 142 GFDYFYGFLSGGHDYFPEMVNTHPKLLLPNGKPHYSHNEGCTQPLLRNKNAA--EFNEYL 199

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSK 280
           T   + +A   + +  +++P  LYLA+ A H      PL AP   +  +  +E D +R+ 
Sbjct: 200 TTALSKDAARFVSD--SEKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDKRRAT 252

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA--GFNLNA-ASNWPLRGVK 337
           + A++  +D+ +G +VEALEQ   L N++I F+SDNGG  +  G      A N P R  K
Sbjct: 253 YLAMIDSMDQGIGLIVEALEQSGKLDNTLIFFMSDNGGVQSKPGHEYETWADNGPFRNGK 312

Query: 338 NTLWEGG 344
            ++ EGG
Sbjct: 313 GSMREGG 319


>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 446

 Score =  166 bits (419), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 181/354 (51%), Gaps = 26/354 (7%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL- 98
           +   F+L M FV L+ ++  P+I+ + ADD+GW DV +HG++   TP IDA+A  G+   
Sbjct: 1   MKFLFSL-MGFVALLRAADKPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFE 59

Query: 99  KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           + Y    +C PSR+ I+TG++    G+  N   G    G+P S+K + + LK  GY++  
Sbjct: 60  QGYAAASVCGPSRAGILTGRYQQLFGVVTN---GDADKGIPKSQKNIAELLKPAGYKSGA 116

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYF--DHSAEEMKMWGLDMRRDLEPAWDL 216
            GKWHLG  K ++ P  RGF++  G+  G  DY+  D    + K     +  + +     
Sbjct: 117 FGKWHLGSKKGQF-PNDRGFDTFYGFHFGAHDYYRADKKLNKKKKGYAPIYFNQDIVDYK 175

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIED 275
            G Y T+  T  AV+ I  +  D+P F+Y+A+ + HS     P Q PD YL  I   +  
Sbjct: 176 EGDYLTEKITDHAVEFIEENK-DQPFFMYVAYNSVHS-----PWQVPDEYLARIPESVPA 229

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG--------FNLNA 327
           ++R  F A++  +D+ VG++   L++  +  N+I VF +DNG    G        + ++ 
Sbjct: 230 YRRL-FLAMVLAMDDGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSM 288

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           +     RG K   +EGG+R    +  P     G   E  V   D  PT LSAA+
Sbjct: 289 SQG--FRGYKGDTYEGGIRVPFCMSWPKKIKSGNKFEAPVIAYDLAPTFLSAAS 340


>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
          Length = 525

 Score =  166 bits (419), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 36/359 (10%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   V +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 16  LLLVLSAAEVGVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNF 75

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           YT   LC+PSR+A++TG+ PI +G         N     E  GG+P SE +LP  LK  G
Sbjct: 76  YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 135

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD E  
Sbjct: 136 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 190

Query: 214 WDLHGKYS----------TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQA 262
              + ++           T ++  EA++ I    +   P FLY A  ATH+     P+ A
Sbjct: 191 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PVYA 245

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
              +L         +R ++   + +LD+SVG+++  L    +  N+ + F SDNG A   
Sbjct: 246 SKPFLGTS------QRGRYGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALIS 299

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                 SN P    K T +EGG+R   + W P     G V+ Q   + D   T LS A 
Sbjct: 300 AP-RQGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAG 357


>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 465

 Score =  166 bits (419), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 128/366 (34%), Positives = 188/366 (51%), Gaps = 43/366 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             FT  ++    +A S  P+ I I  DD G+ DVG  G   I TPN+D +A  G    N+
Sbjct: 11  FLFTFLLIANYSLAQSQKPNFIIIFTDDQGYGDVGTFGHPTIKTPNLDQMAMEGQKWTNF 70

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHN---VLYGCERGGLPLSEKILPQYLKELGYRTR 157
           Y    +CTPSRSAIMTG+ P+ TGM  N   VL+    GGLP +E  + + LK  GY T 
Sbjct: 71  YVAANVCTPSRSAIMTGRLPVRTGMYSNTRRVLFPDSGGGLPATENTIAKLLKTSGYSTA 130

Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGY-WTGHQDYF-DHSAEEM------KMWGLDMRRD 209
            +GKWHLG +  EY PT  GF+++ G  ++   D   D +A+E       + + + + RD
Sbjct: 131 AIGKWHLG-HLPEYLPTSHGFDTYFGIPYSNDMDRINDVTAQEAFASPKPEYFNVPLMRD 189

Query: 210 LE----PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
            E    PA        T  +T EAV  I  +  D+P F+YLAH+      P+ PL A + 
Sbjct: 190 KEIIERPA---DQTTITKRYTEEAVSYIKANK-DQPFFIYLAHSL-----PHIPLFASED 240

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
           +L         +R  +  ++ ++D SVG+++  L+   +  N+ ++F SDNG     +N 
Sbjct: 241 FLTKS------ERGLYGDVIEEIDWSVGQILSTLKSEGIDKNTYVIFTSDNGPWLV-YNE 293

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV----HVSDWLPTLLSAAN 381
              S+  L G K T +EGGVR   +IW P     G +  Q V       D LPT +S+ +
Sbjct: 294 QGGSSGGLFGGKGTSYEGGVRVPTIIWGP-----GNIKPQVVSKIGSTLDLLPT-ISSLS 347

Query: 382 KSDIPN 387
            ++IPN
Sbjct: 348 GTEIPN 353


>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
 gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 489

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 121/339 (35%), Positives = 175/339 (51%), Gaps = 35/339 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+FI ADDLG+ D+GF G D   TPN+D LA   +     Y+    C PSR +IMTGK
Sbjct: 39  PNIVFIFADDLGFADLGFTGSDTHLTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGK 98

Query: 119 HPIHTG-MQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           +P   G + H  L G         GLP++E  + + LK+ GY T  +GKWH+G  K E  
Sbjct: 99  YPARLGAVSHGKLGGVAHPGPNDNGLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENN 156

Query: 173 PTFRGFESHLG----YWTGHQDY-FDHSAEEMKMWG-LDMRRDLEPAWDLHGKYSTDVFT 226
           P  RGF+  +        G   Y F+ + E+ ++   +    D +P     G + TD   
Sbjct: 157 PGTRGFDVDIASNEFCCPGSYMYPFESNNEKQRVASKIPDLEDRKP-----GDFLTDALA 211

Query: 227 AEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQA-PDHYLNIHRHIEDFKRSK---F 281
            EAV  IH  STDE P FL ++  A H+     P+ A P+      R I    R K   +
Sbjct: 212 EEAVKFIH--STDEKPFFLNMSFYAVHT-----PITAIPEKVEKYKRLIGPDARQKNPTY 264

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           A ++  LD++VG +++ALE++ ++ N+IIVF SDNGG           N+PLR  K + +
Sbjct: 265 AGLVEHLDDAVGAILKALEEKGIIDNTIIVFTSDNGGEILH---GITDNFPLRDGKGSSY 321

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           EGG R   L+  P +     V+ + +   D+ PT LS A
Sbjct: 322 EGGTRVPLLVKWPGVTQVNTVSHERIIGFDYYPTFLSMA 360


>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 467

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/345 (31%), Positives = 168/345 (48%), Gaps = 20/345 (5%)

Query: 46  LSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTV 104
           L    V  + +   P+I+FIL+DD G+ D GF G  +  TP +D LA   I   + Y + 
Sbjct: 13  LGFFKVSFLTAQQQPNIVFILSDDAGYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSA 72

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEKILPQYLKELGYRT 156
            +C PSR+ I+TGK+    G + N + G         +  GLPL +  +  YL++LGY+T
Sbjct: 73  AVCGPSRAGILTGKYQQKFGFEENNVPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKT 132

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
            + GKWH G     + PT RGF+   G+  G + Y  +          D        +  
Sbjct: 133 ALFGKWHQG-NADRFHPTKRGFDEFYGFRGGARSYMPYDDSNPLSKNEDRLERGFGNFLE 191

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
           H  Y TD    EA+  I N +   P F+YL+  A H+     P++A    L    H++  
Sbjct: 192 HEGYLTDELAHEAISFI-NRNKKHPFFIYLSFNAVHT-----PMEATAEDLEQFPHLKG- 244

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           KR   AA+   +D ++G V++ LE + + +N++IVF +DNGG +   + N + N PL G 
Sbjct: 245 KRKTLAAMTLAMDRAIGTVLKELETQGLSNNTLIVFTNDNGGPS---DANDSDNSPLSGT 301

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           K    EGG+R   L+  P    +       +   D  PT LSAA+
Sbjct: 302 KANHLEGGIRVPFLMSWPKQLQKNKNYNYPISTLDLFPTFLSAAS 346


>gi|282900986|ref|ZP_06308919.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
           raciborskii CS-505]
 gi|281194077|gb|EFA69041.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
           raciborskii CS-505]
          Length = 434

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 126/395 (31%), Positives = 185/395 (46%), Gaps = 48/395 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLGW D+   G     TP +D LA  GI     Y+   +CTP+R    TG+
Sbjct: 12  PNIVFILADDLGWGDLSIDGQKNYQTPYLDQLARDGIRFNQAYSDSPVCTPTRIGFFTGR 71

Query: 119 HP--IHTGMQHNVLYGCERG---GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +P  +  G    +L   + G   GLP     +   L++ GY T +VGKWH G Y   Y+P
Sbjct: 72  YPGRLAIGNYEPLLSFQQIGDSVGLPPEHPTIASLLRDNGYETVLVGKWHCG-YLPRYSP 130

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMK--MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
              GF    G ++G  DYF H     K  +W  D   +          Y TD+FT  AVD
Sbjct: 131 LKSGFNKFFGNFSGAIDYFRHVDTNGKPDLWEADTPIE-------KIGYVTDIFTERAVD 183

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL---------NIHRHIEDFKRSKFA 282
            I     + P +L L + A     P+ P Q P+            N+   I    +  + 
Sbjct: 184 FIKGPHVN-PFYLSLHYTA-----PHWPWQGPEDVELSNDLIGRDNLENWINTGTKESYK 237

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA-AAGFNLNAASNWPLRGVKNTLW 341
           A++  LD  VGKV++ALE   + + +I++F SDNGG   + F L        +G K  L+
Sbjct: 238 AVVQSLDNGVGKVLQALEDVGIANRTIVIFASDNGGERFSNFGL-------FQGKKGNLY 290

Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY------VNSTVE 394
           EGG+R   LI WS ++    I ++Q +   D   T+L+A   +  P+Y      +   ++
Sbjct: 291 EGGIRVPTLIRWSGVIAPNQI-SDQVIITHDITATILAATKTNTNPDYPLDGEDLLPVIQ 349

Query: 395 NIIPRYENSIL-RYENGTHEYNSPRIENSNTRYEN 428
           N IP Y+  +  RY+        PR+     R EN
Sbjct: 350 NEIPTYDRVLFWRYKGNNRLGLPPRVRQGAVRSEN 384


>gi|225012438|ref|ZP_03702874.1| sulfatase [Flavobacteria bacterium MS024-2A]
 gi|225003415|gb|EEG41389.1| sulfatase [Flavobacteria bacterium MS024-2A]
          Length = 471

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/346 (34%), Positives = 171/346 (49%), Gaps = 19/346 (5%)

Query: 40  LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           L L   +   F +      PP+IIFIL DD G+ D+G +G   I TPN+D LA  G    
Sbjct: 5   LHLLILVLFCFCNTQDKVTPPNIIFILTDDQGFGDLGVYGATDIKTPNLDRLAGEGARFT 64

Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 158
           +YY  Q +C+ SR++I+TG +P   G+ HN      + GL   E  L + LKE GY T I
Sbjct: 65  SYYATQPVCSASRASILTGCYPDRIGI-HNAYSPGSKVGLNPEETTLAELLKEKGYATGI 123

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR--RDLEPAWDL 216
            GKWHLG    E+ P   GF+ + G    +  +  H  +       D++   +  P   L
Sbjct: 124 FGKWHLG-DAPEFQPRKHGFDEYYGILYSNDMWPKHPQQGAVFNFPDIKLYENETPLRVL 182

Query: 217 HGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
             + + T   T  A+D I  +  + P F+YL H       P+ PL A D   N       
Sbjct: 183 EDQTFLTGALTDRAIDFIKKNK-ENPFFVYLPH-----PQPHVPLFAIDSLKNTQ----- 231

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
            KR  +  ++H++D SVG+++  LEQ  ++ N+I++F SDNG   + +  ++ S    R 
Sbjct: 232 -KRGLYGDVIHEIDASVGRILITLEQEDLVENTIVIFTSDNGPWLS-YGGHSGSPGIFRE 289

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T WEGG R  G+I  P   +  +V E      DWLPTL   AN
Sbjct: 290 GKGTNWEGGHRVPGIIRFPNKIAANMVIEAPAMGIDWLPTLAEFAN 335


>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
          Length = 500

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/318 (34%), Positives = 166/318 (52%), Gaps = 54/318 (16%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           FAV  +   LS     L A++  P+IIFI  DD+G ND+G++   ++ TPN+D LA +G+
Sbjct: 12  FAVCQITIQLS-----LGAANQKPNIIFIAVDDMGNNDIGYNN-PEVDTPNLDNLANNGV 65

Query: 97  ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQ-----HNVLYGCERGGLPLSEKILPQYLKE 151
           IL++ Y   +C+PSR+A MTG++    G Q     H      E     ++EK+   Y   
Sbjct: 66  ILESNYVYPVCSPSRAAFMTGRYAHKIGFQRGPVEHKQPAYIESNYKTVAEKLTTNY--- 122

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY   ++GKWHLG+ K   TPT RGF+S  G++ G ++Y+ +++   K    D R +L 
Sbjct: 123 -GYAAHMIGKWHLGYCKDAVTPTNRGFDSFYGFYGGQENYYTYTSARYK----DFRDNLT 177

Query: 212 ---------PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA 262
                    P  D+ G YST  +   A++I+ NH    PLFLYLA  A HS     PLQA
Sbjct: 178 AVTPQNPNYPREDVDG-YSTFEYKKRAIEIVGNHDKSVPLFLYLAFQAPHS-----PLQA 231

Query: 263 PDH----------YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIV- 311
           P            Y +     +   R+ F A++  +D++VG+V   +E + +L N+IIV 
Sbjct: 232 PQELKDKYPGDWTYTDWDGSTKTTSRATFYAMVTAVDQAVGEVYTEMETQGLLDNAIIVL 291

Query: 312 ---------FVSDNGGAA 320
                    F+S  GG+A
Sbjct: 292 LHASDWLPTFISLAGGSA 309


>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
           boliviensis]
          Length = 482

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 106/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G +  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP+ LKE GY T+IVGKWHLG ++ ++ P   GF+  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFDEW 119

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + RD    W++ G+Y               T ++  
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA+D I   +   P FLY A  ATH+     P+ A   +L   R      R ++   + +
Sbjct: 172 EALDFIKRQARRRPFFLYWAVDATHA-----PVYASKPFLGTSR------RGRYGDAVRE 220

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+S+GK+++ L+  R+  ++ + F SDNG A         SN P    K T +EGG+R 
Sbjct: 221 MDDSIGKMLQLLQDLRVADSTFVFFTSDNGAALISAPEQGGSNSPFLCGKQTTFEGGMRE 280

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L W P   + G V+ Q   + D   T L+ A 
Sbjct: 281 PALAWWPGHITPGQVSHQLGSIMDLFTTSLALAG 314


>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 889

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/333 (33%), Positives = 168/333 (50%), Gaps = 49/333 (14%)

Query: 88  IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
           ++ LA  G+ L ++Y    CT SR+A+ TGK+P+H G+Q +V++  E  G+PL E  L Q
Sbjct: 1   MNKLASGGVTLTSFYASPTCTASRAALFTGKYPMHLGLQDSVIHPSEPRGVPLKETFLSQ 60

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDM 206
            LK+ GY T  VGKWHLGF++  YTP+ RGF+   G  TG  D+  H S E   M G D 
Sbjct: 61  KLKDAGYGTVFVGKWHLGFHQAAYTPSQRGFDEFFGILTGGGDHVAHASTESFAMRGPDY 120

Query: 207 R-------------------RDLEPAWDLHGKYSTDVFTAEAVDIIHNHS--TDEPLFLY 245
           +                    D E   D H   ST ++TA++   +  ++   D PLFL 
Sbjct: 121 KSKITTMTGSNLWHNGRPVGEDEEGVRDTH---STHLYTAKSRAYVAKYAEMADVPLFLT 177

Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRH---IEDFK---------RSKFAAILHKLDESVG 293
           +++ A H+     P+Q PD Y++   +    E  +         R +   ++  +DES+ 
Sbjct: 178 VSYQAVHA-----PIQVPDEYVDGSAYANGCESLRAGSASATRLRRQLCGMVSMVDESLA 232

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-- 351
            + + L+   M  N+++ F+SDNG    G   + +SN  LRG K + +EGG+R    +  
Sbjct: 233 SIHDDLKAEAMWENTLLFFLSDNG----GILRHGSSNLDLRGEKGSYYEGGIRVPAFVSG 288

Query: 352 -WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            +   L S G V E  VH++D   T L  A  S
Sbjct: 289 GYLAKLLSAGRVFEGLVHITDLHATALRVAGVS 321


>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
 gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
          Length = 468

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 181/365 (49%), Gaps = 29/365 (7%)

Query: 29  GYRTRIMAFAVLPLAFTLSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTP 86
           G  TR  A A L  A     V       SGP  P+I++I+ADD+G  D+G +G   I TP
Sbjct: 7   GAMTRRTALAALGSAVAAPAVARG--GRSGPRRPNILYIMADDMGHADLGCYGSRDIRTP 64

Query: 87  NIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEK 143
            IDA+A  G+   N Y    +C+P+R A++TG++      G++  + +  +   LP   +
Sbjct: 65  AIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEEPIAFNGDELSLPRGTR 124

Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
            LP  L++LGY T +VGKWH+G      +P   G++   G  +G  DYF H+     + G
Sbjct: 125 TLPGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASGGTDYFAHAT---TING 180

Query: 204 LDMRRDLEPAWDLH-GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQ 261
            +M +  E   ++    Y TD+  A+A+D +   +  D P F+ L     H   P+ P Q
Sbjct: 181 HEMGKLFENRTEIQRPGYLTDLLGAKAIDRMQQAARQDRPFFISL-----HFTAPHWPWQ 235

Query: 262 APDHYLNIH-----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
            P            R ++      +AA++  +D +VG+V+  L    +  ++I++F SDN
Sbjct: 236 GPGDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLAELAALGLEEDTIVIFTSDN 295

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           GG         + +WPL G+K  L EGG+R   ++  P    +G  + Q     D LPTL
Sbjct: 296 GGE------RFSDSWPLTGMKGELLEGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTL 349

Query: 377 LSAAN 381
           LSAA 
Sbjct: 350 LSAAG 354


>gi|334145289|ref|YP_004538499.1| sulfatase [Novosphingobium sp. PP1Y]
 gi|333937173|emb|CCA90532.1| sulfatase [Novosphingobium sp. PP1Y]
          Length = 461

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 171/345 (49%), Gaps = 37/345 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADD+G+ D+G +G     TP ID +A  G++ +  Y    +C+ SR+A++TG+
Sbjct: 21  PNILFILADDMGYADLGCYGRTDYATPAIDRIAREGMLFRQAYANSAVCSASRTALITGR 80

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +   +  G++  +  G    GLP S   +P  L+  GY T ++GKWHLG   K Y P   
Sbjct: 81  YQNRLPVGLEEPL--GRRVRGLPPSHPTMPSLLRSQGYETTLIGKWHLGQLPK-YGPLQS 137

Query: 177 GFESHLGYWTGHQDYFDHSAEEM-KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           G++   G   G  DYF H    +  +W  D  R++E        Y TD+    A+  I +
Sbjct: 138 GYDHFWGLRGGGIDYFSHDFNGVPDLWEDD--REIE-----QPGYMTDLLAERAMTAIDD 190

Query: 236 HST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-----------RSKFAA 283
           ++   +P F+ L     H   P+ P + PD      R     K           R  +A 
Sbjct: 191 YAARQQPFFMSL-----HFTAPHWPWEGPDDQTESQRLAASDKPFSMFDTDGGSRKTYAE 245

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           ++ ++D  +G+++E L++  +  ++IIVF SDNGG         +  WP  G+K  L EG
Sbjct: 246 MVTRMDMEIGRILERLDRLGLAEDTIIVFTSDNGGE------RYSKTWPYNGLKTELLEG 299

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           G+R   ++  P   + G  +EQ +   DW+PT LS    S  P Y
Sbjct: 300 GLRVPAVMRWPRRIAAGQESEQVIMSMDWMPTFLSLIQASPDPRY 344


>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 453

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 124/386 (32%), Positives = 199/386 (51%), Gaps = 38/386 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+ ILADDLG+ D+  +G   I TP+ID+L  +G+   ++Y    +C+PSR+A+++G+
Sbjct: 29  PNILLILADDLGYGDLSSYGAPDIRTPHIDSLVRAGMRFSHFYANSSVCSPSRAALLSGR 88

Query: 119 HPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P   G+   +    +   G L  S  +LP  LK+ GY T +VGKWHLG  +    P  R
Sbjct: 89  YPEQVGVPGVIRTMPDDNWGYLSPSAVLLPSILKKNGYYTALVGKWHLGL-EPPNLPNDR 147

Query: 177 GFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           GF+   G+     D Y+ H   +     L+ R+ + P       ++TD+FT  A D +  
Sbjct: 148 GFDLFHGFEGDMMDDYYTHLRHDRNYMRLN-RQTINPQ-----GHATDLFTQWATDYLEQ 201

Query: 236 HSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---DFKRSKFAAILHKLDES 291
            +    P FLYLA+ A     P++P+Q P  +L   +  +     KR+K   ++  +D+ 
Sbjct: 202 RAGQSNPFFLYLAYNA-----PHDPIQPPADWLAKVKARQPGISEKRAKLVGLIEHMDDG 256

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR-GAGL 350
           +GKV++ L  + +  N++IVFVSDNGG       + A+N PLR  K  ++EGG+R  A +
Sbjct: 257 IGKVIQTLRAKGLYENTLIVFVSDNGGKL----FDGATNGPLRSGKGHMYEGGIRIPACV 312

Query: 351 IWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENG 410
           +W   + ++   ++Q + + D  PTL  A         +N  ++    R   SILR E  
Sbjct: 313 VWPGKVAAQS-QSQQPLLLMDIFPTLAEATG-----TVINYPIDG---RSFLSILRGERQ 363

Query: 411 THEYNSPRIENSNTRYENGTHEYNPK 436
                 P       R E G+ EYN K
Sbjct: 364 LLAAERPLF---FIRREGGS-EYNGK 385


>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 468

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/355 (31%), Positives = 180/355 (50%), Gaps = 32/355 (9%)

Query: 46  LSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYT 103
           +++VF  L  S+    +I+F+ ADD G+ D GF G    PTPN+D LA  G++ K  Y T
Sbjct: 11  VALVFFSLSTSAQKKKNIVFLFADDAGYFDFGFQGSKTFPTPNLDQLAKEGMVFKQAYTT 70

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYG---------CERGGLPLSEKILPQYLKELGY 154
             +C PSR+ ++TG++    G + N + G          +  GLPL EK +  YL +LGY
Sbjct: 71  AAVCGPSRAGLLTGRYQQRFGFEENNVPGYMSKSSKLLGDDMGLPLDEKTMADYLGKLGY 130

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD-LEPA 213
           ++ ++GKWH+G     Y P  RGF    G+  G + ++  + ++      D   D LE  
Sbjct: 131 QSIVLGKWHMG-NADRYHPLKRGFTEFYGFRGGARSFYPLTQKQ----AADKPEDRLEIG 185

Query: 214 WDLHG---KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
           +  +    KY T      A D I + +  +P F+Y++  A HS     P+QA    L   
Sbjct: 186 YKKYQEPKKYLTYDLADAACDFI-DRNKKKPFFMYVSFNAVHS-----PMQATQEDLKKI 239

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
           + +   KR   AA+   LD++ G+++E L++  +  +++IVF +DNGG           N
Sbjct: 240 KGLTG-KRKVLAAMTLALDKACGQILEKLKEEGLEDDTLIVFSNDNGGPDG----TKTCN 294

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS-AANKSD 384
           +PL G K+   EGG+R   ++  P +   G   E  V   D LPT ++ A  K+D
Sbjct: 295 YPLSGCKSNNLEGGIRVPFVMKLPKVIQPGSSYEYPVSTLDLLPTFVNIAGGKAD 349


>gi|323452121|gb|EGB07996.1| hypothetical protein AURANDRAFT_64538 [Aureococcus anophagefferens]
          Length = 1591

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 109/353 (30%), Positives = 172/353 (48%), Gaps = 33/353 (9%)

Query: 47   SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN-IDALAYSGIILKNYYTVQ 105
            S   V   A  G P I+ +  DD+G ND+ +   D +P P  +  L   G+ L +YY   
Sbjct: 1082 SAAHVARFADDGRPDIVLVFMDDVGLNDL-WRSTD-LPKPTEMSKLVRDGVELTSYYGQS 1139

Query: 106  LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGL-----------PLSEKILPQYLKELGY 154
            LC+P+R+ +MTGK        H + +  ++GG+           PL   +LPQ +K LGY
Sbjct: 1140 LCSPARATLMTGK------FAHKIGFSDQQGGVREVTAYSNFSVPLGHDMLPQGMKRLGY 1193

Query: 155  RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
            +T  +GKW++G    +Y P  RGF++ +GY+T    Y DH ++    + ++         
Sbjct: 1194 QTHAIGKWNIGHCNVKYMPWQRGFDTFVGYFTDGIGYTDHVSDTANTYTVNDGGLAFNGS 1253

Query: 215  DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-------DHYL 267
            +  G Y+T +FT  A  ++H+   D PLF++LA+   H  +               D  +
Sbjct: 1254 EYEGTYTTALFTERAEKVLHDAPEDAPLFMWLAYHGMHDNDGVTDADTCAQTGDDDDDAI 1313

Query: 268  NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA 327
             + R    F R  F   L  +D  VG+V  AL+ R    + ++V  SDNGG A G++   
Sbjct: 1314 FMERGHLPFARFNFGCGLRAIDRGVGQVRTALDDRA--RDYVLVVTSDNGGFACGYHCQ- 1370

Query: 328  ASNWPLRGVKNTLWEGGVRGAGLIWSPL---LESRGIVAEQYVHVSDWLPTLL 377
             SN+P RG K   +EGG+R  G+++SP       RG   +  +H  DW  T +
Sbjct: 1371 GSNYPYRGAKFYEFEGGLRLPGVVYSPSRVPARRRGAAYDGLMHHVDWFGTFV 1423


>gi|239626021|ref|ZP_04669052.1| arylsulfatase E [Clostridiales bacterium 1_7_47_FAA]
 gi|239520251|gb|EEQ60117.1| arylsulfatase E [Clostridiales bacterium 1_7_47FAA]
          Length = 483

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 121/372 (32%), Positives = 184/372 (49%), Gaps = 29/372 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+II  L DD G+ D+   G   + TPN+D LA  G    ++Y    +C+PSR+ ++TG+
Sbjct: 17  PNIIVFLTDDQGYGDLSCMGSTDVCTPNLDILAAGGARFTDFYAGSAVCSPSRACLLTGR 76

Query: 119 HPIHTGMQHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
           +P  TG++ ++L G +   GL          LK+LGY T +VGKWHLG    E  PT  G
Sbjct: 77  YPYMTGVR-SILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAV-PECRPTHMG 134

Query: 178 FESHLGYWTGHQDYFD--HSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAVD 231
           F+   G+ +G  DYF   H  E     G++   DL    E      G+YST++F  + ++
Sbjct: 135 FDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLE 194

Query: 232 IIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
            I      D P  LY A  A     P+ P+ AP  YL   +H+ +  R   AA+L  +D+
Sbjct: 195 FIREQVEKDMPFALYCAFNA-----PHYPMHAPYKYLERFKHLPE-DRQIMAAMLSAVDD 248

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--------LNAASNWPLRGVKNTLWE 342
            VG+++  L++R + +++II F SDNG +    N            S   L+G K +L++
Sbjct: 249 GVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGHKFSLFD 308

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSD--IPNYVNSTVENIIP 398
           GG+R   +   P +   G V  +    +D  PT ++AA  N SD  I       V  I  
Sbjct: 309 GGIRVPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASDYEISGCDILPVMTIGA 368

Query: 399 RYENSILRYENG 410
           R +   L +E G
Sbjct: 369 RRDKDCLYWEMG 380


>gi|223936859|ref|ZP_03628768.1| sulfatase [bacterium Ellin514]
 gi|223894428|gb|EEF60880.1| sulfatase [bacterium Ellin514]
          Length = 460

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 118/358 (32%), Positives = 174/358 (48%), Gaps = 70/358 (19%)

Query: 32  TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           TRI  + +L +AF ++  F   V ++  P+I+ ILADDLG+ D+G +G   I TPN+D +
Sbjct: 2   TRISKY-LLVVAFLIAATFAPGVFAAKAPNIVIILADDLGYGDLGCYGHPSIRTPNLDRM 60

Query: 92  AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ---HNVLYGCERGGLPLSEKILPQ 147
           A  GI   ++Y    +CTPSR+ +MTG+ PI +GM    ++VL     GGLP +E  +  
Sbjct: 61  AAEGIRFTDFYVAANVCTPSRAGLMTGRWPIRSGMAGSTNDVLRVFSMGGLPTNEITIAA 120

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
            LK  GY T  +GKWHLG ++ ++ PT  GF           DYF         +GL   
Sbjct: 121 ALKSKGYATACIGKWHLG-HELQFLPTHHGF-----------DYF---------YGLRFS 159

Query: 208 RDLEP---------AWDLHGKYS----------------TDV------FTAEAVDIIHNH 236
            D+EP         +  LH K                  TD+      +T EA+  IH +
Sbjct: 160 NDMEPVRGKIPKNASSSLHPKLEWWNSALLQNDKILEQPTDLSTLTRRYTEEAIKFIHQN 219

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
              +P FLY  H       P+ PL A D +       E   R  +  ++ +LD SVG+V+
Sbjct: 220 K-KQPFFLYFPHTF-----PHVPLFASDAF------KEHSARGLYGDVVEELDWSVGQVL 267

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP 354
             L +  +  N+++ F SDNG      +L   S  PL+  K + WEGG+R   + W P
Sbjct: 268 NTLRKEGLAENTLVFFTSDNGPWLIR-DLAGGSAGPLKDGKGSTWEGGMREPAIAWWP 324


>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 980

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 186/369 (50%), Gaps = 40/369 (10%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHG------LDQIPTPNIDALAYSG 95
           + F L  +   +  ++  P+I+   ADD G  D GF G         + TP++DALA  G
Sbjct: 7   IVFRLITILPSVAGAADQPNILIFFADDAGNADFGFQGGGINGDFANL-TPHLDALAEGG 65

Query: 96  IILKN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEKILPQYLKELG 153
           +   N Y +  +CTPSR+ ++TG++    G++   +YG     GLP SE  +   L++ G
Sbjct: 66  VRFTNGYVSAAVCTPSRAGMLTGRYQHRFGVE--TVYGRIPEAGLPASEITMADTLRKAG 123

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD---- 209
           YRT  +GKWHLG +  E+ P  RGF+   G  TG + +F +        G  ++R+    
Sbjct: 124 YRTYAIGKWHLGEHLHEHLPNQRGFDEFYGALTGARTFFPYRGNNP---GSKLQRNGVFL 180

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE---PLFLYLAHAATHSANPYEPLQAPDHY 266
            EP   L   Y TD+   + V  I +H  +    P FLYLA  A H+     PL+A    
Sbjct: 181 PEP---LDQPYFTDLLARQTVAYIDDHVANHANAPFFLYLAFTAVHT-----PLEADPKR 232

Query: 267 LNIHRHIEDF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
           L+  R I+D    +R   AA+   +D++VG V+  L +  +  N+I+VF+SDNGG     
Sbjct: 233 LD-DRRIQDISPPQRKTLAAMTIAMDDAVGTVMAKLRENNLTENTIVVFLSDNGGPEDNR 291

Query: 324 NLNA---ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY---VHVSDWLPTLL 377
           +L A   + N  L+G K+  +EGG+R   +I  P   +RG+  +     V   D  PT  
Sbjct: 292 SLRAPNWSDNGALQGNKSQNFEGGIRVPFVIHWPRGIARGLSGKSLPDVVTALDLFPTFA 351

Query: 378 SAANKSDIP 386
            AA K+ +P
Sbjct: 352 EAA-KATLP 359


>gi|86142609|ref|ZP_01061048.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
 gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
          Length = 477

 Score =  163 bits (413), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 115/337 (34%), Positives = 171/337 (50%), Gaps = 58/337 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ + I ADD+G+ D+  +G   I TPN+D +A  G    N+Y    +CTPSR+AIMTG+
Sbjct: 32  PNFVIIFADDMGYGDLSSYGHPTIKTPNLDKMAMEGQKWTNFYVGSSVCTPSRAAIMTGR 91

Query: 119 HPIHTGMQ---HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
            P+  GM    H VL      GLP SE  + + LK+ GY T   GKWHLG  K+EY P  
Sbjct: 92  LPVRNGMTSKVHRVLQPWSTNGLPQSEITIAEQLKKGGYGTAAFGKWHLG-DKEEYLPVN 150

Query: 176 RGFESHLGYWTG---------------HQDYFD----HSAEEMKMWGLDMRRDLEPAWDL 216
            GF+    YW G                QDY++       +E+  + + + R+ E     
Sbjct: 151 NGFD----YWYGVPYSNDMDITLEMNEPQDYWNFWKSEERKEISSFNIPLMRNTEIIERP 206

Query: 217 HGKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
             +++ T  +T EA++ I  H  + P F+YLA+       P+ PL A + +         
Sbjct: 207 ANQHTITKRYTNEAIEYIKEHK-ENPFFIYLAYNL-----PHVPLFASEEFEG------K 254

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG--------GAAAGFNLNA 327
            KR  +  ++ ++D +VG+V+EAL+++ +  N+I+VF SDNG        G +AG     
Sbjct: 255 SKRGLYGDVVEEIDHNVGRVMEALKEQGIAENTIVVFTSDNGPWLVMDEEGGSAGL---- 310

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
                LR  K + WEGG+R  G+ WSP     GIV E
Sbjct: 311 -----LRDGKGSTWEGGMREPGIFWSPGNLKPGIVTE 342


>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 455

 Score =  163 bits (412), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 116/338 (34%), Positives = 170/338 (50%), Gaps = 27/338 (7%)

Query: 50  FVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCT 108
           +  L A +  P+++ I +DD G  D G +G D + TP ID LA +GI     Y    +C+
Sbjct: 22  WTSLEARAERPNVLVIYSDDQGTIDAGCYGSDDLTTPAIDHLAETGIRFTQMYAPSCICS 81

Query: 109 PSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
            SR+ ++TG+ P+  G+  NV       G+P SE  L    K+ GYRT  VGKWHLG Y 
Sbjct: 82  ASRAGLLTGRLPVRAGVPANVSSKKGAEGMPASEVTLGDVFKQAGYRTAHVGKWHLG-YT 140

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD-----LH-GKYST 222
               P  +GF+S  G+  G  D + H       W    R DL   W       H G++  
Sbjct: 141 PATMPLGQGFDSSFGHMGGCIDNYSH----FFYWNGPNRHDL---WRDGKEVFHDGQFFP 193

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
           D+   E   ++   S ++PLF+Y A        P+ P Q  D + N +  +E  +R ++A
Sbjct: 194 DLLVDEMQRLLTEKS-EQPLFVYWAINV-----PHYPYQGTDKWRNYYADLETPRR-EYA 246

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA---AGFNLNAASNWPLRGVKNT 339
           A +  LDE +G+V+  LE+     N+I+VF SD+G +    A F   +A   P RG K  
Sbjct: 247 AFVSTLDEKLGEVLATLEETGQRENTIVVFQSDHGHSTEERAFFGGGSAG--PYRGAKGC 304

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           L+EGG+R   +I  P    +G V +Q     DWLPTL+
Sbjct: 305 LFEGGIRVPSVISWPAGLPQGEVRDQMAVGCDWLPTLV 342


>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
           rubripes]
          Length = 519

 Score =  163 bits (412), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 119/373 (31%), Positives = 181/373 (48%), Gaps = 44/373 (11%)

Query: 35  MAFAVLPLAFTLSM-VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           M F  L L  T++         +   P+II +L DD+GW D+G  G     TPN+DA+A 
Sbjct: 1   MGFETLILLCTITCYCLAQKPGNVSSPNIIIMLMDDMGWGDLGAFGQPSKETPNLDAMAA 60

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
            G++L N+YT   LC+PSR+A++TG+ P+  G         N     E  GG+   E +L
Sbjct: 61  QGMLLLNFYTANPLCSPSRAALLTGRLPVRNGFYTTNGHARNAYTPQEIVGGISKDEILL 120

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           PQ LK+ GY  +IVGKWHLG ++ +Y P   GF+   G    H   +++S        + 
Sbjct: 121 PQMLKKRGYFNKIVGKWHLG-HRPQYLPLEHGFDEWFGAPNCHFGPYNNSVRP----NIP 175

Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIHNHS-TDEPLFLYLAHAA 250
           + R+   +W L G+Y               T ++  E +D I + +   +P FLY A  A
Sbjct: 176 VYRN---SWML-GRYYEEFKIDKKTGESNLTQMYLLEGLDFIQSQAEAQKPFFLYWAPDA 231

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           TH+     P+ A   +L         +R ++   + +LD SVG+++  L +  + SN+ +
Sbjct: 232 THA-----PVYASKDFLG------KSQRGRYGDAVMELDYSVGQILSLLRRLGIDSNTFV 280

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            F SDNG A       + SN P    K T +EGG+R   + W P     G V+ Q  +V 
Sbjct: 281 FFTSDNGAALMSGPNESGSNGPFLCGKETTFEGGMREPAIAWWPGHIREGTVSFQLANVM 340

Query: 371 DWLPTLLSAANKS 383
           D   T L+ A  S
Sbjct: 341 DLFTTSLALAGMS 353


>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
 gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
          Length = 468

 Score =  163 bits (412), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 113/363 (31%), Positives = 180/363 (49%), Gaps = 25/363 (6%)

Query: 29  GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
           G  TR  A A L  A     V     + +  P+I++I+ADD+G  D+G +G   I TP I
Sbjct: 7   GAMTRRTALAALGSAVAAPAVARGGRSGARRPNILYIMADDMGHADLGCYGSRDIRTPAI 66

Query: 89  DALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKIL 145
           DA+A  G+   N Y    +C+P+R A++TG++      G++  + +  +   LP   + L
Sbjct: 67  DAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEEPIAFNGDELSLPRGTRTL 126

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           P  L++LGY T +VGKWH+G      +P   G++   G  +G  DYF H+     + G +
Sbjct: 127 PGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASGGTDYFAHAT---TINGHE 182

Query: 206 MRRDLEPAWDLH-GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAP 263
           M +  E   ++    Y TD+  A+A+D +   +  D P F+ L     H   P+ P Q P
Sbjct: 183 MGKLFENRTEIQRPGYLTDLLGAKAIDRMRLAARQDRPFFISL-----HFTAPHWPWQGP 237

Query: 264 DHYLNIH-----RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
                       R ++      +AA++  +D +VG+V+  L    +  ++I++F SDNGG
Sbjct: 238 GDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLAELAALGLQEDTIVIFTSDNGG 297

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
                    + +WPL G+K  L EGG+R   ++  P    +G  + Q     D LPTLLS
Sbjct: 298 E------RFSDSWPLTGMKGELLEGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTLLS 351

Query: 379 AAN 381
           AA 
Sbjct: 352 AAG 354


>gi|87198499|ref|YP_495756.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
 gi|87134180|gb|ABD24922.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
          Length = 491

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 173/357 (48%), Gaps = 47/357 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I++I+ADDLG+ D+  +G     TP +D LA  G+   N Y    +CT +R  ++TG+
Sbjct: 56  PNILYIMADDLGYADLSCYGRRDFETPVLDKLAAQGLRFTNAYANSAVCTATRVGLITGR 115

Query: 119 HPIHTGMQHNVLYGCERG-------GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           +      Q+ +  G E         GLP S   LP  L + GYRT ++GKWHLG    ++
Sbjct: 116 Y------QYRLPVGLEEPLAFRPNIGLPPSHPTLPSLLAKAGYRTSLIGKWHLGSLP-DF 168

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
            P   G+++  G  +G  DY+ H+    +  +W  D    +E A      Y TD+    A
Sbjct: 169 DPLKSGYQTFWGIRSGGVDYYTHATSNGQPDLW--DGPTPVERA-----GYLTDLLADRA 221

Query: 230 VDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHR-----------HIEDFK 277
           V  I   S+ E P F+ L     H   P+ P + PD      R           H +   
Sbjct: 222 VSEIREASSGEAPWFMSL-----HFTAPHWPWEGPDDASESARIAKLKDPSALFHFDGGS 276

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
            + +AA++ +LD  +G+V+EAL+  R   ++I+VF SDNGG         +  WP  G K
Sbjct: 277 AAIYAAMVRRLDYQIGRVLEALKANRAEQDTIVVFTSDNGGE------RFSDTWPFSGRK 330

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
             L EGG+R   ++  P +   G  ++  +   DWLPT L+AA  +  P + +  V+
Sbjct: 331 TELLEGGLRIPAIVRWPGVTRAGTTSDAQIISMDWLPTFLAAAGSAPDPGHPSDGVD 387


>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
          Length = 526

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 113/334 (33%), Positives = 170/334 (50%), Gaps = 31/334 (9%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ I  DD G++DVG +G   IPTPN+DA+A  G++L N+Y  Q +C+ SR+ ++TG
Sbjct: 73  PPNIVIIFTDDQGYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTG 132

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
            +P   G+ HN L      GL  +E+ L + L++ GYRT I GKWHLG +  ++ PT  G
Sbjct: 133 CYPNRVGI-HNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDH-PDFLPTRHG 190

Query: 178 FESHLG------YWTGHQ---DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
           F+   G       W  H      FD     +     +  R ++   D   +  T   T  
Sbjct: 191 FDEFFGIPYSNDMWPLHPLQGPVFDFGPLPL----YEQERVVDTLEDQ--RLLTRQITER 244

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           +VD I+ H  +EP FLY+ H       P+ PL   D +           R  +  ++ ++
Sbjct: 245 SVDFINRHK-EEPFFLYVPH-----PQPHVPLFVSDAFRGKS------GRGLYGDVIMEI 292

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D SVG+V+ ALE   +  ++ ++F SDNG   A  N +  +  PLR  K T WEGGVR  
Sbjct: 293 DWSVGQVLGALEDNGLTDDTWVIFTSDNGPWLAYGNHSGRAE-PLREGKGTNWEGGVREP 351

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
            ++  P    RG V ++ +   D LPT+ S    
Sbjct: 352 CIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGS 385


>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 481

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 113/341 (33%), Positives = 174/341 (51%), Gaps = 29/341 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+II I ADDLG+ D+G  G   I TPN+D +A+ G    N+Y    +CTPSR+ ++TG+
Sbjct: 33  PNIIVIFADDLGYGDLGVFGHPSIKTPNLDKMAFEGQKWTNFYVAAPVCTPSRAGLLTGR 92

Query: 119 HPIHTGMQHN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
            PI +GM  +   VL+    GGLP SE  + + LK  GY+T  +GKWHLG +K  Y PT 
Sbjct: 93  LPIRSGMSSDDRRVLFPDSDGGLPQSEITIAKALKGNGYQTAAIGKWHLG-HKSPYLPTD 151

Query: 176 RGFESHLGY-WTGHQDYFDHSA-------EEMKMWGLDMRRDLEPA-WDLHGKYSTDVFT 226
            GF+S+ G  ++   D  D +        E+   + + + +D E A      +  T  +T
Sbjct: 152 HGFDSYFGIPYSNDMDKIDKTDHFTLTDNEKFDAYNVPLMKDAEIAERPTDQRTLTKRYT 211

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
            EAV  I     DEP F+YLAH   H    +      D  L             +  ++ 
Sbjct: 212 EEAVSKIKAFK-DEPFFIYLAHNLPHIPL-FRSAAFKDQSLG----------GIYGDVIE 259

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           ++D SVG+++  L++  +  N+++VF SDNG     F  +  +   LRG K   +EGG+R
Sbjct: 260 EIDWSVGQILSTLKEEGIAENTLVVFTSDNGPWHV-FKTHGGTAGLLRGAKGGTFEGGMR 318

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
              + W P     G+V +    + D LPT   A + +++P+
Sbjct: 319 EPTVFWWPAQIKPGVVMDMGTTM-DLLPT-FCAISGTELPD 357


>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 115/357 (32%), Positives = 181/357 (50%), Gaps = 35/357 (9%)

Query: 42  LAFTLSMVFVD---LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-I 97
           +    S++F     L  ++  P+II ILADDLG+ D+GF G   I TP+IDALA SG+  
Sbjct: 1   MKLIFSLIFFTYSTLALAAQKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNF 60

Query: 98  LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER------GGLPLSEKILPQYLKE 151
            + Y +  +C PSR+ ++TG++    G   N     E        G+PL E+++   LK 
Sbjct: 61  TQGYQSASVCGPSRAGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKP 120

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR---- 207
             Y T ++GKWH+G    E  PT R  + + G+  G   Y      +M M G  M     
Sbjct: 121 AAYTTGVIGKWHMGL-SHEQRPTQRSVDYYYGFLNGAHSY---REAKMDMKGAPMTWPIF 176

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
           R+ EP       Y+T+VF  E V+ I   + D+P FLY+++ + H      P +A    L
Sbjct: 177 RNNEPV--PFSGYTTEVFNDEGVNFI-KRNKDKPFFLYMSYNSVHG-----PWEAQPKDL 228

Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA-------A 320
               HI+   R  ++A+L  +D+ VG++++ L+   +  N++++F+SDNG          
Sbjct: 229 QRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAER 288

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           AG  L  ASN  LRG K   +EGG+R   ++  P +  +    +  V   D +PTL+
Sbjct: 289 AGDYL--ASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLI 343


>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
 gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
          Length = 466

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 167/340 (49%), Gaps = 20/340 (5%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTP 109
           V L+     P+I+FIL+DD G+ D GF G  +  TP +D LA   +   + Y +  +C P
Sbjct: 18  VSLLFGQQKPNIVFILSDDAGYADFGFQGSTEFKTPELDKLAKKSVKFTQAYVSAAVCGP 77

Query: 110 SRSAIMTGKHPIHTGMQHNVLYG--------CERGGLPLSEKILPQYLKELGYRTRIVGK 161
           SR+ ++TG++    G + N + G         ++ GLP ++  +  YLK+ GY T + GK
Sbjct: 78  SRAGMLTGQYQQKFGFEENNVPGYMSSSGLLGDQMGLPTNQLTIADYLKKQGYTTALFGK 137

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WH+G     + PT RGF+   G+  G + Y  +    +     D        +  H  Y 
Sbjct: 138 WHMG-NADLFHPTKRGFDEFYGFRGGSRSYLPYDKNNVLTKNEDRLEKGFGNYLEHKGYL 196

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           TD    EA+  I N +   P F+YL+  A H+     P++A    L    H+   KR   
Sbjct: 197 TDKLANEAISFI-NKNKKTPFFIYLSFNAVHT-----PMEATKEDLKYFSHLTG-KRKTL 249

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
           AA+   +D ++G V++ L+++ +  N+++VF +DNGG +   + N ++N PL G K    
Sbjct: 250 AAMTLAMDRAIGTVLDELKKQGLDKNTLVVFTNDNGGPS---DANQSNNNPLSGTKANHL 306

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           EGG+R   L+  P    +       +   D  PT L+ AN
Sbjct: 307 EGGIRVPFLMSWPGKLKKNTTYNYPISTLDLFPTFLNVAN 346


>gi|297180066|gb|ADI16291.1| arylsulfatase a and related enzymes [uncultured bacterium
           HF0010_16H03]
          Length = 543

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 135/431 (31%), Positives = 199/431 (46%), Gaps = 76/431 (17%)

Query: 33  RIMAFAVLPLAFTLS----------MVFVDLVASSGPPHIIFILADDLGWNDVGFH---- 78
           R +A  ++P+A  LS             V+    +  P+II ILADDLG+NDV F+    
Sbjct: 21  RDIALTLIPIALKLSNPISENQEINWPIVESSVITTKPNIILILADDLGFNDVSFYNGGA 80

Query: 79  GLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQH---------- 127
               + TPNID LA  G+   N Y    +C+PSR+AIMTG++    G +           
Sbjct: 81  ADGSLMTPNIDNLAKEGVAFLNGYAASPVCSPSRAAIMTGRYSSRYGFEFTPYPAQAAKI 140

Query: 128 -NVL------------------YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
            N+L                   G   GGLP  E  + + L++ GY T  +GKWHLG + 
Sbjct: 141 MNLLREDDELETIYLEDVQLDEVGLTIGGLPDKEVTVAEMLQKNGYYTAHIGKWHLGGFT 200

Query: 169 KEYTPTFRGFE------SHLGYWTGHQDY----FDHSAEEMKMWG-------LDMRRDLE 211
               P  +GF+      S L     H D      D S E+M +W         +     +
Sbjct: 201 DGMKPNDQGFDDSLMLNSSLYLPKNHPDIVNAKIDSSIEDM-VWASSQFAASFNGSEPFK 259

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
           P     G Y TD +T EA+ +I+N+  D P FLYL H A     P+ PLQ+       H 
Sbjct: 260 P-----GGYITDYYTDEAIKVINNNK-DRPFFLYLGHFA-----PHNPLQSLRKDYEKHN 308

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
           H+E+     ++ ++  LD S+GK+V  LE+  +  N++I+F SDNGG  AG+      N 
Sbjct: 309 HMENHTLQVYSGMIEALDRSIGKIVTTLEENGLTENTLIIFTSDNGG--AGYIGLDNINK 366

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
           P RG K T +EGG+        P    + +  ++ +H +D   T+L AA+  + P  +  
Sbjct: 367 PYRGWKLTHFEGGMHIPFFAKWPAKIKKDMRYDKRIHHTDIFSTILGAAS-IEPPKEIKI 425

Query: 392 TVENIIPRYEN 402
              N+IP   N
Sbjct: 426 DGINLIPFLTN 436


>gi|255690671|ref|ZP_05414346.1| twin-arginine translocation pathway signal [Bacteroides finegoldii
           DSM 17565]
 gi|260623695|gb|EEX46566.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 453

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 105/330 (31%), Positives = 178/330 (53%), Gaps = 24/330 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+++ IL DDLG  D+       + TPNID +  +G+ L N+Y    + +PSR+A++TG 
Sbjct: 31  PNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAALLTGC 90

Query: 119 HPIHTGMQHNVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
            P   G+   +    ++  G    S   +P+ LK  GYRT ++GKWHLG ++    P  R
Sbjct: 91  FPAMVGVPGVIRPSIDQNWGYFGPSAVTMPEVLKNGGYRTALIGKWHLG-WESPNLPNER 149

Query: 177 GFESHLGYWTGHQD-YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           GF+   G+     D Y+ H  +      L+  ++++P       ++T++FT+ +VD I  
Sbjct: 150 GFDHFHGFLADMMDDYYTHRRQGGNYMYLN-DKEIDPK-----GHATELFTSWSVDYIKK 203

Query: 236 HSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---FKRSKFAAILHKLDES 291
            + ++ P FLYLA+ A HS     PLQ P  ++N  +  +     KR++  A++  LD +
Sbjct: 204 EAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPVKRARLIALIEHLDYN 258

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +GKV+++LE+   L+N++++F SDNGG       + A+N P RG K  ++EGG+  A  +
Sbjct: 259 IGKVIQSLEESGQLNNTLVIFASDNGGDRG----SMANNGPTRGAKGDMFEGGIHVACAL 314

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             P +   G     +V + D +PT+    N
Sbjct: 315 NMPGVFEGGRRDNHFVVMMDLMPTICDFVN 344


>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
           africana]
          Length = 513

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 179/365 (49%), Gaps = 47/365 (12%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +  V +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct: 13  LLLVLSAAGLGAVGAPQPPNIVLMLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSF 72

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER---------GGLPLSEKILPQYLKE 151
           Y+   LC+PSR+A++TG+ PI  G       G  R         GG+P SE +LP+ LK+
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFY--TTNGHARNAYTPQDIVGGIPDSEHLLPELLKK 130

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
             Y T+IVGKWHLG ++ ++ P   GF+   G    H   +D+ A+      + + RD  
Sbjct: 131 ANYATKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRAKP----NIPVYRD-- 183

Query: 212 PAWDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANP 256
             W++ G++               T ++  EA+D I    S   P FLY A  ATH+   
Sbjct: 184 --WEMVGRFYEEFPINLKTGEANLTQIYLQEALDFIKRQQSQQRPFFLYWAIDATHA--- 238

Query: 257 YEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
             P+ A   +L         +R ++   + ++D+SVG++++ L    +  ++ I F SDN
Sbjct: 239 --PVYASKPFLGTS------QRGRYGDAVREVDDSVGQLLQLLRGLGIAEHTFIFFTSDN 290

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           G A       A SN P    K T +EGGVR   + W P     G V+ Q   + D   T 
Sbjct: 291 GAALVSAPKEAGSNGPFLCGKQTTFEGGVREPAIAWWPGHIPAGQVSHQLGSLMDLFTTS 350

Query: 377 LSAAN 381
           LS A 
Sbjct: 351 LSLAG 355


>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
          Length = 482

 Score =  162 bits (411), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 106/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P   GF+  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 119

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + RD    W++ G+Y               T ++  
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA+D I   +   P FLY A  ATH+     P+ A   +L         +R ++   + +
Sbjct: 172 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 220

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+SVGK++E L    +  N+ + F SDNG A       A SN P    K T +EGG+R 
Sbjct: 221 IDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQTTFEGGMRE 280

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L W P   + G V+ Q   + D   T L+ A 
Sbjct: 281 PALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAG 314


>gi|198275209|ref|ZP_03207740.1| hypothetical protein BACPLE_01368 [Bacteroides plebeius DSM 17135]
 gi|198271792|gb|EDY96062.1| arylsulfatase [Bacteroides plebeius DSM 17135]
          Length = 509

 Score =  162 bits (410), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 184/394 (46%), Gaps = 59/394 (14%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
            L    TL+   +   + +  P+++FI+ DD GW DVG++G     TPNID LA  G+I 
Sbjct: 10  TLAGGVTLAANMLHAASDNRQPNVVFIMVDDYGWADVGYNGSRFYETPNIDRLASEGMIF 69

Query: 99  KN-YYTVQLCTPSRSAIMTGKHPIHTGM-----------------QHNVLYGCERGGLPL 140
            + Y    + +PSR ++MTGK+P  TG+                 Q+ +L       +PL
Sbjct: 70  TDGYAAASISSPSRVSLMTGKYPARTGITDWIPGYQYGLKPEQLKQYKMLAPEMPLNMPL 129

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT-GHQDYFDHSAEEM 199
            E  + +  KE GY T  VGKWH       Y P ++GF+ ++G W  G  +    S    
Sbjct: 130 EEVTMAEAFKEHGYATYHVGKWHCA-EDSLYYPQYQGFDVNIGGWLKGSPNGIRRSQGGK 188

Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
             +    R    P     G++ TD    E++ +I N S D+P FLYLA  A H+     P
Sbjct: 189 GAYCSPYRNPYLPD-GPEGEFLTDRLGDESIKLIKNSSADKPFFLYLAFYAVHT-----P 242

Query: 260 LQAPDHYLN----------------IHRHIEDFKR----------------SKFAAILHK 287
           ++A   Y+                   R++E +K                 +++AA+++ 
Sbjct: 243 IEAKPEYVKYFKWKAQRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAEYAALIYS 302

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG+V++AL+   +  N+I+  +SDNGG +     +   N PLR  K  L+EGG+R 
Sbjct: 303 MDENVGRVMQALKDNGLDKNTIVCLLSDNGGLSTAEG-SPTCNAPLRAGKGWLYEGGIRE 361

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             +I  P +   G V    V   D+ PTLL  A 
Sbjct: 362 PFIIKYPQMVEAGSVCHTPVVAVDFYPTLLDMAG 395


>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  162 bits (410), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 47  MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 106

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P   GF+  
Sbjct: 107 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 165

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + RD    W++ G+Y               T ++  
Sbjct: 166 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA+D I   +   P FLY A  ATH+     P+ A   +L         +R ++   + +
Sbjct: 218 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 266

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+S+GK++E L+   +  N+ + F SDNG A         SN P    K T +EGG+R 
Sbjct: 267 IDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMRE 326

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L W P   + G V+ Q   + D   T L+ A 
Sbjct: 327 PALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAG 360


>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
 gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
          Length = 499

 Score =  162 bits (410), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 112/349 (32%), Positives = 173/349 (49%), Gaps = 32/349 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P++I I  DD G+ DVG  G D + TPN+D +A  G++L ++Y    LC+PSR+ +MTG 
Sbjct: 45  PNVIVIFTDDQGYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMTGS 104

Query: 119 HP----IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +P    + TG    VL   +  GL  +E  + + LK +GY T I GKWHLG  + E+ PT
Sbjct: 105 YPRRVDMATGSNFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLG-DQPEFLPT 163

Query: 175 FRGFESHLGYWTGHQDYFDHSAE-EMKMWGLDMRRDLEPAWDL--HGKYSTDVFTAEAVD 231
            +GF+   G    H     H  +   K   L +  + E   +L  + +Y T   T  A+D
Sbjct: 164 RQGFDEFFGLPYSHDIAPTHKRQAHFKFPDLPLMEN-ENVIELNPNPEYLTRRITERAID 222

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI-------HRHIEDF------KR 278
            I  H  D P FLYL H       P+ PL     +LN          +I D       K+
Sbjct: 223 FIERHQ-DAPFFLYLPHPM-----PHGPLHVSQEFLNTVDKATLARLNISDADIAANSKK 276

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
           + +  ++ +LD +VG +++ L++  + SN+I++F SDNG   +G   +  +   L G K 
Sbjct: 277 AMYPLVIGELDNAVGNILDTLKRLNLDSNTIVIFTSDNG--PSGRRNSDGTERYLSGHKT 334

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
              EGG+R   ++W P       V++      D LPT+   A  +D+P 
Sbjct: 335 LTQEGGMRVPTVVWGPSQIPANTVSQALTTSMDILPTVAFLAG-ADVPT 382


>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
 gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
          Length = 480

 Score =  162 bits (409), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 178/360 (49%), Gaps = 56/360 (15%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           I+   ++ +      ++    A S  P+++ I+ADDLG+ + G  G  +IPTP IDALA 
Sbjct: 9   IVGRCIVCVTLATLTLYWQSTAVSSQPNLVVIIADDLGYGETGMMGNAEIPTPAIDALAR 68

Query: 94  SGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQ 147
           SG+     Y T   C+PSR+  M+G++    G   N     ER      GLP  +K   +
Sbjct: 69  SGVRCTSGYVTSSYCSPSRAGFMSGRYQSRFGYDLNPT--GERNNHPNAGLPPQQKTFVE 126

Query: 148 YLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY----------------WTGHQD- 190
           +L+  GY T ++GKWHLG  +    PT +GF+   G+                WT  +D 
Sbjct: 127 HLQSAGYHTSLIGKWHLG-TRPPQVPTSKGFDRFFGFLHEGHFYVPGPPYENVWTMLRDN 185

Query: 191 -----YFDHSAEEMKMWGLDMRRDLEPAWD-----LHG-------KYSTDVFTAEAVDII 233
                 F+ +   ++    +  R  EPA+D     L G        Y TD  T +AVD I
Sbjct: 186 SLPAGQFETNQRTIRG---NYARINEPAYDTGNPVLDGGEPIDDWNYLTDTITDKAVDTI 242

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
            + +   P  + +++ A HS     P+QA         HI D +R  FA +L  LD  VG
Sbjct: 243 -SQAASNPFAMVVSYNAVHS-----PMQASLEDHAAMDHIADPQRRIFAGMLIALDRGVG 296

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           ++++ L+++++   +++VF SDNGG  A      +SN PLRG K +L+EGGVR   +IWS
Sbjct: 297 RIIKKLDRQKLRQETLVVFFSDNGGPTAELT---SSNAPLRGGKGSLYEGGVR-IPMIWS 352


>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
           paniscus]
          Length = 482

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 165/334 (49%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P   GF+  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 119

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + RD    W++ G+Y               T ++  
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA+D I   +   P FLY A  ATH+     P+ A   +L         +R ++   + +
Sbjct: 172 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 220

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+S+GK++E L+   +  N+ + F SDNG A         SN P    K T +EGG+R 
Sbjct: 221 IDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMRE 280

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L W P   + G V+ Q   + D   T L+ A 
Sbjct: 281 PALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAG 314


>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
 gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
          Length = 478

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 170/335 (50%), Gaps = 29/335 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ DV   G     TP+ID +A+ GI L + Y    +C+ +R+A++TG+
Sbjct: 45  PNIIFILADDLGYADVSCFGTPGFTTPSIDRIAHEGIKLTRAYANSAVCSATRTALITGR 104

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +   I  G++  +     + GLP     LP+ L+  GY+T +VGKWHLG    E+ P   
Sbjct: 105 YQDRIACGLEEPIASSSGKIGLPPGLPTLPEQLRRAGYKTALVGKWHLG-NPPEFGPLKS 163

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD-MRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           G++   G   G  DYF H  +     G D   RD     D HG Y T +   EAV ++  
Sbjct: 164 GYDHFYGISGGAADYFTHQGKP----GEDQFYRDDHLIHD-HG-YLTGLLGDEAVRLVET 217

Query: 236 HSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHR---HIEDF---KRSKFAAILHKL 288
            +++  P FL L     H   P+ P + P+      R    I D+    R  +  ++  +
Sbjct: 218 FASETSPFFLSL-----HFNAPHWPWEGPEDEAESKRLKGKIFDYDGGTRKTYGRMIKAM 272

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D+ +G+V+  L+   +  N+I+VF SDNGG         +  WP  G+K  L EGG+R  
Sbjct: 273 DDQIGRVLATLDSYHLTENTIVVFTSDNGGE------RFSDTWPFTGMKGELLEGGLRVP 326

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            ++  P   + G  ++Q +   DWLP+ L+A   +
Sbjct: 327 AVLRWPARVAAGSESDQTLITMDWLPSFLAATGAT 361


>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 461

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 118/324 (36%), Positives = 163/324 (50%), Gaps = 33/324 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ ILADDLG+ D+G +G   I TPNID LA  G    ++Y+   +C+PSR+A+MTG+
Sbjct: 28  PNIVVILADDLGYGDLGCYG-SPIATPNIDRLAEEGARFTSFYSASPVCSPSRAALMTGR 86

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P  T ++  V+ G    GLP SE  + Q LK  GYRT  +GKWH+G     Y PT RGF
Sbjct: 87  YP--TRVEVPVVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIG-STPGYLPTNRGF 143

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +   G          +SA+      +     + PA D     S+  FT EA+D +   + 
Sbjct: 144 DEFFG--------VPYSADITPCPLMRGSSVVAPAVDCSTLTSS--FTQEALDFMR-RAQ 192

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           D P FLYLAH A     P+ PL A        R         +A ++ +LD S G+V+ A
Sbjct: 193 DNPFFLYLAHTA-----PHLPLAAS------PRFAGQSGLGMYADVVQELDWSTGQVMAA 241

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L+   + SN++++F SDNG    G      S   LRG K   +EGG+R   L   P +  
Sbjct: 242 LKATGLDSNTLVMFSSDNGPWYQG------SQGKLRGRKGETYEGGMREPFLARYPGVIP 295

Query: 359 RGIVAEQYVHVSDWLPTLLSAANK 382
            GI         D LPTL   A  
Sbjct: 296 SGIGCAGLATTMDLLPTLARLAGA 319


>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
          Length = 311

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/275 (33%), Positives = 146/275 (53%), Gaps = 13/275 (4%)

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           SR + ++G++   + MQH V+   +   L L   ILP YLKELGY T   GKWHLG+ + 
Sbjct: 1   SRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRD 60

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           E TPT RGF+S  G ++G  ++++H+      +  D     E  +D  GK+S D+     
Sbjct: 61  ECTPTHRGFDSFSGGFSGEGEFYEHTTATGGYY--DWHLGTEVDYDAIGKHSEDLIGYYV 118

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE-DFKRSKFAAILHKL 288
              +  +    PLF+Y+A    HS     PL     +L ++  ++    R K+  ++  +
Sbjct: 119 NKTLDEYDQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTDDRKKYLGLVSGM 173

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D  +G +V+ +++  +  N+ I+F SDNGG     ++    N P RG K+TLWEGG +  
Sbjct: 174 DYIIGGIVDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKAN 228

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
               SPLL + G+     +HV+DWLPT++  A  S
Sbjct: 229 SWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGS 263


>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
 gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
          Length = 476

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/365 (31%), Positives = 172/365 (47%), Gaps = 51/365 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
           AS   P+IIFILADDLG+ D+GF G + I TPNID LA  G+   ++Y+   +C PSRS+
Sbjct: 14  ASPEKPNIIFILADDLGYGDLGFLGQEYIETPNIDRLAKEGMFFSDFYSGSTVCAPSRSS 73

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEY 171
            +TG H  HT ++ N     E G  P+ + +L   + +K+ GY T   GKW LGF     
Sbjct: 74  FLTGMHTGHTPIRGNSEVQPE-GQFPMPDSVLTVAKVMKQAGYVTGAFGKWGLGFIGSTG 132

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA--WDLHGKYSTDVFTAEA 229
            P  +GFE   GY   +   + H      +W  D + DL P   W   G Y+ D+   + 
Sbjct: 133 EPASQGFEEFYGY---NCQRYAHRYYPAYLWHNDEKIDL-PGNDWTTKGDYAPDLIQEKT 188

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD-HYLNIHRHI--------------- 273
           ++ I + + + P FL++     H+      L APD   +  +R +               
Sbjct: 189 LEFI-DENKENPFFLFMPIVTPHAE-----LAAPDDEIMEKYRAMFPDEKPYVAGNGADY 242

Query: 274 -----------EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN-----G 317
                      + +  + FAA++ ++D  VG+V++ LE   +  N++I+F SDN     G
Sbjct: 243 GPDMRIPGYQSQPYPHATFAAMVERIDRYVGEVLQKLESNGLSENTVIIFTSDNGAHREG 302

Query: 318 GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           GA   F     SN   RG K  L+EGGVR   + W P     G   +      D LPT  
Sbjct: 303 GADPEF---FDSNGKFRGFKRDLYEGGVRAPMIAWWPNKIKAGSTTDHVSAFWDLLPTFA 359

Query: 378 SAANK 382
           +   +
Sbjct: 360 AIGGE 364


>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 462

 Score =  161 bits (408), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/348 (31%), Positives = 184/348 (52%), Gaps = 38/348 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
           P+++ I++DD G++D+G  G   I TP +D LA  G+ L N+Y T   CTPSRS ++TG+
Sbjct: 25  PNVLLIISDDQGYHDLGSFGNKDILTPYLDQLAAEGVRLTNFYVTSSACTPSRSGLLTGR 84

Query: 119 HPIHTG----MQHNVL-YGCERG------------GLPLSEKILPQYLKELGYRTRIVGK 161
           +P   G     ++N++ YG +              G  L E  + + LKE GY    +GK
Sbjct: 85  YPQRNGTFELFRNNMVNYGHQYTDYEYSVSPERILGTDLREIFISELLKEGGYTNGYIGK 144

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           W LG  K+ Y P  +G++ + G+     DYF H     +     M    + + +  G+Y+
Sbjct: 145 WDLGQLKR-YLPMQQGYDKYYGFANTGIDYFTHE----RYGSPSMIDGNQLSLEDKGRYT 199

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP-----LQAPDHYLNIH---RHI 273
           T +F  EA++ I N S  +P FL LA  A HSA+  +P     +QA + YL ++      
Sbjct: 200 TALFEREALEFI-NGSQGKPFFLTLAFNAPHSASNLDPEIKGTVQATEKYLKMYPKSNTK 258

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
              K+  + A + ++D+S+G ++E +++  +  N++++F+SDNGG      ++ A N+PL
Sbjct: 259 RGEKQRAYKAAVTQMDQSIGTILERIKEMGLEENTLVIFLSDNGG------MSLADNYPL 312

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           +G K   +EGGVR   L+  P     G   + ++   +  P +++AA 
Sbjct: 313 KGGKAQFFEGGVRVPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAG 360


>gi|355757044|gb|EHH60652.1| hypothetical protein EGM_12064 [Macaca fascicularis]
          Length = 482

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 105/334 (31%), Positives = 164/334 (49%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G +  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP+ LK+ GY +++VGKWHLG ++ ++ P   GF+  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKVVGKWHLG-HRPQFHPLKHGFDEW 119

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + RD    W++ G+Y               T ++  
Sbjct: 120 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 171

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA+D I   +   P FLY A  ATH+     P+ A   +L         +R ++   + +
Sbjct: 172 EALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAVRE 220

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+SVGK++E L    +  N+ + F SDNG A       A SN P    K T +EGG+R 
Sbjct: 221 IDDSVGKMLELLHDLHIADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQTTFEGGMRE 280

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L W P   + G V+ Q   + D   T L+ A 
Sbjct: 281 PALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAG 314


>gi|47228126|emb|CAF97755.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 650

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 122/380 (32%), Positives = 170/380 (44%), Gaps = 68/380 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLC------------ 107
           PHIIFIL DD G+ND+G+H    I TP +D LA  G+ L+NYY   +C            
Sbjct: 12  PHIIFILTDDQGFNDIGYHN-PTIKTPTLDKLAAEGVRLENYYVQPICTPSRSQLITGRY 70

Query: 108 -TPSRSAIMTGKH---------------PIHTGMQHNVLYGCERGGL-----------PL 140
            TPS SA++                    +H   Q   L    RG             P 
Sbjct: 71  ATPSTSAMLVFFFLFIFFLQIPDPHRPPALHHQAQPAQLSAFARGHPARAPPPGRILHPP 130

Query: 141 SEKILPQYLKE------------LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH 188
             ++ P  L+E            L    R  G   +G + +  +   R     L + +  
Sbjct: 131 GGQMAPGLLQEGVSSHPERLRHLLRLLDRQCGLLQVGLWPRPGS--VRRLSLGLVFLSPA 188

Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLA 247
             Y   S +   + G D+       W   GKYST +FT  A  I+ +H   E PLFL L+
Sbjct: 189 PSYL--SCDGPGVCGYDLHDGEGVVWGQEGKYSTALFTRRARQILESHDPAERPLFLLLS 246

Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
             A H+     PLQ P  Y+  +R + +  R K AA++  +DE+V  V  AL +     N
Sbjct: 247 LQAVHT-----PLQTPKSYIYPYRDMTNVARRKLAAMVSTVDEAVRNVTYALRKYGYYRN 301

Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
           S+I++ +DNG           SNWPLRG K T WEGG+RG   + SPLL  R  V++  +
Sbjct: 302 SVIIYSTDNGAQP----FTGGSNWPLRGRKGTYWEGGIRGVAFVHSPLLRRRRRVSKALL 357

Query: 368 HVSDWLPTL--LSAANKSDI 385
           H++DW PTL  L+  N S I
Sbjct: 358 HITDWFPTLVGLAGGNVSQI 377


>gi|423293951|ref|ZP_17272078.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
           CL03T12C18]
 gi|392677172|gb|EIY70591.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
           CL03T12C18]
          Length = 461

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/337 (32%), Positives = 179/337 (53%), Gaps = 34/337 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ + I+ADD+G+ DVG +G + I TPNID +A  G++  ++++   + +P+R  ++TG+
Sbjct: 29  PNFVIIVADDMGYGDVGIYGNEYIKTPNIDQMAREGMMFTDFHSNGSVSSPTRCGLLTGR 88

Query: 119 HPIHTGMQHNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +    G++  +L   +      GL   E    + L + GYRT ++GKWHLG+ +K +   
Sbjct: 89  YQQRAGLEKVLLVPRDDKDKEVGLQSEEITFAKILGDNGYRTALIGKWHLGYLQKHHPMN 148

Query: 175 FRGFESHLGYWTGHQDYFDHSAE--EMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           F GF+  +G+ +G+ DY  H     ++  W GL+M+       ++ G Y+T + T  + D
Sbjct: 149 F-GFQKFVGFKSGNVDYQSHRNRYGDVDWWDGLEMK-------NMPG-YTTTLLTTLSED 199

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHRHIEDFKRSK------FAAI 284
            I   + D+P  LY+AHAA HS     P+Q P+   +     +E  K S       +  +
Sbjct: 200 YI-KENKDKPFCLYIAHAAPHS-----PMQGPNEKAIRTEATLEGDKNSDRPNKEIYKDM 253

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           + +LD SVG+++E L++ ++  N+ +VF SDNG        N  S    RG K   WEGG
Sbjct: 254 VEELDWSVGRILETLKKYKLDKNTFVVFFSDNGPVIN----NGGSTGGYRGAKGAPWEGG 309

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            R  G+ + P     G + +Q V   D  PT+L  AN
Sbjct: 310 HRVPGICYMPGTVKGGTMCKQTVMSFDLFPTMLDMAN 346


>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
          Length = 446

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 171/354 (48%), Gaps = 27/354 (7%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
           PP+I+FILADDLGW D+  +G     TP++DALA  G+   + Y    +C+ +R A++TG
Sbjct: 8   PPNIVFILADDLGWADLSVYGQADFSTPHLDALAGEGVRFTQAYANSAVCSATRFALITG 67

Query: 118 KHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           ++   +  G++  ++      GLP     LP  L+  GY T ++GKWHLG     + P  
Sbjct: 68  RYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRNAGYDTALIGKWHLGSLPT-FGPLK 126

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
            G+    G + G  DYF H     +    D+     P   +   Y T +   EA   +  
Sbjct: 127 SGYNRFFGNYGGAIDYFTHKPGVGEAVPRDLYEGEVPVERV--GYYTQLLADEASGWLRG 184

Query: 236 HST-DEPLFLYLAHAATHSANPYEPLQAPDHY-----LNIHRHIEDFKRSKFAAILHKLD 289
            S   +P FL L     H   P+ P   P+       L    H +      +  ++  LD
Sbjct: 185 RSAGSKPFFLSL-----HFTAPHWPWVGPEDEEVSLGLKDLFHYDGGNLHTYGKMVRSLD 239

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            +VG+V+ AL+++ +  N+I+VF SDNGG         +  WP  G K  L EGG+R   
Sbjct: 240 GAVGQVLHALQEQGLADNTIVVFTSDNGGE------RFSKTWPFTGQKTELLEGGIRVPT 293

Query: 350 LI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
           L+ W   +  +  V++Q     DWLPTLL+AA  +   +Y  S  ENI+P  E 
Sbjct: 294 LLRWPARIAPQ--VSDQVTATMDWLPTLLAAAGVAPDADY-PSDGENILPVLEG 344


>gi|182677550|ref|YP_001831696.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
 gi|182633433|gb|ACB94207.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
          Length = 486

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/355 (30%), Positives = 174/355 (49%), Gaps = 30/355 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ DV  +G   + TPNID +   G  +L+ Y    +C+ +R+A++TG+
Sbjct: 37  PNIIFILADDLGYADVSIYGRPDLSTPNIDGIGLKGARLLQAYANSAVCSATRTALLTGR 96

Query: 119 HPI--HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +      G++  +  G    GLP     LP  LK+ GY T ++GKWHLG    ++ P   
Sbjct: 97  YQYRERVGLEEPIA-GNIHVGLPPQRPTLPSLLKKAGYTTTLIGKWHLGTL-PDFGPLQS 154

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
           G++   G+  G  DY+ H   + +    D    +         Y T++    A++ I+  
Sbjct: 155 GYDHFYGFRGGAVDYYSHKGTDDQDDLWDQDTKVH-----QTGYLTELLGDRAIETINAS 209

Query: 237 S-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----HIEDF---KRSKFAAILHKL 288
           + T +P F+ L     H   P+ P +AP       R     + DF    ++ +  ++  +
Sbjct: 210 AKTGQPFFISL-----HFNAPHWPWEAPGDEAESARVAGTRLFDFDGGSQATYRGMIAAM 264

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D  +G++V+AL+   +  N+I++F SDNGG         A  WP  G K  L EGG+R  
Sbjct: 265 DLQIGRIVQALQANGISENTIVIFTSDNGGE------RFADTWPFTGRKTELLEGGLRIP 318

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
            L+  P         +Q     DWLPTLL+AA     PN+ +  + N++P    S
Sbjct: 319 ALVSWPARIKADQTIDQVSISMDWLPTLLAAAGSEPDPNFPSDGI-NLLPFLSES 372


>gi|325107240|ref|YP_004268308.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324967508|gb|ADY58286.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 469

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/368 (32%), Positives = 185/368 (50%), Gaps = 48/368 (13%)

Query: 42  LAFTLSMVFVDL------VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           LAF L  +F +L         S PP+ + I  DDLG+ D+G  G   I TP +D +A  G
Sbjct: 8   LAFGL--LFCNLPGDGQAAEKSTPPNFVVIFCDDLGYGDLGCFGHPSISTPQLDQMADEG 65

Query: 96  IILKNYYT-VQLCTPSRSAIMTGKHPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKE 151
                +Y    +CTPSR+A++TG+ PI  GM   +  VL+   +GGLP  E  + + LK+
Sbjct: 66  QRWTQFYVGASVCTPSRAALLTGRLPIRNGMMSAKRRVLFPDSKGGLPAQELTIAELLKK 125

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY--------WTGHQDYFDHSAEEMKMWG 203
            GY T  +GKWHLG ++ E+ PT  GF+S+ G           GH +Y   +AE+   + 
Sbjct: 126 NGYATAAIGKWHLG-HRSEFLPTEHGFDSYWGIPYSNDMDREKGHPNYRQKAAEDPHYFS 184

Query: 204 ---------LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
                    +D +  +E   + H    T  +T +A++ I  +  ++P FLYLAH      
Sbjct: 185 PIEQYQVPIIDNKEVVERPANQH--TITKRYTEKAIEFIKGNR-EQPFFLYLAHNL---- 237

Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
            P+ PL A D +L      E   R  +  ++ ++D  VG++++ L++  +  N+++VF S
Sbjct: 238 -PHIPLYASDSFL------ETSPRGIYGDVVTEIDHGVGQILDTLKETGLAENTLVVFTS 290

Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP-LLESRGIVAEQYVHVSDWL 373
           DNG     F+ +  S   LR  K   +EGG+R   ++W P  LE +  V  +     D L
Sbjct: 291 DNGPWLL-FDTHGGSAGLLREGKGATFEGGMRVPTVMWWPGTLEPK--VQTELGTTMDLL 347

Query: 374 PTLLSAAN 381
           PT  S A 
Sbjct: 348 PTFCSLAG 355


>gi|332263239|ref|XP_003280658.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Nomascus leucogenys]
          Length = 528

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 164/334 (49%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 47  MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 106

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P   GF+  
Sbjct: 107 NAHARNAYTPQEIVGGVPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEW 165

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + RD    W++ G+Y               T ++  
Sbjct: 166 FGSPNCHFGPYDNKARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQIYLQ 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EA+D I   +   P FLY A  ATH+     P+ A   +L         +R ++   + +
Sbjct: 218 EALDFIKRQARRHPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGDAIRE 266

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+S+GK++E L    +  N+ + F SDNG A         SN P    K T +EGG+R 
Sbjct: 267 IDDSIGKMLELLHNLHIADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMRE 326

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             L W P   + G V+ Q   + D   T L+ A 
Sbjct: 327 PALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAG 360


>gi|149174645|ref|ZP_01853271.1| sulfatase [Planctomyces maris DSM 8797]
 gi|148846755|gb|EDL61092.1| sulfatase [Planctomyces maris DSM 8797]
          Length = 405

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 172/339 (50%), Gaps = 20/339 (5%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAI 114
           SS  P++I I  DD G  D+  +G   + TP++D++A  GI    +Y +  +C+PSR+ +
Sbjct: 5   SSEKPNVIIIFTDDQGSVDLNCYGAKDLITPHMDSIARRGIRFTQFYASAPVCSPSRAGM 64

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG+ P   G+  NV     + G+P  +  + + +++ GY+T  +GKWHLG Y  E  P 
Sbjct: 65  LTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLG-YTPETMPH 123

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----EPAWDLHGKYSTDVFTAEAV 230
            +GFE+  G+  G  D + H       W    R DL    +  W   G +  D+   +  
Sbjct: 124 GQGFETSFGHMGGCIDNYSH----FFYWNGPNRHDLWENGKEVWR-DGAFFPDLMVEQCQ 178

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
           D I   + D+P FLY A        P+ PLQ  + +   + H+    R K+AA +  +D+
Sbjct: 179 DYIRK-AGDKPFFLYWAINV-----PHYPLQGKEKWRKTYAHLSS-PRDKYAAFVSTMDD 231

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL-NAASNWPLRGVKNTLWEGGVRGAG 349
            +G+V+  L+  ++   +II+F SD+G +          S  P RG K +L+EGG+R   
Sbjct: 232 CIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPA 291

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           +I  P   + G V +Q     DWLPT +SA   + +P +
Sbjct: 292 MISWPGTIAEGEVRDQLATGCDWLPT-ISALTGAPLPAH 329


>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 472

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 188/359 (52%), Gaps = 47/359 (13%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPS 110
           +L A   PP+I++IL DDLGW D+G +G +   TPNID L   G+     Y+   +C+PS
Sbjct: 21  NLSAQERPPNIVYILIDDLGWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPS 80

Query: 111 RSAIMTGKHPIHTGMQHNVL------YGCERGGLP--------LSEKILPQYLKELGYRT 156
           R++I+TGK+P + G   ++       Y  E   +P        L EK++P+ L + GY +
Sbjct: 81  RASILTGKNPANIGFTGHITAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTS 140

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHL-GYWTGHQDYFDHSAEEMKMW-----GLDMRRDL 210
             +GKWH+G  +K + PT +GF  ++ GY  G    +    E  K W      LD R + 
Sbjct: 141 ASIGKWHVGEEEK-FFPTHQGFAINIAGYEHGSPPTYWGPFESEKSWNPVIKNLDNREE- 198

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD-----H 265
                  G+Y T+  T EA++ I + + + P FLYL+H A H+     PL+APD     +
Sbjct: 199 -------GQYLTNRLTDEAINFI-DENKEGPFFLYLSHYAVHT-----PLEAPDSLIKKY 245

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNL 325
            L +   +E  K + +AA++  +D +VG+++++L+   +  N+I++  SDNGG       
Sbjct: 246 ELKLEGQMEQ-KNAIYAAMIENMDWNVGRLLKSLDSLGLEGNTIVILGSDNGGEG----- 299

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
              +N PLR  K  ++EGG+R   +I  P        ++  V   D LPT++  A  S+
Sbjct: 300 RVTNNVPLREGKGYIYEGGIRVPLVIKWPGKVKPNSSSDVPVITDDMLPTIVEMAGLSN 358


>gi|87309613|ref|ZP_01091747.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87287377|gb|EAQ79277.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 496

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/469 (28%), Positives = 210/469 (44%), Gaps = 73/469 (15%)

Query: 22  PQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFH 78
           P YL+      ++  F ++  A   S        S+ P   P+I+F L DDLGW D+G +
Sbjct: 7   PSYLQH-----KLTTFTLVIGAILFSFTLEKQANSAEPKRTPNIVFFLVDDLGWKDIGVY 61

Query: 79  GLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNV-------- 129
           G     TPN+D LA SG+   N Y   Q+C+P+R++IMTGK+P   G+   +        
Sbjct: 62  GSSFYHTPNVDGLAASGMRFTNAYAACQVCSPTRASIMTGKYPQRVGITDYIGAAQPDKW 121

Query: 130 ------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG 183
                 L    +  L L E  L + LK+ GY T   GKWHLG   +   P  +GF+ ++G
Sbjct: 122 KRNTPLLPAPYQTRLALEETTLAEALKQRGYATFFAGKWHLG--PEGNWPEDQGFDVNIG 179

Query: 184 YWT-----GHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
                   G + YF          G D            G++  D   +E V  I  H  
Sbjct: 180 GIDRGGPYGGKKYFSPYGNPRLTDGPD------------GEHLPDRLASETVKFIEQHQ- 226

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR-------------------- 278
           D+P   YL+  + H+     PL A +     +  I+   R                    
Sbjct: 227 DQPFLAYLSFYSVHT-----PLMAREDLKQKYDEIKQRIRFAGPIWGEEGKSKLRLVQEH 281

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
           S +A ++  +D +VGKV++AL++ ++  N++++F SDNGG +     +  SN PLRG K 
Sbjct: 282 SVYAGMVEAMDAAVGKVLDALDRLKLTDNTLVIFTSDNGGLSTSEG-HPTSNLPLRGGKG 340

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIP 398
            ++EGG+R   ++  P + S G  ++  V   D+LPT+L+  +K    + +++   +II 
Sbjct: 341 WMYEGGIREPLVVRYPGVTSPGSESDALVTSPDFLPTILAVVDKPG--DKIDTDGVSIIS 398

Query: 399 RYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE 447
             E   L  + G   ++ P   N          E + K    YE+G  E
Sbjct: 399 ALEGKPL--DRGPIFWHYPHYGNQGGSPTAAVREGDWKLIEWYEDGKVE 445


>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
 gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
          Length = 413

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/334 (32%), Positives = 173/334 (51%), Gaps = 33/334 (9%)

Query: 66  LADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTG 124
           +ADDLG+ D+  +G     TP++D LA +GI   ++++   +C+P+R+ ++TG++    G
Sbjct: 1   MADDLGYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAG 60

Query: 125 MQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           +   V+Y   +     GL  +E  L Q L++ GY+T + GKWHLG Y+++Y PTFRGF+ 
Sbjct: 61  ID-GVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLG-YQRQYNPTFRGFQQ 118

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            +GY +G+ DYF H     +  W  +   + E        Y T +    A++ I     +
Sbjct: 119 FVGYVSGNVDYFAHLDGTGVFDWWHNAELNREEQ-----GYVTHLINDHALEFIRQQQ-E 172

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH-------KLDESV 292
           +P F+Y+AH A HS     P Q P           D K +K   I +       ++D+ +
Sbjct: 173 KPFFVYIAHEAVHS-----PYQGPHDQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKGI 227

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G++V+ L++  +   + I F+SDNG    G      SN  LRG K +LWEGG R   +  
Sbjct: 228 GQIVDVLKEVNLTEKTFIFFLSDNGANKNG------SNGKLRGFKGSLWEGGHRVPAIAC 281

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
            P     G V ++ V   D +PT+L  AN + IP
Sbjct: 282 WPGRIPEGTVCDEPVISIDLMPTILELAN-AKIP 314


>gi|296124231|ref|YP_003632009.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016571|gb|ADG69810.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 467

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 171/360 (47%), Gaps = 32/360 (8%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNI 88
           R+    +L   FT  +       +       P+I+ IL+DD G  +    G  +  TP+I
Sbjct: 8   RLWLVLILAFQFTSQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQGHPRYKTPHI 67

Query: 89  DALAYSGIILK-NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEKILP 146
           D++  +G+  +  Y +  +C+PSR+ ++ G++    G + N+     E  GLP SE +LP
Sbjct: 68  DSIGKNGVHFRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLP 127

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM 206
           Q LKE GYRT  +GKWHLG Y  ++ P  RGF  + G+  G + YF             M
Sbjct: 128 QLLKEDGYRTIALGKWHLG-YAPQFHPMERGFTDYYGFLQGSRSYFPLKKPTRLN---QM 183

Query: 207 RRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP--LQAPD 264
            RD     +    Y TD    EA+  I    + +P  +YLA  ATHS N      LQA D
Sbjct: 184 LRDRTAIPEEQFGYMTDHLADEAIAYIKQWQS-QPWMMYLAFNATHSPNDATAVDLQAAD 242

Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
                         +K  A+   LD +VGKV++AL++  +  +++++F++DNGGA     
Sbjct: 243 G-------------NKIYAMTIALDRAVGKVLDALKECGLSKDTLVIFINDNGGAG---- 285

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                N  L G K + WEGG R   L+  P     G V ++ V   D  PT+L  A   D
Sbjct: 286 --GHDNGSLHGKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGD 343


>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
          Length = 491

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 107/338 (31%), Positives = 166/338 (49%), Gaps = 47/338 (13%)

Query: 69  DLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQH 127
           ++GW D+G +G     TPN+D +A  G++  N+YT   LC+PSR+A++TG+ PI  G   
Sbjct: 8   NMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYT 67

Query: 128 NVLYGCER---------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               G  R         GG+P SE++LP+ LKE GY ++IVGKWHLG ++ ++ P   GF
Sbjct: 68  TS--GHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHGF 124

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDV 224
           +   G    H   +D+ A       + + RD    W++ G+Y               T +
Sbjct: 125 DEWFGSPNCHFGPYDNRARP----NIPVYRD----WEMVGRYYEEFPINLKTGEANLTQI 176

Query: 225 FTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +  EA+D I    +   P FLY A  ATH+     P+ A   +L         +R ++  
Sbjct: 177 YLQEALDFIRRQQAARRPFFLYWAVDATHA-----PVYASKPFLGTS------QRGRYGD 225

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
            + ++D+SVGK++ +L+   +  ++ + F SDNG A         SN P    K T +EG
Sbjct: 226 AVREIDDSVGKILHSLQDLGIAESTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTFEG 285

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+R   + W P     G V+ Q   + D   T LS A 
Sbjct: 286 GMREPAVAWWPGRIPAGQVSHQLGSIMDLFTTSLSLAG 323


>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 476

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 133/422 (31%), Positives = 200/422 (47%), Gaps = 59/422 (13%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           ++  A +P AF  S         +G P+++ ILADDLG+ D+  +G   + TPNIDAL  
Sbjct: 12  VVVLAGVPEAFGAS-------PDAGRPNVLLILADDLGYGDLSSYGAADLKTPNIDALVA 64

Query: 94  SGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYLK 150
           SG+    +Y    +C+P+R+A++TG +P   G+   +    +   G L     +LPQ LK
Sbjct: 65  SGVRFDRFYANSPVCSPTRAALLTGCYPDLVGVPGVIRTHPDDSWGVLSPQAVLLPQVLK 124

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRD 209
             GY T +VGKWHLG       P+ RGF+   G+      DY +H    +      MRRD
Sbjct: 125 GAGYHTALVGKWHLGLSGAS-LPSRRGFDLFHGFLGDMMDDYHNHRRHGINY----MRRD 179

Query: 210 ---LEPAWDLHGKYSTDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
              ++P       ++TD+F+  A+D ++ +   D P FL LA+   H+     P+Q P  
Sbjct: 180 DREIDPK-----GHATDLFSQWAIDFLNESKGQDRPFFLELAYNVPHT-----PVQPPQE 229

Query: 266 YLNIHRHIE---DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
           +L   R      D KR++  A++  LD  +G+V+ AL        ++IVF SDNGG    
Sbjct: 230 WLEKVRRRAPNLDPKRARLVALIEHLDHGIGQVLAALRANGQAERTLIVFTSDNGG---- 285

Query: 323 FNLNAASNW-PLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             L+A ++  P RG K  ++EGG+R     +W   +E  G  +++     D  PTL  AA
Sbjct: 286 -QLDAGAHCGPYRGGKQDMYEGGIRVPLCAVWPGRIEP-GTRSDRVAVSMDLFPTLCEAA 343

Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENS--NTRYENGTHEYNPKYE 438
                       V+ I      S+L    G H    P IE      R E G     PKY 
Sbjct: 344 GAR-----FEHPVDGI------SLLSTFQGHHNKAEPPIERDLIFVRREGG-----PKYH 387

Query: 439 NR 440
            +
Sbjct: 388 GQ 389


>gi|392969626|ref|ZP_10335041.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387841820|emb|CCH57099.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 477

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 123/411 (29%), Positives = 192/411 (46%), Gaps = 44/411 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+FILADDLG+ D+  +G     TP++D+LA  GI   + Y    +C+PSR+AI+TGK
Sbjct: 37  PNIVFILADDLGYMDLRCYGNPYNETPHLDSLARRGIRFTQAYSACPVCSPSRAAILTGK 96

Query: 119 HPIHTGMQHNVLYGCERGG-------------LPLSEKILPQYLKELGYRTRIVGKWHLG 165
           HP    + + +  G ER               LP SE  L + LK+ GY T +VGKWHLG
Sbjct: 97  HPARLHLTNFI--GGERVDTTSSLLPAEWRRYLPASETTLAELLKQQGYVTGMVGKWHLG 154

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
                 TPT +GF+          DY+++S         +         D   +Y TD  
Sbjct: 155 NTGDSLTPTAQGFDYERQISKNGLDYYNYSIASNNKTVFE---------DTGKEYLTDKL 205

Query: 226 TAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAP-----DHYLNIHRHIEDFKRS 279
           T  A++ I  N +  +PLFLYLA++A H       L  P       +L  +         
Sbjct: 206 TDYALEFIDQNKAGQKPLFLYLAYSAPHI------LLVPRADKLSKFLYRYPRFNGRYNP 259

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA---AGFNLNAASNWPLRGV 336
            +AA+L   DE VG+V++ L+   +  N+++VF SDNGG     AG       N PLR  
Sbjct: 260 SYAAMLLSFDEGVGRVLQKLKDTGLADNTLVVFTSDNGGLGMPEAG--PTPTDNSPLRKW 317

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
           K  ++EGG+R   ++    + +R    ++Y+  +D++PT +     +  P   +   ++ 
Sbjct: 318 KGHVYEGGIRVPLIMAGSGITARNATCDRYLTGTDFVPTFMEMLQTNWKPAVPDG--KSF 375

Query: 397 IPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE 447
                N    ++ G   ++ P   N  +R      + + K    YE G  E
Sbjct: 376 YALLSNPAAAHKRGPVFWHYPHFSNQASRPSAAIRDGDYKLVEHYETGNTE 426


>gi|395537754|ref|XP_003770857.1| PREDICTED: arylsulfatase A [Sarcophilus harrisii]
          Length = 507

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 112/359 (31%), Positives = 171/359 (47%), Gaps = 34/359 (9%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
           +A + PP+II I ADDLG+ D+G +G     TPN+D +A  G+   ++Y  V LCTPSR+
Sbjct: 15  LAVARPPNIILIFADDLGYGDLGSYGHPSSSTPNLDRMAAYGLRFTDFYVPVSLCTPSRA 74

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-YKKEY 171
           A++TG++P+ +G+   VLY   RGGLPL E  + + L + GY T + GKWHLG      +
Sbjct: 75  ALLTGRYPVRSGLYPGVLYPASRGGLPLDEVTIAEVLAKRGYLTGMAGKWHLGVGLNGTF 134

Query: 172 TPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-------MRRDL----EPAWD 215
            P ++GF  +LG    H     Q       + +   G D       +  +L    +P W 
Sbjct: 135 LPPYQGFHRYLGIPYSHDQGPCQKLICFPPDTLCFGGCDQGNVPIVLMANLTVKEQPPW- 193

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
           L G     V  A+   ++     D P FLY A   TH               +    +E+
Sbjct: 194 LPGLEKQYVAFAQNF-MVDAARQDRPFFLYYASHHTHYPQ-----------FSGESFVEN 241

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R  F   L ++D +VG ++  +E   +   +++ F +DNG           S+  LR 
Sbjct: 242 SGRGPFGDALMEMDSAVGALLNTVESLGLTEETLVFFTADNGPETMRL-FRGGSSGILRC 300

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
            K T +EGGVR   L + P    R  V+ +     D LPT L+A  ++ +PN     V+
Sbjct: 301 GKGTTYEGGVRQPALAFWP-GHIRPGVSHELASTLDLLPT-LAALAQAQLPNVTLDGVD 357


>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
           queenslandica]
          Length = 523

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 110/348 (31%), Positives = 171/348 (49%), Gaps = 44/348 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P++IF+L DD+GW D+G +G     TPN+D +A  G++L ++Y+   LC+PSR+A++TG+
Sbjct: 33  PNMIFMLMDDMGWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAAMLTGR 92

Query: 119 HPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            PI  G      +           GG+P SE + P+ L++ GY T I+GKWHLG  +  Y
Sbjct: 93  LPIRNGFYTTNAHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWHLG-QQTHY 151

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV------- 224
            P   GF+   G    H   FD   +      + + R+   A    G+Y  D        
Sbjct: 152 HPLKHGFDEFFGSTNCHFGPFDGKEQP----NMPVYRNATMA----GRYYQDFPINHKTG 203

Query: 225 -------FTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
                  +T EA+  I+ N +  +P FLY    ATH+     PL A   +L         
Sbjct: 204 ESNLTVEYTQEAIKFINKNAANKKPFFLYWTPDATHT-----PLFASKDFLGTS------ 252

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R  +   + +LD SVG++++ L+   +  +++++F SDNGGA      +  SN P    
Sbjct: 253 QRGLYGDAVKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGATYA-KESGGSNSPFLCG 311

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           K T +EGG+R   + W P     G V+ Q   + DW  T L  A  S+
Sbjct: 312 KETTFEGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLAGISE 359


>gi|296120840|ref|YP_003628618.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013180|gb|ADG66419.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 517

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/340 (32%), Positives = 166/340 (48%), Gaps = 46/340 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P++IFI+ADDLG+ D+G  G     TP+ID LA  G+   N++  Q C P+R+AIMTG++
Sbjct: 52  PNVIFIMADDLGYTDLGCFGSRYYETPHIDQLARQGVRFLNHHHCQNCAPTRAAIMTGQY 111

Query: 120 PIHTGM-----------QHNVLYGCER-GGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
              TG+           Q   L   E    LPL    + Q ++  GY+T + GKWHLG  
Sbjct: 112 GPRTGVYTVGSIDRFDWQSRPLRPVENVEKLPLDPATIAQQVQRSGYQTGMFGKWHLGL- 170

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
              Y P+ RGF   +     H ++    A++                D  GKY  D  T 
Sbjct: 171 NGSYHPSQRGFHEAVESSGQHFNFKTTPAQK----------------DTEGKYLADYLTD 214

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH---IEDFKRSKFAAI 284
            A+D I  H + EP FLYL H   HS     P QA   ++    +   +   +   +AA+
Sbjct: 215 RAIDFIERHQS-EPFFLYLPHFGVHS-----PFQAKKEWIEKFENKPGVGGHRNPVYAAM 268

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA----GFNL--NAASNWPLRGVKN 338
           +  +DESVG++++ L++ ++   ++++F SDNGG       G +   +   N PLR  K 
Sbjct: 269 IASVDESVGRILDKLDELKLADKTVVIFASDNGGVGGYEREGLHKANDVTDNAPLRSGKG 328

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQ-YVHVSDWLPTLL 377
           +L+EGG R   ++  P +   G+      +HV D  PT L
Sbjct: 329 SLYEGGTRVPLIVRWPGVAPAGVECRTPTIHV-DLYPTFL 367


>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 548

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 128/377 (33%), Positives = 183/377 (48%), Gaps = 65/377 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQL-CTPSRSAIMTGK 118
           P+I+ ILADD+G NDV   G   I TPNID LA  G +  N Y+    C PSR+A++TG+
Sbjct: 63  PNIVLILADDMGINDVSTFGGGMIETPNIDKLAAKGALFTNGYSGHANCAPSRAALLTGR 122

Query: 119 H---------PIHTGMQHNVL--------------YGCERG---------GLPLSEKILP 146
                     PI  GM   +               Y  E           GLP SE ++P
Sbjct: 123 DATRTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYSAEADATNPTYDNRGLPGSEILIP 182

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE---EMKMWG 203
           + LKE GY T  +GKWHLG    E  P  +GF+  L   +G     DH       ++  G
Sbjct: 183 EILKESGYHTMHIGKWHLG-RSPEMMPNAQGFDESLMMDSGLYLPVDHPESVNAPVESSG 241

Query: 204 LD------MRRDLEPAWDLH-----GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH 252
           LD      MR  +   W+         Y TD FT EA   I  ++ + P FLYLAH    
Sbjct: 242 LDRFIWATMRYSVN--WNGGEIFKPNGYLTDYFTEEAEKAIEANA-NRPFFLYLAHWG-- 296

Query: 253 SANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVF 312
              P+ P+QA          I+   +  +AA+L  +D SV +V+  LE++ +  N+I++ 
Sbjct: 297 ---PHNPVQAKRADYEAVGDIQPHNKRVYAAMLRSIDRSVERVMAKLEKQGIADNTIVIL 353

Query: 313 VSDNGGA--AAGFNLNAASNWPLRGVKNTLWEGGVRGA-GLIWSPLLESRGIVAEQYVHV 369
            SDNGGA   A  +LN     P RG KNT +EGG+R    + W  +++   ++ E   H+
Sbjct: 354 SSDNGGADYVAINDLNK----PYRGWKNTFFEGGIRVPFSVTWPNVIDESTVIEEPVNHI 409

Query: 370 SDWLPTLLSAANKSDIP 386
            D +PT+++ AN +D+P
Sbjct: 410 -DLMPTIINMAN-ADLP 424


>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 467

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 172/357 (48%), Gaps = 42/357 (11%)

Query: 46  LSMVFVDLVASSGP--PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             +V   L A + P  P+ IFILADDLGW DVGFH    +PTPN+D LA  G+ L  +Y 
Sbjct: 25  FGLVVSSLAADTAPLRPNFIFILADDLGWGDVGFH-HGNVPTPNLDHLAGEGLELMQHYV 83

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             +C+P+R A ++G++      + +V               L + LK +GY T + GKWH
Sbjct: 84  YPVCSPTRCAFLSGRY----ASRFSVTTPQNPRAFRWDTVTLARALKSVGYDTALCGKWH 139

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS---AEEMKMWGLDMRRDLEPAWDLHGKY 220
           LG  K E+ P   GF+   G   G    +DH     E  + W  D +   E        +
Sbjct: 140 LG-SKPEWGPQKFGFDHSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIEEQG------H 192

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRS 279
            TD+ T EAV+ + +  TD+P FLY+   A H      P++ PD  L  +   I      
Sbjct: 193 VTDLITKEAVEWLESR-TDKPFFLYVPFTAVHI-----PIREPDEILQRVPASITKPSLR 246

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN------------- 326
            + A +  LD+SVGK++ ALE+     N++++F SDN GA  G   N             
Sbjct: 247 HYGANVMHLDDSVGKILVALEKTGKAGNTLVIFGSDN-GAIPGVENNDPLYPPDHYPPGP 305

Query: 327 -AASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              SN PL G+K  ++EGG+  A +  W   L+    +     H++DW+PT  + A 
Sbjct: 306 AGGSNEPLHGMKGEVYEGGIHTAAVARWPGQLKPGKFLG--LAHITDWMPTFCALAG 360


>gi|148223113|ref|NP_001088386.1| galactosamine (N-acetyl)-6-sulfate sulfatase precursor [Xenopus
           laevis]
 gi|54038351|gb|AAH84661.1| LOC495239 protein [Xenopus laevis]
          Length = 520

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 112/370 (30%), Positives = 181/370 (48%), Gaps = 48/370 (12%)

Query: 39  VLPLAFTLSMVFVDLVASSGP-----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           VL +  +L++ F++ V   G      P+I+ +L DD+GW D+G  G     TPN+D +A 
Sbjct: 2   VLYIMCSLALAFLNPVLVLGGMNVTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMAS 61

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
            G++  N+YT   LC+PSR+A++TG+ PI  G         N     E  GG+  SE + 
Sbjct: 62  EGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFFTTNSHARNAYTPQEIVGGISDSEILF 121

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           P+ LK+ GY  +I+GKWHLG ++  Y P   GF+   G    H   +D+     ++  + 
Sbjct: 122 PELLKKAGYINKIIGKWHLG-HRPSYHPLRHGFDEWFGSPNCHFGPYDNK----QIPNIP 176

Query: 206 MRRDLEPAWDLHGKYS--------------TDVFTAEAVDIIHNHST-DEPLFLYLAHAA 250
           + RD    W++ G+Y               T ++  EA++ I++ +   +P FLY +  A
Sbjct: 177 LYRD----WNMVGRYYEEFNINHKTGESNLTQIYLQEALEFIYSQAVAQQPFFLYWSIDA 232

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           TH+     P+ A   +L          R  +   + ++D S+GK++  L    +  ++ +
Sbjct: 233 THA-----PVYASRPFLGTS------NRGLYGDAVREIDFSIGKILNLLNLTGIAKDTFV 281

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
            F SDNG A     +   SN P    K T +EGG+R  G+ W P   + G V+ Q   + 
Sbjct: 282 FFTSDNGAALISAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIM 341

Query: 371 DWLPTLLSAA 380
           D   T LS A
Sbjct: 342 DLFTTSLSLA 351


>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
          Length = 544

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 126/407 (30%), Positives = 190/407 (46%), Gaps = 66/407 (16%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGP--------PHIIFILADDLGWNDVGFH--- 78
           Y+  I+ +A+  ++       +    S GP        P+II +LADD+G+ND+  H   
Sbjct: 22  YKVNILVWAIPKISNVTVQENIPTTWSKGPDTPVDDNRPNIILVLADDMGYNDISIHNGG 81

Query: 79  -GLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG 136
                + T NIDALA SGI+  + Y     C PSR++IMTGK+P   G +   +    R 
Sbjct: 82  AADGTLQTKNIDALAKSGILFTRGYAANATCAPSRASIMTGKYPTRFGYEFTPIPAFGRT 141

Query: 137 -------------------------------GLPLSEKILPQYLKELGYRTRIVGKWHLG 165
                                          G+P  +  + + L++ GY T  +GKWHLG
Sbjct: 142 VLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMPTEQITIAEVLRDAGYYTAHIGKWHLG 201

Query: 166 FYKKEYTPTFRGFESHLGYW------TGHQDY----FDHSAEEMKMWGLDMRRDLEPAWD 215
            ++    P  +GF+  LG          H D     FD   ++M +WG+          D
Sbjct: 202 -HEYGMDPMSQGFQDSLGLVGPLYLPEDHPDVVNAKFDTRIDKM-IWGMGQYSANFNGGD 259

Query: 216 LHG--KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
           L    KY TD +T EA+ +I N+  + P FLYL+H A H+     PLQA         H+
Sbjct: 260 LFAPDKYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIHN-----PLQALRSDFEQMSHM 313

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
                  ++ +++ LD SVGK++E L++  +   ++I+F SDNGGA     LN   N P 
Sbjct: 314 HGHNLQVYSGMINSLDRSVGKIIEKLKELDIYGKTLIIFTSDNGGANY-IELNDI-NKPY 371

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           RG K + ++GG+R   +I  P   + G  +E  VH  D  PT+L AA
Sbjct: 372 RGWKISFFDGGIRVPYIISWPDEINPGKKSENAVHHFDIFPTILKAA 418


>gi|341889947|gb|EGT45882.1| CBN-SUL-3 protein [Caenorhabditis brenneri]
          Length = 432

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 88/228 (38%), Positives = 132/228 (57%), Gaps = 23/228 (10%)

Query: 51  VDLVASSGPP----HIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTV 104
            D+V    PP    +++FI+ADDLG+ND+ +     + TPN+  LA+  +  +L N Y  
Sbjct: 20  TDVVNGQSPPTQKPNVLFIMADDLGFNDLDWKD-STLHTPNLRNLAFHKNTALLTNSYVN 78

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
           QLCTP+RSA MTG +P   G Q  V    E  G+P     L + +++L Y T +VGKWHL
Sbjct: 79  QLCTPTRSAFMTGYYPFRVGTQAGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHL 138

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMK------MWGLDMRRDLE-----PA 213
           G+ KKE+ PT RGF+   G++     YF+HSA++        + GLD+  ++      P 
Sbjct: 139 GYCKKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELRRVVKGLDLFEEVGNGKSVPD 198

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
           +  +G YSTD+FT  A+ +I NH+T +P F++L++ A H      PLQ
Sbjct: 199 FSQNGVYSTDLFTDVAMSVIDNHNTTKPFFMFLSYQAVH-----PPLQ 241


>gi|260062069|ref|YP_003195149.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783631|gb|EAR14802.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
          Length = 492

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 106/327 (32%), Positives = 160/327 (48%), Gaps = 35/327 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+ I + ADDLG+ D+   G   I T N+D +A  G    N+Y    +CTPSR+ ++TG+
Sbjct: 50  PNFIIVFADDLGYGDLSSFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGR 109

Query: 119 HPIHTGMQHN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
            P+  G+  N   V +     G+P SE  L + LK+ GY T +VGKWHLG +K+EY P  
Sbjct: 110 LPVRNGLTSNEIGVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLG-HKEEYLPPN 168

Query: 176 RGFESHLGY-----------WTGHQDYFDHSAEEMKMWGLD------MRRDLEPAWDLHG 218
            GF+ + G            +T +QDYF    E  +    +      +R   E    ++ 
Sbjct: 169 HGFDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQ 228

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
              T  +  EAV  I  H  DEP F+YLAH+      P+ PL   D +           R
Sbjct: 229 NTITKRYNDEAVKWIREHK-DEPFFMYLAHSL-----PHVPLFTSDEFRGTS------AR 276

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             +  ++ ++D  VG+++E LE   +  N+I+VF SDNG       ++  S   LR  K 
Sbjct: 277 GLYGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNGPWLPT-GISGGSAGLLREGKG 335

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQ 365
           T WEGG+R   + W+P +    +V + 
Sbjct: 336 TTWEGGMREPTIFWAPGMLPAKVVMDM 362


>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
 gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
          Length = 457

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 124/374 (33%), Positives = 188/374 (50%), Gaps = 40/374 (10%)

Query: 40  LPLAFTLSMVFVDLVA-SSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           +PL F +  V +  VA +S P       P+I+ I+ DDLG  D+       + TPNID L
Sbjct: 9   IPLLFAVGGVAISSVAHASSPQKLDEQKPNILLIMVDDLGLGDLSCQYARDLQTPNIDKL 68

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQY 148
              G+ L N+Y    + +PSR+ ++TG +P   G+   +        G L     +LPQ 
Sbjct: 69  FSDGVRLNNFYANSSVSSPSRAGLLTGCYPDMVGVPGVIRTETTASWGYLSPEAILLPQM 128

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD-YFDHSAEEMKMWGLDMR 207
           +K+  Y T I+GKWHLG  ++   P  RGF+   G+     D Y+ H         L+  
Sbjct: 129 MKKAEYNTAIIGKWHLGL-EEPNLPNSRGFDFFHGFLGDMMDNYYTHLRRGHNYMRLN-E 186

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
             +EP       ++TD+F++ A+D       +  P FLYLA+ A     P+ PLQ P  +
Sbjct: 187 EVIEPT-----GHATDLFSSWALDYFDTMRKSKVPFFLYLAYNA-----PHVPLQPPTEW 236

Query: 267 LNIHRHIE---DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
           L   R  E     KR+K  A++  LD ++G+V +ALE+   L N++I+F SDNGG A   
Sbjct: 237 LERVREREPGISEKRAKLVALIEHLDYNIGRVYKALEESGQLDNTLIIFCSDNGGDAGA- 295

Query: 324 NLNAASNWPLRGVKNTLWEGGVR-GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
               A+N P+RG K  ++EGG++    + W   L+ R I  +  V +SD  PTL    + 
Sbjct: 296 ---EANNGPVRGSKGDMYEGGIKVPCAVYWRNHLKPRSI--DDLVIMSDIFPTLCDLVS- 349

Query: 383 SDIPNYVNSTVENI 396
             IP  VN  ++ I
Sbjct: 350 --IP--VNHRIDGI 359


>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 653

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 167/350 (47%), Gaps = 50/350 (14%)

Query: 71  GWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL 130
           GW DVGFH      TPN+DA+   G+ L  +YT   CTPSR+ +MTG++    GMQ +VL
Sbjct: 48  GWKDVGFHDT-TFSTPNLDAMVAEGVELSTFYTAPTCTPSRAQLMTGRYSYRIGMQDSVL 106

Query: 131 YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQD 190
           +  E  G+PL+E  + + L+  GY T  VGKWHLG +  ++ P  RGF+   G  TG   
Sbjct: 107 HTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMPQFLPVERGFDDFYGILTGGGG 166

Query: 191 YFDH---SAEEMKMWGLDMRRDLEPAWDLHGK-----------YSTDVFTAEAVDIIHNH 236
           ++ H   S E     G    R       + G            +ST+++T +A + +   
Sbjct: 167 HYTHMSVSPEFTPRGGSGKTRTFSGPNIVEGSEVSDDNQDTRTHSTELYTKKAAEYVEAM 226

Query: 237 ST--DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE---------------DFKRS 279
           S   D+P FLYL++ A H     +P++  D ++N  R  E               D+   
Sbjct: 227 SAKEDQPWFLYLSYQAIH-----DPIETDDSWIN-GRSCESISAENSSIGETDETDYDNR 280

Query: 280 KFA-AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
           + A  ++ ++D  +G +   LE      N++++F SDNGG  +    + + N P RG K 
Sbjct: 281 RIACGMVAQMDNGLGALRGLLENLGEWDNTVVIFFSDNGGLPS----HGSVNRPFRGGKG 336

Query: 339 TLWEGGVR----GAGLIWSPLLESRGIVAEQY---VHVSDWLPTLLSAAN 381
             WEGGV      +G   S  L   G+   +Y    HV+D   T+L  A 
Sbjct: 337 DYWEGGVHVPAFVSGGFVSAALRRTGVEPYRYGHLTHVTDVHATILGLAG 386


>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
 gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
          Length = 482

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 51/354 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ D+G +G   I TPN+DA+A  G     +Y    +C PSRS+ +TG+
Sbjct: 35  PNIIFILADDLGYGDLGCYGQKLIQTPNLDAMAKQGTRFTQFYAGTAVCAPSRSSFLTGQ 94

Query: 119 HPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           H  HT ++ N   G +  G   +P S   + + LK+ GY T   GKW LG       P  
Sbjct: 95  HTGHTPIRGNK--GVQPEGQWPIPDSAVTIAEVLKKAGYATGDFGKWGLGPVASSGDPIK 152

Query: 176 RGFESHLGY---WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFTAEAVD 231
           +GF++  GY      H  Y DH      +W  D R D       H   YS D+   +A+ 
Sbjct: 153 QGFDAFYGYNCQTLAHDYYPDH------LWENDTRIDFAANTPEHPTDYSADLIHQKALQ 206

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-------------------NIHRH 272
            I  H  + P F++L++   H+A     LQ PD  L                     +  
Sbjct: 207 FIDQHKGN-PFFVFLSYTLPHAA-----LQVPDDSLFEKYKKLLNEKPIPVPKWNGKNYA 260

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG------FNLN 326
            + + R+ +AA++ +LD  VG+V++ L++  +  N++++F SDNG    G      FN  
Sbjct: 261 PQAYPRAAYAAMVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDNGPHKEGGYDPDYFN-- 318

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             SN PLRG+K  L+EGG+R   + W P     G  ++      D+LPT    A
Sbjct: 319 --SNGPLRGIKRDLYEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAELA 370


>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 460

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 112/325 (34%), Positives = 168/325 (51%), Gaps = 51/325 (15%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSA 113
           A+  PP+I+ I ADDLG+ D+G +G   I TP++D +A  G+   ++Y   ++CTPSR+A
Sbjct: 25  AADRPPNIVIIFADDLGYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSRAA 84

Query: 114 IMTGKHPIHTGM--QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           ++TG++P+ +GM  +  VL+    GGLP  E  LP+ LK  GY T  VGKWHLG ++   
Sbjct: 85  LLTGRYPVRSGMYGKRRVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLGIHEGS- 143

Query: 172 TPTFRGFESHLGY-WTGHQDY--------FDHSAEEMKMWGLDMRRDLE----PAWDLHG 218
            P  +GF+   G  ++   D                +  W + + R+ E    PA  +H 
Sbjct: 144 RPLDQGFDQSFGLPYSNDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQVH- 202

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
              T  +T EAV  I    + +P FLY+AH+      P+ PL A           E+FK 
Sbjct: 203 --LTGHYTEEAVKFIQQKKS-QPFFLYMAHSF-----PHVPLFA----------SENFKG 244

Query: 279 SKFAAI----LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG-----GAAAGFNLNAAS 329
              A I    + +LD SVG+V++ L +  +  N+++ F SDNG     GA  G      S
Sbjct: 245 RSRAGIYGDTVEELDWSVGQVLQTLRKEGLSENTLVFFTSDNGPWLIMGAQGG------S 298

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSP 354
             PL+  K + WEGG+R  G+ W P
Sbjct: 299 AGPLKDGKGSHWEGGMRVPGIAWWP 323


>gi|423214667|ref|ZP_17201195.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692573|gb|EIY85810.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 459

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 178/337 (52%), Gaps = 34/337 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+ + I+ADD+G+ DVG +G + I TPNID +A  G++  ++++   + +P+R  ++TG+
Sbjct: 29  PNFVIIVADDMGYGDVGIYGNEYIRTPNIDQMARKGMMFTDFHSNGSVSSPTRCGLLTGR 88

Query: 119 HPIHTGMQHNVLY----GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +    G++  +L       +  GLP  E    + + + GYRT ++GKWHLG+ +K +   
Sbjct: 89  YQQRAGLERVLLVPRNDKDKEVGLPSEEITFAKVMGDNGYRTALIGKWHLGYLQKHHPMN 148

Query: 175 FRGFESHLGYWTGHQDYFDHSAE--EMKMW-GLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           F GF+  +G+ +G+ DY  H     +M  W GL+++       D+ G Y+T + T  + D
Sbjct: 149 F-GFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEVK-------DMPG-YTTTLLTTLSED 199

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAP-DHYLNIHRHIEDFKRSK------FAAI 284
            I   + D+P  LY+AHAA HS     P+Q P +  +      E  K S       +  +
Sbjct: 200 YI-KENKDKPFCLYIAHAAPHS-----PMQGPHEKAIRTEATPEGDKNSDRSNKEIYKDM 253

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           + +LD SVG+++E L++ ++  N+ +VF SDNG        N  S    +G K + WEG 
Sbjct: 254 VEELDWSVGRILETLKKYKLDENTFVVFFSDNGPVIN----NGGSAGGYKGAKGSPWEGA 309

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            R  G+ + P     G   EQ V   D  PT+L  A+
Sbjct: 310 HRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLDMAD 346


>gi|72161757|ref|YP_289414.1| N-acetylgalactosamine-6-sulfatase [Thermobifida fusca YX]
 gi|71915489|gb|AAZ55391.1| n-acetylgalactosamine-6-sulfate sulfatase [Thermobifida fusca YX]
          Length = 471

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 109/348 (31%), Positives = 170/348 (48%), Gaps = 36/348 (10%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
           PP+I+FILADDLGW D+G +G   I TPN+D LA  GI     Y     C+ +R ++ TG
Sbjct: 20  PPNILFILADDLGWADLGCYGSTTIRTPNLDQLAAQGIRFTHGYAGSPWCSSTRISLYTG 79

Query: 118 KHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           ++P  +  G++  ++      G+P     L   L E GY T + GKWH G+    Y+P  
Sbjct: 80  RYPGRLQAGLEEPLVTRSPENGIPEGHPTLSSLLVEAGYATAMFGKWHCGWLPW-YSPLR 138

Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
            GFE+  G + G  DYF+H     K    D+     P  ++   Y T++ +  A + I  
Sbjct: 139 IGFETFFGNFDGALDYFEHVDTLGKA---DLYEGETPVEEV--GYYTEIISERAAEYITA 193

Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHY---LNIHR------------HIEDFKRSK 280
           H  + P ++ L + A     P+ P + PD +     I R            H++    +K
Sbjct: 194 HR-NRPFYVQLNYTA-----PHWPWEGPDDHEVGQEIRRRYQQRWEHSPLMHLDGGSIAK 247

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +  ++  +D  +G+V+ AL++     N+I+VF SDNGG         + NWP  G K  L
Sbjct: 248 YGELVEAMDAGIGQVLAALDRAGAADNTIVVFSSDNGGE------RWSKNWPFVGEKGDL 301

Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
            EGG+R   ++  P   +   V++  V   DW  TLL+AA     P++
Sbjct: 302 TEGGIRVPLIVAWPEAIAGNQVSDHPVITMDWTATLLAAAGTEPHPDW 349


>gi|343087313|ref|YP_004776608.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342355847|gb|AEL28377.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 477

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/318 (34%), Positives = 171/318 (53%), Gaps = 30/318 (9%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
           SS PP+I+ I  D++G+ D  F   D  I TPN++ LA  G +L N+YT    CT SR++
Sbjct: 34  SSAPPNILIITVDNIGYGDFKFLNKDSPIITPNLNKLAKQGAVLTNFYTGSPTCTASRAS 93

Query: 114 IMTGKHPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKEL--GYRTRIVGKWHLGFYKK 169
           ++TG+ P    + + +  L G    GLP +E I+P+ LK    GY+T   GKW++GF   
Sbjct: 94  LLTGRIPQRNKLDYQLGGLAGNYGIGLPQTEIIIPKVLKSSPHGYQTGAFGKWNIGFAPG 153

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL---EPAWDLHGKYSTDVFT 226
              PT RGF+  LG  +G+ D+F H            R DL     + + HG+YSTD+F 
Sbjct: 154 S-RPTERGFDEFLGIASGNVDHFSHVYAG--------RHDLYHNTTSINRHGEYSTDLFA 204

Query: 227 AEAVDIIHNHSTD-EPLFLYLAHAATH-----SANPYEP--LQAPDHYLNIHRHIEDFK- 277
             A+D I+  +++ +P  +YL   A H     +  P E    +APD+    +    D + 
Sbjct: 205 NAAIDFINEKTSENKPWMVYLPFDAPHHPGDRNIAPGEENIWKAPDYAFQPYGLSPDERD 264

Query: 278 -RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            + +F AI+  +D +VG+++++L++  +  N+ + F SDNG       L+  SN PL G 
Sbjct: 265 PKKRFNAIVTAIDMAVGRLMDSLDELGIADNTFVFFYSDNGAFYP--YLDIQSNAPLNGA 322

Query: 337 KNTLWEGGVRGAGLIWSP 354
             TLWEGG+R   L+  P
Sbjct: 323 GVTLWEGGIRVPALVRWP 340


>gi|395508489|ref|XP_003758543.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Sarcophilus harrisii]
          Length = 492

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 103/334 (30%), Positives = 163/334 (48%), Gaps = 42/334 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G  G     TP++D +A  G++  N+YT   LC+PSR+A++TG+ PI  G    
Sbjct: 33  MGWGDLGVFGEPSKETPHLDQMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTT 92

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ ++ P   GF+  
Sbjct: 93  NAHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLG-HRPQFHPLKHGFDEW 151

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A       + + R+    W++ G++               T ++  
Sbjct: 152 FGAPNCHFGPYDNKARP----NIPVYRN----WEMVGRFFEDFPINLKTGEANLTQIYLQ 203

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EAVD I   +  +P FLY A  ATH+     P+ A   +L         +R ++   + +
Sbjct: 204 EAVDFIKQQAHQQPFFLYWAVDATHA-----PVYASKSFLGTS------QRGRYGDAVRE 252

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +D+ VGK+++ L+   +  N+ + F SDNG A     +   SN P    K T +EGG+R 
Sbjct: 253 IDDCVGKILKLLQDLGIQENTFVFFTSDNGAALISAPIQGGSNGPFLCGKQTTFEGGMRE 312

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             + W P     G V+ Q  +V D   T LS A 
Sbjct: 313 PAIAWWPGHIPAGRVSHQLGNVMDLFTTSLSLAG 346


>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
 gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
          Length = 460

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/353 (31%), Positives = 179/353 (50%), Gaps = 47/353 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II I+ADD GWNDV +HG   I TPNID LA +G+ L  +Y    C+PSR+++ TG+ 
Sbjct: 35  PNIILIIADDAGWNDVEYHG-SVIQTPNIDFLAKNGVELNRFYANPTCSPSRASLFTGRP 93

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
               G+   +     +  LP S   LP+ L +  Y+T ++GKWHLG  ++   P   GF+
Sbjct: 94  ASRMGIVAPI-SDKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLGL-QQSSGPKAYGFD 151

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------STDVFTAEAVD-I 232
              G+  G  D + H           + ++ + +W  +G++      +TD+ T EA+  +
Sbjct: 152 YSYGFLHGQIDQYTH-----------LYKNGDKSWYRNGEFIDEQGHATDLITNEAIHWL 200

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAP----DHYLNIHRHIEDFKRSKFAAILHKL 288
                +++  FL +A++A     P+ PLQ      D Y+N    I+D  R  FAA +  L
Sbjct: 201 SEKRDSNKNFFLEVAYSA-----PHFPLQEEQKWKDPYMN---SIKDSSRRDFAAAMSHL 252

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGG-----------AAAGFNLNAASNWPLRGVK 337
           D S+G ++E L+Q  +  N++++F+SDNG                 N     N PLR  K
Sbjct: 253 DYSIGILLEKLKQENLDKNTLVIFMSDNGAMENWDSRNEYNGVHPSNTVLGDNSPLRDWK 312

Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
            + +EGG+R   +  W   L  +   +E Y+ + D LP++L  A   ++P  +
Sbjct: 313 TSNYEGGIRVPCIFYWKDHL--KNYKSESYISIIDLLPSILFLAGDKNLPQSI 363


>gi|430741911|ref|YP_007201040.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013631|gb|AGA25345.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 469

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 116/340 (34%), Positives = 166/340 (48%), Gaps = 39/340 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQL-CTPSRSAIMTGK 118
           P+I+FILADDLG+ DV  +G     TP+ID LA  G+   + YT    C P+R+A+M+G+
Sbjct: 32  PNIVFILADDLGFTDVACYGSRYYETPHIDKLAAQGLKFTDGYTCGPNCQPTRAALMSGQ 91

Query: 119 HPIHTGM-----------QHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           +   TG+           +   L   +   GLPL +  + Q L+  GY T + GKWHLG 
Sbjct: 92  YGPRTGIYTVGSIDRFDWKSRPLRPVDNAQGLPLGKVTIAQTLQAAGYATGMFGKWHLG- 150

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
            + +Y P  RGF+  +     H D+   +A E+            PA    G Y  D  T
Sbjct: 151 QQGDYHPRRRGFDEAIVSMGQHFDF--TTAPEVTY----------PA----GTYLADFLT 194

Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
            +AVD I  H  DEP FLYL H   HS  PY+  +                   +AA+L 
Sbjct: 195 DKAVDFITRHK-DEPFFLYLPHFGVHS--PYQAKKDLVKKFAEKAAAGGHHDPTYAAMLA 251

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA------GFNLNAASNWPLRGVKNTL 340
            +DESVG+VV+ L++  +  +++++F SDNGG         G       N PLRG K  L
Sbjct: 252 SVDESVGRVVKTLDELGLSDDTLVIFSSDNGGVGGYEREGIGKAGAVTDNAPLRGGKGML 311

Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +EGG+R   L   P     G V ++ ++  D  PTL+  A
Sbjct: 312 YEGGIRAPYLFRWPGKIPAGTVCDRAINSVDLYPTLVELA 351


>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
 gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
          Length = 443

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 178/360 (49%), Gaps = 26/360 (7%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNID 89
           Y T+      LPL     MV+  + A    P+II+I+ DDLG+ D+  +G     TPNID
Sbjct: 3   YLTQQYVKKCLPLFVLTFMVYSTVHAQH--PNIIYIMTDDLGYGDLSGYGRKDFLTPNID 60

Query: 90  ALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHT--GMQHNVLYGCERG---GLPLSEK 143
            LA  GI   N Y+   LCTP+R+A MTG++P  T  G+    L   +R    GL  +  
Sbjct: 61  KLAAQGIKFVNAYSAAPLCTPTRTAFMTGRYPARTPVGLMEP-LTPSKRDSTVGLTAAFP 119

Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
            +   ++  GY T ++GKWHLGF  +  +P   GF+   G  +G  DY  H         
Sbjct: 120 SVATLMRASGYETALIGKWHLGFLPQN-SPVKNGFDYFFGIHSGAADYISHKTGPAGRRI 178

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
            D+  + +  +     Y TD+FT +AV  +     ++P FL L + A H      P Q P
Sbjct: 179 HDLYENDQAVYP--EGYLTDLFTQKAVTFL-KQKHNKPFFLTLTYNAAH-----WPWQGP 230

Query: 264 DH--YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
           +   Y++          + +AA++++LDE +G++++ L++ ++ + +I++F +DNGG   
Sbjct: 231 NDKPYVDSVDLRIGGSAAVYAAMMNRLDEGIGEIIKTLDEEQLSNQTIVIFTNDNGGE-- 288

Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                 + N  L   K  LWEGG+R   ++  P   + G   +Q     DW  T+LSA  
Sbjct: 289 ----KYSDNGGLAKAKGALWEGGIRVPAIVKWPGKITAGSSTQQVAITMDWTKTILSAGG 344


>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
          Length = 476

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 109/327 (33%), Positives = 172/327 (52%), Gaps = 32/327 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+II I  DD G++DVG  G   I TP++D LA  G+   N+Y  Q +C+ SR+A++TG 
Sbjct: 30  PNIIIIFTDDQGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAVCSASRAALLTGV 89

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G+ H  L    R GL   E  +   LK LGY T + GKWHLG Y  E+ PT +GF
Sbjct: 90  YSNRLGI-HGALDHMSRYGLHPEEATIADILKPLGYATAMFGKWHLGHY-PEFLPTNQGF 147

Query: 179 ESHLG------YWTGH---QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           + +LG       W  H   +DY+      + ++  D  + ++  W+    ++T +FT ++
Sbjct: 148 DEYLGIPYSNDMWPNHPQTKDYY----PPLPLYQND--KVIDTIWNDQSMFTT-LFTEKS 200

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           +D I  +  D P FLYLAH       P+ PL   D +           R  +  ++ ++D
Sbjct: 201 IDFIERNK-DRPFFLYLAHPM-----PHVPLFVSDKFKG------KSNRGLYGDVIMEID 248

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            SVG+++E L+   +  N+II+F+SDNG   + ++ ++ S +PLR  K T W+GGV+   
Sbjct: 249 WSVGQIMETLKHYDLDENTIIIFLSDNGPWLS-YSGHSGSAYPLREGKGTSWDGGVKVPV 307

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTL 376
           ++      + G+V        D LPTL
Sbjct: 308 IMRWKGTWASGLVQPNPAMSIDILPTL 334


>gi|410611985|ref|ZP_11323071.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
 gi|410168398|dbj|GAC36960.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
          Length = 508

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/335 (31%), Positives = 168/335 (50%), Gaps = 50/335 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+FI ADDLG+ D+G +G   I TPNID +A  G     +Y    +CTPSR+ ++TG+
Sbjct: 50  PNIVFIFADDLGYGDIGAYGSTTINTPNIDKMAAQGAKFDEFYAASPVCTPSRAGLLTGR 109

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +PI  G+ HNV +     G+   E  + + LK  GY T +VGKWHLG + ++Y P  +GF
Sbjct: 110 YPIRQGI-HNVFFPESFQGMDPEEITIAEVLKGAGYATGLVGKWHLG-HHEQYMPWNQGF 167

Query: 179 ESHLGYW----TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           +   G       G   YF++   + +              ++  +Y T  +T +A+  I 
Sbjct: 168 DEFFGLPYSNDMGGLYYFNNKDIDFE--------------EVDQRYMTKTYTDQALQFID 213

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
            H  ++P FLYLAH       P+ P+ A   +              +  ++ +LD SVG+
Sbjct: 214 KHQ-EQPFFLYLAHNM-----PHVPIYASPEFEGQS------AGGAYGDVVEELDWSVGQ 261

Query: 295 VVEALEQRRMLSNSIIVFVSDNG--------GAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           V++ L++  +  N++++F SDNG        G +AG         PLR  K   +EGG+R
Sbjct: 262 VLKRLDELNLSENTLVIFTSDNGPWLLMGDEGGSAG---------PLREGKQFTFEGGMR 312

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              + + P   S+G   +    + DWLPT  + AN
Sbjct: 313 VPTVAYWPGTISQGTKPQGLATMMDWLPTFANLAN 347


>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
           niloticus]
          Length = 525

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 114/362 (31%), Positives = 172/362 (47%), Gaps = 35/362 (9%)

Query: 42  LAFTLSMVFVDLVASS-------GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           +A TL   F  ++  S        P P+II +L DD+GW D+G  G     TPN+DA+A 
Sbjct: 5   VALTLITFFGAIICCSWTKSIPNAPTPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAA 64

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKIL 145
            G++L N+YT   LC+PSR+A++TG+ PI  G      +           GG+   E +L
Sbjct: 65  EGMLLPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGISKDEILL 124

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGL 204
           PQ LK  GY ++IVGKWHLG ++ +Y P   GF+   G    H   Y D S   + ++  
Sbjct: 125 PQLLKTKGYVSKIVGKWHLG-HRPQYLPLKNGFDEWFGSPNCHFGPYNDQSKPNIPVYNN 183

Query: 205 D-----MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYE 258
                    D +   +      T ++  E +D I   +   +P FLY A  ATH+     
Sbjct: 184 SEMLGRFYEDFKIDRNTGESNLTQIYLMEGLDFILRQTKAQQPFFLYWAVDATHA----- 238

Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           P+ A   +L         +R ++   + +LD S+G++++ L    + +N+ + F SDNG 
Sbjct: 239 PVYASKRFLG------KSQRGRYGDAVMELDYSIGQILKWLRTLGIDNNTFVFFTSDNGA 292

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           A       + SN P    K T +EGG+R   + W P     G V  Q  +V D   T L 
Sbjct: 293 AVMSGPNESGSNGPFLCGKETTFEGGMREPAIAWWPGHIKEGSVNFQLANVMDLFTTSLV 352

Query: 379 AA 380
            A
Sbjct: 353 LA 354


>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
           [Oryzias latipes]
          Length = 523

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 175/374 (46%), Gaps = 53/374 (14%)

Query: 38  AVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           A +P    LS      ++ + P    P+II +L DD+GW D+G  G     TPN+DA+A 
Sbjct: 5   ASVPFLSVLSATICCFLSENTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAA 64

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
            GI+  ++YT   LC+PSR+A++TG+ PI  G         N     E  GG+   E +L
Sbjct: 65  QGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILL 124

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           PQ LKE GY  +IVGKWHLG ++ +Y P   GF+   G    H   ++++          
Sbjct: 125 PQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWFGAPNCHFGPYNNTV--------- 174

Query: 206 MRRDLEPAW---DLHGKYS--------------TDVFTAEAVDIIHNHS-TDEPLFLYLA 247
             R   P +   ++ G+Y               T ++    +D I   +    P FLY A
Sbjct: 175 --RPNIPVYNNSEMLGRYFEEFKIDKKTGESNLTQMYLEAGLDFISRQAEAKRPFFLYWA 232

Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
             ATHS     P+ A   +L         +R ++   + +LD SVG+++  L++  + +N
Sbjct: 233 ADATHS-----PVYASKPFLG------KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNN 281

Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
           + + F SDNG A         SN P    K T +EGG+R   + W P     G V+ Q  
Sbjct: 282 TFVFFTSDNGAALMSGPNEGGSNGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPA 341

Query: 368 HVSDWLPTLLSAAN 381
           +V D   T L+ A 
Sbjct: 342 NVMDLFTTSLALAG 355


>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
          Length = 511

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 172/344 (50%), Gaps = 45/344 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+ I +L DD+GW D+G  G     TP +D +A  G++  ++Y+   LC+PSR+A++TG+
Sbjct: 20  PNFILMLMDDMGWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTGR 79

Query: 119 HPIHTGM------------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
            PI  G               N++     GG+P  E +LP+ L++ GY++++VGKWHLG 
Sbjct: 80  LPIRNGFYTTNGHARNAYTPQNIV-----GGIPDEEILLPELLQKAGYKSKLVGKWHLG- 133

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS----- 221
           ++ +Y P   GF+   G    H   +D+         + + R+ E A   +  +      
Sbjct: 134 HQAKYLPLKHGFDEWFGAPNCHFGPYDNVHTP----NIPVYRNEEMAGRYYQDFKIEKNG 189

Query: 222 ----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
               T ++T EAV+ I   H+  +P FLY A  ATH     EPL A   +L         
Sbjct: 190 ESNLTQLYTKEAVEFITRMHNKSKPFFLYWAVDATH-----EPLYASKPFLGTSN----- 239

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R  +  ++ +LD +VGK++ +L   ++ +N+++ F SDNGGA      +  SN P    
Sbjct: 240 -RGLYGDVVRELDSAVGKILASLRSLKIDNNTLVFFSSDNGGATYA-KQHGGSNGPFLCG 297

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           K T +EGG+R   + W P    +G V+ Q   + D   T +  A
Sbjct: 298 KETTFEGGMREPTIAWWPGHIKKGQVSHQVGSLMDLYTTFVDFA 341


>gi|432342876|ref|ZP_19592103.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
 gi|430772100|gb|ELB87901.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
          Length = 459

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/346 (31%), Positives = 161/346 (46%), Gaps = 37/346 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+++FIL D++GW D G +G    PTP ID LA  GI   +Y     CTP+RSAIMTG+H
Sbjct: 7   PNVVFILVDNVGWGDFGVYG-GTTPTPRIDKLASEGIRFNDYNVEAQCTPTRSAIMTGRH 65

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
           P+ +G       G    G+   E  +   L E GY T + GKWHLG  +    PT +GF+
Sbjct: 66  PVRSGTYKLSWPGEGMSGMAPWEYTIADLLSEAGYATALYGKWHLGNTEGR-LPTDQGFD 124

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-------- 231
                W G +D +D +      W L     L       G+       A  +D        
Sbjct: 125 E----WWGIKDSWDEAG--YTAWPLYQELGLPAPMIWEGRKGEQSTPAMPLDLDVRPIVD 178

Query: 232 -----------IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
                      I  N +  +P F+YL ++  H      P+ A  +Y N        +   
Sbjct: 179 EKYLVPKTVEFIKKNAAAKKPFFVYLGYSQMHP-----PMIANPNYANTSTA----RSGV 229

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG-VKNT 339
           +A  L ++D+ VG++V+A++   +  N+IIV  SDN G  A  ++   SN P RG   NT
Sbjct: 230 YADCLAEMDDRVGQIVDAVKAAGIEDNTIIVLSSDNAGGGALPHVGPGSNGPWRGNFFNT 289

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
            +EG +R + +I  P     G+V  Q +   DWLPTL      S++
Sbjct: 290 PFEGSMRVSAMIRWPGAIPAGVVTAQTLAAVDWLPTLAGMVGASEL 335


>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
           [Oryzias latipes]
          Length = 525

 Score =  158 bits (399), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 175/374 (46%), Gaps = 53/374 (14%)

Query: 38  AVLPLAFTLSMVFVDLVASSGP----PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           A +P    LS      ++ + P    P+II +L DD+GW D+G  G     TPN+DA+A 
Sbjct: 5   ASVPFLSVLSATICCFLSENTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAA 64

Query: 94  SGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKIL 145
            GI+  ++YT   LC+PSR+A++TG+ PI  G         N     E  GG+   E +L
Sbjct: 65  QGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILL 124

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
           PQ LKE GY  +IVGKWHLG ++ +Y P   GF+   G    H   ++++          
Sbjct: 125 PQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWFGAPNCHFGPYNNTV--------- 174

Query: 206 MRRDLEPAW---DLHGKYS--------------TDVFTAEAVDIIHNHS-TDEPLFLYLA 247
             R   P +   ++ G+Y               T ++    +D I   +    P FLY A
Sbjct: 175 --RPNIPVYNNSEMLGRYFEEFKIDKKTGESNLTQMYLEAGLDFISRQAEAKRPFFLYWA 232

Query: 248 HAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSN 307
             ATHS     P+ A   +L         +R ++   + +LD SVG+++  L++  + +N
Sbjct: 233 ADATHS-----PVYASKPFLG------KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNN 281

Query: 308 SIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
           + + F SDNG A         SN P    K T +EGG+R   + W P     G V+ Q  
Sbjct: 282 TFVFFTSDNGAALMSGPNEGGSNGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPA 341

Query: 368 HVSDWLPTLLSAAN 381
           +V D   T L+ A 
Sbjct: 342 NVMDLFTTSLALAG 355


>gi|406661802|ref|ZP_11069914.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405554347|gb|EKB49446.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 486

 Score =  158 bits (399), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/357 (33%), Positives = 177/357 (49%), Gaps = 56/357 (15%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ D+GF G   I TPN+D LA  GI   N+Y    +C PSRS +MTG 
Sbjct: 34  PNIIFILADDLGYGDLGFLGQKIIETPNLDRLAAEGIFFSNHYAGATVCAPSRSTLMTGL 93

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           H  HT ++ N     E G  PL + IL  P+  KE GY T   GKW LGF   E  P  +
Sbjct: 94  HTGHTPVRGNFEIQPE-GQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTEGDPNHQ 152

Query: 177 GFESHLGY---WTGHQ---DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           GF+   GY      H+   +Y   +AE++ + G D        W     Y+ D+   E++
Sbjct: 153 GFDQFFGYNCQRIAHRYYPEYLWDNAEKVFLEGND--------WTQKITYAADIIHKESL 204

Query: 231 DIIHNHSTDEPLFLYLAHAATHS----------------ANPYEPLQAP------DHYLN 268
             I +++ + P F+++     H+                  P EP  AP      D  +N
Sbjct: 205 KFIRDNANN-PFFMFVTSVMPHAELAVPDGELLAYYREKIGPEEPHIAPRGSDYGDEPIN 263

Query: 269 IHRHIED-FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--- 324
           I  +  + + ++ +AA++  LD  VG++++ L++  +   +II+F SDNG    G N   
Sbjct: 264 IPGYQSNPYPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNGPHQEGGNDPD 323

Query: 325 -LNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVS---DWLPTL 376
             N  SN   RG K  L+EGG+R   ++ W  +++       Q  HVS   D+LPT 
Sbjct: 324 FFN--SNGIFRGYKRDLYEGGIRVPMILKWPGVIQG----GAQSGHVSAFWDFLPTF 374


>gi|346225028|ref|ZP_08846170.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
 gi|346226998|ref|ZP_08848140.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
          Length = 488

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 114/341 (33%), Positives = 169/341 (49%), Gaps = 43/341 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+ I I ADDLG+ D+G  G   I TPN+D +A+ G    N+Y    +CTPSR+ ++TG+
Sbjct: 36  PNFIVIFADDLGYGDLGIFGHPTINTPNLDRMAFEGQKWTNFYVAAPVCTPSRAGLLTGR 95

Query: 119 HPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
            PI +GM   +  VL+    GGLP SE  + + LKE GY+T  +GKWHLG +K  + PT 
Sbjct: 96  LPIRSGMCSDKRRVLFPDSNGGLPQSEITIARALKENGYQTAAIGKWHLG-HKSPFLPTD 154

Query: 176 RGFESHLGY----------WTGHQDYFDHS---AEEMKMWGLD---MRRDLEPAWDLHGK 219
            GF+S+ G            TG  +Y       AEE      +   +R D         +
Sbjct: 155 HGFDSYFGIPYSNDMDRVPGTGTGNYITSQIKLAEEENFHAYNVPILRNDKIIERPADQR 214

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
             T  +T EAV  I +   D P F+YLAH   H              + + R  E+FK  
Sbjct: 215 TITKRYTEEAVAKIKSMK-DGPFFIYLAHNLPH--------------IPLFRS-EEFKDV 258

Query: 280 KFAA----ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             A     ++ ++D SVG++++ LE+  +  N+++VF SDNG     F  +  +   LRG
Sbjct: 259 SLAGFYGDVIEEIDWSVGQILKTLEEEGLDENTLVVFTSDNGPWHI-FETHGGTAGLLRG 317

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
            K   +EGG+R   + W P    +G+V +    + D  PT 
Sbjct: 318 AKGGTFEGGMREPTIFWWPGKLKQGVVMDMATTM-DLFPTF 357


>gi|32475083|ref|NP_868077.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
           SH 1]
 gi|32445623|emb|CAD75624.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
           SH 1]
          Length = 484

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 112/345 (32%), Positives = 182/345 (52%), Gaps = 40/345 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+ ILADDLG+ D+G +G D+  TP +D LA  G+   + Y     C+P+R+A++TG+
Sbjct: 39  PNIVLILADDLGYGDLGCYGNDEQATPVLDRLATQGVRWTQAYANGPECSPTRAALLTGR 98

Query: 119 HPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
           +  H G         NV        L+     GLP +   L + L  +GY T + GKWHL
Sbjct: 99  YQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFGKWHL 158

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTD 223
           G Y+ +++P   GF+  L    G  DY+ H  + +  + L           + G+ Y TD
Sbjct: 159 G-YEAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEGYFTD 211

Query: 224 VFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK- 280
             T +AV  I + ++ D+P FLYL + A H+  PY+ P ++P   L I   +  +K++  
Sbjct: 212 TITDQAVRFIGDRNANDKPFFLYLPYTAPHT--PYQAPGESPVDPLPIDSPL--WKQNAD 267

Query: 281 ----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
               + A++  +DE +GKV+ A+E+ +M   ++++F SDNGG +      A+ N PLRG 
Sbjct: 268 PPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVIFASDNGGTS------ASRNEPLRGF 321

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           K   +EGG+R   +   P     G+V++Q     D   ++L+AA 
Sbjct: 322 KGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAG 366


>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
 gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
          Length = 464

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 170/364 (46%), Gaps = 34/364 (9%)

Query: 43  AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNY 101
           A T    F     SS  P+I+FILADDLGW D+  +G  +I TPN+D LA SG+   + Y
Sbjct: 31  AVTKETPFRARPGSSKRPNILFILADDLGWADLSSYGSPEIRTPNLDRLARSGVRFTQGY 90

Query: 102 YTVQLCTPSRSAIMTGKHPIHT--GMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
               +C+P+R A+ TG+ P  T  G+   +    ER G+P +   L   LK+ GY T ++
Sbjct: 91  SAGAVCSPTRVALYTGRLPGRTPGGLPEPIGSVNERDGIPATHPTLASLLKKTGYATALI 150

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWH GF    ++P   G++   G ++G  DY+   +   K    D+        D  G+
Sbjct: 151 GKWHGGFLPN-FSPLRTGWDYFFGNYSGGVDYYSKYSHTGKY---DLFEGETEVPD-DGR 205

Query: 220 YSTDVFTAEAVD-IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
           Y TDV   +A D I       +P  L L     +  +P+ P +AP             K 
Sbjct: 206 YYTDVIADKAEDWITKAKKQGKPWLLNL-----NFTSPHWPWEAPGDRKISEEITAKIKA 260

Query: 279 SKFAAILH--------------KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
               AI H               LD  +G+V+ AL +     N+++VF SDNGG    + 
Sbjct: 261 GNTQAIFHNDGGSLETYTKMVENLDARIGQVLNALRRSGQEENTLVVFASDNGGERFSY- 319

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                NWPL G K +L EGG+R   ++  P     G V+ + V   DW  TLL  A  + 
Sbjct: 320 -----NWPLTGNKGSLSEGGIRVPTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAP 374

Query: 385 IPNY 388
            P++
Sbjct: 375 SPDH 378


>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 477

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/328 (32%), Positives = 174/328 (53%), Gaps = 34/328 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+II I  DD G++DVG +G   I TP++D LA  G+   N+Y  Q +C+ SR+A++TG 
Sbjct: 31  PNIILIFTDDQGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAVCSASRAALLTGT 90

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G+ H  L    + GL   E  +   LK LGY T + GKWHLG +  E+ PT +GF
Sbjct: 91  YPNRLGI-HGALDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLG-HHPEFLPTNQGF 148

Query: 179 ESHLG------YWTGH---QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           + + G       W  H   +DY+      + ++  D  + ++  W+    ++T  FT ++
Sbjct: 149 DEYFGIPYSNDMWPNHPQTKDYY----PPLPIYQND--KVVDTIWNDQSMFTT-WFTEKS 201

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           +D I  +  D P FLYLAH       P+ PL   D +           R  +  ++ ++D
Sbjct: 202 IDFIERNK-DRPFFLYLAHPM-----PHVPLFVSDKFKG------KSNRGLYGDVIMEID 249

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            SVG++++ L+Q  +  N+II+F+SDNG   + ++ ++ S +PLR  K T W+GGV+   
Sbjct: 250 WSVGQIMQTLKQNDLDENTIIIFLSDNGPWLS-YSEHSGSAYPLREGKGTSWDGGVKVPA 308

Query: 350 LI-WSPLLESRGIVAEQYVHVSDWLPTL 376
           ++ W     S G+V +      D LP L
Sbjct: 309 IMRWKGTWPS-GLVQKNPAMTIDILPML 335


>gi|300773187|ref|ZP_07083056.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300759358|gb|EFK56185.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 443

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 110/353 (31%), Positives = 175/353 (49%), Gaps = 25/353 (7%)

Query: 46  LSMVFVDLVA-----SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           L  +F+ L+A     +   P+ I I  DD+G+ DVG +G   I TPN+D +A  G+   N
Sbjct: 7   LITLFICLLAVFNSSAQTQPNFIIIYVDDMGYGDVGINGNPNIETPNLDRMAMEGMRFSN 66

Query: 101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           YY+    CT SR A++TGK+P   G +  VL   ++ G+   E  + + LKE GYRT I 
Sbjct: 67  YYSASPACTASRYALLTGKYPSRAGFRW-VLNPTDQIGIHQQESTIAERLKEKGYRTAIY 125

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK 219
           GKWHLG  +KE+ P   GF+ ++G    +        +   + G D   +L P       
Sbjct: 126 GKWHLGSTRKEFLPLANGFDEYVGLPYSNDMIPPKYPDIALLSGYD-TLELNP----DQS 180

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
             T ++T +A+  I  ++  +P F+YL +A  H+     PL A + +L         KR 
Sbjct: 181 KLTRLYTEKAIAFITKNA-KQPFFIYLPYAMPHT-----PLHASEDFLGKS------KRG 228

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            +  ++ +LD  +G+++  L++ ++   + +VF SDNG      N N  S    R  K +
Sbjct: 229 LYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNGPWLIQ-NQNGGSAGLFRDGKGS 287

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
            WEGG+R    +W      +G V  +     D LPT+ + A  S  PN ++ T
Sbjct: 288 TWEGGMREPFFLWGHHTIPKGYVENEVFTALDMLPTITALAGISAGPNKIDGT 340


>gi|423348778|ref|ZP_17326457.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
           CL03T12C32]
 gi|409212738|gb|EKN05774.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
           CL03T12C32]
          Length = 461

 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 108/337 (32%), Positives = 168/337 (49%), Gaps = 40/337 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ +LADD G+ D GF G   I TPNID LA  G I  + +    + +PSRS ++TG+
Sbjct: 31  PNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGR 90

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+    +  GLP  E++LP  LK   YRT  +GKWHLG  +    P  +GF
Sbjct: 91  YGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAKGF 147

Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           ++  G   GH+ YF        D + ++ +  G  +  D          Y TD   ++A 
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFD---------SYFTDELASKAQ 198

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             +    +++P  LY++  A HS N     +A +  L      E   R K+AA+++ LD 
Sbjct: 199 QFVTE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 248

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
            VGK+V+ L+      N+II F+SDNGG+      N +SN PL+G K   +EGG R    
Sbjct: 249 GVGKIVDELKAAGKFDNTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 304

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           ++W      R    +++  ++  L    +  +  DIP
Sbjct: 305 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 337


>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
           griseus]
          Length = 493

 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 106/338 (31%), Positives = 165/338 (48%), Gaps = 43/338 (12%)

Query: 67  ADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM 125
           A+ +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG+ PI  G 
Sbjct: 8   AEVMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGF 67

Query: 126 ------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
                   N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ ++ P   GF
Sbjct: 68  YTSNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGF 126

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDV 224
           +   G    H   +D+  +      + + RD    W++ G++               T +
Sbjct: 127 DEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEMVGRFYEEFPINLKTGEANLTQL 178

Query: 225 FTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +  EA+D I   H+   P FLY A  ATH+     P+ A   +L          R ++  
Sbjct: 179 YLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVYASRPFLGTSH------RGRYGD 227

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
            + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P    K T +EG
Sbjct: 228 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFLCGKQTTFEG 287

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           G+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 288 GMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAG 325


>gi|87308809|ref|ZP_01090948.1| arylsulfatase A [Blastopirellula marina DSM 3645]
 gi|87288520|gb|EAQ80415.1| arylsulfatase A [Blastopirellula marina DSM 3645]
          Length = 474

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 112/355 (31%), Positives = 169/355 (47%), Gaps = 49/355 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+ +F L DDLGW DVG +G     TPNID LA SG+   N Y    +C+P+R++IMTG+
Sbjct: 24  PNFVFFLVDDLGWADVGCNGSTFHETPNIDKLAGSGMRFTNGYAACPVCSPTRASIMTGR 83

Query: 119 HPIHT-------GMQ------HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           HP+         GM+      H   +  +R  L   E  + + L+  GY T   GKWHLG
Sbjct: 84  HPVRVDITDWIPGMKAKGAKNHRFQHIHDRDDLAPEETTIAESLRADGYSTFFAGKWHLG 143

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG---LDMRRDLEPAWDLHGKYST 222
              +   PT +GF+ ++G   GH D           W    L+ + D        G+Y T
Sbjct: 144 --DQGNWPTEQGFDINIG---GH-DRGSPPGGYYSPWNNPVLEAKHD--------GEYLT 189

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-PDHYLNIHRHIEDFK---- 277
           +  T E++  +      +P  LYLA+   H+  P +P Q    HY    +  E       
Sbjct: 190 ERLTEESIHFLETRDQAKPFLLYLAYYNVHT--PIQPYQKRVKHYQEKAKQFEGETPVQK 247

Query: 278 -----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN 326
                        ++A+++  +D+SVG ++ AL++  +  N++++F SDNGG        
Sbjct: 248 EGGGKSRMRQDNPQYASMVAAVDDSVGAILAALDRLSLTDNTVVIFFSDNGGLCTTGGAG 307

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             SN PLR  K  L+EGGVR   +I +P +   G V +  V   D+ PT+ S A 
Sbjct: 308 PTSNLPLRSGKGWLYEGGVREPMIIRAPGVTKAGSVCDTPVVSMDFFPTIRSLAG 362


>gi|255038246|ref|YP_003088867.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254951002|gb|ACT95702.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 481

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 177/357 (49%), Gaps = 49/357 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLG+ DVGF+G   I TPNID LA  G+I   +Y    +C PSRS+++TG+
Sbjct: 30  PNIVFILADDLGYGDVGFNGQKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLTGQ 89

Query: 119 HPIHTGMQHNVLYGCE-RGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           H  HT ++ N   G E  G  P+++ +  L + LK+ GY T   GKW LG    E  P  
Sbjct: 90  HTGHTYIRGNK--GVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLGPVGSEGDPNK 147

Query: 176 RGFESHLGY---WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFTAEAVD 231
           +GF+   GY      H+ Y +H  +  K   L+  + L     +H K Y+ D+   +A+ 
Sbjct: 148 QGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGL-----IHNKEYAPDLIQKKALS 202

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL----------NIHRHI-------- 273
            ++     +P FL+L +   H+      L  PD  L            H+          
Sbjct: 203 FVNAQDGKQPFFLFLPYILPHAE-----LVVPDDSLFRYYKGKFEEKPHKGADYGPGANG 257

Query: 274 -----EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG----GAAAGFN 324
                +DF  + FAA++ +LD  VG+V+ AL+++ +  N++++F SDNG    G A    
Sbjct: 258 GGYASQDFPHATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGADPRF 317

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            N+ + +  RGVK  L+EGG+R       P     G  ++      D LPT    AN
Sbjct: 318 FNSGAGF--RGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELAN 372


>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
          Length = 470

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 169/344 (49%), Gaps = 17/344 (4%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             FT S +      ++G P++I IL DD G  D    G   + TP+ID LA  G+ L  +
Sbjct: 29  FCFTASGLASAADQATGKPNVIVILTDDQGTIDANCFGAKDLETPHIDGLAARGVRLTQF 88

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           Y    +C+PSR+ ++TG++P+  G+  NV       G+P ++  + +  K  GY T  +G
Sbjct: 89  YAAAPVCSPSRAGLLTGRYPVRGGVPGNVSSTAGVAGMPTTQVTMAETFKAAGYATAHIG 148

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP---AWDLH 217
           KWHLG Y ++  P  +GF++  G+  G  D F H       W    R DL          
Sbjct: 149 KWHLG-YSEDTMPLAQGFQTSFGHMGGCIDNFSH----FFYWQGPNRHDLHKDGREVQRP 203

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           G++  D+   EA   I  +  + P F+Y A        P+ P Q    +L  ++ +  + 
Sbjct: 204 GRFFPDLMVEEASQFIGTNR-ERPFFVYFA-----LNGPHYPYQGDPRWLERYKDLP-YP 256

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA-AGFNLNAASNWPLRGV 336
           R+ +AA +  +DE +G +++ +++  +   +I+VF SD+G +     +L   S    RG 
Sbjct: 257 RNLYAAFVSSIDERIGGLLKRVDELGLREKTIVVFQSDHGHSTEERAHLGGGSAGIYRGA 316

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           K +L+EGG+R   +I  P    +G    Q  H  DWLPTL + A
Sbjct: 317 KFSLFEGGIRVPAIISWPGRLPQGEARSQLAHGCDWLPTLAALA 360


>gi|255035933|ref|YP_003086554.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254948689|gb|ACT93389.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 497

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 119/374 (31%), Positives = 183/374 (48%), Gaps = 58/374 (15%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I++I ADDLG+ ++G +G  +I TPN+D LA  GI    +YT   +C P+R+ +MTGK
Sbjct: 27  PNIVYIYADDLGYGELGCYGQQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGK 86

Query: 119 HPIHTGMQHNVLYGC-----ERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           H  H+ ++ N   G      ERG +PL  +E  + + LK+ GY T + GKW +G    E 
Sbjct: 87  HAGHSAIRGNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNTEG 146

Query: 172 TPTFRGFESHLGYW---TGHQDYFDHSAEEMKMWGL-----DMRRDLEPA-------WDL 216
           TPT +GF+ + GY      H  Y  H  E  +   L     D+ R L+PA          
Sbjct: 147 TPTRQGFDYYYGYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDADFESF 206

Query: 217 HGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
            GK Y+    T +A+  I + S   P FLY+ +   H +     LQAPD Y  + ++I  
Sbjct: 207 KGKEYAPAKMTEKALAFI-DRSKAGPFFLYMPYTLPHVS-----LQAPDEY--VKKYIGQ 258

Query: 276 FKR----------------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           F                  S +A+++  LD+ VG +++ L+   +  N+I++F SDNG  
Sbjct: 259 FDEKPYYGEKNYASTKYPLSTYASMITFLDDQVGIILDKLKALGLDDNTIVMFSSDNGAT 318

Query: 320 AAG------FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
             G      FN  A     LRG+K  ++EGG+R   ++  P     G V++      D +
Sbjct: 319 FNGGVNPQFFNSVAG----LRGLKMDVYEGGIREPFIVRWPGKIKPGRVSDHVSAQFDLM 374

Query: 374 PTLLSAANKSDIPN 387
           PTL     ++  P 
Sbjct: 375 PTLAELTGQASPPT 388


>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
          Length = 456

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 112/329 (34%), Positives = 170/329 (51%), Gaps = 29/329 (8%)

Query: 58  GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMT 116
           G P+I+ I+ DDLG  D+       + TPNID L + G+ + N+Y    + +PSR+ ++T
Sbjct: 34  GNPNILLIMVDDLGLGDLSCQYAKDVQTPNIDRLFHQGVRMDNFYANSSVSSPSRAGLLT 93

Query: 117 GKHPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           G  P   G+   V+    +G    L     +LP+ +K+ GY+T I+GKWHLG  +    P
Sbjct: 94  GCFPDMVGVP-GVIRTDPKGSWGYLSPDATLLPEMMKKAGYQTAIIGKWHLGL-ESPNLP 151

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD-I 232
             RGF+   G+     D ++         G +  R      D  G ++TD+FT  A D I
Sbjct: 152 NERGFDFFHGFLGDMMDDYNTHLRR----GFNYMRLNTEEIDPQG-HATDLFTDWASDYI 206

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY----LNIHRHIEDFKRSKFAAILHKL 288
                 D+P FLYLA+ A HS     PLQ P  +       H +I +  R K  A++  +
Sbjct: 207 FKARKQDKPFFLYLAYNAPHS-----PLQPPVEWEKKVRKRHPNISE-TRGKLVALIEHM 260

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D+ VG+V+ +LE+   L N++I+F SDNGG       + A+N P+RG K  +++GG++ A
Sbjct: 261 DDGVGRVINSLEKSGQLDNTLIIFCSDNGGDRK----SEANNGPVRGDKGDMYDGGIKVA 316

Query: 349 -GLIWSPLLESRGIVAEQYVHVSDWLPTL 376
             L W   LE R +     V +SD  PTL
Sbjct: 317 CSLYWKGHLEHRRV--NNLVMMSDIFPTL 343


>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
 gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
          Length = 478

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 182/372 (48%), Gaps = 51/372 (13%)

Query: 52  DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPS 110
           D    S  P+ +FI ADDLG+ D+G  G   I TPNID +A  GI   ++ +   +C+PS
Sbjct: 39  DQAIKSDLPNFVFIFADDLGYGDLGCFGASDIATPNIDRIAAEGIKFTSFLSASPVCSPS 98

Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           R+ ++TG+ P   G+ + V +     G+   E  + + LK  GYRT IVGKWHLG  ++ 
Sbjct: 99  RAGLLTGRMPQRMGI-NTVFFPESLTGMDPEEITIAEILKTKGYRTGIVGKWHLGHLER- 156

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           + P  +GF  + G       Y +  A  + M G ++      A+ +  +Y T  +T E++
Sbjct: 157 FLPLNQGFYEYFGI-----PYSNDMASVVYMRGNEVE-----AYHVDQRYMTRTYTEESL 206

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             I + S D+P FLYLAH       P+ P+ A   +           R  +  ++ +LD 
Sbjct: 207 KFI-DASGDQPFFLYLAHNM-----PHVPIYASPEFEGTS------NRGLYGDVIQELDW 254

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNG--------GAAAGFNLNAASNWPLRGVKNTLWE 342
           SVG+V+  LE++ +L N++IVF SDNG        G +AG          LR  K   +E
Sbjct: 255 SVGEVLRKLEEKGILENTLIVFSSDNGPWLVMEDHGGSAGH---------LREGKQFTFE 305

Query: 343 GGVRGAGL-IWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYE 401
           GGVR   + +W   +E  G V E      DW PT  S    ++IP       + II   +
Sbjct: 306 GGVRVPTVAMWKGKIEP-GQVYEDLATQMDWFPT-FSQLVGAEIPQ------DRIIDGKD 357

Query: 402 NSILRYENGTHE 413
              + +ENG  E
Sbjct: 358 LGPVLFENGNRE 369


>gi|154495003|ref|ZP_02034008.1| hypothetical protein PARMER_04049 [Parabacteroides merdae ATCC
           43184]
 gi|154085553|gb|EDN84598.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
          Length = 464

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/337 (32%), Positives = 168/337 (49%), Gaps = 40/337 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ +LADD G+ D GF G   I TPNID LA  G I  + +    + +PSRS ++TG+
Sbjct: 34  PNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGR 93

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+    +  GLP  E++LP  LK   YRT  +GKWHLG  +    P  +GF
Sbjct: 94  YGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAKGF 150

Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           ++  G   GH+ YF        D + ++ +  G  +  D          Y TD   ++A 
Sbjct: 151 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFD---------GYFTDELASKAQ 201

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             +    +++P  LY++  A HS N     +A +  L      E   R K+AA+++ LD 
Sbjct: 202 QFVTE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 251

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
            VGK+V+ L+      N+II F+SDNGG+      N +SN PL+G K   +EGG R    
Sbjct: 252 GVGKIVDELKAAGKFDNTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 307

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           ++W      R    +++  ++  L    +  +  DIP
Sbjct: 308 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 340


>gi|311748006|ref|ZP_07721791.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|126574986|gb|EAZ79344.1| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 529

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 127/414 (30%), Positives = 194/414 (46%), Gaps = 72/414 (17%)

Query: 42  LAFTLSMVFVDLVASSGPP---HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           L +  S+  V   ++  PP   +I+FIL DDLG+NDV F G +   TPNID +A  G+I 
Sbjct: 8   LLYIFSLAMVSCKSNPEPPSNPNIVFILVDDLGYNDVSFMGSEFYETPNIDKVASKGMIF 67

Query: 99  KN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG-----------LP------L 140
            N Y    +C+PSR+++MTG+     G+   +  G  +G            LP      L
Sbjct: 68  TNGYANSAVCSPSRASLMTGRFTAVHGITDWI--GAPQGEAWRSYKRYSKLLPAEYNHQL 125

Query: 141 SEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE-----SHL------GYWTG 187
           S++I  LP+ LK+ GY+T   GKWHLG  ++   PT  GF+      H+      GY++ 
Sbjct: 126 SKEITTLPEALKQEGYKTFFAGKWHLGSAEENSLPTDHGFDINKGGYHVGGPYSGGYFSP 185

Query: 188 HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLA 247
            ++ F    EE K   L M+                   A+  ++    + +E    YL+
Sbjct: 186 FENPFMEDFEEEKGISLSMK------------------LAKETNLFIEQNKEEKFLAYLS 227

Query: 248 HAATHSA------------NPYEPLQAPDHYLNIHRHIEDFKRSK---FAAILHKLDESV 292
             A H+             N  E +        + R +   K      +A ++ ++DE+V
Sbjct: 228 FYAVHAPIQTSKEKWSKYRNKAEAMGIDSTGFEMERVLPARKYQDNPVYAGLIEQMDEAV 287

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G VV+ LE+  +  N+IIVF SDNGG  +G N +   + PLRG K   WEGG R   +I+
Sbjct: 288 GAVVDQLEKLGLSENTIIVFTSDNGGVVSGDNYSTNLD-PLRGGKGYQWEGGTRIPYMIY 346

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILR 406
            P LE+ G   +  V   D+ PTLL  A  ++I   ++   ++I P  E   L 
Sbjct: 347 VPWLENNGQRNDTPVSGVDFYPTLLDFAGVNEISQPLDG--QSIRPVLEGGTLE 398


>gi|423725143|ref|ZP_17699283.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
           CL09T00C40]
 gi|409234771|gb|EKN27595.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
           CL09T00C40]
          Length = 461

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/337 (32%), Positives = 168/337 (49%), Gaps = 40/337 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ +LADD G+ D GF G   I TPNID LA  G I  + +    + +PSRS ++TG+
Sbjct: 31  PNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGR 90

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+    +  GLP  E++LP  LK   YRT  +GKWHLG  +    P  +GF
Sbjct: 91  YGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAKGF 147

Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           ++  G   GH+ YF        D + ++ +  G  +  D          Y TD   ++A 
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFD---------GYFTDELASKAQ 198

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             +    +++P  LY++  A HS N     +A +  L      E   R K+AA+++ LD 
Sbjct: 199 QFVTE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 248

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
            VGK+V+ L+      N+II F+SDNGG+      N +SN PL+G K   +EGG R    
Sbjct: 249 GVGKIVDELKAAGKFDNTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 304

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           ++W      R    +++  ++  L    +  +  DIP
Sbjct: 305 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 337


>gi|408371886|ref|ZP_11169642.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
 gi|407742679|gb|EKF54270.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
          Length = 480

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/350 (29%), Positives = 178/350 (50%), Gaps = 44/350 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+F+LADDLG  D+G +G + I TPN+D +A  G++  ++Y+   +C PSR+++MTGK
Sbjct: 36  PNIVFVLADDLGIGDLGSYGQEYIKTPNLDRMAKEGMLFTDFYSGSTVCAPSRASLMTGK 95

Query: 119 HPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           H  H  ++ N       G  PL  ++  + Q LK  GY T + GKW LG      +P  +
Sbjct: 96  HTGHLTIRGN-------GEFPLKSTDTTIAQILKSQGYHTAMFGKWGLGLQGTSGSPEKK 148

Query: 177 GFESHLGYWTGHQDYFD-HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
           G++     +TGH  + D H  +   +W ++     + A +    ++ ++FT + ++ I  
Sbjct: 149 GWDE----FTGHLHHVDGHFQKPDSLWSIEKGGVKKHAIE-DDAFANEIFTQKGINFIDK 203

Query: 236 HSTDEPLFLYLAHAATHS--------ANPY-----EPLQAPDHYLNIHRHI--EDFKRSK 280
            S ++P FL++++   H+          PY     E +  P++     RH   + + ++ 
Sbjct: 204 QSDNQPFFLFMSYTVPHAELRVEDKYLAPYLNKHGESIFEPENPWPTGRHYGQQKYPKAA 263

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG---------FNLNAASNW 331
           +AA++  LD  VGK++E L+ + +  N++++F SDNG    G         FN  A    
Sbjct: 264 YAAMVTSLDSYVGKLLEKLKSKGLEENTLVIFTSDNGTHIEGGRTKEDVKFFNSTAG--- 320

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            LRGVK  L+EGG+R   +   P    +G ++       D LPT    A 
Sbjct: 321 -LRGVKRDLYEGGIRTPFIAHWPGTIPKGSISSVTAAYWDLLPTFTELAG 369


>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 443

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 117/374 (31%), Positives = 187/374 (50%), Gaps = 41/374 (10%)

Query: 36  AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           +  VL LAF  S+           P+++ IL DDLG+ D+G +G + I TP +D +A SG
Sbjct: 10  SLQVLCLAFWCSIPTAQSAEIQQQPNVVLILVDDLGYGDLGCYGSESIRTPRLDRMAASG 69

Query: 96  IILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
           + L ++Y    +CTP+R+A+MTG +    G+    L+  +  G+  SE  L + +++ GY
Sbjct: 70  MKLTSFYAAAPICTPTRAALMTGCYATRVGLP-TPLHVYDEIGINESEFTLGEAMQQCGY 128

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM-----WGLDMRRD 209
            T  VGKWHLG ++  + PT  GF  + G   GH   F+  A    +       LD  R+
Sbjct: 129 ETVCVGKWHLG-HQPRFYPTEHGFNHYWGTPLGHM--FNRPAVGKAIGDTSDLFLDDTRE 185

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
           +    D      T+  T +AV+ I     D P FL+LAH       P+EPL A + + + 
Sbjct: 186 IPFPEDAD---LTERLTEKAVEFIEAKR-DRPFFLFLAHPM-----PHEPLAASEKFADR 236

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG-----GAAAGFN 324
                  +   +  ++  LD S G+V++ALE+     N+++ F+SDNG     G  AG  
Sbjct: 237 S------QAGLYGDVVECLDWSTGEVLDALERTGQAENTLVAFLSDNGPKKGHGTTAG-- 288

Query: 325 LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                   LRG K+  +EGG+R   +I +  L   G  +++  +V DW PTL++ A  + 
Sbjct: 289 --------LRGFKHDPYEGGMRVPCIIAAQGLIPAGQTSDEICNVMDWYPTLVNLAGGTT 340

Query: 385 IPNYVNSTVENIIP 398
           +   V    +N+ P
Sbjct: 341 VSEQVRDG-KNLTP 353


>gi|335058614|gb|AEH26503.1| arylsulphatase [uncultured Acidobacteria bacterium A11]
          Length = 491

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 168/364 (46%), Gaps = 59/364 (16%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
             L A    P+I+FIL DD GW  +G +G   I TP++D LA  G      Y +  CTP+
Sbjct: 14  TSLFAGESKPNILFILTDDQGWPTLGCYGNKLIATPHLDRLATEGARFTAAYAMPQCTPT 73

Query: 111 RSAIMTGKHPIHTGMQH---NVLYG---------CERGGLPLSEKILPQYLKELGYRTRI 158
           R+A+ TG+H   TGM H   N  YG           R GLP     LP+ L+  GY T  
Sbjct: 74  RAALFTGQHTARTGMWHVLTNPWYGYPWAPMREPTWREGLPRDWFTLPKGLRAAGYVTGT 133

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG 218
            GKWHL     +    +R  +   G   G    FD  A          R D + A    G
Sbjct: 134 AGKWHL---TADADGDYRALKPEAGDAFG----FDFVAP---------RGDSQSA----G 173

Query: 219 KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR 278
               D  T  A+  I     ++P F YLAH   H      P+ AP+  +  HR     ++
Sbjct: 174 DKQVDFLTDAAIGFIREQR-EKPWFFYLAHHTVHG-----PVLAPEALIARHRARGAPEK 227

Query: 279 ----SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN---------- 324
               + F A L  LD SVG+++ AL++  +   +++VF+SDNGG +  ++          
Sbjct: 228 GSGNATFLAALEHLDNSVGRLMAALDELGLRERTLVVFMSDNGGVSRSYDPKPFTEGPGT 287

Query: 325 ---LNAAS----NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
              L+ A+    N PLRG K + +EGG+R   L+  P + + G V E   HV DWLPTLL
Sbjct: 288 DTQLHLANAEFPNAPLRGWKGSPYEGGIRVPCLVRWPGVVAAGRVVEAPAHVVDWLPTLL 347

Query: 378 SAAN 381
             A 
Sbjct: 348 EVAG 351


>gi|114799529|ref|YP_761144.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114739703|gb|ABI77828.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 459

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/334 (32%), Positives = 165/334 (49%), Gaps = 39/334 (11%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
           A++ PP+II I+ADDLGW D+  +G   I TPNID +   GI L ++Y    +C+PSR+A
Sbjct: 34  AAAKPPNIIIIMADDLGWGDISLNGAALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAA 93

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++PI +GMQH V++   + GLP  E  + + LK  GYRT +VGKWHLG +++EY P
Sbjct: 94  LLTGRYPIRSGMQH-VIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWP 151

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           T +GF           D+F        M   D+ R  E       +    +  A+A    
Sbjct: 152 TNQGF-----------DWFYGVPYSNDMAPFDLYRGKEIIESPADQSQLSLNYAKAAKEF 200

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
              S+D+P FLY A        P+ PL  P+                +  ++  +D  +G
Sbjct: 201 IEDSSDKPFFLYYAETF-----PHIPLFVPEDRSGTS------DAGLYGDVVETVDAGIG 249

Query: 294 KVVEALEQRRMLSNSIIVFVSDNG----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
            V++ L++  +  +++I+F SDNG    G+A  F          RG K    EGG R   
Sbjct: 250 IVLDTLDEAGVADDTLIIFTSDNGPWFEGSAGEF----------RGRKGETHEGGFRVPF 299

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           L   P    +G V+ +     D LPT  S +  +
Sbjct: 300 LARWPGHIPKGSVSHEMAMNIDLLPTAASLSGAT 333


>gi|254294928|ref|YP_003060951.1| sulfatase [Hirschia baltica ATCC 49814]
 gi|254043459|gb|ACT60254.1| sulfatase [Hirschia baltica ATCC 49814]
          Length = 487

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 102/352 (28%), Positives = 179/352 (50%), Gaps = 39/352 (11%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
           +A    P+IIFI+ADDLG+ DV  +G  +  TP ID+LA++G+     Y+   +C+ +R+
Sbjct: 49  IAQKKRPNIIFIMADDLGYADVSCYGRREYKTPAIDSLAHTGVKFTQAYSNSPVCSATRT 108

Query: 113 AIMTGKHP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           +++TG++   +  G++  + +     GLP +   LP  LK+ GY+T +VGKWHLG   K 
Sbjct: 109 SVITGRYQYRLPVGLEEPLTF--RDVGLPTTHPTLPGQLKKSGYQTALVGKWHLGALPK- 165

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHS--AEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFTA 227
           Y P   G++   G   G  DYF H   A++  +W  ++        ++H  +Y T +   
Sbjct: 166 YGPLKSGYDHFWGVRGGSVDYFTHDSMAKQNDLWDGEV--------EIHETEYLTKLIGG 217

Query: 228 EAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----------HIEDF 276
           +++++I   +  +EP F+ L     H   P+ P + P+      R          H++  
Sbjct: 218 KSIELIEKMNENEEPFFMSL-----HYTAPHWPWEGPEDKTESDRLKQGGPFSLLHLDGG 272

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
               +AA++  +D  +G+++  L++  +   ++IVF SDNGG         +  WP  G+
Sbjct: 273 SLETYAAMVTSMDNQIGRLLNKLDELGITEETLIVFTSDNGGE------RFSDTWPFSGI 326

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           K  L EGG+R   +I  P +      ++Q     D +PTLL A   +  P++
Sbjct: 327 KTELLEGGLRVPLIIRWPGITKANTTSDQVSMSMDLMPTLLKAGQAAPHPDF 378


>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 651

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 122/407 (29%), Positives = 190/407 (46%), Gaps = 89/407 (21%)

Query: 60  PHIIFILADDLGWNDVGFH---------GLDQIPTPNIDALAYSGIILKNYYTVQ-LCTP 109
           P+II I+ADDLG+ DVGF+         G+  IPTP +D LA +GII  N +     C P
Sbjct: 28  PNIIVIVADDLGYADVGFNRDANFPAEKGV--IPTPELDQLANNGIICTNGHVAHPFCGP 85

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           SR+A+MTG  P   G+Q+N+        G+PL E   P+ L++  Y T   GKWHLGF +
Sbjct: 86  SRAALMTGVQPSRIGVQYNLPNDINTSLGIPLEETYFPKILQQNNYHTAAFGKWHLGFTQ 145

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEM---KMWG-----------LDMRRDLEPAW 214
            +Y P  RGF+   G+  G + YF+   E++   ++ G           L  +RD     
Sbjct: 146 GKYQPLDRGFDYFFGFLGGGKAYFEREYEDLYYRRLGGSNPVTNEYQDPLQRQRDYVAKD 205

Query: 215 DL-HGKYSTDVFTAEAVD-IIHNHSTDEPLFLYLAHAATHSA--------------NP-Y 257
           +    +Y TD+ T EA++ I  N +  +P F+Y+A+ A H+               NP +
Sbjct: 206 EFNQDEYLTDILTDEAINYIAENKTKSDPFFMYVAYNAPHTPLQAPAEEIAQFKEENPNF 265

Query: 258 EPLQAPDHYLN---------IHRHIEDFK------------------------------R 278
           E L     YL          + R ++  K                              R
Sbjct: 266 ESLVRNSDYLKESGAYELDKVERELKKVKDYDYNSLSSSEKEEKVADYREHLLDKFTESR 325

Query: 279 SKFAAILHKLDESVGKVVEALEQ-RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
             +A ++  +D+++G++V  L++ +++  N++IVF+SDNGG         A N PL  +K
Sbjct: 326 IVYATMVSNMDKNIGRIVAELKKDQKVFDNTMIVFLSDNGGYTYS---KGAVNHPLYALK 382

Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            +++EGG R    I W   ++S      Q +   D  PTL+  A  S
Sbjct: 383 GSVFEGGHRVPFFIHWPNKIKSPKKYKYQ-ISSLDLYPTLVDLAGAS 428


>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 585

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 178/360 (49%), Gaps = 55/360 (15%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           +A  L  +   + A +  P+II  L DD GW D+  +G   I TPNID+LA+ G + +N+
Sbjct: 4   IALVLIALIQFMAAENKKPNIIIFLTDDQGWGDLSINGNKDISTPNIDSLAHDGALFENF 63

Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           Y   +C+P+R+ ++TG++   +G++ +   G ER    L E+ +    K+ GY T   GK
Sbjct: 64  YVQPVCSPTRAELLTGRYAFRSGVR-STSEGGER--FNLDEQTIADVFKKAGYATGAFGK 120

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-- 219
           WH G  +  Y P  RGF+   G+ +GH             WG        P  + +GK  
Sbjct: 121 WHSGM-QYPYHPNGRGFDEFYGFCSGH-------------WG----NYYSPMLEHNGKIV 162

Query: 220 ----YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR---- 271
               +  D FT++A+D I  +  ++P  +YL +   HS     P+Q PD +   ++    
Sbjct: 163 KGQGFCVDDFTSKAMDFIEENK-EKPFLVYLPYNTPHS-----PMQVPDRFWEKYQDKTL 216

Query: 272 -----------HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
                      H EDF ++   A+   +D +VG+V+  L++ ++  N+I+++ SDNG  +
Sbjct: 217 TMKNTYNKKWGHGEDFVKAAL-AMCENIDWNVGRVLNKLDELQIADNTIVLYFSDNGPNS 275

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             +N        ++G K ++ EGGVR    +  P   ++G V ++     D LPTL S A
Sbjct: 276 PRWNGE------MKGKKGSVDEGGVRSPLHLRWPQKVAQGQVIKEVTGAIDLLPTLSSMA 329


>gi|296122901|ref|YP_003630679.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296015241|gb|ADG68480.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 519

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 174/358 (48%), Gaps = 35/358 (9%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M   V+    T++ + V+   S   P+II ++ DD G+ D+  HG   + TP++D L   
Sbjct: 22  MIATVIGAGLTIARI-VEADESKTRPNIILMMTDDQGYGDLSLHGNPVVKTPHLDQLGRQ 80

Query: 95  GIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGY 154
            +  + ++    C P+R++IMT +H   +G+ H +L   ER  L L   ILPQ+LK  GY
Sbjct: 81  SVRFEQFHVSPTCAPTRASIMTSRHEFSSGVTHTIL---ERERLSLKATILPQFLKRAGY 137

Query: 155 RTRIVGKWHLGFYKKEYTPTFRGFES---HLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            T I GKWHLG  +  Y P  RGF+    H G   G Q Y     +        + +   
Sbjct: 138 TTGIFGKWHLG-DEDAYQPGKRGFDEVFIHGGGGIG-QSYPGSCGDA------PLNKYFN 189

Query: 212 PAWDLHGK------YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
           P    +GK      Y T VF  +A+  I +   ++P F Y+   A H+     PL  P  
Sbjct: 190 PVIRHNGKFVATNGYCTKVFVDQAITWISSQPDNQPFFCYITPNAPHA-----PLDCPKE 244

Query: 266 YLNIH-RHI-EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
           Y   +  H+ ED  R  F  ++   D+ +G++++ALE R +  ++I++F++DNG A    
Sbjct: 245 YYEPYLEHVPEDVAR--FYGMITHWDDQLGRLLKALEDRDISKDTIVIFMTDNGSATGAK 302

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           + +A     +R  K T +EGG+R     WS     +  V ++     D LPTL   AN
Sbjct: 303 HFSAG----MRANKGTPYEGGIRVPAF-WSWAGHWQPQVRQEVTCHYDILPTLTELAN 355


>gi|325109525|ref|YP_004270593.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969793|gb|ADY60571.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 453

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 183/364 (50%), Gaps = 35/364 (9%)

Query: 32  TRIMAFAVLPLAF---TLSMVFV---DLVASSGPPHIIFILADDLGWNDVGFHGLDQIPT 85
           T++  F+ +PL      LS+V      L A+   P+I+FIL DD G+ D+G +G  +I T
Sbjct: 2   TKLTLFSPIPLVLFETLLSLVLFAGGQLAAADQRPNIVFILTDDQGYGDLGCYGSTEIAT 61

Query: 86  PNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL 145
           P+ID LA  G+  +++Y    C+P+R+A MTG H    G+   V+Y  ++ GL   E  +
Sbjct: 62  PHIDRLAAEGMKFESFYVHNRCSPTRAAFMTGCHAQRVGLG-KVIYRKDKTGLSADEITV 120

Query: 146 PQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
            + L+  GYRT IVGKWHLG Y ++++P   GF++  G+     ++ D+    +      
Sbjct: 121 AELLQGAGYRTGIVGKWHLGEY-EQFSPVHHGFDTFYGF----MEWDDNRTTAIDNNRTI 175

Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSA-NPYEPLQAPD 264
           + RD+       G +S  +  A    I  N   D P FLY A    H+   P+E      
Sbjct: 176 VERDVSKT---DGVHSPKLLAAGIEFITENQ--DRPFFLYYASPLPHTRWKPHE------ 224

Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN 324
                 R     ++  +  ++ ++D  VG++++ L++  +  N+++VF SDNG      N
Sbjct: 225 ------RFAGSSEQGTYGDVVQEIDWQVGELMQTLDKLGLTDNTLVVFASDNG---PQLN 275

Query: 325 LNA-ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +    S  PLR  K T +EGG+R   L+  P +   G V  + V + D LPT    A   
Sbjct: 276 VEGHGSAGPLRDGKWTNFEGGIRVPCLMRWPSVVPAGSVNNEIVGIIDMLPTFCEIAG-V 334

Query: 384 DIPN 387
           D+P+
Sbjct: 335 DVPS 338


>gi|168702101|ref|ZP_02734378.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Gemmata
           obscuriglobus UQM 2246]
          Length = 596

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 132/435 (30%), Positives = 202/435 (46%), Gaps = 47/435 (10%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           + AFAVL L F     F    +++G P+++ I+ DDLG  D+G +G     TPNID +A 
Sbjct: 2   LRAFAVLALGF-----FALPASAAGKPNVVLIVIDDLGQRDLGCYGSTFYKTPNIDRMAK 56

Query: 94  SGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG-----------LPLS 141
            G+   ++Y    +C+P+R++IMTGK+P   G+   +    +  G           L L 
Sbjct: 57  DGVRFTDFYAACPVCSPTRASIMTGKYPQRVGITDWLPGRKDLPGQRLKRPELKNELALE 116

Query: 142 EKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHL-GYWTGHQ-DYFDHSAEE- 198
           E  + + LK  GY T  +GKWHLG   K + P  +GF+ ++ G  TG    YF   A + 
Sbjct: 117 EVTVAETLKGHGYVTAHIGKWHLG--GKGFEPEKQGFDVNVAGDHTGTPLSYFAPFANKA 174

Query: 199 -MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
              M GL+     E        Y TD   AEA   I   + D+P FLYL H   H+    
Sbjct: 175 GATMPGLEKAAPDE--------YLTDRLAAEAETFI-TANKDKPFFLYLPHYGVHT---- 221

Query: 258 EPLQAPDHYLNIHRHIEDFKRSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
            PL+AP   ++ ++      R     +AA++  +D +VG+V++ L+  ++  N++++F S
Sbjct: 222 -PLRAPQPLVDKYKTQAVHGRQSNPVYAAMVESMDAAVGRVLKRLDDLKLSDNTLVLFTS 280

Query: 315 DNGGAAA--GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDW 372
           DNGG A   G       N PLR  K  L+EGGVR   +   P     G V +Q     D+
Sbjct: 281 DNGGLATLEGMPFAPTINAPLREGKGYLYEGGVRVPLIAKWPGKVKPGTVMDQVACSIDF 340

Query: 373 LPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHE 432
             T+L A   +   +       +++P +    L+       Y  P   N  +R       
Sbjct: 341 FDTILEATGAT---SAARRDGVSLVPAFGGEKLKPRALYWHY--PHYANQGSRPGGAVRA 395

Query: 433 YNPKYENRYENGTHE 447
            N K    YE+G  E
Sbjct: 396 GNYKLVEYYEDGRRE 410


>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 518

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 118/407 (28%), Positives = 192/407 (47%), Gaps = 57/407 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P++I +L DD+GW D+G  G     TPN+D +A  G ++ ++Y    LC+PSR++++TG+
Sbjct: 27  PNVILMLMDDMGWGDLGVLGNPAKETPNLDRMASEGALMTDFYAPNPLCSPSRASLLTGR 86

Query: 119 HPIHTGM-QHNVLYGCER------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            PI  G    N    C        GG+P SE +LP+ L + GYR++I+GKWHLG ++ +Y
Sbjct: 87  LPIRNGFYTTNDHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRSKIIGKWHLG-HQTQY 145

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---------- 221
            P   GF+ + G    H   +D+  +      + + RD     D+ G+Y           
Sbjct: 146 HPLKHGFDEYFGAPNCHVGPYDNKKQP----NIPVYRDA----DMIGRYYEEFKIDKSGE 197

Query: 222 ---TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
              T +F  EA+  I   H T E  FLY    A+HS     P  A   +L         +
Sbjct: 198 SNLTQMFIEEAIAFIEKQHQTGEQFFLYWTPDASHS-----PHYASKSFLGTS------Q 246

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  +   + +LD  VG+++  L++ ++ +N+ + F SDNG A +G      SN P    K
Sbjct: 247 RELYGDAVMELDYGVGQILGKLKELQIENNTFVFFSSDNGAALSG-GAYVGSNGPFLCGK 305

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA-----------NKSDIP 386
           +T +EGG+R   + W P     G V+ Q   + D   T L  A             S +P
Sbjct: 306 HTTFEGGMREPSIAWWPGRIKPGKVSHQLGTIMDLFTTALDIAGVTLPTDRIIDGTSLLP 365

Query: 387 NYVNSTVEN---IIPRYENSILRYENGTHEYNSPRIENSNTRYENGT 430
             VN  +++   I     + ++   +G ++ +     NS   ++ GT
Sbjct: 366 ALVNGKIDSNKTIFFYRGDELMAARHGLYKAHFWTWTNSEEEFKRGT 412


>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
 gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
          Length = 462

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 109/332 (32%), Positives = 167/332 (50%), Gaps = 25/332 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK-NYYTVQLCTPSRSAIMTGK 118
           P+I+FI+ADDLG+ D    G   I TP ID++   G++L+  Y +  +C+P+R+A++TG 
Sbjct: 35  PNIVFIMADDLGYADTSATGSRHIRTPAIDSIGAGGVMLRQGYSSTPICSPTRTALLTGC 94

Query: 119 HP--IHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +      G++  +      G G+PL    +   +K LGYRT +VGKWHLG     + P  
Sbjct: 95  YAQRFAIGVEEPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKWHLG-EPPAHGPLK 153

Query: 176 RGFESHLGYWTGHQDYFDHS-AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
            G++  LG   G  DYF H      K  G+ +  D +   D  G Y TD+F  EAV +I 
Sbjct: 154 HGYDHFLGIVEGGADYFVHRMVMSGKPAGVGLAED-DAQTDRTG-YLTDIFGDEAVRVIE 211

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPD-----HYLNIHRHIEDFKRSKFAAILHKLD 289
               ++P FL L     H   P+ P +  +       L    H E    +K+  ++  +D
Sbjct: 212 -EGGNQPFFLSL-----HFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMVETMD 265

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           ++V KV+ A+++     N+++VF SDNGG         +  WP  G K  + EGGVR   
Sbjct: 266 QNVAKVLAAIDRSGKADNTVVVFTSDNGGE------RFSDTWPFVGHKGEVLEGGVRVPL 319

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           ++  P     G  +EQ +   D+LPTLL  A 
Sbjct: 320 MVRWPRRIKAGSRSEQVMVSMDFLPTLLGMAG 351


>gi|440714153|ref|ZP_20894738.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436440952|gb|ELP34233.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 484

 Score =  155 bits (393), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 112/349 (32%), Positives = 183/349 (52%), Gaps = 40/349 (11%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAI 114
           S+  P+I+ ILADDLG+ D+G +G D+  TP +D LA  G+   + Y     C+P+R+A+
Sbjct: 35  SNARPNIVLILADDLGYGDLGCYGNDEQATPVLDRLATQGVRWTQAYANGPECSPTRAAL 94

Query: 115 MTGKHPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           +TG++  H G         NV        L+     GLP +   L + L  +GY T + G
Sbjct: 95  LTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFG 154

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK- 219
           KWHLG Y+ +++P   GF+  L    G  DY+ H  + +  + L           + G+ 
Sbjct: 155 KWHLG-YEAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEG 207

Query: 220 YSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFK 277
           Y TD  T +AV  I + ++ D+P FLYL + A H+  PY+ P ++P   L +   +  +K
Sbjct: 208 YFTDTITDQAVRYIGDRNANDKPFFLYLPYTAPHT--PYQAPGESPVDPLPLDSPL--WK 263

Query: 278 RSK-----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
           ++      + A++  +DE +GKV+ A+E+  M   ++++F SDNGG +      A+ N P
Sbjct: 264 QNADPPGVYRAMVRHMDEGIGKVLHAIEESEMTDRTLVIFASDNGGTS------ASRNEP 317

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           LRG K   +EGG+R   +   P     G+V++Q     D   ++L+AA 
Sbjct: 318 LRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAG 366


>gi|344237970|gb|EGV94073.1| N-acetylgalactosamine-6-sulfatase [Cricetulus griseus]
          Length = 483

 Score =  155 bits (393), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 105/335 (31%), Positives = 163/335 (48%), Gaps = 43/335 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM--- 125
           +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A++TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTS 60

Query: 126 ---QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
                N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ ++ P   GF+  
Sbjct: 61  NGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEW 119

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+  +      + + RD    W++ G++               T ++  
Sbjct: 120 FGSPNCHFGPYDNKVKP----NIPVYRD----WEMVGRFYEEFPINLKTGEANLTQLYLQ 171

Query: 228 EAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA+D I   H+   P FLY A  ATH+     P+ A   +L          R ++   + 
Sbjct: 172 EALDFIRTQHARQSPFFLYWAIDATHA-----PVYASRPFLGTSH------RGRYGDAVR 220

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           ++D+SVGK++  L+   +  N+ + F SDNG A         SN P    K T +EGG+R
Sbjct: 221 EIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFLCGKQTTFEGGMR 280

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              + W P   + G V+ Q   + D   T LS A 
Sbjct: 281 EPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAG 315


>gi|404447701|ref|ZP_11012695.1| sulfatase [Indibacter alkaliphilus LW1]
 gi|403766287|gb|EJZ27159.1| sulfatase [Indibacter alkaliphilus LW1]
          Length = 486

 Score =  155 bits (393), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 106/337 (31%), Positives = 178/337 (52%), Gaps = 30/337 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+++ I  DD G+NDVG  G   I TPN+D +A  G+   N+Y  Q +C+ SR+A++TG 
Sbjct: 41  PNVVIIFTDDQGYNDVGVFGSKDISTPNLDNMAKEGVRFSNFYVAQAVCSASRAALLTGT 100

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G+ H  L      GL   E  + + LK LGY+T I GKWHLG +  E+ P  +GF
Sbjct: 101 YSNRLGI-HGALDHSSNHGLNPDETTIAEMLKPLGYQTAIFGKWHLG-HHPEFLPVQQGF 158

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--TDVFTAEAVDIIHNH 236
           + + G    +  + +H   +     L +  + E    LH   S  T V T +++D I  +
Sbjct: 159 DEYFGIPYSNDMWPNHPETKNYYPPLPLYENDEVVDTLHHDQSMLTTVLTEKSLDFIERN 218

Query: 237 STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVV 296
             + P FLYLAH       P+ PL   D +          +R  +  ++ ++D SVG+V+
Sbjct: 219 K-ETPFFLYLAHPM-----PHVPLFVSDKFEG------KSERGLYGDVIMEIDWSVGQVL 266

Query: 297 EALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           E+L+ + +  N++++F+SDNG   + ++L+A S +PL+  K T W+GGV+        ++
Sbjct: 267 ESLKSKGLEENTLVIFLSDNGPWLS-YSLHAGSAYPLKEGKGTSWDGGVKIPS-----IM 320

Query: 357 ESRGIVAEQYVHVS-----DWLPTL--LSAANKSDIP 386
           + +G + E  V  +     D LPT+  ++ A   ++P
Sbjct: 321 QWKGKIPEGKVQTNPAMSIDILPTIAEITGAKLPELP 357


>gi|449138380|ref|ZP_21773646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448882925|gb|EMB13473.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 467

 Score =  155 bits (392), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 110/343 (32%), Positives = 178/343 (51%), Gaps = 36/343 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+ ILADDLG+ D+G +G D   TP +D LA  G+   + Y     C+P+R+A++TG+
Sbjct: 22  PNIVLILADDLGYGDLGCYGNDAQATPTLDRLARQGVRWTQAYANGPECSPTRAALLTGR 81

Query: 119 HPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
           +  H G         NV        L+     GLP     L + L ++GY T + GKWHL
Sbjct: 82  YQQHVGGLECAIGVGNVGRYDDAIRLHLVSELGLPADRPTLAKRLSDVGYETALFGKWHL 141

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTD 223
           G Y+ +++P   GFE  L    G  DY+ H  + +  +  ++ R+  P   + G+ Y TD
Sbjct: 142 G-YEAQFSPMQHGFEEALYCIGGAMDYY-HYLDSVATY--NLFRNGRP---ISGEGYFTD 194

Query: 224 VFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPL-QAPDHYLNIHRHI---EDFKR 278
             T  AV  + + + +++P FLYL + A H+  PY+P  Q+P   L +   +    D   
Sbjct: 195 TITDHAVQYVRDRNDSEQPFFLYLPYTAPHT--PYQPPGQSPVDPLPLDSPLWKQSDDPP 252

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKN 338
             + +++  +DE +GKV+ A+E+  M   ++++F SDNGG +      A+ N PLRG K 
Sbjct: 253 GVYRSMVRHMDEGIGKVLHAIEESGMTDRTLVIFASDNGGTS------ASRNEPLRGFKG 306

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             +EGG+R   +   P       V++Q     D   ++L+AA 
Sbjct: 307 QTFEGGIRVPLIARWPGHLPEDTVSQQVTITFDLTASMLAAAG 349


>gi|322434839|ref|YP_004217051.1| sulfatase [Granulicella tundricola MP5ACTX9]
 gi|321162566|gb|ADW68271.1| sulfatase [Granulicella tundricola MP5ACTX9]
          Length = 477

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 164/342 (47%), Gaps = 29/342 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+FILADD+G+ D   +G  +I TPNID +A  G+   + Y    +C PSR  +MTG 
Sbjct: 35  PNIVFILADDMGYGDPAVYGQTKIRTPNIDRMAAGGMRFTQGYAGAPVCAPSRCTLMTGM 94

Query: 119 HPIHTGMQHNVLYGC-----------ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           H  H  ++ N                 R  L   ++ +  YL+  GYRT ++GKWHL  Y
Sbjct: 95  HGGHARVRDNFALAAGHVGHKKKEEIRRASLTTEDRTVADYLRGAGYRTGLMGKWHLDGY 154

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
             +  PT  GFE   G W    D       E +M G  +    E A    G+Y T + T 
Sbjct: 155 DPDAVPTKHGFEEFKG-WLTQIDETQGYWPEKRMHGETLIDIPENAGGKQGRYDTTMITE 213

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH--YLNIHRHIEDFKRSKFAAIL 285
           ++VD I  H  D P FLY+A+ + HS     P  APD   Y +     +D K   +AA++
Sbjct: 214 DSVDYIERHKAD-PFFLYVAYDSPHS-----PYTAPDFGPYKDHPTWADDEK--TYAAMI 265

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA------SNWPLRGVKNT 339
             +D  +GK+++ L++ ++  ++++ F SDNG  +              SN  L G K  
Sbjct: 266 WYMDLGIGKILDTLKRLKLDEDTVVFFASDNGPRSEPTVQQTRVIDFFDSNGNLTGYKRD 325

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           ++EGG+R   +   P     G V +   +  D+LPT L  A 
Sbjct: 326 MYEGGIRDPLIARWPGHIPAGAVNQMSAYFPDFLPTALDLAG 367


>gi|255035424|ref|YP_003086045.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254948180|gb|ACT92880.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 457

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 171/343 (49%), Gaps = 28/343 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+FILADDLG+ D+G HG   + TPNIDALA  G+I  + +    +C+PSRS ++TG 
Sbjct: 23  PNIVFILADDLGYGDIGAHGQKLLRTPNIDALAKEGMIFTDIHAGAPVCSPSRSVLITGL 82

Query: 119 HPIHTGMQHNV-----LYG------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           H  HT ++ N      + G        R  L   +  + + + + GY T + GKWHL  Y
Sbjct: 83  HTGHTTIRGNATIRGGIVGNKGKQTVRRANLAAGDFTVGKLMAQSGYTTALTGKWHLDGY 142

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH-GKYSTDVFT 226
               TP  RGF+   G+   +   + +     K +   + +D+E   +   G Y+ D+ T
Sbjct: 143 DTLATPIHRGFDQFSGWLIAYPGTYANGYWPAKRYVNGVLKDVEQNENGRKGYYADDLTT 202

Query: 227 AEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI-EDFKRSKFAAI 284
            E++  +       +P  L + + + HS     PL A D      R   +D K   + A 
Sbjct: 203 DESLAFLAAQKDAKKPFVLMINYNSPHS-----PLDAADSSAYKDRDWPQDMK--IYGAQ 255

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA------SNWPLRGVKN 338
           +H LDE+VGK+ + L +  +  N+I+ F SDNG  + G     A      SN  LRG K 
Sbjct: 256 VHHLDENVGKIKKYLTESGLAKNTIVFFCSDNGPRSEGTPQQTAIAEFFDSNGRLRGYKR 315

Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            ++EGG+R   ++W+P +   G V+ +  + +D +PT    A 
Sbjct: 316 DMYEGGIRVPMVVWAPGIVKPGSVSSEPAYFADIMPTFADIAG 358


>gi|443694992|gb|ELT96000.1| hypothetical protein CAPTEDRAFT_113428 [Capitella teleta]
          Length = 521

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 104/345 (30%), Positives = 170/345 (49%), Gaps = 49/345 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+II +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   +C+PSR+++MTG+
Sbjct: 27  PNIIVMLMDDMGWGDLGSYGHPARETPNLDRMAAEGMLFTDFYSAAGICSPSRASVMTGR 86

Query: 119 HPIHTGM-------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
            PI  G        +++ L     GG+  SE + P+ LK+ GYR +IVGKWHLG +++++
Sbjct: 87  LPIRNGFYSTNQFAKNSYLPQDVMGGISDSEVLFPELLKQNGYRNKIVGKWHLG-HREQF 145

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS---------- 221
            P   GF+   G    H  Y D     + ++   +         + G+Y           
Sbjct: 146 LPLQHGFDEWFGAPDCHYKYDDKVMPNIPVYKDSL---------MQGRYYDHFKIDIKTG 196

Query: 222 ----TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
               T ++  EA+D +     D P FLY A  ATH+     PL A + +          +
Sbjct: 197 LSNLTQLYIQEALDFLEKER-DRPFFLYWAPDATHT-----PLYASEKFRGKS------E 244

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN-----LNAASNWP 332
           R  +   + +LD  VGK++  L++  + +N+   F SDNG A   F      ++  SN P
Sbjct: 245 RGLYGDAVMELDSGVGKILNKLKELGIENNTFAFFSSDNGAATYAFTEVSIIISGGSNGP 304

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
               K T +EGG+R   L W P     G +++Q  ++ D   T+L
Sbjct: 305 FLCGKQTTFEGGMREPALAWWPTKIRPGSISKQVGNLMDIFSTVL 349


>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
          Length = 461

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 169/337 (50%), Gaps = 40/337 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ +LADD G+ D GF G   I TPNID LA  G I  + +    + +PSRS ++TG+
Sbjct: 31  PNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGR 90

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+    +  G+P  E++LP  LK  GYRT  +GKWHLG  +    P  +GF
Sbjct: 91  YGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SEPSQRPNAKGF 147

Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           ++  G   GH+ YF        D + ++ +  G  +  D          Y TD   ++A 
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGQKLSFD---------GYFTDELASKAR 198

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             +    +++P  LY++  A HS N     +A +  L      E   R K+AA+++ LD 
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHSPN-----EAAEEDLA---RFEGQPRQKYAAMMYALDR 248

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA-G 349
            VGK+V+ L+      ++II F+SDNGG+      N +SN PL+G K   +EGG R    
Sbjct: 249 GVGKIVDELKAAGKFDDTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQRVPFF 304

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           ++W      R    +++  ++  L    +  +  DIP
Sbjct: 305 VVWG----DRFKRDQRFTGLTSSLDIFATVVDALDIP 337


>gi|319794759|ref|YP_004156399.1| sulfatase [Variovorax paradoxus EPS]
 gi|315597222|gb|ADU38288.1| sulfatase [Variovorax paradoxus EPS]
          Length = 439

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 112/347 (32%), Positives = 169/347 (48%), Gaps = 36/347 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+IIFI+ADDLG+ D+G +G         +P +D LA +G+ L + Y    +C+P+R A+
Sbjct: 5   PNIIFIVADDLGYADLGCYGGRDAAFGPVSPVLDGLAANGLKLTQGYANSPVCSPTRFAM 64

Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           +T ++     G     +    RG    GLP     LP  LK  GY+T ++GKWHLG Y  
Sbjct: 65  ITARYQYRLRGAAEEPIRSSSRGSTTLGLPTDHPTLPSLLKASGYQTALIGKWHLG-YPP 123

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGL--DMRRDLEPAWDLHGKYSTDVF 225
            + P   G++   G  +G  DYF H  SA +  +W    D + D          Y TD+ 
Sbjct: 124 HFGPLRSGYDEFFGPMSGGVDYFTHCDSAGQHDLWFGEEDKKED---------GYLTDML 174

Query: 226 TAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK---F 281
           +  AVD +   +  E P FL L + A H   P+E     +    I  ++ D        +
Sbjct: 175 SKRAVDYVERMAKQEQPFFLSLHYTAPHW--PWETRDDAEKAPAIKDNLFDLAGGNIHVY 232

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
             ++H +DE +G ++ AL++  M  N+++VF SDNGG         + NWPL G K  L 
Sbjct: 233 RRMIHHMDEGIGWIMAALQKHGMSDNTLVVFTSDNGGE------RFSDNWPLVGGKMDLT 286

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           EGG+R   +   P + ++G  + Q     DW  T+L AA      NY
Sbjct: 287 EGGIRVPWIAHWPAVIAKGGESAQLCMTMDWSATMLEAAGVKADANY 333


>gi|384098332|ref|ZP_09999449.1| chondroitinsulfatase [Imtechella halotolerans K1]
 gi|383835828|gb|EID75248.1| chondroitinsulfatase [Imtechella halotolerans K1]
          Length = 464

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 171/350 (48%), Gaps = 46/350 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+IIFILADDLG  D+G +G   I TP+ID LA  G+   N+Y    +C PSR+A++TG+
Sbjct: 25  PNIIFILADDLGIGDIGVYGQSIIQTPHIDRLAEEGMKFNNFYAGSTVCAPSRAALLTGQ 84

Query: 119 HPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           H  HT ++ N       G  PL  ++KI+P+ LK  GY   I GKW +G      TP  R
Sbjct: 85  HTGHTQVRGN-------GEFPLDPAKKIIPEVLKNAGYTNAIFGKWGMGLNGSGSTPDKR 137

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD--MRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
           GF+     + GH  +   SA   K   +D  +  DL+        Y  + FT +A+  I+
Sbjct: 138 GFD----VFAGHIHHV--SAHYQKPDSIDAIVHGDLKRIGLPKDTYVNEYFTEQALQFIN 191

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED---------------FKRS 279
           N   D+P FL++++   H+     P     H L+  +   D               F ++
Sbjct: 192 NQPKDKPFFLFMSYTIPHAELVVPPKYLQKHLLSAEQSKHDSEKEWPAGRHYGPQPFPKA 251

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA-----ASNWPLR 334
            +AA++  LD+  G+++ AL+ + +  N+I++F SDNG    G           S+   +
Sbjct: 252 AYAALVESLDDYTGQILAALKAKGIDENTIVIFTSDNGTHTEGGRTQEDVDYFGSSAHYQ 311

Query: 335 GVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVS---DWLPTLLSAA 380
           GVK  L+ GG++   LI W   + +R     Q  HVS   D  PT    A
Sbjct: 312 GVKRDLYTGGIKEPFLIRWPGTIPAR----TQSNHVSAFWDLYPTFAELA 357


>gi|332665095|ref|YP_004447883.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332333909|gb|AEE51010.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 531

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 112/395 (28%), Positives = 187/395 (47%), Gaps = 77/395 (19%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+IIFI+ADDLG++D+G +G  +  TPN+D LA  GI L+++Y    C P+R++++TG +
Sbjct: 43  PNIIFIMADDLGYSDIGCYG-GEAQTPNLDKLATKGIKLRSFYNAGRCCPTRASLLTGNY 101

Query: 120 PIHTGMQHNVLY-------GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
               GM + V +       G  +G L  +   + ++L+++GY T + GKWH+G  + E+ 
Sbjct: 102 SHAAGMGNMVSFDDQKVTPGPYQGYLDPNTPTIAEHLRQVGYHTYMTGKWHVG-ERPEHW 160

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           P  RGF+ + G  +G   +F+   E+ K + +   +D E      G Y+TD FT  A++ 
Sbjct: 161 PLKRGFDRYFGLISGASSFFEILQEKRKRYMV--LQDQEWVLPQEGFYATDAFTDRAIEF 218

Query: 233 IHNHS-TDEPLFLYLAHAATH-SANPYE-------------------------------- 258
           I   +   +P FLYLA+ A H   + YE                                
Sbjct: 219 IQGQAPQSKPFFLYLAYTAPHFPLHAYEADIAKYEALYLQGWDAIRQQRYEKMQGLGLVD 278

Query: 259 --------PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
                   P   PD      +     K + +AA++ ++D+++GK+++ LE R+ L N++I
Sbjct: 279 KRYQLSPRPANIPDWNSVQDKQTWARKMAVYAAMIDRMDQNIGKLIQTLEARQELDNTLI 338

Query: 311 VFVSDNGGAAAGFN-----------------------LNAASNWPLRGVKNTLWEGGVRG 347
           VF+SDNGG +   +                           SN P +  K+ + EGG+  
Sbjct: 339 VFLSDNGGTSENVDDRKLGLPGKRIGERGSYGTYDAPWANVSNTPFKLYKSYMHEGGIIS 398

Query: 348 AGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             ++ W   ++ +       VHV D LPT L  A 
Sbjct: 399 PCILHWPSKIKPKLGYLNTPVHVIDLLPTSLELAQ 433


>gi|323455471|gb|EGB11339.1| hypothetical protein AURANDRAFT_2359, partial [Aureococcus
           anophagefferens]
          Length = 367

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 117/348 (33%), Positives = 175/348 (50%), Gaps = 33/348 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP-----TPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           PHI+ +LAD++GW  V +H    +P     TP +DAL  SG+ L  +YT + C+PSRSA+
Sbjct: 1   PHIVIMLADNVGWAVVEWHRPPALPAREISTPKLDALRASGLELDRHYTYKFCSPSRSAL 60

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY-TP 173
           M+G+ P H  + +N        G P   +++P  L++ GYRT  VGKWHLG   +   TP
Sbjct: 61  MSGRLPAHVNV-YNDDPAMAGAGAPAHMRLMPAMLRDAGYRTHFVGKWHLGMASRSLQTP 119

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHG-------KYSTDVFT 226
             RGF++ LGY+    +Y+D +  E    G     DL   WD  G        Y   VF 
Sbjct: 120 AARGFDTSLGYFHSTNNYYDGTRAEGC--GDAAHADL---WDGDGPSHLNGTAYEELVFR 174

Query: 227 AEAVDIIHNHSTDEPLFLYLA-HAATHSANP-------YEPLQAPDHYLNIHRHIEDFKR 278
              VD++  H    PLF+Y A H +    +P        + LQ    Y +    I+D  R
Sbjct: 175 DRVVDVVEAHDAARPLFIYYALHTSCVGYDPDGTAGGERDSLQPDRAYYDKFSFIDDDDR 234

Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN-WPLRGVK 337
               A++  +D+  G VV AL    + + +++++ SDNGGA    +L   SN WPLRG  
Sbjct: 235 RANVAMVALMDDVAGDVVGALRTAGLWAKTLLLWSSDNGGA---VHLGGGSNTWPLRGGY 291

Query: 338 NTLWEGGVRGAGLIWSPLLE--SRGIVAEQYVHVSDWLPTLLSAANKS 383
              +EGGVR    +    L   +RG +  ++++++DW  T    A  S
Sbjct: 292 YNNFEGGVRAPAFLAGGALPEVARGKLLTEWIYIADWYATFARLAGAS 339


>gi|390944774|ref|YP_006408535.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
 gi|390418202|gb|AFL85780.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
          Length = 470

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 160/322 (49%), Gaps = 44/322 (13%)

Query: 53  LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSR 111
           +   S  P+I+FI+ADDLG  D+  +G     TPN+  LA  GI+ K++Y    +C PSR
Sbjct: 25  VFGQSKKPNIVFIMADDLGVGDIEPYGQSIFETPNLSRLAQEGIMFKDFYAGSTVCAPSR 84

Query: 112 SAIMTGKHPIHTGMQHNVLYGCERGGLPLSE--KILPQYLKELGYRTRIVGKWHLGFYKK 169
           +++MTG+H     ++ N       G  PL    KILP+ LKE GY   + GKW LG  + 
Sbjct: 85  ASLMTGQHTGKIRIRGN-------GEFPLDPDVKILPEMLKENGYTNLMFGKWGLGLQES 137

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           E +P  RG++    ++ GH  +     ++     L +   L       G Y+ D FT  A
Sbjct: 138 ESSPEKRGWD----FFLGHLHHVSAHFQKPDSLDLFLNNTLGRVKVPEGTYANDWFTEGA 193

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL--------------------NI 269
           ++ I   S +EP F+YL++   H+      L  PD Y                       
Sbjct: 194 INYIETSSNEEPFFIYLSYTIPHAE-----LLVPDRYFKDFVNENGQSVFPNEIAWPDGK 248

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA-- 327
           H   +++ ++ +AA++H LD+ VG+V++ALE + +  N+I++F SDNG    G       
Sbjct: 249 HYGQQNYPKAAYAALVHSLDDYVGQVMDALESKGIADNTILIFTSDNGTHIEGGRRMEDV 308

Query: 328 ---ASNWPLRGVKNTLWEGGVR 346
               S+   RGVK  L++GG++
Sbjct: 309 DYFQSSGQYRGVKRDLYDGGIK 330


>gi|149199845|ref|ZP_01876874.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149137016|gb|EDM25440.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 491

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 179/355 (50%), Gaps = 41/355 (11%)

Query: 47  SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQL 106
           SM    +   S  P+IIFIL DD G+ D+  HG   + TPN+D L    +    +Y    
Sbjct: 15  SMTLCSMAKQSKSPNIIFILTDDQGYGDMAVHGHPYLETPNMDRLHSESVRFDRFYVSPS 74

Query: 107 CTPSRSAIMTGKHPIHTGMQHNV-----LYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
           C+P+R+A+MTG H    G+ H V     LY   +G L +++      LKE GY+T  VGK
Sbjct: 75  CSPTRAALMTGMHEFRNGVTHTVQPREKLY---KGALTIAD-----ILKEGGYKTGFVGK 126

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS 221
           WHLG   K Y P +RGF+ +     G  ++FD          ++M R+ +  +   G + 
Sbjct: 127 WHLG-NDKGYAPQYRGFDWYAKNAKGPHNHFD----------VEMIRNGK-RFQTKG-FR 173

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRS 279
            D F  EA+  +   + ++P FLYL      + +P+ PL AP+  L  ++   + D   +
Sbjct: 174 EDAFFDEAMTFM-KEAGEQPFFLYLC-----TYSPHTPLGAPEDLLKKYKAKGLND-NHA 226

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            + A++  +D+++G++ + L++  +  ++I++F++DN G   G ++  A    +RG K T
Sbjct: 227 AYLAMIENIDDNLGRLDQFLKKENLYDDTILIFMNDN-GVTVGLDVYNAD---MRGPKCT 282

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
           +WEGG R   L   P       V     H+ D LPTL   A   D+P  V   +E
Sbjct: 283 IWEGGTRAFSLWRWPKKWQPKTVENLTAHL-DVLPTLCELAG-VDVPEKVQGELE 335


>gi|386819701|ref|ZP_10106917.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386424807|gb|EIJ38637.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 479

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 175/358 (48%), Gaps = 52/358 (14%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTP 109
           V +   S  P+I+FI+ADD+G  D+G +G   I TPNID LA +G+   N+Y    +C P
Sbjct: 32  VTISNHSQKPNIVFIIADDMGIGDLGCYGQQYIQTPNIDKLASTGMKFTNFYAGSTVCAP 91

Query: 110 SRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHLGFY 167
           SR+++MTG+   HTG+     Y    G  PL  K  I+ Q LK+ GY T + GKW LG  
Sbjct: 92  SRASLMTGQ---HTGIS----YIRGNGEFPLRTKDTIISQILKKRGYTTAMFGKWGLGTE 144

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFD-HSAEEMKMWGL---DMRRDLEPAWDLHGKYSTD 223
             E +P  +G++    Y+TGH  + D H  +   +W +   +M +   P       Y+ +
Sbjct: 145 TTEGSPEKKGWD----YFTGHLHHVDAHFQQPDSLWTIKKGEMAQIATPP----NSYANE 196

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH------------- 270
           +FT +AV  I N + + P FLYL+        P+  L  P+ YL ++             
Sbjct: 197 IFTDKAVGFIENQTKENPFFLYLSFTV-----PHAELVVPNKYLKLYKNDQGQSIFMPEK 251

Query: 271 -----RHI--EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
                RH   + + ++ +AA++  +D  +G+V+E ++ +    N+++VF SDNG    G 
Sbjct: 252 AWPSGRHYGKQKYPKAAYAALVTSMDAYIGRVMEVIKSKGFDENTLVVFTSDNGTHIEGG 311

Query: 324 NL-----NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
                     S+   +GVK  L+EGG+R   +   P     G  +       D  PTL
Sbjct: 312 RTIEDVEFFKSSSIYKGVKRDLYEGGIREPFIASWPEKIKAGQTSNHIAAFWDITPTL 369


>gi|109094666|ref|XP_001113032.1| PREDICTED: arylsulfatase A isoform 2 [Macaca mulatta]
 gi|109094668|ref|XP_001113064.1| PREDICTED: arylsulfatase A isoform 3 [Macaca mulatta]
 gi|109094670|ref|XP_001113092.1| PREDICTED: arylsulfatase A isoform 4 [Macaca mulatta]
          Length = 507

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 162/350 (46%), Gaps = 40/350 (11%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+ L ++Y    LCTPSR+A++TG
Sbjct: 20  PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPASLCTPSRAALLTG 79

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-YKKEYTPTFR 176
           + P+ TGM   VL    RGGLPL E  L + L   GY T + GKWHLG   +  + P  +
Sbjct: 80  RLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPHQ 139

Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
           GF   LG    H     Q+            G D           +  + +P W   L  
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLEA 199

Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           +Y      A A D++ +    D P FLY A   TH               +     E   
Sbjct: 200 RY-----VAFARDLMADAQRQDRPFFLYYASHHTHYPQ-----------FSGQNFAERSG 243

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  F   L +LD +VG ++ A+    +L  ++++F +DNG      +    S   LR  K
Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            T +EGGVR   L + P   + G+  E    + D LPTL + A  + +PN
Sbjct: 303 GTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350


>gi|423288214|ref|ZP_17267065.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
           CL02T12C04]
 gi|392671103|gb|EIY64579.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
           CL02T12C04]
          Length = 473

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 106/352 (30%), Positives = 166/352 (47%), Gaps = 46/352 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
           A S   +I+ +  DDLGW D+G        TPN+D L   G+  +  Y  Q   +PSR++
Sbjct: 33  AESKRMNIVMMFVDDLGWADLGLRN-PHYNTPNVDRLFADGMNFERAYVTQPASSPSRAS 91

Query: 114 IMTGKHPIHTGMQHNV----------------------LYGCERGGLPLSEKILPQYLKE 151
           +MTGK P+   +  ++                       Y   R  LP+ E I PQ +KE
Sbjct: 92  LMTGKQPVRFQLVRHIDNKVAGTTEPYSYWPENSPELNEYMPSRNWLPMEETIFPQLMKE 151

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
            GY    VGKWHLG    E  P  RG++   G  +   +  ++ A   +M G D  + + 
Sbjct: 152 CGYYNMFVGKWHLG--GPEVWPDKRGYDEVYG-LSDFGNVSNYYAPFFRMRGADTTKKIA 208

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
                +G Y TD  T +AV +I N+  D+P  L + +   HS N    +   D +     
Sbjct: 209 K----NGDYITDFLTDKAVSMIENYDRDQPFMLSMHYYNVHSPN----IGRKDLFDKYKA 260

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
                 ++ +AA +  +DESVG++ +A++ R +  N+I+++ SD GG          SN+
Sbjct: 261 KGMTDIQANYAAQIEAVDESVGRIRKAIKDRGIEDNTIVIYFSDQGG--------LYSNY 312

Query: 332 PLRGVK---NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           PLRG K   +T+ EGG+R    I+ P +  +G V +  V   D+ PT +  A
Sbjct: 313 PLRGCKLIDDTMAEGGIRVPLAIYYPGVTKKGSVCQTPVQALDFFPTFMEIA 364


>gi|126663704|ref|ZP_01734700.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
 gi|126624287|gb|EAZ94979.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
          Length = 468

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 178/363 (49%), Gaps = 40/363 (11%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           I+ FA+L   F +         ++  P+I+FILADD+G+N++G +G   I TPNID LA 
Sbjct: 6   ILTFAILIATFGIQ---AQETKNTKKPNIVFILADDMGYNELGSYGGKIIETPNIDQLAK 62

Query: 94  SGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKE 151
            G+   N+Y    +C PSR  +MTGKH  H  ++ N     E    +P SE  + + LK 
Sbjct: 63  EGMKFSNHYCGSNICAPSRGTLMTGKHTGHAYIRDNKPLPYEGNEPIPASEITVAEILKT 122

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY--WTGHQDYFDHSAEEMKMWGLDMRRD 209
            GY T   GKW LG+   E +P  +GF+   GY       +YF     +  +  L+   D
Sbjct: 123 AGYTTGAFGKWGLGYPASEGSPNNQGFDQFYGYNGQIHAHNYFTSYLRKNDLVELNANID 182

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
                  +  YS D+    A++ +  +  + P FLY      H  NPY   Q  D  L  
Sbjct: 183 AP-----YSVYSADIIKDRALEFVEVNKNN-PFFLYFCPTLPH--NPYH--QPDDKTLEY 232

Query: 270 H----------RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           +           H E+F   K+AA+  +LD+ VG+++  L++  +L N++I+F SDNG A
Sbjct: 233 YAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSA 292

Query: 320 AAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL---ESRGIVAEQYVHVS---DW 372
                +    +   LRG K+ ++EGG++      SPL+   + + I      H+S   D+
Sbjct: 293 LTKEEDSYLRTGGDLRGRKSEVYEGGIK------SPLIAFWKGKIIPGSSSNHISAFWDF 346

Query: 373 LPT 375
           LPT
Sbjct: 347 LPT 349


>gi|402824519|ref|ZP_10873876.1| sulfatase family protein [Sphingomonas sp. LH128]
 gi|402261924|gb|EJU11930.1| sulfatase family protein [Sphingomonas sp. LH128]
          Length = 455

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 106/335 (31%), Positives = 161/335 (48%), Gaps = 48/335 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+ I IL DDLG+ D+   G   IPTPN++ +A  G++L +YY  Q +CTPSR+ ++TG+
Sbjct: 32  PNFIVILCDDLGYGDIEPTGGTAIPTPNLNRMAREGMVLTDYYAPQNICTPSRAGLLTGR 91

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +PI TG+    +   ++  LPL E  +P+ L+  GYR+ + GKWHLG    ++ PT  GF
Sbjct: 92  YPIRTGLAWGAILQPDKRILPLDEVTIPKALQPAGYRSGLFGKWHLGHLGPDWQPTRHGF 151

Query: 179 ESHLGYWTGH-----QDYFDHSA------EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           +   G    H     Q + D S       EE  M  L  R                 F  
Sbjct: 152 DRFYGIPYSHDMKPLQIFTDTSPDRRASEEEPVMAELQQR-----------------FYE 194

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
           EAV+ I  +  D+P F+ LA +A H AN   P           RH +D   S +  ++ +
Sbjct: 195 EAVNFIEANK-DQPFFVELALSAPHLANYPNP-----------RHRDDKMASAYGEVVEE 242

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL-WEGGVR 346
           +D  VG ++  L    +  +++++F SDNG    G      S+ PLR  K    ++GG R
Sbjct: 243 IDAIVGALMVKLRTLGLDRDTLVLFTSDNGPWYEG------SSGPLRDRKGGAGYDGGYR 296

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              + W P     G+  +      D+LPT  + A 
Sbjct: 297 VPCIAWQPGTVPAGLRCDSIAMGIDFLPTFCAMAG 331


>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 461

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/296 (34%), Positives = 154/296 (52%), Gaps = 35/296 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+I+ +LADD G+ D GF G   I TPNID LA  G I  + +    + +PSRS ++TG+
Sbjct: 31  PNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGR 90

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G + N+    +  G+P  E++LP  LK  GYRT  +GKWHLG  K    P  +GF
Sbjct: 91  YGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SKPFQRPNAKGF 147

Query: 179 ESHLGYWTGHQDYF--------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           ++  G   GH+ YF        D + ++ +  G  +  D          Y TD   ++A 
Sbjct: 148 DTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGQKLSFD---------GYFTDELASKAR 198

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDE 290
             +    +++P  LY++  A HS N     +A +  L      E   R K+AA+++ LD 
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHSPN-----EATEEDLA---RFEGQPRQKYAAMMYALDR 248

Query: 291 SVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
            VGK+V+ L+      ++II F+SDNGG+      N +SN PL+G K   +EGG R
Sbjct: 249 GVGKIVDELKAVGKFDDTIIFFLSDNGGSTT----NQSSNLPLKGFKGNKFEGGQR 300


>gi|298377639|ref|ZP_06987590.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|298265342|gb|EFI07004.1| arylsulfatase [Bacteroides sp. 3_1_19]
          Length = 589

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 167/344 (48%), Gaps = 49/344 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II +L+DD GW D+GF G   + TPNID +A+ G IL+N+Y   + +P+R+  +TG++
Sbjct: 27  PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            + +G+ ++   G ER    L EK + +Y +E GY T + GKWH G  +  Y P  RGFE
Sbjct: 87  HVRSGV-NSTTGGGER--FNLGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142

Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
              G+ +GH         +H+ E +   G                +  D  T +A+D I 
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
           +H  + P F++L++   HS     P+Q PD + N  +     +R+ F             
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+   LD ++G+V+  L    +   +I+++ SDNG  +  F  N      ++G K +  E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           GGVR    I  P    +G V  Q     D +PTLL  A     P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338


>gi|417301368|ref|ZP_12088525.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327542298|gb|EGF28785.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 470

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/339 (29%), Positives = 167/339 (49%), Gaps = 33/339 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P++I +  DD G+ND+G +G   I TPN+D LA  G    ++Y+   +C+PSR+A++TG 
Sbjct: 28  PNVIVVFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGC 87

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G+  +VL+   + GL   E  +  +LK  GY T  VGKWHLG + KE  PT  GF
Sbjct: 88  YPKRVGLHQHVLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGF 146

Query: 179 ESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLHGKYS 221
           +S+                LG  +    + D S+  + +W   + +D E     +  +  
Sbjct: 147 DSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVDQRTV 205

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T  +T  A++ +  +  D+P FLYL H+      P+ PL  P+   +         ++ +
Sbjct: 206 TRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------PQNAY 253

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
             ++  +D  VG++V+ +    +   ++IV+ SDN G    F  +  S  PLR  K T +
Sbjct: 254 KCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTF 312

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           EGG R   ++W+P     G  +  +    D LPT+ S  
Sbjct: 313 EGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVT 351


>gi|325110588|ref|YP_004271656.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324970856|gb|ADY61634.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 458

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 189/384 (49%), Gaps = 42/384 (10%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPH--IIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
            +  AV  +A T+S       A+  P H  ++ I+ADDLG+ D+  +G ++  TP +D +
Sbjct: 13  FLLLAVTTVALTVSQG----TAAEKPSHRNVVVIMADDLGYGDLSCYGCERYQTPFLDQM 68

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYL 149
           A  G+   ++++   +C+P+R+A++TG++    G+   +     R  GLPL      +  
Sbjct: 69  AAEGMKWSDFHSNGNVCSPTRAALLTGQYQQRWGVDRVITAAGHRDRGLPLEAVTFAETF 128

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMR 207
            + GY T I GKWH G Y+  + P  +GF+  +GY +G+ D+  H   A +   W  D  
Sbjct: 129 SQHGYATAIYGKWHQG-YQARFNPVRQGFDHFVGYVSGNVDFQSHIDQAGQADWWHNDEL 187

Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
            +          Y T + T  AV  I ++  ++P  LYL H       P+ P Q P+   
Sbjct: 188 TE-------EAGYVTHLITQYAVKFIEDNK-EQPFCLYLPHE-----TPHYPYQGPNDLA 234

Query: 268 N-------IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA 320
           +       ++    D K + +A ++ ++D  +G+V++ L++  +   ++++F SDNGG +
Sbjct: 235 DRTVGGKFVNHGSRDDKAAAYAEMMIEMDRGIGEVLQTLQKHGLAEQTLVLFFSDNGGTS 294

Query: 321 AGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G       N  LRG K + WEGG R   + W+P     G   +Q     D  PTL   A
Sbjct: 295 LG------DNGGLRGTKGSDWEGGHRVPMIAWNPETVPAGRDCDQLAITHDIFPTL---A 345

Query: 381 NKSDIPNYVNSTVENI--IPRYEN 402
           + +D+P   + +++ I  +P  E+
Sbjct: 346 DYADLPLPKDQSLDGISLLPYLED 369


>gi|227540350|ref|ZP_03970399.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
 gi|227239674|gb|EEI89689.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
          Length = 473

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 177/371 (47%), Gaps = 46/371 (12%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           +L  AF L +  + L A    P+I+FILADDLG+ D+G +G  +I TPNID LA  G+  
Sbjct: 9   LLTAAF-LCLSTIGLQAQERKPNIVFILADDLGYGDLGAYGQKKIQTPNIDRLAAEGMKF 67

Query: 99  KNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYR 155
           + +Y    +C PSRS++MTG+H  HT ++ N     E G  PL++ +    Q L++ GY 
Sbjct: 68  EQFYAGTSVCAPSRSSLMTGQHTGHTYIRGNREIEPE-GQQPLADSVQTFVQLLQKAGYA 126

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           T   GKW LG       P  +GF+   GY    Q    H      +W  D R +L+   D
Sbjct: 127 TGAFGKWGLGMVGTTGAPDQKGFDEFYGYNCQRQ---SHRYFPTHLWHNDQRVELQ-GND 182

Query: 216 LHGK--YSTDVFTAEAVDIIHNHSTDEPLFLY----LAHA-------------------A 250
           L  K  Y+  +   + +  I  H  DEP FL+    L HA                    
Sbjct: 183 LKNKVQYAPALIQEQTLRFIDAHK-DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFEET 241

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            H  N Y P      Y ++ +      R+ FA+++ ++D  VG+V++ L+   +  N+I+
Sbjct: 242 PHKGNDYGPGATVTGYASVEK-----PRATFASMVSRMDAYVGQVLKKLDDLGLTENTIV 296

Query: 311 VFVSDNG----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           +F SDNG    G A     N++    LRG K  L+EGG+R   ++  P        +E  
Sbjct: 297 IFSSDNGAHREGGADPVFFNSSGG--LRGYKRDLYEGGIRTPFIVKWPHKVKSHSTSEFT 354

Query: 367 VHVSDWLPTLL 377
               D +PTLL
Sbjct: 355 GAFWDVMPTLL 365


>gi|150010519|ref|YP_001305262.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
           8503]
 gi|149938943|gb|ABR45640.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
           8503]
          Length = 589

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 167/344 (48%), Gaps = 49/344 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II +L+DD GW D+GF G   + TPNID +A+ G IL+N+Y   + +P+R+  +TG++
Sbjct: 27  PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            + +G+ ++   G ER    L EK + +Y +E GY T + GKWH G  +  Y P  RGFE
Sbjct: 87  HVRSGV-NSTTGGGER--FNLGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142

Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
              G+ +GH         +H+ E +   G                +  D  T +A+D I 
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
           +H  + P F++L++   HS     P+Q PD + N  +     +R+ F             
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+   LD ++G+V+  L    +   +I+++ SDNG  +  F  N      ++G K +  E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           GGVR    I  P    +G V  Q     D +PTLL  A     P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338


>gi|291225019|ref|XP_002732502.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 197

 Score =  154 bits (389), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 72/143 (50%), Positives = 92/143 (64%), Gaps = 1/143 (0%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             L++V    +A S  PH+I I  DD+GWNDV +H  D I  PN+  LA  G+I    YT
Sbjct: 11  LALAVVSKPSIAESDQPHVILIFVDDMGWNDVHWHNPD-IAMPNLMDLAADGVIFNQSYT 69

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
              CTPSR+A+M G +P  TG QH +++     G+PL  K+LP+ LKE+GY T +VGKWH
Sbjct: 70  HPTCTPSRAAMMKGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWH 129

Query: 164 LGFYKKEYTPTFRGFESHLGYWT 186
           LGF K EY PT RGF+SH G WT
Sbjct: 130 LGFCKDEYQPTNRGFDSHYGLWT 152


>gi|301308937|ref|ZP_07214882.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
 gi|423338414|ref|ZP_17316156.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
           CL09T03C24]
 gi|300832963|gb|EFK63588.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
 gi|409233843|gb|EKN26675.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
           CL09T03C24]
          Length = 589

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 167/344 (48%), Gaps = 49/344 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II +L+DD GW D+GF G   + TPNID +A+ G IL+N+Y   + +P+R+  +TG++
Sbjct: 27  PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            + +G+ ++   G ER    L EK + +Y +E GY T + GKWH G  +  Y P  RGFE
Sbjct: 87  HVRSGV-NSTTGGGER--FNLGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142

Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
              G+ +GH         +H+ E +   G                +  D  T +A+D I 
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
           +H  + P F++L++   HS     P+Q PD + N  +     +R+ F             
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+   LD ++G+V+  L    +   +I+++ SDNG  +  F  N      ++G K +  E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           GGVR    I  P    +G V  Q     D +PTLL  A     P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338


>gi|404450774|ref|ZP_11015752.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
 gi|403763554|gb|EJZ24502.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
          Length = 469

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 177/368 (48%), Gaps = 49/368 (13%)

Query: 42  LAFTLSMVFVDLVA-----SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
           L   L M+F           S  P+I+FI+ADDLG  D+G +G   IPTPN+D +A +G+
Sbjct: 8   LGVMLVMIFGPFAGLYAQHFSEKPNIVFIIADDLGIGDIGPYGQTIIPTPNLDKMASTGM 67

Query: 97  ILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYR 155
              ++Y    +C PSR+++MTG+H  HT ++ N  Y  E G      KILP+ LKE GY 
Sbjct: 68  KFNDFYAGSTVCAPSRASLMTGQHTGHTLVRGNGEYPLESG-----RKILPERLKEAGYT 122

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD--MRRDLEPA 213
             + GKW LG      +P   G++    ++ GH  +   SA   K   LD  +  ++   
Sbjct: 123 NAMFGKWGLGLANTHSSPEKVGWD----HFVGHLHHV--SAHYQKPDSLDCIIEGEISRM 176

Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL------ 267
               GKY+ D+F   +++ I   + ++P FLYL+    H+      L  P+ Y       
Sbjct: 177 PVPKGKYANDLFVENSIEFIETQTDEQPFFLYLSLTVPHAE-----LLVPERYFQPFLDE 231

Query: 268 --------------NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
                          +H   + + ++ +AA++  +D+ VG+V+ ALE+  +  N++++F 
Sbjct: 232 SGKSIFPSEKAHPEGLHYGAQPYPKAAYAALVSSIDDYVGRVMSALEKIGIAENTLVIFT 291

Query: 314 SDNGGAAAGFNLNA-----ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVH 368
           SDNG    G           S+   RGVK  L++GG++   ++  P     G  ++    
Sbjct: 292 SDNGTHVEGGRRMEDVDYFKSSAHFRGVKRDLFDGGIKVPLMVNWPGKIKEGSTSDFRGA 351

Query: 369 VSDWLPTL 376
             D  PTL
Sbjct: 352 FWDLFPTL 359


>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
 gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
          Length = 500

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 186/394 (47%), Gaps = 64/394 (16%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVAS--------SGPPHIIFILADDLGWNDVGFHGLDQ 82
           R +IM  AV  + F  ++V+  L+ S        +  P+ +FILADDLGW DVGF+G   
Sbjct: 2   RMKIMKTAVERIVFGGNLVWALLLTSLCATRVHAADRPNFVFILADDLGWKDVGFNGSTF 61

Query: 83  IPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG---- 137
             TPN+D LA  G+   + Y    +C+P+R++IMTGK+P    +   +    ++      
Sbjct: 62  YETPNLDRLAREGMRFTDAYAACSVCSPTRASIMTGKYPARLHLTDWLPGRPDKPDQILK 121

Query: 138 -------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHL-GYWTGH- 188
                  LP +E  L + L+E GY+T  +GKWHLG     + P   GF+ ++ G   GH 
Sbjct: 122 HPKIITELPAAEITLAKALQEGGYKTAFIGKWHLGGLG--HWPEQAGFDINIGGCGMGHP 179

Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
             YF          G        P     G+Y  D  T EAV  I N +   P  LYL+H
Sbjct: 180 SSYFSPYKNPTLKDG--------PV----GEYLADRLTDEAVKFIEN-TKGTPFLLYLSH 226

Query: 249 AATHSANPYEPLQAPDHYLN-IHRHIEDFKRSK--------------------FAAILHK 287
            + H+     PLQA    +    + +     +K                    +AA++  
Sbjct: 227 YSVHT-----PLQAKKGLIEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIYAAMMQS 281

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           LDESVG+V++ L++  +  N++I+F SDNGG +        SN PLR  K   +EGGVR 
Sbjct: 282 LDESVGRVLDKLKELGLDKNTVIIFTSDNGGLSTAEG-APTSNMPLRAGKGWPYEGGVRE 340

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             ++  P +     V++  V  +D+ PTLL  A 
Sbjct: 341 PLVVKWPGVTKAASVSDHQVMSTDYYPTLLEIAG 374


>gi|421612498|ref|ZP_16053605.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
 gi|408496794|gb|EKK01346.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
          Length = 487

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 169/343 (49%), Gaps = 33/343 (9%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
           +S  P++I I  DD G+ND+G +G   I TPN+D LA  G    ++Y+   +C+PSR+A+
Sbjct: 43  TSEKPNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAAL 102

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG +P   G+  +VL+   + GL   E  +  +LK  GY T  VGKWHLG + KE  PT
Sbjct: 103 LTGCYPKRVGLHQHVLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPT 161

Query: 175 FRGFESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLH 217
             GF+S+                LG  +    + D S+  + +W   + +D E     + 
Sbjct: 162 SNGFDSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVD 220

Query: 218 GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
            +  T  +T  A++ +  +  D+P FLYL H+      P+ PL  P+   +         
Sbjct: 221 QRTITRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------P 268

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           ++ +  ++  +D  VG++++ +    +   ++IV+ SDN G    F  +  S  PLR  K
Sbjct: 269 QNAYKCVIEHVDTEVGRLIQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGK 327

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            T +EGG R   ++W+P     G  +  +    D LPT+ S  
Sbjct: 328 GTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVT 370


>gi|444722183|gb|ELW62881.1| N-acetylgalactosamine-6-sulfatase [Tupaia chinensis]
          Length = 764

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 109/351 (31%), Positives = 170/351 (48%), Gaps = 48/351 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
           A  G P  +F +    GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A
Sbjct: 61  AGPGTPTALFFM----GWGDLGVYGEPSRETPNLDQMAAEGMLFPNFYSANPLCSPSRAA 116

Query: 114 IMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           ++TG+ PI TG         N     E  GG+P SE +LP+ LK  GY ++IVGKWHLG 
Sbjct: 117 LLTGRLPIRTGFYTTNAHARNAYTPQEIVGGIPSSEHLLPELLKGAGYVSKIVGKWHLG- 175

Query: 167 YKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY------ 220
           ++ ++ P   GF+   G    H   +D+ A       + + RD    W++ G++      
Sbjct: 176 HRPQFHPLRHGFDEWFGAPNCHFGPYDNKARP----NIPVYRD----WEMVGRFYEEFPI 227

Query: 221 --------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
                    T ++  EA+D I   +   P FL+ A  ATH+     P+ A   +L     
Sbjct: 228 SLKTGEANLTQIYLQEALDFIKRQAGRRPFFLHWAIDATHA-----PVYASQPFLGTS-- 280

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASN 330
               +R ++   + ++D+SVGK++  L    + +++ + F SDNG A  +A       SN
Sbjct: 281 ----QRGRYGDAVREIDDSVGKILALLRALGIGNSTFVFFTSDNGAALISAPRQGKCGSN 336

Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            P    K T +EGG+R   + W P     G V+ Q   + D   T LS A 
Sbjct: 337 GPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAG 387


>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 167

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/161 (47%), Positives = 105/161 (65%)

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +FT +A+ +I +H   +P+FLY +H A H  +    L+AP   +    +I++  RS  A 
Sbjct: 1   LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
            + +LD+SVG V+EAL +R ML NSIIVF +DNGG   G   N+  NWPLRG K TLWEG
Sbjct: 61  AVFELDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGKEPNSGYNWPLRGSKFTLWEG 120

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           GVRGA  +WSPLL   G ++ Q +H++DWLPT  SAA + +
Sbjct: 121 GVRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAGEKE 161


>gi|418528067|ref|ZP_13094017.1| sulfatase [Comamonas testosteroni ATCC 11996]
 gi|371454443|gb|EHN67445.1| sulfatase [Comamonas testosteroni ATCC 11996]
          Length = 442

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/341 (30%), Positives = 167/341 (48%), Gaps = 24/341 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           P++IFILADDLG+ D+G  G           +P +DA+A  G+   + Y    +C+P+R 
Sbjct: 6   PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65

Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           A+ TG+  + +    +  +  ++G +  GLP     +   L++ GY T +VGKWHLG Y 
Sbjct: 66  ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             + P   G+E   G+  G  DYF H     +    D   + +P       Y TD+ +  
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEKP--HAEDGYLTDLLSRR 179

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDFKRSKFAAILHK 287
           AVD +   +  +P +L L ++A H        +A    L  + +HI+      +  ++H 
Sbjct: 180 AVDFVKRQTAKQPFYLSLHYSAPHWPWLTREDRAESERLQGMGKHIDGGSIDTYQRMIHH 239

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE +G +++ALE++ M  N++IVF SDNGG         ++ WP  G K  L EGG+R 
Sbjct: 240 MDEGIGWLLDALEEKGMSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGIRV 293

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             L   P   + G V +      DW  T L+AA  S   +Y
Sbjct: 294 PLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVSADADY 334


>gi|355785121|gb|EHH65972.1| hypothetical protein EGM_02852 [Macaca fascicularis]
          Length = 507

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 162/350 (46%), Gaps = 40/350 (11%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+ L ++Y    LCTPSR+A++TG
Sbjct: 20  PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPAPLCTPSRAALLTG 79

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-YKKEYTPTFR 176
           + P+ TGM   VL    RGGLPL E  L + L   GY T + GKWHLG   +  + P  +
Sbjct: 80  RLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPHQ 139

Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
           GF   LG    H     Q+            G D           +  + +P W   L  
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLEA 199

Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           +Y      A A D++ +    D P FLY A   TH               +     +   
Sbjct: 200 RY-----VAFARDLMADAQRQDRPFFLYYASHPTHYPQ-----------FSGQNFAKRSG 243

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  F   L +LD +VG ++ A+    +L  ++++F +DNG      +    S   LR  K
Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            T +EGGVR   L + P   + G+  E    + D LPTL + A  + +PN
Sbjct: 303 GTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350


>gi|196234349|ref|ZP_03133178.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196221616|gb|EDY16157.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 483

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 110/354 (31%), Positives = 171/354 (48%), Gaps = 41/354 (11%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSA 113
           A+   P++IFILADDLG  D+G +G  +I TPNID LA  G+  L++Y    +C PSR A
Sbjct: 22  ATPAKPNVIFILADDLGIGDLGCYGQQKIRTPNIDHLAADGMRFLQHYTGCSVCAPSRCA 81

Query: 114 IMTGKHPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           +MTG+H  H  ++ N   G    G   +P     + + ++  GY T I+GKW LG  +  
Sbjct: 82  LMTGRHMGHAAIRDNAQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLGMPEDH 141

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE--PAWDLHGK--------- 219
            +P   GF    GY         H+     +W  + R  L   P++D+  K         
Sbjct: 142 SSPRDMGFNYSFGYLC---QSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEPKGEI 198

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH--------SANPYE------PLQAPDH 265
           YS DV  ++A+  + +H  D+P FLYLA    H        S + Y       P +   H
Sbjct: 199 YSHDVMASDALKFVRDHH-DKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPFRNTKH 257

Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGF 323
           Y N      +  R+ +A ++ ++D  VG+++  L++  +  N+++ F SDNG     AG 
Sbjct: 258 YAN-----NETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVFFSSDNGAVFPLAGT 312

Query: 324 N-LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           + +   S    RG K  L+EGG+R   +   P     G+  +Q     D+LPT+
Sbjct: 313 DPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASVFYDFLPTM 366


>gi|440714613|ref|ZP_20895192.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436440809|gb|ELP34113.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 470

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 109/363 (30%), Positives = 178/363 (49%), Gaps = 38/363 (10%)

Query: 39  VLPLAFTLSMVFVDLVAS-----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           V+ LA  L++V    +A+     S  P++I I  DD G+ND+G +G   I TPN+D LA 
Sbjct: 2   VIGLAALLTLVGRPGLAAESNEKSDKPNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLAS 61

Query: 94  SGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL 152
            G    ++Y+   +C+PSR+A++TG +P   G+  +VL+     GL   E  +  +LK  
Sbjct: 62  EGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSA 121

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESH----------------LGYWTGHQDYFDHSA 196
           GY T  VGKWHLG + KE  PT  GF+S+                LG  +    + D S+
Sbjct: 122 GYATACVGKWHLG-HHKETLPTSNGFDSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS 180

Query: 197 EEMKMWGLDMRRDLEPA-WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
             + +W   + +D E     +  +  T  +T  A++ +  +  D+P FLYL H+      
Sbjct: 181 -AVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEFVEANQ-DKPFFLYLPHSM----- 233

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
           P+ PL  P+   +         ++ +  ++  +D  VG++V+ +    +   ++IV+ SD
Sbjct: 234 PHIPLYVPEDVYDPD------PQNAYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSD 287

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           N G    F  +  S  PLR  K T +EGG R   ++W+P     G  +  +    D LPT
Sbjct: 288 N-GPWLQFKNHGGSAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPT 346

Query: 376 LLS 378
           + S
Sbjct: 347 IAS 349


>gi|221069561|ref|ZP_03545666.1| sulfatase [Comamonas testosteroni KF-1]
 gi|220714584|gb|EED69952.1| sulfatase [Comamonas testosteroni KF-1]
          Length = 442

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 106/341 (31%), Positives = 169/341 (49%), Gaps = 24/341 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           P++IFILADDLG+ D+G  G           +P +DA+A  G+   + Y    +C+P+R 
Sbjct: 6   PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65

Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           A+ TG+  + +    +  +  ++G +  GLP     +   L++ GY T +VGKWHLG Y 
Sbjct: 66  ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             + P   G+E   G+  G  DYF H     +    D  R+ EP       Y TD+ +  
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWRNEEP--HTEDGYLTDLLSRR 179

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDFKRSKFAAILHK 287
           AVD +   + ++P +L L ++A H        +A    L  + +HI+      +  ++H 
Sbjct: 180 AVDFVKRQTAEQPFYLSLHYSAPHWPWLTREDRAESERLQGMGKHIDGGSIDTYQRMIHH 239

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE +G +++ALE++ M  N++IVF SDNGG         ++ WP  G K  L EGG+R 
Sbjct: 240 MDEGIGWLLDALEEKGMSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGIRV 293

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             L   P   + G V +      DW  T L+AA  +   +Y
Sbjct: 294 PLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVAAHADY 334


>gi|32476258|ref|NP_869252.1| arylsulfatase A [Rhodopirellula baltica SH 1]
 gi|32446802|emb|CAD76638.1| arylsulfatase A [Rhodopirellula baltica SH 1]
          Length = 489

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/337 (30%), Positives = 166/337 (49%), Gaps = 33/337 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P++I I  DD G+ND+G +G   I TPN+D LA  G    ++Y+   +C+PSR+A++TG 
Sbjct: 47  PNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGC 106

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G+  +VL+     GL   E  +  +LK  GY T  VGKWHLG + KE  PT  GF
Sbjct: 107 YPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGF 165

Query: 179 ESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLHGKYS 221
           +S+                LG  +    + D S+  + +W   + +D E     +  +  
Sbjct: 166 DSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVDQRTV 224

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T  +T  A++ +  +  D+P FLYL H+      P+ PL  P+   +         ++ +
Sbjct: 225 TRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------PQNAY 272

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
             ++  +D  VG++V+ +    +   ++IV+ SDN G    F  +  S  PLR  K T +
Sbjct: 273 KCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTF 331

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           EGG R   ++W+P     G  +  +    D LPT+ S
Sbjct: 332 EGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIAS 368


>gi|408674712|ref|YP_006874460.1| sulfatase [Emticicia oligotrophica DSM 17448]
 gi|387856336|gb|AFK04433.1| sulfatase [Emticicia oligotrophica DSM 17448]
          Length = 518

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 114/399 (28%), Positives = 188/399 (47%), Gaps = 87/399 (21%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II I+ADD+G++D+G +G  +I TPN+D LA +G+ L+N+Y    C P+R++++TGK+
Sbjct: 32  PNIIIIMADDMGFSDIGCYG-SEISTPNLDKLAANGLKLRNFYNAGRCCPTRASLLTGKY 90

Query: 120 PIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
               GM + V +  ++       G L  S   + + LK++GY T + GKWH+G    +Y 
Sbjct: 91  SHAVGMGNMVSFEDQKVPKDNYQGYLEPSVPTIAEDLKKVGYHTYMTGKWHVG-ESPDYW 149

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           P  RGFE + G  +G   YF+   E+ K     ++ D E      G Y+TD FT +A++ 
Sbjct: 150 PLKRGFERYFGLISGASSYFEVLQEKRKR--FVVQDDKEYVLPKDGYYATDAFTDKAIEF 207

Query: 233 IHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDH----YLNIHRHIED------FKRSK- 280
           + +    + P FLYLA+ A     P+ PL A +     Y N +    D      FK+ K 
Sbjct: 208 LESSDKQNNPFFLYLAYTA-----PHFPLHAYEEDIAKYENFYLQGWDKTRTDRFKKMKQ 262

Query: 281 -----------------------------------FAAILHKLDESVGKVVEALEQRRML 305
                                              +AA++ ++D+++GK+VE L++ + +
Sbjct: 263 IGLLDKRYQLTERPNDIQPWEDVKEKKSWARKMAVYAAMIDRMDKNIGKLVEYLKKSQQI 322

Query: 306 SNSIIVFVSDNGGA-----------------------AAGFNLNAASNWPLRGVKNTLWE 342
            N+ I+F++DNG +                          F     SN P +  K  + E
Sbjct: 323 DNTFIIFLADNGASPENVDDRKLGEPNKRIGERGSYGTYDFQWANVSNTPFKLYKKFMHE 382

Query: 343 GGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           GG+    +I W   ++ +    +   HV D LPT L  A
Sbjct: 383 GGIISPCIIHWPAKIKPKKEFIDAPSHVIDLLPTSLELA 421


>gi|305665425|ref|YP_003861712.1| arylsulfatase A [Maribacter sp. HTCC2170]
 gi|88710180|gb|EAR02412.1| arylsulfatase A (precursor) [Maribacter sp. HTCC2170]
          Length = 489

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 170/358 (47%), Gaps = 38/358 (10%)

Query: 42  LAFTLSMVFVDL---------VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
             F +S +F+ L            S PP+++ I  DD G+ DVG  G   I TP++D +A
Sbjct: 16  FLFVISFLFILLSCKDNDQKVTKKSPPPNVVLIFTDDQGYQDVGVFGSPNIKTPHLDRMA 75

Query: 93  YSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
             G+ L +YY+ Q +C+ SR+ ++TG +P   G+ HN L      G+  SE  + + LK 
Sbjct: 76  SEGVKLTSYYSAQAVCSASRAGLLTGCYPNRIGI-HNALGPGNTHGINSSETTMAEMLKN 134

Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG--YWTGHQDYFDHSAEEMKMWGLDMRRD 209
            GY+T I GKWHLG +  E+ P   GF+   G  Y      Y            L +  +
Sbjct: 135 NGYKTAIFGKWHLG-HHPEFMPNKHGFDEFFGIPYSNDMWPYHPQQGPIFNFPDLPLYEN 193

Query: 210 LEPAWDLHGKYS--TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
            E   D   + S  T   T  +VD I N + D P FLY+ H       P+ PL       
Sbjct: 194 -ETVIDTLTEQSNLTTQITERSVDFI-NRNKDNPFFLYVPH-----PQPHVPLFVS---- 242

Query: 268 NIHRHIEDFK----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
                 E FK    R  +  ++ ++D SVG+++EAL++  +  N+I++F SDNG   +  
Sbjct: 243 ------EKFKGKSDRGLYGDVIMEIDWSVGQIIEALKKNGLEENTIVIFTSDNGPWLSYG 296

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           N +A S  P R  K T WEGG R   ++  P     G V +  +   D LPTL S  N
Sbjct: 297 N-HAGSALPFREGKGTAWEGGQREPFIMKYPNKLKAGEVIDVPMMAIDLLPTLASLTN 353


>gi|398990930|ref|ZP_10694093.1| arylsulfatase A family protein [Pseudomonas sp. GM24]
 gi|399012076|ref|ZP_10714404.1| arylsulfatase A family protein [Pseudomonas sp. GM16]
 gi|398116682|gb|EJM06441.1| arylsulfatase A family protein [Pseudomonas sp. GM16]
 gi|398142218|gb|EJM31121.1| arylsulfatase A family protein [Pseudomonas sp. GM24]
          Length = 568

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/435 (28%), Positives = 196/435 (45%), Gaps = 116/435 (26%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK- 118
           P+I+ I+ADD+G++D+   G  +I TPN+DA+A +G+   NY+T   C+P+RS +M+G  
Sbjct: 38  PNIVLIVADDMGYSDLSAFG-SEIQTPNLDAIARAGVTFSNYHTAPTCSPTRSMLMSGTD 96

Query: 119 -HPIHTGMQHNVLYGCERGGLP----LSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEY 171
            H    G    ++   + G       L+++++  P+ L+  GYRT +VGKWHLG    E 
Sbjct: 97  NHIAGVGTMAELVLPEQAGKRGYEGYLTDRVVAFPKLLQANGYRTMMVGKWHLG-KTPEL 155

Query: 172 TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK---------YST 222
            P  RGF+       G  D+F          G    R+ + ++ L+G+         YS+
Sbjct: 156 GPKARGFDQSFALLEGSADHFTQ--------GGTTDRNPKASYRLNGEDIDLPKNFDYSS 207

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED------- 275
           D FT  A++ I   +TD+P F YLA+ A     P+ PLQ P  YL  +R + D       
Sbjct: 208 DYFTRTAIEQI-GAATDKPFFAYLAYTA-----PHWPLQVPTAYLERYRGLYDKGYANIA 261

Query: 276 ---FKRSK--------------------------------------FAAILHKLDESVGK 294
               KR K                                      +AA++  LD++VGK
Sbjct: 262 EARLKRMKDKGLVAQDVQINPGPGVWPTWEKLSPRERALESRRMEVYAAMITNLDDNVGK 321

Query: 295 VVEALEQRRMLSNSIIVFVSDNG-------------------------------GAAAGF 323
           VV+ L+      N++ VF SDNG                               G+ +G+
Sbjct: 322 VVKHLKDTGQYDNTVFVFFSDNGAEGSNPDDVTPNNAKWIAENFDNSLDNIGKPGSFSGY 381

Query: 324 NLNAA--SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             N A  S+ P R  K   +EGG+R A  I  P +   G  ++ Y+ V D  PT L  A 
Sbjct: 382 GPNWARVSSTPFRMYKAFTYEGGIRTAAFIAGPGMR-HGEASDAYMSVKDLAPTFLELA- 439

Query: 382 KSDIPNYVNSTVENI 396
           + + P+  +++VE +
Sbjct: 440 RVEHPSTRDTSVEKM 454


>gi|403282838|ref|XP_003932845.1| PREDICTED: arylsulfatase A [Saimiri boliviensis boliviensis]
          Length = 509

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 162/350 (46%), Gaps = 40/350 (11%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  V LCTPSR+A++TG
Sbjct: 22  PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTG 81

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
           + P+  GM   VL    RGGLPL E  L + L   GY T + GKWHLG   +  + P  +
Sbjct: 82  RLPVRMGMYPGVLVPSSRGGLPLEEVTLAEVLAAQGYLTGMAGKWHLGVGPEGAFLPPHQ 141

Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
           GF   LG    H     Q+      +     G D           +  + +P W   L  
Sbjct: 142 GFHRFLGIPYSHDQGPCQNLTCFPPDTPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA 201

Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           +Y      A A D++ +    D P FLY A   TH               +     E   
Sbjct: 202 RY-----IAFAHDLMADAQRQDRPFFLYYASHHTHYPQ-----------FSGQSFAERSG 245

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  F   L +LD +VG ++ A+    +L  ++++F +DNG      +    S   LR  K
Sbjct: 246 RGPFGDSLMELDAAVGALMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 304

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            T +EGGVR   L + P   + G+  E    + D LPTL + A  + +PN
Sbjct: 305 GTTYEGGVREPALAFWPGHITPGVTHELSSSL-DLLPTLAALAG-APLPN 352


>gi|300771164|ref|ZP_07081040.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
           spiritivorum ATCC 33861]
 gi|300761834|gb|EFK58654.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
           spiritivorum ATCC 33861]
          Length = 473

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 120/371 (32%), Positives = 176/371 (47%), Gaps = 46/371 (12%)

Query: 39  VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
           +L  AF L +  + L A    P+I+FILADDLG+ D+G +G  QI TPNID LA  G+  
Sbjct: 9   LLTAAF-LCLSTIGLQAQERKPNIVFILADDLGYGDLGAYGQKQIQTPNIDRLAAEGMKF 67

Query: 99  KNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYR 155
           + +Y    +C PSRS++MTG+H  HT ++ N     E G  PL++ +    Q L++ GY 
Sbjct: 68  EQFYAGASVCAPSRSSLMTGQHTGHTYIRGNREIEPE-GQQPLADSVQTFVQLLQKAGYA 126

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           T   GKW LG       P  +GF+   GY    Q    H      +W  D R +L    D
Sbjct: 127 TGAFGKWGLGMVGTTGAPDQKGFDEFYGYNCQRQ---SHRYFPTHLWHNDQRVELH-GND 182

Query: 216 LHGK--YSTDVFTAEAVDIIHNHSTDEPLFLY----LAHA-------------------A 250
           L  K  Y+  +   + +  I  H  DEP FL+    L HA                    
Sbjct: 183 LKNKVQYAPALIQEQTLRFIDVHK-DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFQET 241

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            H  N Y P      Y ++ +      R+ FA+++ ++D  VG+V++ L+   +  N+I+
Sbjct: 242 PHKGNDYGPGATVTGYASVEK-----PRATFASMVSRMDAYVGQVLKKLDDLGLTENTIV 296

Query: 311 VFVSDNG----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQY 366
           +F SDNG    G A     N++    LRG K  L+EGG+R   ++  P        +E  
Sbjct: 297 IFSSDNGAHREGGADPVFFNSSGG--LRGYKRDLYEGGIRTPFIVKWPHKVKPNSTSEFA 354

Query: 367 VHVSDWLPTLL 377
               D +PTLL
Sbjct: 355 GAFWDVMPTLL 365


>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
 gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
          Length = 553

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 175/371 (47%), Gaps = 53/371 (14%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
           A + PP+I+ ILADDLG+ND+   G   +PTPNID++A  G    + Y+    C PSR+ 
Sbjct: 66  AGNRPPNIVVILADDLGFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAM 125

Query: 114 IMTGKHPIHTGMQ----------------------HNVLYGCERG---------GLPLSE 142
           IMTG++   TG +                      H +L   E           GLP SE
Sbjct: 126 IMTGRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSE 185

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---HQDYFDHSAEEM 199
             L + LK  GY    +GKWHLG    E+ P  +GF+  +   +G    +D  D    ++
Sbjct: 186 ITLAEALKPKGYHNIHIGKWHLG-NAPEFLPNAQGFDESVMLESGLFLPEDSPDVVNAKL 244

Query: 200 KMWGLD------MRRDLE---PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAA 250
               +D      M+        AW     Y TD +T EA+  I  +  + P FLYLAH  
Sbjct: 245 PFDPIDQFLWARMQYATSYNGSAWFEPKGYLTDFYTDEAIKAIEANR-NRPFFLYLAHWG 303

Query: 251 THSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
            H+     PLQA     +   HIED +   +AA++  LD SVG+V+++L++  +  N+++
Sbjct: 304 VHT-----PLQASKADYDALSHIEDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLV 358

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           +F SDNG  A G+      N P RG K T +EGG+R       P     G      V   
Sbjct: 359 IFSSDNG--APGYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHL 416

Query: 371 DWLPTLLSAAN 381
           D  PT+++AA 
Sbjct: 417 DMFPTIVAAAG 427


>gi|296121822|ref|YP_003629600.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014162|gb|ADG67401.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 638

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 172/366 (46%), Gaps = 39/366 (10%)

Query: 25  LKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIP 84
           L   G  + +M  A   +  + +       A   PP+I+ +LADD GW D    G   + 
Sbjct: 15  LMGFGISSYVMLLACCVVLISPTSSEAQQSARPQPPNIVVLLADDAGWGDYSVSGNPYVK 74

Query: 85  TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI 144
           TP+ID++A  G+ L N+Y   +C+P+R+  +TG++ + TG++  V  G ER  L L E+ 
Sbjct: 75  TPHIDSIAQQGVSLTNFYVCPVCSPTRAEFLTGRYALRTGVR-GVSLGEER--LNLDEQS 131

Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWG 203
           + ++ ++ GYRT I GKWH G  +  Y P  RGF+  LGY  GH  +Y D   E     G
Sbjct: 132 IAEHFRKAGYRTGIFGKWHNG-SQGPYHPLARGFDVQLGYTAGHWSEYIDAPLESQ---G 187

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
             +  +          Y  D     A+D I   S+  P F Y+     HS     P   P
Sbjct: 188 RPVTSE---------GYIVDTCMNAAIDFI--SSSQPPFFCYVPLTTPHS-----PWCVP 231

Query: 264 DHYLN--------IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             Y N        +     D  R  + A++ + D++VG+++  L+   + SN+I+++ SD
Sbjct: 232 QTYWNRWKDRDVGLTGKEADAVRCVY-AMMEQQDDAVGRLLARLDSLHLSSNTIVLYFSD 290

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           NG     +N +      +RG K T+ EGGVR  G +  P     G      +   D LPT
Sbjct: 291 NGPNTVRWNGD------MRGRKGTVDEGGVRSVGFLRWPGHIPPGSTQTGLIGAIDLLPT 344

Query: 376 LLSAAN 381
           L   AN
Sbjct: 345 LAGLAN 350


>gi|390958067|ref|YP_006421824.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390958408|ref|YP_006422165.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390412985|gb|AFL88489.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390413326|gb|AFL88830.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
          Length = 470

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 119/368 (32%), Positives = 172/368 (46%), Gaps = 47/368 (12%)

Query: 43  AFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKN 100
           AF LS +     A +   P+IIFILADD+G+ D   +G  +  TP+ID LA  G+   + 
Sbjct: 21  AFALSNLTKSFAAQTKRRPNIIFILADDMGYADTSVYGQRKFKTPHIDRLAAEGVRFAQA 80

Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGC-----------ERGGLPLSEKILPQYL 149
           Y    +C PSRS +MTG +  HT ++ N                 R  L  ++  +  YL
Sbjct: 81  YAGAPVCAPSRSVLMTGLNTGHTRVRDNFALAAGHVGFKGKEEIRRASLTDADHTIADYL 140

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWGLDM 206
              GY+T + GKWHL  Y    TPT  GF++  G+ T     Q YF              
Sbjct: 141 ARSGYKTALFGKWHLDGYDPAATPTRHGFQTFKGWLTQTASTQGYFP-----------TQ 189

Query: 207 RRDLEPAWDL-------HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           R D E   DL        G+Y T + T ++V+ I   S+D P FLY+A+ A HS     P
Sbjct: 190 RYDNERLIDLPENANGKQGRYDTTMITDDSVNYIQ-QSSDTPFFLYVAYDAPHS-----P 243

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
             APD   +  +   D     +AA++  LD  VGK++ AL+Q+ +  ++I+ F SDNG  
Sbjct: 244 YIAPDQ-GSYAKEPWDEDEKIYAAMIGHLDNGVGKILAALKQKGIDEDTIVFFASDNGPR 302

Query: 320 AAGFNLNAA------SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
           +              SN  L+G K  ++EGG+R   +   P     G V++  +   D L
Sbjct: 303 SEPTPQQTKVVNFFDSNGALQGYKRDMYEGGIREPFIARWPGHIPAGKVSQVPIFFPDVL 362

Query: 374 PTLLSAAN 381
           PT L  A 
Sbjct: 363 PTALDLAG 370


>gi|431799329|ref|YP_007226233.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430790094|gb|AGA80223.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 454

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 172/350 (49%), Gaps = 25/350 (7%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+II I ADDLG+ D+G +G   I TP +D +A  G+    +Y    +CTPSR+A++TG
Sbjct: 29  PPNIIVIFADDLGYGDLGCYGHPTIQTPYLDRMAKEGMRFTQFYVGADVCTPSRAALLTG 88

Query: 118 KHPIHTGM---QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           + PI  GM   +  VL+     GLP +E  +   LK  GYRT IVGKWHLG +  EY PT
Sbjct: 89  RLPIRYGMAGEERGVLFPDSSKGLPHTEATMASALKAAGYRTGIVGKWHLG-HLPEYLPT 147

Query: 175 FRGFESHLG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
             GF+ + G  Y    +    +    + ++  +   DL    D+  +  T  +T EA+  
Sbjct: 148 THGFDFYFGIPYSNDMRPNPHNHVPPLPLYRNEEVIDL----DIDQRQLTKRYTKEAIRF 203

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I  +  + P FLY  +       P+ PL A   +          KR  +  ++ +LD SV
Sbjct: 204 IEENQ-ERPFFLYYPNNF-----PHVPLYASSDFEG------KSKRGIYGDVVSELDWSV 251

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G+++E L+   +  N++++F SDNG   +      ++     G K + +EGG+R   + W
Sbjct: 252 GRILEKLKALDLAENTLVIFTSDNGPWLSQKEKGGSAGLLFEG-KASTYEGGMRVPAIAW 310

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
            P       ++   V   D  PT L+ A   ++P       +NI+P   N
Sbjct: 311 WPGTIKPNQISTALVASMDLYPTFLTMAGL-NMPQDKTLDGDNILPILRN 359


>gi|449138001|ref|ZP_21773306.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448883380|gb|EMB13908.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 470

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 166/339 (48%), Gaps = 33/339 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P++I I  DD G+ND+G +G   I TPN+D LA  G    ++Y+   +C+PSR+A++TG 
Sbjct: 28  PNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGC 87

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   G+  +VL+     GL   E  +  +LK  GY T  VGKWHLG + KE  PT  GF
Sbjct: 88  YPKRVGLHQHVLFPQSNYGLHPEEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGF 146

Query: 179 ESH----------------LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WDLHGKYS 221
           +S+                LG  +    + D S+  + +W   + +D E     +  +  
Sbjct: 147 DSYYGIPYSNDMNHPDNKRLGKMSSDDRWTDQSS-AVTLWNTPLVQDEEIIELPVDQRTI 205

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T  +T  A++ +  +  D+P FLYL H+      P+ PL  P+   +         ++ +
Sbjct: 206 TRRYTDRAIEFVEANQ-DKPFFLYLPHSM-----PHIPLYVPEDVYDPD------PQNAY 253

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
             ++  +D  VG++V+ +    +   ++IV+ SDN G    F  +  S  PLR  K T +
Sbjct: 254 KCVIEHIDTEVGRLVQTVRDLGLSKKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTF 312

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           EGG R   ++W+P     G  +  +    D LPT+ S  
Sbjct: 313 EGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVT 351


>gi|351700549|gb|EHB03468.1| Arylsulfatase A [Heterocephalus glaber]
          Length = 507

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 111/372 (29%), Positives = 172/372 (46%), Gaps = 42/372 (11%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
            L + +   +A++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  
Sbjct: 6   ALCVAWAASLATTSPPNIVLIFADDLGYGDLGAYGHPSSTTPNLDRLAAGGLRFTDFYVP 65

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
           V LCTPSR+A++TG+ P+ TGM   VL    +GGLPL E  L + L   GY T + GKWH
Sbjct: 66  VSLCTPSRAALLTGRLPVRTGMYPGVLGPTSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 125

Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGHQD-----------------YFDHSAEEMKMWGLD 205
           LG   +  + P  +GF   LG    H                     D     + +   +
Sbjct: 126 LGVGPRGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPAATRCDGVCDQGLVPIPLLA-N 184

Query: 206 MRRDLEPAW--DLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA 262
           +  +++P W   L  +Y      A A D++ +      P FLY A   TH          
Sbjct: 185 LSVEVQPPWLPGLEARY-----VAFARDLMADAQHQGRPFFLYYASHHTHYPQ------- 232

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
                +    +    R  F   L +LD +VG ++ A+    +L  ++++F +DNG     
Sbjct: 233 ----FSGQSFVGSSGRGPFGDSLMELDAAVGALMMAVGDLGLLGETLVIFTADNGPETMR 288

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
            +    S   LR  K T +EGGVR   + + P   + G+  E    + D LPT L+A + 
Sbjct: 289 MSRGGCSGL-LRCGKGTTFEGGVREPAVAFWPGHITPGVTHELASSL-DLLPT-LAALSS 345

Query: 383 SDIPNYVNSTVE 394
           + +PN     V+
Sbjct: 346 APLPNVTLDGVD 357


>gi|410965904|ref|XP_003989478.1| PREDICTED: arylsulfatase A [Felis catus]
          Length = 507

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 115/381 (30%), Positives = 171/381 (44%), Gaps = 60/381 (15%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
            L++ F   +A++GPP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  
Sbjct: 6   ALALAFAIGLATAGPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVP 65

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
           V LCTPSR+A++TG+ P+  G+   VL    RGGLPL E  L + L   GY T I GKWH
Sbjct: 66  VSLCTPSRAALLTGRLPVRMGLYPGVLEPSSRGGLPLEEVTLAEVLAARGYLTGIAGKWH 125

Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGHQD-----------------YFDHSAEEMKMWGLD 205
           LG   +  + P  +GF   LG    H                     D     + +   +
Sbjct: 126 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSCDQGLVPIPLLA-N 184

Query: 206 MRRDLEPAW--DLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA 262
           +  + +P W   L  +Y      A A D++ +      P FLY                 
Sbjct: 185 LSVEAQPPWLPGLEARY-----VAFARDLMADAQRQGRPFFLY----------------- 222

Query: 263 PDHYLNIHRHIEDFK---------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
              Y + H H   F          R  F   L +LD +VG ++ A+    +L  ++++F 
Sbjct: 223 ---YASHHTHYPQFSGQSFSGRSGRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFT 279

Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
           +DNG      +    S   LR  K T +EGGVR   L + P   + G+  E    + D L
Sbjct: 280 ADNGPETMRMSHGGCSGL-LRCGKGTTFEGGVREPALAFWPGHIAPGVTHELASSL-DLL 337

Query: 374 PTLLSAANKSDIPNYVNSTVE 394
           PT L+A   + +PN     V+
Sbjct: 338 PT-LAALTGAPLPNVTLDGVD 357


>gi|121609253|ref|YP_997060.1| sulfatase [Verminephrobacter eiseniae EF01-2]
 gi|121553893|gb|ABM58042.1| sulfatase [Verminephrobacter eiseniae EF01-2]
          Length = 442

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 111/346 (32%), Positives = 166/346 (47%), Gaps = 31/346 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+IIFI+ADDLG+ D+G +G         +P +DALA  G+ L + Y    +C+P+R A+
Sbjct: 5   PNIIFIVADDLGYADLGCYGARAAGFGPVSPTLDALAAGGLRLTQGYSNSPVCSPTRFAL 64

Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           MT ++     G     +    RG    GLP     LP  L+  GYRT ++GKWHLG Y  
Sbjct: 65  MTARYQYRLRGAAEEPINSQSRGSTTLGLPPEHPTLPSLLRGAGYRTALMGKWHLG-YPP 123

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
            + P   G+E   G  +G  DYF H +   +   L +    +P       Y TD+ T  A
Sbjct: 124 AFGPLRSGYEEFFGPMSGGVDYFTHCSSSGQH-DLYLGAQEQP----QDGYLTDLITEHA 178

Query: 230 VDIIHNHS----TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS---KFA 282
           +D +   +       P FL L + A H   P+E          +HRH+          + 
Sbjct: 179 LDYVARMAPGAKAGTPFFLSLHYTAPHW--PWETRDDQALAPQLHRHLFHLHGGSIHSYR 236

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
            ++H +DE +G+++  L Q  +  ++++VF SDNGG         + NWPL G K  L E
Sbjct: 237 RMIHHMDEGIGRLMALLAQHGLTRDTLLVFTSDNGGE------RFSDNWPLVGGKMDLTE 290

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           GG+R   +   P + + G V+ Q     DW  TLL AA  +   +Y
Sbjct: 291 GGIRVPWIAHWPAVIAPGGVSAQTCMTMDWSATLLDAATVAPDADY 336


>gi|264676341|ref|YP_003276247.1| sulfatase [Comamonas testosteroni CNB-2]
 gi|262206853|gb|ACY30951.1| sulfatase [Comamonas testosteroni CNB-2]
          Length = 442

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 104/336 (30%), Positives = 165/336 (49%), Gaps = 28/336 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           P++IFILADDLG+ D+G  G           +P +DA+A  G+   + Y    +C+P+R 
Sbjct: 6   PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65

Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           A+ TG+  + +    +  +  ++G +  GLP     +   L++ GY T +VGKWHLG Y 
Sbjct: 66  ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             + P   G+E   G+  G  DYF H     +    D   + EP       Y TD+ +  
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HTEDGYLTDLLSRR 179

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
           AVD +   S ++P +L L ++A H   P+   + L+         +HI+      +  ++
Sbjct: 180 AVDFVKRQSAEQPFYLSLHYSAPHW--PWLTRDDLEESRRIGGFGKHIDGGSIDTYQRMI 237

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
           H +DE +G +++ALE+  +  N++IVF SDNGG         ++ WP  G K  L EGG+
Sbjct: 238 HHMDEGIGWLLDALEENGLSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGI 291

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           R   L   P   + G V +      DW  T L+AA 
Sbjct: 292 RVPLLARWPARMTAGAVCDTPSLTMDWSATFLAAAG 327


>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
 gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
          Length = 457

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 120/373 (32%), Positives = 180/373 (48%), Gaps = 42/373 (11%)

Query: 28  LGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
           + YR  I A  VL     L+   +   A+   P+I+FIL DD+G  D G +G     TP+
Sbjct: 5   MNYRQCIAAILVL-----LASGALHSDAAPTKPNIVFILIDDMGCKDAGCYGATNFSTPH 59

Query: 88  IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER---GG------- 137
           ID LA  G+   + Y   +C+P+R+++MTGKHP    +  N +    R    G       
Sbjct: 60  IDRLANQGMRFTDAYAAPVCSPTRASLMTGKHPARLHLT-NFIPQIGRQLPAGKLIPPGF 118

Query: 138 ---LPLSEKILPQYLKELGYRTRIVGKWHLG-FYKKEYTPTFRGFES-HLGYWTGHQDYF 192
              LPL EK + Q L   GY+  ++GKWHLG  +  EY P  RGF+   L    G  +YF
Sbjct: 119 NHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYF 178

Query: 193 DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH 252
               ++ K W         P     G Y  D  T EA+D +  +  + P FLYL+H + H
Sbjct: 179 YPFVDQQK-WPYAGPLPGNP-----GDYLPDRLTDEAIDFVRENR-ERPFFLYLSHWSVH 231

Query: 253 SANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
                    AP+  +  +R   +E+ + + +AA++  +D SVG+++  L++  +  N++ 
Sbjct: 232 GR-----YFAPESLIAKYRERGLEE-RPAIYAAMMETVDNSVGRLMATLDELNLADNTLF 285

Query: 311 VFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVS 370
           VF+SDNGG          S  PLRG K +L+EGGVR   ++  P +          V   
Sbjct: 286 VFMSDNGGE------RITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISH 339

Query: 371 DWLPTLLSAANKS 383
           D  PT L  A +S
Sbjct: 340 DLFPTFLDFAERS 352


>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 590

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 116/349 (33%), Positives = 170/349 (48%), Gaps = 54/349 (15%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PP+++ ILADD GW D+  HG   + TPNID+LA  G + + +Y   +C P+R+  +TG+
Sbjct: 23  PPNVVVILADDQGWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFLTGR 82

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +    G++  V  G ER  L L+EK + +  K  GY T   GKWH G  +  Y P  RGF
Sbjct: 83  YHPRGGVR-GVTSGGER--LDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNARGF 138

Query: 179 ESHLGYWTGH-QDYFD----HSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           + + G+ +GH  +YFD    H+   ++  G                + TD  T  A+  I
Sbjct: 139 DEYYGFTSGHWGEYFDPPLEHNGRPVQGNG----------------FITDDLTDHAISFI 182

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY--------LNIHRH---IEDFKRSKFA 282
              S D P F YL     HS     P+Q PD +        L +  H    ED   ++ A
Sbjct: 183 -KASKDRPFFCYLPFNTPHS-----PMQVPDRFYDKFKNAALKLRAHDAEREDLMMTRAA 236

Query: 283 -AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
            A+   +DE+VG+V++ LE   +   +I+++ SDNG  +  +N        +RG K +  
Sbjct: 237 LAMCENIDENVGRVLKTLEDLSLDQKTIVLYFSDNGPNSWRWNGG------MRGRKGSTD 290

Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAAN---KSDIP 386
           EGGVR   LI W   +     VA+    + D LPTL   A    KSD P
Sbjct: 291 EGGVRSPLLIRWPKQIRPETRVAKISAAI-DLLPTLTDLAGIPVKSDKP 338


>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 327

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 84/165 (50%), Positives = 120/165 (72%), Gaps = 4/165 (2%)

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP-YEPLQAPDHYLNIHRHIEDFKRSKFA 282
           ++T++A+  I NH+  EPLFLYLA  A HS+N   EPLQAP  +++   +I+  +R K+A
Sbjct: 1   MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+L  +D  +G+V EAL +++ML NS+I+F +DNGG + GFN N A+N+PLRGVK TL+E
Sbjct: 61  AMLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGPSNGFNNNWANNFPLRGVKATLFE 120

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDI 385
           GGVRG G ++S L+E +  V+   +HV+DWLPTL+S A  N SDI
Sbjct: 121 GGVRGVGFVYSKLIE-KSRVSHDLIHVTDWLPTLVSLAGGNVSDI 164



 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 13/126 (10%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTRGKWKLVKENSINGNG----TSEN 564
           +DG D W+ L   + S+RN IL NID+  W+  AL  G WKL+KE   N +G     S N
Sbjct: 167 LDGFDQWATLQNKQSSQRNEILLNIDEHVWKNEALRVGSWKLIKEGLKNWDGWYPPPSYN 226

Query: 565 RSNDNSYQNEI--DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVM 622
            S D S  + I   G D   V++  +      + H ++D  +I  L+K   K   V+  M
Sbjct: 227 ESMDESLNSSILKCGHDFPVVINHCD---DYCLFHVVEDPCEIVDLSK---KFPDVLATM 280

Query: 623 RYQVDL 628
            Y++++
Sbjct: 281 MYRLNI 286



 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 60/132 (45%), Gaps = 33/132 (25%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDE-WQISALTKGKWKLVKVVKVMRYQVDLTGGPD 633
           +DG D W+ L   + S+RN IL NID+  W+  AL  G WKL+K                
Sbjct: 167 LDGFDQWATLQNKQSSQRNEILLNIDEHVWKNEALRVGSWKLIK---------------- 210

Query: 634 QVYLSGLSDRE-WLAL-----AMRKLRDAASIQCG---PVKEVPCEPQIAPCLFDIKNDP 684
                GL + + W        +M +  +++ ++CG   PV    C+     CLF +  DP
Sbjct: 211 ----EGLKNWDGWYPPPSYNESMDESLNSSILKCGHDFPVVINHCDDY---CLFHVVEDP 263

Query: 685 CEKNNLADRSED 696
           CE  +L+ +  D
Sbjct: 264 CEIVDLSKKFPD 275


>gi|340622362|ref|YP_004740814.1| chondroitinsulfatase [Capnocytophaga canimorsus Cc5]
 gi|339902628|gb|AEK23707.1| Chondroitinsulfatase [Capnocytophaga canimorsus Cc5]
          Length = 456

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 165/323 (51%), Gaps = 37/323 (11%)

Query: 48  MVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQL 106
           + F+ ++ +   P+IIFILADD+G  DV  +G   I TPNI  L+  G+   ++Y    +
Sbjct: 2   LFFLGVIQAQDKPNIIFILADDMGIGDVSPYGQKTIQTPNIQKLSDEGMTFSDFYAGSTV 61

Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHL 164
           C PSR++++TG+H  HT ++ N       G  PL +K  I P+ LK+ GY+  I GKW +
Sbjct: 62  CAPSRASMLTGQHTGHTKVRGN-------GEYPLDDKKQIFPEILKKAGYQNAIFGKWGM 114

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD--LEPAWDLHGKYST 222
           G      TP  RGF++  G+   H D    + +      LD+ R   LE      G Y+ 
Sbjct: 115 GLKNSPSTPLSRGFDAFAGFLH-HIDAHFQTPDS-----LDVIRQGRLERIALQQGAYAN 168

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLA----HAATHSANPYEPLQAPDHYLNIHRH------ 272
           D F  + +D I  ++   P F+YL+    HA    A+ +   Q   +  +IH +      
Sbjct: 169 DYFLNQTLDFIDKNANKSPFFIYLSLTVPHAELSVADIHYEKQFDSNGTSIHPNEKAFKG 228

Query: 273 ----IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN-----GGAAAGF 323
                ++F ++ +AA++  +D  VG++++ +  + +  N+II+F SDN     GG  A  
Sbjct: 229 GHYGAQEFPKAAYAAMVSSIDYYVGQILQKVADKNIDDNTIIIFSSDNGTHVEGGRTAQD 288

Query: 324 NLNAASNWPLRGVKNTLWEGGVR 346
                S+   RGVK  L+EGG+R
Sbjct: 289 VAYFQSSGEYRGVKRDLYEGGIR 311


>gi|126339031|ref|XP_001366628.1| PREDICTED: arylsulfatase A [Monodelphis domestica]
          Length = 506

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 176/371 (47%), Gaps = 34/371 (9%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           ++++L++ F   +A + PP+ I I ADDLG+ D+G +G     TPN++ +A  G+ L ++
Sbjct: 3   ISWSLTLAFTTSLAFARPPNFILIFADDLGYGDLGCYGHPSSSTPNLNRMAAKGLRLTDF 62

Query: 102 YT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           Y  V LCTPSR+A++TG++P+ +G+   VLY    GGLPLSE  + + L E GY T + G
Sbjct: 63  YVPVPLCTPSRAALLTGRYPVRSGLYPGVLYPGSLGGLPLSEVTMAEVLAERGYLTGMAG 122

Query: 161 KWHLGFYK-KEYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-------MR 207
           KWHLG      + P  +GF   LG    H     Q+        +   G D       + 
Sbjct: 123 KWHLGVGPNGSFLPPHQGFHRFLGVPYSHDQGPCQNLTCFPPATLCYGGCDQGVVPIPLM 182

Query: 208 RDL----EPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
            +L    +PAW L G     V  A+   I+      +P FLY A   TH           
Sbjct: 183 ANLSVKEQPAW-LPGLEKQYVAFAQNF-IVDAARQGQPFFLYYASHHTHYPQ-------- 232

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
                    +    R  F   L +LD +VG +++ +E   +   +++ F +DNG      
Sbjct: 233 ---FGEETFVGRSGRGPFGDALMELDSAVGTLLDTVENLGLSEETLVFFTADNGPETKRM 289

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
               +S   LR  K T +EGG+R   L + P     G V+ +     D LPT+ + A+ +
Sbjct: 290 FRGGSSGL-LRCGKGTTYEGGIREPALAFWPGHIMPG-VSHELASSLDLLPTMAALAS-A 346

Query: 384 DIPNYVNSTVE 394
            +PN     V+
Sbjct: 347 HLPNVTLDGVD 357


>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
          Length = 383

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 78/181 (43%), Positives = 109/181 (60%), Gaps = 10/181 (5%)

Query: 201 MWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           M G D+  +   AWD  +G YST ++T     I+ +H+  +P+FLY+A+ A HS     P
Sbjct: 1   MCGYDLYENDHAAWDQDNGIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHS-----P 55

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           LQAP  Y   +R I +  R ++AA+L  LDE++  V  AL+      NSI+++ SDNGG 
Sbjct: 56  LQAPGRYFEHYRSIININRRRYAAMLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQ 115

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                    SNWPLRG K T WEGG+R  G + SPLL++RG V  + VH++DW PTL+S 
Sbjct: 116 PTA----GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWYPTLISL 171

Query: 380 A 380
           A
Sbjct: 172 A 172



 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 76/184 (41%), Gaps = 28/184 (15%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
           ++DG D+W  +S    S R  ILHNID  +  +     A   G W   +   ++V  +++
Sbjct: 181 QLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKL 240

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCLFDIKN 682
            LTG P      G SD  W+          A    GP +     V      +  LF+I  
Sbjct: 241 -LTGNP------GYSD--WVP-------PQAFSNLGPTRWHNERVTLSTGKSVWLFNITA 284

Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
           DP E+ +L+ R     +      + +FN+ A P +   ++ +   +          K++ 
Sbjct: 285 DPYERVDLSSRYPGI-VKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWGPWYKEEY 343

Query: 743 KKKK 746
           KKK+
Sbjct: 344 KKKQ 347


>gi|171910116|ref|ZP_02925586.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 480

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 112/358 (31%), Positives = 170/358 (47%), Gaps = 44/358 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+++  L DDLG  D+G  G     TP IDALA SG+   ++Y+   +C+P+R+A+MTGK
Sbjct: 26  PNVLLFLVDDLGSQDLGVEGSKFYETPAIDALAASGVRFSSFYSAHPVCSPTRAALMTGK 85

Query: 119 HPIHTGMQHNVLYGCERGG--LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
            P   G+     Y   + G  LP +E  + +     GY+T  VGKWHLG    +  P   
Sbjct: 86  VPQRVGITD---YIKPKSGVALPTAETTIGEAFAAQGYQTGYVGKWHLGEADAD-QPAQH 141

Query: 177 GFESHLGYWTGHQD---YFDHSAEEMK--MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
           GF+       G Q    Y+ +  ++ K  +W +    DLEP  +  G Y TD  T ++++
Sbjct: 142 GFQWTAAVNRGGQPASYYYPYRKKDGKDTLWDVP---DLEPGTE--GDYLTDALTGKSLE 196

Query: 232 IIHNHSTDEPLFLYLAHAATHSA---------------------NPYEPLQAPDHYLNIH 270
            +    T +P FL  +H A H+                       P  P++AP    +  
Sbjct: 197 FLKQRDTTKPFFLCFSHYAVHTPIQPPQELVAKYQAKAKAMYGDAPAPPVEAPFGAQSRP 256

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA--AGFNLNAA 328
           R  +      +AA+L  LD ++G+V++ALE++++   +I+VF SDNGG    A       
Sbjct: 257 RQDD----PAYAALLENLDTNIGRVLKALEEQKLREKTIVVFTSDNGGLCTLAKGRTGPT 312

Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           SN PLR  K   +EGG+R    I  P     G V     +  D  PTLL  +    +P
Sbjct: 313 SNLPLRCGKGWNYEGGIRTPCYISWPGHLKGGTVIGTPAYTPDLYPTLLELSGLPQLP 370


>gi|239816550|ref|YP_002945460.1| sulfatase [Variovorax paradoxus S110]
 gi|239803127|gb|ACS20194.1| sulfatase [Variovorax paradoxus S110]
          Length = 438

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 117/394 (29%), Positives = 184/394 (46%), Gaps = 40/394 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+IIFI+ADDLG+ D+G +G         +P +D LA +G+ L + Y    +C+P+R A+
Sbjct: 4   PNIIFIVADDLGYADLGCYGGRDAAFGPVSPVLDGLAANGLKLTQGYSNSPVCSPTRFAM 63

Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           +T ++     G     +    RG    GLP     LP  LK  GY+T ++GKWHLG Y  
Sbjct: 64  ITARYQYRLRGAAEEPINSKSRGSTTLGLPTEHPTLPSLLKAGGYQTALIGKWHLG-YPP 122

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
            + P   G++   G  +G  DYF H  S     +W  +  +  E        Y TD+ + 
Sbjct: 123 TFGPLRSGYDEFFGPMSGGVDYFTHCDSTGRHDLWFGEEDKQEE-------GYLTDILSK 175

Query: 228 EAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK---FAA 283
            AVD +   +  D P FL L + A H   P+E     D    +  ++ D        +  
Sbjct: 176 RAVDYVERMAKQDAPFFLSLHYTAPHW--PWETRDDADKAPLVKDNLFDLAGGNIHVYRR 233

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           ++H +DE +G ++ AL +  M  N+++VF SDNGG         + NWPL G K  L EG
Sbjct: 234 MIHHMDEGIGWIMAALRKHGMADNTLVVFTSDNGGE------RFSDNWPLVGGKMDLTEG 287

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
           G+R   +   P + ++G  + Q     DW  T+L AA      +Y    V +++P  +++
Sbjct: 288 GIRVPWIAHWPAVIAKGGESRQLCMTMDWSATMLDAAGVKADADYPLDGV-SLMPVLKDA 346

Query: 404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
                   H +  P     N R +    + + KY
Sbjct: 347 -------RHSFRRPLHWRMNHRGQQAMRDGDWKY 373


>gi|340620655|ref|YP_004739108.1| sulfatase [Zobellia galactanivorans]
 gi|339735452|emb|CAZ98829.1| Sulfatase, family S1-17 [Zobellia galactanivorans]
          Length = 609

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 177/360 (49%), Gaps = 50/360 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P++I + ADD G+ D+  HG   I TPN+DA A   + L N++    C PSR+ +MTG++
Sbjct: 40  PNVIIVFADDQGYGDLASHGNPYIKTPNLDAFAKESLELTNFHVGTTCAPSRAGLMTGRN 99

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
                  H +  GC    L   EK + +  KE GY T + GKWHLG     + P  RGFE
Sbjct: 100 GNRNNAWHTIA-GCSI--LLEDEKTMAEVFKEGGYNTAMFGKWHLG-DNYPFRPHDRGFE 155

Query: 180 SHL-----------GYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             L            YW  +  YFD +            R+ EP   L G Y TDV+  E
Sbjct: 156 HALYNGGGGVQQTPDYW--NNTYFDDT----------YFRNGEPV-KLEG-YCTDVWFNE 201

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH--IEDFKRSKFAAILH 286
           A+  I   + DEP FLYL   A HS     P   P+ Y  ++ +  + D+++ +F  ++ 
Sbjct: 202 AIKHIQ-RTEDEPFFLYLPLNAAHS-----PFNVPESYAKMYENEPLADYQK-RFYGMIS 254

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN-----AASNWPLRGVKNTLW 341
            +DE+ GK+V  L+ R++  N+I+++ +DN G AAG   +        N  L+G K + +
Sbjct: 255 NIDENFGKLVRFLKDRKLFDNTIVIYTTDN-GTAAGIKTDKDGNVTGYNAGLKGTKGSHY 313

Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE--NIIP 398
           +GG +    I W      +G V  + V   D LPTL   A+ S IP  V+  ++  N+ P
Sbjct: 314 DGGHKVPFFISWPDGDIKKGSVNNELVANVDLLPTL---AHMSGIPFEVDKPLDGANVAP 370


>gi|406830821|ref|ZP_11090415.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 497

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 104/320 (32%), Positives = 155/320 (48%), Gaps = 18/320 (5%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           ++A ++  L FT       L A+  PP+II I+ DD G+ D   HG   + TP++D L  
Sbjct: 22  VLAASICLLLFTWGGRLPRLHAA--PPNIIVIITDDQGYGDFSCHGNPVLKTPHLDRLHD 79

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
            G+   +++    C P+RSAIMTG+H    G+ H +L   ER  L L+   LPQ LK +G
Sbjct: 80  EGVRFLDFHVSPTCAPTRSAIMTGRHEFRNGVTHTIL---ERERLTLTATTLPQVLKTVG 136

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG--HQDYFDHSAEEMKMWGLDMRRDLE 211
           Y T I GKWHLG  +  Y P  RGF+    +  G   Q Y     +       D      
Sbjct: 137 YTTGIFGKWHLG-DEPAYRPGQRGFDETFIHGGGGIGQTYPGSCGDAPGNTYFDPAIMHN 195

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-PDHYLNIH 270
             ++    Y TDVF  +A+  I+     +P F Y+A  A     P+ PLQ  P+      
Sbjct: 196 GMFEKTQGYCTDVFFGQAMTWINTVKGRQPFFCYIATNA-----PHGPLQVRPEDEQRYR 250

Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASN 330
             + D   +KF  ++  +D++VG+++  L    +   +++VF++DNGG        A  N
Sbjct: 251 GKVADSNVAKFFGMISNIDDNVGRLLAKLTDWGIERETLVVFMNDNGGTVG----VAIHN 306

Query: 331 WPLRGVKNTLWEGGVRGAGL 350
             +RG K T W GG R A  
Sbjct: 307 AGMRGQKGTPWLGGTRAASF 326


>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
 gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
          Length = 452

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 97/321 (30%), Positives = 164/321 (51%), Gaps = 13/321 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+++ I  DD G  DV  +G   + TPNID LA  G++   +Y    +C+PSR++++TG+
Sbjct: 28  PNVLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGR 87

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P    + +N        G+P S+  + +  K+ GY T  +GKWH+G Y  E  P  +GF
Sbjct: 88  YPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG-YSPETMPNQQGF 146

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH-NHS 237
           +   G+  G  D + H          D+ R+ +  W+  GK+  D+   E    +  N  
Sbjct: 147 DYSFGFMGGCIDNYSHYFYWAGPNRHDLWRNGQEIWE-DGKFFADLTVQEVNGFLEKNKR 205

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            D+P FLY A        P+ PLQ  + +   ++ +   +R  +AA +  +DE +G+V++
Sbjct: 206 ADKPFFLYWAINM-----PHYPLQGQEKWRQYYKDLPAPRR-MYAAAVSTMDEKIGQVLQ 259

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS-NWPLRGVKNTLWEGGVRGAGLI-WSPL 355
            L++  +  N+I+VF SD G +    +        P RG K +L+EGG+R   +I W+  
Sbjct: 260 QLDRLGLAENTIVVFQSDQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIRVPAIIRWTGH 319

Query: 356 LESRGIVAEQYVHVSDWLPTL 376
           L    +  +  V++ DW PTL
Sbjct: 320 LPKNEVRDQLCVNI-DWYPTL 339


>gi|417303302|ref|ZP_12090360.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327540274|gb|EGF26860.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 419

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 164/338 (48%), Gaps = 59/338 (17%)

Query: 98  LKNYYTV-QLCTPSRSAIMTGKHPIHTGM------------------QHNVLYGCER-GG 137
           + N+Y     CTPSR++++TG++P   G+                  ++ V +  ER GG
Sbjct: 1   MTNFYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDFGHRYTPEEYAVTF--ERIGG 58

Query: 138 LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE 197
           +   E ILP  L+  GY++ I GKW LG  ++   PT RGF+   G+     DYF H   
Sbjct: 59  MDEREVILPAVLRPAGYKSGIYGKWDLGALRR-MLPTSRGFDDFYGFVNTGIDYFTHERY 117

Query: 198 EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
            +      M R+LEP     G Y T +F  EA+  +  H+ +EP FLY+   A H+++  
Sbjct: 118 GVPC----MVRNLEPTEADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSL 173

Query: 258 EP-----LQAPDHYLNIHRHIE--------------------DFKRSKFAAILHKLDESV 292
           +P     +QAPD +  ++  +E                      +R  + A +  +D ++
Sbjct: 174 DPTIRSSVQAPDQFKAMYPTVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAI 233

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI- 351
           G++++ LE ++ML  +I+VF SDNGG+        A N PLRG K   WEGG+R   L+ 
Sbjct: 234 GEMLDRLEAKQMLDETIVVFFSDNGGS------GGADNAPLRGHKAQTWEGGIRVPCLVR 287

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
           W       G+V ++++   + LP+  +AA     P  V
Sbjct: 288 WPAGQIPAGVVNDEFLTSLELLPSFAAAAGVEPPPGVV 325


>gi|443722642|gb|ELU11403.1| hypothetical protein CAPTEDRAFT_152949 [Capitella teleta]
          Length = 486

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 107/339 (31%), Positives = 167/339 (49%), Gaps = 44/339 (12%)

Query: 65  ILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHT 123
           +L DD+GW D+G  G     TPN+D +A  G++L ++Y+   LC+PSR+A++TG+ PI  
Sbjct: 1   MLMDDMGWGDLGVFGHPAKETPNLDKMAAEGLLLPDFYSANPLCSPSRAALLTGRLPIRN 60

Query: 124 GMQHNVLYG-------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G   +  +           GG+P  E + P+ L++ GY  +I+GKWHLG  + +Y P   
Sbjct: 61  GFYSDNAHARNAYTPQTIVGGIPDEEILFPELLQKTGYTNKIIGKWHLG-QQAQYHPLLH 119

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS-------------TD 223
           GF+   G    H   FD  ++      + + R+ E    + G+Y              T 
Sbjct: 120 GFDEWFGAPNCHFGPFDDKSQP----NIPVYRNAE----MIGRYYEEFEITKNGESNLTQ 171

Query: 224 VFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
           ++T EA D I+  S+D +P  LY A  ATH     +PL A   +L         +R  + 
Sbjct: 172 MYTQEATDFIYRLSSDKQPFLLYWAVDATH-----DPLYASKKFLGTS------QRGLYG 220

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
             + +LD  VG++++ALE  ++L N++++F SDN GAA        SN P    K T  E
Sbjct: 221 DAMRELDWGVGEILKALETAQVLDNTLVIFSSDN-GAATYAKTGGGSNGPFLCGKETTLE 279

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           GG+R   + + P     G V+ Q   + D   T L  A 
Sbjct: 280 GGMREPTIAYWPGKIKPG-VSHQLGSLMDIFTTTLDLAG 317


>gi|326802120|ref|YP_004319939.1| sulfatase [Sphingobacterium sp. 21]
 gi|326552884|gb|ADZ81269.1| sulfatase [Sphingobacterium sp. 21]
          Length = 471

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 118/367 (32%), Positives = 175/367 (47%), Gaps = 45/367 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ DVGF+G   I TPN+D LA  GI    +Y    +C PSRS++M+G+
Sbjct: 26  PNIIFILADDLGYGDVGFNGQRLINTPNLDRLAKEGIRFNQFYAGTAVCAPSRSSLMSGQ 85

Query: 119 HPIHTGMQHNVLYGCERGG---LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           H  HT ++ N+  G E  G   +P S + + + LK+ GY+T   GKW LG    E  P  
Sbjct: 86  HTGHTFIRGNL--GVEPEGQYPIPDSVETIAETLKKAGYQTGAFGKWGLGPVGSEGDPNK 143

Query: 176 RGFESHLGY---WTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +GF+   GY      H+ Y  H  +  +   L    DLE     H +Y+ D+   +A+  
Sbjct: 144 QGFDRFYGYNCQSLAHRYYPTHLWDNSEKVVLTENGDLE----YHKQYAPDLIQEQALGF 199

Query: 233 IHNHSTDEPLFLYLAHAATHS------------------ANPYEPLQAPDHYLNIHRHIE 274
           I  H  D P FL+L +   H+                    P++         +     +
Sbjct: 200 IEEHK-DRPFFLFLPYILPHAELLVPDDSIFQYYKGKFEEKPFKGFDYGKGAKSGGYASQ 258

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN-----GGAAAGFNLNAAS 329
            +  S FAA++ +LD  VG++ E L    +  N++IVF SDN     GGA   F  ++  
Sbjct: 259 QYPHSTFAAMVARLDLYVGQIEEKLRSLGLDENTLIVFTSDNGPHKEGGADPNFFQSSGG 318

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL-----LSAANKSD 384
              LRG+K  L+EGG+R   +   P     G  +E      D  PT       + A  +D
Sbjct: 319 ---LRGIKRDLYEGGIRVPFVAKWPKAIRAGQQSEFIGAFWDIFPTFAELGGAATAASTD 375

Query: 385 IPNYVNS 391
             ++VN+
Sbjct: 376 GISFVNT 382


>gi|299533802|ref|ZP_07047173.1| sulfatase [Comamonas testosteroni S44]
 gi|298718218|gb|EFI59204.1| sulfatase [Comamonas testosteroni S44]
          Length = 442

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 106/341 (31%), Positives = 168/341 (49%), Gaps = 24/341 (7%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           P++IFILADDLG+ D+G  G           +P +DA+A  G+   + Y    +C+P+R 
Sbjct: 6   PNLIFILADDLGYADLGCTGARDAHNNATDVSPRLDAMAAQGLRFTRGYSNSSVCSPTRF 65

Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           A+ TG+  + +    +  +  ++G +  GLP     +   L++ GY T +VGKWHLG Y 
Sbjct: 66  ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             + P   G+E   G+  G  DYF H     +    D   + EP       Y TD+ +  
Sbjct: 125 PHFGPRLSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HTEDGYLTDLLSRR 179

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL-NIHRHIEDFKRSKFAAILHK 287
           AVD +   S ++P +L L ++A H        +A    L  + +HI+      +  ++H 
Sbjct: 180 AVDFVKRQSAEQPFYLSLHYSAPHWPWLTREDRAESERLQGMGKHIDGGSIDTYQRMIHH 239

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE +G +++ALE++ M  N++IVF SDNGG         ++ WP  G K  L EGG+R 
Sbjct: 240 MDEGIGWLLDALEEKGMSENTLIVFTSDNGGE------RFSNTWPFVGQKMDLLEGGIRV 293

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             L   P   + G V +      DW  T L+AA  +   +Y
Sbjct: 294 PLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVAAHADY 334


>gi|196230145|ref|ZP_03129008.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225742|gb|EDY20249.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 487

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 111/337 (32%), Positives = 167/337 (49%), Gaps = 38/337 (11%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
           +A+  PP++I I+ADDLG+ DVG +G     TPN D LA+ G    +++  Q +C+ SR+
Sbjct: 27  LAADRPPNVILIVADDLGYADVGVYGAKGFETPNFDRLAHEGRRFTDFHVAQAVCSASRA 86

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           AIMTG +P   G++   +    + G+   E  +PQ  K  GY T +VGKWHLG    E+ 
Sbjct: 87  AIMTGCYPNRIGIE-GAMEPWYKFGISDQELTMPQMFKRKGYATGMVGKWHLG-TPTEFL 144

Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDM------------RRDLEPAWDLHGKY 220
           PT RGF+   G    +  +  H  +  K   L +             RD+E         
Sbjct: 145 PTHRGFDEWFGLPYSNDQWPLHPEKPGKFPPLPLYEGDKVINPGINHRDME--------Q 196

Query: 221 STDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
            T  +T  AV+ I  NH  D+P FLY+A        P+ PL   D +          KR 
Sbjct: 197 LTTQYTERAVNFIDRNH--DKPFFLYVAQTM-----PHVPLAVSDKFRGAT------KRG 243

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            +   + ++D S G+++ AL++  +  ++II+F+SDNG     F  +A S +PLR  K T
Sbjct: 244 LYGDAVEEIDWSTGEILAALKKYGLEKDTIIMFLSDNGPWLI-FGNHAGSAYPLREGKTT 302

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
            W+GG R   +I  P     G V+ +     D +PTL
Sbjct: 303 TWDGGTRVPFIISWPGHIPAGTVSTEMACAIDLMPTL 339


>gi|443704175|gb|ELU01350.1| hypothetical protein CAPTEDRAFT_214223 [Capitella teleta]
          Length = 336

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 89/252 (35%), Positives = 136/252 (53%), Gaps = 11/252 (4%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II+  ADD G+ D G+   D I TPNID L   GI   N Y+ Q C+PSRS+ ++G++
Sbjct: 94  PNIIWFFADDYGYQDAGYRNSD-IHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRY 152

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
              +GMQH V+       + L    L  YLKEL Y T   GKWHLG+  KE TPT+RGF+
Sbjct: 153 AYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGFD 212

Query: 180 SHLGYWTGHQDYFDHSA-EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +  G ++G   Y++H+  + M  W    + D   +    G +S D+  A+ ++ + + + 
Sbjct: 213 TFSGGYSGEGKYYEHTTFQGMYDWHNGTKLDRSAS----GIHSQDLIEADLLNQL-DEND 267

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
             PLF Y A   THS  P +P   P+H            R K+  ++  +D  +GKVV  
Sbjct: 268 GTPLFYYAAFQNTHS--PMQP--KPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGKVVAK 323

Query: 299 LEQRRMLSNSII 310
           L+++ M  N+++
Sbjct: 324 LKEKDMFDNTVM 335


>gi|323449289|gb|EGB05178.1| hypothetical protein AURANDRAFT_72312 [Aureococcus anophagefferens]
          Length = 514

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 98/275 (35%), Positives = 134/275 (48%), Gaps = 41/275 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PHI+F+LADDLG+NDVG+  LD    TPN+DA+   G+ L + Y    CTPSR+A++T +
Sbjct: 103 PHIVFVLADDLGYNDVGYGSLDLHACTPNLDAMWRDGVELTSLYAAPTCTPSRAALLTAR 162

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           +P   GMQH  L      GL  S   L +  +  GY T  VGKWHLG + +   PT RGF
Sbjct: 163 YPTTLGMQHWQLEAAAPYGLDSSTATLGEVFQRQGYATVFVGKWHLGHFSEAVLPTRRGF 222

Query: 179 ESHLGYWTGHQDYFDHSAEEM---------------------KMWGL---DMRRDLEPAW 214
           E   G++TG ++YF   +E                         W     +   DL    
Sbjct: 223 EKFYGFYTGGENYFTRVSEGACAPVVNTSVPQVSEVGDIAKECFWDAIDNEAHEDLPAPE 282

Query: 215 DLHGK-YSTDVFTAEAVDIIHNHST------DEPLFLYLAHAATHSANPYEPLQAPDHYL 267
           +LHGK +ST  F   A D++  H +       +P+ L LA       NP+ PL AP    
Sbjct: 283 ELHGKSHSTYAFAMRASDLVARHGSFSGGGLAQPILLVLA-----LPNPHVPLLAPAAVF 337

Query: 268 NIH----RHIEDFKRSKFAAILHKLDESVGKVVEA 298
             H    R I + +R  FAA+    DE++G V  A
Sbjct: 338 RTHDAILRQITNNQRRTFAALTILWDEALGNVSAA 372


>gi|291231158|ref|XP_002735532.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 191

 Score =  152 bits (383), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 74/175 (42%), Positives = 107/175 (61%), Gaps = 9/175 (5%)

Query: 35  MAFAVLPLAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           M+ +VL    T+ ++F+   A     P+I+FI ADD GWND+G+H      TPN+++LA 
Sbjct: 1   MSISVL---LTVPVLFLSAFARCDSQPNIVFIFADDYGWNDIGYHN-PIFQTPNLNSLAA 56

Query: 94  SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
            GI L+NYY   +CTPSR  ++TG++ +  G+ H  +   +R  LPL E  LP+ +K+ G
Sbjct: 57  DGIKLENYYVAPVCTPSRGQLLTGRYAMRYGLVHRNIRPAQRMCLPLDEVTLPEKMKQAG 116

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           Y T +VGKWH GFY     PT RGF+S  G++   +DYF HSA      G D++R
Sbjct: 117 YATHMVGKWHQGFYTPACIPTQRGFDSFFGFYICTEDYFTHSASG----GFDLKR 167


>gi|149196404|ref|ZP_01873459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
 gi|149140665|gb|EDM29063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
          Length = 456

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 171/350 (48%), Gaps = 34/350 (9%)

Query: 42  LAFTLSMVFVDLVASSG-PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           + +    VF+   A+S   P+IIFI+ DD+G+  +G +G   I TP +D +A  G+ L +
Sbjct: 1   MKYISVFVFLMFAANSADKPNIIFIMCDDMGYGQLGSYGQKMIKTPRLDQMAKEGLRLTD 60

Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
           YY    +C PSR ++MTG+H  HT ++ N  Y   +  +P     + + +KE GY T ++
Sbjct: 61  YYAGTAVCAPSRCSLMTGQHVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALI 120

Query: 160 GKWHLGFYKKEYTPTFRGFESHLGY-----WTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
           GKW LG+   E  P  +GF+   GY        H   F    EE      +  +++E   
Sbjct: 121 GKWGLGYPGSEGEPNKQGFDYFFGYNDQKHAHNHFPKFLLRNEETLTLKNNSGKEIE--- 177

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP---DHYLNIHR 271
                YS  + T EA   I  +  D P FLYLA+   HS      LQ P   + YL    
Sbjct: 178 -----YSQYMLTDEAKGFIKKNK-DNPFFLYLAYVIPHSR-----LQIPGDDECYLQYKD 226

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG------FNL 325
                K+ K A ++ +LD+ VG +++ L++  +  N+++VF SDNG    G      FN 
Sbjct: 227 ESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGARPEFFN- 285

Query: 326 NAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
               + PL G+K +++EGGVR   +   P +   G V+       D +PT
Sbjct: 286 ---DSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNHIGAHWDLMPT 332


>gi|333912861|ref|YP_004486593.1| arylsulfatase [Delftia sp. Cs1-4]
 gi|333743061|gb|AEF88238.1| Arylsulfatase [Delftia sp. Cs1-4]
          Length = 441

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 111/346 (32%), Positives = 170/346 (49%), Gaps = 45/346 (13%)

Query: 60  PHIIFILADDLGWNDVGFHG---LDQIP-TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+I+FI+ADDLG+ D+G +G    D  P +P +D LA  G+ L + Y    +C+P+R A+
Sbjct: 4   PNILFIVADDLGYADLGCYGGRAADFGPVSPMLDRLAAGGLRLTQGYANSPVCSPTRFAL 63

Query: 115 MTGKH----------PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
            T ++          PI++  +   L   E+ GLP     +   L++ GYRT ++GKWHL
Sbjct: 64  ATARYQYRLRGAAEEPINSKTRGTPL--GEKLGLPTDMPTVASMLRDAGYRTALIGKWHL 121

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYST 222
           G Y   + P   G+E + G  +G  DYF H  S+ +  +W  +     E        Y T
Sbjct: 122 G-YPPHFGPLRSGYEEYFGPMSGGVDYFTHLSSSGQHDLWVGEEEHHDE-------GYLT 173

Query: 223 DVFTAEAVDIIHNHST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR------HIED 275
           D+ +  +VD +H  +  D P FL L     H   P+ P +  D             H++ 
Sbjct: 174 DLLSQRSVDFVHRMAQGDAPFFLSL-----HYTAPHWPWETRDDRSTAEALGAGIAHLDG 228

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
               ++  ++H +DE +G +VEAL     L N++IVF SDNGG         + NWPL G
Sbjct: 229 GNIHQYRRMIHHMDEGIGWIVEALRANGQLDNTLIVFTSDNGGE------RFSDNWPLVG 282

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K  L EGG+R   +   P + + G  + Q+    DW  T+L AA 
Sbjct: 283 GKMDLTEGGIRVPWIAHWPAVIAPGRSSPQHCMSMDWSATVLDAAG 328


>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 580

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 137/473 (28%), Positives = 202/473 (42%), Gaps = 103/473 (21%)

Query: 31  RTRIMAFAVLPLAFTLSMVF---VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
           R  +  FA+  LAF+ S           +   P+II I+ DDLG++D G +G  +I TPN
Sbjct: 2   RPILFVFALF-LAFSASAQKKGKTTPAPTPSKPNIILIMVDDLGYSDFGAYG-SEIQTPN 59

Query: 88  IDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQH---NVLYGCERGGLPLSEKI 144
           ID LAY G+ LK +Y   +C P+R++++TG++P   G+ +   N+     +G L      
Sbjct: 60  IDKLAYGGLRLKEFYNNSICAPTRASLITGQYPHKAGLGYFNTNLGLPAYQGWLNQESLT 119

Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS-----AEEM 199
             + L++ GY T + GKWH+G     Y P  RGF    G+  G  +Y+D S     A  +
Sbjct: 120 FGEVLQQGGYNTYLTGKWHVG-NDSLYWPNQRGFNKFYGFIGGASNYYDISPYPEKAPPV 178

Query: 200 KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           ++   + R +L P     GKY TD  T  A+  I N S D+P FLYLA  A     P+ P
Sbjct: 179 ELVENNQRINLAP-----GKYLTDEITNHALSYI-NESKDKPFFLYLAFNA-----PHWP 227

Query: 260 LQAPDHYLNIHRHIEDFK----------------------RSKFA--------------- 282
           LQAP    +I ++   +K                      R K A               
Sbjct: 228 LQAPAE--DIAKYKGKYKIGWDSLRAQRYRRQLALGIADPRQKVAERDPEVVAWESLTFD 285

Query: 283 -------------AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG------------ 317
                        A++  +D+SVG+VVE L++ +   N++IV +SDNG            
Sbjct: 286 EKELWQRKMEVYAAMVDHVDQSVGRVVEELKKLKKDDNTLIVLISDNGAQGGLNQLGQRR 345

Query: 318 --------GAAAGF-----NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
                   G+A  +     N    SN P R  K T  EGG+    + W P     G + +
Sbjct: 346 RQRNSGPIGSAGSYDYQEQNWAYVSNTPFRNYKATSHEGGISSPFIAWFPKKIKAGTITK 405

Query: 365 QYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSP 417
              H+ D  PT    A  +   NY      N +P      L +E      N P
Sbjct: 406 GTGHLIDLAPTFYDVAGVNYPTNY-RGVQTNTLPGVSLKNLLFEGNDLVRNEP 457


>gi|260824685|ref|XP_002607298.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
 gi|229292644|gb|EEN63308.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
          Length = 178

 Score =  152 bits (383), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 72/142 (50%), Positives = 93/142 (65%), Gaps = 8/142 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTP-------SRS 112
           P+I+FI+ADDLGWNDVG+H  D + TP +D LA  G+IL   Y   +CTP       SR+
Sbjct: 28  PNILFIVADDLGWNDVGWHNPD-VKTPVLDQLANEGVILNQSYVNYVCTPFPVVKSRSRT 86

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
           A MTG  P H G QH V +  +  G+P +   LP+ LK+LGY T +VGKWHLGF    YT
Sbjct: 87  AFMTGYFPYHVGTQHQVFFPFQAQGIPSNFSFLPEKLKDLGYATHMVGKWHLGFCNWNYT 146

Query: 173 PTFRGFESHLGYWTGHQDYFDH 194
           PT+RGF+S  GY+ G +DY+ H
Sbjct: 147 PTYRGFDSFFGYYNGAEDYYTH 168


>gi|325108958|ref|YP_004270026.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969226|gb|ADY60004.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 499

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 191/385 (49%), Gaps = 50/385 (12%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           ++A  +  + F       D   +S PP+++ I+ADDLG+ ++G +G   I TP +D LA 
Sbjct: 19  VVAMLIAIVMFAAESTAADKGEASPPPNVVLIMADDLGYQELGCYGQKWIRTPFVDQLAA 78

Query: 94  SGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHN----------VLYGCERGG---LP 139
            GI L  +Y+   +C PSR  +MTGKHP H  +++N            Y  E  G   LP
Sbjct: 79  EGIKLTQFYSGNAVCAPSRCCLMTGKHPGHAWVRNNGDPKLPEELREKYQWEFAGQFPLP 138

Query: 140 LSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEM 199
            SE  + + LK+ GY T  +GKW LG +     P  +G +   G+   +     H+    
Sbjct: 139 ESEVTVAELLKQKGYATAAIGKWGLGHFGTTGAPDRQGIDLFYGF---NSQRHAHNHYPR 195

Query: 200 KMWGLDMRRDLEPAWD--LHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP 256
            +W  D +++++P  D  L+G+ YS D FT  A++ I + + D+P FLYL  A  H +  
Sbjct: 196 FLWRND-QKEMQPGNDRTLNGETYSQDRFTEVALEFI-DQNKDQPFFLYLPFAIPHLS-- 251

Query: 257 YEPLQAPDHYLNIHRH---IEDFK----------RSKFAAILHKLDESVGKVVEALEQRR 303
              +Q P+  L  +      E++K          R+ +AA++  +D  +GK++  L +  
Sbjct: 252 ---IQVPESSLAEYTDEIPEEEYKHRGYLKHPKPRAGYAAMITHMDRDIGKIMARLNEYG 308

Query: 304 MLSNSIIVFVSDN-------GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL 356
           +  N++I+F SDN       GG+ + F  +A     +RG+K +L+EGG+R   +   P  
Sbjct: 309 LTDNTLIIFTSDNGPTYDRLGGSDSDFFESAGK---MRGLKGSLYEGGIRVPCVARWPGQ 365

Query: 357 ESRGIVAEQYVHVSDWLPTLLSAAN 381
              G  ++    + D  PT+  AA 
Sbjct: 366 IPAGNTSDLISALWDLFPTIAEAAG 390


>gi|262384881|ref|ZP_06078013.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
 gi|262293597|gb|EEY81533.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
          Length = 589

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 105/344 (30%), Positives = 166/344 (48%), Gaps = 49/344 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II +L+DD GW D+GF G   + TPNID +A+ G IL+N+Y   + +P+R+  +TG++
Sbjct: 27  PNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYVCPVSSPTRAEFLTGRY 86

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            + +G+ ++   G ER      EK + +Y +E GY T + GKWH G  +  Y P  RGFE
Sbjct: 87  HVRSGV-NSTTGGGER--FNQGEKTIAEYFREAGYATSLFGKWHSG-TQYPYHPNARGFE 142

Query: 180 SHLGYWTGH-----QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIH 234
              G+ +GH         +H+ E +   G                +  D  T +A+D I 
Sbjct: 143 EFYGFCSGHWGNYWNPVLEHNGEIISGEG----------------FIIDDLTDKALDYIR 186

Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF------------A 282
           +H  + P F++L++   HS     P+Q PD + N  +     +R+ F             
Sbjct: 187 DHK-EHPFFMFLSYNTPHS-----PMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAAL 240

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A+   LD ++G+V+  L    +   +I+++ SDNG  +  F  N      ++G K +  E
Sbjct: 241 ALAENLDWNIGRVLSLLHSLDLEQETIVIYFSDNGPNS--FRWNGG----MKGRKGSTDE 294

Query: 343 GGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           GGVR    I  P    +G V  Q     D +PTLL  A     P
Sbjct: 295 GGVRSPFCIRWPGHIRKGAVETQLSGAIDLIPTLLGLAGIEYTP 338


>gi|326927530|ref|XP_003209945.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Meleagris
           gallopavo]
          Length = 485

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/332 (30%), Positives = 163/332 (49%), Gaps = 43/332 (12%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G  G     TPN+D +A  G++  ++Y    LC+PSR+A++TG+ P+  G    
Sbjct: 3   MGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAASPLCSPSRAALLTGRLPVRNGFYTT 62

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+  SE +LP+ LK+ GY  +I+GKWHLG ++ ++ P   GF+  
Sbjct: 63  NAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLG-HRPQFHPLKHGFDEW 121

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A    +  + + RD    W++ G+Y               T ++  
Sbjct: 122 FGSPNCHFGPYDNRA----LPNIPVYRD----WEMIGRYYEDFKIDLKTGEANLTQIYLQ 173

Query: 228 EAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA+D I    ++ +P FLY A  ATH+     P+ A  H+L         +R ++   + 
Sbjct: 174 EALDFISKQQASQQPFFLYWAIDATHA-----PVYASKHFLGTS------QRGRYGDAVR 222

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           ++D+SVGK+++ L++  +  N+ + F SDNG A         SN P    K T +EGG+R
Sbjct: 223 EIDDSVGKILKHLQKLGISENTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTFEGGMR 282

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
              + W P     G V+ Q   V D   T LS
Sbjct: 283 EPAIAWWPGHIPAGGVSRQLGSVMDLFTTSLS 314


>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 469

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/327 (32%), Positives = 163/327 (49%), Gaps = 19/327 (5%)

Query: 40  LPLAFTLSMVFVDLVASSGP-------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALA 92
           L    TLS+       ++ P       P++I I+ADDLG+ D+G+ G   IPTPNID LA
Sbjct: 5   LSTVLTLSIAACASTFAAHPTENRQDQPNVIVIMADDLGYGDLGYTGSKHIPTPNIDRLA 64

Query: 93  YSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER--GGLPLSEKILPQYL 149
             G+     Y T Q C PSR+  +TG++    G + N  Y       G+P SE++  + L
Sbjct: 65  NEGVECTYGYVTHQYCGPSRAGFLTGRYQQRFGFETNPPYDRHNTIAGVPASERLFAERL 124

Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
           + +GY+T IVGKWH+G +   +               GH  +   + E M    LD    
Sbjct: 125 QAVGYKTGIVGKWHIGSHSIHHPNNRGFDFFFGFLGGGHDFFRVDTREPMDEGYLDPMMR 184

Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
              + D+ G Y T   T EA+  I  +  D P FL+L++ A     P+ PLQAP+  +  
Sbjct: 185 NGSSVDVEG-YLTTQLTDEAIGFIERNEKD-PFFLFLSYNA-----PHAPLQAPEESIAK 237

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA--GFNLNA 327
             H+E  +R  ++A+++++D  +G+++  LE R +   +II F+SDNGG         + 
Sbjct: 238 FSHVEGKERRVYSAMVYEMDLEIGRILSMLEDRDLAEETIIFFLSDNGGPPHWDKSQDSY 297

Query: 328 ASNWPLRGVKNTLWEGGVRGAGLIWSP 354
            SN   RG K   ++GGV    L + P
Sbjct: 298 TSNGAFRGYKGDTYDGGVHVPFLAYWP 324


>gi|333911967|ref|YP_004485699.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
 gi|333742167|gb|AEF87344.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
          Length = 442

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 166/336 (49%), Gaps = 28/336 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           P++IFILADDLG+ D+G  G           +P +DA+A  G+   + Y    +C+P+R 
Sbjct: 6   PNLIFILADDLGYADLGCTGARDAHSRATDVSPRLDAMAAQGMRFTRGYSNSAVCSPTRF 65

Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           A+ TG+  + +    +  +  ++G +  GLP     +   L++ GY T +VGKWHLG Y 
Sbjct: 66  ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             + P   G+E   G+  G  DYF H     +    D   + EP       Y TD+ +  
Sbjct: 125 PHFGPRMSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HEEEGYLTDLLSRR 179

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
           AVD +   + D+P +L L ++A H   P+   + L+         +H +      +  ++
Sbjct: 180 AVDFVERQTADKPFYLSLHYSAPHW--PWLTRDDLEESRRIGGFGKHTDGGTIETYQRMI 237

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
           H +DE +G V++ALE++ M  N++IVF SDNGG         ++NWP  G K  L EGG+
Sbjct: 238 HHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGE------RFSNNWPFIGQKMDLLEGGI 291

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           R   +   P     G V++      DW  T L+AA 
Sbjct: 292 RVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAG 327


>gi|421613814|ref|ZP_16054885.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495400|gb|EKJ99987.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 468

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/345 (31%), Positives = 178/345 (51%), Gaps = 40/345 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P+I+ ILADDLG+ D+G +G D+  T  +D LA  G+   + Y     C+P+R+A++TG+
Sbjct: 23  PNIVLILADDLGYGDLGCYGNDEQATRVLDRLATQGVRWTQAYANGPECSPTRAALLTGR 82

Query: 119 HPIHTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
           +  H G         NV        L+     GLP     L + L  +GY T + GKWHL
Sbjct: 83  YQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPADRPTLAKRLSSVGYETALFGKWHL 142

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTD 223
           G Y+ +++P   GF+  L    G  DY+ H  + +  + L           + G+ Y TD
Sbjct: 143 G-YEAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEGYFTD 195

Query: 224 VFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK- 280
             T +AV  I + ++ D+P FLYL + A H+  PY+ P ++P   L +   +  +K++  
Sbjct: 196 TITDQAVRYIGDRNANDKPFFLYLPYTAPHT--PYQAPGESPVDPLPLDSPL--WKQNAD 251

Query: 281 ----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
               + A++  +DE +GKV+ A+E+  M   ++++F SDNGG +      A+ N PLRG 
Sbjct: 252 PPGVYRAMVRHMDEGIGKVLHAIEESEMTDRTLVIFASDNGGTS------ASRNEPLRGF 305

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           K   +EGG+R   +   P     G+V++Q     D    +L+AA 
Sbjct: 306 KGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTALMLAAAG 350


>gi|334344099|ref|YP_004552651.1| N-acetylgalactosamine-6-sulfatase [Sphingobium chlorophenolicum
           L-1]
 gi|334100721|gb|AEG48145.1| N-acetylgalactosamine-6-sulfatase [Sphingobium chlorophenolicum
           L-1]
          Length = 487

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 187/392 (47%), Gaps = 61/392 (15%)

Query: 55  ASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPS 110
           A  GP   P+I+FI+ADDLG+ DV  +G     TP ID LA  G+  L+ Y    +C+ +
Sbjct: 40  APVGPARKPNILFIVADDLGYADVSCYGRRDYRTPAIDRLAAEGLRFLQGYANSSVCSAT 99

Query: 111 RSAIMTGKHPIHTGMQHNVLYGCERG------GLPLSEKILPQYLKELGYRTRIVGKWHL 164
           R+A++TG++      Q+ +  G E        GLP S   LP  L+  GY T ++GKWHL
Sbjct: 100 RTALITGRY------QYRLPLGLEEPLDERDVGLPPSHPTLPSILRRQGYGTTLIGKWHL 153

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
           G     Y+P   G++   G+  G  DYF H     K +G     D +   + +G Y TD+
Sbjct: 154 GALPN-YSPLKSGYDHFWGFRAGGVDYFTH-----KYFGKPDLWDGDVPIERNG-YLTDL 206

Query: 225 FTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR-----------H 272
               A++ I  ++  D P  + L     H   P+ P + P       R           H
Sbjct: 207 LAERAIESIDAYAKADRPFLISL-----HFNAPHWPWEGPADQAESVRLDESGKPEAMLH 261

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWP 332
            +   +  +A ++ ++D  +G++++ L+  ++  ++I++F SDNGG         A  WP
Sbjct: 262 FDGGSQRIYAEMVTRMDMQIGRILDRLDALKIADDTIVIFTSDNGGE------RFADTWP 315

Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK---------- 382
             G K  L EGG+R   +I  P   + G  ++Q +   DW+PTLL+AA            
Sbjct: 316 FTGRKTELLEGGLRVPAIIRWPGRVAAGEASQQVMMSMDWMPTLLAAAGASADPAYPLDG 375

Query: 383 -SDIPNYVNSTVENIIPRYENSILRYENGTHE 413
            S +P   N++  +I+PR      RY+N   E
Sbjct: 376 VSLLPALANTS--HIVPR--TLCWRYKNLDQE 403


>gi|441597518|ref|XP_003266414.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Nomascus
           leucogenys]
          Length = 431

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 72/134 (53%), Positives = 97/134 (72%), Gaps = 1/134 (0%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH 194
             G  TG+ DY+ +
Sbjct: 167 FXGSLTGNVDYYTY 180


>gi|337280754|ref|YP_004620226.1| arylsulfatase [Ramlibacter tataouinensis TTB310]
 gi|334731831|gb|AEG94207.1| arylsulfatase precursor-like protein [Ramlibacter tataouinensis
           TTB310]
          Length = 439

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 115/376 (30%), Positives = 184/376 (48%), Gaps = 43/376 (11%)

Query: 58  GPPHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           G P+++FI+ADDLG+ D+G +G  +      +P +D LA  G+   + Y    +C+P+R 
Sbjct: 3   GRPNVVFIVADDLGFADLGCYGGREAGFSRVSPVLDGLAAKGLRFTQGYANSPVCSPTRF 62

Query: 113 AIMTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           A+MT ++     G     +    RG    GLP     +P  LK  GYRT ++GKWHLG Y
Sbjct: 63  ALMTARYQYRLRGAAEEPINSRSRGSTTLGLPPEHPTVPSLLKAAGYRTALIGKWHLG-Y 121

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
              + P   G+E   G  +G  DYF H +   +  +W  +  R  E        Y TD+ 
Sbjct: 122 PPAFGPLRSGYEEFFGPMSGGVDYFSHCSSNGQHDLWLGEQERAEE-------GYLTDLL 174

Query: 226 TAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEP-------LQAPDHYLNIH-RHIEDF 276
           +  AVD +   +  D P FL L + A H   P+E         +  D+  ++H  ++E +
Sbjct: 175 SRRAVDYVARMARQDAPFFLSLHYTAPHW--PWETREDAGKVAEVKDNLFHLHGGNVETY 232

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
           +R     ++H +DE +G V++AL +  +  ++++VF SDNGG         + NWPL G 
Sbjct: 233 RR-----MIHHMDEGIGWVMDALRRHGLERDTLVVFTSDNGGE------RFSDNWPLVGG 281

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
           K  L EGG+R   +   P + + G  + Q     DW  T+L AA  +  P Y    V ++
Sbjct: 282 KMDLTEGGIRVPWIAHWPAVIAPGGASAQPCMTMDWSATVLDAAGAAADPAYPLDGV-SL 340

Query: 397 IPRYENSILRYENGTH 412
           +P   ++  R+    H
Sbjct: 341 LPVLRDASRRFARPMH 356


>gi|160895745|ref|YP_001561327.1| sulfatase [Delftia acidovorans SPH-1]
 gi|160361329|gb|ABX32942.1| sulfatase [Delftia acidovorans SPH-1]
          Length = 442

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 166/336 (49%), Gaps = 28/336 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI------PTPNIDALAYSGI-ILKNYYTVQLCTPSRS 112
           P++IFILADDLG+ D+G  G           +P +DA+A  G+   + Y    +C+P+R 
Sbjct: 6   PNLIFILADDLGYADLGCTGARDAHSRATDVSPRLDAMAAQGMRFTRGYSNSAVCSPTRF 65

Query: 113 AIMTGK--HPIHTGMQHNV--LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
           A+ TG+  + +    +  +  ++G +  GLP     +   L++ GY T +VGKWHLG Y 
Sbjct: 66  ALATGRWQYRLRGAAEEPIASVHGDKVLGLPPDHPTVASLLRDAGYATALVGKWHLG-YP 124

Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
             + P   G+E   G+  G  DYF H     +    D   + EP       Y TD+ +  
Sbjct: 125 PHFGPRMSGYEEFYGFHAGGADYFAHCDPRGRP---DFWINEEP--HEEEGYLTDLLSRR 179

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAIL 285
           AVD +   + D+P +L L ++A H   P+   + L+         +H +      +  ++
Sbjct: 180 AVDFVERQTADKPFYLSLHYSAPHW--PWLTRDDLEESRRIGGFGKHTDGGTIETYQRMI 237

Query: 286 HKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGV 345
           H +DE +G V++ALE++ M  N++IVF SDNGG         ++NWP  G K  L EGG+
Sbjct: 238 HHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGE------RFSNNWPFIGQKMDLLEGGI 291

Query: 346 RGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           R   +   P     G V++      DW  T L+AA 
Sbjct: 292 RVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAG 327


>gi|325298612|ref|YP_004258529.1| arylsulfatase [Bacteroides salanitronis DSM 18170]
 gi|324318165|gb|ADY36056.1| Arylsulfatase [Bacteroides salanitronis DSM 18170]
          Length = 465

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 201/404 (49%), Gaps = 59/404 (14%)

Query: 28  LGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN 87
           LG+ T + +F  LP A   +      V+    P+IIFILADD+G+ D+  +G   I TPN
Sbjct: 7   LGWGT-LASFFFLPPADASADAGQAGVSHPEKPNIIFILADDMGYADLSCYGSRYIETPN 65

Query: 88  IDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG--------- 137
           ID LA SG    + Y    + +PSR A+MTG++  +T ++ N    C+ GG         
Sbjct: 66  IDKLATSGTRFTQCYAGSGISSPSRCALMTGRNTGNTTIRDNF---CKAGGIEGKKGKAT 122

Query: 138 ------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG---- 187
                 LP ++  +   L   GYRT +V KWHL  +  E TP  RGF+   G+       
Sbjct: 123 IRRMHLLP-NDTTIATVLSAAGYRTCLVNKWHLDGFNPEATPLNRGFDEFYGWLISTAYS 181

Query: 188 ----HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLF 243
               +  Y+    EE+    ++++ + +     HG ++TD+ T  A+D I  +  + P F
Sbjct: 182 NDPYYYPYWRFENEEL----VNIQANADGK---HGIHNTDLSTRNAIDFIRRNR-ENPFF 233

Query: 244 LYLAHAATHSANPYEPLQAPD-HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
           LYLA+ A     P+EP    D H+ +      + KR  +A+++  +D+++G++++ LE+ 
Sbjct: 234 LYLAYDA-----PHEPYHIDDTHWYDAREWSMNTKR--YASLVTHMDQAIGRLLDELERL 286

Query: 303 RMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGI 361
            +  N++I+F SDNGGA          N   RG K  L+EGG+R   ++ W      RG 
Sbjct: 287 HLRENTLIIFASDNGGAVQAPLQQLGCNGVFRGRKGQLYEGGIRVPFIVNW------RGK 340

Query: 362 VAEQ----YVHVSDWLPTLLSAANKSD-IPNYVNSTVENIIPRY 400
           V  Q     ++  D +PTL   A  S  +P+ +N    +I P +
Sbjct: 341 VPVQTLDNLIYFPDVMPTLARLAGASGKLPSRLNGI--DITPLF 382


>gi|323450061|gb|EGB05945.1| hypothetical protein AURANDRAFT_66030 [Aureococcus anophagefferens]
          Length = 611

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 114/343 (33%), Positives = 166/343 (48%), Gaps = 32/343 (9%)

Query: 60  PHIIFILADDLGWNDVGFHGLD-QIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PH++  L DD+G+ND  +   D     P    LA  G++L NYY  Q CTPSR+A++TG+
Sbjct: 73  PHVVIALLDDMGYNDAPWTSTDLGAAMPFARRLAADGVVLSNYYADQSCTPSRAALLTGR 132

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL---GYRTRIVGKWHLGFYKKEYTPTF 175
           H +  G+   V++     GL  + K LP  +++     Y    VGKW +G     Y P  
Sbjct: 133 HAVELGVYSTVVWDSAW-GLDENAKTLPALIEDARPGAYHKACVGKWDVGHSTAAYLPLA 191

Query: 176 RGFESHLGYWTGHQ-DYFDHSAEEMKMWGL-DMRRDLEPAWDLHGKYSTDVF-TAEAVDI 232
           RGF++  G       DY  H AE      + D+     P++D    Y+TD++ +A    +
Sbjct: 192 RGFDTFTGMLGDMMTDYSTHMAEFTSSGPVYDLFEGDAPSYDYQNTYATDIWTSAATSAV 251

Query: 233 IHNHSTDEPLFLYLAHAATHSA----------NPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
              H   + L LYLA+ A H+A          + Y+ L AP   L+   + E  +R   A
Sbjct: 252 AAAHEAAKGLLLYLAYNAIHTAITLPKNFEEDDEYKALVAP---LDAGDYTE--QRQLAA 306

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
             L  +D S+  + + L+   M  ++++ F SDNGG   GF     SN+PLRG K TL+E
Sbjct: 307 GALLVVDRSLETLFQTLDAAGMTDDTLLFFGSDNGG-LTGFG---GSNYPLRGEKLTLFE 362

Query: 343 GGVRGAGLIWSP-----LLESRGIVAEQYVHVSDWLPTLLSAA 380
           GGVR     W+         + G      VHVSD  PTL  AA
Sbjct: 363 GGVRVPAFFWAGASFPRFRATAGAAYGGIVHVSDVAPTLFGAA 405


>gi|224536296|ref|ZP_03676835.1| hypothetical protein BACCELL_01168 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522090|gb|EEF91195.1| hypothetical protein BACCELL_01168 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 465

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/363 (30%), Positives = 179/363 (49%), Gaps = 42/363 (11%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
           +IIFILADD+G+ D+  +G   I TPNID LA +G      Y    + +PSR A+MTG++
Sbjct: 37  NIIFILADDMGYCDLACYGNKYIETPNIDKLAATGTSFTQCYAGSGISSPSRCALMTGRN 96

Query: 120 PIHTGMQHNVLYGCERGGL-------PL-------SEKILPQYLKELGYRTRIVGKWHLG 165
             +T ++ N    C  GG+       P+       ++  +   L   GYRT +V KWHL 
Sbjct: 97  TGNTTIRDNF---CNAGGIEGMKGTKPIRRMHLLPNDTTIATVLSSAGYRTCLVNKWHLD 153

Query: 166 FYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD---MRRDLEPAWDLHGKYST 222
            +  E TP  RGF+   G+       + +       W  +   +    E   D H K++T
Sbjct: 154 GFNPEATPLNRGFDEFYGWLI--STVYSNDPYYYPYWRFNNEKLENIKENEGDKHIKHNT 211

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFA 282
           D+ T +A+  I+ + T+ P FLYLA+ A H   PY      D  +       D    ++A
Sbjct: 212 DLSTEDAIKFINRNKTN-PFFLYLAYDAPHE--PY----IIDETVWYDDETWDMNTKRYA 264

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWE 342
           A++  +D ++G+++  L++  +  N++I+F SDNG A             L+G+K  L+E
Sbjct: 265 ALITHMDRAIGRLLAELDRLGLRENTMIIFASDNGAAKQAPLAELGCKGSLKGMKGQLYE 324

Query: 343 GGVRGAGLIWSPLLESRGIVAEQ----YVHVSDWLPTLLSAANKSD-IPNYVNSTVENII 397
           GG+R   ++  P     G V  Q     ++  D +PTL + AN +D +P  +N    N++
Sbjct: 325 GGIRVPFIVNQP-----GKVPVQKLNNIIYFPDVMPTLAALANATDKLPQNLNGI--NVL 377

Query: 398 PRY 400
           P +
Sbjct: 378 PLF 380


>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 159

 Score =  151 bits (381), Expect = 2e-33,   Method: Composition-based stats.
 Identities = 70/159 (44%), Positives = 102/159 (64%)

Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAI 284
           +T  AV +I +H+  +PLFLYL++ A H       L+AP+  +    HI +  R+ +A +
Sbjct: 1   YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGG 344
           +  LD+S+G ++EAL++  ++ N+I+VF SDNG    G   N   NWPLRG K TLWEGG
Sbjct: 61  VDALDQSIGTIMEALQEADLIGNTIVVFTSDNGALPWGVKSNRGYNWPLRGGKFTLWEGG 120

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           +R    IWSPLLE    V+ Q +H++DWLPTL SAA ++
Sbjct: 121 LRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAGET 159


>gi|417301182|ref|ZP_12088351.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327542503|gb|EGF28978.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 443

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/342 (32%), Positives = 177/342 (51%), Gaps = 40/342 (11%)

Query: 63  IFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPI 121
           + ILADDLG+ D+G +G D+  TP +D LA  G+   + Y     C+P+R+A++TG++  
Sbjct: 1   MLILADDLGYGDLGCYGNDEQATPVLDRLATQGVRWTQAYANGPECSPTRAALLTGRYQQ 60

Query: 122 HTGMQH------NV--------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           H G         NV        L+     GLP     L + L  +GY T + GKWHLG Y
Sbjct: 61  HVGGLECAIGVGNVGRYDDAIRLHLVNELGLPADRPTLAKRLSSVGYETALFGKWHLG-Y 119

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK-YSTDVFT 226
           + +++P   GF+  L    G  DY+ H  + +  + L           + G+ Y TD  T
Sbjct: 120 EAKFSPMMHGFDEALYCIGGAMDYY-HYLDSVATYNL-----FHNGRPISGEGYFTDTIT 173

Query: 227 AEAVDIIHN-HSTDEPLFLYLAHAATHSANPYE-PLQAPDHYLNIHRHIEDFKRSK---- 280
             AV  I + ++ D+PLFLYL + A H+  PY+ P ++P   L +   +  +K++     
Sbjct: 174 DHAVRYIGDRNANDKPLFLYLPYTAPHT--PYQAPGESPVDPLPLDSPL--WKQNADPPG 229

Query: 281 -FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            + A++  +DE +GKV+ A+E+  M   ++++F SDNGG       +A+ N PLRG K  
Sbjct: 230 VYRAMVRHMDEGIGKVLHAIEESEMTDRTLVIFASDNGGT------SASRNEPLRGFKGQ 283

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            +EGG+R   +   P     G+V++Q     D   ++L+AA 
Sbjct: 284 AFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAG 325


>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
 gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 483

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 118/411 (28%), Positives = 189/411 (45%), Gaps = 83/411 (20%)

Query: 23  QYLKELGYRTRIMAFAVL-PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLD 81
            +L +L YR   +   VL P  F           +   P+I+FIL DDLGW D+  +G  
Sbjct: 1   MHLHKLFYRNLWVLLTVLGPTVFA---------QTDKQPNIVFILVDDLGWMDIAANGST 51

Query: 82  QIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG--- 137
              TPNID LA  GI   K Y    +C+P+R++I+TGK+P    +   +      GG   
Sbjct: 52  FYETPNIDQLAKEGIRFTKAYAASPICSPTRASILTGKNPARIDLTQWI------GGPGN 105

Query: 138 ------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE--YTPTFRGFESHL-GYWTGH 188
                 LPL E + P+ L++ GY+T  +GKWHL     E  + P  +GF+ ++ G++ G 
Sbjct: 106 PDYLRNLPLEEVLFPELLQDAGYKTAFMGKWHLNNVAGEEKFWPDKQGFDVNVAGHFRG- 164

Query: 189 QDYFDHSAEEMKMWGLDMRRDLEPAWDL-------HGKYSTDVFTAEAVDIIHNHSTDEP 241
                         GL ++      W++        G+Y TD    EA+D I  +S   P
Sbjct: 165 --------------GLYIKNKYFSPWNIPNLENGPEGEYMTDRLAKEAIDFIDENSK-AP 209

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIH-------------RHIED------FK----- 277
             LY +  + H+     P  AP   +  +             R  E+      FK     
Sbjct: 210 FLLYFSLYSVHA-----PFDAPADRVEKYEKKKKALALTDKERFAEEKNAQKPFKYRIKQ 264

Query: 278 -RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
               +AA++  +D +VGK++  L+++ +  N++++F SDNGG +    +  A N PLR  
Sbjct: 265 DHPTYAAMVESMDMAVGKILGKLQEKGIADNTVVIFFSDNGGLSTSEGIPTA-NTPLRAG 323

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           K  L+EGG+R   +I  P     G V++  +   D+ PT+L    ++  P+
Sbjct: 324 KGWLYEGGIREPAIIRWPGTIKPGTVSDAVITSMDFYPTILEMTGQALRPD 374


>gi|421610710|ref|ZP_16051876.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408498494|gb|EKK02987.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 606

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 169/340 (49%), Gaps = 34/340 (10%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            AS   P++I ++ DD G+ D+  HG   + TPN+D LA   + L+N++    CTP+R+A
Sbjct: 41  TASESRPNVILVVTDDQGYGDMSCHGNPWLDTPNLDRLATQSVRLENFHVDPFCTPTRAA 100

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++    G    V  G  R  L   E  + +  +E GYRT + GKWHLG     + P
Sbjct: 101 LMTGRYCTRVGAW-AVTEG--RQLLDPDETTMAEIFRESGYRTGMFGKWHLG-DPPPFAP 156

Query: 174 TFRGFESHLGYWTGHQ---------DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
             RGFE+ + +  G           DYFD +            R+ +P  +    Y TD+
Sbjct: 157 RERGFETVVRHMAGGADEIGNPTGNDYFDDT----------YYRNGKP--ESFDGYCTDI 204

Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAA 283
           +  EA+D I   S ++P F+Y+   A HS     P    D Y +   R   + +R+ F  
Sbjct: 205 WFDEAIDFIQKES-EQPFFVYIPTNAMHS-----PYLVADRYSDPFKRQGIEPQRAAFYG 258

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASNWPLRGVKNTLW 341
           ++   DE++G++++ L+Q  + +N++++F+SDNG A  A+  N     N  +RG K +++
Sbjct: 259 MIQNFDENLGRLLKRLDQDNLRNNTLLIFMSDNGTAQGASEQNRKVGFNAGMRGKKGSVY 318

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           EGG R       P         +Q     DWLPTL+   +
Sbjct: 319 EGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCD 358


>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
 gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
          Length = 555

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 122/383 (31%), Positives = 180/383 (46%), Gaps = 63/383 (16%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGL----DQIPTPNIDALAYSGIILKNYYT-VQ 105
            D  +   PP++I ILADDLG ND+  +G      Q+ TPNID LA SG      Y+   
Sbjct: 56  ADAASQDRPPNVIVILADDLGINDISTYGGGLANGQVQTPNIDELAASGANFSQAYSGAG 115

Query: 106 LCTPSRSAIMTGKHPIHTGMQH--------------------------------NVLYGC 133
            C PSR+ ++TG++P  TG +                                 + +   
Sbjct: 116 TCAPSRAMLLTGRYPTRTGFEFTPTPDGMGAMATMIASSHDNGLPPSIYNKAGADKIPSF 175

Query: 134 ERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTG------ 187
           +  GLP SE  L + L++ GY T  +GKWHLG   K ++P  +GFE  L   +G      
Sbjct: 176 DEQGLPSSEITLAEILRDRGYYTAHIGKWHLG-RGKGFSPEEQGFEQSLLMHSGLYLPVD 234

Query: 188 HQDY----FDHSAEEMKMWGLDMR-----RDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           H D      D    +  +W   MR      D  P ++    Y TD +T E++ II  +  
Sbjct: 235 HPDVVNAKVDFDPIDQFLWA-KMRYAAAFNDGGPVFEPK-SYLTDYWTDESIKIIKANK- 291

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEA 298
           + P FLYLAH   H+     PLQA          I+  +   +AA++  +D SVG+++  
Sbjct: 292 NRPFFLYLAHWGAHT-----PLQATREDYEAVGDIKPHRARVYAAMVRAVDRSVGRILAT 346

Query: 299 LEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           L++  +  N+I+VF SDNGG  AG+      N P RG K T++EGG+R    +  P   +
Sbjct: 347 LKEEGLSDNTIVVFSSDNGG--AGYIGIPDINAPYRGWKLTMFEGGIRVPLFMRWPAHIA 404

Query: 359 RGIVAEQYVHVSDWLPTLLSAAN 381
           R    +      D+LPTL++AA 
Sbjct: 405 RNTSIDTPAAHIDFLPTLVAAAQ 427


>gi|343085667|ref|YP_004774962.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354201|gb|AEL26731.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 465

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 111/352 (31%), Positives = 174/352 (49%), Gaps = 32/352 (9%)

Query: 43  AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
           +     ++   + +   P+I+FI+ADDLG+ D+G +G + I TPNIDALA  G++    Y
Sbjct: 16  SMAFQSIYAQQINNGDRPNIVFIMADDLGYGDLGVYGQELIQTPNIDALAKEGMVFSQAY 75

Query: 103 TVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
               +CT SR  +MTG H  HT  + NV +      L   +  L   LKE GYRT  VGK
Sbjct: 76  AGGPVCTSSRYVLMTGLHNGHTVARDNVPHYPTY--LQEEDITLAMVLKEAGYRTGGVGK 133

Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGH--QD--------YFDHSAEEMKMWGLDMRRDLE 211
           W LG    E   T +GF++    WTG+  QD        Y DH   ++++    + R+  
Sbjct: 134 WSLGDANTEGRATNKGFDT----WTGYLNQDHAHYYYPSYLDHDDRKIQLTDNPILRN-- 187

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP---LQAPDHYLN 268
                   YS D+ T   ++ I  H    P F Y A+ + H +   E    L  P  Y  
Sbjct: 188 -------NYSHDILTNATLNFIRMHRK-VPFFFYAAYTSPHFSAAEEDEHGLTVPSTYPY 239

Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAA 328
             +   +  + K+AA++++LD+ VGK+V+ L +  +  N++I+F SDNGG +  ++    
Sbjct: 240 TEKDWPEAAK-KYAAMIYRLDKDVGKIVKLLNELGIRENTLIIFTSDNGGHSNVWD-KFR 297

Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           +N  L+G K  L+EGG+R   +   P +      ++  +   D LPT    A
Sbjct: 298 TNGKLKGHKRDLYEGGIRVPFIASWPGIIPASRKSDALIGFQDMLPTFAEIA 349


>gi|149174163|ref|ZP_01852791.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
           8797]
 gi|148847143|gb|EDL61478.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
           8797]
          Length = 470

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 111/363 (30%), Positives = 174/363 (47%), Gaps = 37/363 (10%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
           + + AV+ L F  S+      A   P +++F L DDLGW D+G +G D   +PNID LA 
Sbjct: 8   LFSQAVILLCFLSSITQPTHAADEKPWNVVFFLVDDLGWTDLGCYGSDFYQSPNIDQLAA 67

Query: 94  SGI-ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVL-YGCERGGLPLSE--------- 142
            G+   +NY     C+P+R A++TG +P  T +   +  +       PL           
Sbjct: 68  EGMKFTQNYSACNACSPTRGALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQ 127

Query: 143 --KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHL-GYWTG--HQDYFDHSAE 197
               LP+ L+  GY+T  VGKWHLG   +   P   GF+ ++ G   G     +F +  +
Sbjct: 128 KYTTLPEALRTAGYQTFHVGKWHLG--GRGNLPQDHGFDVNISGTNRGLPRSYHFPYGGD 185

Query: 198 EMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
            MK W   +        +   +Y TD    EAV +I     D+P FLY +  + HS    
Sbjct: 186 AMK-WDSSLTEA-----ERQDRYLTDRMADEAVALIRQQQ-DKPFFLYCSFYSVHS---- 234

Query: 258 EPLQAPDHYLNIHRHIEDFKRSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
            P+Q     +  ++ +   KR K   +AA++  +DE++G+V   L++  +   ++IVF S
Sbjct: 235 -PIQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTS 293

Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLP 374
           DNGG         ++N PLRG K   WEGG R   ++  P +   G V  + +   D+ P
Sbjct: 294 DNGGV----RRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYP 349

Query: 375 TLL 377
           T+L
Sbjct: 350 TIL 352


>gi|436838037|ref|YP_007323253.1| sulfatase [Fibrella aestuarina BUZ 2]
 gi|384069450|emb|CCH02660.1| sulfatase [Fibrella aestuarina BUZ 2]
          Length = 478

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 172/356 (48%), Gaps = 50/356 (14%)

Query: 59  PPHIIFILADDLGWNDVGFH-----GLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
           PP++I ILADD+G+ DVG +     G   I TPN++ +A  G+   ++YT   +C PSR 
Sbjct: 34  PPNLIVILADDMGYGDVGCYRAASAGPALIQTPNLNRMAAQGMRFTDFYTGSTVCAPSRC 93

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHLGFYKKE 170
           A+MTG+H  HT ++ N       G  PL ++  ++P+ LK+  Y T + GKW LG    E
Sbjct: 94  ALMTGQHTGHTQIRGN-------GEKPLRQEDVVIPELLKKANYTTGMFGKWGLGMPDTE 146

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
             P  +G+++  G+    + +F       ++      +  +PA    G Y+ D FT +A+
Sbjct: 147 GAPHRKGWDAFFGHVNHEEAHFQQHPFLWQISDGATVKVEQPA----GSYNNDAFTQQAL 202

Query: 231 DIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKR------------ 278
             +   +T +P  LYLA    H+      L  PD YLN +R      R            
Sbjct: 203 AFLDRQTT-KPFLLYLAFTLPHAE-----LHTPDTYLNQYRDATGKSRFQPEKPWPAGRH 256

Query: 279 --------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLN---- 326
                   + +AA++ ++D  VG+V+  L+Q+ + +N++++F SDNG    G        
Sbjct: 257 YGEQPEPKAAYAAMVSQIDGYVGQVLAKLDQKGLANNTLVLFASDNGTHIEGGRTQDDVS 316

Query: 327 -AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              S+ PLRGVK  L++GG+R   ++  P     G          D LPT    AN
Sbjct: 317 YMQSSGPLRGVKRDLFDGGIRTPFIVRWPGHVKAGSTTAFVGAFYDLLPTFCELAN 372


>gi|336413800|ref|ZP_08594149.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
           3_8_47FAA]
 gi|335934817|gb|EGM96800.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
           3_8_47FAA]
          Length = 465

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 114/368 (30%), Positives = 185/368 (50%), Gaps = 41/368 (11%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVA--SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           +M  ++L +   L   FV++ A  +S  P+IIF+LADD+G  D+G +G  +I TPNID L
Sbjct: 1   MMNKSILLMGTLLVSTFVEIAAQKNSENPNIIFVLADDMGIGDLGCYGQKKIKTPNIDKL 60

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG----LPLSEKILP 146
           A  G++  N+Y+   +  PSR +++TGKH  H  ++ N     E G     LP  E  + 
Sbjct: 61  AEKGVLFTNHYSGSTVSAPSRCSLLTGKHTGHAYIRGNKGIKSEEGFFDLHLPSDEVTVA 120

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWG 203
           + LK  GY T  VGKW LG      +P  +GF+   GY +    H+ Y ++  E      
Sbjct: 121 EILKRKGYATMCVGKWGLGGPHTTGSPVRKGFDYFFGYLSQGAAHRYYPEYLYENEDKVM 180

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL-QA 262
           L+ +            YS  +   + +D I  ++   P F Y A    H+   Y  L Q 
Sbjct: 181 LNKKV-----------YSHFLIMEKGLDFIRKNA-GHPFFAYFAITPPHADLDYPDLSQY 228

Query: 263 PDHYLN---IHRHIEDFK-----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
            D +     I+   + FK     ++ +A+++ ++D++VG++++ L+++ +L N+II+F S
Sbjct: 229 EDAFPETPYINNKQKGFKTQMKPKAAYASMVSEIDKNVGQIIQLLKEKGILENTIIIFSS 288

Query: 315 DNGGAAAG------FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVH 368
           DNG    G      F+    SN P RG K  L+EGGVR   ++  P +  +    E    
Sbjct: 289 DNGVHCVGGHEPDFFD----SNGPYRGYKRDLYEGGVRAPFIVSWPKMIKKKRTVEHITT 344

Query: 369 VSDWLPTL 376
             D+LPT+
Sbjct: 345 FWDFLPTV 352


>gi|204927175|ref|ZP_03218377.1| sulfatase [Salmonella enterica subsp. enterica serovar Javiana str.
           GA_MM04042433]
 gi|374999764|ref|ZP_09724105.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Infantis
           str. SARB27]
 gi|416423753|ref|ZP_11691142.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 315996572]
 gi|416429490|ref|ZP_11694552.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 495297-1]
 gi|416441871|ref|ZP_11701958.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 495297-3]
 gi|416446887|ref|ZP_11705399.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 495297-4]
 gi|416455076|ref|ZP_11710701.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 515920-1]
 gi|416458094|ref|ZP_11712696.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 515920-2]
 gi|416477886|ref|ZP_11721589.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. NC_MB110209-0054]
 gi|416487277|ref|ZP_11725587.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. OH_2009072675]
 gi|416495848|ref|ZP_11728755.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CASC_09SCPH15965]
 gi|416506199|ref|ZP_11734417.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. SARB31]
 gi|416545192|ref|ZP_11753251.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 19N]
 gi|416558161|ref|ZP_11760062.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 42N]
 gi|416572463|ref|ZP_11767208.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 4441 H]
 gi|416580385|ref|ZP_11771776.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 81038-01]
 gi|416587997|ref|ZP_11776533.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MD_MDA09249507]
 gi|416592560|ref|ZP_11779370.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 414877]
 gi|416600365|ref|ZP_11784312.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 366867]
 gi|416609010|ref|ZP_11789742.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 413180]
 gi|416615356|ref|ZP_11793268.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 446600]
 gi|416625575|ref|ZP_11798548.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 609458-1]
 gi|416629908|ref|ZP_11800425.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 556150-1]
 gi|416644798|ref|ZP_11807012.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 609460]
 gi|416652444|ref|ZP_11811765.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 507440-20]
 gi|416655742|ref|ZP_11812718.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 556152]
 gi|416667431|ref|ZP_11818234.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB101509-0077]
 gi|416683049|ref|ZP_11824165.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB102109-0047]
 gi|416694147|ref|ZP_11826960.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB110209-0055]
 gi|416708252|ref|ZP_11833114.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB111609-0052]
 gi|416710327|ref|ZP_11834432.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 2009083312]
 gi|416717531|ref|ZP_11839783.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 2009085258]
 gi|416726451|ref|ZP_11846512.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 315731156]
 gi|416732451|ref|ZP_11849752.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2009159199]
 gi|416742001|ref|ZP_11855518.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008282]
 gi|416742918|ref|ZP_11855868.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008283]
 gi|416752811|ref|ZP_11860623.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008284]
 gi|416763578|ref|ZP_11867252.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008285]
 gi|416770088|ref|ZP_11871440.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008287]
 gi|417472114|ref|ZP_12167921.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar Rubislaw
           str. A4-653]
 gi|418483635|ref|ZP_13052642.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 80959-06]
 gi|418487815|ref|ZP_13056006.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035278]
 gi|418494224|ref|ZP_13060680.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035318]
 gi|418501265|ref|ZP_13067654.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035320]
 gi|418503986|ref|ZP_13070345.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035321]
 gi|418508644|ref|ZP_13074947.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035327]
 gi|418513428|ref|ZP_13079658.1| sulfatase [Salmonella enterica subsp. enterica serovar Pomona str.
           ATCC 10729]
 gi|418524081|ref|ZP_13090069.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008286]
 gi|452121665|ref|YP_007471913.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Javiana
           str. CFSAN001992]
 gi|204323840|gb|EDZ09035.1| sulfatase [Salmonella enterica subsp. enterica serovar Javiana str.
           GA_MM04042433]
 gi|322615792|gb|EFY12712.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 315996572]
 gi|322620640|gb|EFY17500.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 495297-1]
 gi|322621731|gb|EFY18581.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 495297-3]
 gi|322627457|gb|EFY24248.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 495297-4]
 gi|322630763|gb|EFY27527.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 515920-1]
 gi|322638017|gb|EFY34718.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 515920-2]
 gi|322645937|gb|EFY42455.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. NC_MB110209-0054]
 gi|322651173|gb|EFY47558.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. OH_2009072675]
 gi|322656570|gb|EFY52858.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CASC_09SCPH15965]
 gi|322658771|gb|EFY55028.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 19N]
 gi|322661786|gb|EFY58002.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 81038-01]
 gi|322666467|gb|EFY62645.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MD_MDA09249507]
 gi|322672374|gb|EFY68486.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 414877]
 gi|322676314|gb|EFY72385.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 366867]
 gi|322679593|gb|EFY75638.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 413180]
 gi|322684304|gb|EFY80308.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 446600]
 gi|323191787|gb|EFZ77036.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 609458-1]
 gi|323198944|gb|EFZ84042.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 556150-1]
 gi|323201010|gb|EFZ86079.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 609460]
 gi|323212904|gb|EFZ97706.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 556152]
 gi|323216648|gb|EGA01373.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB101509-0077]
 gi|323219854|gb|EGA04333.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB102109-0047]
 gi|323226080|gb|EGA10297.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB110209-0055]
 gi|323228733|gb|EGA12862.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. MB111609-0052]
 gi|323236653|gb|EGA20729.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 2009083312]
 gi|323239846|gb|EGA23893.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 2009085258]
 gi|323242106|gb|EGA26135.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 315731156]
 gi|323247453|gb|EGA31408.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2009159199]
 gi|323250554|gb|EGA34436.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008282]
 gi|323259284|gb|EGA42926.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008283]
 gi|323263837|gb|EGA47358.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008284]
 gi|323265567|gb|EGA49063.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008285]
 gi|323270013|gb|EGA53461.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008287]
 gi|353077982|gb|EHB43741.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Infantis
           str. SARB27]
 gi|353655321|gb|EHC96360.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar Rubislaw
           str. A4-653]
 gi|363554797|gb|EHL39029.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. SARB31]
 gi|363573015|gb|EHL56902.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 4441 H]
 gi|363577470|gb|EHL61293.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 42N]
 gi|366061128|gb|EHN25381.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 80959-06]
 gi|366062541|gb|EHN26772.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035318]
 gi|366066779|gb|EHN30937.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035320]
 gi|366070570|gb|EHN34679.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035278]
 gi|366072314|gb|EHN36406.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035321]
 gi|366079899|gb|EHN43881.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. CT_02035327]
 gi|366081789|gb|EHN45729.1| sulfatase [Salmonella enterica subsp. enterica serovar Pomona str.
           ATCC 10729]
 gi|366827048|gb|EHN53958.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 507440-20]
 gi|372208393|gb|EHP21889.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. IA_2010008286]
 gi|451910669|gb|AGF82475.1| arylsulfatase [Salmonella enterica subsp. enterica serovar Javiana
           str. CFSAN001992]
          Length = 497

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 123/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           +T I +   L L  T S + V   A+   P+++FI+ADDLG+ D+  +G   + TPNID 
Sbjct: 3   KTVIASMIGLALCAT-STISVVHAAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61

Query: 91  LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           LA  G+    YY    LC+PSR+ ++TG+ P  TG++  +  G     +  +E  +   L
Sbjct: 62  LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120

Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
           K+ GY T ++GK HL   G    +  P   GF+  L    G   D    +A+E   +G+ 
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179

Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
               + P  W  +GK+       S +  ++E V+ + N   D+P FLY+A    HS    
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231

Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
            PL +P  YL+++ +++ ++++                   ++ A +  +D  VGKV++ 
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290

Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
           ++      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I  
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
                +G+V +  V+  DW+PTL   AN  D  +P       ++++P  E   L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402


>gi|320103547|ref|YP_004179138.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319750829|gb|ADV62589.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 502

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 115/344 (33%), Positives = 167/344 (48%), Gaps = 46/344 (13%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTG 117
           PP+ + ++ADDLGW+D+  +G     TPN+DALA  G      Y    +C+P+R++I TG
Sbjct: 49  PPNFVILVADDLGWSDLACYGHPFHETPNLDALAAGGARFTAAYAAAPVCSPTRASIQTG 108

Query: 118 KHPIHTGMQHNV----------LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           +HP   G+  ++                  LPL  + L + LK  GYRT   GKWHLG  
Sbjct: 109 RHPARYGLTAHIPGHWRPFEKLAEPPPAQALPLEAETLGERLKAHGYRTGYFGKWHLG-- 166

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
              + PT +G++  L       ++  H+    +    D  R   P          +V   
Sbjct: 167 GNGFGPTDQGYDEAL-------EFSSHNYPPGQQAKPDQPRRRGP----------EVLAD 209

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-----RSKFA 282
           +AVD I N   D P  L + + A H      PLQA       +R     +     R  +A
Sbjct: 210 KAVDFI-NRQDDRPFLLQVHYFAVHI-----PLQADPRLEAKYRAKPQVEGRPPARPDYA 263

Query: 283 AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA---AAGFNLNAASNWPLRGVKNT 339
           A+L ++D S+G+++ AL QR + SN+ +VF+SDNGG    + G+     SN PLR  K T
Sbjct: 264 ALLEEMDTSIGRILNALNQRGLDSNTWVVFLSDNGGLERESGGW--PGTSNRPLRDQKGT 321

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           L+EGG+R   +   P L   G V E  V   D  PTLL+AA+ S
Sbjct: 322 LYEGGIRIPMIARIPGLTKPGTVVETPVVTHDLAPTLLAAAHPS 365


>gi|291414059|ref|XP_002723273.1| PREDICTED: arylsulfatase A [Oryctolagus cuniculus]
          Length = 507

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 160/350 (45%), Gaps = 40/350 (11%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  V LCTPSR+A++TG
Sbjct: 20  PPNIVLIFADDLGYGDLGSYGHPSSTTPNLDQLATGGLRFTDFYVPVSLCTPSRAALLTG 79

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
           + P+  GM   VL    RGGLPL E  L + L   GY T + GKWHLG   +  + P  +
Sbjct: 80  RLPVRMGMYPGVLEPSSRGGLPLEEVTLAEVLSARGYLTGMAGKWHLGVGPEGAFLPPHQ 139

Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
           GF   LG    H     Q+            G D           +  + +P W   L  
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA 199

Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           +Y      A A D++ +      P FLY A   TH               + H       
Sbjct: 200 RY-----VAFARDLMADAQRQGRPFFLYYASHHTHYPQ-----------FSGHSFAGHSG 243

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  F   L +LD +VG ++ A+    +L  ++++F +DNG      +    S   LR  K
Sbjct: 244 RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            T +EGGVR   L + P   + G+  E    + D LPTL S A  + +PN
Sbjct: 303 GTTFEGGVREPALAFWPGRITPGVTHELASSL-DLLPTLASLAG-APLPN 350


>gi|168230293|ref|ZP_02655351.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
           str. CDC 191]
 gi|194473008|ref|ZP_03078992.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
           str. CVM29188]
 gi|238910791|ref|ZP_04654628.1| sulfatase [Salmonella enterica subsp. enterica serovar Tennessee
           str. CDC07-0191]
 gi|194459372|gb|EDX48211.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
           str. CVM29188]
 gi|205334980|gb|EDZ21744.1| sulfatase [Salmonella enterica subsp. enterica serovar Kentucky
           str. CDC 191]
          Length = 497

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 122/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           +T I +   L L  T ++  V   A+   P+++FI+ADDLG+ D+  +G   + TPNID 
Sbjct: 3   KTVIASMIGLALCATSTLSVVH-AAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61

Query: 91  LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           LA  G+    YY    LC+PSR+ ++TG+ P  TG++  +  G     +  +E  +   L
Sbjct: 62  LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120

Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
           K+ GY T ++GK HL   G    +  P   GF+  L    G   D    +A+E   +G+ 
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179

Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
               + P  W  +GK+       S +  ++E V+ + N   D+P FLY+A    HS    
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231

Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
            PL +P  YL+++ +++ ++++                   ++ A +  +D  VGKV++ 
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290

Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
           ++      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I  
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
                +G+V +  V+  DW+PTL   AN  D  +P       ++++P  E   L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402


>gi|161612357|ref|YP_001586322.1| hypothetical protein SPAB_00045 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|167552131|ref|ZP_02345884.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
           str. SARA29]
 gi|168244457|ref|ZP_02669389.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. SL486]
 gi|194451527|ref|YP_002044003.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. SL476]
 gi|200388296|ref|ZP_03214908.1| sulfatase [Salmonella enterica subsp. enterica serovar Virchow str.
           SL491]
 gi|386589918|ref|YP_006086318.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. B182]
 gi|417536328|ref|ZP_12189519.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
           Wandsworth str. A4-580]
 gi|419731865|ref|ZP_14258774.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41579]
 gi|419733203|ref|ZP_14260104.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41563]
 gi|419741457|ref|ZP_14268151.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41573]
 gi|419745011|ref|ZP_14271656.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41566]
 gi|419750318|ref|ZP_14276781.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41565]
 gi|421569549|ref|ZP_16015251.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00322]
 gi|421576762|ref|ZP_16022356.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00325]
 gi|421581248|ref|ZP_16026794.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00326]
 gi|421583838|ref|ZP_16029354.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00328]
 gi|161361721|gb|ABX65489.1| hypothetical protein SPAB_00045 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|194409831|gb|ACF70050.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. SL476]
 gi|199605394|gb|EDZ03939.1| sulfatase [Salmonella enterica subsp. enterica serovar Virchow str.
           SL491]
 gi|205323154|gb|EDZ10993.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
           str. SARA29]
 gi|205336651|gb|EDZ23415.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. SL486]
 gi|353670566|gb|EHD07127.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
           Wandsworth str. A4-580]
 gi|381290892|gb|EIC32148.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41579]
 gi|381291686|gb|EIC32922.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41573]
 gi|381300912|gb|EIC41969.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41563]
 gi|381306844|gb|EIC47712.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41566]
 gi|381307271|gb|EIC48132.1| sulfatase [Salmonella enterica subsp. enterica serovar Heidelberg
           str. 41565]
 gi|383796962|gb|AFH44044.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. B182]
 gi|402517722|gb|EJW25120.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00325]
 gi|402517774|gb|EJW25169.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00326]
 gi|402526800|gb|EJW34068.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00322]
 gi|402531739|gb|EJW38944.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00328]
          Length = 497

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 122/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           +T I +   L L  T ++  V   A+   P+++FI+ADDLG+ D+  +G   + TPNID 
Sbjct: 3   KTVIASMIGLALCATSTLSVVH-AAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61

Query: 91  LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           LA  G+    YY    LC+PSR+ ++TG+ P  TG++  +  G     +  +E  +   L
Sbjct: 62  LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120

Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
           K+ GY T ++GK HL   G    +  P   GF+  L    G   D    +A+E   +G+ 
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179

Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
               + P  W  +GK+       S +  ++E V+ + N   D+P FLY+A    HS    
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231

Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
            PL +P  YL+++ +++ ++++                   ++ A +  +D  VGKV++ 
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290

Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
           ++      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I  
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
                +G+V +  V+  DW+PTL   AN  D  +P       ++++P  E   L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402


>gi|167762012|ref|ZP_02434139.1| hypothetical protein BACSTE_00358 [Bacteroides stercoris ATCC
           43183]
 gi|167700104|gb|EDS16683.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
          Length = 463

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 190/406 (46%), Gaps = 45/406 (11%)

Query: 30  YRTRIMAFAVLPLAFTLSMVFV--DLVASSG-PPHIIFILADDLGWNDVGFHGLDQIPTP 86
           +R  I+   V    F +  +     LV   G  P+IIFILADD+G+ D+  +G   I TP
Sbjct: 2   FRKDIVLGGVCAGMFVMPQLMTARQLVTDGGDKPNIIFILADDMGYCDLSCYGNKYIETP 61

Query: 87  NIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGG-------- 137
           NID LA +G      Y    + +PSR A+MTGK+  +T ++ N    C  GG        
Sbjct: 62  NIDRLAATGTAFTQCYAGSGISSPSRCALMTGKNTGNTTIRDNF---CIAGGIEGLKGTK 118

Query: 138 ------LPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDY 191
                 L  ++  +   L   GYRT +V KWHL  +  E TP  RGF+   G+       
Sbjct: 119 TIRRMHLQPNDTTIATVLGAAGYRTCLVNKWHLDGFNPEATPLNRGFDEFYGWLISTA-- 176

Query: 192 FDHSAEEMKMWGLD---MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAH 248
           + +       W  +   +    E   D H K++TD+ T +A+  I N + + P FLYLA+
Sbjct: 177 YSNDPYYYPYWRFNNEKLENVKENEGDKHIKHNTDLSTEDAIKFI-NRNKNNPFFLYLAY 235

Query: 249 AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNS 308
            A     P+EP    D          D    ++A+++  +D ++G+++  L++  +  N+
Sbjct: 236 DA-----PHEPYNI-DETTWYDDEAWDMNTKRYASLITHMDRAIGRLLAELDRLGLRENT 289

Query: 309 IIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQ--- 365
           +++F SDNG A             L+G+K  L+EGG+R   ++  P     G V  Q   
Sbjct: 290 LVIFASDNGAAKQAPLEELGCKGSLKGMKGQLYEGGIRVPFIVNQP-----GKVPVQKLN 344

Query: 366 -YVHVSDWLPTLLSAANKSD-IPNYVNSTVENIIPRYENSILRYEN 409
             ++  D +PTL + A  +D +P  +N    NI+P +    L  +N
Sbjct: 345 NIIYFPDVMPTLAALAGATDKLPQKLNGI--NILPLFYGQQLDTDN 388


>gi|351712929|gb|EHB15848.1| N-acetylgalactosamine-6-sulfatase [Heterocephalus glaber]
          Length = 482

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 101/335 (30%), Positives = 165/335 (49%), Gaps = 44/335 (13%)

Query: 70  LGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN 128
           +GW D+G +G     TPN+D +A  G++  N+Y+   LC+PSR+A+++G+ PI +G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLSGRPPIRSGFYTT 60

Query: 129 VLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             +           GG+P SE++LP  LKE GY T+IVGKWHLG ++ ++ P   GF+  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSERLLPSLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFDEW 119

Query: 182 LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------------TDVFTA 227
            G    H   +D+ A+      + + +D    W++ G++               T ++  
Sbjct: 120 FGSPNCHFGPYDNKAKP----NIPVYKD----WEMVGRFYEEFPINVKTGESNLTQIYLQ 171

Query: 228 EAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH 286
           EA+D I    +   P FLY A  ATH+     P+ A   +L         +R ++   + 
Sbjct: 172 EALDFIKRQQAARRPFFLYWAVDATHA-----PVYASRPFLG------SSQRGRYGDAVR 220

Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVR 346
           ++D+SVGK++  L+   +  ++ + F SDNG A      +  SN P    K T +EGG+R
Sbjct: 221 EIDDSVGKMLSLLQVLGLAESTFVFFTSDNGAALVS-APSQGSNGPFLCGKQTTFEGGMR 279

Query: 347 GAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              + W P     G V+ Q   + D   T L+ A 
Sbjct: 280 EPAIAWWPRHIPPGQVSHQLGSIMDLFTTSLALAG 314


>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
 gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
          Length = 463

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 105/350 (30%), Positives = 171/350 (48%), Gaps = 43/350 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+++ IL DDLGW D+  +G   I TP++D LA SG+   + Y+   +C+P+R  + TG+
Sbjct: 49  PNLLVILGDDLGWADLSAYGSPDIKTPHLDRLAASGVRFTDAYSASSVCSPTRFGLYTGR 108

Query: 119 HP--IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           +P  +  G+   +    +  G+PL    L   LKE GY T + GKWH G Y   Y+PT  
Sbjct: 109 YPGRLAGGLPEPISAPNKLHGIPLDHPTLASLLKEQGYATALFGKWHCG-YLPWYSPTRV 167

Query: 177 GFESHLGYWTGHQDYF---DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           G++   G ++G  DYF   +H+         D+  D     DL  +Y T++ T  AV  I
Sbjct: 168 GWDEFFGNFSGGLDYFSKINHNGAH------DLYEDEVEVEDL--RYYTEIVTERAVSFI 219

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL--------------NIHRHIEDFKRS 279
                D P  L L     +   P+ P + P+                  +  H +    +
Sbjct: 220 ERDH-DTPWLLNL-----NFTTPHWPWEGPEDKAVSDELTARIEGGEKGVLFHTDGGSLA 273

Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            +  ++  LD+S+G+V++ALE+   L ++++ F SDNGG         ++ WP  G K+ 
Sbjct: 274 TYKTMVEDLDQSIGQVIKALERTGQLRDTVVFFASDNGGE------RFSNVWPFTGAKSG 327

Query: 340 LWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           L EGG+R   ++ W  +L  R  V+++ V  +DW  T L  A  +   +Y
Sbjct: 328 LNEGGIRVPTILSWPGILRGRQ-VSDEPVVTTDWTATFLELAGATPAEDY 376


>gi|296122982|ref|YP_003630760.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296015322|gb|ADG68561.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 466

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 173/360 (48%), Gaps = 56/360 (15%)

Query: 51  VDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTP 109
               A S  P+I+ I ADDLG+ D+G +G   I TP++D +A  G+   ++Y+  ++CTP
Sbjct: 27  ASFAADSTKPNIVVIFADDLGYGDLGCYGSPTIRTPHLDEMAAEGLRFTDFYSAAEVCTP 86

Query: 110 SRSAIMTGKHPIHTGM--QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
           SR+A++TG+ PI +GM     VL+   +GGLP +E  + + LKE GY T  +GKWHLG +
Sbjct: 87  SRAALLTGRLPIRSGMCGARRVLFPNSKGGLPPAEITIAEALKEKGYATAQIGKWHLGIH 146

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAE-----------EMKMWGLDMRRDLEPAWDL 216
                P  +GF+   G    + +  D  A+            +  W + + R+ E     
Sbjct: 147 PGS-RPLDQGFDQSFGL--PYSNDMDARADLPKGSTGSPNPPLDGWNVALLRNGEVVEQP 203

Query: 217 HGKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
             + + T  +T EA+  I     + P FLY+ H       P+ P+ A           +D
Sbjct: 204 ANQTTLTKRYTEEAIKFI-TEKKNVPFFLYMPH-----TFPHVPMFAS----------QD 247

Query: 276 FKRSKFAAI----LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG--------GAAAGF 323
           FK    A I    + +LD SVG+V+ AL +  +  N+++ F SDNG        G +AG 
Sbjct: 248 FKGKSRAGIYGDAVEELDWSVGQVLGALCREGIAENTLVFFSSDNGPWLIMGDQGGSAGL 307

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
                    L+  K + WEGG+R  G+ W P     G V  Q     D  PT L+ A  S
Sbjct: 308 ---------LKDGKGSTWEGGMRVPGIAWMPSRIKPG-VTSQLASAMDVFPTALALAGAS 357


>gi|160900836|ref|YP_001566418.1| sulfatase [Delftia acidovorans SPH-1]
 gi|160366420|gb|ABX38033.1| sulfatase [Delftia acidovorans SPH-1]
          Length = 457

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 168/346 (48%), Gaps = 45/346 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+I+FI+ADDLG+ D+G +G         +P +D LA  G+ L + Y    +C+P+R A+
Sbjct: 20  PNILFIVADDLGYADLGCYGGRAADFGAVSPVLDRLAAGGLRLTQGYANSPVCSPTRFAL 79

Query: 115 MTGKH----------PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
            T ++          PI++  +   L   E+ GLP     +   L++ GYRT ++GKWHL
Sbjct: 80  ATARYQYRLRGAAEEPINSKTRGTPL--GEKLGLPPDMPTVASMLRDAGYRTALIGKWHL 137

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYST 222
           G Y   + P   G+E + G  +G  DYF H  S+ +  +W  +     E        Y T
Sbjct: 138 G-YPPHFGPLRSGYEEYFGPMSGGVDYFTHLSSSGQHDLWVGEEEHHDE-------GYLT 189

Query: 223 DVFTAEAVDIIHNHST-DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR------HIED 275
           D+ +  +VD +H  +  D P FL L     H   P+ P +  D             H++ 
Sbjct: 190 DLLSQRSVDFVHRMAQGDAPFFLSL-----HYTAPHWPWETRDDRSTAEALGAGIAHLDG 244

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
               ++  ++H +DE +G +VEAL     L N++IVF SDNGG         + NWPL G
Sbjct: 245 GNIHQYRRMIHHMDEGIGWIVEALRANGQLDNTLIVFTSDNGGE------RFSDNWPLVG 298

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K  L EGG+R   +   P + + G  + Q+    DW  T+L AA 
Sbjct: 299 GKMDLTEGGIRVPWIAHWPAVIAPGRSSPQHCMSMDWSATVLDAAG 344


>gi|339998071|ref|YP_004728954.1| secreted sulfatase [Salmonella bongori NCTC 12419]
 gi|339511432|emb|CCC29132.1| putative secreted sulfatase [Salmonella bongori NCTC 12419]
          Length = 497

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 123/415 (29%), Positives = 203/415 (48%), Gaps = 54/415 (13%)

Query: 31  RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
           +T I +   L L  T S + V   A+   P+++FI+ADDLG+ D+  +G   + TPNID 
Sbjct: 3   KTVIASMIGLALCAT-STLSVAHAAAEKRPNLVFIVADDLGYGDLATYGHQIVKTPNIDR 61

Query: 91  LAYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
           LA  G+    YY    LC+PSR+ ++TG+ P  TG++  +  G     +  +E  +   L
Sbjct: 62  LAKEGVKFTEYYAPAPLCSPSRAGMLTGRMPFRTGVRSWIPEGTNV-SIGRNELTIANLL 120

Query: 150 KELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLD 205
           K+ GY T ++GK HL   G    +  P   GF+  L    G   D    +A+E   +G+ 
Sbjct: 121 KQQGYDTAMMGKLHLNAGGDRTDQPQPKELGFDYSLVNPAGFVTDATLDNAKERPRYGV- 179

Query: 206 MRRDLEP-AWDLHGKY-------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
               + P  W  +GK+       S +  ++E V+ + N   D+P FLY+A    HS    
Sbjct: 180 ----VHPTGWMRNGKHIDRADSISGEFVSSEVVNWLDNKKDDKPFFLYVAFTEVHS---- 231

Query: 258 EPLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEA 298
            PL +P  YL+++ +++ ++++                   ++ A +  +D  VGKV++ 
Sbjct: 232 -PLASPKKYLDMYSQYMSEYQKQHPDLFYGDWADKPWRGTGEYYANISYMDAQVGKVLDK 290

Query: 299 LEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWS 353
           ++      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I  
Sbjct: 291 IKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKY 350

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
                +G+V +  V+  DW+PTL   AN  D  +P       ++++P  E   L+
Sbjct: 351 GKHIPQGVVTDTPVYGLDWMPTL---ANMMDFKLPTDRTYDGQSLVPLLEQKTLK 402


>gi|114798724|ref|YP_760874.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114738898|gb|ABI77023.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 505

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 162/325 (49%), Gaps = 41/325 (12%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
            A+S  P+I+ I  DD+G+ D+G  G     TPN+D LA  G    ++Y    +CTPSR+
Sbjct: 40  AAASEQPNIVLIFVDDMGYADIGSFGSPIARTPNLDRLAMEGQKWTSFYAPAPVCTPSRA 99

Query: 113 AIMTGKHPIHTGM-----QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
            +MTG+  + +GM       +VL+    GGLP SE  + + L++ GY +   GKWH+G +
Sbjct: 100 GLMTGRLAVRSGMAGLVQARHVLFPTSTGGLPQSEVTIAELLQQEGYVSAAFGKWHMG-H 158

Query: 168 KKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA----WDLHGKYSTD 223
             E+ PT  GF+S+ G    + +  +        W +D+    EP     WD+      +
Sbjct: 159 LPEFLPTSHGFQSYFG--IPYSNDMNMPGGGETPWSIDLF--FEPPNIQNWDVPLMQDEE 214

Query: 224 V-------------FTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
           +             +T  A++ +  +H+  +P FLYLAH   H+     PL   + +  +
Sbjct: 215 IIERPADQFTLTQRYTERAIEFMETSHAEGQPFFLYLAHNMPHT-----PLFTSEGFTGV 269

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
                      +  ++ +LD SVG++V+AL+  ++  N++++F SDNG   A    ++ S
Sbjct: 270 S------AGGAYGDVIEELDWSVGEIVDALKDMKIEKNTLVIFTSDNGPWLA-MKTHSGS 322

Query: 330 NWPLRGVKNTLWEGGVRGAGLIWSP 354
              LR  K T WEGG+R   + W P
Sbjct: 323 AGMLRDGKGTTWEGGMRVPAIFWWP 347


>gi|149177572|ref|ZP_01856174.1| hypothetical protein PM8797T_00157 [Planctomyces maris DSM 8797]
 gi|148843552|gb|EDL57913.1| hypothetical protein PM8797T_00157 [Planctomyces maris DSM 8797]
          Length = 599

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 105/352 (29%), Positives = 176/352 (50%), Gaps = 43/352 (12%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
           TL +     + ++  P+++ I+ DD GW DV  H    I TP  D LA  G   + +Y  
Sbjct: 16  TLILSRGSFLQAAERPNVLLIMTDDQGWGDVRSHDNPLIETPQQDLLASQGARFERFYVS 75

Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
            +C P+RS+++TG++ + TG+ H V  G E   +   E  + +  K  GY+T   GKWH 
Sbjct: 76  PVCAPTRSSLLTGRYSLRTGV-HGVTRGFEN--MRAEETTIAEMFKAAGYKTGAFGKWHN 132

Query: 165 GFYKKEYTPTFRGFESHLGYWTGHQD-YFD----HSAEEMKMWGLDMRRDLEPAWDLHGK 219
           G +   + P  +GF+   G+  GH + YFD    H+ + +K  G                
Sbjct: 133 GRHYPMH-PNGQGFDEFFGFCGGHWNRYFDTNLEHNKQPVKTEG---------------- 175

Query: 220 YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN--IHRHIEDFK 277
           Y TDV T  A+D I   + D+P F Y+ + A HS     P   P+ Y +   ++ ++D  
Sbjct: 176 YITDVLTDRAIDFI-KQNKDQPFFCYVPYNAPHS-----PWIVPEKYWDKYANKGLDDKA 229

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  + A++  +D+++G++++ L+  ++  N+I++F++DNG  +  +N N      +RG K
Sbjct: 230 RCAY-AMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNSNRYNGN------MRGRK 282

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL---SAANKSDIP 386
            ++ EGG+R    +  P     G V +      D LPTLL   S  N +D P
Sbjct: 283 GSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTADQP 334


>gi|449138176|ref|ZP_21773471.1| arylsulfatase [Rhodopirellula europaea 6C]
 gi|448883200|gb|EMB13738.1| arylsulfatase [Rhodopirellula europaea 6C]
          Length = 516

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 186/366 (50%), Gaps = 50/366 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSA 113
           A++  P+I+ I+ADDLG++++G +G  +I TP +D LA  GI L N+Y+   +C PSR  
Sbjct: 47  ATASRPNIVLIVADDLGYSELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCC 106

Query: 114 IMTGKHPIHTGMQHNV-------------LYGCERGGLPLSEKILPQYLKELGYRTRIVG 160
           +MTGKHP H  +++N              L    +  LP+ E  + +YLK +GYRT   G
Sbjct: 107 LMTGKHPGHAYVRNNGDPKIDPAVREELKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFG 166

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD--LHG 218
           KW LG +     P  +GF+   G+   +     H+     +W  +  ++++P  D  LHG
Sbjct: 167 KWGLGHFGTTGDPNEQGFDLFYGF---NCQRHAHNHYPKFLW-RNRVKEVQPGNDRTLHG 222

Query: 219 K-YSTDVFTAEAVDIIHN---HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI- 273
           + YS D F  EA + I       T +P F YL  A  H +     +Q P+  ++ +  + 
Sbjct: 223 ETYSQDQFVNEACEFIRQSVAEDTTQPFFAYLPFAVPHLS-----IQVPEEEVDAYEGVI 277

Query: 274 --EDFK----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG---- 317
              D++          R+ +AA++ ++DE VG++V+ ++   +  N++I+F SDNG    
Sbjct: 278 EEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQIVDLIDSMGLGENTLILFTSDNGPTYD 337

Query: 318 --GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
             G +     ++AS   ++G+K  L EGG+R   +     +   G  ++      D+LPT
Sbjct: 338 RLGGSDSDYFDSASG--MKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFLPT 395

Query: 376 LLSAAN 381
           +  AA 
Sbjct: 396 ITDAAG 401


>gi|116620007|ref|YP_822163.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116223169|gb|ABJ81878.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 443

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 176/371 (47%), Gaps = 41/371 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           +A +L+   +    ++  P+++ ++ DDLG +D+G+ G   + TP+IDALA  G+  +N+
Sbjct: 9   MAASLAAPLIRAAPAADRPNVLVVVLDDLGCHDLGYLGAADLKTPHIDALAARGLKFRNW 68

Query: 102 YT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRI 158
           Y+   +C P+RSAI+TG+ P   G+  N        G  L+  I  L   LK  GY+T  
Sbjct: 69  YSNAPVCAPARSAILTGRFPASAGVPDN--------GPALAHGIPTLASVLKGSGYQTGC 120

Query: 159 VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLH- 217
            GKWHLG    E  PT  GF+S  G+ +G  DY+ H       WG D   DL   W    
Sbjct: 121 FGKWHLG-STDETAPTGHGFDSFYGFHSGCVDYYSHR----FYWG-DNYHDL---WHNRT 171

Query: 218 -----GKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
                G+Y T+    EA   I     + P   Y+A  A     P+ P+ AP  Y     +
Sbjct: 172 EIFEDGRYLTERIADEAAGFI---GRNRPFLGYVAFNA-----PHYPMHAPAQYKARFPN 223

Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAA---AGFN---LN 326
           +   +R  +AA++  +D+ +G++  ALE      N+++ F+ DNG      AG N     
Sbjct: 224 LAP-ERQTYAAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGATTEKRAGLNGDFAT 282

Query: 327 AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           A  N   +G K +L++GG+   G +  P    +G   ++     D LPT+  A      P
Sbjct: 283 AGDNGVFKGYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSMDILPTICRATGAPLPP 342

Query: 387 NYVNSTVENII 397
               S + N I
Sbjct: 343 RVDGSDLLNTI 353


>gi|167524284|ref|XP_001746478.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775240|gb|EDQ88865.1| predicted protein [Monosiga brevicollis MX1]
          Length = 520

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 108/360 (30%), Positives = 164/360 (45%), Gaps = 68/360 (18%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQ----IPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           PHI  ++ DD GW +VG+H  ++    + TP IDAL   GI L  +Y    CTP+     
Sbjct: 31  PHIFMVIVDDFGWAEVGYHRSEEYKQDVQTPTIDALVQEGIELNRHYVHYACTPNP---- 86

Query: 116 TGKHPIHTGMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
                            C+ G G+P +   + ++L+  GY T  VGKW +G    ++TP 
Sbjct: 87  -----------------CQPGCGMPQNMTGVAEHLQSAGYATHFVGKWDVGMTTPQHTPK 129

Query: 175 FRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL----------------EPAWDLHG 218
            RG+++ L Y+ GH ++    AE    +  + R DL                +PA +L+G
Sbjct: 130 GRGYDTSLNYF-GHGNWMWSEAEWQGSY--NHRSDLPPCEPPACFKDFWETDKPAAELNG 186

Query: 219 -KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
            +Y   +F      I+  H    PLFL  A    H      PLQAP  Y      IE+  
Sbjct: 187 TQYEEAIFRDRVYSILEQHDPSTPLFLTYASKVAHY-----PLQAPIEYQEKFSFIEEPH 241

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG---------------AAAG 322
           R  + A++  LD+++      L+ + M +N+++V  SDNGG                 A 
Sbjct: 242 RRVYHAMVAFLDDNLANFTSLLKSKGMWNNTLMVLTSDNGGYVKSFLGDCDNNTSHGYAC 301

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLL--ESRGIVAEQYVHVSDWLPTLLSAA 380
           FN  A +N+PL+G K +  EGGVR    +    +  E RG   ++ VHV+DW  T  + A
Sbjct: 302 FNGEAGANYPLQGGKYSTLEGGVRVNAFVSGGFVPEERRGTREDEMVHVADWYATFAALA 361


>gi|149198815|ref|ZP_01875857.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149138013|gb|EDM26424.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 598

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 181/376 (48%), Gaps = 50/376 (13%)

Query: 44  FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             L + ++ L A+   P+ I I  DD G+ D+G  G  +I TP ID +A  G    N+Y+
Sbjct: 8   LVLGLAYLHLQATDKKPNFIVIFTDDQGYQDLGCFGSPKIKTPEIDQMAKEGARYTNFYS 67

Query: 104 V-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
              +C+ SR+A++TG++P   G+ H V Y     GL  SE  + + LK  GYRT I+GKW
Sbjct: 68  ANAICSASRAALLTGRYPSRNGVFH-VYYPGASQGLKPSEITIAEVLKTAGYRTSIIGKW 126

Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS---AEEMKMWG---------------- 203
           HLG  + ++ PT +GF+S+ G    +  +       A+++K++G                
Sbjct: 127 HLG-DRNQFLPTNQGFDSYFGIPFSNDMWMSKDLALADDIKLFGGVTVEQIKSGEASKAV 185

Query: 204 --------LDMRRDLEPA-WDLHGKYSTDVFTAEAV-DIIHNHSTDEPLFLYLAHAATHS 253
                   + + RD E   + +   Y T  +T EA+  I  +    +P F+YLA+A    
Sbjct: 186 KGEKRGGKVPLMRDEEVVEYPVDQTYITQRYTDEALKIIKESEKKKQPYFIYLAYAM--- 242

Query: 254 ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
             P+ PL A   +           R  +   + ++D  VG++++ L+      N++++F 
Sbjct: 243 --PHVPLYASPKFAGKS------ARGPYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFT 294

Query: 314 SDNGGAAAGFNLN--AASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSD 371
           SDNG     +NL     S  PLRG K + +EGG R   ++W P     G  + +     D
Sbjct: 295 SDNG----PWNLGERGGSALPLRGAKFSTYEGGHRVPCVMWWPGTIPAGTDSAEIATTLD 350

Query: 372 WLPTLLSAANKSDIPN 387
           ++PT    AN + +PN
Sbjct: 351 FMPTFAKLAN-AQLPN 365


>gi|170727990|ref|YP_001762016.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169813337|gb|ACA87921.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 484

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 102/309 (33%), Positives = 157/309 (50%), Gaps = 52/309 (16%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGKHP 120
           ++ I  DDLG  D G +G  Q PTPNID LA SG+   + Y     C PSR+++MTG  P
Sbjct: 40  VVIIYVDDLGIMDTGIYGSAQYPTPNIDKLANSGVRFTQAYANAANCAPSRASLMTGLTP 99

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILP---------------QYLKELGYRTRIVGKWHLG 165
              G+   +  G    G     K++P                  K+ GY T ++GKWHLG
Sbjct: 100 AEHGI---LTVGSSERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYATAVIGKWHLG 156

Query: 166 FYKKEYTPTFRGFE-----SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
               +  PT  GF+     SHLG+   +  ++ +S  + K+ GL+          L  +Y
Sbjct: 157 ----KTAPTEYGFDTAIAASHLGHPPSY--FYPYSKGKRKLIGLE-------EGGLKDEY 203

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH---RHIEDFK 277
            ++  T EAV+ I   S  +P FLYL   A H+     P++AP  ++N H   +   + K
Sbjct: 204 LSNRITREAVNYI--SSQRQPFFLYLPFYAVHT-----PIEAPKEWVNQHNARQQAGEIK 256

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
            + +AA++  LD  VGK+++AL++     N+++VF SDNG         A S+ P RG K
Sbjct: 257 SAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFASDNGAYDP-----ATSSLPYRGYK 311

Query: 338 NTLWEGGVR 346
           ++L+EGG++
Sbjct: 312 SSLFEGGIK 320


>gi|392965484|ref|ZP_10330903.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387844548|emb|CCH52949.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 477

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 177/360 (49%), Gaps = 46/360 (12%)

Query: 55  ASSGP--PHIIFILADDLGWNDVGFH-----GLDQIPTPNIDALAYSGIILKNYYT-VQL 106
           + +GP  P+IIFILADDLG+ D+G +     G   I TPN+D +A  G+    +Y    +
Sbjct: 30  SPAGPRKPNIIFILADDLGYGDLGCYRNTGSGQQTIRTPNLDRMAREGMRFTQHYAGSTV 89

Query: 107 CTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           C PSRS +MTG   +HTG  H  + G  R  L   +K + +YLK  GY T ++GKW LG 
Sbjct: 90  CAPSRSTLMTG---LHTG--HTRIRGNARVPLQPDDKTVAEYLKGAGYTTALIGKWGLGE 144

Query: 167 YKKEYTPTFRGFESHLGYWT---GHQDYFDHSAEEMKMWGLDMR---------------R 208
                 P  +GF+   GY      H  Y D       +W  + +               +
Sbjct: 145 DGSTGIPRQQGFDYFYGYLNQTHAHNHYPDF------LWRNETKVKLPNEVIYVENGGVK 198

Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQA-----P 263
            +  A     +++ D+FT EA+  I   S  +P FLYL++ + H+ N ++ + A     P
Sbjct: 199 GIGSAATKRVEWTQDLFTNEALQFI-EQSKQQPFFLYLSYVSPHANNEHKLVAAHGIEVP 257

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
           D+    +    D +++ FAA +  LD  VGKV+  L + ++  N++++F SDNG  A G 
Sbjct: 258 DYGPYANEEFPDVQKA-FAASITYLDAHVGKVLAKLHELKLDDNTLVIFSSDNGPHAEGG 316

Query: 324 NLNAA--SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           N  A   SN  LRG+K  L+EGG+R   L   P     G ++       D+LPT    A 
Sbjct: 317 NDPAYFNSNGGLRGIKRDLYEGGIRVPMLARWPGRIKAGTISNHPSAFWDFLPTACEVAG 376


>gi|440750638|ref|ZP_20929879.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
 gi|436480856|gb|ELP37068.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
          Length = 478

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 109/355 (30%), Positives = 180/355 (50%), Gaps = 26/355 (7%)

Query: 41  PLAFTLSMVFVDLVASSGP------PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           PL + + M+++ L A S P      P++I I  DD+G+ D+G +G + I TPN+D +A +
Sbjct: 9   PLKWGMLMLWISL-AISFPTLAQDRPNVILIFVDDMGYGDLGAYGAEDIATPNLDQMAEN 67

Query: 95  GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
           G+   N+Y  Q +C+ SR++++TG +    G+ H  L    + GL   E  + + LKE G
Sbjct: 68  GVKFTNFYVSQAVCSASRASLLTGTYANRLGI-HGALDHTAKHGLNPEEITIAEMLKEQG 126

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y+T + GKWHLG +   + PT +GF+   G    +  +  H   +     L +  +    
Sbjct: 127 YQTAMFGKWHLG-HHPPFLPTKQGFDEFYGIPYSNDMWPQHPQTKNYYPPLPLFFNDNVV 185

Query: 214 WDLHGKYS--TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
             LH   S  T   T  AV+ I     D+P FLYLAH       P+ PL   + +     
Sbjct: 186 DTLHNDQSRLTTELTERAVNFIQRKK-DQPFFLYLAHPM-----PHVPLFVSEKFRG--- 236

Query: 272 HIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
                KR  +  +L ++D SVG++++ L    +   ++++F+SDNG   + F+ +A S +
Sbjct: 237 ---KSKRGLYGDVLMEIDWSVGQILKTLRDNGLEEKTLVIFISDNGPWLS-FSGHAGSAY 292

Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
           PLR  K T W+GGV+   L++       G V        D LPT ++   ++D+P
Sbjct: 293 PLREGKGTSWDGGVKVPALMYWKGKIPAGTVQTNPAMTIDMLPT-IAKLTRADLP 346


>gi|149174691|ref|ZP_01853316.1| hypothetical protein PM8797T_26185 [Planctomyces maris DSM 8797]
 gi|148846385|gb|EDL60723.1| hypothetical protein PM8797T_26185 [Planctomyces maris DSM 8797]
          Length = 600

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 177/350 (50%), Gaps = 29/350 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+II ++ DD G+ D    G  +I TP I  LA  G+    +Y   +C P+R+ +MTG+H
Sbjct: 35  PNIILVMTDDQGYWDTEISGNPKIKTPTIKKLAAEGVTFTRFYANMVCAPTRAGLMTGRH 94

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
            + TG+ +N  +G +   L  +E  + Q L++ GY+T + GKWHLG Y  +Y P  RGF+
Sbjct: 95  YLRTGL-YNTRFGGDT--LGPNETTIAQVLQKAGYKTGLFGKWHLGRY-AQYQPQRRGFD 150

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
              G++ GH + +  + +++ + G  +             Y TD+FT  A+D I  +   
Sbjct: 151 HFFGHYHGHIERYT-NPDQVVVNGTPVETR---------GYVTDLFTDAAIDFIQRNQ-Q 199

Query: 240 EPLFLYLAHAATHSANPYEP--LQAPDHYLNIHRHIED---FKRSKFAAILHKLDESVGK 294
           +P F YLA+ A HS    +      P+    I +++      + ++  A++ ++D+++ +
Sbjct: 200 QPFFCYLAYNAPHSPFLLDTSHFGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSR 259

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI-WS 353
           +++ +   ++   ++++F SDNGG + GF         L+G K + +EGG R   ++ W+
Sbjct: 260 LLQTVHDLKLDQETVVIFTSDNGGVSRGFKAG------LKGSKASAYEGGTRVPFVVRWT 313

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
               + G   +  V  +D  PT    A    +P+ V    E+I+   E  
Sbjct: 314 DHFPA-GKTTDAMVAQTDLFPTFCQLAGVP-VPSNVKLDGESILSLMEQG 361


>gi|32475708|ref|NP_868702.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32446250|emb|CAD76079.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 667

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 161/337 (47%), Gaps = 53/337 (15%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
           P+++ +L DD GW D+  HG   + TP+ID+LA  G+ +KN+Y   +C+P+R+  +TG++
Sbjct: 93  PNVLVVLTDDQGWGDLSLHGNPNLQTPHIDSLARDGVQIKNFYVCAVCSPTRAEFLTGRY 152

Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE 179
              +G+  +   G ER    LSE+ +    +  GYRT   GKWH G  +  Y P  RGF+
Sbjct: 153 HTRSGV-FSTSAGGER--FDLSERTIGDAFQAAGYRTAAFGKWHSGM-QAPYHPNARGFD 208

Query: 180 SHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGK------YSTDVFTAEAVDII 233
              G+ +GH             WG        P  +L+G+      +  D  T  A+D +
Sbjct: 209 EFYGFCSGH-------------WG----NYFSPMLELNGEIVKGDGFIVDDLTQHAIDFM 251

Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIH-RHI-----------EDFKRSK 280
                + P F+YL     HS     P+Q PD  + N   + I           ED + ++
Sbjct: 252 E-RDRENPFFIYLPLNTPHS-----PMQVPDEDWQNFEGKEIVPDPRPENAKKEDVQHTR 305

Query: 281 FA-AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNT 339
            A A+   +D++VG++++ALE+  +  N+I+VF  DNG   + FN        LRG K  
Sbjct: 306 AALALCENIDDNVGQLLDALERLSLSENTIVVFFCDNGPNGSRFNGG------LRGRKGA 359

Query: 340 LWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           + EGG+R   LI  P     G          D  PTL
Sbjct: 360 VHEGGLRSPCLIRYPSKIPAGQTVGGIAGAIDLFPTL 396


>gi|417306372|ref|ZP_12093283.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327537358|gb|EGF24091.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 606

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 167/340 (49%), Gaps = 34/340 (10%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            AS   P++I ++ DD G+ D+  HG   + TPN+D LA   + L+N++    CTP+R+A
Sbjct: 41  TASESRPNVILVVTDDQGYGDMSCHGNPWLNTPNLDRLATQSVRLENFHVDPFCTPTRAA 100

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           +MTG++    G    V  G  R  L   E  + +  +E GYRT + GKWHLG     + P
Sbjct: 101 LMTGRYCTRVGAW-AVTEG--RQLLDPDETTMAEIFRESGYRTGMFGKWHLG-DPPPFAP 156

Query: 174 TFRGFESHLGYWTGHQ---------DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
             RGFE+ + +  G           DYFD +           R     ++D    Y TD+
Sbjct: 157 RDRGFETVVRHMAGGADEIGNPTGNDYFDDTY---------YRNGTPESFD---GYCTDI 204

Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN-IHRHIEDFKRSKFAA 283
           +  EA+D I   S ++P F+Y+   A HS     P    D Y +   R   + +R+ F  
Sbjct: 205 WFDEAIDFIQKES-EQPFFVYIPTNAMHS-----PYLVADRYSDPFKRQGIEPQRAAFYG 258

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA--AAGFNLNAASNWPLRGVKNTLW 341
           ++   DE++G++++ L+Q  +  N++++F+SDNG A  A+  N     N  +RG K +++
Sbjct: 259 MIQNFDENLGRLLKRLDQDNLRDNTLLIFMSDNGTAQGASEQNRKVGFNAGMRGKKGSVY 318

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           EGG R       P         +Q     DWLPTL+   +
Sbjct: 319 EGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCD 358


>gi|16763425|ref|NP_459040.1| arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. LT2]
 gi|167989888|ref|ZP_02570988.1| sulfatase [Salmonella enterica subsp. enterica serovar 4,[5],12:i:-
           str. CVM23701]
 gi|168262300|ref|ZP_02684273.1| sulfatase [Salmonella enterica subsp. enterica serovar Hadar str.
           RI_05P066]
 gi|168464434|ref|ZP_02698337.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
           SL317]
 gi|194443287|ref|YP_002039265.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
           SL254]
 gi|197265267|ref|ZP_03165341.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
           str. SARA23]
 gi|205351379|ref|YP_002225180.1| sulfatase [Salmonella enterica subsp. enterica serovar Gallinarum
           str. 287/91]
 gi|207855551|ref|YP_002242202.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. P125109]
 gi|374982441|ref|ZP_09723762.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. TN061786]
 gi|375122149|ref|ZP_09767313.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Gallinarum str. SG9]
 gi|378443490|ref|YP_005231122.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|378448318|ref|YP_005235677.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. 14028S]
 gi|378698019|ref|YP_005179976.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. SL1344]
 gi|378982577|ref|YP_005245732.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|378987440|ref|YP_005250604.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. UK-1]
 gi|379699254|ref|YP_005240982.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. ST4/74]
 gi|418761586|ref|ZP_13317728.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|418767351|ref|ZP_13323415.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|418770519|ref|ZP_13326540.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|418776596|ref|ZP_13332538.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|418778881|ref|ZP_13334789.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|418785063|ref|ZP_13340896.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|418803046|ref|ZP_13358671.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|418805727|ref|ZP_13361305.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|418810088|ref|ZP_13365629.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|418816483|ref|ZP_13371975.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21538]
 gi|418820794|ref|ZP_13376225.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22425]
 gi|418830698|ref|ZP_13385659.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|418834225|ref|ZP_13389136.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|418842721|ref|ZP_13397530.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|418843982|ref|ZP_13398777.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|418849810|ref|ZP_13404532.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|418852784|ref|ZP_13407480.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19593]
 gi|418859340|ref|ZP_13413945.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|418861375|ref|ZP_13415934.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
 gi|419790263|ref|ZP_14315937.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. Levine 1]
 gi|419794733|ref|ZP_14320341.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. Levine 15]
 gi|421356939|ref|ZP_15807254.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 622731-39]
 gi|421363662|ref|ZP_15813903.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 639016-6]
 gi|421365681|ref|ZP_15815892.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 640631]
 gi|421371888|ref|ZP_15822045.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 77-0424]
 gi|421374966|ref|ZP_15825082.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607307-6]
 gi|421379270|ref|ZP_15829341.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 485549-17]
 gi|421384252|ref|ZP_15834279.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 596866-22]
 gi|421391480|ref|ZP_15841447.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 596866-70]
 gi|421394207|ref|ZP_15844150.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 629164-26]
 gi|421399851|ref|ZP_15849744.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 629164-37]
 gi|421402263|ref|ZP_15852122.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 639672-46]
 gi|421406273|ref|ZP_15856089.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 639672-50]
 gi|421410732|ref|ZP_15860505.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 77-1427]
 gi|421415823|ref|ZP_15865545.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 77-2659]
 gi|421420220|ref|ZP_15869898.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 78-1757]
 gi|421424731|ref|ZP_15874371.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 22510-1]
 gi|421429251|ref|ZP_15878850.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 8b-1]
 gi|421437106|ref|ZP_15886631.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648905 5-18]
 gi|421438407|ref|ZP_15887905.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 6-18]
 gi|421442608|ref|ZP_15892056.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 50-3079]
 gi|422024148|ref|ZP_16370644.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm1]
 gi|422029150|ref|ZP_16375427.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm2]
 gi|427542722|ref|ZP_18925933.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm8]
 gi|427576467|ref|ZP_18935084.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm9]
 gi|427597934|ref|ZP_18940002.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm3]
 gi|427616845|ref|ZP_18943594.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm4]
 gi|427624819|ref|ZP_18945198.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm6]
 gi|427658888|ref|ZP_18954487.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm10]
 gi|427659234|ref|ZP_18954812.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm11]
 gi|427664472|ref|ZP_18959672.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm12]
 gi|427686227|ref|ZP_18964513.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm5]
 gi|436640158|ref|ZP_20516274.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 22704]
 gi|436650106|ref|ZP_20516727.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE30663]
 gi|436794434|ref|ZP_20522137.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CHS44]
 gi|436805911|ref|ZP_20526431.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1882]
 gi|436816887|ref|ZP_20534074.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1884]
 gi|436846224|ref|ZP_20539155.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1594]
 gi|436848790|ref|ZP_20540259.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1566]
 gi|436856562|ref|ZP_20545656.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1580]
 gi|436863842|ref|ZP_20550061.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1543]
 gi|436873421|ref|ZP_20556178.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1441]
 gi|436880834|ref|ZP_20560453.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1810]
 gi|436890643|ref|ZP_20565921.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1558]
 gi|436892808|ref|ZP_20566935.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1018]
 gi|436900219|ref|ZP_20571299.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1010]
 gi|436908107|ref|ZP_20575654.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1729]
 gi|436916454|ref|ZP_20580301.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0895]
 gi|436926456|ref|ZP_20586410.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0899]
 gi|436935208|ref|ZP_20590758.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1457]
 gi|436939569|ref|ZP_20593875.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1747]
 gi|436954674|ref|ZP_20602038.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0968]
 gi|436964862|ref|ZP_20606436.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1444]
 gi|436968632|ref|ZP_20607873.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1445]
 gi|436976668|ref|ZP_20611978.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1559]
 gi|436988995|ref|ZP_20616351.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1565]
 gi|437002748|ref|ZP_20621366.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1808]
 gi|437015410|ref|ZP_20625696.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1811]
 gi|437034454|ref|ZP_20632968.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0956]
 gi|437040513|ref|ZP_20634762.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1455]
 gi|437052769|ref|ZP_20642171.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1575]
 gi|437057662|ref|ZP_20644584.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1725]
 gi|437063227|ref|ZP_20647902.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1745]
 gi|437072173|ref|ZP_20652345.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1791]
 gi|437084625|ref|ZP_20659790.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1795]
 gi|437092595|ref|ZP_20663791.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 576709]
 gi|437100632|ref|ZP_20666012.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 635290-58]
 gi|437123361|ref|ZP_20672971.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607308-16]
 gi|437132890|ref|ZP_20678289.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607308-19]
 gi|437136026|ref|ZP_20679576.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607307-2]
 gi|437142727|ref|ZP_20683786.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607308-9]
 gi|437154122|ref|ZP_20690948.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 629163]
 gi|437160648|ref|ZP_20694808.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE15-1]
 gi|437172431|ref|ZP_20701043.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_N202]
 gi|437177812|ref|ZP_20704256.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_56-3991]
 gi|437183316|ref|ZP_20707668.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_76-3618]
 gi|437197348|ref|ZP_20711192.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 13183-1]
 gi|437262676|ref|ZP_20719065.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_81-2490]
 gi|437268696|ref|ZP_20722148.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SL909]
 gi|437280533|ref|ZP_20727963.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SL913]
 gi|437291929|ref|ZP_20731771.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_69-4941]
 gi|437312720|ref|ZP_20736666.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 638970-15]
 gi|437323311|ref|ZP_20739275.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 17927]
 gi|437335206|ref|ZP_20742712.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CHS4]
 gi|437387395|ref|ZP_20750868.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 22558]
 gi|437410148|ref|ZP_20752711.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 543463 22-17]
 gi|437447830|ref|ZP_20759096.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 543463 40-18]
 gi|437465442|ref|ZP_20764008.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 561362 1-1]
 gi|437475322|ref|ZP_20766495.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 642044 4-1]
 gi|437489818|ref|ZP_20770601.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 642046 4-7]
 gi|437508578|ref|ZP_20776377.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648898 4-5]
 gi|437524653|ref|ZP_20779474.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648899 3-17]
 gi|437559350|ref|ZP_20785766.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648900 1-16]
 gi|437569968|ref|ZP_20788215.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 1-17]
 gi|437596182|ref|ZP_20796230.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 39-2]
 gi|437610477|ref|ZP_20800788.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648902 6-8]
 gi|437618677|ref|ZP_20803290.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648903 1-6]
 gi|437649574|ref|ZP_20809504.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648904 3-6]
 gi|437664336|ref|ZP_20814294.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 653049 13-19]
 gi|437678211|ref|ZP_20817621.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 642044 8-1]
 gi|437691463|ref|ZP_20820806.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 561362 9-7]
 gi|437706009|ref|ZP_20825211.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 543463 42-20]
 gi|437732765|ref|ZP_20831769.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 16-16]
 gi|437796549|ref|ZP_20837611.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 76-2651]
 gi|437803845|ref|ZP_20838659.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 33944]
 gi|437929892|ref|ZP_20851095.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 6.0562-1]
 gi|438018315|ref|ZP_20854794.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 50-5646]
 gi|438085126|ref|ZP_20858588.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 81-2625]
 gi|438103369|ref|ZP_20865284.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 62-1976]
 gi|438113035|ref|ZP_20869443.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 53-407]
 gi|445128077|ref|ZP_21380069.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Gallinarum str. 9184]
 gi|445167160|ref|ZP_21394296.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE8a]
 gi|445182501|ref|ZP_21398565.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 20037]
 gi|445231003|ref|ZP_21405623.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE10]
 gi|445246091|ref|ZP_21408258.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 436]
 gi|445339825|ref|ZP_21416472.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 18569]
 gi|445344607|ref|ZP_21417722.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 13-1]
 gi|445357353|ref|ZP_21422183.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. PT23]
 gi|16418530|gb|AAL18999.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. LT2]
 gi|194401950|gb|ACF62172.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
           SL254]
 gi|195632839|gb|EDX51293.1| sulfatase [Salmonella enterica subsp. enterica serovar Newport str.
           SL317]
 gi|197243522|gb|EDY26142.1| sulfatase [Salmonella enterica subsp. enterica serovar Saintpaul
           str. SARA23]
 gi|205271160|emb|CAR35946.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|205331405|gb|EDZ18169.1| sulfatase [Salmonella enterica subsp. enterica serovar 4,[5],12:i:-
           str. CVM23701]
 gi|205348763|gb|EDZ35394.1| sulfatase [Salmonella enterica subsp. enterica serovar Hadar str.
           RI_05P066]
 gi|206707354|emb|CAR31626.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|261245269|emb|CBG23054.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|267991696|gb|ACY86581.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. 14028S]
 gi|301156667|emb|CBW16137.1| hypothetical secreted sulfatase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|312911005|dbj|BAJ34979.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|321222395|gb|EFX47467.1| Arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. TN061786]
 gi|323128353|gb|ADX15783.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. ST4/74]
 gi|326626399|gb|EGE32742.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Gallinarum str. SG9]
 gi|332986987|gb|AEF05970.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. UK-1]
 gi|392613900|gb|EIW96352.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. Levine 1]
 gi|392614000|gb|EIW96451.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. Levine 15]
 gi|392734643|gb|EIZ91824.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|392737113|gb|EIZ94274.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|392739935|gb|EIZ97063.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|392745782|gb|EJA02805.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|392752160|gb|EJA09101.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|392755088|gb|EJA12003.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|392776443|gb|EJA33131.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|392782199|gb|EJA38836.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|392783954|gb|EJA40563.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|392789844|gb|EJA46346.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21538]
 gi|392791532|gb|EJA48001.1| secreted sulfatase [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22425]
 gi|392801388|gb|EJA57616.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|392806133|gb|EJA62248.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|392806434|gb|EJA62532.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|392816031|gb|EJA71962.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|392819884|gb|EJA75741.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|392828027|gb|EJA83724.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19593]
 gi|392830508|gb|EJA86157.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|392838672|gb|EJA94226.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
 gi|395985903|gb|EJH95068.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 639016-6]
 gi|395992836|gb|EJI01947.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 622731-39]
 gi|395992876|gb|EJI01986.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 640631]
 gi|396002412|gb|EJI11404.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 77-0424]
 gi|396006135|gb|EJI15105.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607307-6]
 gi|396007260|gb|EJI16218.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 485549-17]
 gi|396011888|gb|EJI20794.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 596866-70]
 gi|396016552|gb|EJI25420.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 629164-26]
 gi|396019886|gb|EJI28736.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 596866-22]
 gi|396026440|gb|EJI35208.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 629164-37]
 gi|396032547|gb|EJI41269.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 639672-46]
 gi|396034345|gb|EJI43045.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 639672-50]
 gi|396045337|gb|EJI53930.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 78-1757]
 gi|396046434|gb|EJI55019.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 77-2659]
 gi|396047823|gb|EJI56391.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 77-1427]
 gi|396051564|gb|EJI60081.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648905 5-18]
 gi|396059533|gb|EJI67987.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 8b-1]
 gi|396059688|gb|EJI68141.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 22510-1]
 gi|396072743|gb|EJI81051.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 50-3079]
 gi|396073577|gb|EJI81878.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 6-18]
 gi|414024521|gb|EKT07894.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm1]
 gi|414024756|gb|EKT08114.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm8]
 gi|414026042|gb|EKT09326.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm9]
 gi|414026640|gb|EKT09905.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm3]
 gi|414026814|gb|EKT10072.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm2]
 gi|414033200|gb|EKT16164.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm4]
 gi|414040786|gb|EKT23388.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm10]
 gi|414054826|gb|EKT36758.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm6]
 gi|414058843|gb|EKT40475.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm11]
 gi|414062440|gb|EKT43756.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm12]
 gi|414067999|gb|EKT48231.1| sulfatase [Salmonella enterica subsp. enterica serovar Typhimurium
           str. STm5]
 gi|434956385|gb|ELL50116.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 22704]
 gi|434962770|gb|ELL55931.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CHS44]
 gi|434969924|gb|ELL62598.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1882]
 gi|434971942|gb|ELL64435.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1884]
 gi|434977871|gb|ELL69949.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1594]
 gi|434989218|gb|ELL80783.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1566]
 gi|434991336|gb|ELL82844.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1580]
 gi|434996839|gb|ELL88134.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1543]
 gi|434998963|gb|ELL90175.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1441]
 gi|435004802|gb|ELL95751.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1810]
 gi|435006463|gb|ELL97358.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1558]
 gi|435020507|gb|ELM10911.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1018]
 gi|435022546|gb|ELM12863.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1010]
 gi|435028094|gb|ELM18178.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE30663]
 gi|435030435|gb|ELM20453.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1729]
 gi|435032943|gb|ELM22860.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0895]
 gi|435037632|gb|ELM27436.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0899]
 gi|435040243|gb|ELM30010.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1457]
 gi|435047063|gb|ELM36665.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0968]
 gi|435047681|gb|ELM37255.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1444]
 gi|435047836|gb|ELM37409.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1747]
 gi|435060074|gb|ELM49346.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1445]
 gi|435071554|gb|ELM60496.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1559]
 gi|435071746|gb|ELM60685.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1565]
 gi|435073758|gb|ELM62614.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1808]
 gi|435075956|gb|ELM64753.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_0956]
 gi|435084088|gb|ELM72675.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1811]
 gi|435087491|gb|ELM75998.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1455]
 gi|435089940|gb|ELM78345.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1575]
 gi|435095344|gb|ELM83658.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1725]
 gi|435102209|gb|ELM90314.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1745]
 gi|435106059|gb|ELM94085.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1795]
 gi|435110360|gb|ELM98278.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CDC_2010K_1791]
 gi|435111566|gb|ELM99455.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 576709]
 gi|435121426|gb|ELN08966.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607308-19]
 gi|435121651|gb|ELN09184.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607308-16]
 gi|435124407|gb|ELN11864.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 635290-58]
 gi|435136762|gb|ELN23835.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607307-2]
 gi|435140179|gb|ELN27143.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 629163]
 gi|435142032|gb|ELN28957.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 607308-9]
 gi|435144796|gb|ELN31626.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_N202]
 gi|435147039|gb|ELN33820.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE15-1]
 gi|435154272|gb|ELN40858.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_56-3991]
 gi|435161856|gb|ELN48070.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_81-2490]
 gi|435162865|gb|ELN49018.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_76-3618]
 gi|435170680|gb|ELN56427.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SL913]
 gi|435173991|gb|ELN59458.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SL909]
 gi|435181325|gb|ELN66397.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CVM_69-4941]
 gi|435182734|gb|ELN67731.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 638970-15]
 gi|435195043|gb|ELN79457.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 17927]
 gi|435198125|gb|ELN82350.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. CHS4]
 gi|435200940|gb|ELN84899.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 22558]
 gi|435204656|gb|ELN88327.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 543463 22-17]
 gi|435209441|gb|ELN92763.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 543463 40-18]
 gi|435216872|gb|ELN99344.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 561362 1-1]
 gi|435223188|gb|ELO05222.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 642044 4-1]
 gi|435224957|gb|ELO06892.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 13183-1]
 gi|435231467|gb|ELO12719.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 642046 4-7]
 gi|435233327|gb|ELO14369.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648898 4-5]
 gi|435239771|gb|ELO20260.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648900 1-16]
 gi|435248791|gb|ELO28642.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648899 3-17]
 gi|435249895|gb|ELO29655.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 1-17]
 gi|435250092|gb|ELO29835.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 39-2]
 gi|435251907|gb|ELO31504.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648902 6-8]
 gi|435259296|gb|ELO38525.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648903 1-6]
 gi|435265810|gb|ELO44608.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 653049 13-19]
 gi|435268863|gb|ELO47438.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648904 3-6]
 gi|435273656|gb|ELO51918.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 642044 8-1]
 gi|435280834|gb|ELO58522.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 561362 9-7]
 gi|435287413|gb|ELO64612.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 648901 16-16]
 gi|435290524|gb|ELO67441.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 76-2651]
 gi|435291871|gb|ELO68661.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 543463 42-20]
 gi|435303211|gb|ELO79123.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 33944]
 gi|435312120|gb|ELO86107.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 6.0562-1]
 gi|435319123|gb|ELO91977.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 81-2625]
 gi|435323006|gb|ELO95175.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 62-1976]
 gi|435329881|gb|ELP01179.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 53-407]
 gi|435333700|gb|ELP04473.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 50-5646]
 gi|444855729|gb|ELX80774.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Gallinarum str. 9184]
 gi|444863770|gb|ELX88586.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE10]
 gi|444865716|gb|ELX90481.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SE8a]
 gi|444871319|gb|ELX95760.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 20037]
 gi|444872305|gb|ELX96662.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 18569]
 gi|444880005|gb|ELY04090.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 13-1]
 gi|444886638|gb|ELY10387.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. PT23]
 gi|444890351|gb|ELY13694.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 436]
          Length = 497

 Score =  150 bits (378), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           R +  +++ LA     V     A++   P+++ ILADDLG+ D+  +G   + TPNID L
Sbjct: 3   RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +E  +   LK
Sbjct: 63  AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121

Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
           + GY T ++GK HL   G    +      GF+  L    G   D    +A+E   +G+  
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179

Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
              + P  W  +G       K S +  ++E V+ + N   D P FLY+A    HS     
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231

Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
           PL +P  YL+++ +++ D+++                   ++ A +  +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291

Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
           +      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I   
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
               +G+V +  V+  DWLPTL   AN  D  +P       ++++P  ++  L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|189230138|ref|NP_001121393.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Xenopus (Silurana)
           tropicalis]
 gi|158253505|gb|AAI54124.1| Galns protein [Danio rerio]
 gi|183986312|gb|AAI66093.1| LOC100158481 protein [Xenopus (Silurana) tropicalis]
          Length = 488

 Score =  150 bits (378), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 165/334 (49%), Gaps = 27/334 (8%)

Query: 65  ILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHT 123
           +L DD+GW D+G  G     TP +D +A  G++  N+YT   LC+PSR+A++TG+ P+  
Sbjct: 1   MLMDDMGWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRN 60

Query: 124 GMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           G      +           GG+   E +LP++LK   Y ++IVGKWHLG ++ +Y P   
Sbjct: 61  GFYTTNAHARNAYTPQEIVGGISADEILLPEHLKNKHYISKIVGKWHLG-HRTQYLPLKH 119

Query: 177 GFESHLGYWTGH-QDYFDHSAEEMKMWG-LDMRRDLEPAWDLHGKYS----TDVFTAEAV 230
           GF+   G    H   Y D S   + ++   +M+      ++++ K      T ++  E +
Sbjct: 120 GFDEWFGAPNCHFGPYNDSSRPNIPVYNNSEMKGRYYEEFEINVKTGESNLTQLYLKEGL 179

Query: 231 DIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
           D I   +  + P FLY A  ATH+     P+ A   +L         +R ++   + +LD
Sbjct: 180 DFISQQAMAQRPFFLYWAPDATHA-----PVYASKPFLG------KSQRGRYGDAVMELD 228

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           +S+G+++  L    + +++++ F SDNG A     L + SN P    K T +EGG+R   
Sbjct: 229 DSIGQILAHLVSLGIQNDTLVFFTSDNGAALMSGPLQSGSNGPFLCGKETTFEGGMREPA 288

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           + W P     G V+ Q   V D   T LS A  S
Sbjct: 289 IAWWPGQIPAGTVSHQLASVMDLFSTSLSVAGVS 322


>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
 gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
          Length = 426

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 108/347 (31%), Positives = 167/347 (48%), Gaps = 38/347 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+++FIL DDLGW D+  +G  +I TPN+D LA  G+   + Y+    C+P+R ++ TG+
Sbjct: 12  PNVLFILGDDLGWADLSSYGAPEIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGR 71

Query: 119 HPIHT--GMQHNVLYGCERG-GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           +P  T  G+   +    +R  GL  +   L   L++ GYRT ++GKWH G Y  +Y+PT 
Sbjct: 72  YPGRTKGGLAEPI---ADRSVGLEPTHPTLASLLRDAGYRTALIGKWHCG-YLPDYSPTR 127

Query: 176 RGFESHLGYWTGHQDYFD--HSAEEMKMWGLDMR-RDLEPAWDLHGKYSTDVFTAEAVDI 232
            G+E   G + G  +Y+    S  E  ++  D   RDL        +Y T + T  AV+ 
Sbjct: 128 SGWEEFFGNFGGALEYYSKLSSGGEYDLYEGDAEYRDL--------RYYTRILTERAVEY 179

Query: 233 IHNHSTDEPLFLYLAHAATH-----------SANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           +       P  L L     H           SA     ++A D     H+  +    +K+
Sbjct: 180 VGRDHGRTPWLLNLNFTTPHWPWIADGDEETSAEIVRRIKAGDRRALWHQ--DGGSLAKY 237

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLW 341
             ++  LD SVGKV+ AL++     ++++ F SDNGG    +      NWPL G K +L 
Sbjct: 238 RQLVEDLDRSVGKVLAALKRSGQEEDTLVFFASDNGGERFSY------NWPLSGNKGSLL 291

Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           +GG+R   ++  P     G V+   V+  DW  TLL        P Y
Sbjct: 292 DGGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGGARPHPAY 338


>gi|407940996|ref|YP_006856637.1| sulfatase [Acidovorax sp. KKS102]
 gi|407898790|gb|AFU47999.1| sulfatase [Acidovorax sp. KKS102]
          Length = 444

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 110/355 (30%), Positives = 175/355 (49%), Gaps = 49/355 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIP----TPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+IIFI+ADDLG+ D+G +G  +      +P +D LA +G+ L + Y    +C+P+R A+
Sbjct: 7   PNIIFIVADDLGYADLGCYGGREAAFGPVSPVLDQLAANGLKLTQGYSNSPVCSPTRFAL 66

Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           MT ++     G     +    RG    GLP     LP  L+  GYRT ++GKWHLG Y  
Sbjct: 67  MTARYQYRLRGAAEEPINSKSRGSTTLGLPPEHPTLPSLLRASGYRTALMGKWHLG-YPP 125

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH----SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
            + P   G+E   G  +G  DYF H     A ++ +   + ++D          Y TD+ 
Sbjct: 126 AFGPLRSGYEEFFGPMSGGVDYFTHCSSNGAHDLYLGEEEKQQD---------GYLTDLI 176

Query: 226 TAEAVDIIHNHS----TDEPLFLYLAHAATHSANPYE-------PLQAPDHYLNIH-RHI 273
           T  A+D +   +       P FL L + A H   P+E         +   +  ++H  +I
Sbjct: 177 TDHALDYVQRMAPGAQAGTPFFLSLHYTAPHW--PWETRDDEALAQEVKTNLFHLHGGNI 234

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
           E ++R     ++H +DE +G+++  LE + +  ++++VF SDNGG         + NWPL
Sbjct: 235 ETYRR-----MIHHMDEGIGRIMALLEAKGLARDTLVVFTSDNGGE------RFSDNWPL 283

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
            G K  L EGG+R   +   P + + G V+ Q     DW  T+L AA  +   +Y
Sbjct: 284 VGGKMDLTEGGIRVPWIAHWPAVIAPGGVSAQTCMTMDWSATMLDAAGAATHTDY 338


>gi|418787538|ref|ZP_13343339.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|418792641|ref|ZP_13348381.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|418796450|ref|ZP_13352142.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|392765795|gb|EJA22579.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|392766594|gb|EJA23367.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|392771319|gb|EJA28040.1| putative secreted sulfatase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
          Length = 497

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           R +  +++ LA     V     A++   P+++ ILADDLG+ D+  +G   + TPNID L
Sbjct: 3   RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +E  +   LK
Sbjct: 63  AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121

Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
           + GY T ++GK HL   G    +      GF+  L    G   D    +A+E   +G+  
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179

Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
              + P  W  +G       K S +  ++E V+ + N   D P FLY+A    HS     
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231

Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
           PL +P  YL+++ +++ D+++                   ++ A +  +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291

Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
           +      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I   
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
               +G+V +  V+  DWLPTL   AN  D  +P       ++++P  ++  L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
          Length = 467

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 183/370 (49%), Gaps = 44/370 (11%)

Query: 44  FTLSMVF--VDLVASSG------PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSG 95
           F  + VF  V L+A SG       P+I+ I+ADDLG  ++G +G  +I TP+ID LA  G
Sbjct: 2   FRTAAVFLAVALLAPSGRAADAPKPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGG 61

Query: 96  IILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKEL 152
                +Y+   +C PSR  +MTGKH  H  +++NV    E G  P+   +  +   LK  
Sbjct: 62  AKFTRFYSGSPVCAPSRCVLMTGKHSGHATVRNNVEAKPE-GQFPIRAEDVTVADALKAH 120

Query: 153 GYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEP 212
           GY T  +GKW LG +    +P   GF+   GY   +     HS     ++  D R +L+ 
Sbjct: 121 GYATGAMGKWGLGMFDTAGSPLKHGFDLFFGY---NCQRHAHSHYPTYIYRNDKRVELKG 177

Query: 213 AWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
                GK ++ D+F  EA+  I  +   +P FLYL     H A     +Q P+  LN ++
Sbjct: 178 NDGKTGKQFTQDLFEEEALGFIEANKA-KPFFLYLPFTVPHVA-----VQVPEDSLNEYK 231

Query: 272 H------IEDFKR---------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
                    D K+         + +AA++ ++D SVG+VVE L    +  N++++F SDN
Sbjct: 232 GQLGDDPAYDGKKGYQPHPAPHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDN 291

Query: 317 G-----GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSD 371
           G     G A     N+A    LRG+K +++EGG+R   + + P     G  ++  ++  D
Sbjct: 292 GPTHNVGGADSSFFNSAGK--LRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPD 349

Query: 372 WLPTLLSAAN 381
            LPTL + A 
Sbjct: 350 VLPTLCAFAG 359


>gi|421450242|ref|ZP_15899618.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 58-6482]
 gi|396067684|gb|EJI76042.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 58-6482]
          Length = 497

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           R +  +++ LA     V     A++   P+++ ILADDLG+ D+  +G   + TPNID L
Sbjct: 3   RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +E  +   LK
Sbjct: 63  AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121

Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
           + GY T ++GK HL   G    +      GF+  L    G   D    +A+E   +G+  
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179

Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
              + P  W  +G       K S +  ++E V+ + N   D P FLY+A    HS     
Sbjct: 180 ---VHPMGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231

Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
           PL +P  YL+++ +++ D+++                   ++ A +  +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291

Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
           +      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I   
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
               +G+V +  V+  DWLPTL   AN  D  +P       ++++P  ++  L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|198243820|ref|YP_002213990.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
           CT_02021853]
 gi|445139185|ref|ZP_21384062.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
           SL1438]
 gi|445149882|ref|ZP_21389433.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
           HWS51]
 gi|197938336|gb|ACH75669.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
           CT_02021853]
 gi|444853819|gb|ELX78885.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
           SL1438]
 gi|444857455|gb|ELX82464.1| sulfatase [Salmonella enterica subsp. enterica serovar Dublin str.
           HWS51]
          Length = 497

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           R +  +++ LA     V     A++   P+++ ILADDLG+ D+  +G   + TPNID L
Sbjct: 3   RTVVASMIGLALCAGCVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +E  +   LK
Sbjct: 63  AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121

Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
           + GY T ++GK HL   G    +      GF+  L    G   D    +A+E   +G+  
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179

Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
              + P  W  +G       K S +  ++E V+ + N   D P FLY+A    HS     
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231

Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
           PL +P  YL+++ +++ D+++                   ++ A +  +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291

Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
           +      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I   
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
               +G+V +  V+  DWLPTL   AN  D  +P       ++++P  ++  L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|297284666|ref|XP_002802639.1| PREDICTED: hypothetical protein LOC697850 [Macaca mulatta]
          Length = 1113

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 97/300 (32%), Positives = 153/300 (51%), Gaps = 42/300 (14%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G +  N+
Sbjct: 13  LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNF 72

Query: 102 YTV-QLCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G         N     E  GG+P SE++LP+ LK+ G
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183

Query: 214 WDLHGKY--------------STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W++ G+Y               T ++  EA+D I   +   P FLY A  ATH+     P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           + A   +L         +R ++   + ++D+SVGK++E L    +  N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAA 292


>gi|398810130|ref|ZP_10568960.1| arylsulfatase A family protein [Variovorax sp. CF313]
 gi|398083821|gb|EJL74525.1| arylsulfatase A family protein [Variovorax sp. CF313]
          Length = 438

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 167/345 (48%), Gaps = 32/345 (9%)

Query: 60  PHIIFILADDLGWNDVG-FHGLDQI---PTPNIDALAYSGIIL-KNYYTVQLCTPSRSAI 114
           P+IIFI+ADDLG+ D+G + G D +    +P +D LA +G+ L + Y    +C+P+R  +
Sbjct: 4   PNIIFIVADDLGYADLGCYGGRDAVFGPVSPVLDGLAANGLKLTQGYANSPVCSPTRFGM 63

Query: 115 MTGKHPIHT-GMQHNVLYGCERG----GLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           +TG++     G     +    RG    GLP     LP  LK  GY T ++GKWHLG Y  
Sbjct: 64  ITGRYQYRLRGAAEEPINSRSRGSATLGLPTDHPTLPSLLKASGYSTALIGKWHLG-YPP 122

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDH--SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
            + P   G++   G  +G  DYF H  S     +W  +  +  E        Y TD+ + 
Sbjct: 123 SFGPLRSGYDEFFGPMSGGVDYFTHCDSTGRHDLWSGEEDKVEE-------GYLTDILSK 175

Query: 228 EAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK---FAA 283
            AV+ +   +  E P FL L + A H   P+E          +  ++ D        +  
Sbjct: 176 RAVEYVERMAKQEAPFFLSLHYTAPHW--PWETRDDAAKAPAVKDNLFDLAGGNIHVYRR 233

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEG 343
           ++H +DE +G ++ AL++  M  N+++VF SDNGG         + NWPL G K  L EG
Sbjct: 234 MIHHMDEGIGWIMAALQKHGMADNTLVVFTSDNGGE------RFSDNWPLVGGKMDLTEG 287

Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
           G+R   +   P +  +G  + Q     DW  T+L AA  +   +Y
Sbjct: 288 GIRVPWIAHWPAVIGKGGESTQLCMTMDWSATMLDAAGVAAHADY 332


>gi|410029536|ref|ZP_11279368.1| sulfatase [Marinilabilia sp. AK2]
          Length = 486

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 116/353 (32%), Positives = 174/353 (49%), Gaps = 48/353 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ D+GF G   I TPN+D LA  G+   N+Y    +C PSRSA+MTG 
Sbjct: 34  PNIIFILADDLGYGDLGFLGQKIIETPNLDRLAAEGMFFSNHYAGATVCAPSRSALMTGL 93

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           H  HT ++ N     E G  PL + IL  P+  KE GY T   GKW LGF   E  P  +
Sbjct: 94  HTGHTPVRGNFEIQPE-GQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTEGDPNHQ 152

Query: 177 GFESHLGY---WTGHQ---DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
           GF+   GY      H+   +Y   +AE++ + G D        W     Y+ D+   E++
Sbjct: 153 GFDQFFGYNCQRIAHRYYPEYLWENAEKVFLEGND--------WIQKITYAPDIIHQESL 204

Query: 231 DIIHNHSTDEPLFLYLAHAATHS--ANP--------------YEPLQAP------DHYLN 268
             I +++ + P  +++     H+  A P               +P  AP      D  +N
Sbjct: 205 KFIGDNANN-PFLMFIPSVMPHAELAVPDGELLAYYREKIGEEKPHIAPVGWDYGDEPIN 263

Query: 269 IHRHIED-FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFN--- 324
           I  +  + + ++ +AA++  LD  VG++++ L++  +   +II+F SDNG    G N   
Sbjct: 264 IPGYQSNPYPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNGPHQEGGNDPD 323

Query: 325 -LNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
             N  SN   RG K  L+EGG+R   ++  P +   G  +       D+LPT 
Sbjct: 324 FFN--SNGIFRGYKRDLYEGGIRVPMIVKWPGVIQGGAQSGHVSAFWDFLPTF 374


>gi|429093555|ref|ZP_19156139.1| Arylsulfatase [Cronobacter dublinensis 1210]
 gi|426741528|emb|CCJ82252.1| Arylsulfatase [Cronobacter dublinensis 1210]
          Length = 502

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 115/377 (30%), Positives = 184/377 (48%), Gaps = 53/377 (14%)

Query: 47  SMVFVDLVASSGP---------------PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           S+V + L AS+G                P+++ I+ADDLG+ D G +G   + TPNID+L
Sbjct: 8   SLVSLMLAASAGAQSPAPTPGSDTTAKRPNVVIIMADDLGYGDTGIYGHPIVKTPNIDSL 67

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+    YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +EK +  YLK
Sbjct: 68  AQQGMRFTEYYAPAPLCSPSRAGLLTGRTPFRTGIRSWIPSGGKNVALGRNEKTIASYLK 127

Query: 151 ELGYRTRIVGKWHL-GFYKKEYTPTFR--GFESHL----GYWTGHQDYFDHSAEEMKMWG 203
           E GY T ++GK HL     +   P  +  GF+  L    G+ T   D          ++ 
Sbjct: 128 EQGYDTAMMGKLHLNAGADRTDQPQAKDMGFDYSLVNAAGFVTSDLDKVKTRPRYGVVYP 187

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
               R+ +P   +  + S ++ ++EA+  + +   ++P FLY+A    H+     PL +P
Sbjct: 188 NGFYRNGQPIGTVK-QMSGELVSSEAIHWLDSRKDNKPFFLYVAFTEVHT-----PLASP 241

Query: 264 DHYLNIHR-HIEDFKR------------------SKFAAILHKLDESVGKVVEALEQRRM 304
             YL++++ ++ +++R                   ++ A +  LDE VGKVV  ++    
Sbjct: 242 QKYLDMYKSYMSEYERQHQDLFYADWADKPWRGTGEYYANISYLDEQVGKVVAKIKAMGE 301

Query: 305 LSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSPLLESR 359
             N+II+F SDNG     A   + LN A     LRG K+ LWEGG+R   +I        
Sbjct: 302 EDNTIILFTSDNGPVTREARKWYELNMAGETSGLRGRKDNLWEGGIRVPAIIKYGSHIKA 361

Query: 360 GIVAEQYVHVSDWLPTL 376
           G V++  V   D LPTL
Sbjct: 362 GTVSDSPVTGLDVLPTL 378


>gi|168822186|ref|ZP_02834186.1| sulfatase [Salmonella enterica subsp. enterica serovar Weltevreden
           str. HI_N05-537]
 gi|197249148|ref|YP_002145016.1| sulfatase [Salmonella enterica subsp. enterica serovar Agona str.
           SL483]
 gi|409248332|ref|YP_006889021.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar
           Weltevreden str. 2007-60-3289-1]
 gi|416526182|ref|ZP_11742236.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. LQC 10]
 gi|416533378|ref|ZP_11746346.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. SARB30]
 gi|416555343|ref|ZP_11758828.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 29N]
 gi|421887223|ref|ZP_16318385.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. SS209]
 gi|437816210|ref|ZP_20842570.1| sulfatase [Salmonella enterica subsp. enterica serovar Enteritidis
           str. SARB17]
 gi|440765646|ref|ZP_20944661.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
           str. SH11G1113]
 gi|440769763|ref|ZP_20948718.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
           str. SH08SF124]
 gi|440772152|ref|ZP_20951060.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
           str. SH10GFN094]
 gi|197212851|gb|ACH50248.1| sulfatase [Salmonella enterica subsp. enterica serovar Agona str.
           SL483]
 gi|205341423|gb|EDZ28187.1| sulfatase [Salmonella enterica subsp. enterica serovar Weltevreden
           str. HI_N05-537]
 gi|320089066|emb|CBY98822.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar
           Weltevreden str. 2007-60-3289-1]
 gi|363558247|gb|EHL42440.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. LQC 10]
 gi|363560160|gb|EHL44307.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. 29N]
 gi|363568662|gb|EHL52640.1| sulfatase [Salmonella enterica subsp. enterica serovar Montevideo
           str. SARB30]
 gi|379983257|emb|CCF90658.1| putative arylsulfatase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. SS209]
 gi|435308800|gb|ELO83718.1| sulfatase [Salmonella enterica subsp. enterica serovar Enteritidis
           str. SARB17]
 gi|436411975|gb|ELP09920.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
           str. SH11G1113]
 gi|436413160|gb|ELP11096.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
           str. SH08SF124]
 gi|436419847|gb|ELP17720.1| arylsulphatase A [Salmonella enterica subsp. enterica serovar Agona
           str. SH10GFN094]
          Length = 497

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           R +  +++ LA     V     A++   P+++ ILADDLG+ D+  +G   + TPNID L
Sbjct: 3   RTVVASMIGLALCAGSVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +E  +   LK
Sbjct: 63  AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121

Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
           + GY T ++GK HL   G    +      GF+  L    G   D    +A+E   +G+  
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179

Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
              + P  W  +G       K S +  ++E V+ + N   D P FLY+A    HS     
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231

Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
           PL +P  YL+++ +++ D+++                   ++ A +  +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291

Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
           +      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I   
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
               +G+V +  V+  DWLPTL   AN  D  +P       ++++P  ++  L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|115906036|ref|XP_797340.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 162

 Score =  149 bits (376), Expect = 6e-33,   Method: Composition-based stats.
 Identities = 69/117 (58%), Positives = 80/117 (68%), Gaps = 6/117 (5%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPHIIFI+ADDLGW+DV  HG  QI TPNID LA  G+ L NYY   +CTP+RSAIMTG+
Sbjct: 32  PPHIIFIVADDLGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGR 91

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           HPIHTGMQH+ +   E  GL L EK + Q+LK LGY T  VGK         +TP F
Sbjct: 92  HPIHTGMQHDTIGAAEPWGLGLDEKTMAQHLKSLGYSTHAVGK------SSNWTPQF 142


>gi|373956153|ref|ZP_09616113.1| sulfatase [Mucilaginibacter paludis DSM 18603]
 gi|373892753|gb|EHQ28650.1| sulfatase [Mucilaginibacter paludis DSM 18603]
          Length = 468

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 183/374 (48%), Gaps = 40/374 (10%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           M    L L F L+ V      +   P+IIFILADDLG+ DVG +G  +I TPNID LA  
Sbjct: 10  MKIRSLFLLFCLTGVLSVKAQTRQQPNIIFILADDLGYGDVGVYGQTKILTPNIDLLARE 69

Query: 95  GIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNV-----LYG------CERGGLPLSE 142
           G  L ++Y    +C+PSR  ++TG +  H  ++ N+     L G        R GL   +
Sbjct: 70  GTKLTDFYAGAPVCSPSRGVLLTGLNTGHATIRGNMAIKGGLAGNKGSDKIRRAGLLPQD 129

Query: 143 KILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMW 202
             +   L+  GY T ++GKWH+  +    TP   GF+   G+   +   +  S      W
Sbjct: 130 STIGDLLQRSGYTTGLIGKWHVDGFDTLATPQRHGFDYFYGWLVSYPQTY-ASTYWPDTW 188

Query: 203 GLDMRRDLEPAWDLHGK---YSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYE 258
             D +    P  + +G+   Y++++ T +A+  +  H  + +P FL + H     +NP+ 
Sbjct: 189 YRDGKLVAVPQ-NQNGQKNYYTSEIITDDAIRYLAKHKKSKKPFFLMVNH-----SNPHS 242

Query: 259 PLQAPDHYLNIHRHIEDFK-----RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
           PL AP      H  I + K     +  +AA++  LD SVGK+ + L +  +  N+I++F 
Sbjct: 243 PLDAP------HNTIYEGKDWTSDQKTYAAMVTYLDSSVGKIRKYLIESGLDKNTIVIFT 296

Query: 314 SDNGG-AAAGFNLNAASNW-----PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYV 367
           SDNG  +    +L A S++     PLRG K  + EGG+R   ++W+        V+    
Sbjct: 297 SDNGPRSEPTTSLTAVSDFFHSSGPLRGYKRDVTEGGIRVPMIVWNKNRIKSDAVSHTPG 356

Query: 368 HVSDWLPTLLSAAN 381
           + +D LPTL   A 
Sbjct: 357 YFADILPTLAGIAG 370


>gi|343086294|ref|YP_004775589.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354828|gb|AEL27358.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 511

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 120/382 (31%), Positives = 183/382 (47%), Gaps = 64/382 (16%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+II+ILADDLG+ D+G +G ++I TPNIDALA SG++   +Y+   +C P+R+ ++TG
Sbjct: 28  PPNIIYILADDLGYGDLGCYGQEKIETPNIDALAASGMMFTQHYSGAPVCAPARAVLLTG 87

Query: 118 KHPIHTGMQHNVLYGCE---------------RGGLPLSEKI--LPQYLKELGYRTRIVG 160
           +H  H+ ++ N  +G                  G  P++     L + L+E GY T +VG
Sbjct: 88  QHSGHSQVRGNDEWGSRGDVWNYHAMLADSTLEGQRPIATGTYTLGRMLQEAGYTTAVVG 147

Query: 161 KWHLGFYKKEYTPTFRGFESHLGY---WTGHQDYFDHSAE-EMKMW-------------- 202
           KW LG    E  PT +GF+   GY      H  Y  H  E E +++              
Sbjct: 148 KWGLGAPHTEGIPTNQGFDYFFGYNCQRQAHTLYPVHLYENEHRVYLNNDTIAPNKKLAE 207

Query: 203 GLDMRRDLEPAWD-LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQ 261
           G D   +LE   D     Y+ ++   +  + +   S++EP FLY A     +  P+ PLQ
Sbjct: 208 GAD-PYNLENYGDYFLTDYAPELMFDKITNFVDEQSSEEPFFLYWA-----TPIPHVPLQ 261

Query: 262 AP----DHYLNIHRHIEDF-----------KRSKFAAILHKLDESVGKVVEALEQRRMLS 306
           AP    DHY+      E +            R+ +AA++  +DE VG++VE L+ + +  
Sbjct: 262 APTKWIDHYVEKFGEEEPYLGENSYYPSRYPRATYAAMVSYMDEQVGELVEQLKSKGLYE 321

Query: 307 NSIIVFVSDNGGAAAG------FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG 360
           N++I+F SDNG    G      FN   A        K  L+EGG+R   +   P     G
Sbjct: 322 NTLIIFTSDNGPTFNGGTDSPWFNSGGAFREEKGFGKGFLYEGGIRVPMIASWPGKIKEG 381

Query: 361 IVAEQYVHVSDWLPTLLSAANK 382
            V++      D +PTL   A K
Sbjct: 382 SVSDHASVFYDVMPTLAEVAGK 403


>gi|410638700|ref|ZP_11349253.1| sulfatase [Glaciecola lipolytica E3]
 gi|410141228|dbj|GAC16458.1| sulfatase [Glaciecola lipolytica E3]
          Length = 522

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 115/361 (31%), Positives = 176/361 (48%), Gaps = 51/361 (14%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P++I+IL DDLG  D+  +G + I TPN+  +A  G++   +Y    +C PSR+A+MTG+
Sbjct: 44  PNVIYILVDDLGIGDIEPYGQELINTPNLQRMADQGMVFSQHYAGNPVCAPSRAALMTGQ 103

Query: 119 HPIHTGMQHNVLYGC-----ERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
           H  H  ++ N   G      E G +PL    + + + +K+ GY T ++GKW LG      
Sbjct: 104 HSGHNQIRGNFELGGFADEEEFGQMPLQPGTETVAKLMKKAGYTTALIGKWGLGGPGSYG 163

Query: 172 TPTFRGFESHLGYW---TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL------------ 216
            PT +GF+   GY      H  Y  H  +    + L+    L+P   L            
Sbjct: 164 VPTKQGFDYFFGYLDQKQAHNHYPTHLWQNETWFPLNNEY-LDPHKGLGKHDDPNDEASY 222

Query: 217 ----HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
                G ++    T EA+  I N   DEP FLYL++AA H+A     LQAPD  +  + H
Sbjct: 223 EKFKRGDFAQQRLTDEALSYIDN-KKDEPFFLYLSYAAPHAA-----LQAPDEEIAKYNH 276

Query: 273 IED--------------FK-RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG 317
            ED              F+ R+  AA++  +DE VG V++ L++  +  N++++F SDNG
Sbjct: 277 FEDTPYDGVADGGYVPSFRPRATRAAMISHIDEGVGLVLDKLKKLGLDENTLVIFSSDNG 336

Query: 318 GA-AAGFNLNAA-SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
            +   G +L    SN   RG K  L++GG+R   +   P     G V++      D LPT
Sbjct: 337 PSFEGGADLEFFDSNAQYRGFKRDLYDGGIRMPTIARWPTKVPAGKVSDHVSAFWDVLPT 396

Query: 376 L 376
           L
Sbjct: 397 L 397


>gi|392391665|ref|YP_006428268.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390522743|gb|AFL98474.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 510

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 118/412 (28%), Positives = 190/412 (46%), Gaps = 64/412 (15%)

Query: 38  AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
           AVL  +FT+  V   L   S  P+IIFILADDLG+ D+  +G   + TPNID LA  G++
Sbjct: 8   AVLSASFTIPAV---LWGQSDKPNIIFILADDLGYGDLACYGQKHVETPNIDKLAKQGMM 64

Query: 98  LKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHN---------VLYGCER-----GGLPL-- 140
               Y    +C PSR+ IMTG+H  HT ++ N         ++YG        G  P   
Sbjct: 65  FTQAYAGAPVCAPSRATIMTGQHTGHTHVRGNKEYWQKSPKIMYGQNEEYSLVGQEPYDA 124

Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY---WTGHQDYFD---- 193
           ++ ILP+ +K+ GY T + GKW  G+      P  RG +   GY   +  H  Y +    
Sbjct: 125 NQVILPEVMKKNGYTTGMFGKWAGGYEGSHSVPEKRGVDEFYGYICQFQAHSYYPNFLNR 184

Query: 194 HSAEEMKMWGLDMRRDLEPAWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYL 246
           +S +E     + +  D    + + G       +YS D+   +A++ +   +  +P + + 
Sbjct: 185 YSQKEGDTQTIRVVMDENVKYPMTGEDYYKRPQYSADMIHHKALEWLDEQNGKKPFYGFF 244

Query: 247 AHAATHSANPYEPLQAPDHYLNIHR-----------------HIEDFKRSKFAAILHKLD 289
            +   H+    E +Q  D  L  ++                 H      ++FAA++ +LD
Sbjct: 245 TYTLPHA----ELMQPNDSLLKKYKKQFFEDKTWGGSDGSRYHATTHTHAQFAAMITRLD 300

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDN-----GGAAAGFNLNAASNWPLRGVKNTLWEGG 344
             VG+++E L+++ +  N+I+VF SDN     GGA   F      +  LRG+K   +EGG
Sbjct: 301 SYVGEIMEKLKEKGLDKNTIVVFSSDNGPHEEGGADPEF---FGRDGKLRGIKRQCYEGG 357

Query: 345 VRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP-NYVNSTVEN 395
           +R   ++  P     G V++  +   D +PT    A     P  Y N   +N
Sbjct: 358 IRVPFIVKWPGKIKPGTVSDHQIAFYDIMPTFADIAGDKKFPKRYFNKKSQN 409


>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
 gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
          Length = 476

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 123/382 (32%), Positives = 187/382 (48%), Gaps = 57/382 (14%)

Query: 34  IMAFAVLPLAFTLSMVFVDLVASSG--PPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           I+A A  PL+         L A  G   P+II I  DD G+ D+   G   + TP ID +
Sbjct: 12  IVALAQWPLS--------SLAAEQGEAQPNIIIIFTDDQGYADLSCFGGTHVSTPRIDQM 63

Query: 92  AYSGIILKNYYTVQ-LCTPSRSAIMTGKHP----IHTGMQHNVLYGCERGGLPLSEKILP 146
           A  G  L ++Y    +CTPSR+A+MTG +P    +  G    VL   ++ GL   E  + 
Sbjct: 64  AAEGAKLTSFYVAAPVCTPSRAALMTGTYPKRIDMARGSNFVVLLAGDKKGLNPKEITIA 123

Query: 147 QYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGH--------QDYFDHSAEE 198
           + LK +GY+T + GKWHLG  + E+ PT +GF+   G    H        Q +F   +  
Sbjct: 124 EVLKAVGYKTGMFGKWHLG-DQPEFLPTRQGFDEFFGLPYSHDIHPYHPQQSHFKFPSLP 182

Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
           +    LD    +E   D    Y T  FT  AV  I  +  D+P FLY+ H   H+     
Sbjct: 183 L----LDGEEVIE--MDPDADYLTKRFTERAVQFIEKNK-DQPFFLYMPHPIPHT----- 230

Query: 259 PLQAPDHYL-NIHRHIE----------DFK-RSKFAA-ILHKLDESVGKVVEALEQRRML 305
           PL A   Y+ ++   I+          D+K R++  +  + ++D SVG++++AL++  + 
Sbjct: 231 PLHASPAYMKDLPEEIKAKLSKENNYIDYKTRTQLLSRAIGEVDWSVGQILDALKKNGID 290

Query: 306 SNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQ 365
            N+I++F SDNG A     +  A+  PLRG KN   EGG+R   +I  P     GI  ++
Sbjct: 291 DNTIVIFSSDNGPA-----IGKAT--PLRGRKNQTLEGGMRVPTVIHWPGKIPAGIANDE 343

Query: 366 YVHVSDWLPTLLSAANKSDIPN 387
            +   D LPT    A  ++IP+
Sbjct: 344 IMTTMDLLPTFAKLAG-AEIPS 364


>gi|421614606|ref|ZP_16055659.1| arylsulfatase [Rhodopirellula baltica SH28]
 gi|408494615|gb|EKJ99220.1| arylsulfatase [Rhodopirellula baltica SH28]
          Length = 524

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 185/366 (50%), Gaps = 50/366 (13%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSA 113
           A++  P+I+ I+ADDLG+ ++G +G  +I TP +D LA  GI L N+Y+   +C PSR  
Sbjct: 55  ATASRPNIVLIVADDLGYGELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCC 114

Query: 114 IMTGKHPIHTGMQHNV----------LYGCERGG---LPLSEKILPQYLKELGYRTRIVG 160
           +MTGKHP H  +++N               E  G   LP+ E  + +YLK +GYRT   G
Sbjct: 115 LMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFG 174

Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD--LHG 218
           KW LG +     P  +GF+   G+   +     H+     +W  +  ++++P  D  LHG
Sbjct: 175 KWGLGHFGTTGDPNEQGFDLFYGF---NCQRHAHNHYPNFLW-RNRVKEVQPGNDRTLHG 230

Query: 219 K-YSTDVFTAEAVDIIHNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI- 273
           + YS D F  EA + I     +   +P F YL  A  H +     +Q P+  ++ +  + 
Sbjct: 231 ETYSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLS-----IQVPEEEVDAYEGVI 285

Query: 274 --EDFK----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG---- 317
              D++          R+ +AA++ ++DE VG++V+ ++   +  N++I+F SDNG    
Sbjct: 286 EEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQIVDLVDSLGLGENTLIMFTSDNGPTYD 345

Query: 318 --GAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
             G +     N+AS   ++G+K  L EGG+R   +     +   G  ++      D+LPT
Sbjct: 346 RLGGSDSDYFNSASG--MKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFLPT 403

Query: 376 LLSAAN 381
           +  AA 
Sbjct: 404 ITDAAG 409


>gi|387791821|ref|YP_006256886.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
 gi|379654654|gb|AFD07710.1| arylsulfatase A family protein [Solitalea canadensis DSM 3403]
          Length = 469

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 104/336 (30%), Positives = 166/336 (49%), Gaps = 52/336 (15%)

Query: 43  AFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY 102
            F+   +F   + S   P+IIFILADD+G+ D+G +G   I TPNID LA  G+   ++Y
Sbjct: 21  CFSPFCLFSQDLRSKEKPNIIFILADDMGYGDLGCYGQQLIKTPNIDQLANQGMRFTDFY 80

Query: 103 T-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIV 159
               +C PSR+++MTG+H  H  ++ N         LPL ++   L QYLK+ GY   +V
Sbjct: 81  AGSTVCAPSRASLMTGQHTGHAFIKGN-------DELPLRQQDSTLSQYLKQAGYVNGMV 133

Query: 160 GKWHLGFYKKEYTPTFRG---FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
           GKW LG       P  +G   F  HL +  GH    DH      +W ++     +   D 
Sbjct: 134 GKWGLGLQGTSGAPEKKGWDFFVGHLHHVEGHFQQTDH------LWKIENGESKKEMID- 186

Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL--------- 267
              Y  ++FT  A+  I  +    P FLY+++   H+      L+ P+ YL         
Sbjct: 187 STVYVNELFTQSALQFIQKNK-KHPFFLYVSYTLPHAE-----LKVPEKYLQLYLNKDGS 240

Query: 268 -----------NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
                       +H   + + ++ +AA++  +D+ VG++++ L+   +  N+I+VF SDN
Sbjct: 241 SKFAPEIAQPAGLHYGQQPYPKAAYAAMITSMDDYVGQIMQQLKAYGIDKNTIVVFTSDN 300

Query: 317 GG-AAAGFNLNAASNW-----PLRGVKNTLWEGGVR 346
           G     G  +  A+ +     PLRG+K  L+EGG+R
Sbjct: 301 GTHIEGGRKMTDATTFFKSSGPLRGIKRDLYEGGIR 336


>gi|390356764|ref|XP_792809.2| PREDICTED: N-acetylgalactosamine-6-sulfatase-like
           [Strongylocentrotus purpuratus]
          Length = 417

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 109/348 (31%), Positives = 173/348 (49%), Gaps = 38/348 (10%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRS 112
           V  + PP+II +L DD+GW D+G +G     TPN+D +A  GI+L ++Y+   LC+PSR+
Sbjct: 23  VVVAAPPNIIVMLMDDMGWGDLGIYGNPAKETPNLDQMAAEGILLPDFYSGNPLCSPSRA 82

Query: 113 AIMTGKHPIHTGMQHNVLYG-------CERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           A++TG+ PI  G      +           GG+  +E +LP+ L++ GYR ++VGKWHLG
Sbjct: 83  ALLTGRLPIRNGFYTTNEHARNAYTPQIIVGGIQDNEILLPELLQKAGYRNKLVGKWHLG 142

Query: 166 FYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD- 223
            ++ ++ P   GF+   G    H   Y D +   + ++     +D E A   + +++ D 
Sbjct: 143 -HQPQFHPLKHGFDEWFGAPNCHFGPYNDKNTPNIPVY-----QDAEMAGRYYEEFAIDK 196

Query: 224 ---------VFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                    ++  E V  I    +  +P FLY    ATH     EPL A   +L      
Sbjct: 197 KTGESNLTQLYIQEGVSFIEKSVAAMQPFFLYWTPDATH-----EPLYASKPFLGRS--- 248

Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPL 333
              +R  +   + +LD  VG+V+  + +  + +N+ ++F SDN GAA   + +  SN P 
Sbjct: 249 ---QRGLYGDAVMELDYGVGQVLGKIRELGVANNTFVLFSSDN-GAALYASESGGSNGPF 304

Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
              K T +EGG+R   + W P     G V  Q   V D   T LS A+
Sbjct: 305 LCGKETTYEGGMREPTIAWWPNHIKPGRVTHQIGSVMDIFTTALSVAD 352


>gi|423138563|ref|ZP_17126201.1| arylsulfatase [Salmonella enterica subsp. houtenae str. ATCC
           BAA-1581]
 gi|379051117|gb|EHY69008.1| arylsulfatase [Salmonella enterica subsp. houtenae str. ATCC
           BAA-1581]
          Length = 497

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 33  RIMAFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           R +  +++ LA     V     A++   P+++ ILADDLG+ D+  +G   + TPNID L
Sbjct: 3   RTVVASMIGLALCAGSVLSTAQAATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKL 62

Query: 92  AYSGIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +YY    LC+PSR+ ++TG+ P  TG++  +  G +   L  +E  +   LK
Sbjct: 63  AQEGVKFTDYYAPAPLCSPSRAGLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLK 121

Query: 151 ELGYRTRIVGKWHL---GFYKKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDM 206
           + GY T ++GK HL   G    +      GF+  L    G   D    +A+E   +G+  
Sbjct: 122 QQGYDTAMMGKLHLNAGGDRTDQPQAKDMGFDYTLVNPAGFVTDATLDNAKERPRYGV-- 179

Query: 207 RRDLEP-AWDLHG-------KYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYE 258
              + P  W  +G       K S +  ++E V+ + N   D P FLY+A    HS     
Sbjct: 180 ---VHPTGWIRNGQHIGRADKMSGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS----- 231

Query: 259 PLQAPDHYLNIH-RHIEDFKR------------------SKFAAILHKLDESVGKVVEAL 299
           PL +P  YL+++ +++ D+++                   ++ A +  +DE VGKV++ +
Sbjct: 232 PLASPKKYLDMYSQYMTDYQKQHPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKI 291

Query: 300 EQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSP 354
           +      N+I++F SDNG     A   + LN A     LRG K+ LWEGG+R   +I   
Sbjct: 292 KAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYG 351

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSD--IPNYVNSTVENIIPRYENSILR 406
               +G+V +  V+  DWLPTL   AN  D  +P       ++++P  ++  L+
Sbjct: 352 KHIPQGMVTDTPVYGLDWLPTL---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|161504832|ref|YP_001571944.1| hypothetical protein SARI_02956 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160866179|gb|ABX22802.1| hypothetical protein SARI_02956 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 497

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 119/392 (30%), Positives = 194/392 (49%), Gaps = 53/392 (13%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
            A++  P+++ ILADDLG+ D+  +G   + TPNID LA  G+   +YY    LC+PSR+
Sbjct: 25  AATAKRPNLVIILADDLGYGDLATYGHRIVKTPNIDKLAQEGVKFTDYYAPAPLCSPSRA 84

Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL---GFYKK 169
            ++TG+ P  TG++  +  G +   L  +E  +   LK+ GY T ++GK HL   G    
Sbjct: 85  GLLTGRMPFRTGIRSWIPEGKDV-ALGRNELTIANLLKQQGYDTAMMGKLHLNAGGDRTD 143

Query: 170 EYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWGLDMRRDLEP-AWDLHG-------KY 220
           +      GF+  L    G   D    +A+E   +G+     + P  W  +G       K 
Sbjct: 144 QPQAKDMGFDYTLVNPAGFVSDATLDNAKERPRYGM-----VHPTGWIRNGQHIGRADKM 198

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH-RHIEDFKR- 278
           S +  ++E V+ + N   D P FLY+A    HS     PL +P  YL+++ +++ D+++ 
Sbjct: 199 SGEFVSSEVVNWLDNKKDDNPFFLYVAFTEVHS-----PLASPKKYLDMYSQYMTDYQKQ 253

Query: 279 -----------------SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG--- 318
                             ++ A +  +DE VGKV++ ++      N+II+F SDNG    
Sbjct: 254 HPDLFYGDWADKPWRGTGEYYANISYMDEQVGKVLDKIKAMGEEDNTIIIFTSDNGPVTR 313

Query: 319 -AAAGFNLNAASNWP-LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
            A   + LN A     LRG K+ LWEGG+R   +I       +G+V +  V+  DWLPTL
Sbjct: 314 EARKVYELNLAGETDGLRGRKDNLWEGGIRVPAIIKYGKHIPQGMVTDTPVYGLDWLPTL 373

Query: 377 LSAANKSD--IPNYVNSTVENIIPRYENSILR 406
              AN  D  +P       ++++P  ++  L+
Sbjct: 374 ---ANMMDFKLPTDRTYDGQSLVPLLKDKTLK 402


>gi|340617020|ref|YP_004735473.1| sulfatase [Zobellia galactanivorans]
 gi|339731817|emb|CAZ95082.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 452

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 108/340 (31%), Positives = 177/340 (52%), Gaps = 35/340 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+IIFILADDLG+ D+G +G + I TP+ID +A  G+    +Y    +C PSR+++MTG+
Sbjct: 22  PNIIFILADDLGYGDLGCYGQETISTPHIDRMASEGLRFTQHYAGATVCAPSRNSLMTGQ 81

Query: 119 HPIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           H     M +  +   E+   P+  S+  + + LK+ GYRT ++GKW LG          +
Sbjct: 82  H-----MGNTTIKSMEK---PIKDSDLTVAEVLKDAGYRTGVIGKWGLGNVGTSGYANAQ 133

Query: 177 GFESHLGYWT---GHQDYFDHSAEEMKMWGL-----------DMRRDLEPAWDLHGKYST 222
           GF+   GY+     H  Y D+  E  + + L           +    +  A     +YS 
Sbjct: 134 GFDYSFGYYDQIRAHNYYPDYLMENGEKFPLKNEVVYVSDSTNYAVGIGNAAIKKKEYSN 193

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY-----EPLQAPDHYLNIHRHIEDFK 277
           D+FT +A+D I   +  EP FLYLA+   H+ N         ++ PD+ +         K
Sbjct: 194 DLFTEKALDFIKTDAK-EPFFLYLAYTIPHANNESFLINEHGMEVPDYGIYDKEPWPSEK 252

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNG-GAAAGFNLNAA-SNWPLRG 335
           ++  AA++ +LD  VG++++ L++  +  N++++F SDNG     G+ ++   SN PLRG
Sbjct: 253 KAG-AAMITRLDSYVGQILDLLKKNNLDGNTLVIFTSDNGPHQEGGWQVDYFDSNGPLRG 311

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           +K  L+EGG+R   +   P    +G   +Q     D+LPT
Sbjct: 312 MKRDLYEGGIRVPFIAKWPGKIEKG-ETDQVATFWDFLPT 350


>gi|406836223|ref|ZP_11095817.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 456

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 112/328 (34%), Positives = 168/328 (51%), Gaps = 22/328 (6%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I++I+ADDLG +D+G +G   I TPNID LA  G      Y    +C PSRS +MTG 
Sbjct: 26  PNIVYIMADDLGPHDLGCYGQKLIQTPNIDKLAIEGTRYTQVYAGASVCAPSRSVLMTGL 85

Query: 119 HPIHTGMQHN-------VLYGCERGGLPL-SEKI-LPQYLKELGYRTRIVGKWHLGFYKK 169
           H  HT ++ N       +    E+G +PL SE + + + LK  GY T I GKW LG    
Sbjct: 86  HTGHTRVRGNHSKFGGVIGESGEKGRVPLRSEDVTVAEVLKRAGYSTGITGKWGLGEPHT 145

Query: 170 EYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD-LHGKYSTDVFTAE 228
              P  +GF+  LGY   +     HS     +W    R DL    D    +YS D+FT  
Sbjct: 146 NGVPNRKGFDQWLGYLNQN---VAHSYFPSALWLNGERFDLPGNSDGRRQQYSHDLFTGF 202

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A++ I  +  +EP FLY+     H+A      + PD      +      R ++A+++ ++
Sbjct: 203 ALNFIRANR-EEPFFLYVPFTIPHAA-----WEVPDLGPYADQTWPKGAR-EYASMVTRM 255

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           D  VG++V  L++ ++  N+I+ F SDNGG A  F     +N  LRG K  ++EGG+R  
Sbjct: 256 DADVGRIVALLDELKLRDNTIVFFCSDNGGPAP-FGETFQTNGALRGKKGQVYEGGLRVP 314

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTL 376
            +   P     G +++   + +D LPTL
Sbjct: 315 MIARWPGHVPAGKISDVPWYFADVLPTL 342


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.133    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,137,002,189
Number of Sequences: 23463169
Number of extensions: 785727486
Number of successful extensions: 11847664
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 16641
Number of HSP's successfully gapped in prelim test: 19134
Number of HSP's that attempted gapping in prelim test: 9335316
Number of HSP's gapped (non-prelim): 1141554
length of query: 905
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 753
effective length of database: 8,792,793,679
effective search space: 6620973640287
effective search space used: 6620973640287
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 82 (36.2 bits)